Lee, Chin-Pang; Chiu, Yu-Wen; Chu, Chun-Lin; Chen, Yu; Jiang, Kun-Hao; Chen, Jiun-Liang; Chen, Ching-Yen
2016-12-01
The aging males' symptoms (AMS) scale is an instrument used to determine the health-related quality of life in adult and elderly men. The purpose of this study was to synthesize internal consistency (Cronbach's alpha) and test-retest reliability for the AMS scale and its three subscales. Of the 123 studies reviewed, 12 provided alpha coefficients which were then used in the meta-analyses of internal consistency. Seven of the 12 included studies provided test-retest coefficients, and these were used in the meta-analyses of test-retest reliability. The AMS scale had excellent internal consistency [α = 0.89 (95% CI 0.88-0.90)]; the mean alpha estimates across the AMS subscales ranged from 0.79 to 0.82. The AMS scale also had good test-retest reliability [r = 0.85 (95% CI 0.82-0.88]; the test-retest reliability coefficients of the AMS subscales ranged from 0.76 to 0.83. There was significant heterogeneity among the included studies. The AMS scale and the three subscales had fairly good internal consistency and test-retest reliability. Future psychometric studies of the AMS scale should report important characteristics of the participants, details of item scores, and test-retest reliability.
ERIC Educational Resources Information Center
Yaman, Erkan
2012-01-01
The aim of this research was to develop the Mobbing Impacts Scale and to examine its validity and reliability analyses. The sample of study consisted of 509 teachers from Sakarya. In this study construct validity, internal consistency, test-retest reliabilities and item analysis of the scale were examined. As a result of factor analysis for…
16 CFR 260.5 - Interpretation and substantiation of environmental marketing claims.
Code of Federal Regulations, 2011 CFR
2011-01-01
... reasonable basis substantiating the claim. A reasonable basis consists of competent and reliable evidence. In... reliable scientific evidence, defined as tests, analyses, research, studies or other evidence based on the... qualified to do so, using procedures generally accepted in the profession to yield accurate and reliable...
Timed activity performance in persons with upper limb amputation: A preliminary study.
Resnik, Linda; Borgia, Mathew; Acluche, Frantzy
55 subjects with upper limb amputation were administered the T-MAP twice within one week. To develop a timed measure of activity performance for persons with upper limb amputation (T-MAP); examine the measure's internal consistency, test-retest reliability and validity; and compare scores by prosthesis use. Measures of activity performance for persons with upper limb amputation are needed The time required to perform daily activities is a meaningful metric that implication for participation in life roles. Internal consistency and test-retest reliability were evaluated. Construct validity was examined by comparing scores by amputation level. Exploratory analyses compared sub-group scores, and examined correlations with other measures. Scale alpha was 0.77, ICC was 0.93. Timed scores differed by amputation level. Subjects using a prosthesis took longer to perform all tasks. T-MAP was not correlated with other measures of dexterity or activity, but was correlated with pain for non-prosthesis users. The timed scale had adequate internal consistency and excellent test-retest reliability. Analyses support reliability and construct validity of the T-MAP. 2c "outcomes" research. Published by Elsevier Inc.
[KON-2006--Neurotic Personality Questionnaire].
Aleksandrowicz, Jerzy W; Klasa, Katarzyna; Sobański, Jerzy A; Stolarska, Dorota
2007-01-01
Construction of a questionnaire describing personality traits connected to the occurrence and persistence of neurotic disorders. Responses of 794 patients (before treatment) and 520 persons from the control group on items of the constructed personality questionnaire and the symptom checklist "0". Analyses of subscales reliability and item-scale correlations, test-retest and split-half reliability. Factor analyses estimating internal reliability of the questionnaire. Cross-validation with the KO"0". symptom checklist Psychometric properties of KON-2006 questionnaire indicate that it is consistent and reliable enough. Validity analyses indicate a large probability that the X-KON coefficient informs on personality dysfunctions related to neurotic disorders. The Neurotic Personality Questionnaire KON-2006 may serve to estimate personality traits connected to the occurrence and persistence of neurotic disorders as well as changes resulting from psychotherapy.
NASA Astrophysics Data System (ADS)
Saini, K. K.; Sehgal, R. K.; Sethi, B. L.
2008-10-01
In this paper major reliability estimators are analyzed and there comparatively result are discussed. There strengths and weaknesses are evaluated in this case study. Each of the reliability estimators has certain advantages and disadvantages. Inter-rater reliability is one of the best ways to estimate reliability when your measure is an observation. However, it requires multiple raters or observers. As an alternative, you could look at the correlation of ratings of the same single observer repeated on two different occasions. Each of the reliability estimators will give a different value for reliability. In general, the test-retest and inter-rater reliability estimates will be lower in value than the parallel forms and internal consistency ones because they involve measuring at different times or with different raters. Since reliability estimates are often used in statistical analyses of quasi-experimental designs.
Yalin Sapmaz, Şermin; Ergin, Dilek; Özek Erkuran, Handan; Şen Celasin, Nesrin; Öztürk, Masum; Karaarslan, Duygu; Köroğlu, Ertuğrul; Aydemir, Ömer
2017-09-01
This study assessed the validity and reliability of the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form for use among the Turkish population. The study group consisted of 30 patients that had been treated in a child psychiatry unit and diagnosed with posttraumatic stress disorder and 83 healthy volunteers that were attending middle or high school during the study period. For reliability analyses, the internal consistency coefficient and the test-retest correlation coefficient were measured. For validity analyses, the exploratory factor analysis and correlation analysis with the Child Posttraumatic Stress Reaction Index for concurrent validity were measured. The Cronbach's alpha (the internal consistency coefficient) of the scale was 0.909, and the test-retest correlation coefficient was 0.663. One factor that could explain 58.5% of the variance was obtained and was congruent with the original construct of the scale. As for concurrent validity, the scale showed high correlation with the Child Posttraumatic Stress Reaction Index. It was concluded that the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form can be used as a valid and reliable tool.
The psychometric properties of the 'Hospital Survey on Patient Safety Culture' in Dutch hospitals.
Smits, Marleen; Christiaans-Dingelhoff, Ingrid; Wagner, Cordula; Wal, Gerrit van der; Groenewegen, Peter P
2008-11-07
In many different countries the Hospital Survey on Patient Safety Culture (HSOPS) is used to assess the safety culture in hospitals. Accordingly, the questionnaire has been translated into Dutch for application in the Netherlands. The aim of this study was to examine the underlying dimensions and psychometric properties of the questionnaire in Dutch hospital settings, and to compare these results with the original questionnaire used in USA hospital settings. The HSOPS was completed by 583 staff members of four general hospitals, three teaching hospitals, and one university hospital in the Netherlands. Confirmatory factor analyses were performed to examine the applicability of the factor structure of the American questionnaire to the Dutch data. Explorative factor analyses were performed to examine whether another composition of items and factors would fit the data better. Supplementary psychometric analyses were performed, including internal consistency and construct validity. The confirmatory factor analyses were based on the 12-factor model of the original questionnaire and resulted in a few low reliability scores. 11 Factors were drawn with explorative factor analyses, with acceptable reliability scores and a good construct validity. Two items were removed from the questionnaire. The composition of the factors was very similar to that of the original questionnaire. A few items moved to another factor and two factors turned out to combine into a six-item dimension. All other dimensions consisted of two to five items. The Dutch translation of the HSOPS consists of 11 factors with acceptable reliability and good construct validity. and is similar to the original HSOPS factor structure.
Martignon, Stefania; Bautista-Mendoza, Gloria; González-Carrera, María; Lafaurie-Villamil, Gloria; Morales, Veicy; Santamaría, Ruth
2008-01-01
Designing three instruments for evaluating oral health knowledge, attitudes and practice in parents/caregivers of low social-economic status 0-5 year-olds. Evaluating the instruments' reliability in terms of internal consistency and analysing items. Three instruments were constructed for evaluating low social-economic status 0-5 year-olds' parents/caregivers' oral health knowledge, attitudes and practice in the municipality of Usaquén , Bogotá , Colombia . 47 parents/caregivers were given a test establishing the instrument's reliability in terms of internal consistency and the adults' level of knowledge, attitudes and practice. A sub-sample was qualitatively analysed (content verification and understanding). Reliability was evaluated using Cronbach's alpha coefficient. Items were analysed for improving constructing and understanding the questions, taking four criteria into account: corrected homogeneity index (CHI), response trend, correlation between items and qualitative analysis. Cronbach's alpha coefficient for knowledge, attitudes and practice was 0,82, 0,80 and 0,62, respectively. Participants' level of knowledge, attitudes and practice was acceptable (60 %, 55 % and 91 %, respectively). This study found two out of the three evaluated instruments to be reliable (knowledge and attitudes); all three of them were then redesigned. The resulting instruments represent a valuable tool which can be used in future studies for describing and evaluating preventative programmes.
The reliability paradox of the Parent-Child Conflict Tactics Corporal Punishment Subscale.
Lorber, Michael F; Slep, Amy M Smith
2018-02-01
In the present investigation we consider and explain an apparent paradox in the measurement of corporal punishment with the Parent-Child Conflict Tactics Scale (CTS-PC): How can it have poor internal consistency and still be reliable? The CTS-PC was administered to a community sample of 453 opposite sex couples who were parents of 3- to 7-year-old children. Internal consistency was marginal, yet item response theory analyses revealed that reliability rose sharply with increasing corporal punishment, exceeding .80 in the upper ranges of the construct. The results suggest that the CTS-PC Corporal Punishment subscale reliably discriminates among parents who report average to high corporal punishment (64% of mothers and 56% of fathers in the present sample), despite low overall internal consistency. These results have straightforward implications for the use and reporting of the scale. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Malec, James F; Kragness, Miriam; Evans, Randall W; Finlay, Karen L; Kent, Ann; Lezak, Muriel D
2003-01-01
To evaluate the internal consistency of the Mayo-Portland Adaptability Inventory (MPAI), further refine the instrument, and provide reference data based on a large, geographically diverse sample of persons with acquired brain injury (ABI). 386 persons, most with moderate to severe ABI. Outpatient, community-based, and residential rehabilitation facilities for persons with ABI located in the United States: West, Midwest, and Southeast. Rasch, item cluster, principal components, and traditional psychometric analyses for internal consistency of MPAI data and subscales. With rescoring of rating scales for 4 items, a 29-item version of the MPAI showed satisfactory internal consistency by Rasch (Person Reliability=.88; Item Reliability=.99) and traditional psychometric indicators (Cronbach's alpha=.89). Three rationally derived subscales for Ability, Activity, and Participation demonstrated psychometric properties that were equivalent to subscales derived empirically through item cluster and factor analyses. For the 3 subscales, Person Reliability ranged from.78 to.79; Item Reliability, from.98 to.99; and Cronbach's alpha, from.76 to.83. Subscales correlated moderately (Pearson r =.49-.65) with each other and strongly with the overall scale (Pearson r=.82-.86). Outcome after ABI is represented by the unitary dimension described by the MPAI. MPAI subscales further define regions of this dimension that may be useful for evaluation of clinical cases and program evaluation.
Changes in School Climate in a Long-Term Perspective
ERIC Educational Resources Information Center
Kallestad, Jan Helge
2010-01-01
In a previous report five school climate instruments were explored (1983 and 1985), and four scales were regarded as meaningful climate measures according to suggested criteria. These scales were re-inspected in the present study (1997 and 1998) by analyses of internal consistency, estimates of reliability (unit and aggregated reliability), and…
Parts and Components Reliability Assessment: A Cost Effective Approach
NASA Technical Reports Server (NTRS)
Lee, Lydia
2009-01-01
System reliability assessment is a methodology which incorporates reliability analyses performed at parts and components level such as Reliability Prediction, Failure Modes and Effects Analysis (FMEA) and Fault Tree Analysis (FTA) to assess risks, perform design tradeoffs, and therefore, to ensure effective productivity and/or mission success. The system reliability is used to optimize the product design to accommodate today?s mandated budget, manpower, and schedule constraints. Stand ard based reliability assessment is an effective approach consisting of reliability predictions together with other reliability analyses for electronic, electrical, and electro-mechanical (EEE) complex parts and components of large systems based on failure rate estimates published by the United States (U.S.) military or commercial standards and handbooks. Many of these standards are globally accepted and recognized. The reliability assessment is especially useful during the initial stages when the system design is still in the development and hard failure data is not yet available or manufacturers are not contractually obliged by their customers to publish the reliability estimates/predictions for their parts and components. This paper presents a methodology to assess system reliability using parts and components reliability estimates to ensure effective productivity and/or mission success in an efficient manner, low cost, and tight schedule.
Hernansaiz-Garrido, Helena; Alonso-Tapia, Jesús
2017-01-01
Internalized stigma and disclosure concerns are key elements for the study of mental health in people living with HIV. Since no measures of these constructs were available for Spanish population, this study sought to develop such instruments, to analyze their reliability and validity and to provide a short version. A heterogeneous sample of 458 adults from different Spanish-speaking countries completed the HIV-Internalized Stigma Scale and the HIV-Disclosure Concerns Scale, along with the Hospital Anxiety and Depression Scale, Rosenberg's Self-esteem Scale and other socio-demographic variables. Reliability and correlation analyses, exploratory factor analyses, path analyses with latent variables, and ANOVAs were conducted to test the scales' psychometric properties. The scales showed good reliability in terms of internal consistency and temporal stability, as well as good sensitivity and factorial and criterion validity. The HIV-Internalized Stigma Scale and the HIV-Disclosure Concerns Scale are reliable and valid means to assess these variables in several contexts.
Reliability analysis of structural ceramics subjected to biaxial flexure
NASA Technical Reports Server (NTRS)
Chao, Luen-Yuan; Shetty, Dinesh K.
1991-01-01
The reliability of alumina disks subjected to biaxial flexure is predicted on the basis of statistical fracture theory using a critical strain energy release rate fracture criterion. Results on a sintered silicon nitride are consistent with reliability predictions based on pore-initiated penny-shaped cracks with preferred orientation normal to the maximum principal stress. Assumptions with regard to flaw types and their orientations in each ceramic can be justified by fractography. It is shown that there are no universal guidelines for selecting fracture criteria or assuming flaw orientations in reliability analyses.
Kadar, Masne; Ibrahim, Suhaili; Razaob, Nor Afifi; Chai, Siaw Chui; Harun, Dzalani
2018-02-01
The Lawton Instrumental Activities of Daily Living Scale is a tool often used to assess independence among elderly at home. Its suitability to be used with the elderly population in Malaysia has not been validated. This current study aimed to assess the validity and reliability of the Lawton Instrumental Activities of Daily Living Scale - Malay Version to Malay speaking elderly in Malaysia. This study was divided into three phases: (1) translation and linguistic validity involving both forward and backward translations; (2) establishment of face validity and content validity; and (3) establishment of reliability involving inter-rater, test-retest and internal consistency analyses. Data used for these analyses were obtained by interviewing 65 elderly respondents. Percentages of Content Validity Index for 4 criteria were from 88.89 to 100.0. The Cronbach α coefficient for internal consistency was 0.838. Intra-class Correlation Coefficient of inter-rater reliability and test-retest reliability was 0.957 and 0.950 respectively. The result shows that the Lawton Instrumental Activities of Daily Living Scale - Malay Version has excellent reliability and validity for use with the Malay speaking elderly people in Malaysia. This scale could be used by professionals to assess functional ability of elderly who live independently in community. © 2018 Occupational Therapy Australia.
Hajcak, Greg; Meyer, Alexandria; Kotov, Roman
2017-08-01
In the clinical neuroscience literature, between-subjects differences in neural activity are presumed to reflect reliable measures-even though the psychometric properties of neural measures are almost never reported. The current article focuses on the critical importance of assessing and reporting internal consistency reliability-the homogeneity of "items" that comprise a neural "score." We demonstrate how variability in the internal consistency of neural measures limits between-subjects (i.e., individual differences) effects. To this end, we utilize error-related brain activity (i.e., the error-related negativity or ERN) in both healthy and generalized anxiety disorder (GAD) participants to demonstrate options for psychometric analyses of neural measures; we examine between-groups differences in internal consistency, between-groups effect sizes, and between-groups discriminability (i.e., ROC analyses)-all as a function of increasing items (i.e., number of trials). Overall, internal consistency should be used to inform experimental design and the choice of neural measures in individual differences research. The internal consistency of neural measures is necessary for interpreting results and guiding progress in clinical neuroscience-and should be routinely reported in all individual differences studies. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Kilgus, Stephen P; Riley-Tillman, T Chris; Stichter, Janine P; Schoemann, Alexander M; Bellesheim, Katie
2016-09-01
The purpose of this investigation was to evaluate the reliability of Direct Behavior Ratings-Social Competence (DBR-SC) ratings. Participants included 60 students identified as possessing deficits in social competence, as well as their 23 classroom teachers. Teachers used DBR-SC to complete ratings of 5 student behaviors within the general education setting on a daily basis across approximately 5 months. During this time, each student was assigned to 1 of 2 intervention conditions, including the Social Competence Intervention-Adolescent (SCI-A) and a business-as-usual (BAU) intervention. Ratings were collected across 3 intervention phases, including pre-, mid-, and postintervention. Results suggested DBR-SC ratings were highly consistent across time within each student, with reliability coefficients predominantly falling in the .80 and .90 ranges. Findings further indicated such levels of reliability could be achieved with only a small number of ratings, with estimates varying between 2 and 10 data points. Group comparison analyses further suggested the reliability of DBR-SC ratings increased over time, such that student behavior became more consistent throughout the intervention period. Furthermore, analyses revealed that for 2 of the 5 DBR-SC behavior targets, the increase in reliability over time was moderated by intervention grouping, with students receiving SCI-A demonstrating greater increases in reliability relative to those in the BAU group. Limitations of the investigation as well as directions for future research are discussed herein. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Reliability analysis and initial requirements for FC systems and stacks
NASA Astrophysics Data System (ADS)
Åström, K.; Fontell, E.; Virtanen, S.
In the year 2000 Wärtsilä Corporation started an R&D program to develop SOFC systems for CHP applications. The program aims to bring to the market highly efficient, clean and cost competitive fuel cell systems with rated power output in the range of 50-250 kW for distributed generation and marine applications. In the program Wärtsilä focuses on system integration and development. System reliability and availability are key issues determining the competitiveness of the SOFC technology. In Wärtsilä, methods have been implemented for analysing the system in respect to reliability and safety as well as for defining reliability requirements for system components. A fault tree representation is used as the basis for reliability prediction analysis. A dynamic simulation technique has been developed to allow for non-static properties in the fault tree logic modelling. Special emphasis has been placed on reliability analysis of the fuel cell stacks in the system. A method for assessing reliability and critical failure predictability requirements for fuel cell stacks in a system consisting of several stacks has been developed. The method is based on a qualitative model of the stack configuration where each stack can be in a functional, partially failed or critically failed state, each of the states having different failure rates and effects on the system behaviour. The main purpose of the method is to understand the effect of stack reliability, critical failure predictability and operating strategy on the system reliability and availability. An example configuration, consisting of 5 × 5 stacks (series of 5 sets of 5 parallel stacks) is analysed in respect to stack reliability requirements as a function of predictability of critical failures and Weibull shape factor of failure rate distributions.
Sauer Liberato, Ana Carolina; Cunha Matheus Rodrigues, Roberta; Kim, MyoungJin; Mallory, Caroline
2016-07-01
This study examined the reliability and validity of the Brazilian Portuguese version of the Treatment Satisfaction Questionnaire for Medication (version 1.4) among patients with hypertension. Understanding the patient experience with treatment satisfaction will contribute to improved medication adherence and control of hypertension. Hypertension is a serious problem in Brazil that is associated with chronic illness controlled, in part, by consistent adherence to medications. Patient satisfaction with medication treatment is associated with adherence to medication. The Treatment Satisfaction Questionnaire for Medication (version 1.4) is a promising instrument for measuring medication; however, to date there has been no report of the reliability and validity of the instrument with Portuguese-speaking adults with hypertension in Brazil. Cross-sectional descriptive exploratory study. A convenience sample of 300 patients with hypertension in an outpatient setting in the southeast region of São Paulo state in Brazil completed the Treatment Satisfaction Questionnaire for Medication (version 1.4). The instrument, comprised of four subscales, was evaluated for reliability using correlation analyses and internal consistency. Confirmatory factor analysis was used to determine factorial validity. Correlational analyses, internal consistency (Cronbach's alpha) and hierarchical confirmatory factor analysis demonstrate adequate support for the four-factor dimensionality, reliability and factorial validity of the Treatment Satisfaction Questionnaire for Medication (version 1.4). This study provides modest evidence for internal consistency and factorial validity of the Treatment Satisfaction Questionnaire for Medication (version 1.4) in Portuguese-speaking adult Brazilians with hypertension. Future testing should focus on extending reliability testing, discriminant validity and potential translation and literacy issues in this population. Within known limitations, clinicians will find the Treatment Satisfaction Questionnaire for Medication (version 1.4) useful for identifying adult Portuguese-speaking Brazilian patients at risk of poor adherence and tailoring adherence interventions to promote hypertension control. © 2016 John Wiley & Sons Ltd.
Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.
Taylor, Karen; Bulsara, Max; Monterosso, Leanne
2018-01-01
Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.
Moreno-Murcia, Juan A; Martínez-Galindo, Celestina; Moreno-Pérez, Víctor; Marcos, Pablo J.; Borges, Fernanda
2012-01-01
This study aimed to cross-validate the psychometric properties of the Basic Psychological Needs in Exercise Scale (BPNES) by Vlachopoulos and Michailidou, 2006 in a Spanish context. Two studies were conducted. Confirmatory factor analysis results confirmed the hypothesized three-factor solution In addition, we documented evidence of reliability, analysed as internal consistency and temporal stability. Future studies should analyse the scale's validity and reliability with different populations and check their experimental effect. Key pointsThe Basic Psychological Needs in Exercise Scale (BPNES) is valid and reliable for measuring basic psychological needs in healthy physical exercise in the Spanish context.The factor structure of three correlated factors has shown minimal invariance across gender. PMID:24149130
Developing scale for colleague solidarity among nurses in Turkey.
Uslusoy, Esin Cetinkaya; Alpar, Sule Ecevit
2013-02-01
There is a need for an appropriate instrument to measure colleague solidarity among nurses. This study was carried out to develop a Colleague Solidarity of Nurses' Scale (CSNS). This study was planned to be descriptive and methodological. The CSNS examined content validity, construct validity, test-retest reliability and internal consistency reliability. The trial form of the CSNS, which was composed of 44 items, was given to 200 nurses, followed by validity and reliability analyses. Following the analyses, 21 items were excluded from the scale, leaving an attitude scale made up of 23 items. Factor analysis of the data showed that the scale has a three sub-factor structure: emotional solidarity, academic solidarity and negative opinions about solidarity. The Cronbach's alpha reliability of the whole scale was 0.80. This study provides evidence that the CSNS possesses robust solidarity among nurses. © 2013 Wiley Publishing Asia Pty Ltd.
Lindström, Eva; Jedenius, Erik; Levander, Sten
2009-01-01
The objective of the study was to validate a self-administrated symptom rating scale for use in patients with schizophrenia spectrum disorders by item analysis, exploration of factor structure, and analyses of reliability and validity. Data on 151 patients, initially treated by risperidone, obtained within the framework of a naturalistic Phase IV longitudinal study, were analysed by comparing patient and clinician ratings of symptoms, side-effects and global indices of illness. The Symptom Self-rating Scale for Schizophrenia (4S) is psychometrically adequate (item analysis, internal consistency, factor structure). Side-effect ratings were reliable. Symptom ratings displayed consistent associations with clinicians' ratings of corresponding symptom dimensions, suggesting construct validity. Patients had most difficulties assessing negative symptom items. Patients were well able to assess their own symptoms and drug side-effects. The factor structure of symptom ratings differs between patients and clinicians as well as how they construe global indices of illness. Clinicians focus on psychotic, patients on affective symptoms. Use of symptom self-ratings is one way to improve communication and thereby strengthen the therapeutic alliance and increase treatment adherence.
Meta-Analysis that Conceals More than It Reveals: Comment on Storm Et Al. (2010)
ERIC Educational Resources Information Center
Hyman, Ray
2010-01-01
Storm, Tressoldi, and Di Risio (2010) rely on meta-analyses to justify their claim that the evidence for psi is consistent and reliable. They manufacture apparent homogeneity and consistency by eliminating many outliers and combining databases whose combined effect sizes are not significantly different--even though these combined effect sizes…
Writing Across the Curriculum: Reliability Testing of a Standardized Rubric.
Minnich, Margo; Kirkpatrick, Amanda J; Goodman, Joely T; Whittaker, Ali; Stanton Chapple, Helen; Schoening, Anne M; Khanna, Maya M
2018-06-01
Rubrics positively affect student academic performance; however, accuracy and consistency of the rubric and its use is imperative. The researchers in this study developed a standardized rubric for use across an undergraduate nursing curriculum, then evaluated the interrater reliability and general usability of the tool. Faculty raters graded papers using the standardized rubric, submitted their independent scoring for interrater reliability analyses, then participated in a focus group discussion regarding rubric use experience. Quantitative analysis of the data showed a high interrater reliability (α = .998). Content analysis of transcription revealed several positive themes: Consistency, Emphasis on Writing Ability, and Ability to Use the Rubric as a Teaching Tool. Areas for improvement included use of value words and difficulty with point allocation. Investigators recommend effective faculty orientation for rubric use and future work in developing a rubric to assess reflective writing. [J Nurs Educ. 2018;57(6):366-370.]. Copyright 2018, SLACK Incorporated.
Santelmann, Hanno; Franklin, Jeremy; Bußhoff, Jana; Baethge, Christopher
2016-10-01
Schizoaffective disorder is a common diagnosis in clinical practice but its nosological status has been subject to debate ever since it was conceptualized. Although it is key that diagnostic reliability is sufficient, schizoaffective disorder has been reported to have low interrater reliability. Evidence based on systematic review and meta-analysis methods, however, is lacking. Using a highly sensitive literature search in Medline, Embase, and PsycInfo we identified studies measuring the interrater reliability of schizoaffective disorder in comparison to schizophrenia, bipolar disorder, and unipolar disorder. Out of 4126 records screened we included 25 studies reporting on 7912 patients diagnosed by different raters. The interrater reliability of schizoaffective disorder was moderate (meta-analytic estimate of Cohen's kappa 0.57 [95% CI: 0.41-0.73]), and substantially lower than that of its main differential diagnoses (difference in kappa between 0.22 and 0.19). Although there was considerable heterogeneity, analyses revealed that the interrater reliability of schizoaffective disorder was consistently lower in the overwhelming majority of studies. The results remained robust in subgroup and sensitivity analyses (e.g., diagnostic manual used) as well as in meta-regressions (e.g., publication year) and analyses of publication bias. Clinically, the results highlight the particular importance of diagnostic re-evaluation in patients diagnosed with schizoaffective disorder. They also quantify a widely held clinical impression of lower interrater reliability and agree with earlier meta-analysis reporting low test-retest reliability. Copyright © 2016. Published by Elsevier B.V.
Polcin, Douglas L.; Galloway, Gantt P.; Bond, Jason; Korcha, Rachael; Greenfield, Thomas K.
2008-01-01
The addiction field lacks an accepted definition and reliable measure of confrontation. The Alcohol and Drug Confrontation Scale (ADCS) defines confrontation as warnings about the potential consequences of substance use. To assess psychometric properties, 323 individual entering recovery houses in U.S. urban and suburban areas were interviewed between 2003 and 2005 (20% women, 68% white). Analyses included test-retest reliability, confirmatory factor analysis, and measures of internal consistency. Findings support the ADCS as a reliable way of assessing two factors: Internal Support and External intensity. Confrontation was experienced as supportive, accurate and helpful. Additional studies should assess confrontation in different contexts. PMID:20686635
Psychometrics Matter in Health Behavior: A Long-term Reliability Generalization Study.
Pickett, Andrew C; Valdez, Danny; Barry, Adam E
2017-09-01
Despite numerous calls for increased understanding and reporting of reliability estimates, social science research, including the field of health behavior, has been slow to respond and adopt such practices. Therefore, we offer a brief overview of reliability and common reporting errors; we then perform analyses to examine and demonstrate the variability of reliability estimates by sample and over time. Using meta-analytic reliability generalization, we examined the variability of coefficient alpha scores for a well-designed, consistent, nationwide health study, covering a span of nearly 40 years. For each year and sample, reliability varied. Furthermore, reliability was predicted by a sample characteristic that differed among age groups within each administration. We demonstrated that reliability is influenced by the methods and individuals from which a given sample is drawn. Our work echoes previous calls that psychometric properties, particularly reliability of scores, are important and must be considered and reported before drawing statistical conclusions.
Reliability and validity of the Salford-Scott Nursing Values Questionnaire in Turkish.
Ulusoy, Hatice; Güler, Güngör; Yıldırım, Gülay; Demir, Ecem
2018-02-01
Developing professional values among nursing students is important because values are a significant predictor of the quality care that will be provided, the clients' recognition, and consequently the nurses' job satisfaction. The literature analysis showed that there is only one validated tool available in Turkish that examines both the personal and the professional values of nursing students. The aim of this study was to assess the reliability and validity of the Salford-Scott Nursing Values Questionnaire in Turkish. This study was a Turkish linguistic and cultural adaptation of a research tool. Participants and research context: The sample of this study consisted of 627 undergraduate nursing students from different geographical areas of Turkey. Two questionnaires were used for data collection: a socio-demographic form and the Salford-Scott Nursing Values Questionnaire. For the Salford-Scott Nursing Values Questionnaire, construct validity was examined using factor analyses. Ethical considerations: The study was approved by the Cumhuriyet University Faculty of Medicine Research Ethics Board. Students were informed that participation in the study was entirely voluntary and anonymous. Item content validity index ranged from 0.66 to 1.0, and the total content validity index was 0.94. The Kaiser-Meyer-Olkin measure of sampling was 0.870, and Bartlett's test of sphericity was statistically significant (x 2 = 3108.714, p < 0.001). Construct validity was examined using factor analyses and the six factors were identified. Cronbach's alpha was used to assess the internal consistency reliability and the value of 0.834 was obtained. Our analyses showed that the Turkish version of Salford-Scott Nursing Values Questionnaire has high validity and reliability.
Müller-Staub, Maria; Lunney, Margaret; Lavin, Mary Ann; Needham, Ian; Odenbreit, Matthias; van Achterberg, Theo
2010-04-01
The instrument Q-DIO was developed in the years 2005 till 2006 to measure the quality of documented nursing diagnoses, interventions, and nursing sensitive patient outcomes. Testing psychometric properties of the Q-DIO (Quality of nursing Diagnoses, Interventions and Outcomes.) was the study aim. Instrument testing included internal consistency, test-retest reliability, interrater reliability, item analyses, and an assessment of the objectivity. To render variation in scores, a random strata sample of 60 nursing documentations was drawn. The strata represented 30 nursing documentations with and 30 without application of theory based, standardised nursing language. Internal consistency of the subscale nursing diagnoses as process showed Cronbach's Alpha 0.83 [0.78, 0.88]; nursing diagnoses as product 0.98 [0.94, 0.99]; nursing interventions 0.90 [0.85, 0.94]; and nursing-sensitive patient outcomes 0.99 [0.95, 0.99]. With Cohen's Kappa of 0.95, the intrarater reliability was good. The interrater reliability showed a Kappa of 0.94 [0.90, 0.96]. Item analyses confirmed the fulfilment of criteria for degree of difficulty and discriminative validity of the items. In this study, Q-DIO has shown to be a reliable instrument. It allows measuring the documented quality of nursing diagnoses, interventions and outcomes with and without implementation of theory based, standardised nursing languages. Studies for further testing of Q-DIO in other settings are recommended. The results implicitly support the use of nursing classifications such as NANDA, NIC and NOC.
A tool to assess sex-gender when selecting health research projects.
Tomás, Concepción; Yago, Teresa; Eguiluz, Mercedes; Samitier, M A Luisa; Oliveros, Teresa; Palacios, Gemma
2015-04-01
To validate the questionnaire "Gender Perspective in Health Research" (GPIHR) to assess the inclusion of gender perspective in research projects. Validation study in two stages. Feasibility was analysed in the first, and reliability, internal consistence and validity in the second. Aragón Institute of Health Science, Aragón, Spain. GPIHR was applied to 118 research projects funded in national and international competitive tenders from 2003 to 2012. Analysis of inter- and intra-observer reliability with Kappa index and internal consistency with Cronbach's alpha. Content validity analysed through literature review and construct validity with an exploratory factor analysis. Validated GPIHR has 10 questions: 3 in the introduction, 1 for objectives, 3 for methodology and 3 for research purpose. Average time of application was 13min Inter-observer reliability (Kappa) varied between 0.35 and 0.94 and intra-observer between 0.40 and 0.94. Theoretical construct is supported in the literature. Factor analysis identifies three levels of GP inclusion: "difference by sex", "gender sensitive" and "feminist research" with an internal consistency of 0.64, 0.87 and 0.81, respectively, which explain 74.78% of variance. GPIHR questionnaire is a valid tool to assess GP and useful for those researchers who would like to include GP in their projects. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.
Judging in Rhythmic Gymnastics at Different Levels of Performance.
Leandro, Catarina; Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta
2017-12-01
This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach's alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges' difficulty scores, the Kendall's coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach's alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level.
Judging in Rhythmic Gymnastics at Different Levels of Performance
Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta
2017-01-01
Abstract This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach’s alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges’ difficulty scores, the Kendall’s coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach’s alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level. PMID:29339996
Reliability and validity of the de Morton Mobility Index in individuals with sub-acute stroke.
Braun, Tobias; Marks, Detlef; Thiel, Christian; Grüneberg, Christian
2018-02-04
To establish the validity and reliability of the de Morton Mobility Index (DEMMI) in patients with sub-acute stroke. This cross-sectional study was performed in a neurological rehabilitation hospital. We assessed unidimensionality, construct validity, internal consistency reliability, inter-rater reliability, minimal detectable change and possible floor and ceiling effects of the DEMMI in adult patients with sub-acute stroke. The study included a total sample of 121 patients with sub-acute stroke. We analysed validity (n = 109) and reliability (n = 51) in two sub-samples. Rasch analysis indicated unidimensionality with an overall fit to the model (chi-square = 12.37, p = 0.577). All hypotheses on construct validity were confirmed. Internal consistency reliability (Cronbach's alpha = 0.94) and inter-rater reliability (intraclass correlation coefficient = 0.95; 95% confidence interval: 0.92-0.97) were excellent. The minimal detectable change with 90% confidence was 13 points. No floor or ceiling effects were evident. These results indicate unidimensionality, sufficient internal consistency reliability, inter-rater reliability, and construct validity of the DEMMI in patients with a sub-acute stroke. Advantages of the DEMMI in clinical application are the short administration time, no need for special equipment and interval level data. The de Morton Mobility Index, therefore, may be a useful performance-based bedside test to measure mobility in individuals with a sub-acute stroke across the whole mobility spectrum. Implications for Rehabilitation The de Morton Mobility Index (DEMMI) is an unidimensional measurement instrument of mobility in individuals with sub-acute stroke. The DEMMI has excellent internal consistency and inter-rater reliability, and sufficient construct validity. The minimal detectable change of the DEMMI with 90% confidence in stroke rehabilitation is 13 points. The lack of any floor or ceiling effects on hospital admission indicates applicability across the whole mobility spectrum of patients with sub-acute stroke.
Bartels, Meike; Cath, Danielle C.; Boomsma, Dorret I.
2008-01-01
The factor structure of the Dutch translation of the Autism-Spectrum Quotient (AQ; a continuous, quantitative measure of autistic traits) was evaluated with confirmatory factor analyses in a large general population and student sample. The criterion validity of the AQ was examined in three matched patient groups (autism spectrum conditions (ASC), social anxiety disorder, and obsessive–compulsive disorder). A two factor model, consisting of a “Social interaction” factor and “Attention to detail” factor could be identified. The internal consistency and test–retest reliability of the AQ were satisfactory. High total AQ and factor scores were specific to ASC patients. Men scored higher than women and science students higher than non-science students. The Dutch translation of the AQ is a reliable instrument to assess autism spectrum conditions. PMID:18302013
Dugosh, Karen Leggett; Festinger, David S; Lynch, Kevin G; Marlowe, Douglas B
2014-10-01
Systematically identifying reasons that clients enter substance abuse treatment may allow clinicians to immediately focus on issues of greatest relevance to the individual and enhance treatment engagement. We developed the Survey of Treatment Entry Pressures (STEP) to identify the specific factors that precipitated an individual's treatment entry. The instrument contains 121 items from 6 psychosocial domains (i.e., family, financial, social, medical, psychiatric, legal). The current study examined the STEP's psychometric properties. A total of 761 participants from various treatment settings and modalities completed the STEP prior to treatment admission and 4-7 days later. Analyses were performed to examine the instrument's psychometric properties including item response rates, test-retest reliability, internal consistency, and factor structure. The items displayed adequate test-retest reliability and internal consistency within each psychosocial domain. Generally, results from exploratory and confirmatory factor analyses support a 2-factor structure reflecting type of reinforcement schedule. The study provides preliminary support for the psychometric properties of the STEP. The STEP may provide a reliable way for clinicians to characterize and capitalize on a client's treatment motivation early on which may serve to improve treatment retention and therapeutic outcomes. © 2014 Wiley Periodicals, Inc.
Yalın Sapmaz, Şermin; Özek Erkuran, Handan; Ergin, Dilek; Öztürk, Masum; Şen Celasin, Nesrin; Karaarslan, Duygu; Aydemir, Ömer
2018-02-23
Background/aim: This study aimed to assess the validity and reliability of the Turkish version of the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form. Materials and methods: The study sample consisted of 32 patients treated in a child psychiatry unit and diagnosed with generalized anxiety disorder and 98 healthy volunteers who were attending middle or high school during the study period. For the assessment, the Screen for Child Anxiety and Related Emotional Disorders (SCARED) was also used along with the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form. Results: Regarding reliability analyses, the Cronbach alpha internal consistency coefficient was calculated as 0.932. The test-retest correlation coefficient was calculated as r = 0.707. As for construct validity, one factor that could explain 62.6% of the variance was obtained and this was consistent with the original construct of the scale. As for concurrent validity, the scale showed a high correlation with SCARED. Conclusion: It was concluded that Turkish version of the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form could be utilized as a valid and reliable tool both in clinical practice and for research purposes.
The brief multidimensional students' life satisfaction scale-college version.
Zullig, Keith J; Huebner, E Scott; Patton, Jon M; Murray, Karen A
2009-01-01
To investigate the psychometric properties of the BMSLSS-College among 723 college students. Internal consistency estimates explored scale reliability, factor analysis explored construct validity, and known-groups validity was assessed using the National College Youth Risk Behavior Survey and Harvard School of Public Health College Alcohol Study. Criterion-related validity was explored through analyses with the CDC's health-related quality of life scale and a social isolation scale. Acceptable internal consistency reliability, construct, known-groups, and criterion-related validity were established. Findings offer preliminary support for the BMSLSS-C; it could be useful in large-scale research studies, applied screening contexts, and for program evaluation purposes toward achieving Healthy People 2010 objectives.
Development and initial validation of the internalization of Asian American stereotypes scale.
Shen, Frances C; Wang, Yu-Wei; Swanson, Jane L
2011-07-01
This research consists of four studies on the initial reliability and validity of the Internalization of Asian American Stereotypes Scale (IAASS), a self-report instrument that measures the degree Asian Americans have internalized racial stereotypes about their own group. The results from the exploratory and confirmatory factor analyses support a stable four-factor structure of the IAASS: Difficulties with English Language Communication, Pursuit of Prestigious Careers, Emotional Reservation, and Expected Academic Success. Evidence for concurrent and discriminant validity is presented. High internal-consistency and test-retest reliability estimates are reported. A discussion of how this scale can contribute to research and practice regarding internalized stereotyping among Asian Americans is provided.
Improved Hip-Based Individual Recognition Using Wearable Motion Recording Sensor
NASA Astrophysics Data System (ADS)
Gafurov, Davrondzhon; Bours, Patrick
In todays society the demand for reliable verification of a user identity is increasing. Although biometric technologies based on fingerprint or iris can provide accurate and reliable recognition performance, they are inconvenient for periodic or frequent re-verification. In this paper we propose a hip-based user recognition method which can be suitable for implicit and periodic re-verification of the identity. In our approach we use a wearable accelerometer sensor attached to the hip of the person, and then the measured hip motion signal is analysed for identity verification purposes. The main analyses steps consists of detecting gait cycles in the signal and matching two sets of detected gait cycles. Evaluating the approach on a hip data set consisting of 400 gait sequences (samples) from 100 subjects, we obtained equal error rate (EER) of 7.5% and identification rate at rank 1 was 81.4%. These numbers are improvements by 37.5% and 11.2% respectively of the previous study using the same data set.
Measuring Critical Care Providers' Attitudes About Controlled Donation After Circulatory Death.
Rodrigue, James R; Luskin, Richard; Nelson, Helen; Glazier, Alexandra; Henderson, Galen V; Delmonico, Francis L
2018-06-01
Unfavorable attitudes and insufficient knowledge about donation after cardiac death among critical care providers can have important consequences for the appropriate identification of potential donors, consistent implementation of donation after cardiac death policies, and relative strength of support for this type of donation. The lack of reliable and valid assessment measures has hampered research to capture providers' attitudes. Design and Research Aims: Using stakeholder engagement and an iterative process, we developed a questionnaire to measure attitudes of donation after cardiac death in critical care providers (n = 112) and examined its psychometric properties. Exploratory factor analysis, internal consistency, and validity analyses were conducted to examine the measure. A 34-item questionnaire consisting of 4 factors (Personal Comfort, Process Satisfaction, Family Comfort, and System Trust) provided the most parsimonious fit. Internal consistency was acceptable for each of the subscales and the total questionnaire (Cronbach α > .70). A strong association between more favorable attitudes overall and knowledge ( r = .43, P < .001) provides evidence of convergent validity. Multivariable regression analyses showed that white race ( P = .002) and more experience with donation after cardiac death ( P < .001) were significant predictors of more favorable attitudes. Study findings support the utility, reliability, and validity of a questionnaire for measuring attitudes in critical care providers and for isolating targets for additional education on donation after cardiac death.
Construction and Validation of the Perceived Opportunity to Craft Scale.
van Wingerden, Jessica; Niks, Irene M W
2017-01-01
We developed and validated a scale to measure employees' perceived opportunity to craft (POC) in two separate studies conducted in the Netherlands (total N = 2329). POC is defined as employees' perception of their opportunity to craft their job. In Study 1, the perceived opportunity to craft scale (POCS) was developed and tested for its factor structure and reliability in an explorative way. Study 2 consisted of confirmatory analyses of the factor structure and reliability of the scale as well as examination of the discriminant and criterion-related validity of the POCS. The results indicated that the scale consists of one dimension and could be reliably measured with five items. Evidence was found for the discriminant validity of the POCS. The scale also showed criterion-related validity when correlated with job crafting (+), job resources (autonomy +; opportunities for professional development +), work engagement (+), and the inactive construct cynicism (-). We discuss the implications of these findings for theory and practice.
Reliability and validity of the Modified Erikson Psychosocial Stage Inventory in diverse samples.
Leidy, N K; Darling-Fisher, C S
1995-04-01
The Modified Erikson Psychosocial Stage Inventory (MEPSI) is a relatively simple survey measure designed to assess the strength of psychosocial attributes that arise from progression through Erikson's eight stages of development. The purpose of this study was to employ secondary analysis to evaluate the internal-consistency reliability and construct validity of the MEPSI across four diverse samples: healthy young adults, hemophilic men, healthy older adults, and older adults with chronic obstructive pulmonary disease. Special attention was given to the performance of the measure across gender, with exploratory analyses examining possible age cohort and health status effects. Internal-consistency estimates for the aggregate measure were high, whereas subscale reliability levels varied across age groups. Construct validity was supported across samples. Gender, cohort, and health effects offered interesting psychometric and theoretical insights and direction for further research. Findings indicated that the MEPSI might be a useful instrument for operationalizing and testing Eriksonian developmental theory in adults.
Consistency Analysis and Data Consultation of Gas System of Gas-Electricity Network of Latvia
NASA Astrophysics Data System (ADS)
Zemite, L.; Kutjuns, A.; Bode, I.; Kunickis, M.; Zeltins, N.
2018-02-01
In the present research, the main critical points of gas transmission and storage system of Latvia have been determined to ensure secure and reliable gas supply among the Baltic States to fulfil the core objectives of the EU energy policies. Technical data of critical points of the gas transmission and storage system of Latvia have been collected and analysed with the SWOT method and solutions have been provided to increase the reliability of the regional natural gas system.
van der Meulen, Mirja W; Boerebach, Benjamin C M; Smirnova, Alina; Heeneman, Sylvia; Oude Egbrink, Mirjam G A; van der Vleuten, Cees P M; Arah, Onyebuchi A; Lombarts, Kiki M J M H
2017-01-01
Multisource feedback (MSF) instruments are used to and must feasibly provide reliable and valid data on physicians' performance from multiple perspectives. The "INviting Co-workers to Evaluate Physicians Tool" (INCEPT) is a multisource feedback instrument used to evaluate physicians' professional performance as perceived by peers, residents, and coworkers. In this study, we report on the validity, reliability, and feasibility of the INCEPT. The performance of 218 physicians was assessed by 597 peers, 344 residents, and 822 coworkers. Using explorative and confirmatory factor analyses, multilevel regression analyses between narrative and numerical feedback, item-total correlations, interscale correlations, Cronbach's α and generalizability analyses, the psychometric qualities, and feasibility of the INCEPT were investigated. For all respondent groups, three factors were identified, although constructed slightly different: "professional attitude," "patient-centeredness," and "organization and (self)-management." Internal consistency was high for all constructs (Cronbach's α ≥ 0.84 and item-total correlations ≥ 0.52). Confirmatory factor analyses indicated acceptable to good fit. Further validity evidence was given by the associations between narrative and numerical feedback. For reliable total INCEPT scores, three peer, two resident and three coworker evaluations were needed; for subscale scores, evaluations of three peers, three residents and three to four coworkers were sufficient. The INCEPT instrument provides physicians performance feedback in a valid and reliable way. The number of evaluations to establish reliable scores is achievable in a regular clinical department. When interpreting feedback, physicians should consider that respondent groups' perceptions differ as indicated by the different item clustering per performance factor.
DOT National Transportation Integrated Search
2006-03-01
There have been several studies that have investigated interactions between light and heavy vehicles. These have primarily consisted of crash database analyses where Police Accident Reports have been studied. These approaches are generally reliable, ...
The Multitheoretical List of Therapeutic Interventions - 30 items (MULTI-30).
Solomonov, Nili; McCarthy, Kevin S; Gorman, Bernard S; Barber, Jacques P
2018-01-16
To develop a brief version of the Multitheoretical List of Therapeutic Interventions (MULTI-60) in order to decrease completion time burden by approximately half, while maintaining content coverage. Study 1 aimed to select 30 items. Study 2 aimed to examine the reliability and internal consistency of the MULTI-30. Study 3 aimed to validate the MULTI-30 and ensure content coverage. In Study 1, the sample included 186 therapist and 255 patient MULTI ratings, and 164 ratings of sessions coded by trained observers. Internal consistency (Chronbach's alpha and McDonald's omega) was calculated and confirmatory factor analysis was conducted. Psychotherapy experts rated content relevance. Study 2 included a sample of 644 patient and 522 therapist ratings, and 793 codings of psychotherapy sessions. In Study 3, the sample included 33 codings of sessions. A series of regression analyses was conducted to examine replication of previously published findings using the MULTI-30. The MULTI-30 was found valid, reliable, and internally consistent across 2564 ratings examined across the three studies presented. The MULTI-30 a brief and reliable process measure. Future studies are required for further validation.
The Brazilian version of the effort-reward imbalance questionnaire to assess job stress.
Chor, Dóra; Werneck, Guilherme Loureiro; Faerstein, Eduardo; Alves, Márcia Guimarães de Mello; Rotenberg, Lúcia
2008-01-01
The effort-reward imbalance (ERI) model has been used to assess the health impact of job stress. We aimed at describing the cross-cultural adaptation of the ERI questionnaire into Portuguese and some psychometric properties, in particular internal consistency, test-retest reliability, and factorial structure. We developed a Brazilian version of the ERI using a back-translation method and tested its reliability. The test-retest reliability study was conducted with 111 health workers and University staff. The current analyses are based on 89 participants, after exclusion of those with missing data. Reproducibility (interclass correlation coefficients) for the "effort", "'reward", and "'overcommitment"' dimensions of the scale was estimated at 0.76, 0.86, and 0.78, respectively. Internal consistency (Cronbach's alpha) estimates for these same dimensions were 0.68, 0.78, and 0.78, respectively. The exploratory factorial structure was fairly consistent with the model's theoretical components. We conclude that the results of this study represent the first evidence in favor of the application of the Brazilian Portuguese version of the ERI scale in health research in populations with similar socioeconomic characteristics.
İlçin, Nursen; Gürpınar, Barış; Bayraktar, Deniz; Savcı, Sema; Çetin, Pınar; Sarı, İsmail; Akkoç, Nurullah
2016-01-01
[Purpose] This study describes the cultural adaptation, validation, and reliability of the Turkish version of the Pain Catastrophizing Scale in patients with ankylosing spondylitis. [Methods] The validity of the Turkish version of the Pain Catastrophizing Scale was assessed by evaluating data quality (missing data and floor and ceiling effects), principal components analysis, internal consistency (Cronbach’s alpha), and construct validity (Spearman’s rho). Reproducibility analyses included standard measurement error, minimum detectable change, limits of agreement, and intraclass correlation coefficients. [Results] Sixty-four adult patients with ankylosing spondylitis with a mean age of 42.2 years completed the study. Factor analysis revealed that all questionnaire items could be grouped into two factors. Excellent internal consistency was found, with a Chronbach’s alpha value of 0.95. Reliability analyses showed an intraclass correlation coefficient (95% confidence interval) of 0.96 for the total score. There was a low correlation coefficient between the Turkish version of the Pain Catastrophizing Scale and body mass index, pain levels at rest and during activity, health-related quality of life, and fear and avoidance behaviors. [Conclusion] The results of this study indicate that the Turkish version of the Pain Catastrophizing Scale is a valid and reliable clinical and research tool for patients with ankylosing spondylitis. PMID:26957778
Lemons and Leases in the Used Business Aircraft Market.
ERIC Educational Resources Information Center
Gilligan, Thomas W.
2004-01-01
Given adverse selection, durable goods that trade less frequently depreciate more quickly. Consistent with this prediction, I find an inverse relationship between depreciation and trading volume for less reliable brands of used business aircraft. Additionally, recent theoretical analyses suggest that leasing, by increasing the average quality of…
Tsuno, Kanami; Kawakami, Norito; Shimazu, Akihito; Shimada, Kyoko; Inoue, Akiomi; P Leiter, Michael
2017-05-25
Although incivility is a common interpersonal mistreatment and associated with poor mental health, there are few studies about it in Asian countries. The aim of this study was to develop the Japanese version of the modified Work Incivility Scale (J-MWIS), investigate its reliability and validity, and reveal the prevalence of incivility among Japanese employees in comparison with data on Canadian employees. A total of 2,191 Japanese and 1,071 Canadian employees were surveyed, using either the J-MWIS or MWIS. Japanese employees additionally answered questions on civility, worksite social support, workplace bullying, psychological distress, intention to leave, and work engagement to investigate construct validity. At least one form of workplace incivility was experienced by both Japanese (52.3%) and Canadian (86.0%) employees in the previous month. Internal consistency reliability of the J-MWIS was acceptable (α=0.71-0.81), and correlation analyses also confirmed its construct validity as expected. Workplace incivility was associated with lower workgroup civility, lower supervisor and coworker support, higher workplace bullying, higher psychological distress, higher intention to leave, and lower work engagement. Confirmatory factor analyses showed that the original three-factor model (supervisor incivility, coworker incivility, and instigated incivility) fitted moderately in both Japan and Canada data, though the privacy/overfamiliarity factor was additionally extracted from exploratory factor analysis for the J-MWIS. The results of this study suggested that the J-MWIS has moderate internal consistency reliability and good construct validity.
Tsuno, Kanami; Kawakami, Norito; Shimazu, Akihito; Shimada, Kyoko; Inoue, Akiomi; P. Leiter, Michael
2017-01-01
Objectives: Although incivility is a common interpersonal mistreatment and associated with poor mental health, there are few studies about it in Asian countries. The aim of this study was to develop the Japanese version of the modified Work Incivility Scale (J-MWIS), investigate its reliability and validity, and reveal the prevalence of incivility among Japanese employees in comparison with data on Canadian employees. Methods: A total of 2,191 Japanese and 1,071 Canadian employees were surveyed, using either the J-MWIS or MWIS. Japanese employees additionally answered questions on civility, worksite social support, workplace bullying, psychological distress, intention to leave, and work engagement to investigate construct validity. Results: At least one form of workplace incivility was experienced by both Japanese (52.3%) and Canadian (86.0%) employees in the previous month. Internal consistency reliability of the J-MWIS was acceptable (α=0.71-0.81), and correlation analyses also confirmed its construct validity as expected. Workplace incivility was associated with lower workgroup civility, lower supervisor and coworker support, higher workplace bullying, higher psychological distress, higher intention to leave, and lower work engagement. Confirmatory factor analyses showed that the original three-factor model (supervisor incivility, coworker incivility, and instigated incivility) fitted moderately in both Japan and Canada data, though the privacy/overfamiliarity factor was additionally extracted from exploratory factor analysis for the J-MWIS. Conclusions: The results of this study suggested that the J-MWIS has moderate internal consistency reliability and good construct validity. PMID:28302927
Hsu, L-F; Hung, C-L; Kuo, L-J; Tsai, P-S
2017-09-01
No instrument is available to assess the impact of faecal incontinence (FI) of quality of life for Chinese-speaking population. The purpose of the study was to adapt the Faecal Incontinence Quality of Life Scale (FIQL) for patients with colorectal cancer, assess the factor structure and reduce the items for brevity. A sample of 120 participants were enrolled. Internal consistency, test-retest reliability, and convergent and contrasted-groups validity were assessed. Construct validity was analysed using an exploratory and confirmatory factor analyses (CFA). The internal consistency (Cronbach's α of the total scale and four subscales = 0.98 and 0.97, 0.96, 0.92, 0.82 respectively), test-retest reliability (intraclass correlation coefficients ≥.98 for all scales with p < .001) and significant correlations of all scales with selected subscales of the Medical Outcomes Study 36-Item Short-Form Health Survey and the Wexner scale suggested satisfactory reliability and validity. The severe FI group (with a Wexner score ≥9) scored significantly lower on the scale than the less severe FI group (with a Wexner score <9) did (p < .001). The CFA supported a two-factor structure and demonstrated an excellent model fit of the 15-item abbreviated version of the FIQL-Chinese. The FIQL-Chinese has satisfactory validity and reliability and the abbreviated version may be more practical and applicable. © 2016 John Wiley & Sons Ltd.
The inner formal structure of the H-T-P drawings: an exploratory study.
Vass, Z
1998-08-01
The study describes some interrelated patterns of traits of the House-Tree-Person (H-T-P) drawings with the instruments of hierarchical cluster analysis. First, according to the literature 1 7 formal or structural aspects of the projective drawings were collected, after which a detailed manual for coding was compiled. Second, the interrater reliability and the consistency of this manual was tested. Third, the hierarchical cluster structure of the reliable and consistent formal aspects was analysed. Results are: (a) a psychometrically tested coding manual of the investigated formal-structural aspects, each of them illustrated with drawings that showed the highest interrater agreement; and (b) the hierarchic cluster structure of the formal aspects of the H-T-P drawings of "normal" adults.
Eren, Nurhan
2014-12-01
In this study, we aimed to develop two reliable and valid assessment instruments for investigating the level of difficulties mental health workers experience while working with patients with personality disorders and the attitudes they develop tt the patients. The research was carried out based on the general screening model. The study sample consisted of 332 mental health workers in several mental health clinics of Turkey, with a certain amount of experience in working with personality disorders, who were selected with a random assignment method. In order to collect data, the Personal Information Questionnaire, Difficulty of Working with Personality Disorders Scale (PD-DWS), and Attitudes Towards Patients with Personality Disorders Scale (PD-APS), which are being examined for reliability and validity, were applied. To determine construct validity, the Adjective Check List, Maslach Burnout Inventory, and State and Trait Anxiety Inventory were used. Explanatory factor analysis was used for investigating the structural validity, and Cronbach alpha, Spearman-Brown, Guttman Split-Half reliability analyses were utilized to examine the reliability. Also, item reliability and validity computations were carried out by investigating the corrected item-total correlations and discriminative indexes of the items in the scales. For the PD-DWS KMO test, the value was .946; also, a significant difference was found for the Bartlett sphericity test (p<.001). The computed test-retest coefficient reliability was .702; the Cronbach alpha value of the total test score was .952. For PD-APS KMO, the value was .925; a significant difference was found in Bartlett sphericity test (p<.001); the computed reliability coefficient based on continuity was .806; and the Cronbach alpha value of the total test score was .913. Analyses on both scales were based on total scores. It was found that PD-DWS and PD-APS have good psychometric properties, measuring the structure that is being investigated, are compatible with other scales, have high levels of internal reliability between their items, and are consistent across time. Therefore, it was concluded that both scales are valid and reliable instruments.
Development of a problematic mobile phone use scale for Turkish adolescents.
Güzeller, Cem Oktay; Coşguner, Tolga
2012-04-01
Abstract The aim of this study was to evaluate the psychometric properties of the Problematic Mobile Phone Use Scale (PMPUS) for Turkish Adolescents. The psychometric properties of PMPUS were tested in two separate sample groups that consisted of 950 Turkish high school students. The first sample group (n=309) was used to determine the factor structure of the scale. The second sample group (n=461) was used to test data conformity with the identified structure, discriminant validity and concurrent scale validity, internal consistency reliability calculations, and item statistics calculations. The results of exploratory factor analyses indicated that the scale had three factors: interference with negative effect, compulsion/persistence, and withdrawal/tolerance. The results showed that item and construct reliability values yielded satisfactory rates in general for the three-factor construct. On the other hand, the average variance extracted value remained below the scale value for three subscales. The scores for the scale significantly correlated with depression and loneliness. In addition, the discriminant validity value was above the scale in all sub-dimensions except one. Based on these data, the reliability of the PMPUS scale appears to be satisfactory and provides good internal consistency. Therefore, with limited exception, the PMPUS was found to be reliable and valid in the context of Turkish adolescents.
The teamwork in assertive community treatment (TACT) scale: development and validation.
Wholey, Douglas R; Zhu, Xi; Knoke, David; Shah, Pri; Zellmer-Bruhn, Mary; Witheridge, Thomas F
2012-11-01
Team design is meticulously specified for assertive community treatment (ACT) teams, yet performance can vary across ACT teams, even those with high fidelity. By developing and validating the Teamwork in Assertive Community Treatment (TACT) scale, investigators examined the role of team processes in ACT performance. The TACT scale measuring ACT teamwork was developed from a conceptual model grounded in organizational research and adapted for the ACT and mental health context. TACT subscales were constructed after exploratory and confirmatory factor analyses. The reliability, discriminant validity, predictive validity, temporal stability, internal consistency, and within-team agreement were established with surveys from approximately 300 members of 26 Minnesota ACT teams who completed the questionnaire three times, at six-month intervals. Nine TACT subscales emerged from the analyses: exploration, exploitation of new and existing knowledge, psychological safety, goal agreement, conflict, constructive controversy, information accessibility, encounter preparedness, and consumer-centered care. These nine subscales demonstrated fit and temporal stability (confirmatory factor analysis), high internal consistency (Cronbach's alpha), and within-team agreement and between-team differences (rwg and intraclass correlations). Correlational analyses of the subscales revealed that they measure related yet distinctive aspects of ACT team processes, and regression analyses demonstrated predictive validity (encounter preparedness is related to staff outcomes). The TACT scale demonstrated high reliability and validity and can be included in research and evaluation of teamwork in ACT and mental health teams.
Reliability of self-rated tinnitus distress and association with psychological symptom patterns.
Hiller, W; Goebel, G; Rief, W
1994-05-01
Psychological complaints were investigated in two samples of 60 and 138 in-patients suffering from chronic tinnitus. We administered the Tinnitus Questionnaire (TQ), a 52-item self-rating scale which differentiates between dimensions of emotional and cognitive distress, intrusiveness, auditory perceptual difficulties, sleep disturbances and somatic complaints. The test-retest reliability was .94 for the TQ global score and between .86 and .93 for subscales. Three independent analyses were conducted to estimate the split-half reliability (internal consistency) which was only slightly lower than the test-retest values for scales with a relatively small number of items. Reliability was sufficient also on the level of single items. Low correlation between the TQ and the Hopkins Symptom Checklist (SCL-90-R) indicate a distinct quality of tinnitus-related and general psychological disturbances.
Reliability Analysis of Uniaxially Ground Brittle Materials
NASA Technical Reports Server (NTRS)
Salem, Jonathan A.; Nemeth, Noel N.; Powers, Lynn M.; Choi, Sung R.
1995-01-01
The fast fracture strength distribution of uniaxially ground, alpha silicon carbide was investigated as a function of grinding angle relative to the principal stress direction in flexure. Both as-ground and ground/annealed surfaces were investigated. The resulting flexural strength distributions were used to verify reliability models and predict the strength distribution of larger plate specimens tested in biaxial flexure. Complete fractography was done on the specimens. Failures occurred from agglomerates, machining cracks, or hybrid flaws that consisted of a machining crack located at a processing agglomerate. Annealing eliminated failures due to machining damage. Reliability analyses were performed using two and three parameter Weibull and Batdorf methodologies. The Weibull size effect was demonstrated for machining flaws. Mixed mode reliability models reasonably predicted the strength distributions of uniaxial flexure and biaxial plate specimens.
Multisite Reliability of Cognitive BOLD Data
Brown, Gregory G.; Mathalon, Daniel H.; Stern, Hal; Ford, Judith; Mueller, Bryon; Greve, Douglas N.; McCarthy, Gregory; Voyvodic, Jim; Glover, Gary; Diaz, Michele; Yetter, Elizabeth; Burak Ozyurt, I.; Jorgensen, Kasper W.; Wible, Cynthia G.; Turner, Jessica A.; Thompson, Wesley K.; Potkin, Steven G.
2010-01-01
Investigators perform multi-site functional magnetic resonance imaging studies to increase statistical power, to enhance generalizability, and to improve the likelihood of sampling relevant subgroups. Yet undesired site variation in imaging methods could off-set these potential advantages. We used variance components analysis to investigate sources of variation in the blood oxygen level dependent (BOLD) signal across four 3T magnets in voxelwise and region of interest (ROI) analyses. Eighteen participants traveled to four magnet sites to complete eight runs of a working memory task involving emotional or neutral distraction. Person variance was more than 10 times larger than site variance for five of six ROIs studied. Person-by-site interactions, however, contributed sizable unwanted variance to the total. Averaging over runs increased between-site reliability, with many voxels showing good to excellent between-site reliability when eight runs were averaged and regions of interest showing fair to good reliability. Between-site reliability depended on the specific functional contrast analyzed in addition to the number of runs averaged. Although median effect size was correlated with between-site reliability, dissociations were observed for many voxels. Brain regions where the pooled effect size was large but between-site reliability was poor were associated with reduced individual differences. Brain regions where the pooled effect size was small but between-site reliability was excellent were associated with a balance of participants who displayed consistently positive or consistently negative BOLD responses. Although between-site reliability of BOLD data can be good to excellent, acquiring highly reliable data requires robust activation paradigms, ongoing quality assurance, and careful experimental control. PMID:20932915
Probabilistic simulation of the human factor in structural reliability
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Chamis, Christos C.
1991-01-01
Many structural failures have occasionally been attributed to human factors in engineering design, analyses maintenance, and fabrication processes. Every facet of the engineering process is heavily governed by human factors and the degree of uncertainty associated with them. Factors such as societal, physical, professional, psychological, and many others introduce uncertainties that significantly influence the reliability of human performance. Quantifying human factors and associated uncertainties in structural reliability require: (1) identification of the fundamental factors that influence human performance, and (2) models to describe the interaction of these factors. An approach is being developed to quantify the uncertainties associated with the human performance. This approach consists of a multi factor model in conjunction with direct Monte-Carlo simulation.
Ehrhart, Mark G.; Torres, Elisa M.; Finn, Natalie K.; Roesch, Scott C.
2016-01-01
There have been recent calls for pragmatic measures to assess factors that influence evidence-based practice (EBP) implementation processes and outcomes. The Implementation Leadership Scale (ILS) is a brief and efficient measure that can be used for research or organizational development purposes to assess leader behaviors and actions that actively support effective EBP implementation. The ILS was developed and validated in mental health settings. This study validates the ILS factor structure with providers in alcohol and other drug (AOD) use treatment agencies. Participants were 323 service providers working in 72 workgroups from three AOD use treatment agencies. Confirmatory factor analyses and reliability analyses were conducted to examine the psychometric properties of the ILS. Convergent and discriminant validity were also assessed. Confirmatory factor analyses demonstrated good fit to the hypothesized first and second order factor structure. Internal consistency reliability was excellent. Convergent and discriminant validity was supported. The ILS psychometric characteristics, reliability, and validity were supported in AOD use treatment agencies. The ILS is a brief and pragmatic measure that can be used for research and practice to assess leadership for EBP implementation in AOD use treatment agencies. PMID:27431044
Aarons, Gregory A; Ehrhart, Mark G; Torres, Elisa M; Finn, Natalie K; Roesch, Scott C
2016-09-01
There have been recent calls for pragmatic measures to assess factors that influence evidence-based practice (EBP) implementation processes and outcomes. The Implementation Leadership Scale (ILS) is a brief and efficient measure that can be used for research or organizational development purposes to assess leader behaviors and actions that actively support effective EBP implementation. The ILS was developed and validated in mental health settings. This study validates the ILS factor structure with providers in alcohol and other drug (AOD) use treatment agencies. Participants were 323 service providers working in 72 workgroups from three AOD use treatment agencies. Confirmatory factor analyses and reliability analyses were conducted to examine the psychometric properties of the ILS. Convergent and discriminant validity were also assessed. Confirmatory factor analyses demonstrated good fit to the hypothesized first and second order factor structure. Internal consistency reliability was excellent. Convergent and discriminant validity was supported. The ILS psychometric characteristics, reliability, and validity were supported in AOD use treatment agencies. The ILS is a brief and pragmatic measure that can be used for research and practice to assess leadership for EBP implementation in AOD use treatment agencies. Copyright © 2016 Elsevier Inc. All rights reserved.
Charalambous, A; Molassiotis, A
2017-01-01
The Short Form Chronic Respiratory Questionnaire (SF-CRQ) is frequently used in patients with obstructive pulmonary disease and it has demonstrated excellent psychometric properties. Since there is no psychometric information for its use with lung cancer patients, this study explored its validity and reliability in this population. Forty-six patients were assessed at two time points (with a 4-week interval) using the SF-CRQ, the modified Borg Scale, five numerical rating scales related to Perceived Severity of Breathlessness, and the Hospital Anxiety and Depression Scale. Internal consistency reliability was investigated by Cronbach's alpha reliability coefficient, test-retest reliability by Spearman-Brown reliability coefficient (P), content validity as well as convergent validity by Pearson's correlation coefficient between the SF-CRQ, and the conceptual similar scales mentioned above were explored. A principal component factor analysis was performed. The internal consistency was high [α = 0.88 (baseline) and 0.91 (after 1 month)]. The SF-CRQ had good stability with test-retest reliability ranging from r = 0.64 to 0.78, P < 0.001. Factor analysis suggests a single construct in this population. The preliminary data analyses supported the convergent, content, and construct validity of the SF-CRQ providing promising evidence that this can be a valid and reliable instrument for the assessment of quality of life related to breathlessness in lung cancer patients. © 2015 John Wiley & Sons Ltd.
ERIC Educational Resources Information Center
Davies, Patrick T.; Forman, Evan M.; Rasi, Jennifer A.; Stevens, Kristopher I.
2002-01-01
Evaluated new self-report measure assessing children's strategies for preserving emotional security in context of interparental conflict. Factor analyses of the Security in the Interparental Subsystem (SIS) Scale supported a 7-factor solution. The SIS demonstrated satisfactory internal consistency and test-retest reliability. Support for test…
ERIC Educational Resources Information Center
Teten, Andra L.; Hall, Gordon C. Nagayama; Pacifici, Caesar
2005-01-01
The psychometric properties of the Acceptance of Coercive Sexual Behavior (ACSB), a multimedia measure of adolescent dating attitudes, were examined. The ACSB is an interactive instrument that uses video vignettes to depict adolescent dating situations. Analyses of the measure's factor structure, internal consistency, test-retest reliability, and…
ERIC Educational Resources Information Center
Gokce, Asiye Toker
2017-01-01
This study aimed to develop a valid and reliable measurement tool to enhance ethical evaluation literature. The tool consists of two subscales named "Bases of ethical evaluation," and "Grounds of ethical evaluation." In order to determine the factor structure of the scales, both exploratory and confirmatory factor analyses were…
Finn, Natalie K; Torres, Elisa M; Ehrhart, Mark G; Roesch, Scott C; Aarons, Gregory A
2016-08-01
The Implementation Leadership Scale (ILS) is a brief, pragmatic, and efficient measure that can be used for research or organizational development to assess leader behaviors and actions that actively support effective implementation of evidence-based practices (EBPs). The ILS was originally validated with mental health clinicians. This study validates the ILS factor structure with providers in community-based organizations (CBOs) providing child welfare services. Participants were 214 service providers working in 12 CBOs that provide child welfare services. All participants completed the ILS, reporting on their immediate supervisor. Confirmatory factor analyses were conducted to examine the factor structure of the ILS. Internal consistency reliability and measurement invariance were also examined. Confirmatory factor analyses showed acceptable fit to the hypothesized first- and second-order factor structure. Internal consistency reliability was strong and there was partial measurement invariance for the first-order factor structure when comparing child welfare and mental health samples. The results support the use of the ILS to assess leadership for implementation of EBPs in child welfare organizations. © The Author(s) 2016.
NASA Astrophysics Data System (ADS)
Ozaki, Hirokazu; Kara, Atsushi; Cheng, Zixue
2012-05-01
In this article, we investigate the reliability of M-for-N (M:N) shared protection systems. We focus on the reliability that is perceived by an end user of one of N units. We assume that any failed unit is instantly replaced by one of the M units (if available). We describe the effectiveness of such a protection system in a quantitative manner under the condition that the failed units are not repairable. Mathematical analysis gives the closed-form solution of the reliability and mean time to failure (MTTF). We also analyse several numerical examples of the reliability and MTTF. This result can be applied, for example, to the analysis and design of an integrated circuit consisting of redundant backup components. In such a device, repairing a failed component is unrealistic. The analysis provides useful information for the design for general shared protection systems in which the failed units are not repaired.
Development and Psychometric Properties of the OCD Family Functioning (OFF) Scale
Stewart, S. Evelyn; Hu, Yu-Pei; Hezel, Dianne M.; Proujansky, Rachel; Lamstein, Abby; Walsh, Casey; Ben-Joseph, Elana Pearl; Gironda, Christina; Jenike, Michael; Geller, Daniel A.; Pauls, David L.
2013-01-01
Obsessive–compulsive disorder (OCD) influences not only patients but also family members. Although the construct of family accommodation has received attention in OCD literature, no measures of overall family functioning are currently available. The OCD Family Functioning (OFF) Scale was developed to explore the context, extent, and perspectives of functional impairment in families affected by OCD. It is a three-part, self-report measure capturing independent perspectives of patients and relatives. A total of 400 subjects were enrolled between 2008 and 2010 from specialized OCD clinics and OCD research studies. Psychometric properties of this scale were examined including internal consistency, test–retest reliability, convergent and divergent validity, and exploratory factor analyses. Both patient and relative versions of the OFF Scale demonstrated excellent internal consistency (Cronbach’s alpha coefficient = 0.96). The test–retest reliability was also adequate (ICC = 0.80). Factor analyses determined that the OFF Scale comprises a family functioning impairment factor and four OCD symptom factors that were consistent with previously reported OCD symptom dimension studies. The OFF Scale demonstrated excellent convergent validity with the Family Accommodation Scale and the Work and Social Adjustment Scale. Information gathered regarding emotional impact and family role-specific impairment was novel and not captured by other examined scales. The OFF Scale is a reliable and valid instrument for the clinical and research assessment of family functioning in pediatric and adult OCD. This will facilitate the exploration of family functioning impairment as a potential risk factor, as a moderator and as a treatment outcome measure in OCD. PMID:21553962
Psychometric properties of the Thought-Action Fusion Scale in a Turkish sample.
Yorulmaz, Orçun; Yilmaz, A Esin; Gençöz, Tülin
2004-10-01
The aim of the present study was to reveal the cross-cultural utility of the Thought-Action Fusion Scale (TAFS; J. Anxiety Disord. 10 (1996) 379). Thought-action fusion (TAF) refers to the tendency to overvalue the significance and the consequences of thoughts. Two hundred and fifty one undergraduate Turkish students participated in the current study. The reliability and validity analyses of the Turkish version of the scale indicated that the TAFS had adequate psychometric properties in a Turkish sample. Consistent with the original TAF, the Turkish version of TAFS revealed two subscales as TAF-Likelihood and TAF-Morality. Reliability analysis showed that TAF Scale and its factors had adequate internal consistencies and split-half reliability coefficients. Confirming the expectations, TAFS scores were found to be significantly and positively correlated with obsessive-compulsive symptoms, responsibility, and guilt measures. Moreover, it was found that people with high obsessive-compulsive symptoms had higher TAFS scores than those with low symptoms.
Sexual Assertiveness Scale (SAS) for women: development and validation.
Morokoff, P J; Quina, K; Harlow, L L; Whitmire, L; Grimley, D M; Gibson, P R; Burkholder, G J
1997-10-01
Four studies were conducted to develop and validate the Sexual Assertiveness Scale (SAS), a measure of sexual assertiveness in women that consists of factors measuring initiation, refusal, and pregnancy-sexually transmitted disease prevention assertiveness. A total of 1,613 women from both university and community populations were studied. Confirmatory factor analyses demonstrated that the 3 factors remained stable across samples of university and community women. A structural model was tested in 2 samples, indicating that sexual experience, anticipated negative partner response, and self-efficacy are consistent predictors of sexual assertiveness. Sexual assertiveness was found to be somewhat related to relationship satisfaction, power, and length. The community sample was retested after 6 months and 1 year to establish test-retest reliability. The SAS provides a reliable instrument for assessing and understanding women's sexual assertiveness.
Santelmann, Hanno; Franklin, Jeremy; Bußhoff, Jana; Baethge, Christopher
2015-11-01
Schizoaffective disorder is a frequent diagnosis, and its reliability is subject to ongoing discussion. We compared the diagnostic reliability of schizoaffective disorder with its main differential diagnoses. We systematically searched Medline, Embase, and PsycInfo for all studies on the test-retest reliability of the diagnosis of schizoaffective disorder as compared with schizophrenia, bipolar disorder, and unipolar depression. We used meta-analytic methods to describe and compare Cohen's kappa as well as positive and negative agreement. In addition, multiple pre-specified and post hoc subgroup and sensitivity analyses were carried out. Out of 4,415 studies screened, 49 studies were included. Test-retest reliability of schizoaffective disorder was consistently lower than that of schizophrenia (in 39 out of 42 studies), bipolar disorder (27/33), and unipolar depression (29/35). The mean difference in kappa between schizoaffective disorder and the other diagnoses was approximately 0.2, and mean Cohen's kappa for schizoaffective disorder was 0.50 (95% confidence interval: 0.40-0.59). While findings were unequivocal and homogeneous for schizoaffective disorder's diagnostic reliability relative to its three main differential diagnoses (dichotomous: smaller versus larger), heterogeneity was substantial for continuous measures, even after subgroup and sensitivity analyses. In clinical practice and research, schizoaffective disorder's comparatively low diagnostic reliability should lead to increased efforts to correctly diagnose the disorder. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Reliability, validity, and significance of assessment of sense of contribution in the workplace.
Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito
2014-01-29
The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%-80.2%). Fifty-four workers were included in the analysis of test-retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach's α coefficients in men and women were 0.85 and 0.86, respectively) and test-retest reliability (intraclass correlation coefficient = 0.91). Significant (p < 0.001), positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS's convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05) and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort-reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
De Smedt, Delphine; Clays, Els; Doyle, Frank; Kotseva, Kornelia; Prugger, Christof; Pająk, Andrzej; Jennings, Catriona; Wood, David; De Bacquer, Dirk
2013-09-01
To investigate the validity and reliability of the EuroQol-5D (EQ-5D), the 12-item Short-Form Health Survey (SF-12v2), and the Hospital Anxiety and Depression Scale (HADS) in a stable coronary population. Cross-sectional study EUROASPIRE III. Quality of life data (QoL) were available on 8745 patients hospitalized for coronary artery bypass graft (CABG), percutaneous coronary intervention (PCI), acute myocardial infarction (AMI), or myocardial ischemia. They were interviewed and examined at least 6 months after their hospital admission. Reliability and validity of the 3 instruments were tested. Internal consistency, and discriminative, convergent, criterion and construct validity were assessed. Cronbach's alpha indicated good internal consistency for all measures (0.73 to 0.87). Discriminative validity analyses confirmed significant QoL differences between known groups: age, gender, educational level. In addition, all hypothesized correlations between QoL constructs (convergent validity) and items (criterion validity) were confirmed with significant correlations. Confirmatory factor analyses indicated good construct validity for HADS and SF-12v2. On country-specific level, results were roughly similar. The EQ-5D as well as the SF-12v2 and the HADS are reliable and valid instruments for use in a stable coronary population, both on aggregate European level and on country-specific level. However, our results must be generalized with caution, because EUROASPIRE III patients might not be representative for all patients with stable coronary heart disease. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Hutchinson, A; Cooper, K L; Dean, J E; McIntosh, A; Patterson, M; Stride, C B; Laurence, B E; Smith, C M
2006-10-01
To explore the factor structure, reliability, and potential usefulness of a patient safety climate questionnaire in UK health care. Four acute hospital trusts and nine primary care trusts in England. The questionnaire used was the 27 item Teamwork and Safety Climate Survey. Thirty three healthcare staff commented on the wording and relevance. The questionnaire was then sent to 3650 staff within the 13 NHS trusts, seeking to achieve at least 600 responses as the basis for the factor analysis. 1307 questionnaires were returned (36% response). Factor analyses and reliability analyses were carried out on 897 responses from staff involved in direct patient care, to explore how consistently the questions measured the underlying constructs of safety climate and teamwork. Some questionnaire items related to multiple factors or did not relate strongly to any factor. Five items were discarded. Two teamwork factors were derived from the remaining 11 teamwork items and three safety climate factors were derived from the remaining 11 safety items. Internal consistency reliabilities were satisfactory to good (Cronbach's alpha > or =0.69 for all five factors). This is one of the few studies to undertake a detailed evaluation of a patient safety climate questionnaire in UK health care and possibly the first to do so in primary as well as secondary care. The results indicate that a 22 item version of this safety climate questionnaire is useable as a research instrument in both settings, but also demonstrates a more general need for thorough validation of safety climate questionnaires before widespread usage.
Reliability, Validity, and Significance of Assessment of Sense of Contribution in the Workplace
Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito
2014-01-01
The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%–80.2%). Fifty-four workers were included in the analysis of test–retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach’s α coefficients in men and women were 0.85 and 0.86, respectively) and test–retest reliability (intraclass correlation coefficient = 0.91). Significant (p < 0.001), positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS’s convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05) and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort–reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine. PMID:24481035
Aguiar, A S; Bataglion, C; Visscher, C M; Bevilaqua Grossi, D; Chaves, T C
2017-07-01
Fear of movement (kinesiophobia) seems to play an important role in the development of chronic pain. However, for temporomandibular disorders (TMD), there is a scarcity of studies about this topic. The Tampa Scale for Kinesiophobia for TMD (TSK/TMD) is the most widely used instrument to measure fear of movement and it is not available in Brazilian Portuguese. The purpose of this study was to culturally adapt the TSK/TMD to Brazilian Portuguese and to assess its psychometric properties regarding internal consistency, reliability, and construct and structural validity. A total of 100 female patients with chronic TMD participated in the validation process of the TSK/TMD-Br. The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Spearman's rank correlation for construct validity and confirmatory factor analysis (CFA) for structural validity. CFA endorsed the pre-specified model with two domains and 12-items (Activity Avoidance - AA/Somatic Focus - SF) and all items obtained a loading factor greater than 0·4. Acceptable levels of reliability were found (ICC > 0·75) for all questions and domains of the TSK/TMD-Br. For internal consistency, Cronbach's α of 0·78 for both domains were found. Moderate correlations (0·40 < r < 0.60) were observed for 84% of the analyses conducted between TSK/TMD-Br scores versus catastrophising, depression and jaw functional limitation. TSK/TMD-Br 12 items and two-factor demonstrated sound psychometric properties (transcultural validity, reliability, internal consistency and structural validity). In such a way, the instrument can be used in clinical settings and for research purposes. © 2017 John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Sproles, Darrell W.; Bavuso, Salvatore J.
1994-01-01
The Hybrid Automated Reliability Predictor (HARP) integrated Reliability (HiRel) tool system for reliability/availability prediction offers a toolbox of integrated reliability/availability programs that can be used to customize the user's application in a workstation or nonworkstation environment. HiRel consists of interactive graphical input/output programs and four reliability/availability modeling engines that provide analytical and simulative solutions to a wide host of highly reliable fault-tolerant system architectures and is also applicable to electronic systems in general. The tool system was designed at the outset to be compatible with most computing platforms and operating systems and some programs have been beta tested within the aerospace community for over 8 years. This document is a user's guide for the HiRel graphical postprocessor program HARPO (HARP Output). HARPO reads ASCII files generated by HARP. It provides an interactive plotting capability that can be used to display alternate model data for trade-off analyses. File data can also be imported to other commercial software programs.
Development of the Seasonal Migrant Agricultural Worker Stress Scale in Sanliurfa, Southeast Turkey.
Simsek, Zeynep; Ersin, Fatma; Kirmizitoprak, Evin
2016-01-01
Stress is one of the main causes of health problems, especially mental disorders. These health problems cause a significant amount of ability loss and increase cost. It is estimated that by 2020, mental disorders will constitute 15% of the total disease burden, and depression will rank second only after ischemic heart disease. Environmental experiences are paramount in increasing the liability of mental disorders in those who constantly face sustained high levels of stress. The objective of this study was to develop a stress scale for seasonal migrant agricultural workers aged 18 years and older. The sample consisted of 270 randomly selected seasonal migrant agricultural workers. The average age of the participants was 33.1 ± 14, and 50.7% were male. The Cronbach alpha coefficient and test-retest methods were used for reliability analyses. Although the factor analysis was performed for the structure validity of the scale, the Kaiser-Meyer-Olkin coefficient and Bartlett test were used to determine the convenience of the data for the factor analysis. In the reliability analyses, the Cronbach alpha coefficient of internal consistency was calculated as .96, and the test-retest reliability coefficient was .81. In the exploratory factor analysis for validity of the scale, four factors were obtained, and the factors represented workplace physical conditions (25.7% of the total variance), workplace psychosocial and economic factors (19.3% of the total variance), workplace health problems (15.2% of the total variance), and school problems (10.1% of the total variance). The four factors explained 70.3% of the total variance. As a result of the expert opinions and analyses, a stress scale with 48 items was developed. The highest score to be obtained from the scale was 144, and the lowest score was 0. The increase in the score indicates the increase in the stress levels. The findings show that the scale is a valid and reliable assessment instrument that can be used in epidemiological research and planning interventions.
Damschroder, Laura J; Goodrich, David E; Kim, Hyungjin Myra; Holleman, Robert; Gillon, Leah; Kirsh, Susan; Richardson, Caroline R; Lutes, Lesley D
2016-09-01
Practical and valid instruments are needed to assess fidelity of coaching for weight loss. The purpose of this study was to develop and validate the ASPIRE Coaching Fidelity Checklist (ACFC). Classical test theory guided ACFC development. Principal component analyses were used to determine item groupings. Psychometric properties, internal consistency, and inter-rater reliability were evaluated for each subscale. Criterion validity was tested by predicting weight loss as a function of coaching fidelity. The final 19-item ACFC consists of two domains (session process and session structure) and five subscales (sets goals and monitor progress, assess and personalize self-regulatory content, manages the session, creates a supportive and empathetic climate, and stays on track). Four of five subscales showed high internal consistency (Cronbach alphas > 0.70) for group-based coaching; only two of five subscales had high internal reliability for phone-based coaching. All five sub-scales were positively and significantly associated with weight loss for group- but not for phone-based coaching. The ACFC is a reliable and valid instrument that can be used to assess fidelity and guide skill-building for weight management interventionists.
[Psychometric properties of a self-efficacy scale for physical activity in Brazilian adults].
Rech, Cassiano Ricardo; Sarabia, Tais Taiana; Fermino, Rogério César; Hallal, Pedro Curi; Reis, Rodrigo Siqueira
2011-04-01
To test the validity and reliability of a self-efficacy scale for physical activity (PA) in Brazilian adults. A self-efficacy scale was applied jointly with a multidimensional questionnaire through face-to-face interviews with 1,418 individuals (63.4% women) aged ≥ 18 years. The scale was submitted to validity (factorial and construct) and reliability analysis (internal consistency and temporal stability). A test-retest procedure was conducted with 74 individuals to evaluate temporal stability. Exploratory factor analyses revealed two independent factors: self-efficacy for walking and self-efficacy for moderate and vigorous PA (MVPA). Together, these two factors explained 65.4% of the total variance of the scale (20.9% and 44.5% for walking and MVPA, respectively). Cronbach's alpha values were 0.83 for walking and 0.90 for MVPA, indicating high internal consistency. Both factors were significantly and positively correlated (rho ≥ 0.17, P < 0.001) with quality of life indicators (health perception, self-satisfaction, and energy for daily activities), indicating an adequate construct validity. The scale's validity, internal consistency, and reliability were adequate to evaluate self-efficacy for PA in Brazilian adults.
Reliability and Validity of the Work and Well-Being Inventory (WBI) for Employees.
Vendrig, A A; Schaafsma, F G
2018-06-01
Purpose The purpose of this study is to measure the psychometric properties of the Work and Wellbeing Inventory (WBI) (in Dutch: VAR-2), a screening tool that is used within occupational health care and rehabilitation. Our research question focused on the reliability and validity of this inventory. Methods Over the years seven different samples of workers, patients and sick listed workers varying in size between 89 and 912 participants (total: 2514), were used to measure the test-retest reliability, the internal consistency, the construct and concurrent validity, and the criterion and predictive validity. Results The 13 scales displayed good internal consistency and test-retest reliability. The constructive validity of the WBI could clearly be demonstrated in both patients and healthy workers. Confirmative factor analyses revealed a CFI >.90 for all scales. The depression scale predicted future work absenteeism (>6 weeks) because of a common mental disorder in healthy workers. The job strain scale and the illness behavior scale predicted long term absenteeism (>3 months) in workers with short-term absenteeism. The illness behavior scale moderately predicted return to work in rehab patients attending an intensive multidisciplinary program. Conclusions The WBI is a valid and reliable tool for occupational health practitioners to screen for risk factors for prolonged or future sickness absence. With this tool they will have reliable indications for further advice and interventions to restore the work ability.
Time-Tagged Risk/Reliability Assessment Program for Development and Operation of Space System
NASA Astrophysics Data System (ADS)
Kubota, Yuki; Takegahara, Haruki; Aoyagi, Junichiro
We have investigated a new method of risk/reliability assessment for development and operation of space system. It is difficult to evaluate risk of spacecraft, because of long time operation, maintenance free and difficulty of test under the ground condition. Conventional methods are FMECA, FTA, ETA and miscellaneous. These are not enough to assess chronological anomaly and there is a problem to share information during R&D. A new method of risk and reliability assessment, T-TRAP (Time-tagged Risk/Reliability Assessment Program) is proposed as a management tool for the development and operation of space system. T-TRAP consisting of time-resolved Fault Tree and Criticality Analyses, upon occurrence of anomaly in the system, facilitates the responsible personnel to quickly identify the failure cause and decide corrective actions. This paper describes T-TRAP method and its availability.
Dettlaff, Alan J; Christopher Graham, J; Holzman, Jesse; Baumann, Donald J; Fluke, John D
2015-11-01
When children come to the attention of the child welfare system, they become involved in a decision-making process in which decisions are made that have a significant effect on their future and well-being. The decision to remove children from their families is particularly complex; yet surprisingly little is understood about this decision-making process. This paper presents the results of a study to develop an instrument to explore, at the caseworker level, the context of the removal decision, with the objective of understanding the influence of the individual and organizational factors on this decision, drawing from the Decision Making Ecology as the underlying rationale for obtaining the measures. The instrument was based on the development of decision-making scales used in prior decision-making studies and administered to child protection caseworkers in several states. Analyses included reliability analyses, principal components analyses, and inter-correlations among the resulting scales. For one scale regarding removal decisions, a principal components analysis resulted in the extraction of two components, jointly identified as caseworkers' decision-making orientation, described as (1) an internal reference to decision-making and (2) an external reference to decision-making. Reliability analyses demonstrated acceptable to high internal consistency for 9 of the 11 scales. Full details of the reliability analyses, principal components analyses, and inter-correlations among the seven scales are discussed, along with implications for practice and the utility of this instrument to support the understanding of decision-making in child welfare. Copyright © 2015 Elsevier Ltd. All rights reserved.
Measurement of perceived competence in Dutch children with mild intellectual disabilities.
Elias, C; Vermeer, A; 't Hart, H
2005-04-01
Little research has been conducted on the perceived competence of children with mild intellectual disabilities (MID). One of the reasons for the marked absence of research appears to be the lack of reliable and clearly valid measurement instruments for this particular group of children. In the present study, it was examined whether a pictorial scale originally designed to measure perceived competence in typically developing children could successfully be used with children with MID. The pictorial scale was administered to a group of 106 children with MID. The construct validity, reliability and stability of the scale were investigated. The results of the exploratory factor analyses and the confirmatory factor analyses supported the conceptual framework proposed. The construct validity was also supported by the pattern of intercorrelations between the subscales. The scale had adequate internal consistency and the stability analyses showed sufficient stability across a 4-month period. The findings show the psychometric properties of the pictorial scale to justify its use with children with MID.
Measuring family-centred practices of professionals in early intervention services in Taiwan.
Kang, L-J; Palisano, R J; Simeonsson, R J; Hwang, A-W
2017-09-01
Family-centred practices emphasize professional supports for forming partnerships with families in early intervention. The Measure of Processes of Care for Service Providers (MPOC-SP) measures the perceptions of paediatric service providers in supporting children and families. This study aimed to establish reliability of the Chinese version of the MPOC-SP (C-MPOC-SP) and to examine professional perceptions of family-centred practices in relation to professional discipline and years of experience. A convenience sample of 94 physical therapists, occupational therapists, speech-language pathologists, social workers and early childhood educators completed the C-MPOC-SP. Thirty-seven professionals completed the measure a second time within 2-4 weeks for test-retest reliability. Internal consistency and test-retest reliability were examined by Cronbach's α and intra-class correlation coefficient. Comparisons were made across professional disciplines by multivariate analyses of variance followed by analyses of variance. Relationships between years of experience and ratings of family-centred practices were examined by Pearson's correlation coefficients (r). Cronbach's α for items on each of the four scales of the C-MPOC-SP ranged from 0.80 to 0.92, indicating adequate internal consistency. Intra-class correlation coefficient between the initial and repeat completion of the C-MPOC-SP for each scale ranged from 0.56 to 0.77, indicating adequate to excellent test-retest reliability. Mean ratings for the Communicating Specific Information were significantly higher for physical therapists, occupational therapists and speech-language pathologists than for social workers (P = 0.001). The C-MPOC-SP scores were positively correlated with years of experience for all four scales (r = 0.23-0.38; P < 0.05). This study established adequate internal consistency and adequate to excellent test-retest reliability of the C-MPOC-SP in measuring perceptions of family centeredness of early intervention service providers. Cross-discipline differences were found in communicating specific information about the child. Higher perceptions of family centeredness were associated with more years of experience. The results support the utility of the C-MPOC-SP in professional education and programme evaluation of early intervention services in Taiwan. © 2017 John Wiley & Sons Ltd.
ERIC Educational Resources Information Center
Huang, Xiaozhong; Li, Weijian; Sun, Binghai; Chen, Haide; Davis, Mark H.
2012-01-01
Psychometric properties of the Chinese version of Interpersonal Reactivity Index (C-IRI) were examined in a sample of 930 teachers in China. The subscales of the C-IRI demonstrated acceptable to good internal consistency and test-retest reliability. Exploratory and confirmatory factor analyses revealed a stable four-factor structure across three…
Assessing guilt toward the former spouse.
Wietzker, Anne; Buysse, Ann
2012-09-01
Divorce is often accompanied by feelings of guilt toward the former spouse. So far, no scale has been available to measure such feelings. For this purpose, the authors developed the Guilt in Separation Scale (GiSS). Content validity was assured by using experts and lay experts to generate and select items. Exploratory analyses were run on samples of 214 divorced individuals and confirmatory analyses on 458 individuals who were in the process of divorcing. Evidence was provided for the reliability and construct validity of the GiSS. The internal consistency was high (α = .91), as were the 6-month and 12-month test-retest reliabilities (r = .72 and r = .76, respectively). The GiSS was related to shame, regret, compassion, locus of cause of the separation, unfaithfulness, and psychological functioning. PsycINFO Database Record (c) 2012 APA, all rights reserved.
Hickman, Ronald L.; Pinto, Melissa D.; Lee, Eunsuk; Daly, Barbara J.
2015-01-01
The Decision Regret Scale (DRS) is a five-item instrument that captures an individual’s regret associated with a healthcare decision. Cross-sectional data were collected from 109 cardiac patients who decided to receive an internal cardioverter defibrillator (ICD). Exploratory and confirmatory factor analyses, assessments of the internal reliability consistency (α = .86), and discriminant validity established the DRS as a reliable and valid measure of decision regret in ICD recipients. The DRS, a psychometrically sound instrument, has relevance for clinicians and researchers vested in optimizing the decisional outcomes of ICD recipients. Future research is needed to examine the reliability and validity of the DRS in a larger and more diverse sample of ICD recipients. PMID:22679707
Development and psychometric properties of the Student Worry Questionnaire-30.
Osman, A; Gutierrez, P M; Downs, W R; Kopper, B A; Barrios, F X; Haraburda, C M
2001-02-01
Described are the development and initial psychometric properties (Ns = 50 and 188) of a self-report measure, the Student Worry Questionnaire-30, for use with college undergraduates. Exploratory principal components analyses (Ns = 388, 350, and 396) with oblimin rotation indicated six domains of worrisome thinking, financial-related concerns, significant others' well-being, social adequacy concerns, academic concerns, and general anxiety symptoms. The total score and scale scores showed internal consistency of .80 to .94. Also, test-retest reliability analyses (.75 to .80) support consistency of responses over 4 wk. Strong evidence for convergent validity) was indicated. Confirmatory factor analysis confirmed the fit of the 6-factor oblique model. Limitations of the present studies, and directions for research are discussed.
NASA Astrophysics Data System (ADS)
Fisher, W. P., Jr.; Elbaum, B.; Coulter, A.
2010-07-01
Reliability coefficients indicate the proportion of total variance attributable to differences among measures separated along a quantitative continuum by a testing, survey, or assessment instrument. Reliability is usually considered to be influenced by both the internal consistency of a data set and the number of items, though textbooks and research papers rarely evaluate the extent to which these factors independently affect the data in question. Probabilistic formulations of the requirements for unidimensional measurement separate consistency from error by modelling individual response processes instead of group-level variation. The utility of this separation is illustrated via analyses of small sets of simulated data, and of subsets of data from a 78-item survey of over 2,500 parents of children with disabilities. Measurement reliability ultimately concerns the structural invariance specified in models requiring sufficient statistics, parameter separation, unidimensionality, and other qualities that historically have made quantification simple, practical, and convenient for end users. The paper concludes with suggestions for a research program aimed at focusing measurement research more on the calibration and wide dissemination of tools applicable to individuals, and less on the statistical study of inter-variable relations in large data sets.
Reliability of adapted version of Italian Label tobacco Impact Index for the adolescent: ALII.
Guerra, F; Mannocci, A; Colamesta, V; De Luca, G; Fiore, M; Firenze, A; Ferrara, M; Langiano, E; De Vito, E; Bonaccorsi, G; La Torre, G
2017-01-01
The aim of this study is to assess the reliability of the Adolescent Label Impact Index (ALII) , it is an adolescent adapted version of Italian LII of the tobacco products warnings. A sample including students aged 13-15 years was considered. The ALII is constructed by 4 items: salience, harm, quitting and forgo. The questionnaire was self-administered to study participants twice with 3 days between each administration (T1 and T2) to measure reliability. The internal consistency using Cronbach's alpha and Corrected Item-Total Correlations (CITC) and the test-retest reliability applying Pearson's correlation were computed. Cronbach's alpha ranges from 0.625 at T1 to 0.715 at T2. The "salience" resulted the item with the lowest CITC value (=0.281). The Pearson's coefficient was r=0.909 (p<0.001). The instruments is low in cost and easy to administer and analyses in a setting people aged 13-15 years. The ALII shown an acceptable consistency and excellent stability over time. However, attention has to be paid when the ALII is administered to the no smoking teens and who has never seen the tobacco product labels to allow an appropriate interpretation of the data collected.
García-Fernández, José M; Inglés, Cándido J; Marzo, Juan C; Martínez-Monteagudo, María C
2014-05-01
The School Anxiety Inventory (SAI) can be applied in different fields of psychology. However, due to the inventory's administration time, it may not be useful in certain situations. To address this concern, the present study developed a short version of the SAI (the SAI-SV). This study examined the reliability and validity evidence drawn from the scores of the School Anxiety Inventory-Short Version (SAI-SV) using a sample of 2,367 (47.91% boys) Spanish secondary school students, ranging from 12 to 18 years of age. To analyze the dimensional structure of the SAI-SV, exploratory and confirmatory factor analyses were applied. Internal consistency and test-retest reliability were calculated for SAI-SV scores. A correlated three-factor structure related to school situations (Anxiety about Aggression, Anxiety about Social Evaluation, and Anxiety about Academic Failure) and a three-factor structure related to the response systems of anxiety (Physiological Anxiety, Cognitive Anxiety, and Behavioral Anxiety) were identified and supported. The internal consistency and test-retest reliability were determined to be appropriate. The reliability and validity evidence based on the internal structure of SAI-SV scores was satisfactory.
Test-retest reliability of the Military Pre-training Questionnaire.
Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D
2010-09-01
Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Scholl, Isabelle; Kriston, Levente; Dirmaier, Jörg; Härter, Martin
2015-02-01
While there has been a clear move towards shared decision-making (SDM) in the last few years, the measurement of SDM-related constructs remains challenging. There has been a call for further psychometric testing of known scales, especially regarding validity aspects. To test convergent validity of the nine-item Shared Decision-Making Questionnaire (SDM-Q-9) by comparing it to the OPTION Scale. Cross-sectional study. Data were collected in outpatient care practices. Patients suffering from chronic diseases and facing a medical decision were included in the study. Consultations were evaluated using the OPTION Scale. Patients completed the SDM-Q-9 after the consultation. First, the internal consistency of both scales and the inter-rater reliability of the OPTION Scale were calculated. To analyse the convergent validity of the SDM-Q-9, correlation between the patient (SDM-Q-9) and expert ratings (OPTION Scale) was calculated. A total of 21 physicians provided analysable data of consultations with 63 patients. Analyses revealed good internal consistency of the SDM-Q-9 and limited internal consistency of the OPTION Scale. Inter-rater reliability of the latter was less than optimal. Association between the total scores of both instruments was weak with a Spearman correlation of r = 0.19 and did not reach statistical significance. By the use of the OPTION Scale convergent validity of the SDM-Q-9 could not be established. Several possible explanations for this result are discussed. This study shows that the measurement of SDM remains challenging. © 2012 John Wiley & Sons Ltd.
Psychometric analyses to improve the Dutch ICF Activity Inventory.
Bruijning, Janna E; van Rens, Ger; Knol, Dirk; van Nispen, Ruth
2013-08-01
In the past, rehabilitation centers for the visually impaired used unstructured or semistructured methods to assess rehabilitation needs of their patients. Recently, an extensive instrument, the Dutch ICF Activity Inventory (D-AI), was developed to systematically investigate rehabilitation needs of visually impaired adults and to evaluate rehabilitation outcomes. The purpose of this study was to investigate the underlying factor structure and other psychometric properties to shorten and improve the D-AI. The D-AI was administered to 241 visually impaired persons who recently enrolled in a multidisciplinary rehabilitation center. The D-AI uses graded scores to assess the importance and difficulty of 65 rehabilitation goals. For high-priority goals (e.g., daily meal preparation), the difficulty of underlying tasks (e.g., read recipes, cut vegetables) was assessed. To reduce underlying task items (>950), descriptive statistics were investigated and factor analyses were performed for several goals. The internal consistency reliability and test-retest reliability of the D-AI were investigated by calculating Cronbach α and Cohen (weighted) κ. Finally, consensus-based discussions were used to shorten and improve the D-AI. Except for one goal, factor analysis model parameters were at least reasonable. Internal consistency reliability was satisfactory (range, 0.74 to 0.93). In total, 60% of the 65 goal importance items and 84.4% of the goal difficulty items showed moderate to almost perfect κ values (≥0.40). After consensus-based discussions, a new D-AI was produced, containing 48 goals and less than 500 tasks. The analyses were an important step in the validation process of the D-AI and to develop a more feasible assessment tool to investigate rehabilitation needs of visually impaired persons in a systematic way. The D-AI is currently implemented in all Dutch rehabilitation centers serving all visually impaired adults with various rehabilitation needs.
Hedlund, Lena; Gyllensten, Amanda Lundvik; Hansson, Lars
2015-04-01
Fatigue is frequently reported by patients with mental illness. The multidimensional fatigue inventory (MFI-20) is a self-assessment instrument with 20 items including five dimensions of fatigue. The purpose of this study was to examine the test-retest reliability, internal consistency, convergent construct validity and feasibility of using MFI-20 in patients with schizophrenia spectrum disorders. Patients completed two self-assessment instruments, MFI-20 (n = 93) and Visual Analogue Scale (n = 79), twice within 1 week ± 2 days. Fifty-three patients also rated the feasibility of responding to the MFI-20 with a Likert scale. The test-retest reliability and validity were analysed by using Spearman's correlations and internal consistency by calculating Cronbach's α. The test-retest showed a correlation between .66 and .91 for all subscales of MFI. The internal consistency was .92. The analysis of convergent construct validity showed a correlation of .68 (time 1) and .77 (time 2). No item was systematically identified as being difficult to answer.
Kim, Eun-Mi; Kim, Sun-Aee; Lee, Ju-Ry; Burlison, Jonathan D; Oh, Eui Geum
2018-02-13
"Second victims" are defined as healthcare professionals whose wellness is influenced by adverse clinical events. The Second Victim Experience and Support Tool (SVEST) was used to measure the second-victim experience and quality of support resources. Although the reliability and validity of the original SVEST have been validated, those for the Korean tool have not been validated. The aim of the study was to evaluate the psychometric properties of the Korean version of the SVEST. The study included 305 clinical nurses as participants. The SVEST was translated into Korean via back translation. Content validity was assessed by seven experts, and test-retest reliability was evaluated by 30 clinicians. Internal consistency and construct validity were assessed via confirmatory factor analysis. The analyses were performed using SPSS 23.0 and STATA 13.0 software. The content validity index value demonstrated validity; item- and scale-level content validity index values were both 0.95. Test-retest reliability and internal consistency reliability were satisfactory: the intraclass consistent coefficient was 0.71, and Cronbach α values ranged from 0.59 to 0.87. The CFA showed a significantly good fit for an eight-factor structure (χ = 578.21, df = 303, comparative fit index = 0.92, Tucker-Lewis index = 0.90, root mean square error of approximation = 0.05). The K-SVEST demonstrated good psychometric properties and adequate validity and reliability. The results showed that the Korean version of SVEST demonstrated the extent of second victimhood and support resources in Korean healthcare workers and could aid in the development of support programs and evaluation of their effectiveness.
Extensive validation of the pain disability index in 3 groups of patients with musculoskeletal pain.
Soer, Remko; Köke, Albère J A; Vroomen, Patrick C A J; Stegeman, Patrick; Smeets, Rob J E M; Coppes, Maarten H; Reneman, Michiel F
2013-04-20
A cross-sectional study design was performed. To validate the pain disability index (PDI) extensively in 3 groups of patients with musculoskeletal pain. The PDI is a widely used and studied instrument for disability related to various pain syndromes, although there is conflicting evidence concerning factor structure, test-retest reliability, and missing items. Additionally, an official translation of the Dutch language version has never been performed. For reliability, internal consistency, factor structure, test-retest reliability and measurement error were calculated. Validity was tested with hypothesized correlations with pain intensity, kinesiophobia, Rand-36 subscales, Depression, Roland-Morris Disability Questionnaire, Quality of Life, and Work Status. Structural validity was tested with independent backward translation and approval from the original authors. One hundred seventy-eight patients with acute back pain, 425 patients with chronic low back pain and 365 with widespread pain were included. Internal consistency of the PDI was good. One factor was identified with factor analyses. Test-retest reliability was good for the PDI (intraclass correlation coefficient, 0.76). Standard error of measurement was 6.5 points and smallest detectable change was 17.9 points. Little correlations between the PDI were observed with kinesiophobia and depression, fair correlations with pain intensity, work status, and vitality and moderate correlations with the Rand-36 subscales and the Roland-Morris Disability Questionnaire. The PDI-Dutch language version is internally consistent as a 1-factor structure, and test-retest reliable. Missing items seem high in sexual and professional items. Using the PDI as a 2-factor questionnaire has no additional value and is unreliable.
Gaete, Jorge; Montero-Marin, Jesus; Rojas-Barahona, Cristian A.; Olivares, Esterbina; Araya, Ricardo
2016-01-01
School membership appears to be an important factor in explaining the relationship between students and schools, including school staff. School membership is associated with several school-related outcomes, such as academic performance and expectations. Most studies on school membership have been conducted in developed countries. The Psychological Sense of School Membership (PSSM) scale (18 items: 13 positively worded items, 5 negatively worded items) has been widely used to measure this construct, but no studies regarding its validity and reliability have been conducted in Spanish-speaking Latin American countries. This study investigates the psychometric properties, factor structure and reliability of this scale in a sample of 1250 early adolescents in Chile. Both exploratory and confirmatory factor analyses provide evidence of an excellent fit for a one-factor solution after removing the negatively worded items. The internal consistency of this new abbreviated version was 0.92. The association analyses demonstrated that high school membership was associated with better academic performance, stronger school bonding, a reduced likelihood of school misbehavior, and reduced likelihood of substance use. Analyses showed support for the reliability and validity of the PSSM among Chilean adolescents. PMID:27999554
The Child Adolescent Bullying Scale (CABS): Psychometric evaluation of a new measure.
Strout, Tania D; Vessey, Judith A; DiFazio, Rachel L; Ludlow, Larry H
2018-06-01
While youth bullying is a significant public health problem, healthcare providers have been limited in their ability to identify bullied youths due to the lack of a reliable, and valid instrument appropriate for use in clinical settings. We conducted a multisite study to evaluate the psychometric properties of a new 22-item instrument for assessing youths' experiences of being bullied, the Child Adolescent Bullying Scale (CABS). The 20 items summed to produce the measure's score were evaluated here. Diagnostic performance was assessed through evaluation of sensitivity, specificity, predictive values, and area under receiver operating characteristic (AUROC) curve. A sample of 352 youths from diverse racial, ethnic, and geographic backgrounds (188 female, 159 male, 5 transgender, sample mean age 13.5 years) were recruited from two clinical sites. Participants completed the CABS and existing youth bullying measures. Analyses grounded in classical test theory, including assessments of reliability and validity, item analyses, and principal components analysis, were conducted. The diagnostic performance and test characteristics of the CABS were also evaluated. The CABS is comprised of one component, accounting for 67% of observed variance. Analyses established evidence of internal consistency reliability (Cronbach's α = 0.97), construct and convergent validity. Sensitivity was 84%, specificity was 65%, and the AUROC curve was 0.74 (95% CI: 0.69-0.80). Findings suggest that the CABS holds promise as a reliable, valid tool for healthcare provider use in screening for bullying exposure in the clinical setting. © 2018 Wiley Periodicals, Inc.
Psychometric properties of stress and anxiety measures among nulliparous women.
Bann, Carla M; Parker, Corette B; Grobman, William A; Willinger, Marian; Simhan, Hyagriv N; Wing, Deborah A; Haas, David M; Silver, Robert M; Parry, Samuel; Saade, George R; Wapner, Ronald J; Elovitz, Michal A; Miller, Emily S; Reddy, Uma M
2017-03-01
To examine the psychometric properties of three measures, the perceived stress scale (PSS), pregnancy experience scale (PES), and state trait anxiety inventory (STAI), for assessing stress and anxiety during pregnancy among a large sample of nulliparous women. The sample included 10,002 pregnant women participating in the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be (nMoM2b). Internal consistency reliability was assessed with Cronbach's alpha and factorial validity with confirmatory factor analyses. Intraclass correlations (ICCs) were calculated to determine stability of PSS scales over time. Psychometric properties were examined for the overall sample, as well as subgroups based on maternal age, race/ethnicity and language. All three scales demonstrated good internal consistency reliability. Confirmatory factor analyses supported the factor structures of the PSS and the PES. However, a one-factor solution of the trait-anxiety subscale from the STAI did not fit well; a two-factor solution, splitting the items into factors based on direction of item wording (positive versus negative) provided a better fit. Scores on the PSS were generally stable over time (ICC = 0.60). Subgroup analyses revealed a few items that did not perform well on Spanish versions of the scales. Overall, the scales performed well, suggesting they could be useful tools for identifying women experiencing high levels of stress and anxiety during pregnancy and allowing for the implementation of interventions to help reduce maternal stress and anxiety.
Jorgensen, J E; Rathleff, C R; Rathleff, M S; Andreasen, J
2016-12-01
The Oslo Sports Trauma Research Centre Overuse Injury Questionnaire (OSTRC-O) and the Oslo Sports Trauma Research Centre questionnaire on Health Problems (The OSTRC-H) make it possible to monitor illness and injury at regular intervals capturing prevalence and incidence of acute injury, overuse injury, and illnesses. The aim of this study was to translate, culturally adapt, and establish the face validity of the OSTRC-O and the OSTRC-H into a Danish context (DK) through cognitive interviews and the assessment of test-retest reliability. The OSTRC-O.DK was distributed to 57 heterogenous respondents; response rate was 89%. The OSTRC-H was distributed to 58 heterogenous respondents; response rate was 86%. No major disagreements were observed between the original and translated versions of the questionnaires. The OSTRC-O had high internal consistency (Cronbach's alpha 0.80-0.93). The primary reliability analyses including all participants, showed reliability ICC: 0.62 (95% CI: 0.42-0.77. The secondary reliability analyses that only included subjects who did not change injury region from the test to the retest showed an ICC of 0.86 (95% CI: 0.77-0.92).The questionnaires were found to be valid, reliable, and acceptable for use in a Danish population. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
The Validation of a Case-Based, Cumulative Assessment and Progressions Examination
Coker, Adeola O.; Copeland, Jeffrey T.; Gottlieb, Helmut B.; Horlen, Cheryl; Smith, Helen E.; Urteaga, Elizabeth M.; Ramsinghani, Sushma; Zertuche, Alejandra; Maize, David
2016-01-01
Objective. To assess content and criterion validity, as well as reliability of an internally developed, case-based, cumulative, high-stakes third-year Annual Student Assessment and Progression Examination (P3 ASAP Exam). Methods. Content validity was assessed through the writing-reviewing process. Criterion validity was assessed by comparing student scores on the P3 ASAP Exam with the nationally validated Pharmacy Curriculum Outcomes Assessment (PCOA). Reliability was assessed with psychometric analysis comparing student performance over four years. Results. The P3 ASAP Exam showed content validity through representation of didactic courses and professional outcomes. Similar scores on the P3 ASAP Exam and PCOA with Pearson correlation coefficient established criterion validity. Consistent student performance using Kuder-Richardson coefficient (KR-20) since 2012 reflected reliability of the examination. Conclusion. Pharmacy schools can implement internally developed, high-stakes, cumulative progression examinations that are valid and reliable using a robust writing-reviewing process and psychometric analyses. PMID:26941435
First Order Reliability Application and Verification Methods for Semistatic Structures
NASA Technical Reports Server (NTRS)
Verderaime, Vincent
1994-01-01
Escalating risks of aerostructures stimulated by increasing size, complexity, and cost should no longer be ignored by conventional deterministic safety design methods. The deterministic pass-fail concept is incompatible with probability and risk assessments, its stress audits are shown to be arbitrary and incomplete, and it compromises high strength materials performance. A reliability method is proposed which combines first order reliability principles with deterministic design variables and conventional test technique to surmount current deterministic stress design and audit deficiencies. Accumulative and propagation design uncertainty errors are defined and appropriately implemented into the classical safety index expression. The application is reduced to solving for a factor that satisfies the specified reliability and compensates for uncertainty errors, and then using this factor as, and instead of, the conventional safety factor in stress analyses. The resulting method is consistent with current analytical skills and verification practices, the culture of most designers, and with the pace of semistatic structural designs.
Boerebach, Benjamin C M; Lombarts, Kiki M J M H; Arah, Onyebuchi A
2016-03-01
The System for Evaluation of Teaching Qualities (SETQ) was developed as a formative system for the continuous evaluation and development of physicians' teaching performance in graduate medical training. It has been seven years since the introduction and initial exploratory psychometric analysis of the SETQ questionnaires. This study investigates the validity and reliability of the SETQ questionnaires across hospitals and medical specialties using confirmatory factor analyses (CFAs), reliability analysis, and generalizability analysis. The SETQ questionnaires were tested in a sample of 3,025 physicians and 2,848 trainees in 46 hospitals. The CFA revealed acceptable fit of the data to the previously identified five-factor model. The high internal consistency estimates suggest satisfactory reliability of the subscales. These results provide robust evidence for the validity and reliability of the SETQ questionnaires for evaluating physicians' teaching performance. © The Author(s) 2014.
Assessment of the psychometric properties of the Family Management Measure.
Knafl, Kathleen; Deatrick, Janet A; Gallo, Agatha; Dixon, Jane; Grey, Margaret; Knafl, George; O'Malley, Jean
2011-06-01
This paper reports development of the Family Management Measure (FaMM) of parental perceptions of family management of chronic conditions. By telephone interview, 579 parents of children age 3 to 19 with a chronic condition (349 partnered mothers, 165 partners, 65 single mothers) completed the FaMM and measures of child functional status and behavioral problems and family functioning. Analyses addressed reliability, factor structure, and construct validity. Exploratory factor analysis yielded six scales: Child's Daily Life, Condition Management Ability, Condition Management Effort, Family Life Difficulty, Parental Mutuality, and View of Condition Impact. Internal consistency reliability ranged from .72 to .91, and test-retest reliability from .71 to .94. Construct validity was supported by significant correlations in hypothesized directions between FaMM scales and established measures. Results support FaMM's; reliability and validity, indicating it performs in a theoretically meaningful way and taps distinct aspects of family response to childhood chronic conditions.
Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.
ERIC Educational Resources Information Center
Henning, Grant
This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…
Fossati, Andrea; Widiger, Thomas A; Borroni, Serena; Maffei, Cesare; Somma, Antonella
2017-06-01
To extend the evidence on the reliability and construct validity of the Five-Factor Model Rating Form (FFMRF) in its self-report version, two independent samples of Italian participants, which were composed of 510 adolescent high school students and 457 community-dwelling adults, respectively, were administered the FFMRF in its Italian translation. Adolescent participants were also administered the Italian translation of the Borderline Personality Features Scale for Children-11 (BPFSC-11), whereas adult participants were administered the Italian translation of the Triarchic Psychopathy Measure (TriPM). Cronbach α values were consistent with previous findings; in both samples, average interitem r values indicated acceptable internal consistency for all FFMRF scales. A multidimensional graded item response theory model indicated that the majority of FFMRF items had adequate discrimination parameters; information indices supported the reliability of the FFMRF scales. Both categorical (i.e., item-level) and scale-level regression analyses suggested that the FFMRF scores may predict a nonnegligible amount of variance in the BPFSC-11 total score in adolescent participants, and in the TriPM scale scores in adult participants.
Wyles, Susannah M; Miskovic, Danilo; Ni, Zhifang; Darzi, Ara W; Valori, Roland M; Coleman, Mark G; Hanna, George B
2016-03-01
There is a lack of educational tools available for surgical teaching critique, particularly for advanced laparoscopic surgery. The aim was to develop and implement a tool that assesses training quality and structures feedback for trainers in the English National Training Programme for laparoscopic colorectal surgery. Semi-structured interviews were performed and analysed, and items were extracted. Through the Delphi process, essential items pertaining to desirable trainer characteristics, training structure and feedback were determined. An assessment tool (Structured Training Trainer Assessment Report-STTAR) was developed and tested for feasibility, acceptability and educational impact. Interview transcripts (29 surgical trainers, 10 trainees, four educationalists) were analysed, and item lists created and distributed for consensus opinion (11 trainers and seven trainees). The STTAR consisted of 64 factors, and its web-based version, the mini-STTAR, included 21 factors that were categorised into four groups (training structure, training behaviour, trainer attributes and role modelling) and structured around a training session timeline (beginning, middle and end). The STTAR (six trainers, 48 different assessments) demonstrated good internal consistency (α = 0.88) and inter-rater reliability (ICC = 0.75). The mini-STTAR demonstrated good inter-item reliability (α = 0.79) and intra-observer reliability on comparison of 85 different trainer/trainee combinations (r = 0.701, p = <0.001). Both were found to be feasible and acceptable. The educational report for trainers was found to be useful (4.4 out of 5). An assessment tool that evaluates training quality was developed and shown to be reliable, acceptable and of educational value. It has been successfully implemented into the English National Training Programme for laparoscopic colorectal surgery.
Validation of the Brazilian Portuguese Version of Geriatric Anxiety Inventory--GAI-BR.
Massena, Patrícia Nitschke; de Araújo, Narahyana Bom; Pachana, Nancy; Laks, Jerson; de Pádua, Analuiza Camozzato
2015-07-01
The Geriatric Anxiety Inventory (GAI) is a recently developed scale aiming to evaluate symptoms of anxiety in later life. This 20-item scale uses dichotomous answers highlighting non-somatic anxiety complaints of elderly people. The present study aimed to evaluate the psychometric properties of the Brazilian Portuguese version GAI (GAI-BR) in a sample from community and outpatient psychogeriatric clinic. A mixed convenience sample of 72 subjects was recruited for answering the research protocol. The interview procedures were structured with questionnaires about sociodemographic data, clinical health status, anxiety, and depression previously validated instruments, Mini-Mental State Examination, Mini International Neuropsychiatric Interview, and GAI-BR. Twenty-two percent of the sample were interviewed twice for test-retest reliability. For internal consistency analyses, the Cronbach's α test was applied. The Spearman correlation test was applied to evaluate the test-retest GAI-BR reliability. A ROC (receiver operating characteristic) curve study was made to estimate the GAI-BR area under curve, cut-off points, sensitivity, and specificity for the Generalized Anxiety Disorder diagnosis. The GAI-BR version showed high internal consistency (Cronbach's α = 0.91) and strong and significant test-retest reliability (ρ = 0.85, p < 0.001). It also showed moderate and significant correlation with the Beck Anxiety Inventory (ρ = 0.68, p < 0.001) and the State-Trait Anxiety Inventory (ρ = 0.61, p < 0.001) showing evidence of concurrent validation. The cut-off point of 13 estimated by ROC curve analyses showed sensitivity of 83.3% and specificity of 84.6% to detect Generalized Anxiety Disorder (DSM-IV). GAI-BR has demonstrated very good psychometric properties and can be a reliable instrument to measure anxiety in Brazilian elderly people.
Thaung, Jörgen; Olseke, Kjell; Ahl, Johan; Sjöstrand, Johan
2014-09-01
The purpose of our study was to establish a practical and quick test for assessing reading performance and to statistically analyse interchart and test-retest reliability of a new standardized Swedish reading chart system consisting of three charts constructed according to the principles available in the literature. Twenty-four subjects with healthy eyes, mean age 65 ± 10 years, were tested binocularly and the reading performance evaluated as reading acuity, critical print size and maximum reading speed. The test charts all consist of 12 short text sentences with a print size ranging from 0.9 to -0.2 logMAR in approximate steps of 0.1 logMAR. Two testing sessions, in two different groups (C1 and C2), were under strict control of luminance and lighting environment. Reading performance tests with chart T1, T2 and T3 were used for evaluation of interchart reliability and test data from a second session 1 month or more apart for the test-retest analysis. The testing of reading performance in adult observers with short sentences of continuous text was quick and practical. The agreement between the tests obtained with the three different test charts was high both within the same test session and at retest. This new Swedish variant of a standardized reading system based on short sentences and logarithmic progression of print size provides reliable measurements of reading performance and preliminary norms in an age group around 65 years. The reading test with three independent reading charts can be useful for clinical studies of reading ability before and after treatment. © 2013 Acta Ophthalmologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
Development of the PROMIS positive emotional and sensory expectancies of smoking item banks.
Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Li, Zhen; Hansen, Mark; Cai, Li
2014-09-01
The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Methods Used to Streamline the CAHPS® Hospital Survey
Keller, San; O'Malley, A James; Hays, Ron D; Matthew, Rebecca A; Zaslavsky, Alan M; Hepner, Kimberly A; Cleary, Paul D
2005-01-01
Objective To identify a parsimonious subset of reliable, valid, and consumer-salient items from 33 questions asking for patient reports about hospital care quality. Data Source CAHPS® Hospital Survey pilot data were collected during the summer of 2003 using mail and telephone from 19,720 patients who had been treated in 132 hospitals in three states and discharged from November 2002 to January 2003. Methods Standard psychometric methods were used to assess the reliability (internal consistency reliability and hospital-level reliability) and construct validity (exploratory and confirmatory factor analyses, strength of relationship to overall rating of hospital) of the 33 report items. The best subset of items from among the 33 was selected based on their statistical properties in conjunction with the importance assigned to each item by participants in 14 focus groups. Principal Findings Confirmatory factor analysis (CFA) indicated that a subset of 16 questions proposed to measure seven aspects of hospital care (communication with nurses, communication with doctors, responsiveness to patient needs, physical environment, pain control, communication about medication, and discharge information) demonstrated excellent fit to the data. Scales in each of these areas had acceptable levels of reliability to discriminate among hospitals and internal consistency reliability estimates comparable with previously developed CAHPS instruments. Conclusion Although half the length of the original, the shorter CAHPS hospital survey demonstrates promising measurement properties, identifies variations in care among hospitals, and deals with aspects of the hospital stay that are important to patients' evaluations of care quality. PMID:16316438
Whorfian effects on colour memory are not reliable.
Wright, Oliver; Davies, Ian R L; Franklin, Anna
2015-01-01
The Whorfian hypothesis suggests that differences between languages cause differences in cognitive processes. Support for this idea comes from studies that find that patterns of colour memory errors made by speakers of different languages align with differences in colour lexicons. The current study provides a large-scale investigation of the relationship between colour language and colour memory, adopting a cross-linguistic and developmental approach. Colour memory on a delayed matching-to-sample (XAB) task was investigated in 2 language groups with differing colour lexicons, for 3 developmental stages and 2 regions of colour space. Analyses used a Bayesian technique to provide simultaneous assessment of two competing hypotheses (H1-Whorfian effect present, H0-Whorfian effect absent). Results of the analyses consistently favoured H0. The findings suggest that Whorfian effects on colour memory are not reliable and that the importance of such effects should not be overestimated.
Heo, K H; Squires, J; Yovanoff, P
2008-03-01
Accurate and efficient developmental screening measures are critical for early identification of developmental problems; however, few reliable and valid tests are available in Korea as well as other countries outside the USA. The Ages and Stages Questionnaires (ASQ) was chosen for study with young children in Korea. The ASQ was translated into Korean and necessary cross-cultural adaptations were made. The translated version was then distributed and completed by 3220 parents of young children between the ages of 4 months and 5 years. Reliability was studied including domain correlations, internal consistency, and performance of identification cut-off scores for the Korean population. Rasch analyses including tests of Differential Item Functioning, contrasting Korean and US samples were also performed. In general, internal consistency of the Korean ASQ was high, with overall correlations 0.75 for communication, 0.85 for gross motor, 0.74 for fine motor, 0.72 for problem solving, and 0.65 for personal-social. Validity, including concurrent validity, also had strong evidence. Mean scores of children on the Korean translation of the ASQ and the US normative sample were generally similar. Rasch analyses indicated the majority of items functioned similarly across the Korean sample. In general, the ASQ was translated with cultural appropriateness in mind and functioned as a valid and reliable parent-completed screening test to assist in early identification of young children with developmental delays. Further research is needed to confirm these results with a larger and more diverse Korean sample.
Cruz, Jonas P; Baldacchino, Donia R; Alquwez, Nahed
2016-06-01
Patients often resort to religious and spiritual activities to cope with physical and mental challenges. The effect of spiritual coping on overall health, adaptation and health-related quality of life among patients undergoing haemodialysis (HD) is well documented. Thus, it is essential to establish a valid and reliable instrument that can assess both the religious and non-religious coping methods in patients undergoing HD. This study aimed to assess the validity and reliability of the Spiritual Coping Strategies Scale Arabic version (SCS-A) in Saudi patients undergoing HD. A convenience sample of 60 Saudi patients undergoing HD was recruited for this descriptive, cross-sectional study. Data were collected between May and June 2015. Forward-backward translation was used to formulate the SCS-A. The SCS-A, Muslim Religiosity Scale and the Quality of Life Index Dialysis Version III were used to procure the data. Internal consistency reliability, stability reliability, factor analysis and construct validity tests were performed. Analyses were set at the 0.05 level of significance. The SCS-A showed an acceptable internal consistency and strong stability reliability over time. The EFA produced two factors (non-religious and religious coping). Satisfactory construct validity was established by the convergent and divergent validity and known-groups method. The SCS-A is a reliable and valid tool that can be used to measure the religious and non-religious coping strategies of patients undergoing HD in Saudi Arabia and other Muslim and Arabic-speaking countries. © 2016 European Dialysis and Transplant Nurses Association/European Renal Care Association.
Kadioglu, Hasibe; Erol, Saime; Ergun, Ayse
2015-01-01
The purpose of this research was to examine the psychometric properties of the Turkish version of the situational self-efficacy scale for vegetable and fruit consumption in adolescents. This was a methodological study. The study was conducted in four public secondary schools in Istanbul, Turkey. Subjects were 1586 adolescents. Content and construct validity were assessed to test the validity of the scale. The reliability was assessed in terms of internal consistency and test-retest reliability. For confirmatory factor analysis, χ(2) statistics plus other fit indices were used, including the goodness-of-fit index, the adjusted goodness-of-fit index, the nonnormed fit index, the comparative fit index, the standardized root mean residual, and the root mean square error of approximation. Pearson's correlation was used for test-retest reliability and item total correlation. The internal consistency was assessed by using Cronbach α. Confirmatory factor analysis strongly supported the three-component structure representing positive social situations (α = .81), negative effect situations (α = .93), and difficult situations (α = .78). Psychometric analyses of the Turkish version of the situational self-efficacy scale indicate high reliability and good content and construct validity. Researchers and health professionals will find it useful to employ the Turkish situational self-efficacy scale in evaluating situational self-efficacy for fruit and vegetable consumption in Turkish adolescents.
Barnett, Lisa M; Robinson, Leah E; Webster, E Kipling; Ridgers, Nicola D
2015-08-01
The purpose was to determine the reliability of an instrument designed to assess young children's perceived movement skill competence in 2 diverse samples. A pictorial instrument assessed 12 perceived Fundamental Movement Skills (FMS) based on the Test of Gross Motor Development 2nd edition. Intra-Class Correlations (ICC) and internal consistency analyses were conducted. Paired sample t tests assessed change in mean perceived skill scores. Bivariate correlations between the intertrial difference and the mean of the trials explored proportional bias. Sample 1 (S1) were culturally diverse Australian children (n = 111; 52% boys) aged 5 to 8 years (mean = 6.4, SD = 1.0) with educated parents. Sample 2 (S2) were racially diverse and socioeconomically disadvantaged American children (n = 110; 57% boys) aged 5 to 10 years (mean = 6.8, SD = 1.1). For all children, the internal consistency for 12 FMS was acceptable (S1 = 0.72, 0.75, S2 = 0.66, 0.67). ICCs were higher in S1 (0.73) than S2 (0.50). Mean changes between trials were small. There was little evidence of proportional bias. Lower values in S2 may be due to differences in study demographic and execution. While the instrument demonstrated reliability/internal consistency, further work is recommended in diverse samples.
Development of the Therapist Empathy Scale.
Decker, Suzanne E; Nich, Charla; Carroll, Kathleen M; Martino, Steve
2014-05-01
Few measures exist to examine therapist empathy as it occurs in session. A 9-item observer rating scale, called the Therapist Empathy Scale (TES), was developed based on Watson's (1999) work to assess affective, cognitive, attitudinal, and attunement aspects of therapist empathy. The aim of this study was to evaluate the inter-rater reliability, internal consistency, and construct and criterion validity of the TES. Raters evaluated therapist empathy in 315 client sessions conducted by 91 therapists, using data from a multi-site therapist training trial (Martino et al., 2010) in Motivational Interviewing (MI). Inter-rater reliability (ICC = .87 to .91) and internal consistency (Cronbach's alpha = .94) were high. Confirmatory factor analyses indicated some support for single-factor fit. Convergent validity was supported by correlations between TES scores and MI fundamental adherence (r range .50 to .67) and competence scores (r range .56 to .69). Discriminant validity was indicated by negative or nonsignificant correlations between TES and MI-inconsistent behavior (r range .05 to -.33). The TES demonstrates excellent inter-rater reliability and internal consistency. RESULTS indicate some support for a single-factor solution and convergent and discriminant validity. Future studies should examine the use of the TES to evaluate therapist empathy in different psychotherapy approaches and to determine the impact of therapist empathy on client outcome.
Reddy, Linda A; Dudek, Christopher M; Fabiano, Gregory A; Peters, Stephanie
2015-12-01
This article presents information about the construct validity and reliability of a new teacher self-report measure of classroom instructional and behavioral practices (the Classroom Strategies Scales-Teacher Form; CSS-T). The theoretical underpinnings and empirical basis for the instructional and behavioral management scales are presented. Information is provided about the construct validity, internal consistency, test-retest reliability, and freedom from item-bias of the scales. Given previous investigations with the CSS Observer Form, it was hypothesized that internal consistency would be adequate and that confirmatory factor analyses (CFA) of CSS-T data from 293 classrooms would offer empirical support for the CSS-T's Total, Composite and subscales, and yield a similar factor structure to that of the CSS Observer Form. Goodness-of-fit indices of χ2/df, Root Mean Square Error of Approximation, Goodness of Fit Index, and Adjusted Goodness of Fit Index suggested satisfactory fit of proposed CFA models whereas the Comparative Fit Index did not. Internal consistency estimates of .93 and .94 were obtained for the Instructional Strategies and Behavioral Strategies Total scales respectively. Adequate test-retest reliability was found for instructional and behavioral total scales (r = .79, r = .84, percent agreement 93% and 93%). The CSS-T evidences freedom from item bias on important teacher demographics (age, educational degree, and years of teaching experience). Implications of results are discussed. (c) 2015 APA, all rights reserved).
Assessing university students' self-efficacy to employ alcohol-related harm reduction strategies.
Rosenberg, Harold; Bonar, Erin E; Hoffmann, Erica; Kryszak, Elizabeth; Young, Kathleen M; Kraus, Shane W; Ashrafioun, Lisham; Bannon, Erin E; Pavlick, Michelle
2011-01-01
Develop and evaluate key psychometric properties of a self-report questionnaire specifically designed to assess student drinkers' self-confidence to employ a variety of strategies intended to reduce unhealthy consequences of high-risk drinking. Four hundred ninety-eight participants rated their confidence (from "not at all confident" to "completely confident") to employ 17 harm reduction strategies when drinking. Factor analysis and internal consistency reliability analyses indicated that the 17 items constitute a single scale with good test-retest reliability. Consistent with other research examining previous use of such strategies, women in our sample reported significantly higher harm reduction self-efficacy than did men. Harm reduction self-efficacy was also associated with reported number of high-risk drinking episodes in the previous 2 weeks. This brief and easily administered questionnaire holds promise as a clinical tool to identify individuals with low harm reduction self-efficacy and as an outcome measure for health promotion and educational interventions.
Clayson, Peter E; Miller, Gregory A
2017-01-01
Failing to consider psychometric issues related to reliability and validity, differential deficits, and statistical power potentially undermines the conclusions of a study. In research using event-related brain potentials (ERPs), numerous contextual factors (population sampled, task, data recording, analysis pipeline, etc.) can impact the reliability of ERP scores. The present review considers the contextual factors that influence ERP score reliability and the downstream effects that reliability has on statistical analyses. Given the context-dependent nature of ERPs, it is recommended that ERP score reliability be formally assessed on a study-by-study basis. Recommended guidelines for ERP studies include 1) reporting the threshold of acceptable reliability and reliability estimates for observed scores, 2) specifying the approach used to estimate reliability, and 3) justifying how trial-count minima were chosen. A reliability threshold for internal consistency of at least 0.70 is recommended, and a threshold of 0.80 is preferred. The review also advocates the use of generalizability theory for estimating score dependability (the generalizability theory analog to reliability) as an improvement on classical test theory reliability estimates, suggesting that the latter is less well suited to ERP research. To facilitate the calculation and reporting of dependability estimates, an open-source Matlab program, the ERP Reliability Analysis Toolbox, is presented. Copyright © 2016 Elsevier B.V. All rights reserved.
Tepe, Rodger; Tepe, Chabha
2015-03-01
To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Tepe, Rodger; Tepe, Chabha
2015-01-01
Objective To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. Methods In this test–retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. Results The IL self-efficacy survey demonstrated good reliability (test–retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test–retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). Conclusions This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments. PMID:25517736
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs.
Harvey, Naomi D; Craigon, Peter J; Blythe, Simon A; England, Gary C W; Asher, Lucy
2017-01-01
Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5-8, 8-12 and 5-12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs.
2012-01-01
Purpose To examine the psychometric properties of the Injection Pen Assessment Questionnaire (IPAQ) including the following: 1) item and scale characteristics (e.g., frequencies, item distributions, and factor structure), 2) reliability, and 3) validity. Methods Focus groups and one-on-one dyad interviews guided the development of the IPAQ. The IPAQ was subsequently tested in 136 parent–child dyads in a Phase 3, 2-month, open-label, multicenter trial for a new Genotropin® disposable pen. Factor analysis was performed to inform the development of a scoring algorithm, and reliability and validity of the IPAQ were evaluated using the data from this two months study. Psychometric analyses were conducted separately for each injection pen. Results Confirmatory factor analysis provides evidence supporting a second order factor solution for four subscales and a total IPAQ score. These factor analysis results support the conceptual framework developed from previous qualitative research in patient dyads using the reusable pen. However, the IPAQ subscales did not consistently meet acceptable internal consistency reliability for some group level comparisons. Cronbach’s alphas for the total IPAQ score for both pens were 0.85, exceeding acceptable levels of reliability for group comparisons. Conclusions The total IPAQ score is a useful measure for evaluating ease of use and preference for injection pens in clinical trials among patient dyads receiving hGH. The psychometric properties of the individual subscales, mainly the lower internal consistency reliability of some of the subscales and the predictive validity findings, do not support the use of subscale scores alone as a primary endpoint. PMID:23046797
Reliability and validity of the Nurse Practitioners' Roles and Competencies Scale.
Lin, Li-Chun; Lee, Sheuan; Ueng, Steve Wen-Neng; Tang, Woung-Ru
2016-01-01
The objective of this study was to test the reliability and construct validity of the Nurse Practitioners' Roles and Competencies Scale. The role of nurse practitioners has attracted international attention. The advanced nursing role played by nurse practitioners varies with national conditions and medical environments. To date, no suitable measurement tool has been available for assessing the roles and competencies of nurse practitioners in Asian countries. Secondary analysis of data from three studies related to nurse practitioners' role competencies. We analysed data from 563 valid questionnaires completed in three studies to identify the factor structure of the Nurse Practitioners' Roles and Competencies Scale. To this end, we performed exploratory factor analysis using principal component analysis extraction with varimax orthogonal rotation. The internal consistency reliabilities of the overall scale and its subscales were examined using Cronbach's alpha coefficient. The scale had six factors: professionalism, direct care, clinical research, practical guidance, medical assistance, as well as leadership and reform. These factors explained 67·5% of the total variance in nurse practitioners' role competencies. Cronbach's alpha coefficient for the overall scale was 0·98, and those of its subscales ranged from 0·83-0·97. The internal consistency reliability and construct validity of the Nurse Practitioners' Roles and Competencies Scale were good. The high internal consistency reliabilities suggest item redundancy, which should be minimised by using item response theory to enhance the applicability of this questionnaire for future academic and clinical studies. The Nurse Practitioners' Roles and Competencies Scale can be used as a tool for assessing the roles and competencies of nurse practitioners in Taiwan. Our findings can also serve as a reference for other Asian countries to develop the nurse practitioner role. © 2015 John Wiley & Sons Ltd.
Reliability and validity of the Japanese version of the Resilience Scale and its short version.
Nishi, Daisuke; Uehara, Ritei; Kondo, Maki; Matsuoka, Yutaka
2010-11-17
The clinical relevance of resilience has received considerable attention in recent years. The aim of this study is to demonstrate the reliability and validity of the Japanese version of the Resilience Scale (RS) and short version of the RS (RS-14). The original English version of RS was translated to Japanese and the Japanese version was confirmed by back-translation. Participants were 430 nursing and university psychology students. The RS, Center for Epidemiologic Studies Depression Scale (CES-D), Rosenberg Self-Esteem Scale (RSES), Social Support Questionnaire (SSQ), Perceived Stress Scale (PSS), and Sheehan Disability Scale (SDS) were administered. Internal consistency, convergent validity and factor loadings were assessed at initial assessment. Test-retest reliability was assessed using data collected from 107 students at 3 months after baseline. Mean score on the RS was 111.19. Cronbach's alpha coefficients for the RS and RS-14 were 0.90 and 0.88, respectively. The test-retest correlation coefficients for the RS and RS-14 were 0.83 and 0.84, respectively. Both the RS and RS-14 were negatively correlated with the CES-D and SDS, and positively correlated with the RSES, SSQ and PSS (all p < 0.05), although the correlation between the RS and CES-D was somewhat lower than that in previous studies. Factor analyses indicated a one-factor solution for RS-14, but as for RS, the result was not consistent with previous studies. This study demonstrates that the Japanese version of RS has psychometric properties with high degrees of internal consistency, high test-retest reliability, and relatively low concurrent validity. RS-14 was equivalent to the RS in internal consistency, test-retest reliability, and concurrent validity. Low scores on the RS, a positive correlation between the RS and perceived stress, and a relatively low correlation between the RS and depressive symptoms in this study suggest that validity of the Japanese version of the RS might be relatively low compared with the original English version.
Moran, Robert W; Rushworth, Wendy M; Mason, Jesse
2017-12-01
Healthcare practitioner beliefs influence advice and management provided to patients with back pain. Several instruments measuring practitioner beliefs have been developed but psychometric properties for some have not been investigated. To investigate internal consistency, test-retest reliability and convergent validity of the Fear Avoidance Beliefs Tool (FABT), the Tampa Scale of Kinesiophobia for Health Care Providers (TSK-HC), the Back Pain Attitudes Questionnaire (Back-PAQ), and the Health Care Pain and Impairment Relationship Scale (HC-PAIRS). A secondary aim was to explore beliefs of New Zealand osteopaths and physiotherapists regarding low back pain. FABT, TSK-HC, Back-PAQ, and HC-PAIRS were administered twice, 14 days apart. Data from 91 osteopaths and 35 physiotherapists were analysed. The FABT, TSK-HC and Back-PAQ each demonstrated excellent internal consistency, (Cronbach's α = 0.92, 0.91, and 0.91 respectively), and excellent test-retest reliability (lower limit of 95% CI for intraclass correlation coefficient >0.75). Correlations between instruments (Pearson's r = 0.51 to 0.77, p < 0.001) demonstrated good convergent validity. There was a medium to large effect (Cohen's d > 0.47) for mean differences in scores, for all instruments, between professions. This study found excellent internal consistency, test-retest reliability and good convergent validity for the FABT, TSK-HC, and Back-PAQ. Previously reported internal consistency, test-retest and convergent validity of the HC-PAIRS were confirmed, and test-retest reliability was excellent. There were significant scoring differences on each instrument between professions, and while both groups demonstrated fear avoidant beliefs, physiotherapist respondent scores indicated that as a group, they held fewer fear-avoidant beliefs than osteopath respondents. Copyright © 2017 Elsevier Ltd. All rights reserved.
Rosneck, James S; Hughes, Joel; Gunstad, John; Josephson, Richard; Noe, Donald A; Waechter, Donna
2014-01-01
This article describes the systematic construction and psychometric analysis of a knowledge assessment instrument for phase II cardiac rehabilitation (CR) patients measuring risk modification disease management knowledge and behavioral outcomes derived from national standards relevant to secondary prevention and management of cardiovascular disease. First, using adult curriculum based on disease-specific learning outcomes and competencies, a systematic test item development process was completed by clinical staff. Second, a panel of educational and clinical experts used an iterative process to identify test content domain and arrive at consensus in selecting items meeting criteria. Third, the resulting 31-question instrument, the Cardiac Knowledge Assessment Tool (CKAT), was piloted in CR patients to ensure use of application. Validity and reliability analyses were performed on 3638 adults before test administrations with additional focused analyses on 1999 individuals completing both pretreatment and posttreatment administrations within 6 months. Evidence of CKAT content validity was substantiated, with 85% agreement among content experts. Evidence of construct validity was demonstrated via factor analysis identifying key underlying factors. Estimates of internal consistency, for example, Cronbach's α = .852 and Spearman-Brown split-half reliability = 0.817 on pretesting, support test reliability. Item analysis, using point biserial correlation, measured relationships between performance on single items and total score (P < .01). Analyses using item difficulty and item discrimination indices further verified item stability and validity of the CKAT. A knowledge instrument specifically designed for an adult CR population was systematically developed and tested in a large representative patient population, satisfying psychometric parameters, including validity and reliability.
Reliability and Validity of Assessing User Satisfaction With Web-Based Health Interventions
Lehr, Dirk; Reis, Dorota; Vis, Christiaan; Riper, Heleen; Berking, Matthias; Ebert, David Daniel
2016-01-01
Background The perspective of users should be taken into account in the evaluation of Web-based health interventions. Assessing the users’ satisfaction with the intervention they receive could enhance the evidence for the intervention effects. Thus, there is a need for valid and reliable measures to assess satisfaction with Web-based health interventions. Objective The objective of this study was to analyze the reliability, factorial structure, and construct validity of the Client Satisfaction Questionnaire adapted to Internet-based interventions (CSQ-I). Methods The psychometric quality of the CSQ-I was analyzed in user samples from 2 separate randomized controlled trials evaluating Web-based health interventions, one from a depression prevention intervention (sample 1, N=174) and the other from a stress management intervention (sample 2, N=111). At first, the underlying measurement model of the CSQ-I was analyzed to determine the internal consistency. The factorial structure of the scale and the measurement invariance across groups were tested by multigroup confirmatory factor analyses. Additionally, the construct validity of the scale was examined by comparing satisfaction scores with the primary clinical outcome. Results Multigroup confirmatory analyses on the scale yielded a one-factorial structure with a good fit (root-mean-square error of approximation =.09, comparative fit index =.96, standardized root-mean-square residual =.05) that showed partial strong invariance across the 2 samples. The scale showed very good reliability, indicated by McDonald omegas of .95 in sample 1 and .93 in sample 2. Significant correlations with change in depressive symptoms (r=−.35, P<.001) and perceived stress (r=−.48, P<.001) demonstrated the construct validity of the scale. Conclusions The proven internal consistency, factorial structure, and construct validity of the CSQ-I indicate a good overall psychometric quality of the measure to assess the user’s general satisfaction with Web-based interventions for depression and stress management. Multigroup analyses indicate its robustness across different samples. Thus, the CSQ-I seems to be a suitable measure to consider the user’s perspective in the overall evaluation of Web-based health interventions. PMID:27582341
Reliability and Validity of Assessing User Satisfaction With Web-Based Health Interventions.
Boß, Leif; Lehr, Dirk; Reis, Dorota; Vis, Christiaan; Riper, Heleen; Berking, Matthias; Ebert, David Daniel
2016-08-31
The perspective of users should be taken into account in the evaluation of Web-based health interventions. Assessing the users' satisfaction with the intervention they receive could enhance the evidence for the intervention effects. Thus, there is a need for valid and reliable measures to assess satisfaction with Web-based health interventions. The objective of this study was to analyze the reliability, factorial structure, and construct validity of the Client Satisfaction Questionnaire adapted to Internet-based interventions (CSQ-I). The psychometric quality of the CSQ-I was analyzed in user samples from 2 separate randomized controlled trials evaluating Web-based health interventions, one from a depression prevention intervention (sample 1, N=174) and the other from a stress management intervention (sample 2, N=111). At first, the underlying measurement model of the CSQ-I was analyzed to determine the internal consistency. The factorial structure of the scale and the measurement invariance across groups were tested by multigroup confirmatory factor analyses. Additionally, the construct validity of the scale was examined by comparing satisfaction scores with the primary clinical outcome. Multigroup confirmatory analyses on the scale yielded a one-factorial structure with a good fit (root-mean-square error of approximation =.09, comparative fit index =.96, standardized root-mean-square residual =.05) that showed partial strong invariance across the 2 samples. The scale showed very good reliability, indicated by McDonald omegas of .95 in sample 1 and .93 in sample 2. Significant correlations with change in depressive symptoms (r=-.35, P<.001) and perceived stress (r=-.48, P<.001) demonstrated the construct validity of the scale. The proven internal consistency, factorial structure, and construct validity of the CSQ-I indicate a good overall psychometric quality of the measure to assess the user's general satisfaction with Web-based interventions for depression and stress management. Multigroup analyses indicate its robustness across different samples. Thus, the CSQ-I seems to be a suitable measure to consider the user's perspective in the overall evaluation of Web-based health interventions.
Blouin, Danielle; Day, Andrew G.; Pavlov, Andrey
2011-01-01
Background Although never directly compared, structured interviews are reported as being more reliable than unstructured interviews. This study compared the reliability of both types of interview when applied to a common pool of applicants for positions in an emergency medicine residency program. Methods In 2008, one structured interview was added to the two unstructured interviews traditionally used in our resident selection process. A formal job analysis using the critical incident technique guided the development of the structured interview tool. This tool consisted of 7 scenarios assessing 4 of the domains deemed essential for success as a resident in this program. The traditional interview tool assessed 5 general criteria. In addition to these criteria, the unstructured panel members were asked to rate each candidate on the same 4 essential domains rated by the structured panel members. All 3 panels interviewed all candidates. Main outcomes were the overall, interitem, and interrater reliabilities, the correlations between interview panels, and the dimensionality of each interview tool. Results Thirty candidates were interviewed. The overall reliability reached 0.43 for the structured interview, and 0.81 and 0.71 for the unstructured interviews. Analyses of the variance components showed a high interrater, low interitem reliability for the structured interview, and a high interrater, high interitem reliability for the unstructured interviews. The summary measures from the 2 unstructured interviews were significantly correlated, but neither was correlated with the structured interview. Only the structured interview was multidimensional. Conclusions A structured interview did not yield a higher overall reliability than both unstructured interviews. The lower reliability is explained by a lower interitem reliability, which in turn is due to the multidimensionality of the interview tool. Both unstructured panels consistently rated a single dimension, even when prompted to assess the 4 specific domains established as essential to succeed in this residency program. PMID:23205201
Blouin, Danielle; Day, Andrew G; Pavlov, Andrey
2011-12-01
Although never directly compared, structured interviews are reported as being more reliable than unstructured interviews. This study compared the reliability of both types of interview when applied to a common pool of applicants for positions in an emergency medicine residency program. In 2008, one structured interview was added to the two unstructured interviews traditionally used in our resident selection process. A formal job analysis using the critical incident technique guided the development of the structured interview tool. This tool consisted of 7 scenarios assessing 4 of the domains deemed essential for success as a resident in this program. The traditional interview tool assessed 5 general criteria. In addition to these criteria, the unstructured panel members were asked to rate each candidate on the same 4 essential domains rated by the structured panel members. All 3 panels interviewed all candidates. Main outcomes were the overall, interitem, and interrater reliabilities, the correlations between interview panels, and the dimensionality of each interview tool. Thirty candidates were interviewed. The overall reliability reached 0.43 for the structured interview, and 0.81 and 0.71 for the unstructured interviews. Analyses of the variance components showed a high interrater, low interitem reliability for the structured interview, and a high interrater, high interitem reliability for the unstructured interviews. The summary measures from the 2 unstructured interviews were significantly correlated, but neither was correlated with the structured interview. Only the structured interview was multidimensional. A structured interview did not yield a higher overall reliability than both unstructured interviews. The lower reliability is explained by a lower interitem reliability, which in turn is due to the multidimensionality of the interview tool. Both unstructured panels consistently rated a single dimension, even when prompted to assess the 4 specific domains established as essential to succeed in this residency program.
Happell, Brenda; Byrne, Louise; Platania-Phung, Chris
2015-01-01
Recovery-oriented services are a goal for policy and practice in the Australian mental health service system. Evidence-based reform requires an instrument to measure knowledge of recovery concepts. The Recovery Knowledge Inventory (RKI) was designed for this purpose, however, its suitability and validity for student health professionals has not been evaluated. The purpose of the current article is to report the psychometric features of the RKI for measuring nursing students' views on recovery. The RKI, a self-report measure, consists of four scales: (I) Roles and Responsibilities, (II) Non-Linearity of the Recovery Process, (III) Roles of Self-Definition and Peers, and (IV) Expectations Regarding Recovery. Confirmatory and exploratory factor analyses of the baseline data (n = 167) were applied to assess validity and reliability. Exploratory factor analyses generally replicated the item structure suggested by the three main scales, however more stringent analyses (confirmatory factor analysis) did not provide strong support for convergent validity. A refined RKI with 16 items had internal reliabilities of α = .75 for Roles and Responsibilities, α = .49 for Roles of Self-Definition and Peers, and α = .72, for Recovery as Non-Linear Process. If the RKI is to be applied to nursing student populations, the conceptual underpinning of the instrument needs to be reworked, and new items should be generated to evaluate and improve scale validity and reliability.
[Estimators of internal consistency in health research: the use of the alpha coefficient].
da Silva, Franciele Cascaes; Gonçalves, Elizandra; Arancibia, Beatriz Angélica Valdivia; Bento, Gisele Graziele; Castro, Thiago Luis da Silva; Hernandez, Salma Stephany Soleman; da Silva, Rudney
2015-01-01
Academic production has increased in the area of health, increasingly demanding high quality in publications of great impact. One of the ways to consider quality is through methods that increase the consistency of data analysis, such as reliability which, depending on the type of data, can be evaluated by different coefficients, especially the alpha coefficient. Based on this, the present review systematically gathers scientific articles produced in the last five years, which in a methodological manner gave the α coefficient psychometric use as an estimator of internal consistency and reliability in the processes of construction, adaptation and validation of instruments. The identification of the studies was conducted systematically in the databases BioMed Central Journals, Web of Science, Wiley Online Library, Medline, SciELO, Scopus, Journals@Ovid, BMJ and Springer, using inclusion and exclusion criteria. Data analyses were performed by means of triangulation, content analysis and descriptive analysis. It was found that most studies were conducted in Iran (f=3), Spain (f=2) and Brazil (f=2). These studies aimed to test the psychometric properties of instruments, with eight studies using the α coefficient to assess reliability and nine for assessing internal consistency. All studies were classified as methodological research when their objectives were analyzed. In addition, four studies were also classified as correlational and one as descriptive-correlational. It can be concluded that though the α coefficient is widely used as one of the main parameters for assessing internal consistency of questionnaires in health sciences, its use as an estimator of trust of the methodology used and internal consistency has some critiques that should be considered.
Boyacioglu, Inci; Akfirat, Serap
2015-01-01
The purpose of this study is to develop a valid and reliable measure for the phenomenology of autobiographical memories. The psychometric properties of the Autobiographical Memory Characteristics Questionnaire (AMCQ) were tested in three studies: the factor structure of the AMCQ was examined for childhood memories in Study 1 (N = 305); for autobiographical memories related to romantic relationships in Study 2 (N = 197); and for self-defining memories in Study 3 (N = 262). The explanatory factor analyses performed for each memory type demonstrated the consistency of the AMCQ factor structure across all memory types; while a confirmatory factor analysis on the data garnered from all three studies supported the constructs for the autobiographical memory characteristics defined by the researchers. The AMCQ consists of 63 items and 14 factors, and the internal consistency values of all 14 scales were ranged between .66 and .97. The relationships between the AMCQ scales related to gender and individual emotions, as well as the intercorrelations among the scales, were consistent with both theoretical expectations and previous findings. The results of all the three studies indicated that this new instrument is a reliable and robust measure for memory phenomenology.
A psychometric evaluation of the Rorschach comprehensive system's perceptual thinking index.
Dao, Tam K; Prevatt, Frances
2006-04-01
In this study, we investigated evidence for reliability and validity of the Perceptual Thinking Index (PTI; Exner, 2000a, 2000b) among an adult inpatient population. We conducted reliability and validity analyses on 107 patients who met the Diagnostic and Statistical Manual of Mental Disorders (4th ed., text revision; American Psychiatric Association, 2000) criteria for a schizophrenia-spectrum disorder (SSD) or mood disorder with no psychotic features (MD). Results provided support for interrater reliability as well as internal consistency of the PTI. Furthermore, the PTI was an effective index in differentiating SSD patients from patients diagnosed with an MD. Finally, the PTI demonstrated adequate diagnostic statistics that can be useful in the classification of patients diagnosed with SSD and MD. We discuss methodological issues, implications for assessment practice, and directions for future research.
Chiba, Rie; Umeda, Maki; Goto, Kyohei; Miyamoto, Yuki; Yamaguchi, Sosei; Kawakami, Norito
2017-01-01
The Recovery Knowledge Inventory (RKI) is one of the influential scales to assess knowledge and attitude toward recovery-oriented practices among mental health service providers. In the present study, we aimed to develop a Japanese version of RKI and examine the validity and reliability. We translated RKI into Japanese by reference to the guidelines for translating and adapting psychometric scales. A cross-sectional questionnaire survey was conducted with mental health service providers. Of a total of 475 eligible professionals, we used data from the 299 participants without missing value for the analyses (valid response rate = 62.9%). The questionnaire included Japanese RKI, Recovery Attitudes Questionnaire, The positive attitudes scale, and Japanese-language version of the Social Distance Scale. To examine the factorial validity of RKI, explanatory factor analysis and confirmatory factor analysis was employed. Convergent validity was assessed by calculating Pearson's correlation coefficients between the total RKI score and the scores for the other three scales. We also calculated Cronbach's α coefficients for the total score and for each domain of RKI to assess internal consistency reliability. The participants' mean age was 40.4 years and 30.4% were men. 20-item RKI did not provide any adequate or interpretable factor solutions at any number of factors by EFAs. Thus four items (#1, 4, 5, and 13) were subsequently eliminated in stages, then 16-item RKI was employed as a consequence for further analyses. EFA with four factor structures yielded marginally interpretable constitution. Each factor represented the knowledge regarding psychiatric symptoms and recovery; knowledge about the recovery process; the understanding of what is important for recovery; and the understanding of the challenges and responsibility in recovery, respectively. Subsequent CFA suggested good fit to the data. Good convergent validity and understandable internal consistency reliability were also observed. The Japanese 16-item RKI revealed reasonable factorial validity, good convergent validity, and understandable internal consistency reliability among mental health professionals. Japanese cultural settings seemed to influence the four-factor structure in the present study. It can be used for future study in Japan, while future large-scale research is required to ensure robust verification.
[Developing Perceived Competence Scale (PCS) for Adolescents].
Özer, Arif; Gençtanirim Kurt, Dilek; Kizildağ, Seval; Demırtaş Zorbaz, Selen; Arici Şahın, Fatma; Acar, Tülin; Ergene, Tuncay
2016-01-01
In this study, Perceived Competence Scale was developed to measure high school students' perceived competence. Scale development process was verified on three different samples. Participants of the research are some high school students in 2011-2012 academic terms from Ankara. Participants' numbers are incorporated in exploratory factor analysis, confirmatory factor analysis and test-retest reliability respectively, as follows: 372, 668 and 75. Internal consistency coefficients (Cronbach's and stratified α) are calculated separately for each group. For data analysis Factor 8.02 and LISREL 8.70 package programs were used. According to results of the analyses, internal consistency coefficients (α) are .90 - .93 for academic competence, .82 - .86 for social competence in the samples that exploratory and confirmatory factor analysis performed. For the whole scale internal consistency coefficient (stratified α) is calculated as .91. As a result of test-retest reliability, adjusted correlation coefficients (r) are .94 for social competence and .90 for academic competence. In addition, to fit indexes and regression weights obtained from factor analysis, findings related convergent and discriminant validity, indicating that competence can be addressed in two dimensions which are academic (16 items) and social (14 items).
Chahoud, M; Chahine, R; Salameh, P; Sauleau, E A
2017-06-01
Our goal is to validate and to verify the reliability of the French and English versions of the Insomnia Severity Index (ISI) in Lebanese adolescents. A cross-sectional study was implemented. 104 Lebanese students aged between 14 and 19 years participated in the study. The English version of the questionnaire was distributed to English-speaking students and the French version was administered to French-speaking students. A scale (1 to 7 with 1 = very well understood and 7 = not at all) was used to identify the level of the students' understanding of each instruction, question and answer of the ISI. The scale's structural validity was assessed. The factor structure of ISI was evaluated by principal component analysis. The internal consistency of this scale was evaluated by Cronbach's alpha. To assess test-retest reliability the intraclass correlation coefficient (ICC) was used. The principal component analysis confirmed the presence of a two-component factor structure in the English version and a three-component factor structure in the French version with eigenvalues > 1. The English version of the ISI had an excellent internal consistency (α = 0.90), while the French version had a good internal consistency (α = 0.70). The ICC presented an excellent agreement in the French version (ICC = 0.914, CI = 0.856-0.949) and a good agreement in the English one (ICC = 0.762, CI = 0.481-890). The Bland-Altman plots of the two versions of the ISI showed that the responses over two weeks' were comparable and very few outliers were detected. The results of our analyses reveal that both English and French versions of the ISI scale have good internal consistency and are reproducible and reliable. Therefore, it can be used to assess the prevalence of insomnia in Lebanese adolescents.
[Psychometric properties of the Polish version of the Oldenburg Burnout Inventory (OLBI)].
Baka, Łukasz; Basińska, Beata A
2016-01-01
The objective of this study was to test the psychometric properties of the Polish version of the Oldenburg Burnout Inventory (OLBI) - its factor structure, reliability, validity and standard norms. The study was conducted on 3 independent samples of 1804, 366 and 48 workers employed in social service and general service professions. To test the OLBI structure the exploratory factor analysis was conducted. The reliability was assessed by means of Cronbach's α coefficient (the internal consistent) and test-retest (the stability over time) method, with a 6-week follow-up. The construct validity of the OLBI was tested by means of correlation analysis, using perceived stress and work engagement as the criterion variables. The result of the factor analysis confirmed a 2-factor structure of the Inventory but the construction of each factor differed from that in the OLBI original version. Therefore, 2 separate factor analyses - each for the single component of job burnout (exhaustion and disengagement from work) - were conducted. The analyses revealed that each of the components consisted of 2 subscales. The reliability of the OLBI was supported by 2 methods. It was also proved that job burnout and its 2 components, exhaustion and disengagement from work, were positively correlated with perceived stress and negatively correlated with work engagement and its 3 components - vigor, absorption and dedication. Despite certain limitations the Polish version of the OLBI shows satisfactory psychometric properties and it can be used to measure job burnout in Polish conditions. This work is available in Open Access model and licensed under a CC BY-NC 3.0 PL license.
Rofail, Diana; Abetz, Linda; Viala, Muriel; Gait, Claire; Baladi, Jean-Francois; Payne, Krista
2009-01-01
This study assesses satisfaction with iron chelation therapy (ICT) based on a reliable and valid instrument, and explores the relationship between satisfaction and adherence to ICT. Patients in the USA and UK completed a new "Satisfaction with ICT" (SICT) instrument consisting of 28 items, three pertaining to adherence. Simple and multivariate regression analyses assessed the relationship between satisfaction with different aspects of ICT and adherence. First assessments of the SICT instrument indicate its validity and reliability. Recommended thresholds for internal consistency, convergent validity, discriminant validity, and floor and ceiling effects were met. A number of variables were identified in the simple linear regression analyses as significant predictors of "never thinking about stopping ICT," a proxy for adherence. These significant variables were entered into the multivariate model to assess the combined factor effects, explaining 42% of the total variance of "never thinking about stopping ICT." A significant and positive relationship was demonstrated between "never thinking about stopping ICT" and age (P = 0.04), Perceived Effectiveness of ICT (P = 0.003), low Burden of ICT (P = 0.002), and low Side Effects of ICT (P = 0.01). The SICT is a reliable and valid instrument which will be useful in ICT clinical trials. Furthermore, the administration of ICT by slow subcutaneous infusion negatively impacts on satisfaction with ICT which was shown to be a determinant of adherence. This points to the need for new more convenient and less burdensome oral iron chelators to increase adherence, and ultimately to improve patient outcomes.
Kang, Lin-Ju; Yen, Chia-Feng; Bedell, Gary; Simeonsson, Rune J; Liou, Tsan-Hon; Chi, Wen-Chou; Liu, Shu-Wen; Liao, Hua-Fang; Hwang, Ai-Wen
2015-03-01
Measurement of children's participation and environmental factors is a key component of the assessment in the new Disability Evaluation System (DES) in Taiwan. The Child and Adolescent Scale of Environment (CASE) was translated into Traditional Chinese (CASE-C) and used for assessing environmental factors affecting the participation of children and youth with disabilities in the DES. The aim of this study was to validate the CASE-C. Participants were 614 children and youth aged 6.0-17.9 years with disabilities, with the largest condition group comprised of children with intellectual disability (61%). Internal structure, internal consistency, test-retest reliability, convergent validity, and discriminant (known group) validity were examined using exploratory factor analyses, Cronbach's α coefficient, intra-class correlation coefficients (ICC), correlation analyses, and univariate ANOVAs. A three-factor structure (Family/Community Resources, Assistance/Attitude Supports, and Physical Design Access) of the CASE-C was produced with 38% variance explained. The CASE-C had adequate internal consistency (Cronbach's α=.74-.86) and test-retest reliability (ICCs=.73-.90). Children and youth with disabilities who had higher levels of severity of impairment encountered more environmental barriers and those experiencing more environmental problems also had greater restrictions in participation. The CASE-C scores were found to distinguish children on the basis of disability condition and impairment severity, but not on the basis of age or sex. The CASE-C is valid for assessing environmental problems experienced by children and youth with disabilities in Taiwan. Copyright © 2014 Elsevier Ltd. All rights reserved.
Reliability and concurrent validity of the Infant Motor Profile.
Heineman, Kirsten R; Middelburg, Karin J; Bos, Arend F; Eidhof, Lieke; La Bastide-Van Gemert, Sacha; Van Den Heuvel, Edwin R; Hadders-Algra, Mijna
2013-06-01
The Infant Motor Profile (IMP) is a qualitative assessment of motor behaviour in infancy. It consists of five domains: movement variation, variability, fluency, symmetry, and performance. The aim of this study was to assess interobserver reliability and concurrent validity of the IMP with the Alberta Infant Motor Scale (AIMS) and an age-specific neurological examination. Fifty-nine preterm infants (25 females, 34 males; median gestational age 29.7wks, median birthweight 1285g) and 146 term infants (74 females, 72 males; median gestational age 40.1wks, birthweight 3500g) were included. Assessments were performed at corrected ages of 4, 6, 10, 12, and 18 months and consisted of the IMP, AIMS, and an age-specific neurological examination. Interobserver reliability was investigated on a sample of 25 video recordings. Non-parametric statistics were used to analyse the data. Interobserver reliability was high (intraclass correlation coefficient 0.95). At all ages, AIMS scores correlated weakly to fairly with total IMP scores (Spearman's ρ 0.36-0.55), but moderately to strongly with scores on the performance domain of the IMP (Spearman's ρ 0.47-0.84). A clear relation was found between total IMP score and outcome of the neurological examination (Kruskal-Wallis p<0.001 at all ages). Interobserver reliability of the IMP is good. Concurrent validity with the AIMS is best for the IMP performance domain. Concurrent validity with age-specific neurological examination is very good. © The Authors. Developmental Medicine & Child Neurology © 2013 Mac Keith Press.
Yalin Sapmaz, Şermin; Özek Erkuran, Handan; Yalin, Nefize; Önen, Özlem; Öztekin, Siğnem; Kavurma, Canem; Köroğlu, Ertuğrul; Aydemir, Ömer
2017-12-01
This study aimed to assess the validity and reliability of the Turkish version of Diagnostic and Statistical Manual of Mental Disorders (DSM-5) Level 2 Anger Scale. The scale was prepared by translation and back translation of DSM-5 Level 2 Anger Scale. Study groups consisted of a clinical sample of cases diagnosed with depressive disorder and treated in a child and adolescent psychiatry unit and a community sample. The study was continued with 218 children and 160 parents. In the assessment process, child and parent forms of DSM-5 Level 2 Anger Scale and Children's Depression Inventory and Strengths and Difficulties Questionnaire-Parent Form were used. In the reliability analyses, the Cronbach alpha internal consistency coefficient values were found very high regarding child and parent forms. Item-total score correlation coefficients were high and very high, respectively, for child and parent forms indicating a statistical significance. As for construct validity, one factor was maintained for each form and was found to be consistent with the original form of the scale. As for concurrent validity, the child form of the scale showed significant correlation with Children's Depression Inventory, while the parent form showed significant correlation with Strengths and Difficulties Questionnaire-Parent Form. It was found that the Turkish version of DSM-5 Level 2 Anger Scale could be utilized as a valid and reliable tool both in clinical practice and for research purposes.
Newman-Beinart, Naomi A; Norton, Sam; Dowling, Dominic; Gavriloff, Dimitri; Vari, Chiara; Weinman, John A; Godfrey, Emma L
2017-06-01
There is no gold standard for measuring adherence to prescribed home exercise. Self-report diaries are commonly used however lack of standardisation, inaccurate recall and self-presentation bias limit their validity. A valid and reliable tool to assess exercise adherence behaviour is required. Consequently, this article reports the development and psychometric evaluation of the Exercise Adherence Rating Scale (EARS). Development of a questionnaire. Secondary care in physiotherapy departments of three hospitals. A focus group consisting of 8 patients with chronic low back pain (CLBP) and 2 physiotherapists was conducted to generate qualitative data. Following on from this, a convenience sample of 224 people with CLBP completed the initial 16-item EARS for purposes of subsequent validity and reliability analyses. Construct validity was explored using exploratory factor analysis and item response theory. Test-retest reliability was assessed 3 weeks later in a sub-sample of patients. An item pool consisting of 6 items was found suitable for factor analysis. Examination of the scale structure of these 6 items revealed a one factor solution explaining a total of 71% of the variance in adherence to exercise. The six items formed a unidimensional scale that showed good measurement properties, including acceptable internal consistency and high test-retest reliability. The EARS enables the measurement of adherence to prescribed home exercise. This may facilitate the evaluation of interventions promoting self-management for both the prevention and treatment of chronic conditions. Copyright © 2017 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Developing and testing the CHORDS: Characteristics of Responsible Drinking Survey.
Barry, Adam E; Goodson, Patricia
2011-01-01
Report on the development and psychometric testing of a theoretically and evidence-grounded instrument, the Characteristics of Responsible Drinking Survey (CHORDS). Instrument subjected to four phases of pretesting (cognitive validity, cognitive and motivational qualities, pilot test, and item evaluation) and a final posttest implementation. Large public university in Texas. Randomly selected convenience sample (n = 729) of currently enrolled students. This 78-item questionnaire measures individuals' responsible drinking beliefs, motivations, intentions, and behaviors. Cronbach α, split-half reliability, principal components analysis and Spearman ρ were conducted to investigate reliability, stability, and validity. Measures in the CHORDS exhibited high internal consistency reliability and strong correlations of split-half reliability. Factor analyses indicated five distinct scales were present, as proposed in the theoretical model. Subscale composite scores also exhibited a correlation to alcohol consumption behaviors, indicating concurrent validity. The CHORDS represents the first instrument specifically designed to assess responsible drinking beliefs and behaviors. It was found to elicit valid and reliable data among a college student sample. This instrument holds much promise for practitioners who desire to empirically investigate dimensions of responsible drinking.
First-order reliability application and verification methods for semistatic structures
NASA Astrophysics Data System (ADS)
Verderaime, V.
1994-11-01
Escalating risks of aerostructures stimulated by increasing size, complexity, and cost should no longer be ignored in conventional deterministic safety design methods. The deterministic pass-fail concept is incompatible with probability and risk assessments; stress audits are shown to be arbitrary and incomplete, and the concept compromises the performance of high-strength materials. A reliability method is proposed that combines first-order reliability principles with deterministic design variables and conventional test techniques to surmount current deterministic stress design and audit deficiencies. Accumulative and propagation design uncertainty errors are defined and appropriately implemented into the classical safety-index expression. The application is reduced to solving for a design factor that satisfies the specified reliability and compensates for uncertainty errors, and then using this design factor as, and instead of, the conventional safety factor in stress analyses. The resulting method is consistent with current analytical skills and verification practices, the culture of most designers, and the development of semistatic structural designs.
i-Assess: Evaluating the impact of electronic data capture for OSCE.
Monteiro, Sandra; Sibbald, Debra; Coetzee, Karen
2018-04-01
Tablet-based assessments offer benefits over scannable-paper assessments; however, there is little known about the impact to the variability of assessment scores. Two studies were conducted to evaluate changes in rating technology. Rating modality (paper vs tablets) was manipulated between candidates (Study 1) and within candidates (Study 2). Average scores were analyzed using repeated measures ANOVA, Cronbach's alpha and generalizability theory. Post-hoc analyses included a Rasch analysis and McDonald's omega. Study 1 revealed a main effect of modality (F (1,152) = 25.06, p < 0.01). Average tablet-based scores were higher, (3.39/5, 95% CI = 3.28 to 3.51), compared with average scan-sheet scores (3.00/5, 95% CI = 2.90 to 3.11). Study 2 also revealed a main effect of modality (F (1, 88) = 15.64, p < 0.01), however, the difference was reduced to 2% with higher scan-sheet scores (3.36, 95% CI = 3.30 to 3.42) compared with tablet scores (3.27, 95% CI = 3.21 to 3.33). Internal consistency (alpha and omega) remained high (>0.8) and inter-station reliability remained constant (0.3). Rasch analyses showed no relationship between station difficulty and rating modality. Analyses of average scores may be misleading without an understanding of internal consistency and overall reliability of scores. Although updating to tablet-based forms did not result in systematic variations in scores, routine analyses ensured accurate interpretation of the variability of assessment scores. This study demonstrates the importance of ongoing program evaluation and data analysis.
Saad, Karen Ruggeri; Colombo, Alexandra S; João, Silvia M Amado
2009-01-01
The purpose of this study was to investigate the reliability and validity of photogrammetry in measuring the lateral spinal inclination angles. Forty subjects (32 female and 8 males) with a mean age of 23.4 +/- 11.2 years had their scoliosis evaluated by radiographs of their trunk, determined by the Cobb angle method, and by photogrammetry. The statistical methods used included Cronbach alpha, Pearson/Spearman correlation coefficients, and regression analyses. The Cronbach alpha values showed that the photogrammetric measures showed high internal consistency, which indicated that the sample was bias free. The radiograph method showed to be more precise with intrarater reliabilities of 0.936, 0.975, and 0.945 for the thoracic, lumbar, and thoracolumbar curves, respectively, and interrater reliabilities of 0.942 and 0.879 for the angular measures of the thoracic and thoracolumbar segments, respectively. The regression analyses revealed a high determination coefficient although limited to the adjusted linear model between the radiographic and photographic measures. It was found that with more severe scoliosis, the lateral curve measures obtained with the photogrammetry were for the thoracic and lumbar regions (R = 0.619 and 0.551). The photogrammetric measures were found to be reproducible in this study and could be used as supplementary information to decrease the number of radiographs necessary for the monitoring of scoliosis.
Reliability of translated measures assessing dating violence among Mexican adolescents.
Hokoda, Audrey; Ramos-Lira, Luciana; Celaya, Patricia; Vilhauer, Keleigh; Angeles, Manuel; Ruíz, Serena; Malcarne, Vanessa L; Mora, Marina Duque
2006-02-01
Research on the prevalence and correlates of dating violence in Mexican teens is challenged by the lack of culturally and linguistically appropriate assessment tools. This study modified, translated, and back-translated the Conflict in Adolescent Dating Relationships Inventory (CADRI; Wolfe et al., 2001) and the Attitudes Towards Dating Violence Scales (Price, Byers, & the Dating Violence Research Team, 1999) for Mexican adolescents. Analyses on 307 adolescents (15-18 years old) from Monterrey and Mexicali, Mexico, revealed that most of the translated CADRI subscales and Attitudes Towards Dating Violence Scales had acceptable internal consistency and test-retest reliability coefficients. The study offers some evidence that the measures may be useful in assessing dating violence in Mexican teens.
Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial
Hallgren, Kevin A.
2012-01-01
Many research designs require the assessment of inter-rater reliability (IRR) to demonstrate consistency among observational ratings provided by multiple coders. However, many studies use incorrect statistical procedures, fail to fully report the information necessary to interpret their results, or do not address how IRR affects the power of their subsequent analyses for hypothesis testing. This paper provides an overview of methodological issues related to the assessment of IRR with a focus on study design, selection of appropriate statistics, and the computation, interpretation, and reporting of some commonly-used IRR statistics. Computational examples include SPSS and R syntax for computing Cohen’s kappa and intra-class correlations to assess IRR. PMID:22833776
Design and testing of the Space Station Freedom Propellant Tank Assembly
NASA Technical Reports Server (NTRS)
Dudley, D. D.; Thonet, T. A.; Goforth, A. M.
1992-01-01
Propellant storage and management functions for the Propulsion Module of the U.S. Space Station Freedom are provided by the Propellant Tank Assembly (PTA). The PTA consists of a surface-tension type propellant acquisition device contained within a welded titanium pressure vessel. The PTA design concept was selected with high reliability and low program risk as primary goals in order to meet stringent NASA structural, expulsion, fracture control and reliability requirements. The PTA design makes use of Shuttle Orbital Maneuvering System and Peacekeeper Propellant Storage Assembly design and analysis techniques. This paper summarizes the PTA design solution and discusses the underlying detailed analyses. In addition, design verification and qualification test activities are discussed.
An integrated approach to system design, reliability, and diagnosis
NASA Technical Reports Server (NTRS)
Patterson-Hine, F. A.; Iverson, David L.
1990-01-01
The requirement for ultradependability of computer systems in future avionics and space applications necessitates a top-down, integrated systems ingeneering approach for design, implementation, testing, and operation. The functional analyses of hardware and software systems must be combined by models that are flexible enough to represent their interactions and behavior. The information contained in these models must be accessible throughout all phases of the system life cycle in order to maintain consistency and accuracy in design and operational decisions. One approach being taken by researchers at Ames Research Center is the creation of an object-oriented environment that integrates information about system components required in the reliability evaluation with behavioral information useful for diagnostic algorithms.
Psychometric evaluation and wording effects on the Chinese version of the parent-proxy Kid-KINDL.
Lee, Chih-Ting; Lin, Chung-Ying; Tsai, Meng-Che; Strong, Carol; Lin, Yi-Ching
2016-09-05
The pediatric quality of life (QoL) questionnaire, the child-rated Kid-KINDL, has wording effects. However, no studies have examined for its parallel questionnaire, the parent-proxy Kid-KINDL. This study aimed to examine the psychometric properties and wording effects of the parent-proxy Kid-KINDL. Parents with 8- to 12-year-old children (n = 247) completed the parent-proxy Kid-KINDL, 83 of them completed it again 7-14 days later, and 241 of their children completed the child-rated Kid-KINDL. Internal consistency was examined using Cronbach's α; test-retest reliability and concurrent validity, using Pearson correlation coefficients (r); construct validity and wording effects, using confirmatory factor analyses (CFAs). The internal consistency of the parent-proxy Kid-KINDL total score was acceptable (α = .86). Test-retest reliability (r = .33-.60) and concurrent validity (r = .27-.42) were acceptable or nearly acceptable for all subscales and the total score. The CFA models simultaneously accounting for QoL traits and wording effects had satisfactory fit indices, and outperformed the model accounting only for QoL traits. However, four subscales had unsatisfactory internal consistency, which might be attributable to wording effects. When children are unable to complete a QoL questionnaire, the parent-proxy Kid-KINDL can substitute with all due cautions to wording effects and inconsistent reliability among different raters.
Krüger-Gottschalk, Antje; Knaevelsrud, Christine; Rau, Heinrich; Dyer, Anne; Schäfer, Ingo; Schellong, Julia; Ehring, Thomas
2017-11-28
The Posttraumatic Stress Disorder (PTSD) Checklist (PCL, now PCL-5) has recently been revised to reflect the new diagnostic criteria of the disorder. A clinical sample of trauma-exposed individuals (N = 352) was assessed with the Clinician Administered PTSD Scale for DSM-5 (CAPS-5) and the PCL-5. Internal consistencies and test-retest reliability were computed. To investigate diagnostic accuracy, we calculated receiver operating curves. Confirmatory factor analyses (CFA) were performed to analyze the structural validity. Results showed high internal consistency (α = .95), high test-retest reliability (r = .91) and a high correlation with the total severity score of the CAPS-5, r = .77. In addition, the recommended cutoff of 33 on the PCL-5 showed high diagnostic accuracy when compared to the diagnosis established by the CAPS-5. CFAs comparing the DSM-5 model with alternative models (the three-factor solution, the dysphoria, anhedonia, externalizing behavior and hybrid model) to account for the structural validity of the PCL-5 remained inconclusive. Overall, the findings show that the German PCL-5 is a reliable instrument with good diagnostic accuracy. However, more research evaluating the underlying factor structure is needed.
Scheffers, Mia; van Duijn, Marijtje A. J.; Bosscher, Ruud J.; Wiersma, Durk; Schoevers, Robert A.; van Busschbach, Jooske T.
2017-01-01
Background Body image has implications for psychosocial functioning and quality of life and its disturbance is reported in a broad range of psychiatric disorders. In view of the lack of instruments in Dutch measuring body image as a broad concept, we set out to make an instrument available that reflects the multidimensional character of this construct by including more dimensions than physical appearance. The Dresden Körperbildfragebogen (DBIQ, Dresden Body Image Questionnaire) particularly served this purpose. The DBIQ consists of 35 items and five subscales: body acceptance, sexual fulfillment, physical contact, vitality, and self-aggrandizement. The main objective of the present study was to evaluate the psychometric properties of the Dutch translation of the Dresden Body Image Questionnaire (DBIQ-NL) in a non-clinical sample. Methods The psychometric properties of the DBIQ-NL were examined in a non-clinical sample of 988 respondents aged between 18 and 65. We investigated the subscales' internal consistency and test-retest reliability. In order to establish construct validity we evaluated the association with a related construct, body cathexis, and with indices of self-esteem and psychological wellbeing. The factor structure of the DBIQ-NL was examined via confirmatory factor analysis (CFA). The equivalence of the measurement model across sex and age was evaluated by multiplegroup confirmatory factor analyses. Results Confirmatory factor analyses showed a structure in accordance with the original scale, where model fit was improved significantly by moving one item to another subscale. Multiple group confirmatory factor analysis across sex and age demonstrated partial strong invariance. Internal consistency was good with little overlap between the subscales. Temporal reliability and construct validity were satisfactory. Conclusion Results indicate that the DBIQ-NL is a reliable and valid instrument for non-clinical subjects. This provides a sound basis for further investigation of the DBIQ-NL in a clinical sample. PMID:28746387
The structure of harassment and abuse in the workplace: a factorial comparison of two measures.
Fendrich, Michael; Woodword, Paul; Richman, Judith A
2002-08-01
The structures of two measures examining negative experiences in the workplace, one focusing primarily on sexual harassment (SEQ) and one focusing on workplace abuse (GWA), were examined in detail. This article investigated whether the five subscales for the relatively unexplored measure (GWA) are reliably measured by a single underlying construct. It also investigated whether the two workplace-based measures are distinct but related constructs and the consistency of their factor structure across genders. Using a large and diverse organizational survey derived from a Midwestern university, analyses supported the distinctiveness of the two measures and showed that the factor structures for the two constructs were remarkably similar across genders. Analyses also suggested that indices of extreme behavior within each of the constructs were not reliably measured. The findings have important implications for data collection strategies in research focused on negative workplace experiences. This study provides considerable support for the continued use of both measures in research investigating the impact of adverse workplace environment on health.
Dayton, Melody; Koskinen, Mikko T; Tom, Bradley K; Mattila, Anna-Maria; Johnston, Eric; Halverson, Joy; Fantin, Dennis; DeNise, Sue; Budowle, Bruce; Smith, David Glenn; Kanthaswamy, Sree
2009-01-01
Aim To develop a reagent kit that enables multiplex polymerase chain reaction (PCR) amplification of 18 short tandem repeats (STR) and the canine sex-determining Zinc Finger marker. Methods Validation studies to determine the robustness and reliability in forensic DNA typing of this multiplex assay included sensitivity testing, reproducibility studies, intra- and inter-locus color balance studies, annealing temperature and cycle number studies, peak height ratio determination, characterization of artifacts such as stutter percentages and dye blobs, mixture analyses, species-specificity, case type samples analyses and population studies. Results The kit robustly amplified domesticated dog samples and consistently generated full 19-locus profiles from as little as 125 pg of dog DNA. In addition, wolf DNA samples could be analyzed with the kit. Conclusion The kit, which produces robust, reliable, and reproducible results, will be made available for the forensic research community after modifications based on this study’s evaluation to comply with the quality standards expected for forensic casework. PMID:19480022
Screening utility of the social anxiety screening scale in Spanish speaking adolescents.
Piqueras, José Antonio; Olivares, José; Hidalgo, María Dolores
2012-07-01
The aim of this study was to analyse the screening utility of the Social Anxiety Screening Scale (SASS/EDAS) in a sample of 227 adolescents with social anxiety disorder and 156 Without it (14-17 years). Results showed that the EDAS subscales (Avoidance, Distress and Interference) scores were reliable in terms of internal consistency (alpha > .80). All the subscales discriminated between adolescents with and without the disorder. They also showed a positive and significant correlation with other empirically validated measures of social anxiety. The three subscales indicated relevant sensitivity (69.16-84.14%), specificity (63.46-66.03%) and areas under the curve (.74-.81%). Binary logistic regression analyses indicated the adequate predictive utility of EDAS subscales, with the Distress subscale as the best diagnostic predictor. The data provide empirical evidence of the usefulness of EDAS as a screener for adolescent social anxiety disorder in terms of reliability, convergent and discriminant validity, diagnostic accuracy and clinical usefulness.
Bergmeister, Konstantin D; Gröger, Marion; Aman, Martin; Willensdorfer, Anna; Manzano-Szalai, Krisztina; Salminger, Stefan; Aszmann, Oskar C
2016-08-01
Skeletal muscle consists of different fiber types which adapt to exercise, aging, disease, or trauma. Here we present a protocol for fast staining, automatic acquisition, and quantification of fiber populations with ImageJ. Biceps and lumbrical muscles were harvested from Sprague-Dawley rats. Quadruple immunohistochemical staining was performed on single sections using antibodies against myosin heavy chains and secondary fluorescent antibodies. Slides were scanned automatically with a slide scanner. Manual and automatic analyses were performed and compared statistically. The protocol provided rapid and reliable staining for automated image acquisition. Analyses between manual and automatic data indicated Pearson correlation coefficients for biceps of 0.645-0.841 and 0.564-0.673 for lumbrical muscles. Relative fiber populations were accurate to a degree of ± 4%. This protocol provides a reliable tool for quantification of muscle fiber populations. Using freely available software, it decreases the required time to analyze whole muscle sections. Muscle Nerve 54: 292-299, 2016. © 2016 Wiley Periodicals, Inc.
O'Neil, Margaret E; Fragala-Pinkham, Maria; Lennon, Nancy; George, Ameeka; Forman, Jeffrey; Trost, Stewart G
2016-01-01
Physical therapy for youth with cerebral palsy (CP) who are ambulatory includes interventions to increase functional mobility and participation in physical activity (PA). Thus, reliable and valid measures are needed to document PA in youth with CP. The purpose of this study was to evaluate the inter-instrument reliability and concurrent validity of 3 accelerometer-based motion sensors with indirect calorimetry as the criterion for measuring PA intensity in youth with CP. Fifty-seven youth with CP (mean age=12.5 years, SD=3.3; 51% female; 49.1% with spastic hemiplegia) participated. Inclusion criteria were: aged 6 to 20 years, ambulatory, Gross Motor Function Classification System (GMFCS) levels I through III, able to follow directions, and able to complete the full PA protocol. Protocol activities included standardized activity trials with increasing PA intensity (resting, writing, household chores, active video games, and walking at 3 self-selected speeds), as measured by weight-relative oxygen uptake (in mL/kg/min). During each trial, participants wore bilateral accelerometers on the upper arms, waist/hip, and ankle and a portable indirect calorimeter. Intraclass coefficient correlations (ICCs) were calculated to evaluate inter-instrument reliability (left-to-right accelerometer placement). Spearman correlations were used to examine concurrent validity between accelerometer output (activity and step counts) and indirect calorimetry. Friedman analyses of variance with post hoc pair-wise analyses were conducted to examine the validity of accelerometers to discriminate PA intensity across activity trials. All accelerometers exhibited excellent inter-instrument reliability (ICC=.94-.99) and good concurrent validity (rho=.70-.85). All accelerometers discriminated PA intensity across most activity trials. This PA protocol consisted of controlled activity trials. Accelerometers provide valid and reliable measures of PA intensity among youth with CP. © 2016 American Physical Therapy Association.
Güner, Olcay
2017-03-01
The Early Maladaptive Schema Questionnaires Set for Children and Adolescents (SQS) was developed to assess early maladaptive schemas in children between the ages of 10 and 16 in Turkey. The SQS consists of five questionnaires that represent five schema domains in Young's schema theory. Psychometric properties (n = 983) and normative values (n = 2250) of SQS were investigated in children and adolescents between the ages of 10 and 16. Both exploratory and confirmatory factor analyses were performed. Results revealed 15 schema factors under five schema domains, with good fit indexes. A total of 14 schema factors were in line with Young's early maladaptive schemas. In addition to these factors, one new schema emerged: self-disapproval. Reliability analyses showed that SQS has high internal consistency and consistency over a 1-month interval. Correlations of SQS with the Adjective Check List (ACL), the Inventory of Parent and Peer Attachment (IPPA), the Symptom Assessment (SA-45) and the Young Schema Questionnaire (YSQ) were investigated to assess criterion validity, and the correlations revealed encouraging results. SQS significantly differentiated between children who have clinical diagnoses (n = 78) and children who have no diagnosis (n = 100). Finally, general normative values (n = 2,250) were determined for age groups, gender and age/gender groups. In conclusion, the early maladaptive schema questionnaires set for children and adolescents turned out to be a reliable and valid questionnaire with standard scores.Copyright © 2016 John Wiley & Sons, Ltd. The early maladaptive schema questionnaires set for children and adolescents (SQS) is a psychometrically reliable and valid measure of early maladaptive schemas for children between the ages of 10 and 16. SQS consists of five schema domains that represent Young's schema domains including 15 early maladaptive schemas and 97 items. Normative values for each schema were determined for age, gender and age/gender groups. Clinically, SQS presents valuable information about early maladaptive schemas during childhood and adolescence, before such schemas become more pervasive and persistent. Copyright © 2016 John Wiley & Sons, Ltd.
Yalin Sapmaz, Şermin; Ergin, Dilek; Şen Celasin, Nesrin; Karaarslan, Duygu; Öztürk, Masum; Özek Erkuran, Handan; Köroğlu, Ertuğrul; Aydemir, Ömer
2017-12-01
This study aimed to assess the validity and reliability of the Turkish version of the Diagnostic and statistical manual of Mental Disorders. (5 th ed.) (DSM-5) Social Anxiety Disorder Severity Scale- Child Form. The scale was prepared by carrying out the translation and back translation of the DSM-5 Social Anxiety Disorder Severity Scale - Child Form. The study group consisted of 31 patients that had been treated in a child psychiatry unit and diagnosed with social anxiety disorder and 99 healthy volunteers that were attending middle or high school during the study period. For the assessment, the Screen for Child Anxiety and Related Emotional Disorders (SCARED) was also used along with the DSM-5 Social Anxiety Disorder Severity Scale - Child Form. Regarding reliability analyses, Cronbach's alpha internal consistency coefficient was calculated as 0.941, while item-total score correlation coefficients were measured between 0.566 and 0.866. A test-retest correlation coefficient was calculated as r=0.711. As for construct validity, one factor that could explain 66.0 % of the variance was obtained. As for concurrent validity, the scale showed a high correlation with the SCARED. It was concluded that the Turkish version of the DSM-5 Social Anxiety Disorder Severity Scale - Child Form could be utilized as a valid and reliable tool both in clinical practice and for research purposes.
Evaluation of the Swedish version of the Child Drawing: Hospital Manual.
Wennström, Berith; Nasic, Salmir; Hedelin, Hans; Bergh, Ingrid
2011-05-01
This paper is a report of psychometric testing of the Swedish version of the Child Drawing: Hospital Manual. Drawings have shown to be useful in assessing emotional status and anxiety in children because they generally speak to us more clearly and openly through their drawings than they are willing or able to verbally. The Child Drawing: Hospital Manual was translated into Swedish according to World Health Organization guidelines (a routine procedure for translation of English instruments) in order to assess anxiety by analysing the drawings of 59 children (5-11 years), of whom nine were girls and 50 boys undergoing day surgery during 2007-2009. Inter-rater reliability (five independent scorers) was high and internal consistency reliability was good (coefficient alpha =0·77). Parts A and C, as well as the total scale score of the Child Drawing: Hospital Manual, discriminated anxiety significantly between the group of children undergoing day surgery and a comparison group of school children, indicating adequate construct validity. For the Swedish version of the Child Drawing: Hospital Manual, our study demonstrates evidence for adequate construct validity in Parts A and C (and total scale score), high inter-rater reliability and acceptable internal consistency reliability. However, some improvements are needed before the instrument will be a clinically useful assessment of anxiety in children undergoing day surgery. © 2011 Blackwell Publishing Ltd.
Risk-Informed Mean Recurrence Intervals for Updated Wind Maps in ASCE 7-16.
McAllister, Therese P; Wang, Naiyu; Ellingwood, Bruce R
2018-05-01
ASCE 7 is moving toward adopting load requirements that are consistent with risk-informed design goals characteristic of performance-based engineering (PBE). ASCE 7-10 provided wind maps that correspond to return periods of 300, 700, and 1,700 years for Risk Categories I, II, and combined III/IV, respectively. The risk targets for Risk Categories III and IV buildings and other structures (designated as essential facilities) are different in PBE. The reliability analyses reported in this paper were conducted using updated wind load data to (1) confirm that the return periods already in ASCE 7-10 were also appropriate for risk-informed PBE, and (2) to determine a new risk-based return period for Risk Category IV. The use of data for wind directionality factor, K d , which has become available from recent wind tunnel tests, revealed that reliabilities associated with wind load combinations for Risk Category II structures are, in fact, consistent with the reliabilities associated with the ASCE 7 gravity load combinations. This paper shows that the new wind maps in ASCE 7-16, which are based on return periods of 300, 700, 1,700, and 3,000 years for Risk Categories I, II, III, and IV, respectively), achieve the reliability targets in Section 1.3.1.3 of ASCE 7-16 for nonhurricane wind loads.
Test battery for measuring the perception and recognition of facial expressions of emotion
Wilhelm, Oliver; Hildebrandt, Andrea; Manske, Karsten; Schacht, Annekathrin; Sommer, Werner
2014-01-01
Despite the importance of perceiving and recognizing facial expressions in everyday life, there is no comprehensive test battery for the multivariate assessment of these abilities. As a first step toward such a compilation, we present 16 tasks that measure the perception and recognition of facial emotion expressions, and data illustrating each task's difficulty and reliability. The scoring of these tasks focuses on either the speed or accuracy of performance. A sample of 269 healthy young adults completed all tasks. In general, accuracy and reaction time measures for emotion-general scores showed acceptable and high estimates of internal consistency and factor reliability. Emotion-specific scores yielded lower reliabilities, yet high enough to encourage further studies with such measures. Analyses of task difficulty revealed that all tasks are suitable for measuring emotion perception and emotion recognition related abilities in normal populations. PMID:24860528
Hybrid propulsion technology program
NASA Technical Reports Server (NTRS)
1990-01-01
Technology was identified which will enable application of hybrid propulsion to manned and unmanned space launch vehicles. Two design concepts are proposed. The first is a hybrid propulsion system using the classical method of regression (classical hybrid) resulting from the flow of oxidizer across a fuel grain surface. The second system uses a self-sustaining gas generator (gas generator hybrid) to produce a fuel rich exhaust that was mixed with oxidizer in a separate combustor. Both systems offer cost and reliability improvement over the existing solid rocket booster and proposed liquid boosters. The designs were evaluated using life cycle cost and reliability. The program consisted of: (1) identification and evaluation of candidate oxidizers and fuels; (2) preliminary evaluation of booster design concepts; (3) preparation of a detailed point design including life cycle costs and reliability analyses; (4) identification of those hybrid specific technologies needing improvement; and (5) preperation of a technology acquisition plan and large scale demonstration plan.
Psychometric evaluation of the Revised Professional Practice Environment (RPPE) scale.
Erickson, Jeanette Ives; Duffy, Mary E; Ditomassi, Marianne; Jones, Dorothy
2009-05-01
The purpose was to examine the psychometric properties of the Revised Professional Practice Environment (RPPE) scale. Despite renewed focus on studying health professionals' practice environments, there are still few reliable and valid instruments available to assist nurse administrators in decision making. A psychometric evaluation using a random-sample cross-validation procedure (calibration sample [CS], n = 775; validation sample [VS], n = 775) was undertaken. Cronbach alpha internal consistency reliability of the total score (r = 0.93 [CS] and 0.92 [VS]), resulting subscale scores (r range: 0.80-0.87 [CS], 0.81-0.88 [VS]), and principal components analyses with Varimax rotation and Kaiser normalization (8 components, 59.2% variance [CS], 59.7% [VS]) produced almost identical results in both samples. The multidimensional RPPE is a psychometrically sound measure of 8 components of the professional practice environment in the acute care setting and sufficiently reliable and valid for use as independent subscales in healthcare research.
Assessing the competences associated with a nursing Bachelor thesis by means of rubrics.
Llaurado-Serra, M; Rodríguez, E; Gallart, A; Fuster, P; Monforte-Royo, C; De Juan, M Á
2018-07-01
Writing a Bachelor thesis is the last step in obtaining a university degree. The thesis may be job- or research-orientated, but it must demonstrate certain degree-level competences. Rubrics are a useful way of unifying the assessment criteria. To design a system of rubrics for assessing the competences associated with the Bachelor thesis of a nursing degree, to examine the system's reliability and validity and to analyse results in relation to the final thesis mark. Cross-sectional and psychometric study conducted between 2012 and 2014. Nursing degree at a Spanish university. Twelve tutors who designed the system of rubrics. Students (n = 76) who wrote their Bachelor thesis during the 2013-2014 academic year. After deciding which aspects would be assessed, who would assess them and when, the tutors developed seven rubrics (drafting process, assessment of the written thesis by the supervisor and by a panel, student self-assessment, peer assessment, tutor evaluation of the peer assessment and panel assessment of the viva). We analysed the reliability (inter-rater and internal consistency) and validity (convergent and discriminant) of the rubrics, and also the relationship between the competences assessed and the final thesis mark. All the rubrics had internal consistency coefficients >0.80. The rubric for oral communication skills (viva) yielded inter-rater reliability of 0.95. Factor analysis indicated a unidimensional structure for all but one of the rubrics, the exception being the rubric for peer assessment, which had a two-factor structure. The main competences associated with a good quality Bachelor thesis were written communication skills and the ability to work independently. The assessment system based on seven rubrics is shown to be valid and reliable. Writing a Bachelor thesis requires a range of degree-level competences and it offers nursing students the opportunity to develop their evidence-based practice skills. Copyright © 2018 Elsevier Ltd. All rights reserved.
[Design and validation of a brief questionnaire to assess young´s sexual knowledge].
Leon-Larios, Fátima; Gómez-Baya, Diego
2018-06-01
Only very few instruments have been developed to assess sexual knowledge and practices. Most of the research to date has been carried out with adolescent samples, but not with university students, who are also at a particularly risky stage. The aim of this study was to design and validate a brief questionnaire to assess young´s sexual knowledge, practices and behaviors to design health education programs in the university context. We created a specific questionnaire about sexual pattern in university adolescents and a brief questionnaire consisted of 9 items (true/false) about contraception, sexuality and sexual transmission diseases. We carried out a pilot study, reliability (KR-20) and validity analyses using factorial analysis and examining the association with other variables. 566 students from University of Seville participated during 2015/16. One item was eliminated because of comprehension (only 13.9% of correct answers) and weak or non significant associations (p more than 0.05). Finally, the scale was formed by 8 items and had good internal consistency reliability (KR-20 = 0.57), and both factorial and external validity reliability. A three-factor model showed good data fit, χ2 (14, N=566)=17.48, p= 0.232, Comparative Fit Index CFI = 0.97, root mean squared error of prediction RMSEA = 0.02. Participants with less knowledge about sexuality were whose did not receive any information (M=6.82, SD=1.41), without partner (M=6.87, SD=1.35), had an abortion (M=6.43, SD=1.95) and did not use any contraceptive method (M=6.66, SD=0.58) or coitus interruptus (M=6.55, SD=1.39), and had less sexual relationships, e.g., once or twice a year (M=6.49, SD=1.70). This questionnaire is a short instrument to assess students´ practices and knowledge about sexuality and contraception. The analyses of reliability and validity have shown the good psychometric properties of this instrument.
Eakman, Aaron M; Carlson, Mike E; Clark, Florence A
2010-01-01
The Meaningful Activity Participation Assessment (MAPA), a recently developed 28-item tool designed to measure the meaningfulness of activity, was tested in a sample of 154 older adults. The MAPA evidenced a sufficient level of internal consistency and test-retest reliability and correlated as theoretically predicted with the Life Satisfaction Index-Z, the Satisfaction with Life Scale, the Engagement in Meaningful Activities Survey, the Purpose in Life Test, the Center for Epidemiologic Studies Depression Inventory and the Rand SF-36v2 Health Survey subscales. Zero-order correlations consistently demonstrated meaningful relationships between the MAPA and scales of psychosocial well-being and health-related quality of life. Results from multiple regression analyses further substantiated these findings, as greater meaningful activity participation was associated with better psychological well-being and health-related quality of life. The MAPA appears to be a reliable and valid measure of meaningful activity, incorporating both subjective and objective indicators of activity engagement.
EAKMAN, AARON M.; CARLSON, MIKE E.; CLARK, FLORENCE A.
2011-01-01
The Meaningful Activity Participation Assessment (MAPA), a recently developed 28-item tool designed to measure the meaningfulness of activity, was tested in a sample of 154 older adults. The MAPA evidenced a sufficient level of internal consistency and test-retest reliability and correlated as theoretically predicted with the Life Satisfaction Index-Z, the Satisfaction with Life Scale, the Engagement in Meaningful Activities Survey, the Purpose in Life Test, the Center for Epidemiologic Studies Depression Inventory and the Rand SF-36v2 Health Survey subscales. Zero-order correlations consistently demonstrated meaningful relationships between the MAPA and scales of psychosocial well-being and health-related quality of life. Results from multiple regression analyses further substantiated these findings, as greater meaningful activity participation was associated with better psychological well-being and health-related quality of life. The MAPA appears to be a reliable and valid measure of meaningful activity, incorporating both subjective and objective indicators of activity engagement. PMID:20649161
Rodgers, Rachel F; Schaefer, Lauren M; Thompson, J Kevin; Girard, Marilou; Bertrand, Mélanie; Chabrol, Henri
2016-06-01
This study evaluated the psychometric properties of the Sociocultural Attitudes Towards Appearance Questionnaire-4 (SATAQ-4), a measure of internalization of societal appearance ideals, in French men and women. French college students completed a translation of the 22-item SATAQ-4 and measures of body image and eating concerns. Exploratory analyses among women (N=207) indicated a 20-item scale with the original five factors: Internalization: Thin/Low Body Fat, Internalization: Muscular/Athletic, Pressures: Family, Pressures: Media, Pressures: Peers. This structure was confirmed among a second sample of women (N=227). The SATAQ-4 scores revealed excellent reliability and convergent validity with body image and eating concern scores. A slightly modified factor structure emerged in men, with excellent reliability. Among men, the SATAQ-4 subscales were consistently associated with eating, and shape and weight concerns, although less consistently with general measures of body image. The French SATAQ-4 is a useful measure of internalization of appearance ideals. Copyright © 2016 Elsevier Ltd. All rights reserved.
Giezen, Hilde; Stevens, Martin; van den Akker-Scheek, Inge; Reininga, Inge H F
2017-01-01
The Copenhagen Hip And Groin Outcome Score (HAGOS) was developed to assess disease-specific consequences in young to middle-aged, physically active hip and/or groin patients. The study aimed to determine validity and reliability of the Dutch version of the HAGOS (HAGOS-NL) for middle-aged patients with hip complaints. To assess validity, 117 participants completed five questionnaires: HAGOS-NL, international Hip Outcome Tool (iHOT-12NL), Hip disability and Osteoarthritis Outcome Score (HOOS), RAND-36 Health Survey and Tegner activity scale. Structural validity was determined by conducting confirmatory factor analysis. Construct validity was analyzed by formulating predefined hypotheses regarding relationships between the HAGOS-NL and subscales of the iHOT-12NL, HOOS, RAND-36 and Tegner activity scale. The HAGOS-NL was filled out again by 67 patients to explore test-retest reliability. Reliability was assessed in terms of Cronbach's alpha, Intraclass Correlation Coefficient (ICC), Standard Error of Measurement (SEM) and Minimal Detectable Change (MDC). The Bland and Altman method was used to explore absolute agreement. Factor analysis confirmed that the HAGOS-NL consists of six subscales. All hypotheses were confirmed, indicating good construct validity. Internal consistency was good, with Cronbach's alpha values ranging from 0.89 to 0.98. Test-retest reliability was considered good, with ICC values of 0.80 and higher. The SEM ranged from 6.6 to 12.3, and MDC at individual level from 18.3 to 34.1 and at group level from 2.3 to 4.4. Bland and Altman analyses showed no bias. The HAGOS-NL is a reliable and valid instrument for measuring pain, physical functioning and quality of life in middle-aged patients with hip complaints.
Tsuno, Kanami; Yoshimasu, Kouichi; Hayashi, Takashi; Tatsuta, Nozomi; Ito, Yuki; Kamijima, Michihiro; Nakai, Kunihiko
2018-01-01
Nowadays, attention deficit hyperactivity (ADH) problems are observed commonly among school-age children. However, questionnaires specific to ADH behaviors among preschool children are very few. The aim of this study was to investigate the reliability and validity of the 25-item Behavioral Check List (BCL), which was developed from interviews of parents with children who were diagnosed as having Attention-deficit/hyperactivity disorder (ADHD) and measures ADH behaviors in preschool age. We recruited 22 teachers from 10 nurseries/kindergartens in Miyagi Prefecture, Japan. A total of 138 preschool children were assessed using the BCL. To investigate inter-rater reliability, two teachers from each facility assess seven to twenty children in their class, and intraclass correlation coefficients (ICCs) were calculated. The teachers additionally answered questions in the 1/5-5 Caregiver-Teacher Report Form (C-TRF) to investigate the criterion validity of the BCL. To investigate structural validity, exploratory factor analysis with promax rotation and confirmatory factor analysis were performed. The internal consistency reliability of the BCL was good (α = 0.92) and correlation analyses also confirmed its excellent criterion validity. Although exploratory factor analysis for the BCL yielded a five-factor model that consisted of a factor structure different from that of the original one, the results were similar to the original six factors. The ICCs of the BCL were 0.38-0.99 and it was not high enough for inter-rater reliability in some facilities. However, there is a possibility to improve it by giving raters adequate explanations when using BCL. The present study showed acceptable levels of reliability and validity of the BCL among Japanese preschool children.
Martin, T P C; Moualed, D; Paul, A; Ronan, N; Tysome, J R; Donnelly, N P; Cook, R; Axon, P R
2015-04-01
The Cambridge Otology Quality of Life Questionnaire (COQOL) is a patient-recorded outcome measurement (PROM) designed to quantify the quality of life of patients attending otology clinics. Item-reduction model. A systematically designed long-form version (74 items) was tested with patient focus groups before being presented to adult otology patients (n. 137). Preliminary item analysis tested reliability, reducing the COQOL to 24 questions. This was then presented in conjunction with the SF-36 (V1) questionnaire to a total of 203 patients. Subsequently, these were re-presented at T + 3 months, and patients recorded whether they felt their condition had improved, deteriorated or remained the same. Non-responders were contacted by post. A correlation between COQOL scores and patient perception of change was examined to analyse content validity. Teaching hospital and university psychology department. Adult patients attending otology clinics with a wide range of otological conditions. Item reliability measured by item–total correlation, internal consistency and test– retest reliability. Validity measured by correlation between COQOL scores and patient-reported symptom change. Reliability: the COQOL showed excellent internal consistency at both initial presentation (a = 0.90) and 3 months later (a = 0.93). Validity: One-way analysis of variance showed a significant difference between groups reporting change and those reporting no change in quality of life (F(2, 80) = 5.866, P < 0.01). The COQOL is the first otology-specific PROM. Initial studies demonstrate excellent reliability and encouraging preliminary criterion validity: further studies will allow a deeper validation of the instrument.
Chen, Y-W; HajGhanbari, B; Road, J D; Coxson, H O; Camp, P G; Reid, W D
2018-06-08
Pain is prevalent in chronic obstructive pulmonary disease (COPD) and the Brief Pain Inventory (BPI) appears to be a feasible questionnaire to assess this symptom. However, the reliability and validity of the BPI have not been determined in individuals with COPD. This study aimed to determine the internal consistency, test-retest reliability and validity (construct, convergent, divergent and discriminant) of the BPI in individuals with COPD. In order to examine the test-retest reliability, individuals with COPD were recruited from pulmonary rehabilitation programmes to complete the BPI twice 1 week apart. In order to investigate validity, de-identified data was retrieved from two previous studies, including forced expiratory volume in 1-s, age, sex and data from four questionnaires: the BPI, short-form McGill Pain Questionnaire (SF-MPQ), 36-Item Short Form Survey (SF-36) and Community Health Activities Model Program for Seniors (CHAMPS) questionnaire. In total, 123 participants were included in the analyses (eligible data were retrieved from 86 participants and additional 37 participants were recruited). The BPI demonstrated excellent internal consistency and test-retest reliability. It also showed convergent validity with the SF-MPQ and divergent validity with the SF-36. The factor analysis yielded two factors of the BPI, which demonstrated that the two domains of the BPI measure the intended constructs. The BPI can also discriminate pain levels among COPD patients with varied levels of quality of life (SF-36) and physical activity (CHAMPS). The BPI is a reliable and valid pain questionnaire that can be used to evaluate pain in COPD. This study formally established the reliability and validity of the BPI in individuals with COPD, which have not been determined in this patient group. The results of this study provide strong evidence that assessment results from this pain questionnaire are reliable and valid. © 2018 European Pain Federation - EFIC®.
Bryant, Elizabeth; Murtagh, Shemane; Finucane, Laura; McCrum, Carol; Mercer, Christopher; Smith, Toby; Canby, Guy; Rowe, David A; Moore, Ann P
2018-05-11
In response for the need of a freely available, stand-alone, validated outcome measure for use within musculoskeletal (MSK) physiotherapy practice, sensitive enough to measure clinical effectiveness, we developed an MSK patient reported outcome measure. This study examined the validity and reliability of the newly developed Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM) within physiotherapy outpatient settings. Two hundred twenty-four patients attending physiotherapy outpatient departments in South East England with an MSK condition participated in this study. The BmPROM was assessed for user friendliness (rated feedback, N = 224), reliability (internal consistency and test-retest reliability, n = 42), validity (internal and external construct validity, N = 224), and responsiveness (internal, n = 25). Exploratory factor analysis indicated that a two-factor model provides a good fit to the data. Factors were representative of "Functionality" and "Wellbeing". Correlations observed between the BmPROM and SF-36 domains provided evidence of convergent validity. Reliability results indicated that both subscales were internally consistent with alphas above the acceptable limits for both "Functionality" (α = .85, 95% CI [.81, .88]) and 'Wellbeing' (α = .80, 95% CI [.75, .84]). Test-retest analyses (n = 42) demonstrated a high degree of reliability between "Functionality" (ICC = .84; 95% CI [.72, .91]) and "Wellbeing" scores (ICC = .84; 95% CI [.72, .91]). Further examination of test-retest reliability through the Bland-Altman analysis demonstrated that the difference between "Functionality" and "Wellbeing" test scores did not vary as a function of absolute test score. Large treatment effect sizes were found for both subscales (Functionality d = 1.10; Wellbeing 1.03). The BmPROM is a reliable and valid outcome measure for use in evaluating physiotherapy treatment of MSK conditions. Copyright © 2018 John Wiley & Sons, Ltd.
Place prioritization for biodiversity content.
Sarkar, Sahotra; Aggarwal, Anshu; Garson, Justin; Margules, Chris R; Zeidler, Juliane
2002-07-01
The prioritization of places on the basis of biodiversity content is part of any systematic biodiversity conservation planning process. The place prioritization procedure implemented in the ResNet software package is described. This procedure is primarily based on the principles of rarity and complementarity. Application of the procedure is demonstrated with two analyses, one data set consisting of the distributions of termite genera in Namibia, and the other consisting of the distributions of bird species in the Islas Malvinas/Falkland Islands. The attributes that data sets should have for the effective and reliable application of such procedures are discussed. The procedure used here is compared to some others that are also currently in use.
Reliability of intracerebral hemorrhage classification systems: A systematic review.
Rannikmäe, Kristiina; Woodfield, Rebecca; Anderson, Craig S; Charidimou, Andreas; Chiewvit, Pipat; Greenberg, Steven M; Jeng, Jiann-Shing; Meretoja, Atte; Palm, Frederic; Putaala, Jukka; Rinkel, Gabriel Je; Rosand, Jonathan; Rost, Natalia S; Strbian, Daniel; Tatlisumak, Turgut; Tsai, Chung-Fen; Wermer, Marieke Jh; Werring, David; Yeh, Shin-Joe; Al-Shahi Salman, Rustam; Sudlow, Cathie Lm
2016-08-01
Accurately distinguishing non-traumatic intracerebral hemorrhage (ICH) subtypes is important since they may have different risk factors, causal pathways, management, and prognosis. We systematically assessed the inter- and intra-rater reliability of ICH classification systems. We sought all available reliability assessments of anatomical and mechanistic ICH classification systems from electronic databases and personal contacts until October 2014. We assessed included studies' characteristics, reporting quality and potential for bias; summarized reliability with kappa value forest plots; and performed meta-analyses of the proportion of cases classified into each subtype. We included 8 of 2152 studies identified. Inter- and intra-rater reliabilities were substantial to perfect for anatomical and mechanistic systems (inter-rater kappa values: anatomical 0.78-0.97 [six studies, 518 cases], mechanistic 0.89-0.93 [three studies, 510 cases]; intra-rater kappas: anatomical 0.80-1 [three studies, 137 cases], mechanistic 0.92-0.93 [two studies, 368 cases]). Reporting quality varied but no study fulfilled all criteria and none was free from potential bias. All reliability studies were performed with experienced raters in specialist centers. Proportions of ICH subtypes were largely consistent with previous reports suggesting that included studies are appropriately representative. Reliability of existing classification systems appears excellent but is unknown outside specialist centers with experienced raters. Future reliability comparisons should be facilitated by studies following recently published reporting guidelines. © 2016 World Stroke Organization.
Comprehensive Design Reliability Activities for Aerospace Propulsion Systems
NASA Technical Reports Server (NTRS)
Christenson, R. L.; Whitley, M. R.; Knight, K. C.
2000-01-01
This technical publication describes the methodology, model, software tool, input data, and analysis result that support aerospace design reliability studies. The focus of these activities is on propulsion systems mechanical design reliability. The goal of these activities is to support design from a reliability perspective. Paralleling performance analyses in schedule and method, this requires the proper use of metrics in a validated reliability model useful for design, sensitivity, and trade studies. Design reliability analysis in this view is one of several critical design functions. A design reliability method is detailed and two example analyses are provided-one qualitative and the other quantitative. The use of aerospace and commercial data sources for quantification is discussed and sources listed. A tool that was developed to support both types of analyses is presented. Finally, special topics discussed include the development of design criteria, issues of reliability quantification, quality control, and reliability verification.
Psychometric properties of the Brunel Mood Scale in Chinese adolescents and adults.
Zhang, Chun-Qing; Si, Gangyan; Chung, Pak-Kwong; Du, Mengmeng; Terry, Peter C
2014-01-01
Building on the work of Terry and colleagues (Terry, P. C., Lane, A. M., Lane, H. J., & Keohane, L. (1999). Development and validation of a mood measure for adolescents. Journal of Sports Sciences, 17, 861-872; Terry, P. C., Lane, A. M., & Fogarty, G. J. (2003). Construct validity of the Profile of Mood States-Adolescents for use with adults. Psychology of Sport & Exercise, 4, 125-139.), the present study examined the validity and internal consistency reliability of the Chinese version of the Brunel Mood Scale (BRUMS-C) among 2,548 participants, comprising adolescent athletes (n = 520), adult athletes (n = 434), adolescent students (n = 673), and adult students (n = 921). Both adolescent and adult athletes completed the BRUMS-C before, during, or after regular training and both adolescent and adult students completed the BRUMS-C in a classroom setting. Confirmatory factor analyses (CFAs) provided support for the factorial validity of a 23-item six-factor model, with one item removed from the hypothesised measurement model. Internal consistency reliabilities were satisfactory for all subscales across each of the four samples. Criterion validity was supported with strong relationships between the BRUMS-C, abbreviated POMS, and Chinese Affect Scale consistent with theoretical predictions. Multi-sample CFAs showed the BRUMS-C to be invariant at the configural, metric, strong, and structural levels for all samples. Furthermore, latent mean difference analyses showed that athletes reported significantly higher levels of fatigue than students while maintaining almost the same levels of vigour, and adolescent students reported significantly higher levels of depressed mood than the other three samples.
Chabrera, Carolina; Areal, Joan; Font, Albert; Caro, Mónica; Bonet, Marta; Zabalegui, Adelaida
2015-01-01
The aim of this study is to develop a Spanish version of the Satisfaction With Decision scale (SWDs) and analyse the psychometric properties of validity and reliability. An observational, descriptive study and validation of a tool to measure satisfaction with the decision. Urology, Radiation oncology, and Medical oncology Departments of the Hospital Universitari Germans Trias i Pujol, Institut Català d'Oncologia and the Institut Oncològic del Vallès - Hospital General de Catalunya. A total of 170 participants diagnosed with prostate cancer, and who could read and write in Spanish and gave their informed consent. A translation, back-translation and cross-cultural adaptation to Spanish was performed on the SWDs. The content validity, criterion validity, construct validity and reliability (internal consistency and stability) of the Spanish version were evaluated. The SWDs contains 6 items with 5-item Likert scales. A Spanish version (ESD) was obtained that was linguistically and conceptually equivalent to the original version. Criterion validity, the ESD correlated with "satisfaction with the decision" using a linear analogue scale, was significant (r=0.63, P<.01) for all items. The factorial analysis showed a unique dimension to explain 82.08% of the variance. The ESD showed excellent results in terms of internal consistency (Cronbach alpha=0.95) and good test-retest reliability with intraclass correlation coefficient of 0.711. The ESD is a validated Spanish scale to measure the satisfaction with the decisions taken in health, and demonstrates a correct validity and reliability. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
The live donor assessment tool: a psychosocial assessment tool for live organ donors.
Iacoviello, Brian M; Shenoy, Akhil; Braoude, Jenna; Jennings, Tiane; Vaidya, Swapna; Brouwer, Julianna; Haydel, Brandy; Arroyo, Hansel; Thakur, Devendra; Leinwand, Joseph; Rudow, Dianne LaPointe
2015-01-01
Psychosocial evaluation is an important part of the live organ donor evaluation process, yet it is not standardized across institutions, and although tools exist for the psychosocial evaluation of organ recipients, none exist to assess donors. We set out to develop a semistructured psychosocial evaluation tool (the Live Donor Assessment Tool, LDAT) to assess potential live organ donors and to conduct preliminary analyses of the tool's reliability and validity. Review of the literature on the psychosocial variables associated with treatment adherence, quality of life, live organ donation outcome, and resilience, as well as review of the procedures for psychosocial evaluation at our center and other centers around the country, identified 9 domains to address; these domains were distilled into several items each, in collaboration with colleagues at transplant centers across the country, for a total of 29 items. Four raters were trained to use the LDAT, and they retrospectively scored 99 psychosocial evaluations conducted on live organ donor candidates. Reliability of the LDAT was assessed by calculating the internal consistency of the items in the scale and interrater reliability between raters; validity was estimated by comparing LDAT scores between those with a "positive" evaluation outcome and "negative" outcome. The LDAT was found to have good internal consistency, inter-rater reliability, and showed signs of validity: LDAT scores differentiated the positive vs. negative outcome groups. The LDAT demonstrated good reliability and validity, but future research on the LDAT and the ability to implement the LDAT prospectively is warranted. Copyright © 2015 The Academy of Psychosomatic Medicine. Published by Elsevier Inc. All rights reserved.
López-Ortega, Mariana; Torres-Castro, Sara; Rosas-Carrasco, Oscar
2016-12-09
The Satisfaction with Life Scale (SWLS) has been widely used and has proven to be a valid and reliable instrument for assessing satisfaction with life in diverse population groups, however, research on satisfaction with life and validation of different measuring instruments in Mexican adults is still lacking. The objective was to evaluate the psychometric properties of the Satisfaction with Life Scale (SWLS) in a representative sample of Mexican adults. This is a methodological study to evaluate a satisfaction with life scale in a sample of 13,220 Mexican adults 50 years of age or older from the 2012 Mexican Health and Aging Study. The scale's reliability (internal consistency) was analysed using Cronbach's alpha and inter-item correlations. An exploratory factor analysis was also performed. Known-groups validity was evaluated comparing good-health and bad-health participants. Comorbidity, perceived financial situation, self-reported general health, depression symptoms, and social support were included to evaluate the validity between these measures and the total score of the scale using Spearman's correlations. The analysis of the scale's reliability showed good internal consistency (α = 0.74). The exploratory factor analysis confirmed the existence of a unique factor structure that explained 54% of the variance. SWLS was related to depression, perceived health, financial situation, and social support, and these relations were all statistically significant (P < .01). There was significant difference in life satisfaction between the good- and bad-health groups. Results show good internal consistency and construct validity of the SWLS. These results are comparable with results from previous studies. Meeting the study's objective to validate the scale, the results show that the Spanish version of the SWLS is a reliable and valid measure of satisfaction with life in the Mexican context.
Gunaydin, Gurkan; Citaker, Seyit; Meray, Jale; Cobanoglu, Gamze; Gunaydin, Ozge Ece; Hazar Kanik, Zeynep
2016-11-01
Validation of a self-report questionnaire. The purpose of this study was to investigate adaptation, validity, and reliability of the Turkish version of the Bournemouth Questionnaire. Low back pain is one of the most frequent disorders leading to activity limitation. This pain affects most of people in their lives. The most important point to evaluate patient's functional abilities and to decide a successful therapy procedure is to manage the assessment questionnaires precisely. One hundred ten patients with chronic low back pain were included in present study. To assess reliability, test-retest and internal consistency analyses were applied. The results of test-retest analysis were assessed by using Intraclass Correlation Coefficient method (95% confidence interval). For internal consistency, Cronbach alpha value was calculated. Validity of the questionnaire was assessed in terms of construct validity. For construct validity, factor analysis and convergent validity were tested. For convergent validity, total points of the Bournemouth Questionnaire were assessed with the total points of Quebec Back Pain Disability Scale and Roland Morris Disability Questionnaire by using Pearson correlation coefficient analysis. Cronbach alpha value was found 0.914, showing that this questionnaire has high internal consistency. The results of test-retest analysis were varying between 0.851 and 0.927, which shows that test-retest results are highly correlated. Factor analysis test indicated that this questionnaire had one factor. Pearson correlation coefficient of the Bournemouth Questionnaire with Roland Morris Disability Questionnaire was calculated 0.703 and it was found with Quebec Back Pain Disability Scale is 0.659. These results showed that the Bournemouth Questionnaire is very good correlated with Roland Morris Disability Questionnaire and Quebec Back Pain Disability Scale. The Turkish version of the Bournemouth Questionnaire is valid and reliable. 3.
Zhang, Yin-Ping; Zhao, Xin-Shuang; Zhang, Bei; Zhang, Lu-Lu; Ni, Chun-Ping; Hao, Nan; Shi, Chang-Bei; Porr, Caroline
2015-07-01
The comprehensive needs assessment tool for cancer caregivers (CNAT-C) is a systematic and comprehensive needs assessment tool for the family caregivers. The purpose of this project was twofold: (1) to adapt the CNAT-C to Mainland China's cultural context and (2) to evaluate the psychometric properties of the newly adapted Chinese CNAT-C. Cross-cultural adaptation of the original CNAT-C was performed according to published guidelines. A pilot study was conducted in Mainland China with 30 Chinese family cancer caregivers. A subsequent validation study was conducted with 205 Chinese cancer caregivers from Mainland China. Construct validity was determined through exploratory and confirmatory factor analyses. Reliability was determined using internal consistency and test-retest reliability. The split-half coefficient for the overall Chinese CNAT-C scale was 0.77. Principal component analysis resulted in an eight-factor structure explaining 68.11 % of the total variance. The comparative fit index (CFI) was 0.91 from the modified model confirmatory factor analysis. The Chi-square divided by degrees of freedom was 1.98, and the root mean squared error of approximation (RMSEA) was 0.079. In relation to the known-group validation, significant differences were found in the Chinese CNAT-C scale according to various caregiver characteristics. Internal consistency was high for the Chinese CNAT-C reaching a Cronbach α value of 0.94. Test-retest reliability was 0.85. The newly adapted Chinese CNAT-C scale possesses adequate validity, test-retest reliability, and internal consistency and therefore may be used to ascertain holistic health and support needs of cancer patients' family caregivers in Mainland China.
Consensuses and discrepancies of basin-scale ocean heat content changes in different ocean analyses
NASA Astrophysics Data System (ADS)
Wang, Gongjie; Cheng, Lijing; Abraham, John; Li, Chongyin
2018-04-01
Inconsistent global/basin ocean heat content (OHC) changes were found in different ocean subsurface temperature analyses, especially in recent studies related to the slowdown in global surface temperature rise. This finding challenges the reliability of the ocean subsurface temperature analyses and motivates a more comprehensive inter-comparison between the analyses. Here we compare the OHC changes in three ocean analyses (Ishii, EN4 and IAP) to investigate the uncertainty in OHC in four major ocean basins from decadal to multi-decadal scales. First, all products show an increase of OHC since 1970 in each ocean basin revealing a robust warming, although the warming rates are not identical. The geographical patterns, the key modes and the vertical structure of OHC changes are consistent among the three datasets, implying that the main OHC variabilities can be robustly represented. However, large discrepancies are found in the percentage of basinal ocean heating related to the global ocean, with the largest differences in the Pacific and Southern Ocean. Meanwhile, we find a large discrepancy of ocean heat storage in different layers, especially within 300-700 m in the Pacific and Southern Oceans. Furthermore, the near surface analysis of Ishii and IAP are consistent with sea surface temperature (SST) products, but EN4 is found to underestimate the long-term trend. Compared with ocean heat storage derived from the atmospheric budget equation, all products show consistent seasonal cycles of OHC in the upper 1500 m especially during 2008 to 2012. Overall, our analyses further the understanding of the observed OHC variations, and we recommend a careful quantification of errors in the ocean analyses.
Burke, Kylie; McCarthy, Maria; Lowe, Cherie; Sanders, Matthew R; Lloyd, Erin; Bowden, Madeleine; Williams, Lauren
2017-03-01
Childhood cancer is associated with child adjustment difficulties including, eating and sleep disturbance, and emotional and other behavioral difficulties. However, there is a lack of validated instruments to measure the specific child adjustment issues associated with pediatric cancer treatments. The aim of this study was to develop and evaluate the reliability and validity of a parent-reported, child adjustment scale. One hundred thirty-two parents from two pediatric oncology centers who had children (aged 2-10 years) diagnosed with cancer completed the newly developed measure and additional measures of child behavior, sleep, diet, and quality of life. Children were more than 4 weeks postdiagnosis and less than 12 months postactive treatment. Factor structure, internal consistency, and construct (convergent) validity analyses were conducted. Principal component analysis revealed five distinct and theoretically coherent factors: Sleep Difficulties, Impact of Child's Illness, Eating Difficulties, Hospital-Related Behavior Difficulties, and General Behavior Difficulties. The final 25-item measure, the Children's Oncology Child Adjustment Scale (ChOCs), demonstrated good internal consistency (α = 0.79-0.91). Validity of the ChOCs was demonstrated by significant correlations between the subscales and measures of corresponding constructs. The ChOCs provides a new measure of child adjustment difficulties designed specifically for pediatric oncology. Preliminary analyses indicate strong theoretical and psychometric properties. Future studies are required to further examine reliability and validity of the scale, including test-retest reliability, discriminant validity, as well as change sensitivity and generalizability across different oncology samples and ages of children. The ChOCs shows promise as a measure of child adjustment relevant for oncology clinical settings and research purposes. © 2016 Wiley Periodicals, Inc.
SEQUenCE: a service user-centred quality of care instrument for mental health services.
Hester, Lorraine; O'Doherty, Lorna Jane; Schnittger, Rebecca; Skelly, Niamh; O'Donnell, Muireann; Butterly, Lisa; Browne, Robert; Frorath, Charlotte; Morgan, Craig; McLoughlin, Declan M; Fearon, Paul
2015-08-01
To develop a quality of care instrument that is grounded in the service user perspective and validate it in a mental health service. The instrument (SEQUenCE (SErvice user QUality of CarE)) was developed through analysis of focus group data and clinical practice guidelines, and refined through field-testing and psychometric analyses. All participants were attending an independent mental health service in Ireland. Participants had a diagnosis of bipolar affective disorder (BPAD) or a psychotic disorder. Twenty-nine service users participated in six focus group interviews. Seventy-one service users participated in field-testing: 10 judged the face validity of an initial 61-item instrument; 28 completed a revised 52-item instrument from which 12 items were removed following test-retest and convergent validity analyses; 33 completed the resulting 40-item instrument. Test-retest reliability, internal consistency and convergent validity of the instrument. The final instrument showed acceptable test-retest reliability at 5-7 days (r = 0.65; P < 0.001), good convergent validity with the Verona Service Satisfaction Scale (r = 0.84, P < 0.001) and good internal consistency (Cronbach's alpha = 0.87). SEQUenCE is a valid, reliable scale that is grounded in the service user perspective and suitable for routine use. It may serve as a useful tool in individual care planning, service evaluation and research. The instrument was developed and validated with service users with a diagnosis of either BPAD or a psychotic disorder; it does not yet have established external validity for other diagnostic groups. © The Author 2015. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Cavelti, Marialuisa; Contin, Giuliana; Beck, Eva-Marina; Kvrgic, Sara; Kossowsky, Joe; Stieglitz, Rolf-Dieter; Vauth, Roland
2012-01-01
Because the mere definition of insight from the therapist's viewpoint may not be sufficient to identify treatment targets for adherence enhancement, we need assessment strategies which are more sensitive to the patient's perspective. Illness perception (IP), defined as the beliefs a patient holds about his/her health problems, has been shown to affect coping in the context of a physical or mental illness, e.g. compliance behaviour. To assess IP in people diagnosed with schizophrenia, the Illness Perception Questionnaire for Schizophrenia (IPQS) was developed. The aim of the present study was to analyse the psychometric properties of the German version of the IPQS. The study sample consisted of 128 German-speaking outpatients suffering from chronic schizophrenia or schizoaffective disorder. To achieve comparability with the validation of the English scale version, the same constructs were assessed: psychopathology, depression, and beliefs about medication. Furthermore, insight into one's illness was assessed. Internal consistency, test-retest reliability and construct validity including convergent and discriminant validity were analysed. Five of eight IPQS subscales were found to be internally reliable and all subscales demonstrated high stability over time. Correlations with validity measures indicated that the subscales assess dimensions of a construct, which is distinct from psychopathology, depression, beliefs about medication and insight, except for the Identity subscale which substantially overlapped with measures of insight. The German version of the IPQS is an essentially reliable and valid measure of IP for German-speaking people with a schizophrenia spectrum disorder. This may encourage its usage in further studies investigating the impact of subjective beliefs about mental health problems on outcome and recovery in schizophrenia. Copyright © 2012 S. Karger AG, Basel.
Kim, Jae-Min; Hong, Jin-Pyo; Kim, Sang-Dae; Kang, Hee-Ju; Lee, Yong-Sung
2016-01-01
Objective Cognitive symptoms are an important component of depression and the Perceived Deficits Questionnaire-Depression is one of only a few instruments available for the subjective assessment of cognitive dysfunction in depression. Thus, the present study aimed to validate a Korean version of the PDQ-D (K-PDQ-D) using patients with major depressive disorder (MDD). Methods This study included 128 MDD patients who were assessed at study entry and 86 of these patients were then completed 12 weeks of antidepressant monotherapy. All subjects were assessed with the K-PDQ-D, the Montgomery-Asberg Depression Rating Scale (MADRS), the Sheehan Disability Scale (SDS), the EuroQol-5 dimensions questionnaire (EQ-5D), and the number of sick leave days taken in the previous week. The internal consistency, Guttman’s split-half and test-retest reliabilities, factorial analyses, and concurrent and predictive validities of the K-PDQ-D were investigated. Results The K-PDQ-D exhibited excellent internal consistency and reliabilities, and was composed of four factors with high coefficients of determination. The concurrent validity analyses revealed that the K-PDQ-D scores were significantly correlated with the MADRS, SDS, and EQ-5D scores and the number of sick leave days taken. The K-PDQ-D scores at study entry significantly predicted changes in sick leave days and EQ-5D score from study entry to the 12-week endpoint. Conclusion The newly developed K-PDQ-D is a reliable and valid instrument for the evaluation of subjective cognitive symptoms in MDD patients. The K-PDQ-D may assist in the gathering of unique information regarding subjective cognitive complaints, which is important for the comprehensive evaluation of patients with MDD. PMID:26792037
Development of the PROMIS nicotine dependence item banks.
Shadel, William G; Edelen, Maria Orlando; Tucker, Joan S; Stucky, Brian D; Hansen, Mark; Cai, Li
2014-09-01
Nicotine dependence is a core construct important for understanding cigarette smoking and smoking cessation behavior. This article describes analyses conducted to develop and evaluate item banks for assessing nicotine dependence among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of nicotine dependence items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess dependence. A total of 32 items were included in the Nicotine Dependence item banks; 22 items are common across daily and nondaily smokers, 5 are unique to daily smokers, and 5 are unique to nondaily smokers. For both daily and nondaily smokers, the Nicotine Dependence item banks are strongly unidimensional, highly reliable (reliability = 0.97 and 0.97, respectively), and perform similarly across gender, age, and race/ethnicity groups. SFs common to daily and nondaily smokers consist of 8 and 4 items (reliability = 0.91 and 0.81, respectively). Results from simulated CATs showed that dependence can be assessed with very good precision for most respondents using fewer than 6 items adaptively selected from the item banks. Nicotine dependence on cigarettes can be assessed on the basis of these item banks via one of the SFs, by using CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development of the PROMIS negative psychosocial expectancies of smoking item banks.
Stucky, Brian D; Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Cerully, Jennifer; Kuhfeld, Megan; Hansen, Mark; Cai, Li
2014-09-01
Negative psychosocial expectancies of smoking include aspects of social disapproval and disappointment in oneself. This paper describes analyses conducted to develop and evaluate item banks for assessing psychosocial expectancies among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of psychosocial expectancies items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess psychosocial expectancies. A total of 21 items were included in the Psychosocial Expectancies item banks: 14 items are common across daily and nondaily smokers, 6 are unique to daily, and 1 is unique to nondaily. For both daily and nondaily smokers, the Psychosocial Expectancies item banks are strongly unidimensional, highly reliable (reliability = 0.95 and 0.93, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.85). Results from simulated CATs showed that, on average, fewer than 8 items are needed to assess psychosocial expectancies with adequate precision when using the item banks. Psychosocial expectancies of smoking can be assessed on the basis of these item banks via the SF, by using CAT, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Kersten, Paula; Vandal, Alain C; Elder, Hinemoa; McPherson, Kathryn M
2018-04-21
This observational study examines the internal construct validity, internal consistency and cross-informant reliability of the Strengths and Difficulties Questionnaire (SDQ) in a New Zealand preschool population across four ethnicity strata (New Zealand European, Māori, Pasifika, Asian). Rasch analysis was employed to examine internal validity on a subsample of 1000 children. Internal consistency (n=29 075) and cross-informant reliability (n=17 006) were examined using correlations, intraclass correlation coefficients and Cronbach's alpha on the sample available for such analyses. Data were used from a national SDQ database provided by the funder, pertaining to New Zealand domiciled children aged 4 and 5 and scored by their parents and teachers. The five subscales do not fit the Rasch model (as indicated by the overall fit statistics), contain items that are biased (differential item functioning (DIF)) by key variables, suffer from a floor and ceiling effect and have unacceptable internal consistency. After dealing with DIF, the Total Difficulty scale does fit the Rasch model and has good internal consistency. Parent/teacher inter-rater reliability was unacceptably low for all subscales. The five SDQ subscales are not valid and not suitable for use in their own right in New Zealand. We have provided a conversion table for the Total Difficulty scale, which takes account of bias by ethnic group. Clinicians should use this conversion table in order to reconcile DIF by culture in final scores. It is advisable to use both parents and teachers' feedback when considering children's needs for referral of further assessment. Future work should examine whether validity is impacted by different language versions used in the same country. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Rusli, B N; Amrina, K; Trived, S; Loh, K P; Shashi, M
2017-10-01
The 21-item English version of the Depression Anxiety Stress Scale (DASS-21) has been proposed as a method for assessing self-perceived depression, anxiety and stress over the past week in various clinical and nonclinical populations. Several Malay versions of the DASS-21 have been validated in various populations with varying success. One particular Malay version has been validated in various occupational groups (such as nurses and automotive workers) but not among male clinic outpatient attendees in Malaysia. To validate the Malay version of the DASS-21 (Malay-DASS-21) among male outpatient clinic attendees in Johor. A validation study with a random sample of 402 male respondents attending the outpatient clinic of a major public outpatient clinic in Johor Bahru and Segamat was carried out from January to March 2016. Construct validity of the Malay-DASS-21 was examined using Exploratory Factor Analysis (KMO = 0.947; Bartlett's test of sphericity is significant, p<0.001) through Principal Component Analysis and orthogonal (varimax) rotation with Kaiser Normalization to confirm the psychometric properties of the Malay-DASS- 21 and the internal consistency reliability using Cronbach's alpha. Construct validity of the Malay-DASS-21 based on eigenvalues and factor loadings to confirm the three factor structure (depression, anxiety, and stress) was acceptable. The internal consistency reliability of the factor construct was very impressive with Cronbach's alpha values in the range of 0.837 to 0.863. The present study showed that the Malay- DASS-21 has acceptable psychometric construct and high internal consistency reliability to measure self-perceived depression, anxiety and stress over the past week in male outpatient clinic attendees in Johor. Further studies are necessary to revalidate the Malay-DASS-21 across different populations and cultures, and using confirmatory factor analyses.
Zumpano, Camila Eugênia; Mendonça, Tânia Maria da Silva; Silva, Carlos Henrique Martins da; Correia, Helena; Arnold, Benjamin; Pinto, Rogério de Melo Costa
2017-01-23
This study aimed to perform the cross-cultural adaptation and validation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Global Health scale in the Portuguese language. The ten Global Health items were cross-culturally adapted by the method proposed in the Functional Assessment of Chronic Illness Therapy (FACIT). The instrument's final version in Portuguese was self-administered by 1,010 participants in Brazil. The scale's precision was verified by floor and ceiling effects analysis, reliability of internal consistency, and test-retest reliability. Exploratory and confirmatory factor analyses were used to assess the construct's validity and instrument's dimensionality. Calibration of the items used the Gradual Response Model proposed by Samejima. Four global items required adjustments after the pretest. Analysis of the psychometric properties showed that the Global Health scale has good reliability, with Cronbach's alpha of 0.83 and intra-class correlation of 0.89. Exploratory and confirmatory factor analyses showed good fit in the previously established two-dimensional model. The Global Physical Health and Global Mental Health scale showed good latent trait coverage according to the Gradual Response Model. The PROMIS Global Health items showed equivalence in Portuguese compared to the original version and satisfactory psychometric properties for application in clinical practice and research in the Brazilian population.
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs
Craigon, Peter J.; Blythe, Simon A.; England, Gary C. W.; Asher, Lucy
2017-01-01
Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5–8, 8–12 and 5–12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs. PMID:28614347
Patient adherence to prescribed antimicrobial drug dosing regimens.
Vrijens, Bernard; Urquhart, John
2005-05-01
The aim of this article is to review current knowledge about the clinical impact of patients' variable adherence to prescribed anti-infective drug dosing regimens, with the aim of renewing interest and exploration of this important but largely neglected area of therapeutics. Central to the estimation of a patient's adherence to a prescribed drug regimen is a reliably compiled drug dosing history. Electronic monitoring methods have emerged as the virtual 'gold standard' for compiling drug dosing histories in ambulatory patients. Reliably compiled drug dosing histories are consistently downwardly skewed, with varying degrees of under-dosing. In particular, the consideration of time intervals between protease inhibitor doses has revealed that ambulatory patients' variable execution of prescribed dosing regimens is a leading source of variance in viral response. Such analyses reveal the need for a new discipline, called pharmionics, which is the study of how ambulatory patients use prescription drugs. Properly analysed, reliable data on the time-course of patients' actual intake of prescription drugs can eliminate a major source of unallocated variance in drug responses, including the non-response that occurs and is easily misinterpreted when a patient's complete non-execution of a prescribed drug regimen is unrecognized clinically. As such, reliable compilation of ambulatory patients' drug dosing histories has the promise of being a key step in reducing unallocated variance in drug response and in improving the informational yield of clinical trials. It is also the basis for sound, measurement-guided steps taken to improve a patient's execution of a prescribed dosing regimen.
Cross-cultural adaptation of the Nordic musculoskeletal questionnaire.
de Barros, E N C; Alexandre, N M C
2003-06-01
Reports in the literature have identified a need for internationally standardized and reliable measurements to analyse musculoskeletal symptoms. Screening of musculoskeletal disorders may serve as a diagnostic tool to evaluate the work environment. The Nordic general questionnaire is a standardized instrument used to analyse musculoskeletal symptoms in an ergonomic or occupational health context. To translate and adapt a version of the Nordic general questionnaire into Brazilian Portuguese and evaluate its reliability. The cross-cultural adaptation was performed according to internationally recommended methodology, using the following guidelines: translation; back-translation; committee review; and pretesting. First, the questionnaire was independently translated into Portuguese by two teachers and one doctor, and a consensus version was generated. Second, two other translators performed a back-translation independently from one another. This version was then submitted to a committee, consisting of six specialists in the area of knowledge of the instrument, to evaluate its equivalence to the original instrument. The final version was pretested on 20 subjects randomly selected in an outpatient clinic. Reliability was assessed by a test-retest procedure at 1-day intervals using the Kappa coefficient in a group of 40 subjects. The Kappa agreement values were calculated for each one of the four questions of the questionnaire. The agreement among the same observers was substantial, varying from 0.88 to 1, according to the Kappa values. these demonstrated strong agreement of the instrument, suggesting that the Brazilian version of the "Standardized Nordic Questionnaire" offers substantial reliability.
Results of a geochemical survey, Wadi Ash Shu'Bah quadrangle, sheet 26E, Kingdom of Saudi Arabia
Miller, W.R.; Arnold, M.A.
1989-01-01
A major problem in the interpretation of the regional data resulted from the incomplete removal of magnetite before analyses. The magnetite can cause anomalous values for Ni, Fe, V, Cu, and Co because of it's ability to incorporate these elements into it's structure during magmatic crystallization. It is essential that samples be prepared and analyzed in a consistent manner so that the resulting data may be as reliable as possible.
Ghazanfari, Zeinab; Niknami, Shamsaddin; Ghofranipour, Fazlollah; Hajizadeh, Ebrahim; Montazeri, Ali
2010-11-09
This study carried out to develop a scale for assessing diabetic patients' perceptions about physical activity and to test its psychometric properties (The Physical Activity Questionnaire for Diabetic Patients-PAQ-DP). An item pool extracted from the Theory of Planned Behavior literature was generated. Then an expert panel evaluated the items by assessing content validity index and content validity ratio. Consequently exploratory factor analysis (EFA) was performed to indicate the scale constructs. In addition reliability analyses including internal consistency and test-retest analysis were carried out. In all a sample of 127 women with diabetes participated in the study. Twenty-two items were initially extracted from the literature. A six-factor solution (containing 19 items) emerged as a result of an exploratory factor analysis namely: instrumental attitude, subjective norm, perceived behavioral control, affective attitude, self-identity, and intention explaining 60.30% of the variance observed. Additional analyses indicated satisfactory results for internal consistency (Cronbach's alpha ranging from 0.54 to 0.8) and intraclass correlation coefficients (ranging from 0.40 to 0.92). The Physical Activity Questionnaire for Diabetic Patients (PAQ-DP) is the first instrument that applies the Theory of Planned Behavior in its constructs. The findings indicated that the PAQ-DP is a reliable and valid measure for assessing physical activity perceptions and now is available and can be used in future studies.
2010-01-01
Background This study carried out to develop a scale for assessing diabetic patients' perceptions about physical activity and to test its psychometric properties (The Physical Activity Questionnaire for Diabetic Patients-PAQ-DP). Methods An item pool extracted from the Theory of Planned Behavior literature was generated. Then an expert panel evaluated the items by assessing content validity index and content validity ratio. Consequently exploratory factor analysis (EFA) was performed to indicate the scale constructs. In addition reliability analyses including internal consistency and test-retest analysis were carried out. Results In all a sample of 127 women with diabetes participated in the study. Twenty-two items were initially extracted from the literature. A six-factor solution (containing 19 items) emerged as a result of an exploratory factor analysis namely: instrumental attitude, subjective norm, perceived behavioral control, affective attitude, self-identity, and intention explaining 60.30% of the variance observed. Additional analyses indicated satisfactory results for internal consistency (Cronbach's alpha ranging from 0.54 to 0.8) and intraclass correlation coefficients (ranging from 0.40 to 0.92). Conclusions The Physical Activity Questionnaire for Diabetic Patients (PAQ-DP) is the first instrument that applies the Theory of Planned Behavior in its constructs. The findings indicated that the PAQ-DP is a reliable and valid measure for assessing physical activity perceptions and now is available and can be used in future studies. PMID:21062466
Ritter, Philip L; Lorig, Kate
2014-11-01
Self-efficacy theory, as developed by Bandura, suggests that self-efficacy is an important predictor of future behavior. The Chronic Disease Self-Management Program was designed to enhance self-efficacy as one approach to improving health behaviors and outcomes for people with varying chronic diseases. The six-item Self-Efficacy to Manage Chronic Disease Scale (SEMCD) and the four-item Spanish-language version (SEMCD-S) were developed to measure changes in self-efficacy in program participants and have been used in a numerous evaluations of chronic disease self-management programs. This study describes the development of the scales and their psychometric properties. Secondary analyses of questionnaire data from 2,866 participants in six studies are used to quantify and evaluate the SEMCD. Data from 868 participants in two studies are used for the SEMCD-S. Subjects consisted of individuals with various chronic conditions, who enrolled in chronic disease self-management programs (either small group or Internet based). Subjects came from United States, England, Canada, Mexico, and Australia. Descriptive statistics are summarized, reliability tested (Cronbach alpha), and principal component analyses applied to items. Baseline and change scores are correlated with baseline and change scores for five medical outcome variables that have been shown to be associated with self-efficacy measures in past studies. Principal component analyses confirmed the one-dimensional structure of the scales. The SEMCD had means ranging from 4.9 to 6.1 and the SEMCD-S 6.1 and 6.2. Internal consistency was high (Cronbach alpha, 0.88-0.95). The scales were sensitive to change and significantly correlated with health outcomes. The SEMCD and SEMCD-S are reliable and appear to be valid instruments for assessing self-efficacy for managing chronic disease. There was remarkable consistency across a range of studies from varying countries using two languages. Copyright © 2014 Elsevier Inc. All rights reserved.
Shi, Qiyun; MacDermid, Joy C; Tang, Kenneth; Sinden, Kathryn E; Walton, Dave; Grewal, Ruby
2017-06-01
Background The long version of the organizational, policies and practices (OPP) had a high burden and short versions were developed to solve this drawback. The 11-item version showed promise, but the ergonomic subscale was deficient. The OPP-14 was developed by adding three additional items to the ergonomics subscale. The aim of this study is to evaluate the factor structure using confirmatory factor and Rasch analyses in healthy firefighters. Methods A sample of 261 firefighters (Mean age 42 years, 95 % male) were sampled. A confirmatory factor and Rasch analyses were used to assess the internal consistency, factor structure and other psychometric characteristics of revised OPP-14. Results The OPP-14 demonstrates sound factor structure and internal consistency in firefighters. Confirmatory factor analysis confirmed the consistency of the original 4-domain structure (CFI = 0.97, TLI = 0.96, and RMSEA = 0.053). The 5 items showing misfit initially with disordered thresholds were rescored. The four subscales satisfied Rasch expectations with well target and acceptable reliability. Conclusions The OPP-14 scale shows a promising factor structure in this sample and remediated deficits found in OPP-11. This version may be preferable for musculoskeletal concerns or work applications where ergonomic indicators are relevant.
Zhang, Dengke; Pang, Yanxia; Cai, Weixiong; Fazio, Rachel L; Ge, Jianrong; Su, Qiaorong; Xu, Shuiqin; Pan, Yinan; Chen, Sanmei; Zhang, Hongwei
2016-08-01
Impairment of theory of mind (ToM) is a common phenomenon following traumatic brain injury (TBI) that has clear effects on patients' social functioning. A growing body of research has focused on this area, and several methods have been developed to assess ToM deficiency. Although an informant assessment scale would be useful for examining individuals with TBI, very few studies have adopted this approach. The purpose of the present study was to develop an informant assessment scale of ToM for adults with traumatic brain injury (IASToM-aTBI) and to test its reliability and validity with 196 adults with TBI and 80 normal adults. A 44-item scale was developed following a literature review, interviews with patient informants, consultations with experts, item analysis, and exploratory factor analysis (EFA). The following three common factors were extracted: social interaction, understanding of beliefs, and understanding of emotions. The psychometric analyses indicate that the scale has good internal consistency reliability, split-half reliability, test-retest reliability, inter-rater reliability, structural validity, discriminate validity and criterion validity. These results provide preliminary evidence that supports the reliability and validity of the IASToM-aTBI as a ToM assessment tool for adults with TBI.
HDMR methods to assess reliability in slope stability analyses
NASA Astrophysics Data System (ADS)
Kozubal, Janusz; Pula, Wojciech; Vessia, Giovanna
2014-05-01
Stability analyses of complex rock-soil deposits shall be tackled considering the complex structure of discontinuities within rock mass and embedded soil layers. These materials are characterized by a high variability in physical and mechanical properties. Thus, to calculate the slope safety factor in stability analyses two issues must be taken into account: 1) the uncertainties related to structural setting of the rock-slope mass and 2) the variability in mechanical properties of soils and rocks. High Dimensional Model Representation (HDMR) (Chowdhury et al. 2009; Chowdhury and Rao 2010) can be used to carry out the reliability index within complex rock-soil slopes when numerous random variables with high coefficient of variations are considered. HDMR implements the inverse reliability analysis, meaning that the unknown design parameters are sought provided that prescribed reliability index values are attained. Such approach uses implicit response functions according to the Response Surface Method (RSM). The simple RSM can be efficiently applied when less than four random variables are considered; as the number of variables increases, the efficiency in reliability index estimation decreases due to the great amount of calculations. Therefore, HDMR method is used to improve the computational accuracy. In this study, the sliding mechanism in Polish Flysch Carpathian Mountains have been studied by means of HDMR. The Southern part of Poland where Carpathian Mountains are placed is characterized by a rather complicated sedimentary pattern of flysh rocky-soil deposits that can be simplified into three main categories: (1) normal flysch, consisting of adjacent sandstone and shale beds of approximately equal thickness, (2) shale flysch, where shale beds are thicker than adjacent sandstone beds, and (3) sandstone flysch, where the opposite holds. Landslides occur in all flysch deposit types thus some configurations of possible unstable settings (within fractured rocky-soil masses) resulting in sliding mechanisms have been investigated in this study. The reliability indices values drawn from the HDRM method have been compared with conventional approaches as neural networks: the efficiency of HDRM is shown in the case studied. References Chowdhury R., Rao B.N. and Prasad A.M. 2009. High-dimensional model representation for structural reliability analysis. Commun. Numer. Meth. Engng, 25: 301-337. Chowdhury R. and Rao B. 2010. Probabilistic Stability Assessment of Slopes Using High Dimensional Model Representation. Computers and Geotechnics, 37: 876-884.
Yang, Fang Yu; Zhao, Rong Rong; Liu, Yi Si; Wu, Ying; Jin, Ning Ning; Li, Rui Ying; Shi, Shu Ping; Shao, Yue Ying; Guo, Ming; Arthur, David; Elliott, Malcolm
2013-12-01
A review of the literature showed that the core competencies needed by newly graduated Chinese nurses were not as of yet undocumented. To develop a psychometrically sound instrument for identifying and measuring the core competencies needed by Chinese nursing baccalaureate graduates. Descriptive correlational and multicentre study. Seven major tertiary teaching hospitals and three major medical universities in Beijing. 790 subjects, including patients, nursing faculty members, doctors and nurses. A reliable and valid self-report instrument, consisting of 58 items, was developed using multiple methods. It was then distributed to 790 subjects to measure nursing competency in a broader Chinese context. The psychometric characteristics of reliability and validity were supported by descriptive and inferential analyses. The final instrument consists of six dimensions with 47 items. The content validity index was 0.90. The overall scale reliability was 0.97 with dimensions range from 0.87 to 0.94. Six domains of core competencies were identified: professionalism; direct care; support and communication; application of professional knowledge; personal traits; and critical thinking and innovation. The findings of this study provide valuable evidence for a psychometrically sound measurement tool, as well as for competency-based nursing curriculum reform. Copyright © 2013 Elsevier Ltd. All rights reserved.
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.
Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott
2015-12-01
To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
Bessette, Katie L; Jenkins, Lisanne M; Skerrett, Kristy A; Gowins, Jennifer R; DelDonno, Sophie R; Zubieta, Jon-Kar; McInnis, Melvin G; Jacobs, Rachel H; Ajilore, Olusola; Langenecker, Scott A
2018-01-01
There is substantial variability across studies of default mode network (DMN) connectivity in major depressive disorder, and reliability and time-invariance are not reported. This study evaluates whether DMN dysconnectivity in remitted depression (rMDD) is reliable over time and symptom-independent, and explores convergent relationships with cognitive features of depression. A longitudinal study was conducted with 82 young adults free of psychotropic medications (47 rMDD, 35 healthy controls) who completed clinical structured interviews, neuropsychological assessments, and 2 resting-state fMRI scans across 2 study sites. Functional connectivity analyses from bilateral posterior cingulate and anterior hippocampal formation seeds in DMN were conducted at both time points within a repeated-measures analysis of variance to compare groups and evaluate reliability of group-level connectivity findings. Eleven hyper- (from posterior cingulate) and 6 hypo- (from hippocampal formation) connectivity clusters in rMDD were obtained with moderate to adequate reliability in all but one cluster (ICC's range = 0.50 to 0.76 for 16 of 17). The significant clusters were reduced with a principle component analysis (5 components obtained) to explore these connectivity components, and were then correlated with cognitive features (rumination, cognitive control, learning and memory, and explicit emotion identification). At the exploratory level, for convergent validity, components consisting of posterior cingulate with cognitive control network hyperconnectivity in rMDD were related to cognitive control (inverse) and rumination (positive). Components consisting of anterior hippocampal formation with social emotional network and DMN hypoconnectivity were related to memory (inverse) and happy emotion identification (positive). Thus, time-invariant DMN connectivity differences exist early in the lifespan course of depression and are reliable. The nuanced results suggest a ventral within-network hypoconnectivity associated with poor memory and a dorsal cross-network hyperconnectivity linked to poorer cognitive control and elevated rumination. Study of early course remitted depression with attention to reliability and symptom independence could lead to more readily translatable clinical assessment tools for biomarkers.
Shou, Juan; Ren, Limin; Wang, Haitang; Yan, Fei; Cao, Xiaoyun; Wang, Hui; Wang, Zhiliang; Zhu, Shanzhu; Liu, Yao
2016-04-01
The 12-item Short-Form Health Survey (SF-12) is the abridged practical version of SF-36. This cross-sectional study was aimed to assess the reliability and validity of SF-12 for the health status of Chinese community elderly population. The Chinese community elderly people in Xujiahui district of Shanghai were investigated. The internal consistency reliability was assessed using Cronbach's alpha and split-half reliability coefficients. Construct validity was analyzed using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Spearman's correlation coefficient (ρ) was used for the evaluation of criterion, convergent, and discriminant validity with Spearman's ρ ≥ 0.4 as satisfactory. Comparisons of the SF-12 summary scores among populations that differed in demographics were performed for discriminant validity. Total 1343 individuals aged ≥60 and <85 years old (response rate: 91.3 %) were analyzed. The Cronbach's α value (0.910) and the split-half reliability coefficient (0.812) reflected satisfactory internal consistency reliability of SF-12. EFA extracted a two-factor model (physical and mental health). About 60.7 % of the total variance was explained by the two factors. CFA showed that the two-factor solution provided a good fit to the data. Good convergent validity and discriminant validity of SF-12 were proved by the correction analyses (Spearman's ρ > 0.4) and the comparisons of the SF-12 summary scores among populations (P < 0.05). SF-12 summary scores were significantly correlated with the SF-36 summary scores (Spearman's ρ > 0.4, P < 0.05). In conclusion, SF-12 had satisfactory reliability and validity in measuring health status of Chinese community elderly population in Xujiahui district of Shanghai.
de Vries, Merlijn W; Visscher, Corine; Delwel, Suzanne; van der Steen, Jenny T; Pieper, Marjoleine J C; Scherder, Erik J A; Achterberg, Wilco P; Lobbezoo, Frank
2016-01-01
Objectives. The aim of this study was to establish the reliability of the "chewing" subscale of the OPS-NVI, a novel tool designed to estimate presence and severity of orofacial pain in nonverbal patients. Methods. The OPS-NVI consists of 16 items for observed behavior, classified into four categories and a subjective estimate of pain. Two observers used the OPS-NVI for 237 video clips of people with dementia in Dutch nursing homes during their meal to observe their behavior and to estimate the intensity of orofacial pain. Six weeks later, the same observers rated the video clips a second time. Results. Bottom and ceiling effects for some items were found. This resulted in exclusion of these items from the statistical analyses. The categories which included the remaining items (n = 6) showed reliability varying between fair-to-good and excellent (interobserver reliability, ICC: 0.40-0.47; intraobserver reliability, ICC: 0.40-0.92). Conclusions. The "chewing" subscale of the OPS-NVI showed a fair-to-good to excellent interobserver and intraobserver reliability in this dementia population. This study contributes to the validation process of the OPS-NVI as a whole and stresses the need for further assessment of the reliability of the OPS-NVI with subjects that might already show signs of orofacial pain.
Bergeron, Lise; Smolla, Nicole; Berthiaume, Claude; Renaud, Johanne; Breton, Jean-Jacques; St-Georges, Marie; Morin, Pauline; Zavaglia, Elissa; Labelle, Réal
2017-03-01
The Dominic Interactive for Adolescents-Revised (DIA-R) is a multimedia self-report screen for 9 mental disorders, borderline personality traits, and suicidality defined by the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders ( DSM-5). This study aimed to examine the reliability and the validity of this instrument. French- and English-speaking adolescents aged 12 to 15 years ( N = 447) were recruited from schools and clinical settings in Montreal and were evaluated twice. The internal consistency was estimated by Cronbach alpha coefficients and the test-retest reliability by intraclass correlation coefficients. Cutoff points on the DIA-R scales were determined by using clinically relevant measures for defining external validation criteria: the Schedule for Affective Disorders and Schizophrenia for School-Aged Children, the Beck Hopelessness Scale, and the Abbreviated-Diagnostic Interview for Borderlines. Receiver operating characteristic (ROC) analyses provided accuracy estimates (area under the ROC curve, sensitivity, specificity, likelihood ratio) to evaluate the ability of the DIA-R scales to predict external criteria. For most of the DIA-R scales, reliability coefficients were excellent or moderate. High or moderate accuracy estimates from ROC analyses demonstrated the ability of the DIA-R thresholds to predict psychopathological conditions. These thresholds were generally capable to discriminate between clinical and school subsamples. However, the validity of the obsessions/compulsions scale was too low. Findings clearly support the reliability and the validity of the DIA-R. This instrument may be useful to assess a wide range of adolescents' mental health problems in the continuum of services. This conclusion applies to all scales, except the obsessions/compulsions one.
Personality traits in companion dogs-Results from the VIDOPET.
Turcsán, Borbála; Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A; Huber, Ludwig; Riemer, Stefanie
2018-01-01
Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs' personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years-a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners' assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs.
González, Patricia; Nuñez, Alicia; Merz, Erin; Brintz, Carrie; Weitzman, Orit; Navas, Elena; Camacho, Alvaro; Buelna, Christina; Penedo, Frank J.; Wassertheil-Smoller, Sylvia; Perreira, Krista; Isasi, Carmen; Choca, James; Talavera, Gregory A.; Gallo, Linda C.
2016-01-01
The Center for Epidemiologic Studies Depression Scale (CES-D) is a widely used self-report measure of depression symptomatology. This study evaluated the reliability, validity, and measurement invariance of the CES-D 10 in a diverse cohort of Hispanics/Latinos from the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). The sample consisted of 16,415 Hispanic/Latino adults recruited from four field centers (Miami, FL; San Diego, CA; Bronx, NY; Chicago, IL). Participants completed interview administered measures in English or Spanish. The CES-D 10 was examined for internal consistency, test-retest reliability, convergent validity, and measurement invariance. The total score for the CES-D 10 displayed acceptable internal consistencies (Cronbach α’s = .80 – .86) and test-retest reliability (r’s = .41 – .70) across the total sample, language group and ethnic background group. The total CES-D 10 scores correlated in a theoretically consistent manner with the Spielberger State-Trait Anxiety Inventory (r = .72, p < .001), the Patient Health Questionnaire-9 depression measure (r = .80, p < .001) the Short Form-12’s Mental Component Summary (r = −.65, p < .001) and Physical Component Summary score (r = −.25, p < .001). A confirmatory factor analysis showed that a one-factor model fit the CES-D 10 data well (CFI = .986, RMSEA = .047) after correlating one pair of item residual variances. Multiple group analyses showed the one-factor structure to be invariant across English and Spanish speaking responders and partially invariant across Hispanic/Latino background groups. The total score of the CES-D 10 can be recommended for use with Hispanics/Latinos in English and Spanish. PMID:27295022
Personality traits in companion dogs—Results from the VIDOPET
Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A.; Huber, Ludwig; Riemer, Stefanie
2018-01-01
Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs’ personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years—a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners’ assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs. PMID:29634747
Space shuttle hypergolic bipropellant RCS engine design study, Bell model 8701
NASA Technical Reports Server (NTRS)
1974-01-01
A research program was conducted to define the level of the current technology base for reaction control system rocket engines suitable for space shuttle applications. The project consisted of engine analyses, design, fabrication, and tests. The specific objectives are: (1) extrapolating current engine design experience to design of an RCS engine with required safety, reliability, performance, and operational capability, (2) demonstration of multiple reuse capability, and (3) identification of current design and technology deficiencies and critical areas for future effort.
Park, Hyeon Jin; Yang, Hyung Kook; Shin, Dong Wook; Kim, Yoon Yi; Kim, Young Ae; Yun, Young Ho; Nam, Byung Ho; Bhatia, Smita; Park, Byung Kiu; Ghim, Thad T; Kang, Hyoung Jin; Park, Kyung Duk; Shin, Hee Young; Ahn, Hyo Seop
2013-12-01
We verified the reliability and validity of the Korean version of the Minneapolis-Manchester Quality of Life Instrument-Adolescent Form (KMMQL-AF) among Korean childhood cancer survivors. A total of 107 childhood cancer patients undergoing cancer treatment and 98 childhood cancer survivors who completed cancer treatment were recruited. To assess the internal structure of the KMMQL-AF, we performed multi-trait scaling analyses and exploratory factor analysis. Additionally, we compared each domains of the KMMQL-AF with those of the Karnofsky Performance Status Scale and the Revised Children's Manifest Anxiety Scale (RCMAS). Internal consistency of the KMMQL-AF was sufficient (Cronbach's alpha: 0.78-0.92). In multi-trait scaling analyses, the KMMQL-AF showed sufficient construct validity. The "physical functioning" domain showed moderate correlation with Karnofsky scores and the "psychological functioning" domain showed moderate-to-high correlation with the RCMAS. The KMMQL-AF discriminated between subgroups of different adolescent cancer survivors depending on treatment completion. The KMMQL-AF is a sufficiently reliable and valid instrument for measuring quality of life among Korean childhood cancer survivors.
Development of the scale of hygıene behavıors for nursıng students.
Ipek Coban, Gulay; Bilgin, Sonay
2015-08-21
There is a need to have an appropriate instrument to measure the hygiene behaviors for nursing students. This study was carried out to develop a Hygiene Behavior Scale (HBS). The population of the study is composed of the students of students of nursing department. A total of 416 participants were included in this study. The students in the sampling group were asked to write a composition containing their feelings and thoughts about hygiene. These compositions were analysed and 87 items about positive and negative behaviors were determined. These items were presented to expert opinion and after necessary editions, reliability and validity analyses were conducted. The resulting HBS consists of 25 items across the following three domains: Personal hygiene, handwashing technique and food-related hygiene . The final model in confirmatory factor analysis showed that this 25-item HBS indicated a good fit of the model. The value of the Cronbach's a for the total scale was 0.90. The HBS is determined to be quite highly valid and reliable, sufficient measuring instrument to determine hygiene behaviors of nursing students.
Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine
2014-09-04
There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a unidimensional, Web-based measure of AD caregiver burden and is supported by strong model fit statistics from CFA, high degree of item-level reliability, good internal consistency, moderate test-retest reliability, and moderate convergent validity. Additional validation of the IADCQ is warranted to ensure invariance between the paper-based and Web-based administration and to determine an appropriate responder definition.
The application of the statistical theory of extreme values to gust-load problems
NASA Technical Reports Server (NTRS)
Press, Harry
1950-01-01
An analysis is presented which indicates that the statistical theory of extreme values is applicable to the problems of predicting the frequency of encountering the larger gust loads and gust velocities for both specific test conditions as well as commercial transport operations. The extreme-value theory provides an analytic form for the distributions of maximum values of gust load and velocity. Methods of fitting the distribution are given along with a method of estimating the reliability of the predictions. The theory of extreme values is applied to available load data from commercial transport operations. The results indicate that the estimates of the frequency of encountering the larger loads are more consistent with the data and more reliable than those obtained in previous analyses. (author)
Hernandez, Ana; Gallardo-Pujol, David; Pereda, Noemí; Arntz, Arnoud; Bernstein, David P; Gaviria, Ana M; Labad, Antonio; Valero, Joaquín; Gutiérrez-Zotes, Jose Alfonso
2013-05-01
The present study examines the internal consistency and factor structure of the Spanish version of the Childhood Trauma Questionnaire-Short Form (CTQ-SF) and the association between the CTQ-SF subscales and parenting style. Cronbach's α and confirmatory factor analyses (CFA) were performed in a female clinical sample (n = 185). Kendall's ι correlations were calculated between the maltreatment and parenting scales in a subsample of 109 patients. The Spanish CTQ-SF showed adequate psychometric properties and a good fit of the 5-factor structure. The neglect and abuse scales were negatively associated with parental care and positively associated with overprotection scales. The results of this study provide initial support for the reliability and validity of the Spanish CTQ-SF.
Grilo, C M
2004-01-01
To examine the factor structure of DSM-IV criteria for obsessive compulsive personality disorder (OCPD) in patients with binge eating disorder (BED). Two hundred and eleven consecutive out-patients with axis I diagnoses of BED were reliably assessed with semi-structured diagnostic interviews. The eight criteria for the OCPD diagnosis were examined with reliability and correlational analyses. Exploratory factor analysis was performed to identify potential components. Cronbach's coefficient alpha for the OCPD criteria was 0.77. Principal components factor analysis with varimax rotation revealed a three-factor solution (rigidity, perfectionism, and miserliness), which accounted for 65% of variance. The DSM-IV criteria for OCPD showed good internal consistency. Exploratory factor analysis, however, revealed three components that may reflect distinct interpersonal, intrapersonal (cognitive), and behavioral features.
Boer, Annemarie; Dutmer, Alisa L; Schiphorst Preuper, Henrica R; van der Woude, Lucas H V; Stewart, Roy E; Deyo, Richard A; Reneman, Michiel F; Soer, Remko
2017-10-01
Validation study with cross-sectional and longitudinal measurements. To translate the US National Institutes of Health (NIH)-minimal dataset for clinical research on chronic low back pain into the Dutch language and to test its validity and reliability among people with chronic low back pain. The NIH developed a minimal dataset to encourage more complete and consistent reporting of clinical research and to be able to compare studies across countries in patients with low back pain. In the Netherlands, the NIH-minimal dataset has not been translated before and measurement properties are unknown. Cross-cultural validity was tested by a formal forward-backward translation. Structural validity was tested with exploratory factor analyses (comparative fit index, Tucker-Lewis index, and root mean square error of approximation). Hypothesis testing was performed to compare subscales of the NIH dataset with the Pain Disability Index and the EurQol-5D (Pearson correlation coefficients). Internal consistency was tested with Cronbach α and test-retest reliability at 2 weeks was calculated in a subsample of patients with Intraclass Correlation Coefficients and weighted Kappa (κω). In total, 452 patients were included of which 52 were included for the test-retest study. factor analysis for structural validity pointed into the direction of a seven-factor model (Cronbach α = 0.78). Factors and total score of the NIH-minimal dataset showed fair to good correlations with Pain Disability Index (r = 0.43-0.70) and EuroQol-5D (r = -0.41 to -0.64). Reliability: test-retest reliability per item showed substantial agreement (κω=0.65). Test-retest reliability per factor was moderate to good (Intraclass Correlation Coefficient = 0.71). The Dutch language version measurement properties of the NIH-minimal were satisfactory. N/A.
The development and validation of a test of science critical thinking for fifth graders.
Mapeala, Ruslan; Siew, Nyet Moi
2015-01-01
The paper described the development and validation of the Test of Science Critical Thinking (TSCT) to measure the three critical thinking skill constructs: comparing and contrasting, sequencing, and identifying cause and effect. The initial TSCT consisted of 55 multiple choice test items, each of which required participants to select a correct response and a correct choice of critical thinking used for their response. Data were obtained from a purposive sampling of 30 fifth graders in a pilot study carried out in a primary school in Sabah, Malaysia. Students underwent the sessions of teaching and learning activities for 9 weeks using the Thinking Maps-aided Problem-Based Learning Module before they answered the TSCT test. Analyses were conducted to check on difficulty index (p) and discrimination index (d), internal consistency reliability, content validity, and face validity. Analysis of the test-retest reliability data was conducted separately for a group of fifth graders with similar ability. Findings of the pilot study showed that out of initial 55 administered items, only 30 items with relatively good difficulty index (p) ranged from 0.40 to 0.60 and with good discrimination index (d) ranged within 0.20-1.00 were selected. The Kuder-Richardson reliability value was found to be appropriate and relatively high with 0.70, 0.73 and 0.92 for identifying cause and effect, sequencing, and comparing and contrasting respectively. The content validity index obtained from three expert judgments equalled or exceeded 0.95. In addition, test-retest reliability showed good, statistically significant correlations ([Formula: see text]). From the above results, the selected 30-item TSCT was found to have sufficient reliability and validity and would therefore represent a useful tool for measuring critical thinking ability among fifth graders in primary science.
Skritskaya, Natalia A; Carson-Wong, Amanda R; Moeller, James R; Shen, Sa; Barsky, Arthur J; Fallon, Brian A
2012-07-01
Clinician-administered measures to assess severity of illness anxiety and response to treatment are few. The authors evaluated a modified version of the hypochondriasis-Y-BOCS (H-YBOCS-M), a 19-item, semistructured, clinician-administered instrument designed to rate severity of illness-related thoughts, behaviors, and avoidance. The scale was administered to 195 treatment-seeking adults with DSM-IV hypochondriasis. Test-retest reliability was assessed in a subsample of 20 patients. Interrater reliability was assessed by 27 interviews independently rated by four raters. Sensitivity to change was evaluated in a subsample of 149 patients. Convergent and discriminant validity was examined by comparing H-YBOCS-M scores to other measures administered. Item clustering was examined with confirmatory and exploratory factor analyses. The H-YBOCS-M demonstrated good internal consistency, interrater and test-retest reliability, and sensitivity to symptom change with treatment. Construct validity was supported by significant higher correlations with scores on other measures of hypochondriasis than with nonhypochondriacal measures. Improvement over time in response to treatment correlated with improvement both on measures of hypochondriasis and on measures of somatization, depression, anxiety, and functional status. Confirmatory factor analysis did not show adequate fit for a three-factor model. Exploratory factor analysis revealed a five-factor solution with the first two factors consistent with the separation of the H-YBOCS-M items into the subscales of illness-related avoidance and compulsions. H-YBOCS-M appears to be valid, reliable, and appropriate as an outcome measure for treatment studies of illness anxiety. Study results highlight "avoidance" as a key feature of illness anxiety-with potentially important nosologic and treatment implications. © 2012 Wiley Periodicals, Inc.
Skritskaya, Natalia A.; Carson-Wong, Amanda R.; Moeller, James R.; Shen, Sa; Barsky, Arthur J.; Fallon, Brian A.
2012-01-01
Background Clinician-administered measures to assess severity of illness anxiety and response to treatment are few. The authors evaluated a modified version of the hypochondriasis-Y-BOCS (H-YBOCS-M), a 19-item, semistructured, clinician-administered instrument designed to rate severity of illness-related thoughts, behaviors, and avoidance. Methods The scale was administered to 195 treatment-seeking adults with DSM-IV hypochondriasis. Test–retest reliability was assessed in a subsample of 20 patients. Interrater reliability was assessed by 27 interviews independently rated by four raters. Sensitivity to change was evaluated in a subsample of 149 patients. Convergent and discriminant validity was examined by comparing H-YBOCS-M scores to other measures administered. Item clustering was examined with confirmatory and exploratory factor analyses. Results The H-YBOCS-M demonstrated good internal consistency, interrater and test–retest reliability, and sensitivity to symptom change with treatment. Construct validity was supported by significant higher correlations with scores on other measures of hypochondriasis than with nonhypochondriacal measures. Improvement over time in response to treatment correlated with improvement both on measures of hypochondriasis and on measures of somatization, depression, anxiety, and functional status. Confirmatory factor analysis did not show adequate fit for a three-factor model. Exploratory factor analysis revealed a five-factor solution with the first two factors consistent with the separation of the H-YBOCS-M items into the subscales of illness-related avoidance and compulsions. Conclusions H-YBOCS-M appears to be valid, reliable, and appropriate as an outcome measure for treatment studies of illness anxiety. Study results highlight “avoidance” as a key feature of illness anxiety—with potentially important nosologic and treatment implications. PMID:22504935
Cross-cultural adaptation of VISA-P score for patellar tendinopathy in Turkish population.
Çelebi, Mehmet Mesut; Köse, Serdal Kenan; Akkaya, Zehra; Zergeroglu, Ali Murat
2016-01-01
VISA-P questionnaire assesses to severity of symptoms and treatment effects in athletes with patellar tendinopathy. The purpose of this study was to translated VISA-P questionnaire into Turkish language and to determine its validity and reliability. The English version of VISA-P questionnaire was translated into Turkish according to the internationally recommended guidelines. Test-retest reliability was determined on 89 participants with time interval 24 h. To determine validity of Turkish VISA-P, 31 (17 male, 14 female) healthy students, 34 (20 male, 14 female) patients with patellar tendinopathy (diagnosed by physical examination and ultrasonography) and 24 (16 male, 8 female) volleyball players (at risk populations) were completed VISA-P-Tr. Internal consistency was determined with Cronbach's alpha. Intraclass correlation coefficients (ICCs) were calculated to analyse test-retest reliability. To assessment of discrimination, VISA-P-Tr scores compared all groups using the Mann-Whitney-U test. The VISA-P-Tr questionnaire showed good test-retest reliability (The Cronbach's alpha was 0.79 and 0.78 respectively and ICC was 0.96). The VISA-P-Tr score (mean ± SD) were 93.7 ± 8.9 and 94.0 ± 8.1 for healthy students, 81.1 ± 13.7 and 80.7 ± 13.4 for volleyball players, 58.8 ± 12.1 and 58.5 ± 11.0 for athletes with patellar tendinopathy. The translated Turkish version of VISA-P has good internal consistency and good reliability and validity. Therefore VISA-P-Tr is useful to evaluate symptoms and follow the treatment effect in athletes with patellar tendinopathy.
Vinco, L J; Giacomelli, S; Campana, L; Chiari, M; Vitale, N; Lombardi, G; Veldkamp, T; Hocking, P M
2018-02-01
1. An experiment was conducted to compare 5 different methods for the evaluation of litter moisture. 2. For litter collection and assessment, 55 farms were selected, one shed from each farm was inspected and 9 points were identified within each shed. 3. For each device, used for the evaluation of litter moisture, mean and standard deviation of wetness measures per collection point were assessed. 4. The reliability and overall consistency between the 5 instruments used to measure wetness were high (α = 0.72). 5. Measurement of three out of the 9 collection points were sufficient to provide a reliable assessment of litter moisture throughout the shed. 6. Based on the direct correlation between litter moisture and footpad lesions, litter moisture measurement can be used as a resource based on-farm animal welfare indicator. 7. Among the 5 methods analysed, visual scoring is the most simple and practical, and therefore the best candidate to be used on-farm for animal welfare assessment.
Demoralization Scale in Spanish-Speaking Palliative Care Patients.
Rudilla, David; Galiana, Laura; Oliver, Amparo; Barreto, Pilar
2016-04-01
Among the approaches to the demoralization syndrome, the one proposed by Kissane et al. is prevalent in the literature. These authors developed the Demoralization Scale (DS) to assess emotional distress, conceived as demoralization. To present the Spanish adaptation of the Demoralization Scale in palliative care patients, with a new and more comprehensive approach to its factorial structure. A cross-sectional study was carried out in 226 Spanish palliative care patients in three different settings: hospital, home care unit, and continued care unit. Outcome measures included the DS and the Hospital Anxiety and Depression Scale. Analyses comprised confirmatory factor analyses to test the original, German, and Irish structure of the DS, exploratory structural equation modeling (ESEM), estimations of internal consistency, and multivariate analyses of variance for criterion-related validity. The confirmatory factor analyses showed inappropriate fit for the previous structures when studied in the Spanish version of the DS. With ESEM, the best fitting structure was the five-factor solution, without item 18. Reliability results offered good estimations of internal consistency for all the dimensions except for sense of failure. Cronbach alpha coefficients were appropriate for the dimensions of loss of meaning (0.86), helplessness (0.79), disheartenment (0.88), and dysphoria (0.80), but low reliability was found for sense of failure (0.62). Convergent and discriminant validity showed positive correlations between demoralization, anxiety, and depression. Patients with higher levels of anxiety had higher scores on every dimension of demoralization, and those with higher levels of depression had higher scores on loss of meaning, disheartenment, and sense of failure, but not on dysphoria or helplessness. The Spanish adaptation of the DS has shown appropriate psychometric properties. It has been useful to differentiate between depression and the demoralization syndrome, pointing to helplessness and dysphoria as unique characteristics of demoralized palliative care patients. Copyright © 2016. Published by Elsevier Inc.
Gude, J.A.; Mitchell, M.S.; Russell, R.E.; Sime, C.A.; Bangs, E.E.; Mech, L.D.; Ream, R.R.
2012-01-01
Reliable analyses can help wildlife managers make good decisions, which are particularly critical for controversial decisions such as wolf (Canis lupus) harvest. Creel and Rotella (2010) recently predicted substantial population declines in Montana wolf populations due to harvest, in contrast to predictions made by Montana Fish, Wildlife and Parks (MFWP). We replicated their analyses considering only those years in which field monitoring was consistent, and we considered the effect of annual variation in recruitment on wolf population growth. Rather than assuming constant rates, we used model selection methods to evaluate and incorporate models of factors driving recruitment and human-caused mortality rates in wolf populations in the Northern Rocky Mountains. Using data from 27 area-years of intensive wolf monitoring, we show that variation in both recruitment and human-caused mortality affect annual wolf population growth rates and that human-caused mortality rates have increased with the sizes of wolf populations. We document that recruitment rates have decreased over time, and we speculate that rates have decreased with increasing population sizes and/or that the ability of current field resources to document recruitment rates has recently become less successful as the number of wolves in the region has increased. Estimates of positive wolf population growth in Montana from our top models are consistent with field observations and estimates previously made by MFWP for 2008-2010, whereas the predictions for declining wolf populations of Creel and Rotella (2010) are not. Familiarity with limitations of raw data, obtained first-hand or through consultation with scientists who collected the data, helps generate more reliable inferences and conclusions in analyses of publicly available datasets. Additionally, development of efficient monitoring methods for wolves is a pressing need, so that analyses such as ours will be possible in future years when fewer resources will be available for monitoring. ?? 2011 The Wildlife Society. Copyright ?? The Wildlife Society, 2011.
Validation of the Spanish version of the Index of Spouse Abuse.
Plazaola-Castaño, Juncal; Ruiz-Pérez, Isabel; Escribà-Agüir, Vicenta; Jiménez-Martín, Juan Manuel; Hernández-Torres, Elisa
2009-04-01
Partner violence against women is a major public health problem. Although there are currently a number of validated screening and diagnostic tools that can be used to evaluate this type of violence, such tools are not available in Spain. The aim of this study is to analyze the validity and reliability of the Spanish version of the Index of Spouse Abuse (ISA). A cross-sectional study was carried out in 2005 in two health centers in Granada, Spain, in 390 women between 18 and 70 years old. Analyses of the factorial structure, internal consistency, test-retest reliability, and construct validity were conducted. Cutoff points for each subscale were also defined. For the construct validity analysis, the SF-36 perceived general health dimension, the Rosenberg Self-Esteem Scale and the Goldberg 12-item General Health Questionnaire were included. The psychometric analysis shows that the instrument has good internal consistency, reproducibility, and construct validity. The scale is useful for the analysis of partner violence against women in both a research setting and a healthcare setting.
Gironés Muriel, Alberto; Campos Segovia, Ana; Ríos Gómez, Patricia
2018-01-01
The study of mediating variables and psychological responses to child surgery involves the evaluation of both the patient and the parents as regards different stressors. To have a reliable and reproducible valid evaluation tool that assesses the level of paternal involvement in relation to different stressors in the setting of surgery. A self-report questionnaire study was completed by 123 subjects of both sexes, subdivided into 2populations, due to their relationship with the hospital setting. The items were determined by a group of experts and analysed using the Lawshe validity index to determine a first validity of content. Subsequently, the reliability of the tool was determined by an item-re-item analysis of the 2sub-populations. A factorial analysis was performed to analyse the construct validity with the maximum likelihood and rotation of varimax type factors. A questionnaire of paternal concern was offered, consisting of 21 items with a Cronbach coefficient of 0.97, giving good precision and stability. The posterior factor analysis gives an adequate validity to the questionnaire, with the determination of 10 common stressors that cover 74.08% of the common and non-common variance of the questionnaire. The proposed questionnaire is reliable, valid and easy-to-apply and is developed to assess the level of paternal concern about the surgery of a child and to be able to apply measures and programs through the prior assessment of these elements. Copyright © 2016 Asociación Española de Pediatría. Publicado por Elsevier España, S.L.U. All rights reserved.
Measuring teamwork and conflict among Emergency Medical Technician personnel
Patterson, P. Daniel; Weaver, Matthew D.; Weaver, Sallie J.; Rosen, Michael A.; Todorova, Gergana; Weingart, Laurie R.; Krackhardt, David; Lave, Judith R.; Arnold, Robert M.; Yealy, Donald M.; Salas, Eduardo
2011-01-01
Objective We sought to develop a reliable and valid tool for measuring teamwork among Emergency Medical Technician (EMT) partnerships. Methods We adapted existing scales and developed new items to measure components of teamwork. After recruiting a convenience sample of 39 agencies, we tested a 122-item draft survey tool. We performed a series of Exploratory Factor Analyses (EFA) and Confirmatory Factor Analysis (CFA) to test reliability and construct validity, describing variation in domain and global scores using descriptive statistics. Results We received 687 completed surveys. The EFA analyses identified a 9-factor solution. We labeled these factors [1] Team Orientation, [2] Team Structure & Leadership, [3] Partner Communication, Team Support, & Monitoring, [4] Partner Trust and Shared Mental Models, [5] Partner Adaptability & Back-Up Behavior, [6] Process Conflict, [7] Strong Task Conflict, [8] Mild Task Conflict, and [9] Interpersonal Conflict. We tested a short form (30-item SF) and long form (45-item LF) version. The CFA analyses determined that both the SF and LF versions possess positive psychometric properties of reliability and construct validity. The EMT-TEAMWORK-SF has positive internal consistency properties with a mean Cronbach’s alpha coefficient ≥0.70 across all 9-factors (mean=0.84; min=0.78, max=0.94). The mean Cronbach’s alpha coefficient for the EMT-TEAMWORK-LF version was 0.87 (min=0.79, max=0.94). There was wide variation in weighted scores across all 9 factors and the global score for the SF and LF versions. Mean scores were lowest for the Team Orientation factor (48.1, SD 21.5 SF; 49.3 SD 19.8 LF) and highest (more positive) for the Interpersonal Conflict factor (87.7 SD 18.1 for both SF and LF). Conclusions We developed a reliable and valid survey to evaluate teamwork between EMT partners. PMID:22128909
Reliable and fast volumetry of the lumbar spinal cord using cord image analyser (Cordial).
Tsagkas, Charidimos; Altermatt, Anna; Bonati, Ulrike; Pezold, Simon; Reinhard, Julia; Amann, Michael; Cattin, Philippe; Wuerfel, Jens; Fischer, Dirk; Parmar, Katrin; Fischmann, Arne
2018-04-30
To validate the precision and accuracy of the semi-automated cord image analyser (Cordial) for lumbar spinal cord (SC) volumetry in 3D T1w MRI data of healthy controls (HC). 40 3D T1w images of 10 HC (w/m: 6/4; age range: 18-41 years) were acquired at one 3T-scanner in two MRI sessions (time interval 14.9±6.1 days). Each subject was scanned twice per session, allowing determination of test-retest reliability both in back-to-back (intra-session) and scan-rescan images (inter-session). Cordial was applied for lumbar cord segmentation twice per image by two raters, allowing for assessment of intra- and inter-rater reliability, and compared to a manual gold standard. While manually segmented volumes were larger (mean: 2028±245 mm 3 vs. Cordial: 1636±300 mm 3 , p<0.001), accuracy assessments between manually and semi-automatically segmented images showed a mean Dice-coefficient of 0.88±0.05. Calculation of within-subject coefficients of variation (COV) demonstrated high intra-session (1.22-1.86%), inter-session (1.26-1.84%), as well as intra-rater (1.73-1.83%) reproducibility. No significant difference was shown between intra- and inter-session reproducibility or between intra-rater reliabilities. Although inter-rater reproducibility (COV: 2.87%) was slightly lower compared to all other reproducibility measures, between rater consistency was very strong (intraclass correlation coefficient: 0.974). While under-estimating the lumbar SCV, Cordial still provides excellent inter- and intra-session reproducibility showing high potential for application in longitudinal trials. • Lumbar spinal cord segmentation using the semi-automated cord image analyser (Cordial) is feasible. • Lumbar spinal cord is 40-mm cord segment 60 mm above conus medullaris. • Cordial provides excellent inter- and intra-session reproducibility in lumbar spinal cord region. • Cordial shows high potential for application in longitudinal trials.
Jette, Alan M.; McDonough, Christine M.; Haley, Stephen M.; Ni, Pengsheng; Olarsch, Sippy; Latham, Nancy; Hambleton, Ronald K.; Felson, David; Kim, Young-jo; Hunter, David
2012-01-01
Objective To develop and evaluate a prototype measure (OA-DISABILITY-CAT) for osteoarthritis research using Item Response Theory (IRT) and Computer Adaptive Test (CAT) methodologies. Study Design and Setting We constructed an item bank consisting of 33 activities commonly affected by lower extremity (LE) osteoarthritis. A sample of 323 adults with LE osteoarthritis reported their degree of limitation in performing everyday activities and completed the Health Assessment Questionnaire-II (HAQ-II). We used confirmatory factor analyses to assess scale unidimensionality and IRT methods to calibrate the items and examine the fit of the data. Using CAT simulation analyses, we examined the performance of OA-DISABILITY-CATs of different lengths compared to the full item bank and the HAQ-II. Results One distinct disability domain was identified. The 10-item OA-DISABILITY-CAT demonstrated a high degree of accuracy compared with the full item bank (r=0.99). The item bank and the HAQ-II scales covered a similar estimated scoring range. In terms of reliability, 95% of OA-DISABILITY reliability estimates were over 0.83 versus 0.60 for the HAQ-II. Except at the highest scores the 10-item OA-DISABILITY-CAT demonstrated superior precision to the HAQ-II. Conclusion The prototype OA-DISABILITY-CAT demonstrated promising measurement properties compared to the HAQ-II, and is recommended for use in LE osteoarthritis research. PMID:19216052
Counting pollen grains using readily available, free image processing and analysis software.
Costa, Clayton M; Yang, Suann
2009-10-01
Although many methods exist for quantifying the number of pollen grains in a sample, there are few standard methods that are user-friendly, inexpensive and reliable. The present contribution describes a new method of counting pollen using readily available, free image processing and analysis software. Pollen was collected from anthers of two species, Carduus acanthoides and C. nutans (Asteraceae), then illuminated on slides and digitally photographed through a stereomicroscope. Using ImageJ (NIH), these digital images were processed to remove noise and sharpen individual pollen grains, then analysed to obtain a reliable total count of the number of grains present in the image. A macro was developed to analyse multiple images together. To assess the accuracy and consistency of pollen counting by ImageJ analysis, counts were compared with those made by the human eye. Image analysis produced pollen counts in 60 s or less per image, considerably faster than counting with the human eye (5-68 min). In addition, counts produced with the ImageJ procedure were similar to those obtained by eye. Because count parameters are adjustable, this image analysis protocol may be used for many other plant species. Thus, the method provides a quick, inexpensive and reliable solution to counting pollen from digital images, not only reducing the chance of error but also substantially lowering labour requirements.
Kujawa, Autumn; Carroll, Ashley; Mumper, Emma; Mukherjee, Dahlia; Kessel, Ellen M; Olino, Thomas; Hajcak, Greg; Klein, Daniel N
2017-11-04
Brain regions involved in reward processing undergo developmental changes from childhood to adolescence, and alterations in reward-related brain function are thought to contribute to the development of psychopathology. Event-related potentials (ERPs), such as the reward positivity (RewP) component, are valid measures of reward responsiveness that are easily assessed across development and provide insight into temporal dynamics of reward processing. Little work has systematically examined developmental changes in ERPs sensitive to reward. In this longitudinal study of 75 youth assessed 3 times across 6years, we used principal components analyses (PCA) to differentiate ERPs sensitive to monetary reward and loss feedback in late childhood, early adolescence, and middle adolescence. We then tested reliability of, and developmental changes in, ERPs. A greater number of ERP components differentiated reward and loss feedback in late childhood compared to adolescence, but components in childhood accounted for only a small proportion of variance. A component consistent with RewP was the only one to consistently emerge at each of the 3 assessments. RewP demonstrated acceptable reliability, particularly from early to middle adolescence, though reliability estimates varied depending on scoring approach and developmental period. The magnitude of the RewP component did not significantly change across time. Results provide insight into developmental changes in the structure of ERPs sensitive to reward, and indicate that RewP is a consistently observed and relatively stable measure of reward responsiveness, particularly across adolescence. Copyright © 2017. Published by Elsevier B.V.
Psychometric Properties of “Community Assessment of Psychic Experiences”: Review and Meta-analyses
Mark, Winifred; Toulopoulou, Timothea
2016-01-01
The Community Assessment of Psychic Experiences (CAPE) has been used extensively as a measurement for psychosis proneness in clinical and research settings. However, no prior review and meta-analysis have comprehensively examined psychometric properties (reliability and validity) of CAPE scores across different studies. To study CAPE’s internal reliability—ie, how well scale items correlate with one another—111 studies were reviewed. Of these, 18 reported unique internal reliability coefficients using data at hand, which were aggregated in a meta-analysis. Furthermore, to confirm the number and nature of factors tapped by CAPE, 17 factor analytic studies were reviewed and subjected to meta-analysis in cases of discrepancy. Results suggested that CAPE scores were psychometrically reliable—ie, scores obtained could be attributed to true score variance. Our review of factor analytic studies supported a 3-factor model for CAPE consisting of “Positive”, “Negative”, and “Depressive” subscales; and a tripartite structure for the Negative dimension consisting of “Social withdrawal”, “Affective flattening”, and “Avolition” subdimensions. Meta-analysis of factor analytic studies of the Positive dimension revealed a tridimensional structure consisting of “Bizarre experiences”, “Delusional ideations”, and “Perceptual anomalies”. Information on reliability and validity of CAPE scores is important for ensuring accurate measurement of the psychosis proneness phenotype, which in turn facilitates early detection and intervention for psychotic disorders. Apart from enhancing the understanding of psychometric properties of CAPE scores, our review revealed questionable reporting practices possibly reflecting insufficient understanding regarding the significance of psychometric properties. We recommend increased focus on psychometrics in psychology programmes and clinical journals. PMID:26150674
The development and testing of a qualitative instrument designed to assess critical thinking
NASA Astrophysics Data System (ADS)
Clauson, Cynthia Louisa
This study examined a qualitative approach to assess critical thinking. An instrument was developed that incorporates an assessment process based on Dewey's (1933) concepts of self-reflection and critical thinking as problem solving. The study was designed to pilot test the critical thinking assessment process with writing samples collected from a heterogeneous group of students. The pilot test included two phases. Phase 1 was designed to determine the validity and inter-rater reliability of the instrument using two experts in critical thinking, problem solving, and literacy development. Validity of the instrument was addressed by requesting both experts to respond to ten questions in an interview. The inter-rater reliability was assessed by analyzing the consistency of the two experts' scorings of the 20 writing samples to each other, as well as to my scoring of the same 20 writing samples. Statistical analyses included the Spearman Rho and the Kuder-Richardson (Formula 20). Phase 2 was designed to determine the validity and reliability of the critical thinking assessment process with seven science teachers. Validity was addressed by requesting the teachers to respond to ten questions in a survey and interview. Inter-rater reliability was addressed by comparing the seven teachers' scoring of five writing samples with my scoring of the same five writing samples. Again, the Spearman Rho and the Kuder-Richardson (Formula 20) were used to determine the inter-rater reliability. The validity results suggest that the instrument is helpful as a guide for instruction and provides a systematic method to teach and assess critical thinking while problem solving with students in the classroom. The reliability results show the critical thinking assessment instrument to possess fairly high reliability when used by the experts, but weak reliability when used by classroom teachers. A major conclusion was drawn that teachers, as well as students, would need to receive instruction in critical thinking and in how to use the assessment process in order to gain more consistent interpretations of the six problem-solving steps. Specific changes needing to be made in the instrument to improve the quality are included.
Validating the Why/How Contrast for Functional MRI Studies of Theory of Mind
Spunt, Robert P.; Adolphs, Ralph
2014-01-01
The ability to impute mental states to others, or Theory of Mind (ToM), has been the subject of hundreds of neuroimaging studies. Although reviews and meta-analyses of these studies have concluded that ToM recruits a coherent brain network, mounting evidence suggests that this network is an abstraction based on pooling data from numerous studies, most of which use different behavioral tasks to investigate ToM. Problematically, this means that no single behavioral task can be used to reliably measure ToM Network function as currently conceived. To make ToM Network function scientifically tractable, we need standardized tasks capable of reliably measuring specific aspects of its functioning. Here, our goal is to validate the Why/How Task for this purpose. Several prior studies have found that when compared to answering how-questions about another person's behavior, answering why-questions about that same behavior activates a network that is anatomically consistent with meta-analytic definitions of the ToM Network. In the version of the Why/How Task presented here, participants answer yes/no Why (e.g., Is the person helping someone?) and How (e.g., Is the person lifting something?) questions about pretested photographs of naturalistic human behaviors. Across three fMRI studies, we show that the task elicits reliable performance measurements and modulates a left-lateralized network that is consistently localized across studies. While this network is convergent with meta-analyses of ToM studies, it is largely distinct from the network identified by the widely used False-Belief Localizer, the most common ToM task. Our new task is publicly available, and can be used as an efficient functional localizer to provide reliable identification of single-subject responses in most regions of the network. Our results validate the Why/How Task, both as a standardized protocol capable of producing maximally comparable data across studies, and as a flexible foundation for programmatic research on the neurobiological foundations of a basic manifestation of human ToM. PMID:24844746
Riquelme, Arnoldo; Herrera, Cristian; Aranis, Carolina; Oporto, Jorge; Padilla, Oslando
2009-06-01
The Spanish version of the Postgraduate Hospital Educational Environment Measure (PHEEM) was evaluated in this study to determine its psychometric properties, validity and internal consistency to measure the clinical learning environment in the hospital setting of Pontificia Universidad Católica de Chile Medical School's Internship. The 40-item PHEEM questionnaire was translated from English to Spanish and retranslated to English. Content validity was tested by a focus group and minor differences in meaning were adjusted. The PHEEM was administered to clerks in years 6 and 7. Construct validity was carried out using exploratory factor analysis followed by a Varimax rotation. Internal consistency was measured using Cronbach's alpha. A total of 125 out of 220 students responded to the PHEEM. The overall response rate was 56.8% and compliances with each item ranged from 99.2% to 100%. Analyses indicate that five factors instrument accounting for 58% of the variance and internal consistency of the 40-item questionnaire is 0.955 (Cronbach's alpha). The 40-item questionnaire had a mean score of 98.21 +/- 21.2 (maximum score of 160). The Spanish version of PHEEM is a multidimensional, valid and highly reliable instrument measuring the educational environment among undergraduate medical students working in hospital-based clerkships.
Development of the "Treatment beliefs in knee and hip OsteoArthritis (TOA)" questionnaire.
Selten, Ellen M H; Vriezekolk, Johanna E; Schers, Henk J; Nijhof, Marc W; van der Laan, Willemijn H; van der Meulen-Dilling, Roelien G; Geenen, Rinie; van den Ende, Cornelia H M
2017-09-19
Use of conservative treatment modalities in osteoarthritis (OA) is suboptimal, which appears to be partly due to patients' beliefs about treatments. The aim of this study was to develop a research instrument assessing patients' beliefs about various treatment modalities of hip and knee OA: the 'Treatment beliefs in OA (TOA) questionnaire'. The item pool that was retrieved from interviews with patients and healthcare providers comprised beliefs regarding five treatment modalities: physical activity, pain medication, physiotherapy, injections and arthroplasty. After an extensive selection procedure, a draft questionnaire with 200 items was constructed. Descriptive analyses and exploratory factor analyses with oblique rotation were conducted for each treatment modality separately to decide upon the final questionnaire. Internal consistency and test-retest reliability were determined. The final questionnaire comprised 60 items. It was completed by 351 patients with knee or hip OA. Each of the five treatment modalities yielded a two factor solution with 37% to 51% explained variance and high face validity. Factor I included 'positive treatment beliefs' and factor II 'negative treatment beliefs'. Internal consistency (Cronbach α's from 0.72 to 0.87) and test-retest reliability (i.e. intraclass correlation coefficient from 0.66-0.88; standard error of measurement from 0.06-0.11) were satisfactory to good. The TOA questionnaire is the first questionnaire assessing positive and negative treatment beliefs regarding five treatment modalities for knee and hip OA. The instrument will help to understand whether and to what extent treatment beliefs influence treatment choices.
Arheart, Kristopher L; Sly, David F; Trapido, Edward J; Rodriguez, Richard D; Ellestad, Amy J
2004-11-01
To identify multi-item attitude/belief scales associated with the theoretical foundations of an anti-tobacco counter-marketing campaign and assess their reliability and validity. The data analyzed are from two state-wide, random, cross-sectional telephone surveys [n(S1)=1,079, n(S2)=1,150]. Items forming attitude/belief scales are identified using factor analysis. Reliability is assessed with Chronbach's alpha. Relationships among scales are explored using Pearson correlation. Validity is assessed by testing associations derived from the Centers for Disease Control and Prevention's (CDC) logic model for tobacco control program development and evaluation linking media exposure to attitudes/beliefs, and attitudes/beliefs to smoking-related behaviors. Adjusted odds ratios are employed for these analyses. Three factors emerged: traditional attitudes/beliefs about tobacco and tobacco use, tobacco industry manipulation and anti-tobacco empowerment. Reliability coefficients are in the range of 0.70 and vary little between age groups. The factors are correlated with one-another as hypothesized. Associations between media exposure and the attitude/belief scales and between these scales and behaviors are consistent with the CDC logic model. Using reliable, valid multi-item scales is theoretically and methodologically more sound than employing single-item measures of attitudes/beliefs. Methodological, theoretical and practical implications are discussed.
Meylan, Grégoire; Reck, Barbara K; Rechberger, Helmut; Graedel, Thomas E; Schwab, Oliver
2017-10-17
Decision-makers traditionally expect "hard facts" from scientific inquiry, an expectation that the results of material flow analyses (MFAs) can hardly meet. MFA limitations are attributable to incompleteness of flowcharts, limited data quality, and model assumptions. Moreover, MFA results are, for the most part, based less on empirical observation but rather on social knowledge construction processes. Developing, applying, and improving the means of evaluating and communicating the reliability of MFA results is imperative. We apply two recently proposed approaches for making quantitative statements on MFA reliability to national minor metals systems: rhenium, gallium, and germanium in the United States in 2012. We discuss the reliability of results in policy and management contexts. The first approach consists of assessing data quality based on systematic characterization of MFA data and the associated meta-information and quantifying the "information content" of MFAs. The second is a quantification of data inconsistencies indicated by the "degree of data reconciliation" between the data and the model. A high information content and a low degree of reconciliation indicate reliable or certain MFA results. This article contributes to reliability and uncertainty discourses in MFA, exemplifying the usefulness of the approaches in policy and management, and to raw material supply discussions by providing country-level information on three important minor metals often considered critical.
Reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour.
Scafide, Katherine N; Sheridan, Daniel J; Taylor, Laura A; Hayat, Matthew J
2016-06-01
Bruising is one of the most common types of injury clinicians observe among victims of violence and other trauma patients. However, research has shown commonly used qualitative description of cutaneous bruise colour via the naked eye is subjective and unreliable. No published work has formally evaluated the reliability of tristimulus colourimetry as an alternative for assessing bruise colour, despite its clinical and research applications in accurately assessing skin colour. The purpose of this study was to systematically evaluate the test-retest and inter-observer reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour. Two researchers obtained repeated tristimulus colourimetry measures of cutaneous bruises with participants of diverse skin colour. Measures were obtained using the Minolta CR-400 Chomameter. Commission Internationale d'Eclairage (CIE) L*a*b* colour space was used. Data was analysed using intraclass correlation coefficients (ICC), Cronbach's alpha, and minimal detectable change (MDC) on all three L*a*b* values. The colorimeter demonstrated excellent test-retest or intra-rater reliability (L* ICC=0.999; a* ICC=0.973; b* ICC=0.892) and inter-rater reliability (L* ICC=0.997; a* ICC=0.976; b* ICC=0.982). With consistent placement, the tristimulus colourimetry is reliable for the objective assessment and documentation of cutaneous bruise colour for purposes of clinical practice and research. Recommendations for use in practice/research are provided. Copyright © 2016 Elsevier Ltd. All rights reserved.
[Turkish validity and reliability study of fear of pain questionnaire-III].
Ünver, Seher; Turan, Fatma Nesrin
2018-01-01
This study aimed to develop a Turkish version of the Fear of Pain Questionnaire-III developed by McNeil and Rainwater (1998) and examine its validity and reliability indicators. The study was conducted with 459 university students studying in the nursing department. The Turkish translation of the scale was conducted by language experts and the original scale owner. Expert opinions were taken for language validity, and the Lawshe's content validity ratio formula was used to calculate the content validity. Exploratory factor analysis was used to assess the construct validity. The factors were rotated using the Varimax rotation (orthogonal) method. For reliability indicators of the questionnaire, the internal consistency coefficient and test re-test reliability were utilized. Explanatory factor analyses using the three-factor model (explaining 50.5% of the total variance) revealed that the item factor loads varied were above the limit value of 0.30 which indicated that the questionnaire had good construct validity. The Cronbach's alpha value for the total questionnaire was 0.938, and test re-test value was 0.846 for the total scale. The Turkish version of the Fear of Pain Questionnaire-III had sufficiently high reliability and validity to be used as a tool in evaluating the fear of pain among the young Turkish population.
The Validity and reliability of the Comprehensive Home Environment Survey (CHES).
Pinard, Courtney A; Yaroch, Amy L; Hart, Michael H; Serrano, Elena L; McFerren, Mary M; Estabrooks, Paul A
2014-01-01
Few comprehensive measures exist to assess contributors to childhood obesity within the home, specifically among low-income populations. The current study describes the modification and psychometric testing of the Comprehensive Home Environment Survey (CHES), an inclusive measure of the home food, physical activity, and media environment related to childhood obesity. The items were tested for content relevance by an expert panel and piloted in the priority population. The CHES was administered to low-income parents of children 5 to 17 years (N = 150), including a subsample of parents a second time and additional caregivers to establish test-retest and interrater reliabilities. Children older than 9 years (n = 95), as well as parents (N = 150) completed concurrent assessments of diet and physical activity behaviors (predictive validity). Analyses and item trimming resulted in 18 subscales and a total score, which displayed adequate internal consistency (α = .74-.92) and high test-retest reliability (r ≥ .73, ps < .01) and interrater reliability (r ≥ .42, ps < .01). The CHES score and a validated screener for the home environment were correlated (r = .37, p < .01; concurrent validity). CHES subscales were significantly correlated with behavioral measures (r = -.20-.55, p < .05; predictive validity). The CHES shows promise as a valid/reliable assessment of the home environment related to childhood obesity, including healthy diet and physical activity.
Dysphagia in Multiple Sclerosis: Evaluation and Validation of the DYMUS Questionnaire.
Alali, Dalal; Ballard, Kirrie; Vucic, Steve; Bogaardt, Hans
2018-06-01
The 10-item Dysphagia in Multiple Sclerosis (DYMUS) questionnaire is a self-administered tool used to identify swallowing problems in adults with MS. The questionnaire was not validated against other existing questionnaires to assess its convergent validity. Moreover, its test-retest reliability was not measured previously. Therefore, the purpose of this study was to assess the factor analysis, internal consistency and test-retest reliability of the DYMUS, as well as its convergent validity against an established and validated questionnaire, the EAT-10. English-speaking adults with MS in New South Wales, Australia who were seen for routine medical check-ups were invited to complete two questionnaires across two phases. One hundred participants completed phase 1, while 55 completed phase 2. Statistical analyses were performed to investigate the psychometric properties of the DYMUS questionnaire. Internal consistency (Cronbach's Alpha) reduced the DYMUS questionnaire from ten to five items. The shortened version of the DYMUS showed high internal consistency (alpha = 0.904). It also showed satisfactory reproducibility, and adequate correlation with the 10-item Eating Assessment Tool (EAT-10). Evaluation of the DYMUS resulted in a shortened version of the questionnaire with five questions related to dysphagia. This shortened version is considered an easy and useful tool in identifying patients with MS-related dysphagia.
Phenotypic regional fMRI activation patterns during memory encoding in MCI and AD
Browndyke, Jeffrey N.; Giovanello, Kelly; Petrella, Jeffrey; Hayden, Kathleen; Chiba-Falek, Ornit; Tucker, Karen A.; Burke, James R.; Welsh-Bohmer, Kathleen A.
2014-01-01
Background Reliable blood-oxygen-level-dependent (BOLD) fMRI phenotypic biomarkers of Alzheimer's disease (AD) or mild cognitive impairment (MCI) are likely to emerge only from a systematic, quantitative, and aggregate examination of the functional neuroimaging research literature. Methods A series of random-effects, activation likelihood estimation (ALE) meta-analyses were conducted on studies of episodic memory encoding operations in AD and MCI samples relative to normal controls. ALE analyses were based upon a thorough literature search for all task-based functional neuroimaging studies in AD and MCI published up to January 2010. Analyses covered 16 fMRI studies, which yielded 144 distinct foci for ALE meta-analysis. Results ALE results indicated several regional task-based BOLD consistencies in MCI and AD patients relative to normal controls across the aggregate BOLD functional neuroimaging research literature. Patients with AD and those at significant risk (MCI) showed statistically significant consistent activation differences during episodic memory encoding in the medial temporal lobe (MTL), specifically parahippocampal gyrus, as well superior frontal gyrus, precuneus, and cuneus, relative to normal controls. Conclusions ALE consistencies broadly support the presence of frontal compensatory activity, MTL activity alteration, and posterior midline “default mode” hyperactivation during episodic memory encoding attempts in the diseased or prospective pre-disease condition. Taken together these robust commonalities may form the foundation for a task-based fMRI phenotype of memory encoding in AD. PMID:22841497
The displaced aggression questionnaire.
Denson, Thomas F; Pedersen, William C; Miller, Norman
2006-06-01
Previous measures of aggressive personality have focused on direct aggression (i.e., retaliation toward the provoking agent). An original self-report measure of trait displaced aggression is presented. Exploratory and confirmatory factor analyses provided support for a 3-factor conceptualization of the construct. These analyses identified an affective dimension (angry rumination), a cognitive dimension (revenge planning), and a behavioral dimension (general tendency to engage in displaced aggression). The trait measure demonstrated good internal consistency and test-retest reliability as well as convergent and discriminant construct validity. Unlike other related personality measures, trait displaced aggression significantly predicted indirect indicators of real-world displaced aggression (i.e., self-reported domestic abuse and road rage) as well as laboratory displaced aggression in 2 experiments. Copyright 2006 APA, all rights reserved.
Demaria, T P; Kassinove, H; Dill, C A
1989-01-01
A test consistency and confirmatory factor analyses were performed on the Survey of Personal Beliefs, a new measure of irrational thinking based on rational-emotive personality theory. The survey, which was logically derived, includes a general rationality factor and subscales measuring five hypothesized core categories of irrational beliefs. Subjects included a nonclinical sample of 130 men and 150 women, with a mean age of 46. Results indicated that the Survey of Personal Beliefs had satisfactory total and scale reliability. The confirmatory analyses supported a higher order factor model including 5 first-order factors ( awfulizing, self-directed shoulds, other-directed shoulds, low frustration tolerance, and self-worth) and 1 second-order or general factor.
Mantarova, Stefka G; Velcheva, Irena V; Georgieva, Spaska O; Stambolieva, Katerina I
2013-01-01
The last twenty years have witnessed a surge of interest in the autonomic symptoms in Parkinson's disease (PD) and the possibilities to diagnose and treat them. The specialized questionnaire assessing the autonomic symptoms in Parkinson's disease (SCOPA-AUT) has been validated and available in English, Dutch and Spanish. In this study we aim at evaluating the validity, reliability and applicability of the Bulgarian version of SCOPA-AUT (SCOPA-AUT-BG). The study included 55 patients with idiopathic PD (mean age 64.4 +/- 8.9 yrs), and 40 healthy controls (mean age 58.5 +/- 9.4 yrs). Clinical severity and disease stage were assessed by United Parkinson's disease rating scale (UPRDS) and Hoen and Yahr (H&Y). Thirty-two of the PD patients completed SCOPA-AUT-BG again after a 7-day interval. Questionnaire reliability was analyzed by determining the internal consistency, homogeneity, discriminatory and construct validity and test-retest reliability. Analyses showed good internal consistency of the summary evaluation of SCOPA-AUT-BG (coefficient alpha of Cronbach = 0.79), which indicates the high reliability of the questionnaire. The lowest Cronbach's alpha coefficient (0.53) was found for the subscale "cardiovascular functions". A dominant role belongs to the subscales for gastrointestinal and urinary functions (Cronbach's Alpha > 0.7), where a significantly high correlation of PD with the UPDRS scale was observed. We found high test-retest reliability based on the responses associated with dysfunction of the gastrointestinal, urinary, thermoregulatory and pupillary autonomic systems. The correlation of the results of SCOPA-AUT-BG with UPDRS is higher than that with H&Y, and the construct validity is high except for the cardiovascular and pupillomotor functions subscales. The results of this study show that SCOPA-AUT-BG is a valid and reliable specialized questionnaire to evaluate autonomic function in patients with Parkinson's disease. Using it allows for more detailed clinical evaluation of these patients and justifies the need to refer them to specialized examination of autonomic functions.
2010-01-01
Background The primary aim of this study was to develop and psychometrically test a Greek-language instrument for measuring satisfaction with home care. The first empirical evidence about the level of satisfaction with these services in Greece is also provided. Methods The questionnaire resulted from literature search, on-site observation and cognitive interviews. It was applied in 2006 to a sample of 201 enrollees of five home care programs in the city of Thessaloniki and contains 31 items that measure satisfaction with individual service attributes and are expressed on a 5-point Likert scale. The latter has been usually considered in practice as an interval scale, although it is in principle ordinal. We thus treated the variable as an ordinal one, but also employed the traditional approach in order to compare the findings. Our analysis was therefore based on ordinal measures such as the polychoric correlation, Kendall's Tau b coefficient and ordinal Cronbach's alpha. Exploratory factor analysis was followed by an assessment of internal consistency reliability, test-retest reliability, construct validity and sensitivity. Results Analyses with ordinal and interval scale measures produced in essence very similar results and identified four multi-item scales. Three of these were found to be reliable and valid: socioeconomic change, staff skills and attitudes and service appropriateness. A fourth dimension -service planning- had lower internal consistency reliability and yet very satisfactory test-retest reliability, construct validity and floor and ceiling effects. The global satisfaction scale created was also quite reliable. Overall, participants were satisfied -yet not very satisfied- with home care services. More room for improvement seems to exist for the socio-economic and planning aspects of care and less for staff skills and attitudes and appropriateness of provided services. Conclusions The methods developed seem to be a promising tool for the measurement of home care satisfaction in Greece. PMID:20602759
Detecting long-term growth trends using tree rings: a critical evaluation of methods.
Peters, Richard L; Groenendijk, Peter; Vlam, Mart; Zuidema, Pieter A
2015-05-01
Tree-ring analysis is often used to assess long-term trends in tree growth. A variety of growth-trend detection methods (GDMs) exist to disentangle age/size trends in growth from long-term growth changes. However, these detrending methods strongly differ in approach, with possible implications for their output. Here, we critically evaluate the consistency, sensitivity, reliability and accuracy of four most widely used GDMs: conservative detrending (CD) applies mathematical functions to correct for decreasing ring widths with age; basal area correction (BAC) transforms diameter into basal area growth; regional curve standardization (RCS) detrends individual tree-ring series using average age/size trends; and size class isolation (SCI) calculates growth trends within separate size classes. First, we evaluated whether these GDMs produce consistent results applied to an empirical tree-ring data set of Melia azedarach, a tropical tree species from Thailand. Three GDMs yielded similar results - a growth decline over time - but the widely used CD method did not detect any change. Second, we assessed the sensitivity (probability of correct growth-trend detection), reliability (100% minus probability of detecting false trends) and accuracy (whether the strength of imposed trends is correctly detected) of these GDMs, by applying them to simulated growth trajectories with different imposed trends: no trend, strong trends (-6% and +6% change per decade) and weak trends (-2%, +2%). All methods except CD, showed high sensitivity, reliability and accuracy to detect strong imposed trends. However, these were considerably lower in the weak or no-trend scenarios. BAC showed good sensitivity and accuracy, but low reliability, indicating uncertainty of trend detection using this method. Our study reveals that the choice of GDM influences results of growth-trend studies. We recommend applying multiple methods when analysing trends and encourage performing sensitivity and reliability analysis. Finally, we recommend SCI and RCS, as these methods showed highest reliability to detect long-term growth trends. © 2014 John Wiley & Sons Ltd.
Rocket engine system reliability analyses using probabilistic and fuzzy logic techniques
NASA Technical Reports Server (NTRS)
Hardy, Terry L.; Rapp, Douglas C.
1994-01-01
The reliability of rocket engine systems was analyzed by using probabilistic and fuzzy logic techniques. Fault trees were developed for integrated modular engine (IME) and discrete engine systems, and then were used with the two techniques to quantify reliability. The IRRAS (Integrated Reliability and Risk Analysis System) computer code, developed for the U.S. Nuclear Regulatory Commission, was used for the probabilistic analyses, and FUZZYFTA (Fuzzy Fault Tree Analysis), a code developed at NASA Lewis Research Center, was used for the fuzzy logic analyses. Although both techniques provided estimates of the reliability of the IME and discrete systems, probabilistic techniques emphasized uncertainty resulting from randomness in the system whereas fuzzy logic techniques emphasized uncertainty resulting from vagueness in the system. Because uncertainty can have both random and vague components, both techniques were found to be useful tools in the analysis of rocket engine system reliability.
Rath, Hilke M; Steimann, Monika; Ullrich, Anneke; Rotsch, Martin; Zurborn, Karl-Heinz; Koch, Uwe; Kriston, Levente; Bergelt, Corinna
2015-02-01
Although the Occupational Stress and Coping Inventory (AVEM) questionnaire is used to assess work behaviour during occupation-related oncological rehabilitation, little is known about its psychometric characteristics in cancer patients. Therefore, we analysed the psychometric properties of the AVEM in this group. The AVEM was administered to 477 cancer patients at the beginning of rehabilitation. The AVEM consists of 11 subscales that categorise patients into one of four types of work behaviour. We obtained data from several subgroups and analysed reliability using Cronbach's α. We performed a confirmatory factor analysis (CFA) of the dimensional structure proposed by the authors of the AVEM. In addition, we analysed the AVEM's predictive validity by examining work-related outcomes one year after the end of rehabilitation (N = 336). Similar to a population-based reference sample, half of the patients exhibited work behaviours that might be problematic in stressful working situations. The AVEM proved to be a reliable instrument, and the CFA supported the factor structure of the AVEM. The analyses of predictive validity suggest that work behaviour and mental health characteristics, that involve the tendency to feel overwhelmed and less motivated at work, might lead to an increased level of occupational stress one year post-rehabilitation. The AVEM can be used during rehabilitation to assess the extent to which patients report work behaviours associated with occupational stress and dissatisfaction. Patients who exhibit the tendency to feel overwhelmed and helpless in stressful work situations should be identified early so they can be offered support.
[Validating the Spanish version of the Nursing Activities Score].
Sánchez-Sánchez, M M; Arias-Rivera, S; Fraile-Gamo, M P; Thuissard-Vasallo, I J; Frutos-Vivar, F
2015-01-01
Validating workload scores ensures that they are appropriate for the purpose for which they were developed. To validate the Nursing Activities Score (NAS) Spanish version. Observational and prospective study. 1,045 patients who were admitted to a medical-surgical unit and a serious burns unit in 2006 were included. The nurse in charge assessed patient workloads by Nine Equivalent of Nursing Manpower use Score and NAS. To assess the internal consistency of the measurements of NAS, item-test correlations, Cronbach's α and Cronbach's α corrected by omitting each of the items were calculated. The intraobserver and interobserver reliability were assessed with the intraclass correlation coefficient by viewing recordings and Kappa (interobserver reliability) was estimated. For the analysis of internal validity, a factorial principal components analysis was performed. Convergent validity was assessed using the Spearman correlation coefficient values obtained from the Nine Equivalent of Nursing Manpower use Score and Spanish-NAS scales. For internal consistency, 164 questionnaires were analysed and a Cronbach's α of 0.373 was calculated. The intraclass correlation coefficient for intraobserver reliability estimate was 0.837 (95% IC: 0.466-0.950) and 0.662 (95% IC: 0.033-0.882) for interobserver reliability. The estimated kappa was 0.371. For internal validity, exploratory factor analysis showed that the first item explained 58.9% of the variance of the questionnaire. For convergent validity 1006 questionnaires were included and a Spearman correlation coefficient of 0.746 was observed. The psychometric properties of Spanish-NAS are acceptable. Copyright © 2014 Elsevier España, S.L.U. y SEEIUC. All rights reserved.
Psychometric Properties of the Persian Translation of the Sexual Quality of Life–Male Questionnaire
Maasoumi, Raziyeh; Mokarami, Hamidreza; Nazifi, Morteza; Stallones, Lorann; Taban, Abrahim; Yazdani Aval, Mohsen; Samimi, Kazem
2016-01-01
Sexual dysfunction has been demonstrated to be related to a poor quality of life. These dysfunctions are especially prevalent among men. This cross-sectional study aimed to investigate the psychometric properties of the Persian translation of the Sexual Quality of Life–Male (SQOL-M), translated and adapted to measure sexual quality of life among Iranian men. Forward–backward procedures were applied in translating the original SQOL-M into Persian, and then the psychometric properties of the Persian translation of the SQOL-M were studied. A total of 181 participants (23-60 years old) were included in the study. Validity was assessed by construct validity using confirmatory factor analysis, convergent validity, and content validity. The international index of erectile function (IIEF) and the work ability index were used to study the convergent validity. Reliability was evaluated through internal consistency and test–retest reliability analyses. The results from confirmatory factor analysis confirmed a one-factor solution for the Persian version of the SQOL-M. Content validity of the translated measure was endorsed by 10 specialists. Pearson correlations indicated that work ability index score, dimensions of the IIEF, and the IIEF total score were positively correlated with the Persian version of the SQOL-M (p < .001). Reliability evaluation indicated a high internal consistency and test–retest reliability. The Cronbach’s alpha coefficient and intraclass correlation coefficients were .96 and .95, respectively. Results indicated that the Persian version of the SQOL-M has good to excellent psychometric properties and can be used to assess the sexual quality of life among Iranian men. PMID:26856758
Psychometric Properties of the Persian Translation of the Sexual Quality of Life-Male Questionnaire.
Maasoumi, Raziyeh; Mokarami, Hamidreza; Nazifi, Morteza; Stallones, Lorann; Taban, Abrahim; Yazdani Aval, Mohsen; Samimi, Kazem
2017-05-01
Sexual dysfunction has been demonstrated to be related to a poor quality of life. These dysfunctions are especially prevalent among men. This cross-sectional study aimed to investigate the psychometric properties of the Persian translation of the Sexual Quality of Life-Male (SQOL-M), translated and adapted to measure sexual quality of life among Iranian men. Forward-backward procedures were applied in translating the original SQOL-M into Persian, and then the psychometric properties of the Persian translation of the SQOL-M were studied. A total of 181 participants (23-60 years old) were included in the study. Validity was assessed by construct validity using confirmatory factor analysis, convergent validity, and content validity. The international index of erectile function (IIEF) and the work ability index were used to study the convergent validity. Reliability was evaluated through internal consistency and test-retest reliability analyses. The results from confirmatory factor analysis confirmed a one-factor solution for the Persian version of the SQOL-M. Content validity of the translated measure was endorsed by 10 specialists. Pearson correlations indicated that work ability index score, dimensions of the IIEF, and the IIEF total score were positively correlated with the Persian version of the SQOL-M ( p < .001). Reliability evaluation indicated a high internal consistency and test-retest reliability. The Cronbach's alpha coefficient and intraclass correlation coefficients were .96 and .95, respectively. Results indicated that the Persian version of the SQOL-M has good to excellent psychometric properties and can be used to assess the sexual quality of life among Iranian men.
Questionnaire-based assessment of executive functioning: Psychometrics.
Castellanos, Irina; Kronenberger, William G; Pisoni, David B
2018-01-01
The psychometric properties of the Learning, Executive, and Attention Functioning (LEAF) scale were investigated in an outpatient clinical pediatric sample. As a part of clinical testing, the LEAF scale, which broadly measures neuropsychological abilities related to executive functioning and learning, was administered to parents of 118 children and adolescents referred for psychological testing at a pediatric psychology clinic; 85 teachers also completed LEAF scales to assess reliability across different raters and settings. Scores on neuropsychological tests of executive functioning and academic achievement were abstracted from charts. Psychometric analyses of the LEAF scale demonstrated satisfactory internal consistency, parent-teacher inter-rater reliability in the small to large effect size range, and test-retest reliability in the large effect size range, similar to values for other executive functioning checklists. Correlations between corresponding subscales on the LEAF and other behavior checklists were large, while most correlations with neuropsychological tests of executive functioning and achievement were significant but in the small to medium range. Results support the utility of the LEAF as a reliable and valid questionnaire-based assessment of delays and disturbances in executive functioning and learning. Applications and advantages of the LEAF and other questionnaire measures of executive functioning in clinical neuropsychology settings are discussed.
Object-oriented fault tree evaluation program for quantitative analyses
NASA Technical Reports Server (NTRS)
Patterson-Hine, F. A.; Koen, B. V.
1988-01-01
Object-oriented programming can be combined with fault free techniques to give a significantly improved environment for evaluating the safety and reliability of large complex systems for space missions. Deep knowledge about system components and interactions, available from reliability studies and other sources, can be described using objects that make up a knowledge base. This knowledge base can be interrogated throughout the design process, during system testing, and during operation, and can be easily modified to reflect design changes in order to maintain a consistent information source. An object-oriented environment for reliability assessment has been developed on a Texas Instrument (TI) Explorer LISP workstation. The program, which directly evaluates system fault trees, utilizes the object-oriented extension to LISP called Flavors that is available on the Explorer. The object representation of a fault tree facilitates the storage and retrieval of information associated with each event in the tree, including tree structural information and intermediate results obtained during the tree reduction process. Reliability data associated with each basic event are stored in the fault tree objects. The object-oriented environment on the Explorer also includes a graphical tree editor which was modified to display and edit the fault trees.
Falkum, Erik; Pedersen, Geir; Karterud, Sigmund
2009-01-01
This article examines reliability and validity aspects of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) paranoid personality disorder (PPD) diagnosis. Patients with personality disorders (n = 930) from the Norwegian network of psychotherapeutic day hospitals, of which 114 had PPD, were included in the study. Frequency distribution, chi(2), correlations, reliability statistics, exploratory, and confirmatory factor analyses were performed. The distribution of PPD criteria revealed no distinct boundary between patients with and without PPD. Diagnostic category membership was obtained in 37 of 64 theoretically possible ways. The PPD criteria formed a separate factor in a principal component analysis, whereas a confirmatory factor analysis indicated that the DSM-IV PPD construct consists of 2 separate dimensions as follows: suspiciousness and hostility. The reliability of the unitary PPD scale was only 0.70, probably partly due to the apparent 2-dimensionality of the construct. Persistent unwarranted doubts about the loyalty of friends had the highest diagnostic efficiency, whereas unwarranted accusations of infidelity of partner had particularly poor indicator properties. The reliability and validity of the unitary PPD construct may be questioned. The 2-dimensional PPD model should be further explored.
Sun, Zong-Liang; Zhang, Yu-Zhen; Zhang, Feng; Zhang, Jia-Wei; Zheng, Guo-Can; Tan, Ling; Wang, Chong-Zhi; Zhou, Lian-Di; Zhang, Qi-Hui; Yuan, Chun-Su
2018-06-22
An efficient method combined with fingerprint and chemometric analyses was developed to evaluate the quality of the traditional Chinese medicine plant Penthorum chinense Pursh. Nine samples were collected from different regions during different harvest periods, and 17 components in the form of extracts were simultaneously examined to assess quality by using high-performance liquid chromatography. The hepatoprotective effects of components were investigated by assessing the inhibition of SMMC-7721 cell growth. The results indicated that the quality control method was accurate, stable, and reliable, and the hierarchical heat-map cluster and the principle component analyses confirmed that the classification of all nine samples was consistent. Quercetin and ellagitannins including pinocembrin-7-O-[3''-O-galloyl-4'',6''-hexahydroxydiphenoyl]-β-glucose (PGHG), thonningianin A, thonningianin B, and other flavonoids were abundant in the extracts, and significantly contributed to the hepatoprotective effects.
The development and psychometric analysis of the Chinese HIV-Related Fatigue Scale.
Li, Su-Yin; Wu, Hua-Shan; Barroso, Julie
2016-04-01
To develop a Chinese version of the human immunodeficiency virus-related Fatigue Scale and examine its reliability and validity. Fatigue is found in more than 70% of people infected with human immunodeficiency virus. However, a scale to assess fatigue in human immunodeficiency virus-positive people has not yet been developed for use in Chinese-speaking countries. A methodologic study involving instrument development and psychometric evaluation was used. The human immunodeficiency virus-related Fatigue Scale was examined through a two-step procedure: (1) translation and back translation and (2) psychometric analysis. A sample of 142 human immunodeficiency virus-positive patients was recruited from the Infectious Disease Outpatient Clinic in central Taiwan. Their fatigue data were analysed with Cronbach's α for internal consistency. Two weeks later, the data of a random sample of 28 patients from the original 142 were analysed for test-retest reliability. The correlation between the World Health Organization Quality of Life Assessment-Human Immunodeficiency Virus and the Chinese version of the human immunodeficiency virus-related Fatigue Scale was analysed for concurrent validity. The Chinese version of the human immunodeficiency virus-related Fatigue Scale scores of human immunodeficiency virus-positive patients with highly active antiretroviral therapy and those without were compared to demonstrate construct validity. The internal consistency and test-retest reliability of the Chinese version of the human immunodeficiency virus-related Fatigue Scale were 0·97 and 0·686, respectively. In regard to concurrent validity, a negative correlation was found between the scores of the Chinese version of the human immunodeficiency virus-related Fatigue Scale and the World Health Organization Quality of Life Assessment-Human Immunodeficiency Virus. Additionally, the Chinese version of the human immunodeficiency virus-related Fatigue Scale could be used to effectively distinguish fatigue differences between the human immunodeficiency virus-positive patients with highly active antiretroviral therapy and those without. The Chinese version of the human immunodeficiency virus-related Fatigue Scale presents good reliability and validity through a robust psychometric analysis. This scale can be appropriately applied to human immunodeficiency virus-positive patients by clinical staff and case managers in Chinese-speaking countries. The Chinese version of the human immunodeficiency virus-related Fatigue Scale is an effective and comprehensive tool that can help clinical professionals measure the frequency, strength and impact on the quality of life of fatigue in Chinese human immunodeficiency virus-positive patients. © 2016 John Wiley & Sons Ltd.
Validation of the Chinese Version of the Quality of Nursing Work Life Scale
Fu, Xia; Xu, Jiajia; Song, Li; Li, Hua; Wang, Jing; Wu, Xiaohua; Hu, Yani; Wei, Lijun; Gao, Lingling; Wang, Qiyi; Lin, Zhanyi; Huang, Huigen
2015-01-01
Quality of Nursing Work Life (QNWL) serves as a predictor of a nurse’s intent to leave and hospital nurse turnover. However, QNWL measurement tools that have been validated for use in China are lacking. The present study evaluated the construct validity of the QNWL scale in China. A cross-sectional study was conducted conveniently from June 2012 to January 2013 at five hospitals in Guangzhou, which employ 1938 nurses. The participants were asked to complete the QNWL scale and the World Health Organization Quality of Life abbreviated version (WHOQOL-BREF). A total of 1922 nurses provided the final data used for analyses. Sixty-five nurses from the first investigated division were re-measured two weeks later to assess the test-retest reliability of the scale. The internal consistency reliability of the QNWL scale was assessed using Cronbach’s α. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC). Criterion-relation validity was assessed using the correlation of the total scores of the QNWL and the WHOQOL-BREF. Construct validity was assessed with the following indices: χ2 statistics and degrees of freedom; relative mean square error of approximation (RMSEA); the Akaike information criterion (AIC); the consistent Akaike information criterion (CAIC); the goodness-of-fit index (GFI); the adjusted goodness of fit index; and the comparative fit index (CFI). The findings demonstrated high internal consistency (Cronbach’s α = 0.912) and test-retest reliability (interclass correlation coefficient = 0.74) for the QNWL scale. The chi-square test (χ2 = 13879.60, df [degree of freedom] = 813 P = 0.0001) was significant. The RMSEA value was 0.091, and AIC = 1806.00, CAIC = 7730.69, CFI = 0.93, and GFI = 0.74. The correlation coefficient between the QNWL total scores and the WHOQOL-BREF total scores was 0.605 (p<0.01). The QNWL scale was reliable and valid in Chinese-speaking nurses and could be used as a clinical and research instrument for measuring work-related factors among nurses in China. PMID:25950838
Introducing English and German versions of the Adolescent Time Attitude Scale.
Worrell, Frank C; Mello, Zena R; Buhl, Monika
2013-08-01
In this study, the authors report on the development of English and German versions of the Adolescent Time Attitude Scale (ATAS). The ATAS consists of six subscales assessing Past Positive, Past Negative, Present Positive, Present Negative, Future Positive, and Future Negative time attitudes. The authors describe the development of the scales and present data on the reliability and structural validity of ATAS scores in samples of American (N = 300) and German (N = 316) adolescents. Internal consistency estimates for scores on the English and German versions of the ATAS were in the .70 to .80 range. Confirmatory factor analyses indicated that a six-factor structure yielded the best fit for scores and that the scores were invariant across samples.
Mikaeili, Fattaneh; Mathis, Alexander; Deplazes, Peter; Mirhendi, Hossein; Barazesh, Afshin; Ebrahimi, Sepideh; Kia, Eshrat Beigom
2017-09-26
The definitive genetic identification of Toxocara species is currently based on PCR/sequencing. The objectives of the present study were to design and conduct an in silico polymerase chain reaction-restriction fragment length polymorphism method for identification of Toxocara species. In silico analyses using the DNASIS and NEBcutter softwares were performed with rDNA internal transcribed spacers, and mitochondrial cox1 and nad1 sequences obtained in our previous studies along with relevant sequences deposited in GenBank. Consequently, RFLP profiles were designed and all isolates of T. canis and T. cati collected from dogs and cats in different geographical areas of Iran were investigated with the RFLP method using some of the identified suitable enzymes. The findings of in silico analyses predicted that on the cox1 gene only the MboII enzyme is appropriate for PCR-RFLP to reliably distinguish the two species. No suitable enzyme for PCR-RFLP on the nad1 gene was identified that yields the same pattern for all isolates of a species. DNASIS software showed that there are 241 suitable restriction enzymes for the differentiation of T. canis from T. cati based on ITS sequences. RsaI, MvaI and SalI enzymes were selected to evaluate the reliability of the in silico PCR-RFLP. The sizes of restriction fragments obtained by PCR-RFLP of all samples consistently matched the expected RFLP patterns. The ITS sequences are usually conserved and the PCR-RFLP approach targeting the ITS sequence is recommended for the molecular differentiation of Toxocara species and can provide a reliable tool for identification purposes particularly at the larval and egg stages.
Ciciriello, Sabina; Buchbinder, Rachelle; Osborne, Richard H; Wicks, Ian P
2014-02-01
To develop and test an evidence-based, multimedia patient education program (MPEP) about methotrexate (MTX) treatment for rheumatoid arthritis (RA) and a new measure of patient knowledge [Methotrexate in Rheumatoid Arthritis Knowledge test (MiRAK)]. The content of the MPEP and MiRAK was guided by concept-mapping workshops with patients (N = 24), literature review, health professional, and expert linguistic input. The MPEP and MiRAK underwent multiple stages of testing and revision with patients and health professionals. The MiRAK was administered to RA patients (N = 169) and its properties examined using the Rasch analyses. A subset of respondents (N = 131) repeated the MiRAK to determine test-retest reliability. A before-after pilot study with patients who had recently started MTX (N = 31) tested responsiveness of the MiRAK and feasibility and acceptability of the MPEP. A DVD of 24-minutes duration was produced that presents detailed, evidence-based information about MTX. The Rasch analyses of the 60 MiRAK items revealed that these could be summated into a single score. The MiRAK had good model fit, supporting internal construct validity, good internal consistency (person separation index; 0.84), test-retest reliability (ICC; 0.89), and ability to detect change (ES; 2.38). The before-after study suggested that patients could self-administer the MPEP, with the majority finding it informative and easy to use. We developed a MPEP about MTX treatment for RA, which was found to be user-friendly and easily implementable. The MiRAK is a new scale, testing a broad spectrum of MTX knowledge. Analyses revealed strong evidence for its validity and reliability. Copyright © 2014 Elsevier Inc. All rights reserved.
Development of the PROMIS health expectancies of smoking item banks.
Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Stucky, Brian D; Cerully, Jennifer; Li, Zhen; Hansen, Mark; Cai, Li
2014-09-01
Smokers' health-related outcome expectancies are associated with a number of important constructs in smoking research, yet there are no measures currently available that focus exclusively on this domain. This paper describes the development and evaluation of item banks for assessing the health expectancies of smoking. Using data from a sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of health expectancies items for daily and nondaily smokers. We also evaluated the performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess health expectancies. A total of 24 items were included in the Health Expectancies item banks; 13 items are common across daily and nondaily smokers, 6 are unique to daily, and 5 are unique to nondaily. For both daily and nondaily smokers, the Health Expectancies item banks are unidimensional, reliable (reliability = 0.95 and 0.96, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.87). Results from simulated CATs showed that health expectancies can be assessed with good precision with an average of 5-6 items adaptively selected from the item banks. Health expectancies of smoking can be assessed on the basis of these item banks via SFs, CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Kolotkin, Ronette L; Crosby, Ross D
2002-03-01
The short form of impact of weight on quality of life (IWQOL)-Lite is a 31-item, self-report, obesity-specific measure of health-related quality of life (HRQOL) that consists of a total score and scores on each of five scales--physical function, self-esteem, sexual life, public distress, and work--and that exhibits strong psychometric properties. This study was undertaken in order to assess test-retest reliability and discriminant validity in a heterogeneous sample of individuals not in treatment. Individuals were recruited from the community to complete questionnaires that included the IWQOL-Lite, SF-36, Rosenberg self-esteem (RSE) scale, Marlowe-Crowne social desirability scale, global ratings of quality of life, and sexual functioning and public distress ratings. Persons currently enrolled in weight loss programs or with a body mass index (BMI) of less than 18.5 were dropped from the analyses, leaving 341 females and 153 males for analysis, with an average BMI of 27.4. For test-retest reliability, 112 participants completed the IWQOL-Lite again. ANOVA revealed significant main effects for BMI for all IWQOL-Lite scales and total score. Females showed greater impairment than males on all scales except public distress. Internal consistency ranged from 0.816 to 0.944 for IWQOL-Lite scales and was 0.958 for total score. Test-retest reliability ranged from 0.814 to 0.877 for scales and was 0.937 for total score. Internal consistency and test-retest results for overweight/obese subjects were similar to those obtained for the total sample. There was strong evidence for convergent and discriminant validity of the IWQOL-Lite in overweight/obese subjects. As in previous studies conducted on treatment-seeking obese persons, the IWQOL-Lite appears to be a reliable and valid measure of obesity-specific quality of life in overweight/obese persons not seeking treatment.
Miller, M; Hamilton, J; Scupham, R; Matwiejczyk, L; Prichard, I; Farrer, O; Yaxley, A
2018-01-01
Food service staff are integral to delivery of quality food in aged care homes yet measurement of their satisfaction is unable to be performed due to an absence of a valid and reliable questionnaire. The aim of this study was to develop and perform psychometric testing for a new Food Service Satisfaction Questionnaire developed in Australia specifically for use by food service staff working in residential aged care homes (Flinders FSSQFSAC). A mixed methods design utilizing both a qualitative (in-depth interviews, focus groups) and a quantitative approach (cross sectional survey) was used. Content validity was determined from focus groups and interviews with food service staff currently working in aged care homes, related questionnaires from the literature and consultation with an expert panel. The questionnaire was tested for construct validity and internal consistency using data from food service staff currently working in aged care homes that responded to an electronic invitation circulated to Australian aged care homes using a national database of email addresses. Construct validity was tested via principle components analysis and internal consistency through Cronbach's alpha. Temporal stability of the questionnaire was determined from food service staff undertaking the Flinders FSSQFSAC on two occasions, two weeks apart, and analysed using Pearson's correlations. Content validity for the Flinders FSSQFSAC was established from a panel of experts and stakeholders. Principle components analysis revealed food service staff satisfaction was represented by 61-items divided into eight domains: job satisfaction (α=0.832), food quality (α=0.871), staff training (α=0.922), consultation (α=0.840), eating environment (α=0.777), reliability (α=0.695), family expectations (α=0.781) and resident relationships (α=0.429), establishing construct validity in all domains, and internal consistency in all (α>0.5) except for "resident relationships" (α=0.429). Test-retest reliability coefficients ranged from 0.276 to 0.826 dependent on domain, with test-retest reliability established in seven domains at r>0.4; an exception was "reliability" at r=0.276. The newly developed Flinders FSSQFSAC has acceptable validity and reliability and thereby the potential to measure satisfaction of food service staff working in residential aged care homes, identify areas for strategic change, measure improvements and in turn, improve the satisfaction and quality of life of both food service staff and residents of aged care homes.
A review of lunar chronology revealing a preponderance of 4.34-4.37 Ga ages
Borg, Lars E.; Gaffney, Amy M.; Shearer, Charles K.
2014-11-24
In this study, data obtained from Sm-Nd and Rb-Sr isotopic measurements of lunar highlands’ samples are renormalized to common standard values and then used to define ages with a common isochron regression algorithm. The reliability of these ages is evaluated using five criteria that include whether: (1) the ages are defined by multiple isotopic systems, (2) the data demonstrate limited scatter outside uncertainty, (3) initial isotopic compositions are consistent with the petrogenesis of the samples, (4) the ages are defined by an isotopic system that is resistant to disturbance by impact metamorphism, and (5) the rare-earth element abundances determined bymore » isotope dilution of bulk of mineral fractions match those measured by in situ analyses. From this analysis, it is apparent that the oldest highlands’ rock ages are some of the least reliable, and that there is little support for crustal ages older than ~4.40 Ga. A model age for ur-KREEP formation calculated using the most reliable Mg-suite Sm-Nd isotopic systematics, in conjunction with Sm-Nd analyses of KREEP basalts, is 4389 ± 45 Ma. This age is a good match to the Lu-Hf model age of 4353 ± 37 Ma determined using a subset of this sample suite, the average model age of 4353 ± 25 Ma determined on mare basalts with the 146Sm- 142Nd isotopic system, with a peak in Pb-Pb ages observed in lunar zircons of ~4340 ± 20 Ma, and the oldest terrestrial zircon age of 4374 ± 6 Ma. The preponderance of ages between 4.34 and 4.37 Ga reflect either primordial solidification of a lunar magma ocean or a widespread secondary magmatic event on the lunar nearside. The first scenario is not consistent with the oldest ages reported for lunar zircons, whereas the second scenario does not account for concordance between ages of crustal rocks and mantle reservoirs.« less
Painter, J; Trevithick, L; Hastings, R P; Ingham, B; Roy, A
2016-12-01
In meeting the needs of individuals with intellectual disabilities (ID) who access health services, a brief, holistic assessment of need is useful. This study outlines the development and testing of the Learning Disabilities Needs Assessment Tool (LDNAT), a tool intended for this purpose. An existing mental health (MH) tool was extended by a multidisciplinary group of ID practitioners. Additional scales were drafted to capture needs across six ID treatment domains that the group identified. LDNAT ratings were analysed for the following: item redundancy, relevance, construct validity and internal consistency (n = 1692); test-retest reliability (n = 27); and concurrent validity (n = 160). All LDNAT scales were deemed clinically relevant with little redundancy apparent. Principal component analysis indicated three components (developmental needs, challenging behaviour, MH and well-being). Internal consistency was good (Cronbach alpha 0.80). Individual item test-retest reliability was substantial-near perfect for 20 scales and slight-fair for three scales. Overall reliability was near perfect (intra-class correlation = 0.91). There were significant associations with five of six condition-specific measures, i.e. the Waisman Activities of Daily Living Scale (general ability/disability), Threshold Assessment Grid (risk), Behaviour Problems Inventory for Individuals with Intellectual Disabilities-Short Form (challenging behaviour) Social Communication Questionnaire (autism) and a bespoke physical health questionnaire. Additionally, the statistically significant correlations between these tools and the LDNAT components made sense clinically. There were no statistically significant correlations with the Psychiatric Assessment Schedules for Adults with Developmental Disabilities (a measure of MH symptoms in people with ID). The LDNAT had clinically utility when rating the needs of people with ID prior to condition-specific assessment(s). Analyses of internal and external validity were promising. Further evaluation of its sensitivity to changes in needs is now required. © 2016 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
van der Meulen, Ineke; van de Sandt-Koenderman, W Mieke E; Duivenvoorden, Hugo J; Ribbers, Gerard M
2010-01-01
This study explores the psychometric qualities of the Scenario Test, a new test to assess daily-life communication in severe aphasia. The test is innovative in that it: (1) examines the effectiveness of verbal and non-verbal communication; and (2) assesses patients' communication in an interactive setting, with a supportive communication partner. To determine the reliability, validity, and sensitivity to change of the Scenario Test and discuss its clinical value. The Scenario Test was administered to 122 persons with aphasia after stroke and to 25 non-aphasic controls. Analyses were performed for the entire group of persons with aphasia, as well as for a subgroup of persons unable to communicate verbally (n = 43). Reliability (internal consistency, test-retest reliability, inter-judge, and intra-judge reliability) and validity (internal validity, convergent validity, known-groups validity) and sensitivity to change were examined using standard psychometric methods. The Scenario Test showed high levels of reliability. Internal consistency (Cronbach's alpha = 0.96; item-rest correlations = 0.58-0.82) and test-retest reliability (ICC = 0.98) were high. Agreement between judges in total scores was good, as indicated by the high inter- and intra-judge reliability (ICC = 0.86-1.00). Agreement in scores on the individual items was also good (square-weighted kappa values 0.61-0.92). The test demonstrated good levels of validity. A principal component analysis for categorical data identified two dimensions, interpreted as general communication and communicative creativity. Correlations with three other instruments measuring communication in aphasia, that is, Spontaneous Speech interview from the Aachen Aphasia Test (AAT), Amsterdam-Nijmegen Everyday Language Test (ANELT), and Communicative Effectiveness Index (CETI), were moderate to strong (0.50-0.85) suggesting good convergent validity. Group differences were observed between persons with aphasia and non-aphasic controls, as well as between persons with aphasia unable to use speech to convey information and those able to communicate verbally; this indicates good known-groups validity. The test was sensitive to changes in performance, measured over a period of 6 months. The data support the reliability and validity of the Scenario Test as an instrument for examining daily-life communication in aphasia. The test focuses on multimodal communication; its psychometric qualities enable future studies on the effect of Alternative and Augmentative Communication (AAC) training in aphasia.
Development and validation of a Malawian version of the primary care assessment tool.
Dullie, Luckson; Meland, Eivind; Hetlevik, Øystein; Mildestvedt, Thomas; Gjesdal, Sturla
2018-05-16
Malawi does not have validated tools for assessing primary care performance from patients' experience. The aim of this study was to develop a Malawian version of Primary Care Assessment Tool (PCAT-Mw) and to evaluate its reliability and validity in the assessment of the core primary care dimensions from adult patients' perspective in Malawi. A team of experts assessed the South African version of the primary care assessment tool (ZA-PCAT) for face and content validity. The adapted questionnaire underwent forward and backward translation and a pilot study. The tool was then used in an interviewer administered cross-sectional survey in Neno district, Malawi, to test validity and reliability. Exploratory factor analysis was performed on a random half of the sample to evaluate internal consistency, reliability and construct validity of items and scales. The identified constructs were then tested with confirmatory factor analysis. Likert scale assumption testing and descriptive statistics were done on the final factor structure. The PCAT-Mw was further tested for intra-rater and inter-rater reliability. From the responses of 631 patients, a 29-item PCAT-Mw was constructed comprising seven multi-item scales, representing five primary care dimensions (first contact, continuity, comprehensiveness, coordination and community orientation). All the seven scales achieved good internal consistency, item-total correlations and construct validity. Cronbach's alpha coefficient ranged from 0.66 to 0.91. A satisfactory goodness of fit model was achieved (GFI = 0.90, CFI = 0.91, RMSEA = 0.05, PCLOSE = 0.65). The full range of possible scores was observed for all scales. Scaling assumptions tests were achieved for all except the two comprehensiveness scales. Intra-class correlation coefficient (ICC) was 0.90 (n = 44, 95% CI 0.81-0.94, p < 0.001) for intra-rater reliability and 0.84 (n = 42, 95% CI 0.71-0.96, p < 0.001) for inter-rater reliability. Comprehensive metric analyses supported the reliability and validity of PCAT-Mw in assessing the core concepts of primary care from adult patients' experience. This tool could be used for health service research in primary care in Malawi.
The development and validation of the Incivility from Customers Scale.
Wilson, Nicole L; Holmvall, Camilla M
2013-07-01
Scant research has examined customers as sources of workplace incivility, despite evidence suggesting that mistreatment is more common from organizational outsiders, including customers, than from organizational members (Grandey, Kern, & Frone, 2007; Schat & Kelloway, 2005). As an important step in extending the literature on customer incivility, we conducted two studies to develop and validate a measure of this construct. Study 1 used focus groups of retail and restaurant employees (n = 30) to elicit a list of uncivil customer behaviors, based on which we wrote initial scale items. Study 2 used a correlational survey design (n = 439) to pare down the number of scale items to 10 and to garner reliability and validity evidence for the scale. Exploratory and confirmatory factor analyses show that the scale is unidimensional and distinguishable from measures of the related, but distinct, constructs of interpersonal justice and psychological aggression from customers. Reliability analyses show that the scale is internally consistent. Significant correlations between the scale and individuals' job satisfaction, turnover intentions, and general and job-specific psychological strain provide evidence of criterion-related validity. Hierarchical regression analyses show that the scale significantly predicts three of four organizational and personal strain outcomes over and above a workplace incivility measure adapted for customer incivility, providing some evidence of incremental validity. Limitations and future research directions are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved.
Sayer, Nina A; Frazier, Patricia; Orazem, Robert J; Murdoch, Maureen; Gravely, Amy; Carlson, Kathleen F; Hintz, Samuel; Noorbaloochi, Siamak
2011-12-01
The primary objective of this study was to describe the development, reliability, and construct validity of scores on the Military to Civilian Questionnaire (M2C-Q), a 16-item self-report measure of postdeployment community reintegration difficulty. We surveyed a national, stratified sample of 1,226 Iraq and Afghanistan veterans who used U.S. Department of Veterans Affairs (VA) medical care; 745 completed the M2C-Q and validated mental health screening measures. All analyses were based on weighted estimates. The internal consistency of the M2C-Q was .95 in this sample. Factor analyses indicated a single total score was the best-fitting model. Total scores were associated with measures theoretically related to reintegration difficulties including perception of overall difficulty readjusting back into civilian life (R(2) = .49), probable PTSD (d = 1.07), probable problem drug or alcohol use (d = 0.34), and overall mental health (r = -.83). Subgroup analyses revealed a similar pattern of findings in those who screened negative for PTSD. Nonwhite and unemployed veterans reported greater community reintegration difficulty (d = 0.20 and 0.45, respectively). Findings offer preliminary support for the reliability and construct validity of M2C-Q scores. Published 2011. This article is a US Government work and is in the public domain in the USA.
Głowacka, Katarzyna; Kromdijk, Johannes; Leonelli, Lauriebeth; Niyogi, Krishna K.; Clemente, Tom E.
2016-01-01
Abstract Stable transformation of plants is a powerful tool for hypothesis testing. A rapid and reliable evaluation method of the transgenic allele for copy number and homozygosity is vital in analysing these transformations. Here the suitability of Southern blot analysis, thermal asymmetric interlaced (TAIL‐)PCR, quantitative (q)PCR and digital droplet (dd)PCR to estimate T‐DNA copy number, locus complexity and homozygosity were compared in transgenic tobacco. Southern blot analysis and ddPCR on three generations of transgenic offspring with contrasting zygosity and copy number were entirely consistent, whereas TAIL‐PCR often underestimated copy number. qPCR deviated considerably from the Southern blot results and had lower precision and higher variability than ddPCR. Comparison of segregation analyses and ddPCR of T1 progeny from 26 T0 plants showed that at least 19% of the lines carried multiple T‐DNA insertions per locus, which can lead to unstable transgene expression. Segregation analyses failed to detect these multiple copies, presumably because of their close linkage. This shows the importance of routine T‐DNA copy number estimation. Based on our results, ddPCR is the most suitable method, because it is as reliable as Southern blot analysis yet much faster. A protocol for this application of ddPCR to large plant genomes is provided. PMID:26670088
The Interpersonal Shame Inventory for Asian Americans: Scale Development and Psychometric Properties
Wong, Y. Joel; Kim, Bryan S. K.; Nguyen, Chi P.; Cheng, Janice Ka Yan; Saw, Anne
2016-01-01
This article reports the development and psychometric properties of the Interpersonal Shame Inventory (ISI), a culturally salient and clinically relevant measure of interpersonal shame for Asian Americans. Across 4 studies involving Asian American college students, the authors provided evidence for this new measure’s validity and reliability. Exploratory factor analyses and confirmatory factor analyses provided support for a model with 2 correlated factors: external shame (arising from concerns about others’ negative evaluations) and family shame (arising from perceptions that one has brought shame to one’s family), corresponding to 2 subscales: ISI-E and ISI-F, respectively. Evidence for criterion-related, concurrent, discriminant, and incremental validity was demonstrated by testing the associations between external shame and family shame and immigration/international status, generic state shame, face concerns, thwarted belongingness, perceived burdensomeness, self-esteem, depressive symptoms, and suicide ideation. External shame and family shame also exhibited differential relations with other variables. Mediation findings were consistent with a model in which family shame mediated the effects of thwarted belongingness on suicide ideation. Further, the ISI subscales demonstrated high alpha coefficients and test–retest reliability. These findings are discussed in light of the conceptual, methodological, and clinical contributions of the ISI. PMID:24188650
Antunes, Ana Cristina; Caetano, António; Pina E Cunha, Miguel
2017-06-01
The Psychological Capital Questionnaire (PCQ) is the most commonly used measure for assessing psychological capital in work settings. Although several studies confirmed its factorial validity, most validation studies only examined the four-factor structure preconized by Luthans, Youssef, and Avolio, not attending to empirical evidence on alternative factorial structures. The present study aimed to test the psychometric properties of the Portuguese version of the PCQ, by using two independent samples (NS1 = 542; NS2 = 115) of Portuguese employees. We conducted a series of confirmatory factor analyses and found that, unlike previous findings, a five-factor solution of the PCQ best fitted the data. The evidence obtained also supported the existence of a second-order factor, psychological capital. The coefficients of internal consistency, as measured by Cronbach's alpha, were adequate and test-retest reliability suggested that the PCQ presented a lower stability than personality factors. Convergent validity, assessed with average variance extracted, revealed problems in the optimism subscale. The discriminant validity of the PCQ was confirmed by its correlations with Positive and Negative Affect and Big Five personality factors. Hierarchical regression analyses showed that this measure has incremental validity over personality and affect when predicting job performance.
Reliability and validity of the adapted Resistance Training Skills Battery for Children.
Furzer, Bonnie J; Bebich-Philip, Marc D; Wright, Kemi E; Reid, Siobhan L; Thornton, Ashleigh L
2017-12-29
Resistance training (RT) is emerging as a training modality to improve motor function and facilitate physical activity participation in children across the motor proficiency spectrum. Although RT competency assessments have been established and validated among adolescent cohorts, the extent to which these methods are suitable for assessing children's RT skills is unknown. This project aimed to assess the psychometric properties of the adapted Resistance Training Skills Battery for Children (RTSBc), in children with varying motor proficiency. Repeated measures design with 40 participants (M age=8.2±1.7years) displaying varying levels of motor proficiency. Participants performed the adapted RTSBc on two occasions, receiving a score for their execution of each component, in addition to an overall RT skill quotient child (RTSQc). Cronbach's alpha, intra-class correlation (ICC), Bland-Altman analysis, and typical error were used to assess test-retest reliability. To examine construct validity, exploratory factor analysis was performed alongside computing correlations between participants' muscle strength, motor proficiency, age, lean muscle mass, and RTSQc. The RTSBc displayed an acceptable level of internal consistency (alpha=0.86) and test-retest reliability (ICC range=0.86-0.99). Exploratory factor analysis supported internal test structure, with all six RT skills loading strongly on a single factor (range 0.56-0.89). Analyses of structural validity revealed positive correlations for RTSQc in relation to motor proficiency (r=0.52, p<0.001) and strength scores (r=0.61, p<0.001). Analyses revealed support for the construct validity and test-retest reliability of the RTSBc, providing preliminary evidence that the RTSBc is appropriate for use in the assessment of children's RT competency. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Lei, Pingguang; Lei, Guanghe; Tian, Jianjun; Zhou, Zengfen; Zhao, Miao; Wan, Chonghua
2014-10-01
This paper is aimed to develop the irritable bowel syndrome (IBS) scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-IBS) by the modular approach and validate it by both classical test theory and generalizability theory. The QLICD-IBS was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, and quantitative statistical procedures. One hundred twelve inpatients with IBS were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability, and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t tests and also G studies and D studies of generalizability theory analysis. Multi-trait scaling analysis, correlation, and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. Test-retest reliability coefficients (Pearson r and intra-class correlation (ICC)) for the overall score and all domains were higher than 0.80; the internal consistency α for all domains at two measurements were higher than 0.70 except for the social domain (0.55 and 0.67, respectively). The overall score and scores for all domains/facets had statistically significant changes after treatments with moderate or higher effect size standardized response mean (SRM) ranging from 0.72 to 1.02 at domain levels. G coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-IBS has good validity, reliability, responsiveness, and some highlights and can be used as the quality of life instrument for patients with IBS.
Cancela Carral, José María; Lago Ballesteros, Joaquín; Ayán Pérez, Carlos; Mosquera Morono, María Belén
2016-01-01
To analyse the reliability and validity of the Weekly Activity Checklist (WAC), the One Week Recall (OWR), and the Godin-Shephard Leisure Time Exercise Questionnaire (GLTEQ) in Spanish adolescents. A total of 78 adolescents wore a pedometer for one week, filled out the questionnaires at the end of this period and underwent a test to estimate their maximal oxygen consumption (VO2max). The reliability of the questionnaires was determined by means of a factor analysis. Convergent validity was obtained by comparing the questionnaires' scores against the amount of physical activity quantified by the pedometer and the VO2max reported. The questionnaires showed a weak internal consistency (WAC: α=0.59-0.78; OWR: α=0.53-0.73; GLTEQ: α=0.60). Moderate statistically significant correlations were found between the pedometer and the WAC (r=0.69; p <0.01) and the OWR (r=0.42; p <0.01), while a low statistically significant correlation was found for the GLTEQ (r=0.36; p=0.01). The estimated VO2max showed a low level of association with the WAC results (r=0.30; p <0.05), and the OWR results (r=0.29; p <0.05). When classifying the participants as active or inactive, the level of agreement with the pedometer was moderate for the WAC (k=0.46) and the OWR (r=0.44), and slight for the GLTEQ (r=0.20). Of the three questionnaires analysed, the WAC showed the best psychometric performance as it was the only one with respectable convergent validity, while sharing low reliability with the OWR and the GLTEQ. Copyright © 2016 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.
Thermal-Structural Analysis of PICA Tiles for Solar Tower Test
NASA Technical Reports Server (NTRS)
Agrawal, Parul; Empey, Daniel M.; Squire, Thomas H.
2009-01-01
Thermal protection materials used in spacecraft heatshields are subjected to severe thermal and mechanical loading environments during re-entry into earth atmosphere. In order to investigate the reliability of PICA tiles in the presence of high thermal gradients as well as mechanical loads, the authors designed and conducted solar-tower tests. This paper presents the design and analysis work for this tests series. Coupled non-linear thermal-mechanical finite element analyses was conducted to estimate in-depth temperature distribution and stress contours for various cases. The first set of analyses performed on isolated PICA tile showed that stresses generated during the tests were below the PICA allowable limit and should not lead to any catastrophic failure during the test. The tests results were consistent with analytical predictions. The temperature distribution and magnitude of the measured strains were also consistent with predicted values. The second test series is designed to test the arrayed PICA tiles with various gap-filler materials. A nonlinear contact method is used to model the complex geometry with various tiles. The analyses for these coupons predict the stress contours in PICA and inside gap fillers. Suitable mechanical loads for this architecture will be predicted, which can be applied during the test to exceed the allowable limits and demonstrate failure modes. Thermocouple and strain-gauge data obtained from the solar tower tests will be used for subsequent analyses and validation of FEM models.
Portuguese Version of the Pain Beliefs and Perceptions Inventory: A Multicenter Validation Study.
Azevedo, Luís Filipe; Sampaio, Rute; Camila Dias, Cláudia; Romão, José; Lemos, Laurinda; Agualusa, Luís; Vaz-Serra, Sílvia; Patto, Teresa; Costa-Pereira, Altamiro; Castro-Lopes, José Manuel
2017-07-01
We aimed to perform the translation, cultural adaptation, and validation of the Pain Beliefs and Perceptions Inventory (PBPI) for the European Portuguese language and chronic pain population. This is a longitudinal multicenter validation study. A Portuguese version of the PBPI (PBPI-P) was created through a process of translation, back translation, and expert panel evaluation. The PBPI-P was administered to a total of 122 patients from 13 chronic pain clinics in Portugal, at baseline and after 7 days. Internal consistency and test-retest reliability were assessed by Cronbach's alpha (α) and intraclass correlation coefficient (ICC). Construct (convergent and discriminant) validity was assessed based on a set of previously developed theoretical hypotheses about interrelations between the PBPI-P and other measures. Exploratory and confirmatory factor analyses were performed to test the theoretical structure of the PBPI-P. The internal consistency and test-retest reliability coefficients for each respective subscale were α = 0.620 and ICC = 0.801 for mystery; α = 0.744 and ICC = 0.841 for permanence; α = 0.778 and ICC = 0.791 for constancy; and α = 0.764 and ICC = 0.881 for self-blame. Exploratory and confirmatory factor analysis revealed a four-factor structure (performance, constancy, self-blame, and mystery) that explained 63% of the variance. The construct validity of the PBPI-P was shown to be adequate, with more than 90% of the previously defined hypotheses regarding interrelations with other measures confirmed. The PBPI-P has been shown to be adequate and to have excellent reliability, internal consistency, and validity. It may contribute to a better pain assessment and is suitable for research and clinical use. © 2016 World Institute of Pain.
Reliability and coverage analysis of non-repairable fault-tolerant memory systems
NASA Technical Reports Server (NTRS)
Cox, G. W.; Carroll, B. D.
1976-01-01
A method was developed for the construction of probabilistic state-space models for nonrepairable systems. Models were developed for several systems which achieved reliability improvement by means of error-coding, modularized sparing, massive replication and other fault-tolerant techniques. From the models developed, sets of reliability and coverage equations for the systems were developed. Comparative analyses of the systems were performed using these equation sets. In addition, the effects of varying subunit reliabilities on system reliability and coverage were described. The results of these analyses indicated that a significant gain in system reliability may be achieved by use of combinations of modularized sparing, error coding, and software error control. For sufficiently reliable system subunits, this gain may far exceed the reliability gain achieved by use of massive replication techniques, yet result in a considerable saving in system cost.
Quasi-Static Probabilistic Structural Analyses Process and Criteria
NASA Technical Reports Server (NTRS)
Goldberg, B.; Verderaime, V.
1999-01-01
Current deterministic structural methods are easily applied to substructures and components, and analysts have built great design insights and confidence in them over the years. However, deterministic methods cannot support systems risk analyses, and it was recently reported that deterministic treatment of statistical data is inconsistent with error propagation laws that can result in unevenly conservative structural predictions. Assuming non-nal distributions and using statistical data formats throughout prevailing stress deterministic processes lead to a safety factor in statistical format, which integrated into the safety index, provides a safety factor and first order reliability relationship. The embedded safety factor in the safety index expression allows a historically based risk to be determined and verified over a variety of quasi-static metallic substructures consistent with the traditional safety factor methods and NASA Std. 5001 criteria.
Sorsdahl, Katherine; Stein, Dan J; Myers, Bronwyn
2017-04-01
The Social Problem Solving Inventory-Revised Short-Form (SPSI-R:SF) has been used in several countries to identify problem-solving deficits among clinical and general populations in order to guide cognitive-behavioural interventions. Yet, very few studies have evaluated its psychometric properties. Three language versions of the questionnaire were administered to a general population sample comprising 1000 participants (771 English-, 178 Afrikaans- and 101 Xhosa-speakers). Of these participants, 210 were randomly selected to establish test-retest reliability (70 in each language). Principal component analysis was performed to examine the applicability of the factor structure of the original questionnaire to the South African data. Supplementary psychometric analyses were performed, including internal consistency and test-retest reliability. Collectively, results provide initial evidence of the reliability and validity of the SPSI-R:SF for the assessment of problem solving deficits in South Africa. Further studies that explore how the Afrikaans language version of the SPSI-R:SF can be improved and that establish the predictive validity of scores on the SPSI-R:SF are needed. © 2015 International Union of Psychological Science.
Franklin, Ashley E; Burns, Paulette; Lee, Christopher S
2014-10-01
In 2006, the National League for Nursing published three measures related to novice nurses' beliefs about self-confidence, scenario design, and educational practices associated with simulation. Despite the extensive use of these measures, little is known about their reliability and validity. The psychometric properties of the Student Satisfaction and Self-Confidence in Learning Scale, Simulation Design Scale, and Educational Practices Questionnaire were studied among a sample of 2200 surveys completed by novice nurses from a liberal arts university in the southern United States. Psychometric tests included item analysis, confirmatory and exploratory factor analyses in randomly-split subsamples, concordant and discordant validity, and internal consistency. All three measures have sufficient reliability and validity to be used in education research. There is room for improvement in content validity with the Student Satisfaction and Self-Confidence in Learning and Simulation Design Scale. This work provides robust evidence to ensure that judgments made about self-confidence after simulation, simulation design and educational practices are valid and reliable. Copyright © 2014 Elsevier Ltd. All rights reserved.
Rubio-Ochoa, J; Benítez-Martínez, J; Lluch, E; Santacruz-Zaragozá, S; Gómez-Contreras, P; Cook, C E
2016-02-01
It has been suggested that differential diagnosis of headaches should consist of a robust subjective examination and a detailed physical examination of the cervical spine. Cervicogenic headache (CGH) is a form of headache that involves referred pain from the neck. To our knowledge, no studies have summarized the reliability and diagnostic accuracy of physical examination tests for CGH. The aim of this study was to summarize the reliability and diagnostic accuracy of physical examination tests used to diagnose CGH. A systematic review following PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines was performed in four electronic databases (MEDLINE, Web of Science, Embase and Scopus). Full text reports concerning physical tests for the diagnosis of CGH which reported the clinometric properties for assessment of CGH, were included and screened for methodological quality. Quality Appraisal for Reliability Studies (QAREL) and Quality Assessment of Studies of Diagnostic Accuracy (QUADAS-2) scores were completed to assess article quality. Eight articles were retrieved for quality assessment and data extraction. Studies investigating diagnostic reliability of physical examination tests for CGH scored poorer on methodological quality (higher risk of bias) than those of diagnostic accuracy. There is sufficient evidence showing high levels of reliability and diagnostic accuracy of the selected physical examination tests for the diagnosis of CGH. The cervical flexion-rotation test (CFRT) exhibited both the highest reliability and the strongest diagnostic accuracy for the diagnosis of CGH. Copyright © 2015 Elsevier Ltd. All rights reserved.
Automated Segmentability Index for Layer Segmentation of Macular SD-OCT Images.
Lee, Kyungmoo; Buitendijk, Gabriëlle H S; Bogunovic, Hrvoje; Springelkamp, Henriët; Hofman, Albert; Wahle, Andreas; Sonka, Milan; Vingerling, Johannes R; Klaver, Caroline C W; Abràmoff, Michael D
2016-03-01
To automatically identify which spectral-domain optical coherence tomography (SD-OCT) scans will provide reliable automated layer segmentations for more accurate layer thickness analyses in population studies. Six hundred ninety macular SD-OCT image volumes (6.0 × 6.0 × 2.3 mm 3 ) were obtained from one eyes of 690 subjects (74.6 ± 9.7 [mean ± SD] years, 37.8% of males) randomly selected from the population-based Rotterdam Study. The dataset consisted of 420 OCT volumes with successful automated retinal nerve fiber layer (RNFL) segmentations obtained from our previously reported graph-based segmentation method and 270 volumes with failed segmentations. To evaluate the reliability of the layer segmentations, we have developed a new metric, segmentability index SI, which is obtained from a random forest regressor based on 12 features using OCT voxel intensities, edge-based costs, and on-surface costs. The SI was compared with well-known quality indices, quality index (QI), and maximum tissue contrast index (mTCI), using receiver operating characteristic (ROC) analysis. The 95% confidence interval (CI) and the area under the curve (AUC) for the QI are 0.621 to 0.805 with AUC 0.713, for the mTCI 0.673 to 0.838 with AUC 0.756, and for the SI 0.784 to 0.920 with AUC 0.852. The SI AUC is significantly larger than either the QI or mTCI AUC ( P < 0.01). The segmentability index SI is well suited to identify SD-OCT scans for which successful automated intraretinal layer segmentations can be expected. Interpreting the quantification of SD-OCT images requires the underlying segmentation to be reliable, but standard SD-OCT quality metrics do not predict which segmentations are reliable and which are not. The segmentability index SI presented in this study does allow reliable segmentations to be identified, which is important for more accurate layer thickness analyses in research and population studies.
Togari, Taisuke; Yamazaki, Yoshihiko; Koide, Syotaro; Miyata, Ayako
2006-01-01
In community and workplace health plans, the Perceived Health Competence Scale (PHCS) is employed as an index of health competency. The purpose of this research was to examine the reliability and validity of a modified Japanese PHCS. Interviews were sought with 3,000 randomly selected Japanese individuals using a two-step stratified method. Valid PHCS responses were obtained from 1,910 individuals, yielding a 63.7% response rate. Reliability was assessed using Cronbach's alpha coefficient (henceforth, alpha) to evaluate internal consistency, and by employing item-total correlation and alpha coefficient analyses to assess the effect of removal of variables from the model. To examine content validity, we assessed the correlation between the PHCS score and four respondent attribute characteristics, that is, sex, age, the presence of chronic disease, and the existence of chronic disease at age 18. The correlation between PHCS score and commonly employed healthy lifestyle indices was examined to assess construct validity. General linear model statistical analysis was employed. The modified Japanese PHCS demonstrated a satisfactory alpha coefficient of 0.869. Moreover, reliability was confirmed by item-total correlation and alpha coefficient analyses after removal of variables from the model. Differences in PHCS scores were seen between individuals 60 years and older, and younger individuals. These with current chronic disease, or who had had a chronic disease at age 18, tended to have lower PHCS scores. After controlling for the presence of current or age 18 chronic disease, age, and sex, significant correlations were seen between PHCS scores and tobacco use, dietary habits, and exercise, but not alcohol use or frequency of medical consultation. This study supports the reliability and validity, and hence supports the use, of the modified Japanese PHCS. Future longitudinal research is needed to evaluate the predictive power of modified Japanese PHCS scores, to examine factors influencing the development of perceived health competence, and to assess the effects of interventions on perceived health competence.
Wan, Chonghua; Li, Hezhan; Fan, Xuejin; Yang, Ruixue; Pan, Jiahua; Chen, Wenru; Zhao, Rong
2014-06-04
Quality of life (QOL) for patients with coronary heart disease (CHD) is now concerned worldwide with the specific instruments being seldom and no one developed by the modular approach. This paper is aimed to develop the CHD scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-CHD) by the modular approach and validate it by both classical test theory and Generalizability Theory. The QLICD-CHD was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, pre-testing and quantitative statistical procedures. 146 inpatients with CHD were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t-tests and also G studies and D studies of Genralizability Theory analysis. Multi-trait scaling analysis, correlation and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. The internal consistency α and test-retest reliability coefficients (Pearson r and Intra-class correlations ICC) for the overall instrument and all domains were higher than 0.70 and 0.80 respectively; The overall and all domains except for social domain had statistically significant changes after treatments with moderate effect size SRM (standardized response mea) ranging from 0.32 to 0.67. G-coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-CHD has good validity, reliability, and moderate responsiveness and some highlights, and can be used as the quality of life instrument for patients with CHD. However, in order to obtain better reliability, the numbers of items for social domain should be increased or the items' quality, not quantity, should be improved.
Validation of a Malay Version of the Smartphone Addiction Scale among Medical Students in Malaysia.
Ching, Siew Mooi; Yee, Anne; Ramachandran, Vasudevan; Sazlly Lim, Sazlyna Mohd; Wan Sulaiman, Wan Aliaa; Foo, Yoke Loong; Hoo, Fan Kee
2015-01-01
This study was initiated to determine the psychometric properties of the Smart Phone Addiction Scale (SAS) by translating and validating this scale into the Malay language (SAS-M), which is the main language spoken in Malaysia. This study can distinguish smart phone and internet addiction among multi-ethnic Malaysian medical students. In addition, the reliability and validity of the SAS was also demonstrated. A total of 228 participants were selected between August 2014 and September 2014 to complete a set of questionnaires, including the SAS and the modified Kimberly Young Internet addiction test (IAT) in the Malay language. There were 99 males and 129 females with ages ranging from 19 to 22 years old (21.7±1.1) included in this study. Descriptive and factor analyses, intra-class coefficients, t-tests and correlation analyses were conducted to verify the reliability and validity of the SAS. Bartlett's test of sphericity was significant (p <0.01), and the Kaiser-Mayer-Olkin measure of sampling adequacy for the SAS-M was 0.92, indicating meritoriously that the factor analysis was appropriate. The internal consistency and concurrent validity of the SAS-M were verified (Cronbach's alpha = 0.94). All of the subscales of the SAS-M, except for positive anticipation, were significantly related to the Malay version of the IAT. This study developed the first smart phone addiction scale among medical students. This scale was shown to be reliable and valid in the Malay language.
Noorbakhsh, Simasadat; Shams, Jamal; Faghihimohamadi, Mohamadmahdi; Zahiroddin, Hanieh; Hallgren, Mats; Kallmen, Hakan
2018-01-30
Iran is a developing and Islamic country where the consumption of alcoholic beverages is banned. However, psychiatric disorders and alcohol use disorders are often co-occurring. We used the Alcohol Use Disorders Identification Test (AUDIT) to estimate the prevalence of alcohol use and examined the psychometric properties of the test among psychiatric outpatients in Teheran, Iran. AUDIT was completed by 846 consecutive (sequential) patients. Descriptive statistics, internal consistency (Cronbach alpha), confirmatory and exploratory factor analyses were used to analyze the prevalence of alcohol use, reliability and construct validity. 12% of men and 1% of women were hazardous alcohol consumers. Internal reliability of the Iranian version of AUDIT was excellent. Confirmatory factor analyses showed that the construct validity and the fit of previous factor structures (1, 2 and 3 factors) to data were not good and seemingly contradicted results from the explorative principal axis factoring, which showed that a 1-factor solution explained 77% of the co-variances. We could not reproduce the suggested factor structure of AUDIT, probably due to the skewed distribution of alcohol consumption. Only 19% of men and 3% of women scored above 0 on AUDIT. This could be explained by the fact that alcohol is illegal in Iran. In conclusion the AUDIT exhibited good internal reliability when used as a single scale. The prevalence estimates according to AUDIT were somewhat higher among psychiatric patients compared to what was reported by WHO regarding the general population.
Cebolla, Ausias; Luciano, Juan V; DeMarzo, Marcelo Piva; Navarro-Gil, Mayte; Campayo, Javier Garcia
2013-01-14
Mindful-based interventions improve functioning and quality of life in fibromyalgia (FM) patients. The aim of the study is to perform a psychometric analysis of the Spanish version of the Mindful Attention Awareness Scale (MAAS) in a sample of patients diagnosed with FM. The following measures were administered to 251 Spanish patients with FM: the Spanish version of MAAS, the Chronic Pain Acceptance Questionnaire, the Pain Catastrophising Scale, the Injustice Experience Questionnaire, the Psychological Inflexibility in Pain Scale, the Fibromyalgia Impact Questionnaire and the Euroqol. Factorial structure was analysed using Confirmatory Factor Analyses (CFA). Cronbach's α coefficient was calculated to examine internal consistency, and the intraclass correlation coefficient (ICC) was calculated to assess the test-retest reliability of the measures. Pearson's correlation tests were run to evaluate univariate relationships between scores on the MAAS and criterion variables. The MAAS scores in our sample were low (M = 56.7; SD = 17.5). CFA confirmed a two-factor structure, with the following fit indices [sbX2 = 172.34 (p < 0.001), CFI = 0.95, GFI = 0.90, SRMR = 0.05, RMSEA = 0.06. MAAS was found to have high internal consistency (Cronbach's α = 0.90) and adequate test-retest reliability at a 1-2 week interval (ICC = 0.90). It showed significant and expected correlations with the criterion measures with the exception of the Euroqol (Pearson = 0.15). Psychometric properties of the Spanish version of the MAAS in patients with FM are adequate. The dimensionality of the MAAS found in this sample and directions for future research are discussed.
Kirouac, Megan; Stein, Elizabeth R; Pearson, Matthew R; Witkiewitz, Katie
2017-11-01
Quality of life is an outcome often examined in treatment research contexts such as biomedical trials, but has been studied less often in alcohol use disorder (AUD) treatment. The importance of considering QoL in substance use treatment research has recently been voiced, and measures of QoL have been administered in large AUD treatment trials. Yet, the viability of popular QoL measures has never been evaluated in AUD treatment samples. Accordingly, the present manuscript describes a psychometric examination of and prospective changes in the World Health Organization Quality of Life measure (WHOQOL-BREF) in a large sample (N = 1383) of patients with AUD recruited for the COMBINE Study. Specifically, we examined the construct validity (via confirmatory factor analyses), measurement invariance across time, internal consistency reliability, convergent validity, and effect sizes of post-treatment changes in the WHOQOL-BREF. Confirmatory factor analyses of the WHOQOL-BREF provided acceptable fit to the current data and this model was invariant across time. Internal consistency reliability was excellent (α > .9) for the full WHOQOL-BREF for each timepoint; the WHOQOL-BREF had good convergent validity, and medium effect size improvements were found in the full COMBINE sample across time. These findings suggest that the WHOQOL-BREF is an appropriate measure to use in samples with AUD, that the WHOQOL-BREF scores may be examined over time (e.g., from pre- to post-treatment), and the WHOQOL-BREF may be used to assess improvements in quality of life in AUD research.
Greeven, Anja; Spinhoven, Philip; van Balkom, Anton J L M
2009-01-01
This study investigated the psychometric properties of the first clinician-administered semi-structured interview for assessing the severity of hypochondriacal symptoms. The Hypochondriasis Yale-Brown Obsessive-Compulsive Scale (H-YBOCS) consisted of three a priori dimensions: hypochondriacal obsessions, compulsions and avoidance. The 16-item interview was conducted with 112 participants with Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, hypochondriasis. We analysed factor analytic structure, reliability, construct validity and sensitivity to change. Factor analysis supported a three-factor model similar to the a priori dimensions. Internal consistency ranged from satisfactory to good. Inter-rater reliability was excellent. The construct validity was low to moderate. The H-YBOCS was sensitive for measuring changes in symptom severity. The H-YBOCS is a (factorially) valid and coherent interview with a high level of agreement across different raters. The relatively low discriminant validity could be due to co-morbid anxiety and depressive disorders. Overall, the H-YBOCS seems to be a promising contribution to the assessment of hypochondriasis. *The hypochondriasis Y-BOCS is a feasible clinician rated interview to assess the severity of hypochondriacal complaints.
Reblin, Maija; Clayton, Margaret F; John, Kevin K; Ellington, Lee
2016-07-01
In this article, we present strategies for collecting and coding a large longitudinal communication data set collected across multiple sites, consisting of more than 2000 hours of digital audio recordings from approximately 300 families. We describe our methods within the context of implementing a large-scale study of communication during cancer home hospice nurse visits, but this procedure could be adapted to communication data sets across a wide variety of settings. This research is the first study designed to capture home hospice nurse-caregiver communication, a highly understudied location and type of communication event. We present a detailed example protocol encompassing data collection in the home environment, large-scale, multisite secure data management, the development of theoretically-based communication coding, and strategies for preventing coder drift and ensuring reliability of analyses. Although each of these challenges has the potential to undermine the utility of the data, reliability between coders is often the only issue consistently reported and addressed in the literature. Overall, our approach demonstrates rigor and provides a "how-to" example for managing large, digitally recorded data sets from collection through analysis. These strategies can inform other large-scale health communication research.
Dittmann, Ralf W; Wehmeier, Peter M; Schacht, Alexander; Lehmann, Martin; Lehmkuhl, Gerd
2009-12-01
To report on (1) psychometric properties of the Rosenberg Self-Esteem Scale (SES) studied in adolescents with ADHD, (2) correlations of SES with ADHD scale scores, and (3) change in patient-reported self-esteem with atomoxetine treatment. ADHD patients (12-17 years), treated in an open-label study for 24 weeks. Secondary analyses on ADHD symptoms (assessed with ADHD-RS, CGI, GIPD scales) and self-esteem (SES) were performed. One hundred and fifty-nine patients were treated. A dichotomous structure of the SES could be confirmed. Reliability and internal consistency were moderate to excellent. Highest coefficients were found for the correlation between SES and GIPD scores. Self-esteem significantly increased over time, accompanied by an improvement of ADHD symptoms and related perceived difficulties. The Rosenberg SES was shown to be internally consistent, reliable, and sensitive to treatment-related changes of self-esteem. According to these findings, self-esteem may be an important individual patient outcome beyond the core symptoms of ADHD. © The Author(s) 2009. This article is published with open access at Springerlink.com
Castillo, Isabel; Tomás, Inés; Ntoumanis, Nikos; Bartholomew, Kimberley; Duda, Joan L; Balaguer, Isabel
2014-01-01
The purpose of this research was to translate into Spanish and examine the psychometric properties of the Spanish version of the Controlling Coach Behaviors Scale (CCBS) in male soccer players. The CCBS is a questionnaire designed to assess athletes' perceptions of sports coaches' controlling interpersonal style from the perspective of the self-determination theory. Study 1 tested the factorial structure of the translated scale using confirmatory factor analysis (CFA) and provided evidence of discriminant validity. Studies 2 and 3 examined the invariance across time and across competitive level via multi-sample CFA. Reliability analyses were also conducted. The CFA results revealed that a four-factor model was acceptable, indicating that a controlling interpersonal style is a multidimensional construct represented by four separate and related controlling coaching strategies. Further, results supported the invariance of the CCBS factor structure across time and competitive level and provided support for the internal consistency of the scale. Overall, the CCBS demonstrated adequate internal consistency, as well as good factorial validity. The Spanish version of the CCBS represents a valid and reliable adaptation of the instrument, which can be confidently used to measure soccer players' perceptions of their coaches' controlling interpersonal style.
Rapid screening for perceived cognitive impairment in major depressive disorder.
Iverson, Grant L; Lam, Raymond W
2013-05-01
Subjectively experienced cognitive impairment is common in patients with mood disorders. The British Columbia Cognitive Complaints Inventory (BC-CCI) is a 6-item scale that measures perceived cognitive problems. The purpose of this study is to examine the reliability of the scale in healthy volunteers and depressed patients and to evaluate the sensitivity of the measure to perceived cognitive problems in depression. Participants were 62 physician-diagnosed inpatients or outpatients with depression, who had independently confirmed diagnoses on the Structured Clinical Interview for DSM-IV, and a large sample of healthy community volunteers (n=112). The internal consistency reliability of the BC-CCI was α=.86 for patients with depression and α=.82 for healthy controls. Principal components analyses revealed a one-factor solution accounting for 54% of the total variability in the control sample and a 2-factor solution (cognitive impairment and difficulty with expressive language) accounting for 76% of the variance in the depression sample. The total score difference between the groups was very large (Cohen's d=2.2). The BC-CCI has high internal consistency in both depressed patients and community controls, despite its small number of items. The test is sensitive to cognitive complaints in patients with depression.
Keilmann, Annerose; Friese, Barbara; Lässig, Anne; Hoffmann, Vanessa
2018-04-01
The introduction of neonatal hearing screening and the increasingly early age at which children can receive a cochlear implant has intensified the need for a validated questionnaire to assess the speech production of children aged 0‒18. Such a questionnaire has been created, the LittlEARS ® Early Speech Production Questionnaire (LEESPQ). This study aimed to validate a second, revised edition of the LEESPQ. Questionnaires were returned for 362 children with normal hearing. Completed questionnaires were analysed to determine if the LEESPQ is reliable, prognostically accurate, internally consistent, and if gender or multilingualism affects total scores. Total scores correlated positively with age. The LEESPQ is reliable, accurate, and consistent, and independent of gender or lingual status. A norm curve was created. This second version of the LEESPQ is a valid tool to assess the speech production development of children with normal hearing, aged 0‒18, regardless of their gender. As such, the LEESPQ may be a useful tool to monitor the development of paediatric hearing device users. The second version of the LEESPQ is a valid instrument for assessing early speech production of children aged 0‒18 months.
Reblin, Maija; Clayton, Margaret F; John, Kevin K; Ellington, Lee
2015-01-01
In this paper, we present strategies for collecting and coding a large longitudinal communication dataset collected across multiple sites, consisting of over 2000 hours of digital audio recordings from approximately 300 families. We describe our methods within the context of implementing a large-scale study of communication during cancer home hospice nurse visits, but this procedure could be adapted to communication datasets across a wide variety of settings. This research is the first study designed to capture home hospice nurse-caregiver communication, a highly understudied location and type of communication event. We present a detailed example protocol encompassing data collection in the home environment, large-scale, multi-site secure data management, the development of theoretically-based communication coding, and strategies for preventing coder drift and ensuring reliability of analyses. Although each of these challenges have the potential to undermine the utility of the data, reliability between coders is often the only issue consistently reported and addressed in the literature. Overall, our approach demonstrates rigor and provides a “how-to” example for managing large, digitally-recorded data sets from collection through analysis. These strategies can inform other large-scale health communication research. PMID:26580414
Savoia, Elena; Biddinger, Paul D; Burstein, Jon; Stoto, Michael A
2010-01-01
As proxies for actual emergencies, drills and exercises can raise awareness, stimulate improvements in planning and training, and provide an opportunity to examine how different components of the public health system would combine to respond to a challenge. Despite these benefits, there remains a substantial need for widely accepted and prospectively validated tools to evaluate agencies' and hospitals' performance during such events. Unfortunately, to date, few studies have focused on addressing this need. The purpose of this study was to assess the validity and reliability of a qualitative performance assessment tool designed to measure hospitals' communication and operational capabilities during a functional exercise. The study population included 154 hospital personnel representing nine hospitals that participated in a functional exercise in Massachusetts in June 2008. A 25-item questionnaire was developed to assess the following three hospital functional capabilities: (1) inter-agency communication; (2) communication with the public; and (3) disaster operations. Analyses were conducted to examine internal consistency, associations among scales, the empirical structure of the items, and inter-rater agreement. Twenty-two questions were retained in the final instrument, which demonstrated reliability with alpha coefficients of 0.83 or higher for all scales. A three-factor solution from the principal components analysis accounted for 57% of the total variance, and the factor structure was consistent with the original hypothesized domains. Inter-rater agreement between participants' self reported scores and external evaluators' scores ranged from moderate to good. The resulting 22-item performance measurement tool reliably measured hospital capabilities in a functional exercise setting, with preliminary evidence of concurrent and criterion-related validity.
Lin, Chung-Ying; Pakpour, Amir H
2017-02-01
The problems of mood disorders are critical in people with epilepsy. Therefore, there is a need to validate a useful tool for the population. The Hospital Anxiety and Depression Scale (HADS) has been used on the population, and showed that it is a satisfactory screening tool. However, more evidence on its construct validity is needed. A total of 1041 people with epilepsy were recruited in this study, and each completed the HADS. Confirmatory factor analysis (CFA) and Rasch analysis were used to understand the construct validity of the HADS. In addition, internal consistency was tested using Cronbachs' α, person separation reliability, and item separation reliability. Ordering of the response descriptors and the differential item functioning (DIF) were examined using the Rasch models. The HADS showed that 55.3% of our participants had anxiety; 56.0% had depression based on its cutoffs. CFA and Rasch analyses both showed the satisfactory construct validity of the HADS; the internal consistency was also acceptable (α=0.82 in anxiety and 0.79 in depression; person separation reliability=0.82 in anxiety and 0.73 in depression; item separation reliability=0.98 in anxiety and 0.91 in depression). The difficulties of the four-point Likert scale used in the HADS were monotonically increased, which indicates no disordering response categories. No DIF items across male and female patients and across types of epilepsy were displayed in the HADS. The HADS has promising psychometric properties on construct validity in people with epilepsy. Moreover, the additive item score is supported for calculating the cutoff. Copyright © 2016 British Epilepsy Association. Published by Elsevier Ltd. All rights reserved.
Kwan, Yu Heng; Fong, Warren Weng Seng; Lui, Nai Lee; Yong, Si Ting; Cheung, Yin Bun; Malhotra, Rahul; Østbye, Truls; Thumboo, Julian
2016-12-01
The Short Form 36 Health Survey (SF-36) is a popular health-related quality of life (HrQoL) tool. However, few studies have assessed its psychometric properties in patients with spondyloarthritis (SpA). We therefore aimed to assess the reliability and validity of the SF-36 in patients with SpA in Singapore. Cross-sectional data from a registry of 196 SpA patients recruited from a dedicated tertiary referral clinic in Singapore from 2011 to 2014 was used. Analyses were guided by the COnsensus-based Standards for the selection of health Measurement INstruments framework. Internal consistency reliability was assessed using Cronbach's alpha. Construct validity was assessed through 33 a priori hypotheses by correlations of the eight subscales and two summary scores of SF-36 with other health outcomes. Known-group construct validity was assessed by comparison of the means of the subscales and summary scores of the SF-36 of SpA patients and the general population of Singapore using student's t tests. Among 196 patients (155 males (79.0 %), median (range) age: 36 (17-70), 166 Chinese (84.6 %)), SF-36 scales showed high internal consistency ranging from 0.88 to 0.90. Convergent construct validity was supported as shown by fulfillment of all hypotheses. Divergent construct validity was supported, as SF-36 MCS was not associated with PGA, pain and HAQ. Known-group construct validity showed SpA patients had lower scores of 3.8-12.5 when compared to the general population at p < 0.001. This study supports the SF-36 as a valid and reliable measure of HrQoL for use in patients with SpA at a single time point.
Reliability studies of Integrated Modular Engine system designs
NASA Technical Reports Server (NTRS)
Hardy, Terry L.; Rapp, Douglas C.
1993-01-01
A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
Reliability studies of integrated modular engine system designs
NASA Technical Reports Server (NTRS)
Hardy, Terry L.; Rapp, Douglas C.
1993-01-01
A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
Reliability studies of integrated modular engine system designs
NASA Astrophysics Data System (ADS)
Hardy, Terry L.; Rapp, Douglas C.
1993-06-01
A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
Reliability studies of Integrated Modular Engine system designs
NASA Astrophysics Data System (ADS)
Hardy, Terry L.; Rapp, Douglas C.
1993-06-01
A study was performed to evaluate the reliability of Integrated Modular Engine (IME) concepts. Comparisons were made between networked IME systems and non-networked discrete systems using expander cycle configurations. Both redundant and non-redundant systems were analyzed. Binomial approximation and Markov analysis techniques were employed to evaluate total system reliability. In addition, Failure Modes and Effects Analyses (FMEA), Preliminary Hazard Analyses (PHA), and Fault Tree Analysis (FTA) were performed to allow detailed evaluation of the IME concept. A discussion of these system reliability concepts is also presented.
Qi, Bing-Bing; Resnick, Barbara
2014-01-01
To assess the psychometric properties of Chinese versions self-efficacy and outcome expectations on osteoporosis medication adherence (SEOMA-C and OEOMA-C) scales. Back-translated tools were assessed by internal consistency and R2 by structured equation modeling, confirmatory factor analyses, hypothesis testing, and criterion-related validity among 110 (81 females, 29 males) Mandarin-speaking immigrants (mean age = 63.44, SD = 9.63). The Cronbach's alpha for SEOMA-C and OEOMA-C is .904 and .937, respectively. There was fair and good fit of the measurement model to the data. Previous bone mineral density (BMD) testing, calcaneus BMD, self-efficacy for exercise, and osteoporosis medication adherence were positively related to SEOMA-C scores. These scales constitute some preliminary validity and reliability. Further refined and cultural sensitive items could be explored and added.
Finite element analysis of wirelessly interrogated implantable bio-MEMS
NASA Astrophysics Data System (ADS)
Dissanayake, Don W.; Al-Sarawi, Said F.; Lu, Tien-Fu; Abbott, Derek
2008-12-01
Wirelessly interrogated bio-MEMS devices are becoming more popular due to many challenges, such as improving the diagnosis, monitoring, and patient wellbeing. The authors present here a passive, low power and small area device, which can be interrogated wirelessly using a uniquely coded signal for a secure and reliable operation. The proposed new approach relies on converting the interrogating coded signal to surface acoustic wave that is then correlated with an embedded code. The suggested method is implemented to operate a micropump, which consist of a specially designed corrugated microdiaphragm to modulate the fluid flow in microchannels. Finite Element Analysis of the micropump operation is presented and a performance was analysed. Design parameters of the diaphragm design were finetuned for optimal performance and different polymer based materials were used in various parts of the micropump to allow for better flexibility and high reliability.
Psychophysical measurements in children: challenges, pitfalls, and considerations.
Witton, Caroline; Talcott, Joel B; Henning, G Bruce
2017-01-01
Measuring sensory sensitivity is important in studying development and developmental disorders. However, with children, there is a need to balance reliable but lengthy sensory tasks with the child's ability to maintain motivation and vigilance. We used simulations to explore the problems associated with shortening adaptive psychophysical procedures, and suggest how these problems might be addressed. We quantify how adaptive procedures with too few reversals can over-estimate thresholds, introduce substantial measurement error, and make estimates of individual thresholds less reliable. The associated measurement error also obscures group differences. Adaptive procedures with children should therefore use as many reversals as possible, to reduce the effects of both Type 1 and Type 2 errors. Differences in response consistency, resulting from lapses in attention, further increase the over-estimation of threshold. Comparisons between data from individuals who may differ in lapse rate are therefore problematic, but measures to estimate and account for lapse rates in analyses may mitigate this problem.
Measuring hope among families impacted by cognitive impairment
Hunsaker, Amanda E.; Terhorst, Lauren; Gentry, Amanda; Lingler, Jennifer H.
2014-01-01
The current exploratory investigation aims to establish the reliability and validity of a hope measure, the Herth Hope Index (HHI), among families impacted by early cognitive impairment (N=96). Exploratory factor analysis was used to examine the dimensionality of the measure. Bivariate analyses were used to examine construct validity. The sample had moderately high hope scores. A two-factor structure emerged from the factor analysis, explaining 51.44% of the variance. Both factors exhibited strong internal consistency (Cronbach’s alphas ranged from .83 to .86). Satisfaction with social support was positively associated with hope, supporting convergent validity. Neurocognitive status, illness insight and depression were not associated with hope, indicating discriminant validity. Families impacted by cognitive impairment may maintain hope in the face of a potentially progressive illness, regardless of cognitive status. The HHI can be utilized as a reliable and valid measure of hope by practitioners providing support to families impacted by cognitive impairment. PMID:24784938
Lonsdale, Chris; Hodge, Ken; Rose, Elaine A
2008-06-01
The purpose of the four studies described in this article was to develop and test a new measure of competitive sport participants' intrinsic motivation, extrinsic motivation, and amotivation (self-determination theory; Deci & Ryan, 1985). The items for the new measure, named the Behavioral Regulation in Sport Questionnaire (BRSQ), were constructed using interviews, expert review, and pilot testing. Analyses supported the internal consistency, test-retest reliability, and factorial validity of the BRSQ scores. Nomological validity evidence was also supportive, as BRSQ subscale scores were correlated in the expected pattern with scores derived from measures of motivational consequences. When directly compared with scores derived from the Sport Motivation Scale (SMS; Pelletier, Fortier, Vallerand, Tuson, & Blais, 1995) and a revised version of that questionnaire (SMS-6; Mallett, Kawabata, Newcombe, Otero-Forero, & Jackson, 2007), BRSQ scores demonstrated equal or superior reliability and factorial validity as well as better nomological validity.
Zupančič, Maja; Inglés, Candido S; Bajec, Boštjan; Puklek Levpušček, Melita
2011-06-01
This study analyzed the psychometric properties of scores on the Slovene version of the Questionnaire about Interpersonal Difficulties for Adolescents (QIDA) in a sample of 1,334 adolescents (44% boys), ranging in age from 12 to 18 years (M = 15.61). Confirmatory factor analyses replicated the correlated five-factor structure of the QIDA: Assertiveness, Heterosexual Relationships, Public Speaking, Family Relationships, and Close Friendships. Internal consistency and test-retest reliability were reasonable. Correlations of scores on the QIDA with scores of neuroticism, low extraversion, and low openness, as measured by the Inventory of Child/Adolescent Individual Differences, and scores of fear of negative evaluation, and tension and inhibition in social contacts, as measured by the Social Anxiety Scale for Adolescents were found, revealing differential links with QIDA subscale scores. Girls reported more difficulties than boys. Age differences showed a small but significant decrease in QIDA total score over adolescence.
A Persian version of the parental bonding instrument: factor structure and psychometric properties.
Behzadi, Behnaz; Parker, Gordon
2015-02-28
The Parental Bonding Instrument (PBI) is a widely used self-report measure for quantifying key parenting styles as perceived by the child during its first 16 years. While its development study identified two key parental dimensions, subsequent studies have variably confirmed those two or argued for one or more additional parental constructs. We developed a Persian translation of the PBI and administered it to a sample of 340 high school students. The construct validity of the Persian PBI was examined by Exploratory Factor Analysis while Confirmatory Factor Analysis was used to identify the most adequate model. Analyses of the Persian PBI favored a four-factor model for both parental forms. The Persian PBI has a factorial structure consistent with constructs identified in western cultures, as well as high internal consistency and test-retest reliability. Multivariate analyses indicated significant differences between boys and girls across some factors. The PBI appears an acceptable and appropriate measure for quantifying parent-child bonding in Iranian samples. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Bayesian methods in reliability
NASA Astrophysics Data System (ADS)
Sander, P.; Badoux, R.
1991-11-01
The present proceedings from a course on Bayesian methods in reliability encompasses Bayesian statistical methods and their computational implementation, models for analyzing censored data from nonrepairable systems, the traits of repairable systems and growth models, the use of expert judgment, and a review of the problem of forecasting software reliability. Specific issues addressed include the use of Bayesian methods to estimate the leak rate of a gas pipeline, approximate analyses under great prior uncertainty, reliability estimation techniques, and a nonhomogeneous Poisson process. Also addressed are the calibration sets and seed variables of expert judgment systems for risk assessment, experimental illustrations of the use of expert judgment for reliability testing, and analyses of the predictive quality of software-reliability growth models such as the Weibull order statistics.
Validation of the Greek Translation of the Nursing Dimensions Inventory questionnaire (NDI-35)
Kotrotsiou, Evagelia; Gouva, Mary; Kotrotsiou, Stiliani; Malliarou, Maria; Paralikas, Theodosios
2014-01-01
Context: The concept of care is a fundamental issue in nursing science. Therefore the development and the use of tools for assessing care is an imperative for the nursing profession. The NDI-35 questionnaire is one such tool for assessing the nursing care. Objectives: The purpose of this paper is to adapt and use the NDI-35 questionnaire in Greek nursing practice. A translation and validation of NDI-35 questionnaire is performed. Methods: Exploratory factor analyses, as well as internal consistency and test–retest analyses, were conducted. Forward translations from English were produced by three independent Greek translators and then back translations by five independent bilingual translators. The Greek NDI-35 questionnaire that was produced was administered to 200 nurses (144 women and 56 men) from tertiary and secondary health care facilities. Data were analyzed using principal component analysis and Cronbach’s alpha. Results: One hundred and eighty four nurses that answered the NDI-35 questionnaire were graduates from the Technological Educational Institute (T.E.I.) and 64% of the respondents had more than 15 years of professional experience. Two subscales arbitrarily called “clinical work” and “patient needs” emerged, with the mean “clinical work” subscale score being at 70.16 ±12.90 (a maximum of 85) and mean “patient needs” subscale at 21.49± 6.16. Considerable differences in scoring among different items were observed when the NDI-35 answers were compared to their Greek counterparts’. Results confirmed that: (a) the translated versions are an accurate translation of the original, (b) factor analyses established similar factor solutions as that of the English versions, (c) reliability coefficients are satisfactory (i.e., Cronbach’s α coefficients and test–retests), and (d) construct validity revealed similarities between English and Greek versions, replications consistent with past research, as well as differences explained through theoretical frameworks. Therefore, both scales were accepted as valid and reliable measures in Greek-speaking populations. Conclusion: Alphas and test-retest correlation suggest the Greek translated and validated NDI-35 questionnaire is a reliable tool for assessing nursing care. Factor analysis and focus group input suggest it is a valid tool. Nurses in different settings may perceive nursing care differently. The findings of the current paper are discussed in the context of nurse education and assessment of care. PMID:25168976
Psychometric properties of the Brief Symptom Inventory-18 in a Spanish breast cancer sample.
Galdón, Ma José; Durá, Estrella; Andreu, Yolanda; Ferrando, Maite; Murgui, Sergio; Pérez, Sandra; Ibañez, Elena
2008-12-01
The objective of this work was to study the psychometric and structural properties of the Brief Symptom Inventory-18 (BSI-18) in a sample of breast cancer patients (N=175). Confirmatory factor analyses were conducted. Two models were tested: the theoretical model with the original structure (three-dimensional), and the empirical model (a four-factor structure) obtained through exploratory factor analysis initially performed by the authors of the BSI-18. The eligible structure was the original proposal consisting of three dimensions: somatization, depression, and anxiety scores. These measures also showed good internal consistency. The results of this study support the reliability and structural validity of the BSI-18 as a standardized instrument for screening purposes in breast cancer patients, with the added benefits of simplicity and ease of application.
Goode, N; Salmon, P M; Taylor, N Z; Lenné, M G; Finch, C F
2017-10-01
One factor potentially limiting the uptake of Rasmussen's (1997) Accimap method by practitioners is the lack of a contributing factor classification scheme to guide accident analyses. This article evaluates the intra- and inter-rater reliability and criterion-referenced validity of a classification scheme developed to support the use of Accimap by led outdoor activity (LOA) practitioners. The classification scheme has two levels: the system level describes the actors, artefacts and activity context in terms of 14 codes; the descriptor level breaks the system level codes down into 107 specific contributing factors. The study involved 11 LOA practitioners using the scheme on two separate occasions to code a pre-determined list of contributing factors identified from four incident reports. Criterion-referenced validity was assessed by comparing the codes selected by LOA practitioners to those selected by the method creators. Mean intra-rater reliability scores at the system (M = 83.6%) and descriptor (M = 74%) levels were acceptable. Mean inter-rater reliability scores were not consistently acceptable for both coding attempts at the system level (M T1 = 68.8%; M T2 = 73.9%), and were poor at the descriptor level (M T1 = 58.5%; M T2 = 64.1%). Mean criterion referenced validity scores at the system level were acceptable (M T1 = 73.9%; M T2 = 75.3%). However, they were not consistently acceptable at the descriptor level (M T1 = 67.6%; M T2 = 70.8%). Overall, the results indicate that the classification scheme does not currently satisfy reliability and validity requirements, and that further work is required. The implications for the design and development of contributing factors classification schemes are discussed. Copyright © 2017 Elsevier Ltd. All rights reserved.
Kaux, Jean-François; Delvaux, François; Schaus, Jean; Demoulin, Christophe; Locquet, Médéa; Buckinx, Fanny; Beaudart, Charlotte; Dardenne, Nadia; Van Beveren, Julien; Croisier, Jean-Louis; Forthomme, Bénédicte; Bruyère, Olivier
Translation and validation of algo-functional questionnaire. The lateral elbow tendinopathy is a common injury in tennis players and physical workers. The Patient-Rated Tennis Elbow Evaluation (PRTEE) Questionnaire was specifically designed to measure pain and functional limitations in patients with lateral epicondylitis (tennis elbow). First developed in English, this questionnaire has since been translated into several languages. The aims of the study were to translate and cross-culturally adapt the PRTEE questionnaire into French and to evaluate the reliability and validity of this translated version of the questionnaire (PRTEE-F). The PRTEE was translated and cross-culturally adapted into French according to international guidelines. To assess the reliability and validity of the PRTEE-F, 115 participants were asked twice to fill in the PRTEE-F, and once the Disabilities of Arm, Shoulder and Hand Questionnaire (DASH) and the Short Form Health Survey (SF-36). Internal consistency (using Cronbach's alpha), test-retest reliability (using intraclass correlation coefficient (ICC), standard error of measurement and minimal detectable change), and convergent and divergent validity (using the Spearman's correlation coefficients respectively with the DASH and with some subscales of the SF-36) were assessed. The PRTEE was translated into French without any problems. PRTEE-F showed a good test-retest reliability for the overall score (ICC 0.86) and for each item (ICC 0.8-0.96) and a high internal consistency (Cronbach's alpha = 0.98). The correlation analyses revealed high correlation coefficients between PRTEE-F and DASH (convergent validity) and, as expected, a low or moderate correlation with the divergent subscales of the SF-36 (discriminant validity). There was no floor or ceiling effect. The PRTEE questionnaire was successfully cross-culturally adapted into French. The PRTEE-F is reliable and valid for evaluating French-speaking patients with lateral elbow tendinopathy. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Zhu, Junya; Li, Liping; Zhao, Hailei; Han, Guangshu; Wu, Albert W; Weingart, Saul N
2014-10-01
Existing patient safety climate instruments, most of which have been developed in the USA, may not accurately reflect the conditions in the healthcare systems of other countries. To develop and evaluate a patient safety climate instrument for healthcare workers in Chinese hospitals. Based on a review of existing instruments, expert panel review, focus groups and cognitive interviews, we developed items relevant to patient safety climate in Chinese hospitals. The draft instrument was distributed to 1700 hospital workers from 54 units in six hospitals in five Chinese cities between July and October 2011, and 1464 completed surveys were received. We performed exploratory and confirmatory factor analyses and estimated internal consistency reliability, within-unit agreement, between-unit variation, unit-mean reliability, correlation between multi-item composites, and association between the composites and two single items of perceived safety. The final instrument included 34 items organised into nine composites: institutional commitment to safety, unit management support for safety, organisational learning, safety system, adequacy of safety arrangements, error reporting, communication and peer support, teamwork and staffing. All composites had acceptable unit-mean reliabilities (≥0.74) and within-unit agreement (Rwg ≥0.71), and exhibited significant between-unit variation with intraclass correlation coefficients ranging from 9% to 21%. Internal consistency reliabilities ranged from 0.59 to 0.88 and were ≥0.70 for eight of the nine composites. Correlations between composites ranged from 0.27 to 0.73. All composites were positively and significantly associated with the two perceived safety items. The Chinese Hospital Survey on Patient Safety Climate demonstrates adequate dimensionality, reliability and validity. The integration of qualitative and quantitative methods is essential to produce an instrument that is culturally appropriate for Chinese hospitals. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Gilet, Hélène; Arnould, Benoit; Fofana, Fatoumata; Clerson, Pierre; Colombel, Jean-Frédéric; D'Hondt, Olivier; Faure, Patrick; Hagège, Hervé; Nachury, Maria; Nahon, Stéphane; Tucat, Gilbert; Vandromme, Luc; Cazala-Telinge, Ines; Thibout, Emmanuel
2014-01-01
Severe Crohn's disease management includes anti-tumor necrosis factor (anti-TNF) drugs that differ from early-stage treatments regarding efficacy, safety, and convenience. This study aimed to finalize and psychometrically validate the Satisfaction for PAtients in Crohn's diseasE Questionnaire (SPACE-Q(©)), developed to measure satisfaction with anti-TNF treatment in patients with severe Crohn's disease. A total of 279 patients with severe Crohn's disease receiving anti-TNF therapy completed the SPACE-Q 62-item pilot version at inclusion and 12 and 13 weeks after first anti-TNF injection. The final SPACE-Q scoring was defined using multitrait and regression analyses and clinical relevance considerations. Psychometric validation included clinical validity against Harvey-Bradshaw score, concurrent validity against Treatment Satisfaction Questionnaire for Medication (TSQM), internal consistency reliability, test-retest reliability, and responsiveness against the patient global impression of change (PGIC). Quality of completion was good (55%-67% of patients completed all items). Four items were removed from the questionnaire. Eleven scores were defined within the final 58-item SPACE-Q: disease control; symptoms, anal symptoms, and quality of life transition scales; tolerability; convenience; expectation confirmation toward efficacy, side effects, and convenience; satisfaction with treatment; and motivation. Scores met standards for concurrent validity (correlation between SPACE-Q satisfaction with treatment and TSQM satisfaction scores =0.59), internal consistency reliability (Cronbach's α=0.67-0.93), test-retest reliability (intraclass correlations =0.62-0.91), and responsiveness (improvement in treatment experience assessed by the SPACE-Q for patients reporting improvement on the PGIC). Significantly different mean scores were observed between groups of patients with different Harvey-Bradshaw disease severity scores. The SPACE-Q is a valid, reliable, and responsive instrument to measure satisfaction with anti-TNF treatment in patients with severe Crohn's disease and for use in future studies.
Zheng, Jing; You, Li-Ming; Lou, Tan-Qi; Chen, Nian-Chang; Lai, De-Yuan; Liang, Yan-Yi; Li, Ying-Na; Gu, Ying-Ming; Lv, Shao-Fen; Zhai, Cui-Qiu
2010-02-01
Perceptions of exercise benefits and barriers affect exercise behavior. Because of the clinical course and treatment, dialysis patients differ from the general population in their perceptions of exercise benefits and barriers, especially the latter. At present, no valid instruments for assessing perceived exercise benefits and barriers in dialysis patients are available. Our goal was to develop and test the psychometric properties of the Dialysis patient-perceived Exercise Benefits and Barriers Scale (DPEBBS). A literature review and two focus groups were conducted to generate the initial item pool. An expert panel examined the content validity. Then, 269 Chinese hemodialysis patients were recruited by convenience sampling. Exploratory and confirmatory factor analyses were used to test construct validity. Finally, internal consistency and test-retest reliability were assessed. The expert panel determined that the content validity index was satisfactory. The final 24-item scale consisted of six factors explaining 57% of the total variance in the data. Confirmative factor analysis supported the six-factor structure and a higher-order model. Cronbach's alpha was 0.87 for the total scale, and 0.84 for test-retest reliability. The DPEBBS was a valid and reliable instrument for evaluating dialysis patients' perceived benefits and barriers to exercise. The application value of this scale remains to be investigated by increasing the sample size and evaluating patients undergoing different dialysis modalities and coming from different regions and cultural backgrounds. Copyright 2009 Elsevier Ltd. All rights reserved.
Maduz, Roman; Kugelmeier, Patrick; Meili, Severin; Döring, Robert; Meier, Christoph; Wahl, Peter
2017-04-01
The Abbreviated Injury Scale (AIS) and the Injury Severity Score (ISS) find increasingly widespread use to assess trauma burden and to perform interhospital benchmarking through trauma registries. Since 2015, public resource allocation in Switzerland shall even be derived from such data. As every trauma centre is responsible for its own coding and data input, this study aims at evaluating interobserver reliability of AIS and ISS coding. Interobserver reliability of the AIS and ISS is analysed from a cohort of 50 consecutive severely injured patients treated in 2012 at our institution, coded retrospectively by 3 independent and specifically trained observers. Considering a cutoff ISS≥16, only 38/50 patients (76%) were uniformly identified as polytraumatised or not. Increasing the cut off to ≥20, this increased to 41/50 patients (82%). A difference in the AIS of ≥ 1 was present in 261 (16%) of possible codes. Excluding the vast majority of uninjured body regions, uniformly identical AIS severity values were attributed in 67/193 (35%) body regions, or 318/579 (55%) possible observer pairings. Injury severity all too often is neither identified correctly nor consistently when using the AIS. This leads to wrong identification of severely injured patients using the ISS. Improving consistency of coding through centralisation is recommended before scores based on the AIS are to be used for interhospital benchmarking and resource allocation in the treatment of severely injured patients. Copyright © 2017. Published by Elsevier Ltd.
Osman, Augustine; Wong, Jane L; Bagge, Courtney L; Freedenthal, Stacey; Gutierrez, Peter M; Lozano, Gregorio
2012-12-01
We conducted two studies to examine the dimensions, internal consistency reliability estimates, and potential correlates of the Depression Anxiety Stress Scales-21 (DASS-21; Lovibond & Lovibond, 1995). Participants in Study 1 included 887 undergraduate students (363 men and 524 women, aged 18 to 35 years; mean [M] age = 19.46, standard deviation [SD] = 2.17) recruited from two public universities to assess the specificity of the individual DASS-21 items and to evaluate estimates of internal consistency reliability. Participants in a follow-up study (Study 2) included 410 students (168 men and 242 women, aged 18 to 47 years; M age = 19.65, SD = 2.88) recruited from the same universities to further assess factorial validity and to evaluate potential correlates of the original DASS-21 total and scale scores. Item bifactor and confirmatory factor analyses revealed that a general factor accounted for the greatest proportion of common variance in the DASS-21 item scores (Study 1). In Study 2, the fit statistics showed good fit for the bifactor model. In addition, the DASS-21 total scale score correlated more highly with scores on a measure of mixed depression and anxiety than with scores on the proposed specific scales of depression or anxiety. Coefficient omega estimates for the DASS-21 scale scores were good. Further investigations of the bifactor structure and psychometric properties of the DASS-21, specifically its incremental and discriminant validity, using known clinical groups are needed. © 2012 Wiley Periodicals, Inc.
Spanish validation of the social stigma scale: Community Attitudes towards Mental Illness.
Ochoa, Susana; Martínez-Zambrano, Francisco; Vila-Badia, Regina; Arenas, Oti; Casas-Anguera, Emma; García-Morales, Esther; Villellas, Raúl; Martín, José Ramón; Pérez-Franco, María Belén; Valduciel, Tamara; García-Franco, Mar; Miguel, Jose; Balsera, Joaquim; Pascual, Gemma; Julia, Eugènia; Casellas, Diana; Haro, Josep Maria
2016-01-01
The stigma against people with mental illness is very high. In Spain there are currently no tools to assess this construct. The aim of this study was to validate the Spanish version of the Community Attitudes towards Mental Illness questionnaire in an adolescent population, and determining its internal consistency and temporal stability. Another analysis by gender will be also performed. A translation and back-translation of the Community Attitudes towards Mental Illness was performed. A total of 150 students of between 14 and 18 years-old were evaluated with this tool in two stages. Internal consistency was tested using Cronbach α; and intraclass correlation coefficient was used for test-retest reliability. Gender-stratified analyses were also performed. The Cronbach α was 0.861 for the first evaluation and 0.909 for the second evaluation. The values of the intraclass correlation coefficient ranged from 0.775 to 0.339 in the item by item analysis, and between 0.88 and 0.81 in the subscales. In the segmentation by gender, it was found that girls scored between 0.797 and 0.863 in the intraclass correlation coefficient, and boys scored between 0.889 and 0.774. In conclusion, the Community Attitudes towards Mental Illness is a reliable tool for the assessment of social stigma. Although reliable results have been found for boys and girls, our results found some gender differences in the analysis. Copyright © 2014 SEP y SEPB. Published by Elsevier España. All rights reserved.
Bemister, Taryn B; Brooks, Brian L; Kirton, Adam
2014-07-01
Perinatal stroke is a leading cause of cerebral palsy and lifelong disability, although parent and family outcomes have not yet been studied in this specific population. The Alberta Perinatal Stroke Project Parental Outcome Measure was developed as a 26-item questionnaire on the impact of perinatal stroke on parents and families. The items were derived from expert opinion and scientific literature on issues salient to parents of children with perinatal stroke, including guilt and blame, which are not well captured in existing measures of family impact. Data were collected from 82 mothers and 28 fathers who completed the Parental Outcome Measure and related questionnaires (mean age, 39.5 years; mean child age, 7.4 years). Analyses examined the Parental Outcome Measure's internal consistency, test-retest reliability, validity, and factor structure. The Parental Outcome Measure demonstrated three unique theoretical constructs: Psychosocial Impact, Guilt, and Blame. The Parental Outcome Measure has excellent internal consistency (Cronbach α = 0.91) and very good test-retest reliability more than 2-5 weeks (r = 0.87). Regarding validity, the Parental Outcome Measure is sensitive to condition severity, accounts for additional variance in parent outcomes, and strongly correlates with measures of anxiety, depression, stress, quality of life, family functioning, and parent adjustment. The Parental Outcome Measure contributes to the literature as the first brief measure of family impact designed for parents of children with perinatal stroke. Copyright © 2014 Elsevier Inc. All rights reserved.
Assessing the internal consistency of the event-related potential: An example analysis.
Thigpen, Nina N; Kappenman, Emily S; Keil, Andreas
2017-01-01
ERPs are widely and increasingly used to address questions in psychophysiological research. As discussed in this special issue, a renewed focus on questions of reliability and stability marks the need for intuitive, quantitative descriptors that allow researchers to communicate the robustness of ERP measures used in a given study. This report argues that well-established indices of internal consistency and effect size meet this need and can be easily extracted from most ERP datasets, as demonstrated with example analyses using a representative dataset from a feature-based visual selective attention task. We demonstrate how to measure the internal consistency of three aspects commonly considered in ERP studies: voltage measurements for specific time ranges at selected sensors, voltage dynamics across all time points of the ERP waveform, and the distribution of voltages across the scalp. We illustrate methods for quantifying the robustness of experimental condition differences, by calculating effect size for different indices derived from the ERP. The number of trials contributing to the ERP waveform was manipulated to examine the relationship between signal-to-noise ratio (SNR), internal consistency, and effect size. In the present example dataset, satisfactory consistency (Cronbach's alpha > 0.7) of individual voltage measurements was reached at lower trial counts than were required to reach satisfactory effect sizes for differences between experimental conditions. Comparing different metrics of robustness, we conclude that the internal consistency and effect size of ERP findings greatly depend on the quantification strategy, the comparisons and analyses performed, and the SNR. © 2016 Society for Psychophysiological Research.
Assessing the internal consistency of the event-related potential: An example analysis
Thigpen, Nina; Kappenman, Emily; Keil, Andreas
2017-01-01
Event-related potentials (ERPs) are widely and increasingly used to address questions in Psychophysiological research. As discussed in this special issue, a renewed focus on questions of reliability and stability marks the need for intuitive, quantitative descriptors that allow researchers to communicate the robustness of ERP measures used in a given study. This report argues that well-established indices of internal consistency and effect size meet this need and can be easily extracted from most ERP data sets, as demonstrated with example analyses using a representative data set from a feature-based visual selective attention task. We demonstrate how to measure the internal consistency of three aspects commonly considered in ERP studies: Voltage measurements for specific time ranges at selected sensors, voltage dynamics across all time points of the ERP waveform, and the distribution of voltages across the scalp. We illustrate methods for quantifying the robustness of experimental condition differences, by calculating effect size for different indices derived from the ERP. The number of trials contributing to the ERP waveform was manipulated to examine the relationship between signal-to-noise ratio, internal consistency, and effect size. In the present example data set, satisfactory consistency (Cronbach’s alpha > 0.7) of individual voltage measurements was reached at lower trial counts than were required to reach satisfactory effect sizes for differences between experimental conditions. Comparing different metrics of robustness, we conclude that the SNR, internal consistency, and effect size of ERP findings greatly depend on the quantification strategy, the comparisons and analyses performed, and the signal-to-noise ratio. PMID:28000264
Li, Y; Morris, S; Cole, J; Dube', S; Smith, J A M; Burbridge, C; Symonds, T; Hudgens, S; Wang, W
2017-05-18
The Multidimensional Daily Diary of Fatigue-Fibromyalgia-17 instrument (MDF-Fibro-17) has been developed for use in fibromyalgia (FM) clinical studies and includes 5 domains: Global Fatigue Experience, Cognitive Fatigue, Physical Fatigue, Motivation, and Impact on Function. Psychometric properties of the MDF-Fibro-17 needed to demonstrate the appropriateness of using this instrument in clinical studies are presented. Psychometric analyses were conducted to evaluate the factor structure, reliability, validity, and responsiveness of the MDF-Fibro-17 using data from a Phase 2 clinical study of FM patients (N = 381). Confirmatory factor analyses (CFA) were performed to ensure understanding of the multidimensional domain structure, and a secondary factor analysis of the domains examined the appropriateness of calculating a total score in addition to domain scores. Longitudinal psychometric analyses (test-retest reliability and responder analysis) were also conducted on the data from Baseline to Week 6. The CFA supported the 17-item, 5 domain structure of this instrument as the best fit of the data: comparative fit index (CFI) and non-normed fit index (NNFI) were 0.997 and 0.992 respectively, standardized root mean square residual (SRMR) was 0.010 and the root mean square error of approximation (RMSEA) was 0.06. In addition, total score (CFI and NNFI both 0.95) met required standards. For the total and 5 domain scores, reliability and validity data were acceptable: test-retest and internal consistency were above 0.9; correlations were as expected with the Global Fatigue Index (GFI) (0.62-0.75), Fibromyalgia Impact Questionnaire (FIQ) Total (0.59-0.71), and 36-Item Short Form Health Survey (SF-36) vitality (VT) (0.43-0.53); and discrimination was shown using quintile scores for the GFI, FIQ Total, and Pain Numeric Rating Scale (NRS) quartiles. In addition, sensitivity to change was demonstrated with an overall mean responder score of -2.59 using anchor-based methods. The MDF-Fibro-17 reliably measures 5 domains of FM-related fatigue and psychometric evaluation confirms that this measure meets or exceeds each of the predefined acceptable thresholds for evidence of reliability, validity, and responsiveness to changes in clinical status. This suggests that the MDF-Fibro-17 is an appropriate and responsive measure of FM-related fatigue in clinical studies.
Validation of Turkish version of brief negative symptom scale.
Polat Nazlı, Irmak; Ergül, Ceylan; Aydemir, Ömer; Chandhoke, Swati; Üçok, Alp; Gönül, Ali Saffet
2016-11-01
Negative symptoms in schizophrenia have been assessed by many instruments. However, a current consensus on these symptoms has been built and new tools, such as the Brief Negative Symptom Scale (BNSS), are generated. This study aimed to evaluate reliability and validity of the Turkish version of BNSS. The scale was translated to Turkish and backtranslated to English. After the approval of the translation, 75 schizophrenia patients were interviewed with BNSS, Positive and Negative Syndrome Scale (PANSS), Calgary Depression Scale for Schizophrenia (CDSS) and Extrapyramidal Symptom Rating Scale (ESRS). Reliability and validity analyses were then calculated. In the reliability analysis, the Cronbach's alpha coefficient was 0.96 and item-total score correlation coefficients were between 0.655-0.884. The intraclass correlation coefficient was 0.665. The inter-rater reliability was 0.982 (p < 0.0001). In the validity analysis, the total score of BNSS-TR was correlated with PANSS Total Score, Positive Symptoms Subscale, Negative Symptoms Subscale, and General Psychopathology Subscale. CDSS and ESRS were not correlated with BNSS-TR. The factor structure of the scale was consisting the same items as in the original version. Our study confirms that the Turkish version of BNSS is an applicable tool for the evaluation of negative symptoms in schizophrenia.
An integrated approach to system design, reliability, and diagnosis
NASA Technical Reports Server (NTRS)
Patterson-Hine, F. A.; Iverson, David L.
1990-01-01
The requirement for ultradependability of computer systems in future avionics and space applications necessitates a top-down, integrated systems engineering approach for design, implementation, testing, and operation. The functional analyses of hardware and software systems must be combined by models that are flexible enough to represent their interactions and behavior. The information contained in these models must be accessible throughout all phases of the system life cycle in order to maintain consistency and accuracy in design and operational decisions. One approach being taken by researchers at Ames Research Center is the creation of an object-oriented environment that integrates information about system components required in the reliability evaluation with behavioral information useful for diagnostic algorithms. Procedures have been developed at Ames that perform reliability evaluations during design and failure diagnoses during system operation. These procedures utilize information from a central source, structured as object-oriented fault trees. Fault trees were selected because they are a flexible model widely used in aerospace applications and because they give a concise, structured representation of system behavior. The utility of this integrated environment for aerospace applications in light of our experiences during its development and use is described. The techniques for reliability evaluation and failure diagnosis are discussed, and current extensions of the environment and areas requiring further development are summarized.
An integrated approach to system design, reliability, and diagnosis
NASA Astrophysics Data System (ADS)
Patterson-Hine, F. A.; Iverson, David L.
1990-12-01
The requirement for ultradependability of computer systems in future avionics and space applications necessitates a top-down, integrated systems engineering approach for design, implementation, testing, and operation. The functional analyses of hardware and software systems must be combined by models that are flexible enough to represent their interactions and behavior. The information contained in these models must be accessible throughout all phases of the system life cycle in order to maintain consistency and accuracy in design and operational decisions. One approach being taken by researchers at Ames Research Center is the creation of an object-oriented environment that integrates information about system components required in the reliability evaluation with behavioral information useful for diagnostic algorithms. Procedures have been developed at Ames that perform reliability evaluations during design and failure diagnoses during system operation. These procedures utilize information from a central source, structured as object-oriented fault trees. Fault trees were selected because they are a flexible model widely used in aerospace applications and because they give a concise, structured representation of system behavior. The utility of this integrated environment for aerospace applications in light of our experiences during its development and use is described. The techniques for reliability evaluation and failure diagnosis are discussed, and current extensions of the environment and areas requiring further development are summarized.
Uggioni, Paula Lazzarin; Salay, Elisabete
2012-04-01
Validated and reliable instruments for measuring consumer attitudes regarding food quality certifications are lacking, but the measurement of consumer attitude could be an important tool for understanding consumer behavior. Thus the objective of this study was to develop an instrument for measuring consumer attitudes regarding private food safety certifications for commercial restaurants. To this end, the following steps were carried out: development of the interview items; complete pilot testing; item analyses (influence of social desirability and total-item correlation); reliability test (internal consistency and test-retest); and validity assessment (content and discriminative validity and exploratory and confirmatory factor analysis). The subjects, all over the age of 18 and drawn from six non-probabilistic samples (n=7-350) in the city of Campinas, Brazil, were all subjected to an interview. The final scale included 24 items and had a Cronbach's alpha coefficient of 0.79 and a content validation coefficient of 0.99, both within acceptable limits. The confirmatory factor analysis validated a model with five factors and the final instrument discriminated reasonably well between the groups and showed satisfactory reproducibility (r=0.955). Furthermore, the scale validity and reliability were satisfactory, suggesting it could also be applied to future studies. Copyright © 2011 Elsevier Ltd. All rights reserved.
Güler, Sibel; Turan, F Nesrin
2015-09-30
Our aim was to translate the Quality of Life in Essential Tremor Questionnaire (QUEST) advanced by Troster (2005) and to analyse the validity and reliability of this questionnaire. Two hundred twelve consecutive patients with essential tremor (ET) and forty-three control subjects were included in the study. Permission for the translation and validation of the QUEST scale was obtained. The translation was performed according to the guidelines provided by the publisher. After the translation, the final version of the scale was administered to both groups to determine its reliability and validity. The QUEST Physical, Psychosocial, communication, Hobbies/leisure and Work/finance scores were 0.967, 0.968, 0.933, 0.964 and 0.925, respectively. There were good correlations between each of the QUEST scores that were indicative of good internal consistency. Additionally, we observed that all of the QUEST scores were most strongly related to the right and left arms (p=0.0001). However, we observed that all of the QUEST scores were weakly related to the voice, head and right leg (p=0.0001). These findings support the notion that the Turkish version of the Quality of Life in Essential Tremor (QUEST) questionnaire is a valid and reliable tool for the assessment of the quality of life of patients with ET.
Study on the Validity and Reliability of Melbourne Decision Making Scale in Turkey
ERIC Educational Resources Information Center
Çolakkadioglu, Oguzhan; Deniz, M. Engin
2015-01-01
This study is to analyze the validity and reliability of Melbourne Decision Making Questionnaire (MDMQ). The sample consisted of 650 university students. The structural validity of the MDMQ, as well as correlations among its sub-scales, measure-bound validity, internal consistency, item total correlations and test-retest reliability coefficients…
ERIC Educational Resources Information Center
Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U.
2015-01-01
This article uses definitions provided by Cronbach in his seminal paper for coefficient a to show the concepts of reliability, dimensionality, and internal consistency are distinct but interrelated. The article begins with a critique of the definition of reliability and then explores mathematical properties of Cronbach's a. Internal consistency…
NASA Astrophysics Data System (ADS)
Iskandar, Ismed; Satria Gondokaryono, Yudi
2016-02-01
In reliability theory, the most important problem is to determine the reliability of a complex system from the reliability of its components. The weakness of most reliability theories is that the systems are described and explained as simply functioning or failed. In many real situations, the failures may be from many causes depending upon the age and the environment of the system and its components. Another problem in reliability theory is one of estimating the parameters of the assumed failure models. The estimation may be based on data collected over censored or uncensored life tests. In many reliability problems, the failure data are simply quantitatively inadequate, especially in engineering design and maintenance system. The Bayesian analyses are more beneficial than the classical one in such cases. The Bayesian estimation analyses allow us to combine past knowledge or experience in the form of an apriori distribution with life test data to make inferences of the parameter of interest. In this paper, we have investigated the application of the Bayesian estimation analyses to competing risk systems. The cases are limited to the models with independent causes of failure by using the Weibull distribution as our model. A simulation is conducted for this distribution with the objectives of verifying the models and the estimators and investigating the performance of the estimators for varying sample size. The simulation data are analyzed by using Bayesian and the maximum likelihood analyses. The simulation results show that the change of the true of parameter relatively to another will change the value of standard deviation in an opposite direction. For a perfect information on the prior distribution, the estimation methods of the Bayesian analyses are better than those of the maximum likelihood. The sensitivity analyses show some amount of sensitivity over the shifts of the prior locations. They also show the robustness of the Bayesian analysis within the range between the true value and the maximum likelihood estimated value lines.
Use of Internal Consistency Coefficients for Estimating Reliability of Experimental Tasks Scores
Green, Samuel B.; Yang, Yanyun; Alt, Mary; Brinkley, Shara; Gray, Shelley; Hogan, Tiffany; Cowan, Nelson
2017-01-01
Reliabilities of scores for experimental tasks are likely to differ from one study to another to the extent that the task stimuli change, the number of trials varies, the type of individuals taking the task changes, the administration conditions are altered, or the focal task variable differs. Given reliabilities vary as a function of the design of these tasks and the characteristics of the individuals taking them, making inferences about the reliability of scores in an ongoing study based on reliability estimates from prior studies is precarious. Thus, it would be advantageous to estimate reliability based on data from the ongoing study. We argue that internal consistency estimates of reliability are underutilized for experimental task data and in many applications could provide this information using a single administration of a task. We discuss different methods for computing internal consistency estimates with a generalized coefficient alpha and the conditions under which these estimates are accurate. We illustrate use of these coefficients using data for three different tasks. PMID:26546100
Internal Consistency, Retest Reliability, and their Implications For Personality Scale Validity
McCrae, Robert R.; Kurtz, John E.; Yamagata, Shinji; Terracciano, Antonio
2010-01-01
We examined data (N = 34,108) on the differential reliability and validity of facet scales from the NEO Inventories. We evaluated the extent to which (a) psychometric properties of facet scales are generalizable across ages, cultures, and methods of measurement; and (b) validity criteria are associated with different forms of reliability. Composite estimates of facet scale stability, heritability, and cross-observer validity were broadly generalizable. Two estimates of retest reliability were independent predictors of the three validity criteria; none of three estimates of internal consistency was. Available evidence suggests the same pattern of results for other personality inventories. Internal consistency of scales can be useful as a check on data quality, but appears to be of limited utility for evaluating the potential validity of developed scales, and it should not be used as a substitute for retest reliability. Further research on the nature and determinants of retest reliability is needed. PMID:20435807
The reliability of vertical jump tests between the Vertec and My Jump phone application.
Yingling, Vanessa R; Castro, Dimitri A; Duong, Justin T; Malpartida, Fiorella J; Usher, Justin R; O, Jenny
2018-01-01
The vertical jump is used to estimate sports performance capabilities and physical fitness in children, elderly, non-athletic and injured individuals. Different jump techniques and measurement tools are available to assess vertical jump height and peak power; however, their use is limited by access to laboratory settings, excessive cost and/or time constraints thus making these tools oftentimes unsuitable for field assessment. A popular field test uses the Vertec and the Sargent vertical jump with countermovement; however, new low cost, easy to use tools are becoming available, including the My Jump iOS mobile application (app). The purpose of this study was to assess the reliability of the My Jump relative to values obtained by the Vertec for the Sargent stand and reach vertical jump (VJ) test. One hundred and thirty-five healthy participants aged 18-39 years (94 males, 41 females) completed three maximal Sargent VJ with countermovement that were simultaneously measured using the Vertec and the My Jump . Jump heights were quantified for each jump and peak power was calculated using the Sayers equation. Four separate ICC estimates and their 95% confidence intervals were used to assess reliability. Two analyses (with jump height and calculated peak power as the dependent variables, respectively) were based on a single rater, consistency, two-way mixed-effects model, while two others (with jump height and calculated peak power as the dependent variables, respectively) were based on a single rater, absolute agreement, two-way mixed-effects model. Moderate to excellent reliability relative to the degree of consistency between the Vertec and My Jump values was found for jump height (ICC = 0.813; 95% CI [0.747-0.863]) and calculated peak power (ICC = 0.926; 95% CI [0.897-0.947]). However, poor to good reliability relative to absolute agreement for VJ height (ICC = 0.665; 95% CI [0.050-0.859]) and poor to excellent reliability relative to absolute agreement for peak power (ICC = 0.851; 95% CI [0.272-0.946]) between the Vertec and My Jump values were found; Vertec VJ height, and thus, Vertec calculated peak power values, were significantly higher than those calculated from My Jump values ( p < 0.0001). The My Jump app may provide a reliable measure of vertical jump height and calculated peak power in multiple field and laboratory settings without the need of costly equipment such as force plates or Vertec. The reliability relative to degree of consistency between the Vertec and My Jump app was moderate to excellent. However, the reliability relative to absolute agreement between Vertec and My Jump values contained significant variation (based on CI values), thus, it is recommended that either the My Jump or the Vertec be used to assess VJ height in repeated measures within subjects' designs; these measurement tools should not be considered interchangeable within subjects or in group measurement designs.
The reliability of vertical jump tests between the Vertec and My Jump phone application
Castro, Dimitri A.; Duong, Justin T.; Malpartida, Fiorella J.; Usher, Justin R.; O, Jenny
2018-01-01
Background The vertical jump is used to estimate sports performance capabilities and physical fitness in children, elderly, non-athletic and injured individuals. Different jump techniques and measurement tools are available to assess vertical jump height and peak power; however, their use is limited by access to laboratory settings, excessive cost and/or time constraints thus making these tools oftentimes unsuitable for field assessment. A popular field test uses the Vertec and the Sargent vertical jump with countermovement; however, new low cost, easy to use tools are becoming available, including the My Jump iOS mobile application (app). The purpose of this study was to assess the reliability of the My Jump relative to values obtained by the Vertec for the Sargent stand and reach vertical jump (VJ) test. Methods One hundred and thirty-five healthy participants aged 18–39 years (94 males, 41 females) completed three maximal Sargent VJ with countermovement that were simultaneously measured using the Vertec and the My Jump. Jump heights were quantified for each jump and peak power was calculated using the Sayers equation. Four separate ICC estimates and their 95% confidence intervals were used to assess reliability. Two analyses (with jump height and calculated peak power as the dependent variables, respectively) were based on a single rater, consistency, two-way mixed-effects model, while two others (with jump height and calculated peak power as the dependent variables, respectively) were based on a single rater, absolute agreement, two-way mixed-effects model. Results Moderate to excellent reliability relative to the degree of consistency between the Vertec and My Jump values was found for jump height (ICC = 0.813; 95% CI [0.747–0.863]) and calculated peak power (ICC = 0.926; 95% CI [0.897–0.947]). However, poor to good reliability relative to absolute agreement for VJ height (ICC = 0.665; 95% CI [0.050–0.859]) and poor to excellent reliability relative to absolute agreement for peak power (ICC = 0.851; 95% CI [0.272–0.946]) between the Vertec and My Jump values were found; Vertec VJ height, and thus, Vertec calculated peak power values, were significantly higher than those calculated from My Jump values (p < 0.0001). Discussion The My Jump app may provide a reliable measure of vertical jump height and calculated peak power in multiple field and laboratory settings without the need of costly equipment such as force plates or Vertec. The reliability relative to degree of consistency between the Vertec and My Jump app was moderate to excellent. However, the reliability relative to absolute agreement between Vertec and My Jump values contained significant variation (based on CI values), thus, it is recommended that either the My Jump or the Vertec be used to assess VJ height in repeated measures within subjects’ designs; these measurement tools should not be considered interchangeable within subjects or in group measurement designs. PMID:29692955
Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese.
Shinga-Ishihara, Chikako; Nakai, Yukie; Milgrom, Peter; Murakami, Kaori; Matsumoto-Nakano, Michiyo
2014-01-02
Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman's correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach's alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4 ± 14.1, and 40.6 ± 11.3 for the first and second administrations in the second sample. The distribution of Dentocult SM score was 6.8% (score = 0), 34.4% (score = 1), 39.4% (score = 2), and 19.4% (score = 3). Participants with higher scores were more likely to have higher dietary cariogenicity scores (p < 0.001; Kruskal-Wallis test). These results provide the preliminary evidence for the reliability and validity of the Japanese language Food Frequency Questionnaire.
The psychometric properties of the WHOQOL-BREF in Japanese couples
Sun, Yi; Sugawara, Masumi; Matsumoto, Satoko; Sakai, Atsushi; Takaoka, Junko; Goto, Noriko
2015-01-01
This study investigated the psychometric properties of the Japanese version of the WHOQOL-BREF among 10,693 community-based married Japanese men and women (4376 couples) who were either expecting or raising a child. Analyses of item-response distributions, internal consistency, criterion validity, and discriminant validity indicated that the scale had acceptable reliability and performed well in preliminary tests of validity. Furthermore, dyadic confirmatory factor analysis revealed that the theoretical factor structure was valid and similar across partners, suggesting that men and women define and value quality of life in a similar way. PMID:28070365
Computing Lives And Reliabilities Of Turboprop Transmissions
NASA Technical Reports Server (NTRS)
Coy, J. J.; Savage, M.; Radil, K. C.; Lewicki, D. G.
1991-01-01
Computer program PSHFT calculates lifetimes of variety of aircraft transmissions. Consists of main program, series of subroutines applying to specific configurations, generic subroutines for analysis of properties of components, subroutines for analysis of system, and common block. Main program selects routines used in analysis and causes them to operate in desired sequence. Series of configuration-specific subroutines put in configuration data, perform force and life analyses for components (with help of generic component-property-analysis subroutines), fill property array, call up system-analysis routines, and finally print out results of analysis for system and components. Written in FORTRAN 77(IV).
Markup of temporal information in electronic health records.
Hyun, Sookyung; Bakken, Suzanne; Johnson, Stephen B
2006-01-01
Temporal information plays a critical role in the understanding of clinical narrative (i.e., free text). We developed a representation for marking up temporal information in a narrative, consisting of five elements: 1) reference point, 2) direction, 3) number, 4) time unit, and 5) pattern. We identified 254 temporal expressions from 50 discharge summaries and represented them using our scheme. The overall inter-rater reliability among raters applying the representation model was 75 percent agreement. The model can contribute to temporal reasoning in computer systems for decision support, data mining, and process and outcomes analyses by providing structured temporal information.
Ponterotto, Joseph G; Ruckdeschel, Daniel E
2007-12-01
The present article addresses issues in reliability assessment that are often neglected in psychological research such as acceptable levels of internal consistency for research purposes, factors affecting the magnitude of coefficient alpha (alpha), and considerations for interpreting alpha within the research context. A new reliability matrix anchored in classical test theory is introduced to help researchers judge adequacy of internal consistency coefficients with research measures. Guidelines and cautions in applying the matrix are provided.
Loeding, B L; Greenan, J P
1998-12-01
The study examined the validity and reliability of four assessments, with three instruments per domain. Domains included generalizable mathematics, communication, interpersonal relations, and reasoning skills. Participants were deaf, legally blind, or visually impaired students enrolled in vocational classes at residential secondary schools. The researchers estimated the internal consistency reliability, test-retest reliability, and construct validity correlations of three subinstruments: student self-ratings, teacher ratings, and performance assessments. The data suggest that these instruments are highly internally consistent measures of generalizable vocational skills. Four performance assessments have high-to-moderate test-retest reliability estimates, and were generally considered to possess acceptable validity and reliability.
20 CFR 220.14 - Weighing of evidence.
Code of Federal Regulations, 2010 CFR
2010-04-01
... capacity evaluation is based upon functional objective tests with high validity and reliability; (2) The... consists of objective findings of exams that have poor reliability or validity; (7) The evidence consists...
[The reliability of a questionnaire regarding Colombian children's physical activity].
Herazo-Beltrán, Aliz Y; Domínguez-Anaya, Regina
2012-10-01
Reporting the Physical Activity Questionnaire for school children's (PAQ-C) test-retest reliability and internal consistency. This was a descriptive study of 100 school-aged children aged 9 to 11 years old attending a school in Cartagena, Colombia. The sample was randomly selected. The PAQ-C was given twice, one week apart, after the informed consent forms had been signing by the children's parents and school officials. Cronbach's alpha coefficient of reliability was used for assessing internal consistency and an intra-class correlation coefficient for test-retest reliability SPSS (version 17.0) was used for statistical analysis. The questionnaire scored 0.73 internal consistencies during the first measurement and 0.78 on the second; intra-class correlation coefficient was 0.60. There were differences between boys and girls regarding both measurements. The PAQ-C had acceptable internal consistency and test-retest reliability, thereby making it useful for measuring children's self-reported physical activity and a valuable tool for population studies in Colombia.
The Validity and Reliability of the Mobbing Scale (MS)
ERIC Educational Resources Information Center
Yaman, Erkan
2009-01-01
The aim of this research is to develop the Mobbing Scale and examine its validity and reliability. The sample of the study consisted of 515 persons from Sakarya and Bursa. In this study, construct validity, internal consistency, test-retest reliability, and item analysis of the scale were examined. As a result of factor analysis for construct…
Development of the multiple sclerosis (MS) early mobility impairment questionnaire (EMIQ).
Ziemssen, Tjalf; Phillips, Glenn; Shah, Ruchit; Mathias, Adam; Foley, Catherine; Coon, Cheryl; Sen, Rohini; Lee, Andrew; Agarwal, Sonalee
2016-10-01
The Early Mobility Impairment Questionnaire (EMIQ) was developed to facilitate early identification of mobility impairments in multiple sclerosis (MS) patients. We describe the initial development of the EMIQ with a focus on the psychometric evaluation of the questionnaire using classical and item response theory methods. The initial 20-item EMIQ was constructed by clinical specialists and qualitatively tested among people with MS and physicians via cognitive interviews. Data from an observational study was used to make additional updates to the instrument based on exploratory factor analysis (EFA) and item response theory (IRT) analysis, and psychometric analyses were performed to evaluate the reliability and validity of the final instrument's scores and screening properties (i.e., sensitivity and specificity). Based on qualitative interview analyses, a revised 15-item EMIQ was included in the observational study. EFA, IRT and item-to-item correlation analyses revealed redundant items which were removed leading to the final nine-item EMIQ. The nine-item EMIQ performed well with respect to: test-retest reliability (ICC = 0.858); internal consistency (α = 0.893); convergent validity; and known-groups methods for construct validity. A cut-point of 41 on the 0-to-100 scale resulted in sufficient sensitivity and specificity statistics for viably identifying patients with mobility impairment. The EMIQ is a content valid and psychometrically sound instrument for capturing MS patients' experience with mobility impairments in a clinical practice setting. Additional research is suggested to further confirm the EMIQ's screening properties over time.
Delimiting Coefficient a from Internal Consistency and Unidimensionality
ERIC Educational Resources Information Center
Sijtsma, Klaas
2015-01-01
I discuss the contribution by Davenport, Davison, Liou, & Love (2015) in which they relate reliability represented by coefficient a to formal definitions of internal consistency and unidimensionality, both proposed by Cronbach (1951). I argue that coefficient a is a lower bound to reliability and that concepts of internal consistency and…
Brennan, Sue E; McKenzie, Joanne E; Turner, Tari; Redman, Sally; Makkar, Steve; Williamson, Anna; Haynes, Abby; Green, Sally E
2017-01-17
Capacity building strategies are widely used to increase the use of research in policy development. However, a lack of well-validated measures for policy contexts has hampered efforts to identify priorities for capacity building and to evaluate the impact of strategies. We aimed to address this gap by developing SEER (Seeking, Engaging with and Evaluating Research), a self-report measure of individual policymakers' capacity to engage with and use research. We used the SPIRIT Action Framework to identify pertinent domains and guide development of items for measuring each domain. Scales covered (1) individual capacity to use research (confidence in using research, value placed on research, individual perceptions of the value their organisation places on research, supporting tools and systems), (2) actions taken to engage with research and researchers, and (3) use of research to inform policy (extent and type of research use). A sample of policymakers engaged in health policy development provided data to examine scale reliability (internal consistency, test-retest) and validity (relation to measures of similar concepts, relation to a measure of intention to use research, internal structure of the individual capacity scales). Response rates were 55% (150/272 people, 12 agencies) for the validity and internal consistency analyses, and 54% (57/105 people, 9 agencies) for test-retest reliability. The individual capacity scales demonstrated adequate internal consistency reliability (alpha coefficients > 0.7, all four scales) and test-retest reliability (intra-class correlation coefficients > 0.7 for three scales and 0.59 for fourth scale). Scores on individual capacity scales converged as predicted with measures of similar concepts (moderate correlations of > 0.4), and confirmatory factor analysis provided evidence that the scales measured related but distinct concepts. Items in each of these four scales related as predicted to concepts in the measurement model derived from the SPIRIT Action Framework. Evidence about the reliability and validity of the research engagement actions and research use scales was equivocal. Initial testing of SEER suggests that the four individual capacity scales may be used in policy settings to examine current capacity and identify areas for capacity building. The relation between capacity, research engagement actions and research use requires further investigation.
Bernhardt, Jay M; Stellefson, Michael; Weiler, Robert M; Anderson-Lewis, Charkarra; Miller, M David; MacInnes, Jann
2015-01-01
Background Social media can promote healthy behaviors by facilitating engagement and collaboration among health professionals and the public. Thus, social media is quickly becoming a vital tool for health promotion. While guidelines and trainings exist for public health professionals, there are currently no standardized measures to assess individual social media competency among Certified Health Education Specialists (CHES) and Master Certified Health Education Specialists (MCHES). Objective The aim of this study was to design, develop, and test the Social Media Competency Inventory (SMCI) for CHES and MCHES. Methods The SMCI was designed in three sequential phases: (1) Conceptualization and Domain Specifications, (2) Item Development, and (3) Inventory Testing and Finalization. Phase 1 consisted of a literature review, concept operationalization, and expert reviews. Phase 2 involved an expert panel (n=4) review, think-aloud sessions with a small representative sample of CHES/MCHES (n=10), a pilot test (n=36), and classical test theory analyses to develop the initial version of the SMCI. Phase 3 included a field test of the SMCI with a random sample of CHES and MCHES (n=353), factor and Rasch analyses, and development of SMCI administration and interpretation guidelines. Results Six constructs adapted from the unified theory of acceptance and use of technology and the integrated behavioral model were identified for assessing social media competency: (1) Social Media Self-Efficacy, (2) Social Media Experience, (3) Effort Expectancy, (4) Performance Expectancy, (5) Facilitating Conditions, and (6) Social Influence. The initial item pool included 148 items. After the pilot test, 16 items were removed or revised because of low item discrimination (r<.30), high interitem correlations (Ρ>.90), or based on feedback received from pilot participants. During the psychometric analysis of the field test data, 52 items were removed due to low discrimination, evidence of content redundancy, low R-squared value, or poor item infit or outfit. Psychometric analyses of the data revealed acceptable reliability evidence for the following scales: Social Media Self-Efficacy (alpha=.98, item reliability=.98, item separation=6.76), Social Media Experience (alpha=.98, item reliability=.98, item separation=6.24), Effort Expectancy(alpha =.74, item reliability=.95, item separation=4.15), Performance Expectancy (alpha =.81, item reliability=.99, item separation=10.09), Facilitating Conditions (alpha =.66, item reliability=.99, item separation=16.04), and Social Influence (alpha =.66, item reliability=.93, item separation=3.77). There was some evidence of local dependence among the scales, with several observed residual correlations above |.20|. Conclusions Through the multistage instrument-development process, sufficient reliability and validity evidence was collected in support of the purpose and intended use of the SMCI. The SMCI can be used to assess the readiness of health education specialists to effectively use social media for health promotion research and practice. Future research should explore associations across constructs within the SMCI and evaluate the ability of SMCI scores to predict social media use and performance among CHES and MCHES. PMID:26399428
Alber, Julia M; Bernhardt, Jay M; Stellefson, Michael; Weiler, Robert M; Anderson-Lewis, Charkarra; Miller, M David; MacInnes, Jann
2015-09-23
Social media can promote healthy behaviors by facilitating engagement and collaboration among health professionals and the public. Thus, social media is quickly becoming a vital tool for health promotion. While guidelines and trainings exist for public health professionals, there are currently no standardized measures to assess individual social media competency among Certified Health Education Specialists (CHES) and Master Certified Health Education Specialists (MCHES). The aim of this study was to design, develop, and test the Social Media Competency Inventory (SMCI) for CHES and MCHES. The SMCI was designed in three sequential phases: (1) Conceptualization and Domain Specifications, (2) Item Development, and (3) Inventory Testing and Finalization. Phase 1 consisted of a literature review, concept operationalization, and expert reviews. Phase 2 involved an expert panel (n=4) review, think-aloud sessions with a small representative sample of CHES/MCHES (n=10), a pilot test (n=36), and classical test theory analyses to develop the initial version of the SMCI. Phase 3 included a field test of the SMCI with a random sample of CHES and MCHES (n=353), factor and Rasch analyses, and development of SMCI administration and interpretation guidelines. Six constructs adapted from the unified theory of acceptance and use of technology and the integrated behavioral model were identified for assessing social media competency: (1) Social Media Self-Efficacy, (2) Social Media Experience, (3) Effort Expectancy, (4) Performance Expectancy, (5) Facilitating Conditions, and (6) Social Influence. The initial item pool included 148 items. After the pilot test, 16 items were removed or revised because of low item discrimination (r<.30), high interitem correlations (Ρ>.90), or based on feedback received from pilot participants. During the psychometric analysis of the field test data, 52 items were removed due to low discrimination, evidence of content redundancy, low R-squared value, or poor item infit or outfit. Psychometric analyses of the data revealed acceptable reliability evidence for the following scales: Social Media Self-Efficacy (alpha=.98, item reliability=.98, item separation=6.76), Social Media Experience (alpha=.98, item reliability=.98, item separation=6.24), Effort Expectancy(alpha =.74, item reliability=.95, item separation=4.15), Performance Expectancy (alpha =.81, item reliability=.99, item separation=10.09), Facilitating Conditions (alpha =.66, item reliability=.99, item separation=16.04), and Social Influence (alpha =.66, item reliability=.93, item separation=3.77). There was some evidence of local dependence among the scales, with several observed residual correlations above |.20|. Through the multistage instrument-development process, sufficient reliability and validity evidence was collected in support of the purpose and intended use of the SMCI. The SMCI can be used to assess the readiness of health education specialists to effectively use social media for health promotion research and practice. Future research should explore associations across constructs within the SMCI and evaluate the ability of SMCI scores to predict social media use and performance among CHES and MCHES.
Phylogenomic analyses data of the avian phylogenomics project.
Jarvis, Erich D; Mirarab, Siavash; Aberer, Andre J; Li, Bo; Houde, Peter; Li, Cai; Ho, Simon Y W; Faircloth, Brant C; Nabholz, Benoit; Howard, Jason T; Suh, Alexander; Weber, Claudia C; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Narula, Nitish; Liu, Liang; Burt, Dave; Ellegren, Hans; Edwards, Scott V; Stamatakis, Alexandros; Mindell, David P; Cracraft, Joel; Braun, Edward L; Warnow, Tandy; Jun, Wang; Gilbert, M Thomas Pius; Zhang, Guojie
2015-01-01
Determining the evolutionary relationships among the major lineages of extant birds has been one of the biggest challenges in systematic biology. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders. We used these genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomic analyses. Here we present the datasets associated with the phylogenomic analyses, which include sequence alignment files consisting of nucleotides, amino acids, indels, and transposable elements, as well as tree files containing gene trees and species trees. Inferring an accurate phylogeny required generating: 1) A well annotated data set across species based on genome synteny; 2) Alignments with unaligned or incorrectly overaligned sequences filtered out; and 3) Diverse data sets, including genes and their inferred trees, indels, and transposable elements. Our total evidence nucleotide tree (TENT) data set (consisting of exons, introns, and UCEs) gave what we consider our most reliable species tree when using the concatenation-based ExaML algorithm or when using statistical binning with the coalescence-based MP-EST algorithm (which we refer to as MP-EST*). Other data sets, such as the coding sequence of some exons, revealed other properties of genome evolution, namely convergence. The Avian Phylogenomics Project is the largest vertebrate phylogenomics project to date that we are aware of. The sequence, alignment, and tree data are expected to accelerate analyses in phylogenomics and other related areas.
Barbosa, Taís de Souza; Gavião, Maria Beatriz Duarte
2015-01-01
To test the validity and reliability of Brazilian Portuguese version of the Parental-Caregiver Perceptions Questionnaire (P-CPQ) (Aim 1) and to assess the agreement between parents and children concerning the child's oral health-related quality of life (OHRQoL) (Aim 2). The P-CPQ and the Brazilian Portuguese versions of the Child Perceptions Questionnaires (CPQ8-10 and CPQ11-14 ) were used. Objective 1 addressed in the study that involved 210 (validity and internal reliability) and 20 (test-retest reliability) parents and Objective 2 in the study that involved 210 pairs of parents and children. Construct validity was calculated using the Spearman's correlation and the Mann-Whitney/Kruskal-Wallis tests. Reliability was determined using Cronbach's alpha and intraclass correlation coefficient (ICC). Agreement between overall and subscale scores derived from the P-CPQ and CPQ was assessed in comparison and correlation analyses. The P-CPQ discriminated among the categories of malocclusion and dmft. The P-CPQ showed good construct validity, good internal consistency reliability, and excellent test-retest reliability. There was systematic under- and overreporting in parents' assessments for younger and older children, respectively. However, the magnitude of the directional differences was just small. At individual level, agreement between parents and children was excellent. However, it ranged from excellent to moderate or substantial in subscales for CPQ8-10 and CPQ11-14 groups, respectively. The Portuguese version of P-CPQ is valid and reliable. Some parents have limited knowledge about child OHRQoL. Given that parental and child reports measure different realities concerning the child's OHRQoL, information provided by parents can complement the child's evaluation. © 2015 American Association of Public Health Dentistry.
Reliability of reference distances used in photogrammetry.
Aksu, Muge; Kaya, Demet; Kocadereli, Ilken
2010-07-01
To determine the reliability of the reference distances used for photogrammetric assessment. The sample consisted of 100 subjects with mean ages of 22.97 +/- 2.98 years. Five lateral and four frontal parameters were measured directly on the subjects' faces. For photogrammetric assessment, two reference distances for the profile view and three reference distances for the frontal view were established. Standardized photographs were taken and all the parameters that had been measured directly on the face were measured on the photographs. The reliability of the reference distances was checked by comparing direct and indirect values of the parameters obtained from the subjects' faces and photographs. Repeated measure analysis of variance (ANOVA) and Bland-Altman analyses were used for statistical assessment. For profile measurements, the indirect values measured were statistically different from the direct values except for Sn-Sto in male subjects and Prn-Sn and Sn-Sto in female subjects. The indirect values of Prn-Sn and Sn-Sto were reliable in both sexes. The poorest results were obtained in the indirect values of the N-Sn parameter for female subjects and the Sn-Me parameter for male subjects according to the Sa-Sba reference distance. For frontal measurements, the indirect values were statistically different from the direct values in both sexes except for one in male subjects. The indirect values measured were not statistically different from the direct values for Go-Go. The indirect values of Ch-Ch were reliable in male subjects. The poorest results were obtained according to the P-P reference distance. For profile assessment, the T-Ex reference distance was reliable for Prn-Sn and Sn-Sto in both sexes. For frontal assessment, Ex-Ex and En-En reference distances were reliable for Ch-Ch in male subjects.
Lohrer, Heinz; Nauck, Tanja; Korakakis, Vasileios; Malliaropoulos, Nikos
2016-10-24
The FASH (Functional Assessment Scale for Acute Hamstring Injuries) questionnaire has been recently developed as a disease-specific self-administered questionnaire for use in Greek, English, and German languages. Its psychometric qualities (validity and reliability) were tested only in Greek-speaking patients mainly representing track and field athletes. As hamstring injuries represent the most common football injury, we tested the validity and reliability of the FASH-G (G = German version) questionnaire in German-speaking footballers suffering from acute hamstring injuries. The FASH-G questionnaire was tested for reliability and validity, in 16 footballers with hamstring injuries (patients' group), 77 asymptomatic footballers (healthy group), and 19 field hockey players (at-risk group). Known-group validity was tested by comparing the total FASH-G scores of the injured and non-injured groups. Reliability of the FASH-G questionnaire was analysed in 18 asymptomatic footballers using the intra-class coefficient. Known-group validity was demonstrated by significant differences between injured and non-injured participants (p < 0.001). The FASH-G exhibited very good test-retest reliability (intra-class correlation coefficient = 0.982, p < 0.001). Internal consistency was excellent (α = 0.938). Compared with the results presented in the original publication, no statistical differences were found between healthy athletes (p = 0.257), but patients' groups and at-risk groups presented scoring differences (p = 0.040 and <0.001, respectively). The FASH-G is a valid and reliable instrument to assess and determine the severity of hamstring injuries in German footballers.
Validation of a Malay Version of the Smartphone Addiction Scale among Medical Students in Malaysia
Sazlly Lim, Sazlyna Mohd; Wan Sulaiman, Wan Aliaa; Foo, Yoke Loong; Hoo, Fan kee
2015-01-01
Introduction This study was initiated to determine the psychometric properties of the Smart Phone Addiction Scale (SAS) by translating and validating this scale into the Malay language (SAS-M), which is the main language spoken in Malaysia. This study can distinguish smart phone and internet addiction among multi-ethnic Malaysian medical students. In addition, the reliability and validity of the SAS was also demonstrated. Materials and Methods A total of 228 participants were selected between August 2014 and September 2014 to complete a set of questionnaires, including the SAS and the modified Kimberly Young Internet addiction test (IAT) in the Malay language. Results There were 99 males and 129 females with ages ranging from 19 to 22 years old (21.7±1.1) included in this study. Descriptive and factor analyses, intra-class coefficients, t-tests and correlation analyses were conducted to verify the reliability and validity of the SAS. Bartlett’s test of sphericity was significant (p <0.01), and the Kaiser-Mayer-Olkin measure of sampling adequacy for the SAS-M was 0.92, indicating meritoriously that the factor analysis was appropriate. The internal consistency and concurrent validity of the SAS-M were verified (Cronbach’s alpha = 0.94). All of the subscales of the SAS-M, except for positive anticipation, were significantly related to the Malay version of the IAT. Conclusions This study developed the first smart phone addiction scale among medical students. This scale was shown to be reliable and valid in the Malay language. PMID:26431511
Moser, Debra K; Riegel, Barbara; McKinley, Sharon; Doering, Lynn V; Meischke, Hendrika; Heo, Seongkum; Lennie, Terry A; Dracup, Kathleen
2009-01-01
Perceived control is a construct with important theoretical and clinical implications for healthcare providers, yet practical application of the construct in research and clinical practice awaits development of an easily administered instrument to measure perceived control with evidence of reliability and validity. To test the psychometric properties of the Control Attitudes Scale-Revised (CAS-R) using a sample of 3,396 individuals with coronary heart disease, 513 patients with acute myocardial infarction, and 146 patients with heart failure. Analyses were done separately in each patient group. Reliability was assessed using Cronbach's alpha to determine internal consistency, and item homogeneity was assessed using item-total and interitem correlations. Validity was examined using principal component analysis and testing hypotheses about known associations. Cronbach's alpha values for the CAS-R in patients with coronary heart disease, acute myocardial infarction, and heart failure were all greater than .70. Item-total and interitem correlation coefficients for all items were acceptable in the groups. In factor analyses, the same single factor was extracted in all groups, and all items were loaded moderately or strongly to the factor in each group. As hypothesized in the final construct validity test, in all groups, patients with higher levels of perceived control had less depression and less anxiety compared with those of patients who had lower levels of perceived control. This study provides evidence of the reliability and validity of the 8-item CAS-R as a measure of perceived control in patients with cardiac illness and provides important insight into a key patient construct.
Nurses' Empowerment Scale for ICU patients' families: an instrument development study.
Li, Hong; Liu, Ya-Lan; Qiu, Li; Chen, Qiao-Ling; Wu, Jing-Bing; Chen, Li-Li; Li, Na
2016-09-01
Family members provide essential support for ICU patients, contributing to their mental and physical recovery. Empowering ICU patients' families may help them overcome inadequacies and meet their own and patients' acknowledged needs. Nursing should understand and address patients' families' empowerment status. To develop a tool, the Nurses' Empowerment Scale for Intensive Care Unit (ICU) Patients' Families (NESIPF), to help ICU nursing staff assess the empowerment status of patients' families. Four-phase instrument development study. A 19-item instrument was initially generated based on literature review and interviews with family members of ICU patients. The Delphi research method was applied to gain expert opinion and consensus via rounds of questionnaires. A panel of 27 experts experienced in critical care medicine, nursing and psychology participated in two Delphi rounds and their input helped formulate an 18-item pretest instrument. Families of 20 patients were recruited to examine instrument readability. After a 2-week interval, another 20 patients' families were recruited to examine test-retest reliability. Two hundred questionnaires were then administered and analysed to examine the instrument's construct validity, criterion-related validity and internal consistency. Expert authority coefficients of two Delphi rounds reached 0·89 and 0·91. Kendall' W coefficients of 0·113 (P < 0·001) in round 1 and 0·220 (P < 0·001) in round 2 indicated slight to fair agreement among experts. Content validity index (CVI) reached 1·0 for 12 items; the CVI for item 13 was <0·7 so it was excluded. Cronbach's α coefficient was 0·92, indicating acceptable internal consistency reliability. The coefficient of internal consistency of each dimension was 0·717-0·921. The Pearson correlation coefficient >0·9 (P < 0·05) showed an acceptable test-retest reliability. The instrument has acceptable reliability and validity and can assess the empowerment status of families of critically ill patients. Knowledge of families' empowerment status may help to address their psychological needs and their ability to provide family support. © 2014 British Association of Critical Care Nurses.
Hosford, Charles C; Siders, William A
2010-10-01
Strategies to facilitate learning include using knowledge of students' learning style preferences to inform students and their teachers. Aims of this study were to evaluate the factor structure, internal consistency, and temporal stability of medical student responses to the Index of Learning Styles (ILS) and determine its appropriateness as an instrument for medical education. The ILS assesses preferences on four dimensions: sensing/intuitive information perceiving, visual/verbal information receiving, active/reflective information processing, and sequential/global information understanding. Students entering the 2002-2007 classes completed the ILS; some completed the ILS again after 2 and 4 years. Analyses of responses supported the ILS's intended structure and moderate reliability. Students had moderate preferences for sensing and visual learning. This study provides evidence supporting the appropriateness of the ILS for assessing learning style preferences in medical students.
ERIC Educational Resources Information Center
Green, Samuel B.; Yang, Yanyun
2015-01-01
In the lead article, Davenport, Davison, Liou, & Love demonstrate the relationship among homogeneity, internal consistency, and coefficient alpha, and also distinguish among them. These distinctions are important because too often coefficient alpha--a reliability coefficient--is interpreted as an index of homogeneity or internal consistency.…
Hagman, Brett T
2017-11-01
The Diagnostic and Statistical Manual of Mental Disorders (5th edition) Alcohol Use Disorder (DSM-5 AUD) criteria have been modified to reflect a single, continuous disorder. It is critical that we develop brief assessment measures that can accurately assess for DSM-5 AUD criteria in college students to assist in screening, referral, and brief intervention services implemented on college campuses. The present study sought to develop and assess for the psychometric properties of a brief 13-item measure designed to capture the full spectrum of the DSM-5 AUD criteria in a sample of college students. Participants were past-year drinkers (N = 923) between the ages of 18 to 30 enrolled at 3 universities. Respondents completed a 30-min anonymous battery of questionnaires online. The Brief DSM-5 AUD Assessment consisted of 13 items designed to reflect the DSM-5 AUD criteria. Results indicated a high degree of internal consistency reliability with high item-to-scale correlations. Confirmatory factor analyses indicated that a dominant single factor emerged with good model fit. The Item Response Theory (IRT) analyses indicated that the difficulty parameters for each criterion were intermixed along the upper portion of the underlying AUD severity continuum, and the discrimination parameters were all high. Additional analysis indicated that those with a DSM-5 AUD had greater levels of alcohol and other drug use and problem severity in comparison to those without a DSM-5 AUD. Study findings provide empirical support for the reliability and validity of the Brief 13-item DSM-5 Assessment. It should be routinely included into research and clinical practice efforts. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
2013-01-01
Background Mindful-based interventions improve functioning and quality of life in fibromyalgia (FM) patients. The aim of the study is to perform a psychometric analysis of the Spanish version of the Mindful Attention Awareness Scale (MAAS) in a sample of patients diagnosed with FM. Methods The following measures were administered to 251 Spanish patients with FM: the Spanish version of MAAS, the Chronic Pain Acceptance Questionnaire, the Pain Catastrophising Scale, the Injustice Experience Questionnaire, the Psychological Inflexibility in Pain Scale, the Fibromyalgia Impact Questionnaire and the Euroqol. Factorial structure was analysed using Confirmatory Factor Analyses (CFA). Cronbach's α coefficient was calculated to examine internal consistency, and the intraclass correlation coefficient (ICC) was calculated to assess the test-retest reliability of the measures. Pearson’s correlation tests were run to evaluate univariate relationships between scores on the MAAS and criterion variables. Results The MAAS scores in our sample were low (M = 56.7; SD = 17.5). CFA confirmed a two-factor structure, with the following fit indices [sbX2 = 172.34 (p < 0.001), CFI = 0.95, GFI = 0.90, SRMR = 0.05, RMSEA = 0.06. MAAS was found to have high internal consistency (Cronbach’s α = 0.90) and adequate test-retest reliability at a 1–2 week interval (ICC = 0.90). It showed significant and expected correlations with the criterion measures with the exception of the Euroqol (Pearson = 0.15). Conclusion Psychometric properties of the Spanish version of the MAAS in patients with FM are adequate. The dimensionality of the MAAS found in this sample and directions for future research are discussed. PMID:23317306
Bayard, Sophie; Lebrun, Cindy; Maudarbocus, Khaalid Hassan; Schellaert, Vanessa; Joffre, Alicia; Ferrante, Esther; Le Louedec, Marie; Cournoulat, Alice; Gely-Nargeot, Marie-Christine; Luik, Annemarie I
2017-12-01
Insomnia disorder is frequent in the population, yet there is no French screening instrument available that is based on the updated DSM-5 criteria. We evaluated the validity and reliability of the French version of an insomnia screening instrument based on DSM-5 criteria, the Sleep Condition Indicator, in a population-based sample of adults. A total of 366 community-dwelling participants completed a face-to-face clinical interview to determine insomnia disorder against DSM-5 criteria and several questionnaires including the French Sleep Condition Indicator version. Three-hundred and twenty-nine participants completed the Sleep Condition Indicator again after 1 month. Statistical analyses were performed to determine the reliability, construct validity, divergent validity and temporal stability of the French translation of the Sleep Condition Indicator. In addition, an explanatory factor analysis was performed to assess the underlying structure. The internal consistency (α = 0.87) and temporal stability (r = 0.86, P < 0.001) of the French Sleep Condition Indicator were high. When using the previously defined cut-off value of ≤ 16, the area under the receiver operating characteristic curve was 0.93 with a sensitivity of 95% and a specificity of 75%. Additionally, good construct and divergent validity were demonstrated. The factor analyses showed a two-factor structure with a focus on sleep and daytime effects. The French version of the Sleep Condition Indicator demonstrates satisfactory psychometric properties while being a useful instrument in detecting cases of insomnia disorder, consistent with features of DSM-5, in the general population. © 2017 European Sleep Research Society.
ERIC Educational Resources Information Center
Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Lehotkay, R; Saraswathi Devi, T; Raju, M V R; Bada, P K; Nuti, S; Kempf, N; Carminati, G Galli
2015-03-01
In this study realised in collaboration with the department of psychology and parapsychology of Andhra University, validation of the Aberrant Behavior Checklist-Community (ABC-C) in Telugu, the official language of Andhra Pradesh, one of India's 28 states, was carried out. To assess the factor validity and reliability of this Telugu version, 120 participants with moderate to profound intellectual disability (94 men and 26 women, mean age 25.2, SD 7.1) were rated by the staff of the Lebenshilfe Institution for Mentally Handicapped in Visakhapatnam, Andhra Pradesh, India. Rating data were analysed with a confirmatory factor analysis. The internal consistency was estimated by Cronbach's alpha. To confirm the test-retest reliability, 50 participants were rated twice with an interval of 4 weeks, and 50 were rated by pairs of raters to assess inter-rater reliability. Confirmatory factor analysis revealed that the root mean square error of approximation (RMSEA) was equal to 0.06, the comparative fit index (CFI) was equal to 0.77, and the Tucker Lewis index (TLI) was equal to 0.77, which indicated that the model with five correlated factors had a good fit. Coefficient alpha ranged from 0.85 to 0.92 across the five subscales. Spearman's rank correlation coefficients for inter-rater reliability tests ranged from 0.65 to 0.75, and the correlations for test-retest reliability ranged from 0.58 to 0.76. All reliability coefficients were statistically significant (P < 0.01). The factor validity and reliability of Telugu version of the ABC-C evidenced factor validity and reliability comparable to the original English version and appears to be useful for assessing behaviour disorders in Indian people with intellectual disabilities. © 2014 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Teresi, Jeanne A; Ocepek-Welikson, Katja; Cook, Karon F; Kleinman, Marjorie; Ramirez, Mildred; Reid, M Carrington; Siu, Albert
2016-01-01
Reducing the response burden of standardized pain measures is desirable, particularly for individuals who are frail or live with chronic illness, e.g., those suffering from cancer and those in palliative care. The Patient Reported Outcome Measurement Information System ® (PROMIS ® ) project addressed this issue with the provision of computerized adaptive tests (CAT) and short form measures that can be used clinically and in research. Although there has been substantial evaluation of PROMIS item banks, little is known about the performance of PROMIS short forms, particularly in ethnically diverse groups. Reviewed in this article are findings related to the differential item functioning (DIF) and reliability of the PROMIS pain interference short forms across diverse sociodemographic groups. DIF hypotheses were generated for the PROMIS short form pain interference items. Initial analyses tested item response theory (IRT) model assumptions of unidimensionality and local independence. Dimensionality was evaluated using factor analytic methods; local dependence (LD) was tested using IRT-based LD indices. Wald tests were used to examine group differences in IRT parameters, and to test DIF hypotheses. A second DIF-detection method used in sensitivity analyses was based on ordinal logistic regression with a latent IRT-derived conditioning variable. Magnitude and impact of DIF were investigated, and reliability and item and scale information statistics were estimated. The reliability of the short form item set was excellent. However, there were a few items with high local dependency, which affected the estimation of the final discrimination parameters. As a result, the item, "How much did pain interfere with enjoyment of social activities?" was excluded in the DIF analyses for all subgroup comparisons. No items were hypothesized to show DIF for race and ethnicity; however, five items showed DIF after adjustment for multiple comparisons in both primary and sensitivity analyses: ability to concentrate, enjoyment of recreational activities, tasks away from home, participation in social activities, and socializing with others. The magnitude of DIF was small and the impact negligible. Three items were consistently identified with DIF for education: enjoyment of life, ability to concentrate, and enjoyment of recreational activities. No item showed DIF above the magnitude threshold and the impact of DIF on the overall measure was minimal. No item showed gender DIF after correction for multiple comparisons in the primary analyses. Four items showed consistent age DIF: enjoyment of life, ability to concentrate, day to day activities, and enjoyment of recreational activities, none with primary magnitude values above threshold. Conditional on the pain state, Spanish speakers were hypothesized to report less pain interference on one item, enjoyment of life. The DIF findings confirmed the hypothesis; however, the magnitude was small. Using an arbitrary cutoff point of theta ( θ ) ≥ 1.0 to classify respondents with acute pain interference, the highest number of changes were for the education groups analyses. There were 231 respondents (4% of the total sample) who changed from the designation of no acute pain interference to acute interference after the DIF adjustment. There was no change in the designations for race/ethnic subgroups, and a small number of changes for respondents aged 65 to 84. Although significant DIF was observed after correction for multiple comparisons, all DIF was of low magnitude and impact. However, some individual-level impact was observed for low education groups. Reliability estimates were high. Thus, the PROMIS short form pain items examined in this ethnically diverse sample performed relatively well; although one item was problematic and removed from the analyses. It is concluded that the majority of the PROMIS pain interference short form items can be recommended for use among ethnically diverse groups, including those in palliative care and with cancer and chronic illness.
Teresi, Jeanne A.; Ocepek-Welikson, Katja; Cook, Karon F.; Kleinman, Marjorie; Ramirez, Mildred; Reid, M. Carrington; Siu, Albert
2017-01-01
Reducing the response burden of standardized pain measures is desirable, particularly for individuals who are frail or live with chronic illness, e.g., those suffering from cancer and those in palliative care. The Patient Reported Outcome Measurement Information System® (PROMIS®) project addressed this issue with the provision of computerized adaptive tests (CAT) and short form measures that can be used clinically and in research. Although there has been substantial evaluation of PROMIS item banks, little is known about the performance of PROMIS short forms, particularly in ethnically diverse groups. Reviewed in this article are findings related to the differential item functioning (DIF) and reliability of the PROMIS pain interference short forms across diverse sociodemographic groups. Methods DIF hypotheses were generated for the PROMIS short form pain interference items. Initial analyses tested item response theory (IRT) model assumptions of unidimensionality and local independence. Dimensionality was evaluated using factor analytic methods; local dependence (LD) was tested using IRT-based LD indices. Wald tests were used to examine group differences in IRT parameters, and to test DIF hypotheses. A second DIF-detection method used in sensitivity analyses was based on ordinal logistic regression with a latent IRT-derived conditioning variable. Magnitude and impact of DIF were investigated, and reliability and item and scale information statistics were estimated. Results The reliability of the short form item set was excellent. However, there were a few items with high local dependency, which affected the estimation of the final discrimination parameters. As a result, the item, “How much did pain interfere with enjoyment of social activities?” was excluded in the DIF analyses for all subgroup comparisons. No items were hypothesized to show DIF for race and ethnicity; however, five items showed DIF after adjustment for multiple comparisons in both primary and sensitivity analyses: ability to concentrate, enjoyment of recreational activities, tasks away from home, participation in social activities, and socializing with others. The magnitude of DIF was small and the impact negligible. Three items were consistently identified with DIF for education: enjoyment of life, ability to concentrate, and enjoyment of recreational activities. No item showed DIF above the magnitude threshold and the impact of DIF on the overall measure was minimal. No item showed gender DIF after correction for multiple comparisons in the primary analyses. Four items showed consistent age DIF: enjoyment of life, ability to concentrate, day to day activities, and enjoyment of recreational activities, none with primary magnitude values above threshold. Conditional on the pain state, Spanish speakers were hypothesized to report less pain interference on one item, enjoyment of life. The DIF findings confirmed the hypothesis; however, the magnitude was small. Using an arbitrary cutoff point of theta (θ) ≥ 1.0 to classify respondents with acute pain interference, the highest number of changes were for the education groups analyses. There were 231 respondents (4% of the total sample) who changed from the designation of no acute pain interference to acute interference after the DIF adjustment. There was no change in the designations for race/ethnic subgroups, and a small number of changes for respondents aged 65 to 84. Conclusions Although significant DIF was observed after correction for multiple comparisons, all DIF was of low magnitude and impact. However, some individual-level impact was observed for low education groups. Reliability estimates were high. Thus, the PROMIS short form pain items examined in this ethnically diverse sample performed relatively well; although one item was problematic and removed from the analyses. It is concluded that the majority of the PROMIS pain interference short form items can be recommended for use among ethnically diverse groups, including those in palliative care and with cancer and chronic illness. PMID:28983449
Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H
2017-08-15
Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error (<6%). Most of the outcome measures reported high internal consistency (α > .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
Quality and rigor of the concept mapping methodology: a pooled study analysis.
Rosas, Scott R; Kane, Mary
2012-05-01
The use of concept mapping in research and evaluation has expanded dramatically over the past 20 years. Researchers in academic, organizational, and community-based settings have applied concept mapping successfully without the benefit of systematic analyses across studies to identify the features of a methodologically sound study. Quantitative characteristics and estimates of quality and rigor that may guide for future studies are lacking. To address this gap, we conducted a pooled analysis of 69 concept mapping studies to describe characteristics across study phases, generate specific indicators of validity and reliability, and examine the relationship between select study characteristics and quality indicators. Individual study characteristics and estimates were pooled and quantitatively summarized, describing the distribution, variation and parameters for each. In addition, variation in the concept mapping data collection in relation to characteristics and estimates was examined. Overall, results suggest concept mapping yields strong internal representational validity and very strong sorting and rating reliability estimates. Validity and reliability were consistently high despite variation in participation and task completion percentages across data collection modes. The implications of these findings as a practical reference to assess the quality and rigor for future concept mapping studies are discussed. Copyright © 2011 Elsevier Ltd. All rights reserved.
Overgaauw, Sandy; Rieffe, Carolien; Broekhof, Evelien; Crone, Eveline A.; Güroğlu, Berna
2017-01-01
Empathy plays a crucial role in healthy social functioning and in maintaining positive social relationships. In this study, 1250 children and adolescents (10–15 year olds) completed the newly developed Empathy Questionnaire for Children and Adolescents (EmQue-CA) that was tested on reliability, construct validity, convergent validity, and concurrent validity. The EmQue-CA aims to assess empathy using the following scales: affective empathy, cognitive empathy, and intention to comfort. A Principal Components Analysis, which was directly tested with a Confirmatory Factor Analysis, confirmed the proposed three-factor model resulting in 14 final items. Reliability analyses demonstrated high internal consistency of the scales. Furthermore, the scales showed high convergent validity, as they were positively correlated with related scales of the Interpersonal Reactivity Index (Davis, 1983). With regard to concurrent validity, higher empathy was related to more attention to others’ emotions, higher friendship quality, less focus on own affective state, and lower levels of bullying behavior. Taken together, we show that the EmQue-CA is a reliable and valid instrument to measure empathy in typically developing children and adolescents aged 10 and older. PMID:28611713
Development, validity, and reliability of a ballet-specific aerobic fitness test.
Twitchett, Emily; Nevill, Alan; Angioi, Manuela; Koutedakis, Yiannis; Wyon, Matthew
2011-09-01
The aim of this study was to develop and assess the reliability and validity of a multi-stage, ballet-specific aerobic fitness test to be used in a dance studio setting. The test consists of five stages, each four minutes long, that increase in intensity. It uses classical ballet movement of an intermediate-level of difficulty, thus emphasizing physiological demand rather than skill. The demand of each stage was determined by calculating the mean oxygen uptake during its final minute using a portable gas analyser. After an initial familiarization period, eight female subjects performed the test twice within seven days. The results showed significant differences in oxygen consumption between stages (p < 0.001), but not between trials. Pearson correlation co-efficients produced a very good linear relationship between trials (r = 0.998, p < 0.001). Bland-Altman reliability analysis revealed the 95% limits of agreement to be ± 6.2 ml·kg(-1)·min(-1), showing good agreement between trials. The oxygen uptake in our subjects equated positively to previous estimates for class and performance, confirming validity. It was concluded that the test is suitable for use among classical ballet dancers, with many possible applications.
A novel and reliable computational intelligence system for breast cancer detection.
Zadeh Shirazi, Amin; Seyyed Mahdavi Chabok, Seyyed Javad; Mohammadi, Zahra
2018-05-01
Cancer is the second important morbidity and mortality factor among women and the most incident type is breast cancer. This paper suggests a hybrid computational intelligence model based on unsupervised and supervised learning techniques, i.e., self-organizing map (SOM) and complex-valued neural network (CVNN), for reliable detection of breast cancer. The dataset used in this paper consists of 822 patients with five features (patient's breast mass shape, margin, density, patient's age, and Breast Imaging Reporting and Data System assessment). The proposed model was used for the first time and can be categorized in two stages. In the first stage, considering the input features, SOM technique was used to cluster the patients with the most similarity. Then, in the second stage, for each cluster, the patient's features were applied to complex-valued neural network and dealt with to classify breast cancer severity (benign or malign). The obtained results corresponding to each patient were compared to the medical diagnosis results using receiver operating characteristic analyses and confusion matrix. In the testing phase, health and disease detection ratios were 94 and 95%, respectively. Accordingly, the superiority of the proposed model was proved and can be used for reliable and robust detection of breast cancer.
Fault tree models for fault tolerant hypercube multiprocessors
NASA Technical Reports Server (NTRS)
Boyd, Mark A.; Tuazon, Jezus O.
1991-01-01
Three candidate fault tolerant hypercube architectures are modeled, their reliability analyses are compared, and the resulting implications of these methods of incorporating fault tolerance into hypercube multiprocessors are discussed. In the course of performing the reliability analyses, the use of HARP and fault trees in modeling sequence dependent system behaviors is demonstrated.
NASA Astrophysics Data System (ADS)
Erdogan, Ibrahim; Campbell, Todd; Hashidah Abd-Hamid, Nor
2011-07-01
This study describes the development of an instrument to investigate the extent to which student-centered actions are occurring in science classrooms. The instrument was developed through the following five stages: (1) student action identification, (2) use of both national and international content experts to establish content validity, (3) refinement of the item pool based on reviewer comments, (4) pilot testing of the instrument, and (5) statistical reliability and item analysis leading to additional refinement and finalization of the instrument. In the field test, the instrument consisted of 26 items separated into four categories originally derived from student-centered instruction literature and used by the authors to sort student actions in previous research. The SACS was administered across 22 Grade 6-8 classrooms by 22 groups of observers, with a total of 67 SACS ratings completed. The finalized instrument was found to be internally consistent, with acceptable estimates from inter-rater intraclass correlation reliability coefficients at the p < 0.01 level. After the final stage of development, the SACS instrument consisted of 24 items separated into three categories, which aligned with the factor analysis clustering of the items. Additionally, concurrent validity of the SACS was established with the Reformed Teaching Observation Protocol. Based on the analyses completed, the SACS appears to be a useful instrument for inclusion in comprehensive assessment packages for illuminating the extent to which student-centered actions are occurring in science classrooms.
Le, Minh Thi Hong; Tran, Thach Duc; Holton, Sara; Nguyen, Huong Thanh; Wolfe, Rory; Fisher, Jane
2017-01-01
To assess the internal consistency, latent structure and convergent validity of the Depression, Anxiety and Stress Scale-21 (DASS-21) among adolescents in Vietnam. An anonymous, self-completed questionnaire was conducted among 1,745 high school students in Hanoi, Vietnam between October, 2013 and January, 2014. Confirmatory factor analyses were performed to assess the latent structure of the DASS-21. Factorial invariance between girls and boys was examined. Cronbach alphas and correlation coefficients between DASS-21 factor scores and the domain scores of the Duke Health Profile Adolescent Vietnamese validated version (ADHP-V) were calculated to assess DASS-21 internal consistency and convergent validity. A total of 1,606/ 1,745 (92.6%) students returned the questionnaire. Of those, 1,387 students provided complete DASS-21 data. The scale demonstrated adequate internal consistency (Cronbach α: 0.761 to 0.906). A four-factor model showed the best fit to the data. Items loaded significantly on a common general distress factor, the depression, and the anxiety factors, but few on the stress factor (p<0.05). DASS-21 convergent validity was confirmed with moderate correlation coefficients (-0.47 to -0.66) between its factor scores and the ADHP-V mental health related domains. The DASS-21 is reliable and suitable for use to assess symptoms of common mental health problems, especially depression and anxiety among Vietnamese adolescents. However, its ability in detecting stress among these adolescents may be limited. Further research is warrant to explore these results.
Cazenave, N; Paquette, L
2010-10-01
In French-speaking countries, the concept of sensation seeking has been most widely assessed using the Zuckerman Sensation Seeking Scale form V (SSS), since this instrument was validated (in French) more than 15 years ago. This instrument has received several criticisms which limit the internal and external consistencies. Indeed, five limitations of conception and form could reduce the fact that many researchers have found the SSSV to be valid and useful and, more importantly, the conclusions that can be drawn from studies in which it has been used (e.g; tautological relationships, a forced-choice format, language of some items is out-of-date). Arnett thus developed a new measurement (Arnett Inventory of Sensation Seeking, AISS) based on a new conceptualization of sensation seeking, which is characterized by the need for novelty and intensity of stimulation, whereas sensation seeking, as developed by Zuckerman, is marked by a need for novelty and complexity of stimulation. The AISS has been translated and validated in Spanish and in German. Both studies found support for the bi-dimensional structure of the instrument. Currently, there is no French-speaking version of the AISS, and because of the cultural differences between English- and French-speaking populations, we cannot simply translate the instrument without examining the reliability and the factorial validity. Hence, we followed the seven steps of the cross-cultural validation methodology for psychological questionnaires presented by Vallerand. Questionnaires were distributed to 782 young adults. Out of these questionnaires, 737 (94%) were returned. One hundred and sixteen questionnaires were removed because of missing data. Thus, a total of 621 young adults were included in the study. They were aged from 18 to 28 years (M=23.32, SD=2.79). They completed the SSS and the AISS. We conducted a confirmatory factor analysis (CFA) on the data set, using Amos 6.0, to assess the validity of the bi-dimensional structure; we also examined the internal consistencies, and tested the potential gender differences. The analyses show that the fit indices, associated with the model with 20 items proposed by Arnett, were poor. We therefore had to modify it and delete some items in order to provide a more satisfactory account of the data. The fit indices from the confirmatory factor analysis were adequate for a two-factor structure with six items on each subscale. Pearson's correlation coefficients supported convergent validity of the questionnaire. Internal consistency reliabilities Cronbach's α were calculated for each of the factors and for the total scale. The reliability coefficients for the Intensity and Novelty subscales were 0.621 and 0.567, respectively, whereas the reliability of the overall scale was 0.646. In order to assess the differences between both sexes, we carried out a multivariate analysis of variance with gender as independent variables, and intensity, novelty and the total score of the revised AISS as dependent variables. Men scored higher than women on the Total Scale and on the Intensity subscale, but no gender relationship was found on Novelty subscale. These findings replicated research supporting the construct validity and reliability of the AISS in previous psychometric examinations. The results of this preliminary study yielded sufficient support for the validity of the French translation of the AISS, but further analyses, such as test-retest reliability and discriminant validity should be conducted. Copyright © 2010 L'Encéphale, Paris. Published by Elsevier Masson SAS. All rights reserved.
de Souza, Jonas A; Yap, Bonnie J; Wroblewski, Kristen; Blinder, Victoria; Araújo, Fabiana S; Hlubocky, Fay J; Nicholas, Lauren H; O'Connor, Jeremy M; Brockstein, Bruce; Ratain, Mark J; Daugherty, Christopher K; Cella, David
2017-02-01
Cancer and its treatment lead to increased financial distress for patients. To the authors' knowledge, to date, no standardized patient-reported outcome measure has been validated to assess this distress. Patients with AJCC Stage IV solid tumors receiving chemotherapy for at least 2 months were recruited. Financial toxicity was measured by the COmprehensive Score for financial Toxicity (COST) measure. The authors collected data regarding patient characteristics, clinical trial participation, health care use, willingness to discuss costs, psychological distress (Brief Profile of Mood States [POMS]), and health-related quality of life (HRQOL) as measured by the Functional Assessment of Cancer Therapy: General (FACT-G) and the European Organization for Research and Treatment of Cancer (EORTC) QOL questionnaires. Test-retest reliability, internal consistency, and validity of the COST measure were assessed using standard-scale construction techniques. Associations between the resulting factors and other variables were assessed using multivariable analyses. A total of 375 patients with advanced cancer were approached, 233 of whom (62.1%) agreed to participate. The COST measure demonstrated high internal consistency and test-retest reliability. Factor analyses revealed a coherent, single, latent variable (financial toxicity). COST values were found to be correlated with income (correlation coefficient [r] = 0.28; P<.001), psychosocial distress (r = -0.26; P<.001), and HRQOL, as measured by the FACT-G (r = 0.42; P<.001) and by the EORTC QOL instruments (r = 0.33; P<.001). Independent factors found to be associated with financial toxicity were race (P = .04), employment status (P<.001), income (P = .003), number of inpatient admissions (P = .01), and psychological distress (P = .003). Willingness to discuss costs was not found to be associated with the degree of financial distress (P = .49). The COST measure demonstrated reliability and validity in measuring financial toxicity. Its correlation with HRQOL indicates that financial toxicity is a clinically relevant patient-centered outcome. Cancer 2017;123:476-484. © 2016 American Cancer Society. © 2016 The Authors. Cancer published by Wiley Periodicals, Inc. on behalf of American Cancer Society.
Peipert, John D; Beaumont, Jennifer L; Bode, Rita; Cella, Dave; Garcia, Sofia F; Hahn, Elizabeth A
2014-04-01
To develop and validate a new functional assessment of chronic illness therapy (FACIT) measure of satisfaction with treatment for chronic illnesses such as cancer and HIV/AIDS. To define domains and generate items, a literature review informed creation of semi-structured interview guides for patients and an international expert panel of clinicians and researchers. Patients and experts also rated 15 areas of satisfaction for relevance. The final list of items underwent further refinement by the original expert panel and a new group of clinical experts. Items were tested in four studies (primarily lung cancer) and data were pooled for analysis. Exploratory and confirmatory factor analyses (CFA), and item response theory modeling were conducted to evaluate dimensionality. Internal consistency reliability and test-retest reliability were both evaluated. Validity was evaluated by correlating the FACIT subscale scores and measures of comparable concepts and by testing the scales' ability to distinguish people according to their overall treatment satisfaction. Two instruments were created: the FACIT TS-general (G), an overall evaluation of current treatment, and the FACIT TS-patient satisfaction (PS), a measure of patient satisfaction. CFA results were not optimal for a five-factor solution for PS. Internal consistency reliability met psychometric standards (≥0.70) for all PS subscales. Construct validity was established for the PS subscales: Physician Communication, Treatment Staff Communication, Technical Competence, Confidence and Trust, and Nurse Communication. The two instruments generated here offer a new way to assess several key dimensions of patient satisfaction with treatment, especially for people with lung cancer.
Ishii, Hitoshi; Shimatsu, Akira; Okimura, Yasuhiko; Tanaka, Toshiaki; Hizuka, Naomi; Kaji, Hidesuke; Hanew, Kunihiko; Oki, Yutaka; Yamashiro, Sayuri; Takano, Koji; Chihara, Kazuo
2012-01-01
To develop and validate the Adult Hypopituitarism Questionnaire (AHQ) as a disease-specific, self-administered questionnaire for evaluation of quality of life (QOL) in adult patients with hypopituitarism. We developed and validated this new questionnaire, using a standardized procedure which included item development, pilot-testing and psychometric validation. Of the patients who participated in psychometric validation, those whose clinical conditions were judged to be stable were asked to answer the survey questionnaire twice, in order to assess test-retest reliability. Content validity of the initial questionnaire was evaluated via two pilot tests. After these tests, we made minor revisions and finalized the initial version of the questionnaire. The questionnaire was constructed with two domains, one psycho-social and the other physical. For psychometric assessment, analyses were performed on the responses of 192 adult patients with various types of hypopituitarism. The intraclass correlations of the respective domains were 0.91 and 0.95, and the Cronbach's alpha coefficients were 0.96 and 0.95, indicating adequate test-retest reliability and internal consistency for each domain. For known-group validity, patients with hypopituitarism due to hypothalamic disorder showed significantly lower scores in 11 out of 13 sub-domains compared to those who had hypopituitarism due to pituitary disorder. Regarding construct validity, the domain structure was found to be almost the same as that initially hypothesized. Exploratory factor analysis (n = 228) demonstrated that each domain consisted of six and seven sub-domains. The AHQ showed good reliability and validity for evaluating QOL in adult patients with hypopituitarism.
Development of an instrument to measure self-efficacy in caregivers of people with advanced cancer.
Ugalde, Anna; Krishnasamy, Meinir; Schofield, Penelope
2013-06-01
Informal caregivers of people with advanced cancer experience many negative impacts as a result of their role. There is a lack of suitable measures specifically designed to assess their experience. This study aimed to develop a new measure to assess self-efficacy in caregivers of people with advanced cancer. The development and testing of the new measure consisted of four separate, sequential phases: generation of issues, development of issues into items, pilot testing and field testing. In the generation of issues, 17 caregivers were interviewed to generate data. These data were analysed to generate codes, which were then systematically developed into items to construct the instrument. The instrument was pilot tested with 14 health professionals and five caregivers. It was then administered to a large sample for field testing to establish the psychometric properties, with established measures including the Brief Cope and the Family Appraisals for Caregiving Questionnaire for Palliative Care. Ninety-four caregivers completed the questionnaire booklet to establish the factor structure, reliability and validity. The factor analysis resulted in a 21-item, four-factor instrument, with the subscales being termed Resilience, Self-Maintenance, Emotional Connectivity and Instrumental Caregiving. The test-retest reliability and internal consistency were both excellent, ranging from 0.73 to 0.85 and 0.81 to 0.94, respectively. Six convergent and divergent hypotheses were made, and five were supported. This study has developed a new instrument to assess self-efficacy in caregivers of people with advanced cancer. The result is a four-factor, 21-item instrument with demonstrated reliability and validity. Copyright © 2012 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Campbell, Todd; Abd-Hamid, Nor Hashidah
2013-08-01
This study describes the development of an instrument to investigate the extent to which technology is integrated in science instruction in ways aligned to science reform outlined in standards documents. The instrument was developed by: (a) creating items consistent with the five dimensions identified in science education literature, (b) establishing content validity with both national and international content experts, (c) refining the item pool based on content expert feedback, (d) piloting testing of the instrument, (e) checking statistical reliability and item analysis, and (f) subsequently refining and finalization of the instrument. The TUSI was administered in a field test across eleven classrooms by three observers, with a total of 33 TUSI ratings completed. The finalized instrument was found to have acceptable inter-rater intraclass correlation reliability estimates. After the final stage of development, the TUSI instrument consisted of 26-items separated into the original five categories, which aligned with the exploratory factor analysis clustering of the items. Additionally, concurrent validity of the TUSI was established with the Reformed Teaching Observation Protocol. Finally, a subsequent set of 17 different classrooms were observed during the spring of 2011, and for the 9 classrooms where technology integration was observed, an overall Cronbach alpha reliability coefficient of 0.913 was found. Based on the analyses completed, the TUSI appears to be a useful instrument for measuring how technology is integrated into science classrooms and is seen as one mechanism for measuring the intersection of technological, pedagogical, and content knowledge in science classrooms.
Chevat, Catherine; Viala-Danten, Muriel; Dias-Barbosa, Carla; Nguyen, Van Hung
2009-01-01
Background Influenza is among the most common infectious diseases. The main protection against influenza is vaccination. A self-administered questionnaire was developed and validated for use in clinical trials to assess subjects' perception and acceptance of influenza vaccination and its subsequent injection site reactions (ISR). Methods The VAPI questionnaire was developed based on interviews with vaccinees. The initial version was administered to subjects in international clinical trials comparing intradermal with intramuscular influenza vaccination. Item reduction and scale construction were carried out using principal component and multitrait analyses (n = 549). Psychometric validation of the final version was conducted per country (n = 5,543) and included construct and clinical validity and internal consistency reliability. All subjects gave their written informed consent before being interviewed or included in the clinical studies. Results The final questionnaire comprised 4 dimensions ("bother from ISR"; "arm movement"; "sleep"; "acceptability") grouping 16 items, and 5 individual items (anxiety before vaccination; bother from pain during vaccination; satisfaction with injection system; willingness to be vaccinated next year; anxiety about vaccination next year). Construct validity was confirmed for all scales in most of the countries. Internal consistency reliability was good for all versions (Cronbach's alpha ranging from 0.68 to 0.94), as was clinical validity: scores were positively correlated with the severity of ISR and pain. Conclusion The VAPI questionnaire is a valid and reliable tool, assessing the acceptance of vaccine injection and reactions following vaccination. Trial registration NCT00258934, NCT00383526, NCT00383539. PMID:19261173
Jäger, B; Schmid-Ott, G; Ernst, G; Dölle-Lange, E; Sack, M
2012-06-01
The aim of this study was to construct and validate a short self-rating questionnaire for the assessment of ego functions and ability of self regulation. An item pool of 120 items covering 6 postulated dimensions was reduced by two steps in independent samples (n = 136 + 470) via factor and item analyses to the final version consisting of 35 items. The 5 resulting questionnaire scales "interpersonal disturbances", "frustration tolerance and impulse control", "identity disturbances", "affect differentiation and affect tolerance" and "self-esteem" were well interpretable and showed in confirmatory factor analysis the best fit to the data (CHI²/df = 3.48; RMSEA = 0.73). Total scores were found to differentiate well between diagnostic groups of patients with more or less ego pathology (FANOVA = 9.8; df = 11; p < 0.001), thus proving good concurrent validity. Reliability was shown by testing internal consistency and test-retest correlations. The "Hannover self-regulation questionnaire" (HSRQ) evidently is an appropriate and reliable screening instrument in order to assess ego functions and capacities of self regulation in an economic and user-friendly means. The scale structure allows differentiated diagnostics of weak vs. stable ego functions and may be used for detailed therapy planning. © Georg Thieme Verlag KG Stuttgart · New York.
Synkinesis assessment in facial palsy: validation of the Dutch Synkinesis Assessment Questionnaire.
Kleiss, Ingrid J; Beurskens, Carien H G; Stalmeier, Peep F M; Ingels, Koen J A O; Marres, Henri A M
2016-06-01
The objective of this study is to validate an existing health-related quality of life questionnaire for patients with synkinesis in facial palsy for implementation in the Dutch language and culture. The Synkinesis Assessment Questionnaire was translated into the Dutch language using a forward-backward translation method. A pilot test with the translated questionnaire was performed in 10 patients with facial palsy and 10 normal subjects. Finally, cross-cultural adaption was accomplished at our outpatient clinic for facial palsy. Analyses for internal consistency, test-retest reliability, and construct validity were performed. Sixty-six patients completed the Dutch Synkinesis Assessment Questionnaire and the Dutch Facial Disability Index. Cronbach's α, representing internal consistency, was 0.80. Test-retest reliability was 0.53 (Spearman's correlation coefficient, P < 0.01). Correlations with the House-Brackmann score, Sunnybrook score, Facial Disability Index physical function, and social/well-being function were -0.29, 0.20, -0.29, and -0.32, respectively. Correlation with the Sunnybrook synkinesis subscore was 0.50 (Spearman's correlation coefficient). The Dutch Synkinesis Assessment Questionnaire shows good psychometric values and can be implemented in the management of Dutch-speaking patients with facial palsy and synkinesis in the Netherlands. Translation of the instrument into other languages may lead to widespread use, making evaluation, and comparison possible among different providers.
Sarma-based key-group method for rock slope reliability analyses
NASA Astrophysics Data System (ADS)
Yarahmadi Bafghi, A. R.; Verdel, T.
2005-08-01
The methods used in conducting static stability analyses have remained pertinent to this day for reasons of both simplicity and speed of execution. The most well-known of these methods for purposes of stability analysis of fractured rock masses is the key-block method (KBM).This paper proposes an extension to the KBM, called the key-group method (KGM), which combines not only individual key-blocks but also groups of collapsable blocks into an iterative and progressive analysis of the stability of discontinuous rock slopes. To take intra-group forces into account, the Sarma method has been implemented within the KGM in order to generate a Sarma-based KGM, abbreviated SKGM. We will discuss herein the hypothesis behind this new method, details regarding its implementation, and validation through comparison with results obtained from the distinct element method.Furthermore, as an alternative to deterministic methods, reliability analyses or probabilistic analyses have been proposed to take account of the uncertainty in analytical parameters and models. The FOSM and ASM probabilistic methods could be implemented within the KGM and SKGM framework in order to take account of the uncertainty due to physical and mechanical data (density, cohesion and angle of friction). We will then show how such reliability analyses can be introduced into SKGM to give rise to the probabilistic SKGM (PSKGM) and how it can be used for rock slope reliability analyses. Copyright
ERIC Educational Resources Information Center
Lim, Young-Jin
2015-01-01
The aim of this study was to examine the internal consistency reliability, test-retest reliability, factorial structure validity, and convergent validity of a Korean version of the Satisfaction With Life Scale adapted for children (K-SWLS-C). Participants consisted of 653 elementary school students (48% were male). The internal consistency of the…
hEIDI: An Intuitive Application Tool To Organize and Treat Large-Scale Proteomics Data.
Hesse, Anne-Marie; Dupierris, Véronique; Adam, Claire; Court, Magali; Barthe, Damien; Emadali, Anouk; Masselon, Christophe; Ferro, Myriam; Bruley, Christophe
2016-10-07
Advances in high-throughput proteomics have led to a rapid increase in the number, size, and complexity of the associated data sets. Managing and extracting reliable information from such large series of data sets require the use of dedicated software organized in a consistent pipeline to reduce, validate, exploit, and ultimately export data. The compilation of multiple mass-spectrometry-based identification and quantification results obtained in the context of a large-scale project represents a real challenge for developers of bioinformatics solutions. In response to this challenge, we developed a dedicated software suite called hEIDI to manage and combine both identifications and semiquantitative data related to multiple LC-MS/MS analyses. This paper describes how, through a user-friendly interface, hEIDI can be used to compile analyses and retrieve lists of nonredundant protein groups. Moreover, hEIDI allows direct comparison of series of analyses, on the basis of protein groups, while ensuring consistent protein inference and also computing spectral counts. hEIDI ensures that validated results are compliant with MIAPE guidelines as all information related to samples and results is stored in appropriate databases. Thanks to the database structure, validated results generated within hEIDI can be easily exported in the PRIDE XML format for subsequent publication. hEIDI can be downloaded from http://biodev.extra.cea.fr/docs/heidi .
Atwal, Anita; McIntyre, Anne
2017-01-01
Introduction High quality guidance in home strategies is needed to enable older people to measure their home environment and become involved in the provision of assistive devices and to promote consistency among professionals. This study aims to investigate the reliability of such guidance and its ability to promote accuracy of results when measurements are taken by both older people and professionals. Method Twenty-five health professionals and 26 older people participated in a within-group design to test the accuracy of measurements taken (that is, person’s popliteal height, baths, toilets, beds, stairs and chairs). Data were analysed with descriptive analysis and the Wilcoxon test. The intra-rater reliability was assessed by correlating measurements taken at two different times with guidance use. Results The intra-rater reliability analysis revealed statistical significance (P < 0.05) for all measurements except for the bath internal width. The guidance enabled participants to take 90% of measurements that they were not able to complete otherwise, 80.55% of which lay within the acceptable suggested margin of variation. Accuracy was supported by the significant reduction in the standard deviation of the actual measurements and accuracy scores. Conclusion This evidence-based guidance can be used in its current format by older people and professionals to facilitate appropriate measurements. Yet, some users might need help from carers or specialists depending on their impairments. PMID:29386701
Spiliotopoulou, Georgia; Atwal, Anita; McIntyre, Anne
2018-01-01
High quality guidance in home strategies is needed to enable older people to measure their home environment and become involved in the provision of assistive devices and to promote consistency among professionals. This study aims to investigate the reliability of such guidance and its ability to promote accuracy of results when measurements are taken by both older people and professionals. Twenty-five health professionals and 26 older people participated in a within-group design to test the accuracy of measurements taken (that is, person's popliteal height, baths, toilets, beds, stairs and chairs). Data were analysed with descriptive analysis and the Wilcoxon test. The intra-rater reliability was assessed by correlating measurements taken at two different times with guidance use. The intra-rater reliability analysis revealed statistical significance ( P < 0.05) for all measurements except for the bath internal width. The guidance enabled participants to take 90% of measurements that they were not able to complete otherwise, 80.55% of which lay within the acceptable suggested margin of variation. Accuracy was supported by the significant reduction in the standard deviation of the actual measurements and accuracy scores. This evidence-based guidance can be used in its current format by older people and professionals to facilitate appropriate measurements. Yet, some users might need help from carers or specialists depending on their impairments.
Martín-Nieto, A; Atín-Arratibel, M Á; Bravo-Llatas, C; Moreno-Bermejo, M I; Martín-Casas, P
2018-06-08
The aim of this study was to develop and validate a Spanish-language version of the Scale for Contraversive Pushing, used to diagnose and measure pusher behaviour in stroke patients. Translation-back translation was used to create the Spanish-language Scale for Contraversive Pushing; we subsequently evaluated its validity and reliability by administering it to a sample of patients. We also analysed its sensitivity to change in patients identified as pushers who received neurological physiotherapy. Experts indicated that the content of the scale was valid. Internal consistency was very good (Cronbach's alpha of 0.94). The intraclass correlation coefficient showed high intra- and interobserver reliability (0.999 and 0.994, respectively). The Kappa and weighted Kappa coefficients were used to measure the reliability of each item; the majority obtained values above 0.9. Lastly, the differences between baseline and final evaluations of pushers were significant (paired sample t test), showing that the scale is sensitive to changes obtained through physical therapy. The Spanish-language version of the Scale for Contraversive Pushing is valid and reliable for measuring pusher behaviour in stroke patients. In addition, it is able to evaluate the ongoing changes in patients who have received physical therapy. Copyright © 2018 Sociedad Española de Neurología. Publicado por Elsevier España, S.L.U. All rights reserved.
INOUE, Akiomi; KAWAKAMI, Norito; SHIMOMITSU, Teruichi; TSUTSUMI, Akizumi; HARATANI, Takashi; YOSHIKAWA, Toru; SHIMAZU, Akihito; ODAGIRI, Yuko
2014-01-01
This study aimed to investigate the reliability and construct validity of a new version of the Brief Job Stress Questionnaire (New BJSQ), which measures an extended set of psychosocial factors at work by adding new scales/items to the current version of the BJSQ. Additional scales/items were extensively collected from theoretical job stress models and similar questionnaires in several countries. Scales/items were field-tested and refined through a pilot internet survey. Finally, an 84-item questionnaire (141 items in total when combined with the current BJSQ) was developed. A nationally representative survey was administered to employees in Japan (n=1,633) to examine the reliability and construct validity. Most scales showed acceptable levels of internal consistency and test-retest reliability. Principal component analyses showed that the first factor explained 50% or greater proportion of the variance in most scales. A scale factor analysis and a correlation analysis showed that these scales fit the theoretical expectations. These findings provided a piece of evidence that the New BJSQ scales are reliable and valid. Although more detailed content and construct validity should be examined in future study, the New BJSQ is a useful instrument to evaluate psychosocial work environment and positive mental health outcomes in the current workplace. PMID:24492763
Development of a direct observation Measure of Environmental Qualities of Activity Settings.
King, Gillian; Rigby, Patty; Batorowicz, Beata; McMain-Klein, Margot; Petrenchik, Theresa; Thompson, Laura; Gibson, Michelle
2014-08-01
The aim of this study was to develop an observer-rated measure of aesthetic, physical, social, and opportunity-related qualities of leisure activity settings for young people (with or without disabilities). Eighty questionnaires were completed by sets of raters who independently rated 22 community/home activity settings. The scales of the 32-item Measure of Environmental Qualities of Activity Settings (MEQAS; Opportunities for Social Activities, Opportunities for Physical Activities, Pleasant Physical Environment, Opportunities for Choice, Opportunities for Personal Growth, and Opportunities to Interact with Adults) were determined using principal components analyses. Test-retest reliability was determined for eight activity settings, rated twice (4-6wk interval) by a trained rater. The factor structure accounted for 80% of the variance. The Kaiser-Meyer-Olkin Measure of Sampling Adequacy was 0.73. Cronbach's alphas for the scales ranged from 0.76 to 0.96, and interrater reliabilities (ICCs) ranged from 0.60 to 0.93. Test-retest reliabilities ranged from 0.70 to 0.90. Results suggest that the MEQAS has a sound factor structure and preliminary evidence of internal consistency, interrater, and test-retest reliability. The MEQAS is the first observer-completed measure of environmental qualities of activity settings. The MEQAS allows researchers to assess comprehensively qualities and affordances of activity settings, and can be used to design and assess environmental qualities of programs for young people. © 2014 Mac Keith Press.
Stability of physical assessment of older drivers over 1 year.
Smith, Andrew; Marshall, Shawn; Porter, Michelle; Ha, Linda; Bédard, Michel; Gélinas, Isabelle; Man-Son-Hing, Malcolm; Mazer, Barbara; Rapoport, Mark; Tuokko, Holly; Vrkljan, Brenda
2013-12-01
Older adults represent the fastest-growing population of drivers with a valid driver's licence. Also common in this age group are multiple chronic medical conditions that may have an effect on physical function and driving ability. Determining the reliability of physical measures used to assess older drivers' functional ability is important to identifying those who are safe to continue driving. Most previous reliability studies of clinical physical measures of health used test-retest intervals shorter than those between patient visits with a clinician. In the present study we examined a more clinically representative interval of 1 year to determine the stability of commonly used physical measures collected during the Candrive II prospective cohort study of older drivers. Reliability statistics indicate that the sequential finger-thumb opposition, rapid pace walk and the Pelli-Robson contrast sensitivity tests have adequate stability over 1 year. Poor stability was observed for the one-legged stance and Snellen visual acuity test. Several assessments with nominal data (Marottoli method [functional neck range of motion], whispered voice test, range of motion and strength testing) lacked sufficient variability to conduct reliability analyses; however, a lack of variability between test days suggests consistency over a 1-year time frame. Our results provide evidence that specific physical measures are stable in monitoring functional ability over the course of a year. Copyright © 2013 Elsevier Ltd. All rights reserved.
Llerena, Katiah; Wynn, Jonathan K; Hajcak, Greg; Green, Michael F; Horan, William P
2016-07-01
Accurately monitoring one's performance on daily life tasks, and integrating internal and external performance feedback are necessary for guiding productive behavior. Although internal feedback processing, as indexed by the error-related negativity (ERN), is consistently impaired in schizophrenia, initial findings suggest that external performance feedback processing, as indexed by the feedback negativity (FN), may actually be intact. The current study evaluated internal and external feedback processing task performance and test-retest reliability in schizophrenia. 92 schizophrenia outpatients and 63 healthy controls completed a flanker task (ERN) and a time estimation task (FN). Analyses examined the ΔERN and ΔFN defined as difference waves between correct/positive versus error/negative feedback conditions. A temporal principal component analysis was conducted to distinguish the ΔERN and ΔFN from overlapping neural responses. We also assessed test-retest reliability of ΔERN and ΔFN in patients over a 4-week interval. Patients showed reduced ΔERN accompanied by intact ΔFN. In patients, test-retest reliability for both ΔERN and ΔFN over a four-week period was fair to good. Individuals with schizophrenia show a pattern of impaired internal, but intact external, feedback processing. This pattern has implications for understanding the nature and neural correlates of impaired feedback processing in schizophrenia. Published by Elsevier B.V.
The Hip Sports Activity Scale (HSAS) for patients with femoroacetabular impingement.
Naal, Florian D; Miozzari, Hermes H; Kelly, Bryan T; Magennis, Erin M; Leunig, Michael; Noetzli, Hubert P
2013-01-01
To develop and validate a sports activity scale for patients with a diagnosis of femoroacetabular impingement (FAI). A nine level Hip Sports Activity Scale (HSAS) was constructed both in German and English languages. Fifty-nine consecutive patients undergoing surgical treatment for FAI at two centers in Switzerland and in the US completed a questionnaire set consisting of the HSAS, the University of California at Los Angeles (UCLA) activity scale and different hip joint-specific and generic outcome tools. For reliability assessment, the HSAS was completed twice about nine days apart. Evidence of reliability, validity and responsiveness was investigated by classical psychometric analyses. Reliability was excellent for both the German and the English versions with intraclass correlation coefficients of 0.94 and 0.96, respectively. Evidence of convergent validity was supported by moderate to high correlations with the UCLA activity scale and with the joint-specific measures used. Evidence of divergent validity was supported by low correlations with the SF-12 Mental Component Scale and the WOMAC stiffness subscale. The standardised response mean was 0.69. The HSAS is a reliable and valid tool to determine sports levels in patients suffering from FAI. Its use in future studies investigating outcomes in young patients with hip disease can be recommended. Level III, Diagnostic Studies - An independent, masked comparison with an appropriate population of patients, but reference standard not applied to all study patients.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Grabaskas, David; Brunett, Acacia J.; Passerini, Stefano
GE Hitachi Nuclear Energy (GEH) and Argonne National Laboratory (Argonne) participated in a two year collaboration to modernize and update the probabilistic risk assessment (PRA) for the PRISM sodium fast reactor. At a high level, the primary outcome of the project was the development of a next-generation PRA that is intended to enable risk-informed prioritization of safety- and reliability-focused research and development. A central Argonne task during this project was a reliability assessment of passive safety systems, which included the Reactor Vessel Auxiliary Cooling System (RVACS) and the inherent reactivity feedbacks of the metal fuel core. Both systems were examinedmore » utilizing a methodology derived from the Reliability Method for Passive Safety Functions (RMPS), with an emphasis on developing success criteria based on mechanistic system modeling while also maintaining consistency with the Fuel Damage Categories (FDCs) of the mechanistic source term assessment. This paper provides an overview of the reliability analyses of both systems, including highlights of the FMEAs, the construction of best-estimate models, uncertain parameter screening and propagation, and the quantification of system failure probability. In particular, special focus is given to the methodologies to perform the analysis of uncertainty propagation and the determination of the likelihood of violating FDC limits. Additionally, important lessons learned are also reviewed, such as optimal sampling methodologies for the discovery of low likelihood failure events and strategies for the combined treatment of aleatory and epistemic uncertainties.« less
Hunger, Matthias; Sabariego, Carla; Stollenwerk, Björn; Cieza, Alarcos; Leidl, Reiner
2012-09-01
To analyse the psychometric properties of the EQ-5D in German stroke survivors undergoing neurological rehabilitation. The EQ-5D, the Hospital Anxiety and Depression Scale (HADS) and the Stroke Impact Scale (SIS) were completed before (210 subjects) and after (183 subjects) a patient education programme in seven rehabilitation clinics in Bavaria, Germany. A postal follow-up was conducted after 6 months. Acceptance, validity, reliability and responsiveness of the EQ-5D were tested. The SIS subscales were used as external anchors to classify the patients into change groups between the measurements. The proportion of missing answers ranged from 4.7 to 8.6%. Between 16 and 19% reported no problems in any EQ-5D dimension. At baseline, correlations between EQ-5D index and the SIS subscales ranged from 0.15 (communication) to 0.60 (mobility). Correlations with the EQ VAS were slightly smaller. All scores were reliable in test-retest with intraclass correlations ranging from 0.67 to 0.81. EQ-5D index and EQ VAS were consistently responsive only to improvements in health, showing small- to medium effect sizes (0.27-0.42). The EQ-5D has shown reasonable validity, reliability and, more limited, responsiveness in stroke patients with mild to moderate limitations of functional status, allowing it to be used in clinical trials in rehabilitation.
Inoue, Akiomi; Kawakami, Norito; Shimomitsu, Teruichi; Tsutsumi, Akizumi; Haratani, Takashi; Yoshikawa, Toru; Shimazu, Akihito; Odagiri, Yuko
2014-01-01
This study aimed to investigate the reliability and construct validity of a new version of the Brief Job Stress Questionnaire (New BJSQ), which measures an extended set of psychosocial factors at work by adding new scales/items to the current version of the BJSQ. Additional scales/items were extensively collected from theoretical job stress models and similar questionnaires in several countries. Scales/items were field-tested and refined through a pilot internet survey. Finally, an 84-item questionnaire (141 items in total when combined with the current BJSQ) was developed. A nationally representative survey was administered to employees in Japan (n=1,633) to examine the reliability and construct validity. Most scales showed acceptable levels of internal consistency and test-retest reliability. Principal component analyses showed that the first factor explained 50% or greater proportion of the variance in most scales. A scale factor analysis and a correlation analysis showed that these scales fit the theoretical expectations. These findings provided a piece of evidence that the New BJSQ scales are reliable and valid. Although more detailed content and construct validity should be examined in future study, the New BJSQ is a useful instrument to evaluate psychosocial work environment and positive mental health outcomes in the current workplace.
Universal first-order reliability concept applied to semistatic structures
NASA Technical Reports Server (NTRS)
Verderaime, V.
1994-01-01
A reliability design concept was developed for semistatic structures which combines the prevailing deterministic method with the first-order reliability method. The proposed method surmounts deterministic deficiencies in providing uniformly reliable structures and improved safety audits. It supports risk analyses and reliability selection criterion. The method provides a reliability design factor derived from the reliability criterion which is analogous to the current safety factor for sizing structures and verifying reliability response. The universal first-order reliability method should also be applicable for air and surface vehicles semistatic structures.
Universal first-order reliability concept applied to semistatic structures
NASA Astrophysics Data System (ADS)
Verderaime, V.
1994-07-01
A reliability design concept was developed for semistatic structures which combines the prevailing deterministic method with the first-order reliability method. The proposed method surmounts deterministic deficiencies in providing uniformly reliable structures and improved safety audits. It supports risk analyses and reliability selection criterion. The method provides a reliability design factor derived from the reliability criterion which is analogous to the current safety factor for sizing structures and verifying reliability response. The universal first-order reliability method should also be applicable for air and surface vehicles semistatic structures.
Psychometric properties of a four-component Norwegian Organizational Justice Scale.
Olsen, Olav Kjellevold; Myrseth, Helga; Eidhamar, Are; Hystad, Sigurd W
2012-04-01
Organizational justice has attracted attention as a predictor of employees' mental and physical health as well as commitment and work outcomes. The lack of a Norwegian translation of an organizational justice scale has precluded its use in Norway. Four dimensions of the organizational justice construct were examined in a Norwegian military context, including facet measures of distributional, interpersonal, and informational justice developed by Colquitt in 2001, in addition to procedural justice developed by Moorman in 1991. Confirmatory factor analyses supported a four-dimensional structure with good internal consistency. Follow-up analyses have suggested that the four dimensions were nested beneath a general, latent organizational justice factor. A positive relationship between organizational justice and self-sacrificial behavior was found, indicating satisfactory construct validity. The results demonstrate that the Norwegian Organizational Justice Scale is a reliable and construct-valid measure of organizational justice in a Norwegian setting.
Adapting the academic motivation scale for use in pre-tertiary mathematics classrooms
NASA Astrophysics Data System (ADS)
Lim, Siew Yee; Chapman, Elaine
2015-09-01
The Academic Motivation Scale ( ams) is a comprehensive and widely used instrument for assessing motivation based on the self-determination theory. Currently, no such comprehensive instrument exists to assess the different domains of motivation (stipulated by the self-determination theory) in mathematics education at the pre-tertiary level (grades 11 and 12) in Asia. This study adapted the ams for this use and assessed the properties of the adapted instrument with 1610 students from Singapore. Exploratory and confirmatory factor analyses indicated a five-factor structure for the modified instrument (the three original ams intrinsic subscales collapsed into a single factor). Additionally, the modified instrument exhibited good internal consistency (mean α = .88), and satisfactory test-retest reliability over a 1-month interval (mean r xx = .73). The validity of the modified ams was further demonstrated through correlational analyses among scores on its subscales, and with scores on other instruments measuring mathematics attitudes, anxiety and achievement.
Stop Using the Modified Work APGAR to Measure Job Satisfaction
Mielenz, Thelma J.; DeVellis, Robert F.; Battie, Michele C.; Carey, Timothy S.
2011-01-01
Background. The psychometric properties of the Modified Work APGAR (MWA) scale are not established, yet researchers use this scale as an overall measure of job satisfaction. Objective. Perform psychometric analyses on the MWA scale using data from two populations. Methods. A landmark occupational cohort and a clinical cohort are populations with low back pain studied. The first five items of the MWA scale measure social support from coworkers, one item measures dissatisfaction with job tasks, and the sixth item measures lack of social support from a supervisor. Exploratory principal components analyses were conducted in both cohorts. Results. In both cohorts, the first five items of the MWA scale loaded consistently onto one factor, social support from coworkers subscale. Conclusions. Unless researchers are interested in measuring social support from coworkers only, future studies should use other reliable and valid instruments to measure a broad range of psychosocial work characteristics. PMID:22191021
Gebremariam, Mekdes K; Lien, Nanna; Torheim, Liv Elin; Andersen, Lene F; Melbye, Elisabeth L; Glavin, Kari; Hausken, Solveig E S; Sleddens, Ester F C; Bjelland, Mona
2016-08-17
The existence of socioeconomic differences in dietary behaviors is well documented. However, studies exploring the mechanisms behind these differences among adolescents using comprehensive and reliable measures of mediators are lacking. The aims of this study were (a) to assess the psychometric properties of new scales assessing the perceived rules and accessibility related to the consumption of vegetables and soft drinks and (b) to explore their mediating role in the association between parental education and the corresponding dietary behaviors. A cross-sectional survey including 440 adolescents from three counties in Norway (mean age 14.3 years (SD = 0.6)) was conducted using a web-based questionnaire. Principal component analysis, test-retest and internal reliability analysis were conducted. The mediating role of perceived accessibility and perceived rules in the association between parental education and the dietary behaviors was explored using linear regression analyses. Factor analyses confirmed two separate subscales, named "accessibility" and "rules", both for vegetables and soft drinks (factor loadings >0.60). The scales had good internal consistency reliability (0.70-0.87). The test-retest reliability of the scales was moderate to good (0.44-0.62). Parental education was inversely related to the consumption of soft drinks and positively related to the consumption of vegetables. Perceived accessibility and perceived rules related to soft drink consumption were found to mediate the association between parental education and soft drink consumption (47.5 and 8.5 % of total effect mediated). Accessibility of vegetables was found to mediate the association between parental education and the consumption of vegetables (51 % of total effect mediated). The new scales developed in this study are comprehensive and have adequate validity and reliability; they are therefore considered appropriate for use among 13-15 year-olds. Parents, in particular those with a low educational background, should be encouraged to increase the accessibility of vegetables and to decrease the accessibility of soft drinks, in particular during dinner. Enforcing parental rules limiting soft drink intake in families with low parental education also appears relevant.
Reliability, Validity, and Responsiveness of the QuickDASH in Patients With Upper Limb Amputation.
Resnik, Linda; Borgia, Matthew
2015-09-01
To examine the internal consistency, test-retest reliability, validity, and responsiveness of the shortened version of the Disabilities of the Arm, Shoulder and Hand (QuickDASH) questionnaire in persons with upper limb amputation. Cross-sectional and longitudinal. Three sites participating in the U.S. Department of Veterans Affairs Home Study of the DEKA Arm. A convenience sample of upper limb amputees (N=44). Training with a multifunction upper limb prosthesis. Multiple outcome measures including the QuickDASH were administered twice within 1 week, and for a subset of 20 persons, after completion of in-laboratory training with the DEKA Arm. Scale alphas and intraclass correlation coefficient type 3,1 (ICC3,1) were used to examine reliability. Minimum detectable change (MDC) scores were calculated. Analyses of variance, comparing QuickDASH scores by the amount of prosthetic use and amputation level, were used for known-group validity analyses with alpha set at .05. Pairwise correlations between QuickDASH and other measures were used to examine concurrent validity. Responsiveness was measured by effect size (ES) and standardized response mean (SRM). QuickDASH alpha was .83, and ICC was .87 (95% confidence interval, .77-.93). MDC at the 95% confidence level (MDC95%) was 17.4. Full- or part-time prosthesis users had better QuickDASH scores compared with nonprosthesis users (P=.021), as did those with more distal amputations at both baseline (P=.042) and with the DEKA Arm (P=.024). The QuickDASH was correlated with concurrent measures of activity limitation as expected. The ES and SRM after training with the DEKA Arm were 0.6. This study provides evidence of reliability and validity of the QuickDASH in persons with upper limb amputation. Results provide preliminary evidence of responsiveness to prosthetic device type/training. Further research with a larger sample is needed to confirm results. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Allen Gomes, Ana; Ruivo Marques, Daniel; Meia-Via, Ana Maria; Meia-Via, Mariana; Tavares, José; Fernandes da Silva, Carlos; Pinto de Azevedo, Maria Helena
2015-04-01
Based on successive samples totaling more than 5000 higher education students, we scrutinized the reliability, structure, initial validity and normative scores of a brief self-report seven-item scale to screen for the continuum of nighttime insomnia complaints/perceived sleep quality, used by our team for more than a decade, henceforth labeled the Basic Scale on Insomnia complaints and Quality of Sleep (BaSIQS). In study/sample 1 (n = 1654), the items were developed based on part of a larger survey on higher education sleep-wake patterns. The test-retest study was conducted in an independent small group (n = 33) with a 2-8 week gap. In study/sample 2 (n = 360), focused mainly on validity, the BaSIQS was completed together with the Pittsburgh Sleep Quality Index (PSQI). In study 3, a large recent sample of students from universities all over the country (n = 2995) answered the BaSIQS items, based on which normative scores were determined, and an additional question on perceived sleep problems in order to further analyze the scale's validity. Regarding reliability, Cronbach alpha coefficients were systematically higher than 0.7, and the test-retest correlation coefficient was greater than 0.8. Structure analyses revealed consistently satisfactory two-factor and single-factor solutions. Concerning validity analyses, BaSIQS scores were significantly correlated with PSQI component scores and overall score (r = 0.652 corresponding to a large association); mean scores were significantly higher in those students classifying themselves as having sleep problems (p < 0.0001, d = 0.99 corresponding to a large effect size). In conclusion, the BaSIQS is very easy to administer, and appears to be a reliable and valid scale in higher education students. It might be a convenient short tool in research and applied settings to rapidly assess sleep quality or screen for insomnia complaints, and it may be easily used in other populations with minor adaptations.
How Many Sleep Diary Entries Are Needed to Reliably Estimate Adolescent Sleep?
Arora, Teresa; Gradisar, Michael; Taheri, Shahrad; Carskadon, Mary A.
2017-01-01
Abstract Study Objectives: To investigate (1) how many nights of sleep diary entries are required for reliable estimates of five sleep-related outcomes (bedtime, wake time, sleep onset latency [SOL], sleep duration, and wake after sleep onset [WASO]) and (2) the test–retest reliability of sleep diary estimates of school night sleep across 12 weeks. Methods: Data were drawn from four adolescent samples (Australia [n = 385], Qatar [n = 245], United Kingdom [n = 770], and United States [n = 366]), who provided 1766 eligible sleep diary weeks for reliability analyses. We performed reliability analyses for each cohort using complete data (7 days), one to five school nights, and one to two weekend nights. We also performed test–retest reliability analyses on 12-week sleep diary data available from a subgroup of 55 US adolescents. Results: Intraclass correlation coefficients for bedtime, SOL, and sleep duration indicated good-to-excellent reliability from five weekday nights of sleep diary entries across all adolescent cohorts. Four school nights was sufficient for wake times in the Australian and UK samples, but not the US or Qatari samples. Only Australian adolescents showed good reliability for two weekend nights of bedtime reports; estimates of SOL were adequate for UK adolescents based on two weekend nights. WASO was not reliably estimated using 1 week of sleep diaries. We observed excellent test–rest reliability across 12 weeks of sleep diary data in a subsample of US adolescents. Conclusion: We recommend at least five weekday nights of sleep dairy entries to be made when studying adolescent bedtimes, SOL, and sleep duration. Adolescent sleep patterns were stable across 12 consecutive school weeks. PMID:28199718
How Many Sleep Diary Entries Are Needed to Reliably Estimate Adolescent Sleep?
Short, Michelle A; Arora, Teresa; Gradisar, Michael; Taheri, Shahrad; Carskadon, Mary A
2017-03-01
To investigate (1) how many nights of sleep diary entries are required for reliable estimates of five sleep-related outcomes (bedtime, wake time, sleep onset latency [SOL], sleep duration, and wake after sleep onset [WASO]) and (2) the test-retest reliability of sleep diary estimates of school night sleep across 12 weeks. Data were drawn from four adolescent samples (Australia [n = 385], Qatar [n = 245], United Kingdom [n = 770], and United States [n = 366]), who provided 1766 eligible sleep diary weeks for reliability analyses. We performed reliability analyses for each cohort using complete data (7 days), one to five school nights, and one to two weekend nights. We also performed test-retest reliability analyses on 12-week sleep diary data available from a subgroup of 55 US adolescents. Intraclass correlation coefficients for bedtime, SOL, and sleep duration indicated good-to-excellent reliability from five weekday nights of sleep diary entries across all adolescent cohorts. Four school nights was sufficient for wake times in the Australian and UK samples, but not the US or Qatari samples. Only Australian adolescents showed good reliability for two weekend nights of bedtime reports; estimates of SOL were adequate for UK adolescents based on two weekend nights. WASO was not reliably estimated using 1 week of sleep diaries. We observed excellent test-rest reliability across 12 weeks of sleep diary data in a subsample of US adolescents. We recommend at least five weekday nights of sleep dairy entries to be made when studying adolescent bedtimes, SOL, and sleep duration. Adolescent sleep patterns were stable across 12 consecutive school weeks. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Difficult Decisions Made Easier
NASA Technical Reports Server (NTRS)
2006-01-01
NASA missions are extremely complex and prone to sudden, catastrophic failure if equipment falters or if an unforeseen event occurs. For these reasons, NASA trains to expect the unexpected. It tests its equipment and systems in extreme conditions, and it develops risk-analysis tests to foresee any possible problems. The Space Agency recently worked with an industry partner to develop reliability analysis software capable of modeling complex, highly dynamic systems, taking into account variations in input parameters and the evolution of the system over the course of a mission. The goal of this research was multifold. It included performance and risk analyses of complex, multiphase missions, like the insertion of the Mars Reconnaissance Orbiter; reliability analyses of systems with redundant and/or repairable components; optimization analyses of system configurations with respect to cost and reliability; and sensitivity analyses to identify optimal areas for uncertainty reduction or performance enhancement.
Finding online health-related information: usability issues of health portals.
Gurel Koybasi, Nergis A; Cagiltay, Kursat
2012-01-01
As Internet and computers become widespread, health portals offering online health-related information become more popular. The most important point for health portals is presenting reliable and valid information. Besides, portal needs to be usable to be able to serve information to users effectively. This study aims to determine usability issues emerging when health-related information is searched on a health portal. User-based usability tests are conducted and eye movement analyses are used in addition to traditional performance measures. Results revealed that users prefer systematic, simple and consistent designs offering interactive tools. Moreover, content and partitions needs to be shaped according to the medical knowledge of target users.
ERIC Educational Resources Information Center
Yelboga, Atilla; Tavsancil, Ezel
2010-01-01
In this research, the classical test theory and generalizability theory analyses were carried out with the data obtained by a job performance scale for the years 2005 and 2006. The reliability coefficients obtained (estimated) from the classical test theory and generalizability theory analyses were compared. In classical test theory, test retest…
Reliability and Factor Analyses of a Teacher Efficacy Scale for Nigerian Secondary School Teachers
ERIC Educational Resources Information Center
Faleye, Bamidele Abiodun
2008-01-01
Introduction: The suitability of 52 items for measuring Teacher Efficacy was investigated with the aim of developing and validating a Teacher Efficacy Scale (TES) for Nigerian secondary school teachers. Method: The TES was administered on 2400 teachers (mean age = 36.75 years). Data were subjected to factor and reliability analyses. Results:…
Reliability Problems of the Datum: Solutions for Questionnaire Responses.
ERIC Educational Resources Information Center
Bastick, Tony
Questionnaires often ask for estimates, and these estimates are given with different reliabilities. It is difficult to know the different reliabilities of single estimates and to take these into account in subsequent analyses. This paper contains a practical example to show that not taking the reliability of different responses into account can…
Validation of the Center for Epidemiological Studies Depression Scale among Korean Adolescents.
Heo, Eun-Hye; Choi, Kyeong-Sook; Yu, Je-Chun; Nam, Ji-Ae
2018-02-01
The Center for Epidemiological Studies Depression Scale (CES-D) is designed to measure the current level of depressive symptomatology in the general population. However, no review has examined whether the scale is reliable and valid among children and adolescents in Korea. The purpose of this study was to test whether the Korean form of the CES-D is valid in adolescents. Data were obtained from 1,884 adolescents attending grades 1-3 in Korean middle schools. Reliability was evaluated by internal consistency (Cronbach's alpha). Concurrent validity was evaluated by a correlation analysis between the CES-D and other scales. Construct validity was evaluated by exploratory factor and confirmatory factor analyses. The internal consistency coefficient for the entire group was 0.88. The CES-D was positively correlated with scales that measure negative psychological constructs, such as the State Anxiety Inventory for Children, the Korean Social Anxiety Scale for Children and Adolescents, and the Reynold Suicidal Ideation Questionnaire, but it was negatively correlated with scales that measure positive psychological constructs, such as the Korean version of the Rosenberg Self-Esteem Scale and the Connor-Davidson Resilience Scale-2. The CES-D was examined by three-dimensional exploratory factor analysis, and the three-factor structure of the scale explained 53.165% of the total variance. The variance explained by factor I was 24.836%, that explained by factor II was 15.988%, and that explained by factor III was 12.341%. The construct validity of the CES-D was tested by confirmatory factor analysis, and we applied the entire group's data using a three-factor hierarchical model. The fit index showed a level similar to those of other countries' adolescent samples. The CES-D has high internal consistency and addresses psychological constructs similar to those addressed by other scales. The CES-D showed a three-factor structure in an exploratory factor analysis. The present findings suggest that the CES-D is a useful and reliable tool for measuring depression in Korean adolescents.
Inconsistencies in authoritative national paediatric workforce data sources.
Allen, Amy R; Doherty, Richard; Hilton, Andrew M; Freed, Gary L
2017-12-01
Objective National health workforce data are used in workforce projections, policy and planning. If data to measure the current effective clinical medical workforce are not consistent, accurate and reliable, policy options pursued may not be aligned with Australia's actual needs. The aim of the present study was to identify any inconsistencies and contradictions in the numerical count of paediatric specialists in Australia, and discuss issues related to the accuracy of collection and analysis of medical workforce data. Methods This study compared respected national data sources regarding the number of medical practitioners in eight fields of paediatric speciality medical (non-surgical) practice. It also counted the number of doctors listed on the websites of speciality paediatric hospitals and clinics as practicing in these eight fields. Results Counts of medical practitioners varied markedly for all specialties across the data sources examined. In some fields examined, the range of variability across data sources exceeded 450%. Conclusions The national datasets currently available from federal and speciality sources do not provide consistent or reliable counts of the number of medical practitioners. The lack of an adequate baseline for the workforce prevents accurate predictions of future needs to provide the best possible care of children in Australia. What is known about the topic? Various national data sources contain counts of the number of medical practitioners in Australia. These data are used in health workforce projections, policy and planning. What does this paper add? The present study found that the current data sources do not provide consistent or reliable counts of the number of practitioners in eight selected fields of paediatric speciality practice. There are several potential issues in the way workforce data are collected or analysed that cause the variation between sources to occur. What are the implications for practitioners? Without accurate data on which to base decision making, policy options may not be aligned with the actual needs of children with various medical needs, in various geographic areas or the nation as a whole.
[Consistency and Reliability of MDK Expertise Examining the Encoding in the German DRG System].
Gaertner, T; Lehr, F; Blum, B; van Essen, J
2015-09-01
Hospital inpatient stays are reimbursed on the basis of German diagnosis-related groups (G-DRG). The G-DRG classification system is based on complex coding guidelines. The Medical Review Board of the Statutory Health Insurance Funds (MDK) examines the encoding by hospitals and delivers individual expertises on behalf of the German statutory health insurance companies in cases in which irregularities are suspected. A study was conducted on the inter-rater reliability of the MDK expertises regarding the scope of the assessment. A representative sample of 212 MDK expertises was taken from a selected pool of 1 392 MDK expertises in May 2013. This representative sample underwent a double-examination by 2 independent MDK experts using a special software based on the 3MTM G-DRG Grouper 2013 of 3M Medica, Germany. The following items encoded by the hospitals were examined: DRG, principal diagnosis, secondary diagnoses, procedures and additional payments. It was analysed whether the results of MDK expertises were consistent, reliable and correct. 202 expertises were eligible for evaluation, containing a total of 254 questions regarding one or more of the 5 items encoded by hospitals. The double-examination by 2 independent MDK experts showed matching results in 187 questions (73.6%) meaning they had been examined consistently and correctly. 59 questions (23.2%) did not show matching results, nevertheless they had been examined correctly regarding the scope of the assessment. None of the principal diagnoses was significantly affected by inconsistent or wrong judgment. A representative sample of MDK expertises examining the DRG encoding by hospitals showed a very high percentage of correct examination by the MDK experts. Identical MDK expertises cannot be achieved in all cases due to the scope of the assessment. Further improvement and simplification of codes and coding guidelines are required to reduce the scope of assessment with regard to correct DRG encoding and its examination. © Georg Thieme Verlag KG Stuttgart · New York.
Steagall, Paulo V M; Monteiro, Beatriz P; Lavoie, Anne-Marie; Frank, Diane; Troncy, Eric; Luna, Stelio P L; Brondani, Juliana T
2017-01-01
Validation of the French version of the UNESP-Botucatu multidimensional composite pain scale for assessing postoperative pain in cats. The aim of this study was to validate the French version of the UNESP-Botucatu multidimensional composite pain scale (MCPS-Fr) to assess postoperative pain in cats. Two veterinarians and one DVM student identified three domains of behavior based on video analyses: "psychomotor change", "protection of the painful area" and "physiological variables". Internal consistency was excellent (Cronbach's alpha coefficient of 0.94, 0.90 and 0.61, respectively). Criterion validity was good to very good when evaluations from the three observers were compared with a "gold standard". Inter- and intra-rater reliability for each scale item were good to very good. The optimal cut-off point identified with a ROC curve was > 7 (scale range 0-30 points), with a sensitivity of 97.8% and specificity of 99.1%. The MCPS-Fr is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy.(Translated by Dr. Beatriz Monteiro).
A simple behavioral test for locomotor function after brain injury in mice.
Tabuse, Masanao; Yaguchi, Masae; Ohta, Shigeki; Kawase, Takeshi; Toda, Masahiro
2010-11-01
To establish a simple and reliable test for assessing locomotor function in mice with brain injury, we developed a new method, the rotarod slip test, in which the number of slips of the paralytic hind limb from a rotarod is counted. Brain injuries of different severity were created in adult C57BL/6 mice, by inflicting 1-point, 2-point and 4-point cryo-injuries. These mice were subjected to the rotarod slip test, the accelerating rotarod test and the elevated body swing test (EBST). Histological analyses were performed to assess the severity of the brain damage. Significant and consistent correlations between test scores and severity were observed for the rotarod slip test and the EBST. Only the rotarod slip test detected the mild hindlimb paresis in the acute and sub-acute phase after injury. Our results suggest that the rotarod slip test is the most sensitive and reliable method for assessing locomotor function after brain damage in mice. Copyright © 2010 Elsevier Ltd. All rights reserved.
Lee, Michele D; Kaidonis, Georgia; Kim, Alice Y; Shields, Ryan A; Leng, Theodore
2017-09-01
Choroidal nevi are common benign intraocular tumors with a small risk of malignant transformation. This retrospective study investigates the use of en face spectral-domain optical coherence tomography angiography (SD-OCTA) in determining the clinical features and measurement of choroidal nevi. Patients with choroidal nevi were imaged with both OCTA and a fundus photography device. Greatest longitudinal dimension (GLD), perpendicular dimension (PD), and the GLD/PD ratio were assessed on each device. Inter-device variation and intra- and inter-rater reliability analyses were performed. Fourteen patients with choroidal nevi were included. No significant difference between the GLD/PD ratio as measured by all three devices was found (Chi-square = 2.8, 2 df, P = .247). Intraclass correlation coefficients were greater than 0.7 for repeated measures on all devices, suggesting good repeatability and reproducibility. This study demonstrated inter-device consistency and high intra- and inter-rater reliability when measuring choroidal nevi. [Ophthalmic Surg Lasers Imaging Retina. 2017;48:741-747.]. Copyright 2017, SLACK Incorporated.
“Connectedness to Nature Scale”: Validity and Reliability in the French Context
Navarro, Oscar; Olivos, Pablo; Fleury-Bahi, Ghozlane
2017-01-01
Connectedness to nature represents the relationship of the self with the natural environment and has been operationalized using different scales. One of the most systematically studied in the Anglo-Saxon context is the Connectedness to Nature Scale (CNS). In an attempt to study the psychometric properties of this instrument in a French-speaking context, three studies (Study 1 n = 204, Study 2 n = 153, and Study 3 n = 322) were carried out in France to provide evidence of the internal consistency of the CNS, as well as its convergent, discriminant, and predictive validity. Moreover, as anticipated, positive correlations between the CNS and the environmental identity and environmental concerns scales were observed. Based on factorial analyses of maximum likelihood and reliability, an improvement in the psychometric properties was identified by eliminating three items. Through confirmatory factor analysis, the factorial structure and the psychometric properties of the CNS French version were confirmed, as well as their significate regression prediction on eudaimonic wellbeing. PMID:29312052
Factor structure and psychometric properties of the Fertility Problem Inventory–Short Form
Zurlo, Maria Clelia; Cattaneo Della Volta, Maria Franscesca; Vallone, Federica
2017-01-01
The study analyses factor structure and psychometric properties of the Italian version of the Fertility Problem Inventory–Short Form. A sample of 206 infertile couples completed the Italian version of Fertility Problem Inventory (46 items) with demographics, State Anxiety Scale of State-Trait Anxiety Inventory (Form Y), Edinburgh Depression Scale and Dyadic Adjustment Scale, used to assess convergent and discriminant validity. Confirmatory factor analysis was unsatisfactory (comparative fit index = 0.87; Tucker-Lewis Index = 0.83; root mean square error of approximation = 0.17), and Cronbach’s α (0.95) revealed a redundancy of items. Exploratory factor analysis was carried out deleting cross-loading items, and Mokken scale analysis was applied to verify the items homogeneity within the reduced subscales of the questionnaire. The Fertility Problem Inventory–Short Form consists of 27 items, tapping four meaningful and reliable factors. Convergent and discriminant validity were confirmed. Findings indicated that the Fertility Problem Inventory–Short Form is a valid and reliable measure to assess infertility-related stress dimensions. PMID:29379625
Llerena, Katiah; Park, Stephanie G; McCarthy, Julie M; Couture, Shannon M; Bennett, Melanie E; Blanchard, Jack J
2013-07-01
The Clinical Assessment Interview for Negative Symptoms (CAINS) is an empirically developed interview measure of negative symptoms. Building on prior work, this study examined the reliability and validity of a self-report measure based on the CAINS-the Motivation and Pleasure Scale-Self-Report (MAP-SR)-that assesses the motivation and pleasure domain of negative symptoms. Thirty-seven participants with schizophrenia or schizoaffective disorder completed the 18-item MAP-SR, the CAINS, and other measures of functional outcome. Item analyses revealed three items that performed poorly. The revised 15-item MAP-SR demonstrated good internal consistency and convergent validity with the clinician-rated Motivation and Pleasure scale of the CAINS, as well as good discriminant validity, with little association with psychotic symptoms or depression/anxiety. MAP-SR scores were related to social anhedonia, social closeness, and clinician-rated social functioning. The MAP-SR is a promising self-report measure of severity of negative symptoms. Copyright © 2013 Elsevier Inc. All rights reserved.
Psychometrics of the Personal Questionnaire: A client-generated outcome measure.
Elliott, Robert; Wagner, John; Sales, Célia M D; Rodgers, Brian; Alves, Paula; Café, Maria J
2016-03-01
We present a range of evidence for the reliability and validity of data generated by the Personal Questionnaire (PQ), a client-generated individualized outcome measure, using 5 data sets from 3 countries. Overall pretherapy mean internal consistency (alpha) across clients was .80, and within-client alphas averaged .77; clients typically had 1 or 2 items that did not vary with the other items. Analyses of temporal structure indicated high levels of between-clients variance (58%), moderate pretherapy test-retest correlation (r = .57), and high session-to-session Lag-1 autocorrelation (.82). Scores on the PQ provided clear evidence of convergence with a range of outcome measures (within-client r = .41). Mean pre-post effects were large (d = 1.25). The results support a revised caseness cutoff of 3.25 and a reliable change index interval of 1.67. We conclude that PQ data meet criteria for evidence-based, norm-referenced measurement of client psychological distress for supporting psychotherapy practice and research. (c) 2016 APA, all rights reserved).
Andreu, Yolanda; Galdon, Maria J; Durá, Estrella; Ferrando, Maite; Pascual, Juan; Turk, Dennis C; Jiménez, Yolanda; Poveda, Rafael
2006-01-01
Background This paper seeks to analyse the psychometric and structural properties of the Multidimensional Pain Inventory (MPI) in a sample of temporomandibular disorder patients. Methods The internal consistency of the scales was obtained. Confirmatory Factor Analysis was carried out to test the MPI structure section by section in a sample of 114 temporomandibular disorder patients. Results Nearly all scales obtained good reliability indexes. The original structure could not be totally confirmed. However, with a few adjustments we obtained a satisfactory structural model of the MPI which was slightly different from the original: certain items and the Self control scale were eliminated; in two cases, two original scales were grouped in one factor, Solicitous and Distracting responses on the one hand, and Social activities and Away from home activities, on the other. Conclusion The MPI has been demonstrated to be a reliable tool for the assessment of pain in temporomandibular disorder patients. Some divergences to be taken into account have been clarified. PMID:17169143
Jin, Xinhong; Jin, Yahong; Zhou, Shi; Li, Xinhao; Yang, Shun-Nan; Yang, Donglin; Nieuwoudt, Johanna E; Yao, Jiaxin
2015-06-01
Muscle dysmorphia (MD) is the distorted perception of men's own muscle appearance. The increasing popularity of weightlifting in Chinese men suggests the presence of MD. The study assessed the validity and reliability of the Muscle Appearance Satisfaction Scale (MASS) for its use on adult Chinese males. Exploratory and confirmatory factor analyses of responses from 225 and 592 participants confirmed the same five factors for the 17-item Chinese version as the original MASS (CFI=.931, RMSEA=.052). The internal consistency for all factors were acceptable (Cronbach's α=.636 to .737). Correlation levels of its subscales with converging measurements indicated that the revised MASS is effective in assessing MD in Chinese male weightlifters. Differences in the importance of the factors suggest an influence of Chinese culture in the symptoms of MD and the need of assessing the MASS with populations from distinct demographics in China and from different cultures. Copyright © 2015 Elsevier Ltd. All rights reserved.
Measuring hope among families impacted by cognitive impairment.
Hunsaker, Amanda E; Terhorst, Lauren; Gentry, Amanda; Lingler, Jennifer H
2016-07-01
The current exploratory investigation aims to establish the reliability and validity of a hope measure, the Herth Hope Index, among families impacted by early cognitive impairment (N = 96). Exploratory factor analysis was used to examine the dimensionality of the measure. Bivariate analyses were used to examine construct validity. The sample had moderately high hope scores. A two-factor structure emerged from the factor analysis, explaining 51.44% of the variance. Both factors exhibited strong internal consistency (Cronbach's alphas ranged from .83 to .86). Satisfaction with social support was positively associated with hope, supporting convergent validity. Neurocognitive status, illness insight, and depression were not associated with hope, indicating discriminant validity. Families impacted by cognitive impairment may maintain hope in the face of a potentially progressive illness, regardless of cognitive status. The Herth Hope Index can be utilized as a reliable and valid measure of hope by practitioners providing support to families impacted by cognitive impairment. © The Author(s) 2014.
Initial development and reliability of a motivation for weight loss scale.
Meyer, Andrea H; Weissen-Schelling, Simone; Munsch, Simone; Margraf, Jürgen
2010-06-01
We aimed at developing and evaluating a questionnaire assessing health and appearance as the two main reasons for weight loss in overweight and obese individuals. Using data from two representative telephone surveys in Switzerland, the factorial structure of this questionnaire was analyzed by exploratory and confirmatory factor analysis. The model obtained was cross-validated with data from a second representative Swiss survey and multigroup analyses according to sex, age, BMI and regional language subgroups were performed. This lead to a 24-item, 3-factor solution, with factors labeled 'health', 'appearance in relation to others', and 'appearance in relation to oneself'. Internal consistency and test-retest reliability were good. To the best of our knowledge, this is the first validated questionnaire assessing overweight and obese individuals' reasons for weight loss. It should be further tested whether using this questionnaire as a pretreatment assessment device will help in tailoring treatments to individuals, thereby increasing treatment adherence and success. Copyright (c) 2010 S. Karger AG, Basel.
Brief reasons for living inventory: a psychometric investigation.
Cwik, Jan Christopher; Siegmann, Paula; Willutzki, Ulrike; Nyhuis, Peter; Wolter, Marcus; Forkmann, Thomas; Glaesmer, Heide; Teismann, Tobias
2017-11-06
The present study aimed at validating the German version of the Brief Reasons for Living inventory (BRFL). Validity and reliability were established in a community (n = 339) and a clinical sample (n = 272). Convergent and discriminant validity were investigated, and confirmatory factor analyses were conducted for the complete BRFL as well as for a 10-item version excluding conditional items on child-related concerns. Furthermore, it was assessed how BRFL scores moderate the association between depression and suicide ideation. Results indicated an adequate fit of the data to the original factor structure. The total scale and the subscales of the German version of the BRFL had sufficient internal consistency, as well as good convergent and divergent validity. The BRFL demonstrated clinical utility by differentiating between participants with vs. without suicide ideation. Reasons for living proved to moderate the association between depression and suicide ideation. Results provide preliminary evidence that the BRFL may be a reliable and valid measure of adaptive reasons for living that can be used in clinic and research settings.
Zurlo, Maria Clelia; Pes, Daniela; Siegrist, Johannes
2010-08-01
This study explores the explicative potential of effort-reward imbalance Model to unveil the dimensions involved in teacher stress process and analyses the psychometric characteristics of the Italian version of the ERI Questionnaire (Siegrist, J Occup Health Psychol 1:27-43, 1996) with respect to a homogeneous occupational group: Italian school teachers. The Italian version of the ERI Questionnaire was submitted to 673 teachers randomly drawn from a cross-section of school types. Internal consistency, reliability, discriminative validity, and factorial structure were evaluated. Predictive validity was explored with respect to a measure of perceived strain, the Crown-Crisp Experiential Index. Discriminative validity was explored with respect to age, gender, education, type of school, the presence/absence of physical pains in the last 12 months before the survey, and teachers' intention to leave the profession. Item-total correlations are for all items included between 0.30 and 0.80 (p < 0.01). Mean inter-item correlation is 0.26. Cronbach's alpha for the whole questionnaire reaches the value of 0.89. The factor analysis identified four reliable factors that accounted for 44.8 per cent of the total variance and which confirmed the basic structure emerged from previous studies yet highlighting two instead of three different components for reward. Higher efforts (T = -3.82, p < 0.001) and both lower material (T = 3.23, p < 0.001) and immaterial rewards (T = 3.17, p < 0.005) characterised the group of teachers, which reported to suffer for physical pains. Higher efforts (T = -5.26, p < 0.001), higher overcommitment (T = -3.15, p < 0.005), and both lower material (T = 4.63, p < 0.001) and immaterial rewards (T = 4.00, p < 0.001) were observed in the group of teachers inclined to give up the job. Multiple regression analyses have highlighted that higher efforts, higher overcommitment, and lower rewards are significantly predictive of higher levels of free-floating and somatic anxiety as well as depression and global psychological strain. This preliminary analysis of the reliability and validity of the Italian version of the ERI Questionnaire reveals that it constitutes a useful and reliable measure to analyse work-related stress with respect to the school setting. The validity of the ERI model to describe the dimensions involved in teacher's stress and to highlight those associated to leaving intentions and to several physical and psychological strain outcomes in Italian school teachers has been confirmed.
Bermúdez-de-Alvear, Rosa M; Gálvez-Ruiz, Pablo; Martínez-Arquero, A Ginés; Rando-Márquez, Sara; Fernández-Contreras, Elena
2018-06-11
This study aimed to analyze the psychometric properties of the Spanish version of the Voice Activity and Participation Profile (SVAPP) questionnaire. A randomized, cross-sectional sampling strategy with controls was used. Two samples with a total of 169 participants were analyzed, specifically 61 men (mean age 37.02) and 108 women (mean age 37.78). Of these participants, 112 were patients and 57 were controls. The instrument was submitted to reliability (internal consistency and corrected item-total correlations) and reproducibility analyses. Validation assessment was based on the construct validity, convergent validity, discriminant validity, and concurrent validity. The global internal consistency was excellent (Cronbach's α = 0.976), corrected item-total correlations were satisfactory and ranged 0.63-0.89, and factor loadings were above 0.50. The different subscales showed good internal consistency (alpha coefficients ranged 0.830-0.956) and test-retest values were consistently associated. The exploratory factor analysis evidenced a strongly defined five factors internal structure, with factors loadings ranging 0.51-0.86. Convergent validity demonstrated that all subscales and scores were very strongly correlated (Pearson r above 0.735) and significantly associated. The discriminant validity analysis showed that SVAPP had good specificity to distinguish dysphonic from healthy voice subjects. Concurrent validity with Voice Handicap Index Spanish version (SVHI) showed very strong correlations between total scores, and between SVHI total score and SVAPP Daily and Social Communication subscales; correlations between both tests subscales were strong; only between SVAPP Work and SVHI Physical sections correlations were moderate. The findings of the present study demonstrated evidence for the SVAPP questionnaire reliability and validity, and provided insightful implications of voice disorders on Spanish patients' quality of life. However, further investigations are required. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reliabilities of mental rotation tasks: limits to the assessment of individual differences.
Hirschfeld, Gerrit; Thielsch, Meinald T; Zernikow, Boris
2013-01-01
Mental rotation tasks with objects and body parts as targets are widely used in cognitive neuropsychology. Even though these tasks are well established to study between-groups differences, the reliability on an individual level is largely unknown. We present a systematic study on the internal consistency and test-retest reliability of individual differences in mental rotation tasks comparing different target types and orders of presentations. In total n = 99 participants (n = 63 for the retest) completed the mental rotation tasks with hands, feet, faces, and cars as targets. Different target types were presented in either randomly mixed blocks or blocks of homogeneous targets. Across all target types, the consistency (split-half reliability) and stability (test-retest reliabilities) were good or acceptable both for intercepts and slopes. At the level of individual targets, only intercepts showed acceptable reliabilities. Blocked presentations resulted in significantly faster and numerically more consistent and stable responses. Mental rotation tasks-especially in blocked variants-can be used to reliably assess individual differences in global processing speed. However, the assessment of the theoretically important slope parameter for individual targets requires further adaptations to mental rotation tests.
NASA Astrophysics Data System (ADS)
Wallace, Jon Michael
2003-10-01
Reliability prediction of components operating in complex systems has historically been conducted in a statistically isolated manner. Current physics-based, i.e. mechanistic, component reliability approaches focus more on component-specific attributes and mathematical algorithms and not enough on the influence of the system. The result is that significant error can be introduced into the component reliability assessment process. The objective of this study is the development of a framework that infuses the needs and influence of the system into the process of conducting mechanistic-based component reliability assessments. The formulated framework consists of six primary steps. The first three steps, identification, decomposition, and synthesis, are primarily qualitative in nature and employ system reliability and safety engineering principles to construct an appropriate starting point for the component reliability assessment. The following two steps are the most unique. They involve a step to efficiently characterize and quantify the system-driven local parameter space and a subsequent step using this information to guide the reduction of the component parameter space. The local statistical space quantification step is accomplished using two proposed multivariate probability models: Multi-Response First Order Second Moment and Taylor-Based Inverse Transformation. Where existing joint probability models require preliminary distribution and correlation information of the responses, these models combine statistical information of the input parameters with an efficient sampling of the response analyses to produce the multi-response joint probability distribution. Parameter space reduction is accomplished using Approximate Canonical Correlation Analysis (ACCA) employed as a multi-response screening technique. The novelty of this approach is that each individual local parameter and even subsets of parameters representing entire contributing analyses can now be rank ordered with respect to their contribution to not just one response, but the entire vector of component responses simultaneously. The final step of the framework is the actual probabilistic assessment of the component. Although the same multivariate probability tools employed in the characterization step can be used for the component probability assessment, variations of this final step are given to allow for the utilization of existing probabilistic methods such as response surface Monte Carlo and Fast Probability Integration. The overall framework developed in this study is implemented to assess the finite-element based reliability prediction of a gas turbine airfoil involving several failure responses. Results of this implementation are compared to results generated using the conventional 'isolated' approach as well as a validation approach conducted through large sample Monte Carlo simulations. The framework resulted in a considerable improvement to the accuracy of the part reliability assessment and an improved understanding of the component failure behavior. Considerable statistical complexity in the form of joint non-normal behavior was found and accounted for using the framework. Future applications of the framework elements are discussed.
Interformat reliability of digital psychiatric self-report questionnaires: a systematic review.
Alfonsson, Sven; Maathz, Pernilla; Hursti, Timo
2014-12-03
Research on Internet-based interventions typically use digital versions of pen and paper self-report symptom scales. However, adaptation into the digital format could affect the psychometric properties of established self-report scales. Several studies have investigated differences between digital and pen and paper versions of instruments, but no systematic review of the results has yet been done. This review aims to assess the interformat reliability of self-report symptom scales used in digital or online psychotherapy research. Three databases (MEDLINE, Embase, and PsycINFO) were systematically reviewed for studies investigating the reliability between digital and pen and paper versions of psychiatric symptom scales. From a total of 1504 publications, 33 were included in the review, and interformat reliability of 40 different symptom scales was assessed. Significant differences in mean total scores between formats were found in 10 of 62 analyses. These differences were found in just a few studies, which indicates that the results were due to study effects and sample effects rather than unreliable instruments. The interformat reliability ranged from r=.35 to r=.99; however, the majority of instruments showed a strong correlation between format scores. The quality of the included studies varied, and several studies had insufficient power to detect small differences between formats. When digital versions of self-report symptom scales are compared to pen and paper versions, most scales show high interformat reliability. This supports the reliability of results obtained in psychotherapy research on the Internet and the comparability of the results to traditional psychotherapy research. There are, however, some instruments that consistently show low interformat reliability, suggesting that these conclusions cannot be generalized to all questionnaires. Most studies had at least some methodological issues with insufficient statistical power being the most common issue. Future studies should preferably provide information about the transformation of the instrument into digital format and the procedure for data collection in more detail.
Scale for positive aspects of caregiving experience: development, reliability, and factor structure.
Kate, N; Grover, S; Kulhara, P; Nehra, R
2012-06-01
OBJECTIVE. To develop an instrument (Scale for Positive Aspects of Caregiving Experience [SPACE]) that evaluates positive caregiving experience and assess its psychometric properties. METHODS. Available scales which assess some aspects of positive caregiving experience were reviewed and a 50-item questionnaire with a 5-point rating was constructed. In all, 203 primary caregivers of patients with severe mental disorders were asked to complete the questionnaire. Internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity were evaluated. Principal component factor analysis was run to assess the factorial validity of the scale. RESULTS. The scale developed as part of the study was found to have good internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity. Principal component factor analysis yielded a 4-factor structure, which also had good test-retest reliability and cross-language reliability. There was a strong correlation between the 4 factors obtained. CONCLUSION. The SPACE developed as part of this study has good psychometric properties.
Wang, Yuan; Bao, Shan; Du, Wenjun; Ye, Zhirui; Sayer, James R
2017-11-17
This article investigated and compared frequency domain and time domain characteristics of drivers' behaviors before and after the start of distracted driving. Data from an existing naturalistic driving study were used. Fast Fourier transform (FFT) was applied for the frequency domain analysis to explore drivers' behavior pattern changes between nondistracted (prestarting of visual-manual task) and distracted (poststarting of visual-manual task) driving periods. Average relative spectral power in a low frequency range (0-0.5 Hz) and the standard deviation in a 10-s time window of vehicle control variables (i.e., lane offset, yaw rate, and acceleration) were calculated and further compared. Sensitivity analyses were also applied to examine the reliability of the time and frequency domain analyses. Results of the mixed model analyses from the time and frequency domain analyses all showed significant degradation in lateral control performance after engaging in visual-manual tasks while driving. Results of the sensitivity analyses suggested that the frequency domain analysis was less sensitive to the frequency bandwidth, whereas the time domain analysis was more sensitive to the time intervals selected for variation calculations. Different time interval selections can result in significantly different standard deviation values, whereas average spectral power analysis on yaw rate in both low and high frequency bandwidths showed consistent results, that higher variation values were observed during distracted driving when compared to nondistracted driving. This study suggests that driver state detection needs to consider the behavior changes during the prestarting periods, instead of only focusing on periods with physical presence of distraction, such as cell phone use. Lateral control measures can be a better indicator of distraction detection than longitudinal controls. In addition, frequency domain analyses proved to be a more robust and consistent method in assessing driving performance compared to time domain analyses.
The development and validation of a neuropathy- and foot ulcer-specific quality of life instrument.
Vileikyte, Loretta; Peyrot, Mark; Bundy, Christine; Rubin, Richard R; Leventhal, Howard; Mora, Pablo; Shaw, Jonathan E; Baker, Paul; Boulton, Andrew J M
2003-09-01
The purpose of this study was to develop a questionnaire that measures patients' perceptions of the impact of diabetic peripheral neuropathy and foot ulcers on their quality of life and to assess the psychometric properties of this instrument in a sample of patients with varying severity and symptomatology of diabetic peripheral neuropathy. The neuropathy- and foot ulcer-specific quality of life instrument (NeuroQoL), generated from interviews with patients with (n = 47) and without (n = 15) diabetic peripheral neuropathy, was administered to 418 consecutive patients with diabetic peripheral neuropathy (35% with foot ulcer history) attending either U.K. (n = 290) or U.S. (n = 128) diabetes centers. Psychometric tests of NeuroQoL included factor analyses and internal consistency of scales; a series of multivariate analyses were performed to establish its criterion, construct, and incremental validity. Results were compared with those obtained using the Short Form (SF)-12 measure of health-related functioning. Factor analyses of NeuroQoL revealed three physical symptom measures and two psychosocial functioning measures with good reliability (alpha = 0.86-0.95). NeuroQoL was more strongly associated with measures of neuropathic severity than SF-12, more fully mediated the relationship of diabetic peripheral neuropathy with overall quality of life, and significantly increased explained variance in overall quality of life over SF-12. NeuroQoL reliably captures the key dimensions of the patients' experience of diabetic peripheral neuropathy and is a valid tool for studying the impact of neuropathy and foot ulceration on quality of life.
A validity and reliability study of the coping self-efficacy scale
Chesney, Margaret A.; Neilands, Torsten B.; Chambers, Donald B.; Taylor, Jonelle M.; Folkman, Susan
2006-01-01
Objectives Investigate the psychometric characteristics of the coping self-efficacy (CSE) scale, a 26-item measure of one’s confidence in performing coping behaviors when faced with life challenges. Design Data came from two randomized clinical trials (N1 = 149, N2 = 199) evaluating a theory-based Coping Effectiveness Training (CET) intervention in reducing psychological distress and increasing positive mood in persons coping with chronic illness. Methods The 348 participants were HIV-seropositive men with depressed mood who have sex with men. Participants were randomly assigned to intervention and comparison conditions and assessed pre- and post-intervention. Outcome variables included the CSE scale, ways of coping, and measures of social support and psychological distress and well-being. Results Exploratory (EFA) and confirmatory factor analyses (CFA) revealed a 13-item reduced form of the CSE scale with three factors: Use problem-focused coping (6 items, α = .91), stop unpleasant emotions and thoughts (4 items, α = .91), and get support from friends and family (3 items, α = .80). Internal consistency and test–retest reliability are strong for all three factors. Concurrent validity analyses showed these factors assess self-efficacy for different types of coping. Predictive validity analyses showed that residualized change scores in using problem- and emotion-focused coping skills were predictive of reduced psychological distress and increased psychological well-being over time. Conclusions The CSE scale provides a measure of a person’s perceived ability to cope effectively with life challenges, as well as a way to assess changes in CSE over time in intervention research. PMID:16870053
Brothers, Allyson; Chui, Helena; Diehl, Manfred
2014-01-01
Purpose of the Study: Despite calls for the consideration of future time perspective (FTP) as a multidimensional construct, mostly unidimensional measurement instruments have been used. This study had two objectives: (a) to develop a brief multidimensional questionnaire for assessing FTP in adulthood and evaluate its psychometric properties; and (b) to examine age associations and age-group differences of the dimensions of FTP. Design and Methods: Data were collected from 625 community-residing adults between the ages of 18 and 93, representing young, middle-aged, and older adults. The psychometric evaluation involved exploratory factor analyses (EFA) and confirmatory FA (CFA), reliability and validity analyses, and measurement invariance testing. Zero-order and partial correlations were used to examine the association of the dimensions of FTP with age, and multivariate analysis of variance was used to examine age-group differences. Results: EFA and CFA supported a three-factor solution: Future as Open, Future as Limited, and Future as Ambiguous. Metric measurement invariance for this factor structure was confirmed across the three age groups. Reliability and validity analyses provided evidence of sound psychometric properties of the brief questionnaire. Age was negatively associated with Future as Open and positively associated with Future as Limited. Young adults exhibited significantly greater ambiguity toward the future than middle-aged or older adults. Implications: This study provides evidence in support of the psychometric properties of a new brief multidimensional FTP scale. It also provides evidence for a pattern of age associations and age-group differences consistent with life-span developmental theory. PMID:25063938
Reliability of resting-state microstate features in electroencephalography.
Khanna, Arjun; Pascual-Leone, Alvaro; Farzan, Faranak
2014-01-01
Electroencephalographic (EEG) microstate analysis is a method of identifying quasi-stable functional brain states ("microstates") that are altered in a number of neuropsychiatric disorders, suggesting their potential use as biomarkers of neurophysiological health and disease. However, use of EEG microstates as neurophysiological biomarkers requires assessment of the test-retest reliability of microstate analysis. We analyzed resting-state, eyes-closed, 30-channel EEG from 10 healthy subjects over 3 sessions spaced approximately 48 hours apart. We identified four microstate classes and calculated the average duration, frequency, and coverage fraction of these microstates. Using Cronbach's α and the standard error of measurement (SEM) as indicators of reliability, we examined: (1) the test-retest reliability of microstate features using a variety of different approaches; (2) the consistency between TAAHC and k-means clustering algorithms; and (3) whether microstate analysis can be reliably conducted with 19 and 8 electrodes. The approach of identifying a single set of "global" microstate maps showed the highest reliability (mean Cronbach's α > 0.8, SEM ≈ 10% of mean values) compared to microstates derived by each session or each recording. There was notably low reliability in features calculated from maps extracted individually for each recording, suggesting that the analysis is most reliable when maps are held constant. Features were highly consistent across clustering methods (Cronbach's α > 0.9). All features had high test-retest reliability with 19 and 8 electrodes. High test-retest reliability and cross-method consistency of microstate features suggests their potential as biomarkers for assessment of the brain's neurophysiological health.
Li, Jie; Stroebe, Magaret; Chan, Cecilia L W; Chow, Amy Y M
2017-06-01
The rationale, development, and validation of the Bereavement Guilt Scale (BGS) are described in this article. The BGS was based on a theoretically developed, multidimensional conceptualization of guilt. Part 1 describes the generation of the item pool, derived from in-depth interviews, and review of the scientific literature. Part 2 details statistical analyses for further item selection (Sample 1, N = 273). Part 3 covers the psychometric properties of the emergent-BGS (Sample 2, N = 600, and Sample 3, N = 479). Confirmatory factor analysis indicated that a five-factor model fit the data best. Correlations of BGS scores with depression, anxiety, self-esteem, self-forgiveness, and mode of death were consistent with theoretical predictions, supporting the construct validity of the measure. The internal consistency and test-retest reliability were also supported. Thus, initial testing or examination suggests that the BGS is a valid tool to assess multiple components of bereavement guilt. Further psychometric testing across cultures is recommended.
Further refinement of the Escherichia coli brain abscess model in rat.
Nazzaro, J M; Pagel, M A; Neuwelt, E A
1992-09-01
The rat brain abscess model provides a substrate for the modeling of delivery of therapeutic agents to intracerebral mass lesions. We now report refinement of the Escherichia coli brain abscess model in rat. A K1 surface antigen-negative E. coli isolated from human blood culture was stereotaxically inoculated into deep brain sites. Histopathologic analyses and quantitative cultures demonstrated the consistent production of lesions. No animal in this consecutive series developed meningitis, ventriculitis or sepsis. By contrast, prior experience with E. coli abscess production resulted in 25% failure rate of abscess production or death from sepsis. This improvement in the model may be attributable to specific characteristics of the bacteria used, modification of the inoculation method or the intracerebral placement technique. The present work suggests a reliable and consistent brain abscess model, which may be further used to study brain suppuration.
Cheong, A T; Tong, S F; Sazlina, S G
2015-01-01
Hill-Bone compliance to high blood pressure therapy scale (HBTS) is one of the useful scales in primary care settings. It has been tested in America, Africa and Turkey with variable validity and reliability. The aim of this paper was to determine the validity and reliability of the Malay version of HBTS (HBTS-M) for the Malaysian population. HBTS comprises three subscales assessing compliance to medication, appointment and salt intake. The content validity of HBTS to the local population was agreed through consensus of expert panel. The 14 items used in the HBTS were adapted to reflect the local situations. It was translated into Malay and then back-translated into English. The translated version was piloted in 30 participants. This was followed by structural and predictive validity, and internal consistency testing in 262 patients with hypertension, who were on antihypertensive agent(s) for at least 1 year in two primary healthcare clinics in Kuala Lumpur, Malaysia. Exploratory factor analyses and the correlation between HBTS-M total score and blood pressure were performed. The Cronbach's alpha was calculated accordingly. Factor analysis revealed a three-component structure represented by two components on medication adherence and one on salt intake adherence. The Kaiser-Meyer-Olkin statistic was 0.764. The variance explained by each factors were 23.6%, 10.4% and 9.8%, respectively. However, the internal consistency for each component was suboptimal with Cronbach's alpha of 0.64, 0.55 and 0.29, respectively. Although there were two components representing medication adherence, the theoretical concepts underlying each concept cannot be differentiated. In addition, there was no correlation between the HBTS-M total score and blood pressure. HBTS-M did not conform to the structural and predictive validity of the original scale. Its reliability on assessing medication and salt intake adherence would most probably to be suboptimal in the Malaysian primary care setting.
Feenstra, Heleen E M; Murre, Jaap M J; Vermeulen, Ivar E; Kieffer, Jacobien M; Schagen, Sanne B
2018-04-01
To facilitate large-scale assessment of a variety of cognitive abilities in clinical studies, we developed a self-administered online neuropsychological test battery: the Amsterdam Cognition Scan (ACS). The current studies evaluate in a group of adult cancer patients: test-retest reliability of the ACS and the influence of test setting (home or hospital), and the relationship between our online and a traditional test battery (concurrent validity). Test-retest reliability was studied in 96 cancer patients (57 female; M age = 51.8 years) who completed the ACS twice. Intraclass correlation coefficients (ICCs) were used to assess consistency over time. The test setting was counterbalanced between home and hospital; influence on test performance was assessed by repeated measures analyses of variance. Concurrent validity was studied in 201 cancer patients (112 female; M age = 53.5 years) who completed both the online and an equivalent traditional neuropsychological test battery. Spearman or Pearson correlations were used to assess consistency between online and traditional tests. ICCs of the online tests ranged from .29 to .76, with an ICC of .78 for the ACS total score. These correlations are generally comparable with the test-retest correlations of the traditional tests as reported in the literature. Correlating online and traditional test scores, we observed medium to large concurrent validity (r/ρ = .42 to .70; total score r = .78), except for a visuospatial memory test (ρ = .36). Correlations were affected-as expected-by design differences between online tests and their offline counterparts. Although development and optimization of the ACS is an ongoing process, and reliability can be optimized for several tests, our results indicate that it is a highly usable tool to obtain (online) measures of various cognitive abilities. The ACS is expected to facilitate efficient gathering of data on cognitive functioning in the near future.
Patients' expectations of orthodontic treatment: part 1 - development of a questionnaire.
Sayers, M S; Newton, J T
2006-12-01
The development of a questionnaire to measure patients' and their parents' expectations before orthodontic treatment, and to test the reliability and validity of this measure. A two-stage methodology, with open-ended interviews to identify themes and concepts followed by development and testing of the questionnaire. GKT Orthodontic Department, King's College Dental Hospital. The sample consisted of 140 participants, 70 patients aged 12-14 years, who had been referred to the orthodontic department for treatment. One parent of each patient was also recruited. The study was in two phases. In the first phase 30 participants (15 new patients and their 15 parents) participated in open-ended interviews, which were analysed qualitatively. Information from these interviews was used to construct a questionnaire. During the second phase, the questionnaire was piloted on 10 participants, five new consecutive patients and their parents. The questionnaire was then distributed to 174 subjects (87 new patients and their 87 parents). Seventy-eight subjects (39 new patients and their 39 parents) completed the questionnaire before their orthodontic consultation. Another 96 subjects (48 new patients and their 48 parents) were invited to complete the questionnaire prior to and at their orthodontic consultation. Test-retest analysis was conducted on 22 participants (11 patients and their 11 parents), who completed the questionnaire previous to and at their orthodontic consultation, and contributed to the psychometric validation of this questionnaire. A questionnaire was devized using the key themes and concepts identified in the open-ended interviews. As a result, 10 questions, some with sub-questions were constructed using a visual analogue scale as the response format. The questionnaire developed had good face validity. Internal consistency of the questionnaire using Cronbach's alpha, produced an overall inter-item reliability > 0.7 along with item-total correlations > 0.3 in over 50% of questions. Test-retest reliability was statistically significant using Spearman's correlation. This study provides a valid and reliable measure of orthodontic expectations in participants aged 12-14 years and their parents.
Development and psychometric testing of the active aging scale for Thai adults.
Thanakwang, Kattika; Isaramalai, Sang-Arun; Hatthakit, Urai
2014-01-01
Active aging is central to enhancing the quality of life for older adults, but its conceptualization is not often made explicit for Asian elderly people. Little is known about active aging in older Thai adults, and there has been no development of scales to measure the expression of active aging attributes. The aim of this study was to develop a culturally relevant composite scale of active aging for Thai adults (AAS-Thai) and to evaluate its reliability and validity. EIGHT STEPS OF SCALE DEVELOPMENT WERE FOLLOWED: 1) using focus groups and in-depth interviews, 2) gathering input from existing studies, 3) developing preliminary quantitative measures, 4) reviewing for content validity by an expert panel, 5) conducting cognitive interviews, 6) pilot testing, 7) performing a nationwide survey, and 8) testing psychometric properties. In a nationwide survey, 500 subjects were randomly recruited using a stratified sampling technique. Statistical analyses included exploratory factor analysis, item analysis, and measures of internal consistency, concurrent validity, and test-retest reliability. Principal component factor analysis with varimax rotation resulted in a final 36-item scale consisting of seven factors of active aging: 1) being self-reliant, 2) being actively engaged with society, 3) developing spiritual wisdom, 4) building up financial security, 5) maintaining a healthy lifestyle, 6) engaging in active learning, and 7) strengthening family ties to ensure care in later life. These factors explained 69% of the total variance. Cronbach's alpha coefficient for the overall AAS-Thai was 0.95 and varied between 0.81 and 0.91 for the seven subscales. Concurrent validity and test-retest reliability were confirmed. The AAS-Thai demonstrated acceptable overall validity and reliability for measuring the multidimensional attributes of active aging in a Thai context. This newly developed instrument is ready for use as a screening tool to assess active aging levels among older Thai adults in both community and clinical practice settings.
Caçola, Priscila M; Gabbard, Carl; Montebelo, Maria I L; Santos, Denise C C
2015-06-01
Affordances in the home environment may play a significant role in infant motor development. The purpose of this study was to further develop and validate the Affordances in the Home Environment for Motor Development-Infant Scale (AHEMD-IS), an inventory that measures the quantity and quality of motor affordances in the home. A cross-sectional study was conducted to evaluate criteria for content validity, reliability, internal consistency, floor and ceiling effects, and interpretability of the instrument. A pilot version of the inventory with 5 dimensions was used for expert panel analysis and administered to parents of infants (N=419). Data were analyzed with Cronbach alpha, intraclass correlation coefficients (ICCs), ceiling and floor effects, and item and dimension interpretability analyses for creation of a scoring system with descriptive categories for each dimension and total score. Average agreement among the expert panel was 95% across all evaluation criteria. Cronbach alpha values with the 41-item scale ranged between .639 and .824 for the separate dimensions, with a total value of .824 (95% confidence interval [95% CI]=.781, .862). The ICC values were .990 for interrater reliability and .949 for intrarater reliability. There was a ceiling effect on 3 questions for the Inside Space dimension and on 3 questions for the Variety of Stimulation dimension. These results demonstrated the need for reduction in total items (from 41 to 35) and the combination of space dimensions. After removal of questions, internal consistency was .766 (95% CI=.729, .800) for total score. Overall assessment categories were created as: less than adequate, moderately adequate, adequate, and excellent. The inventory does not determine specific use (time, frequency) of affordances in the home, and it does not account for infants' out-of-home activities. The AHEMD-IS is a reliable and valid instrument to assess affordances in the home environment that promote infant motor development. © 2015 American Physical Therapy Association.
Jaipuria, Jiten; Suryavanshi, Manav; Sen, Tridib K
2016-12-01
To assess the reliability of the Guy's Stone Score, the Seoul National University Renal Stone Complexity (S-ReSC) score and the S.T.O.N.E. scores in percutaneous nephrolithotomy (PCNL), and assess their utility in discriminating outcomes [stone free rate (SFR), complications, need for multiple PCNL sessions, and auxiliary procedures] valid across parameters of experience of surgeon, independence from surgical approach, and variations in institution-specific instrumentation. A prospectively maintained database of two tertiary institutions was analysed (606 cases). Institutes differed in instrumentation, while the overall surgical team comprised: two trainees (experience <100 cases), two junior consultants (experience 100-200 cases), and two senior surgeons (experience >1000 cases). Scores were assigned and re-assigned after 4 months by one trainee and an expert surgeon. Inter-rater and test-retest agreement were analysed by Cohen's κ and intraclass correlation coefficient. Multivariate logistic regression models were created adjusting outcomes for the institution, comorbidity, Amplatz size, access tract location, the number of punctures, the experience level of the surgeon, and individual scoring system, and receiver operating curves were analysed for comparison. Despite some areas of inconsistencies, individually all scores had excellent inter-rater and test-retest concordance. On multivariable analyses, while the experience of the surgeon and surgical approach characteristics (such as access tract location, Amplatz size, and number of punctures) remained independently associated with different outcomes in varying combinations, calculus complexity scores were found consistently to be independently associated with all outcomes. The S-ReSC score had a superior association with SFR, the need for multiple PCNL sessions, and auxiliary procedures. Individually all scoring systems performed well. On cross comparison, the S-ReSC score consistently emerged to be more superiorly associated with all outcomes, signifying the importance of the distributional complexity of the calculus (which also indirectly amalgamates the influence of stone number, size, and anatomical location) in discriminating outcomes. Our study proves the utility of scoring systems in prognosticating multiple outcomes and also clarifies important aspects of their practical application including future roles such as benchmarking, audit, training, and objective assessment of surgical technique modifications. © 2016 The Authors BJU International © 2016 BJU International Published by John Wiley & Sons Ltd.
Lyon, Aaron R; Pullmann, Michael D; Dorsey, Shannon; Martin, Prerna; Grigore, Alexandra A; Becker, Emily M; Jensen-Doss, Amanda
2018-05-11
Measurement-based care (MBC) is an increasingly popular, evidence-based practice, but there are no tools with established psychometrics to evaluate clinician use of MBC practices in mental health service delivery. The current study evaluated the reliability, validity, and factor structure of scores generated from a brief, standardized tool to measure MBC practices, the Current Assessment Practice Evaluation-Revised (CAPER). Survey data from a national sample of 479 mental health clinicians were used to conduct exploratory and confirmatory factor analyses, as well as reliability and validity analyses (e.g., relationships between CAPER subscales and clinician MBC attitudes). Analyses revealed competing two- and three-factor models. Regardless of the model used, scores from CAPER subscales demonstrated good reliability and convergent and divergent validity with MBC attitudes in the expected directions. The CAPER appears to be a psychometrically sound tool for assessing clinician MBC practices. Future directions for development and application of the tool are discussed.
ERIC Educational Resources Information Center
Keller, Lisa A.; Clauser, Brian E.; Swanson, David B.
2010-01-01
In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates…
7 CFR 1788.2 - General insurance requirements.
Code of Federal Regulations, 2010 CFR
2010-01-01
... consistent with cost-effectiveness, reliability, safety, and expedition. It is recognized that Prudent... accomplish the desired result at the lowest reasonable cost consistent with cost-effectiveness, reliability... which is used or useful in the borrower's business and which shall be covered by insurance, unless each...
Fossils and living taxa agree on patterns of body mass evolution: a case study with Afrotheria.
Puttick, Mark N; Thomas, Gavin H
2015-12-22
Most of life is extinct, so incorporating some fossil evidence into analyses of macroevolution is typically seen as necessary to understand the diversification of life and patterns of morphological evolution. Here we test the effects of inclusion of fossils in a study of the body size evolution of afrotherian mammals, a clade that includes the elephants, sea cows and elephant shrews. We find that the inclusion of fossil tips has little impact on analyses of body mass evolution; from a small ancestral size (approx. 100 g), there is a shift in rate and an increase in mass leading to the larger-bodied Paenungulata and Tubulidentata, regardless of whether fossils are included or excluded from analyses. For Afrotheria, the inclusion of fossils and morphological character data affect phylogenetic topology, but these differences have little impact upon patterns of body mass evolution and these body mass evolutionary patterns are consistent with the fossil record. The largest differences between our analyses result from the evolutionary model, not the addition of fossils. For some clades, extant-only analyses may be reliable to reconstruct body mass evolution, but the addition of fossils and careful model selection is likely to increase confidence and accuracy of reconstructed macroevolutionary patterns. © 2015 The Authors.
Fossils and living taxa agree on patterns of body mass evolution: a case study with Afrotheria
Puttick, Mark N.; Thomas, Gavin H.
2015-01-01
Most of life is extinct, so incorporating some fossil evidence into analyses of macroevolution is typically seen as necessary to understand the diversification of life and patterns of morphological evolution. Here we test the effects of inclusion of fossils in a study of the body size evolution of afrotherian mammals, a clade that includes the elephants, sea cows and elephant shrews. We find that the inclusion of fossil tips has little impact on analyses of body mass evolution; from a small ancestral size (approx. 100 g), there is a shift in rate and an increase in mass leading to the larger-bodied Paenungulata and Tubulidentata, regardless of whether fossils are included or excluded from analyses. For Afrotheria, the inclusion of fossils and morphological character data affect phylogenetic topology, but these differences have little impact upon patterns of body mass evolution and these body mass evolutionary patterns are consistent with the fossil record. The largest differences between our analyses result from the evolutionary model, not the addition of fossils. For some clades, extant-only analyses may be reliable to reconstruct body mass evolution, but the addition of fossils and careful model selection is likely to increase confidence and accuracy of reconstructed macroevolutionary patterns. PMID:26674947
The psychometric testing of the diabetes health promotion self-care scale.
Wang, Ruey-Hsia; Lin, Li-Ying; Cheng, Chung-Ping; Hsu, Min-Tao; Kao, Chia-Chan
2012-06-01
Health-promoting behavior is an important strategy to maintain and enhance health of patients with Type 2 diabetes. Few instruments have been developed to measure health promotion self-care behavior of patients with Type 2 diabetes. Developing and psychometric testing of the Chinese version of the Diabetes Health Promotion Self-Care Scale (DHPSC) for patients with Type 2 diabetes. Four hundred and eighty-nine patients with Type 2 diabetes were recruited from endocrine clinics in four hospitals in Kaohsiung City in southern Taiwan. Exploratory and confirmatory factor analyses were used to assess the construct validity of the scale. Correlations between the DHPSC and the satisfaction subscale of Diabetes Quality of Life, Diabetes Empowerment Scale, and HbA1c were calculated to evaluate concurrent validity. Internal consistency and test-retest reliability were used to assess the reliability of the scale. The study was conducted in 2007 and 2008. A proposed second-order factor model with seven subscales and 26 items fit the data well. The seven subscales were interpersonal relationships, diet, blood glucose self-monitoring, personal health responsibility, exercise, adherence to the recommended regimens, and foot care. The DHPSC statistically significantly correlated with the satisfaction subscale of Diabetes Quality of Life and the Diabetes Empowerment Scale. HbA1c only statistically significantly correlated with the subscale of health responsibility. Reliability was supported by acceptable Cronbach's alpha (range, .78-.94) and test-retest reliability (range, .76-.95). The DHPSC has satisfactory reliability and validity. Healthcare providers can use the DHPSC to comprehensively assess the health promotion self-care behaviors of patients with Type 2 diabetes.
McKay, J; Murphy, D J; Bowie, P; Schmuck, M-L; Lough, M; Eva, K W
2007-04-01
To establish the content validity and specific aspects of reliability for an assessment instrument designed to provide formative feedback to general practitioners (GPs) on the quality of their written analysis of a significant event. Content validity was quantified by application of a content validity index. Reliability testing involved a nested design, with 5 cells, each containing 4 assessors, rating 20 unique significant event analysis (SEA) reports (10 each from experienced GPs and GPs in training) using the assessment instrument. The variance attributable to each identified variable in the study was established by analysis of variance. Generalisability theory was then used to investigate the instrument's ability to discriminate among SEA reports. Content validity was demonstrated with at least 8 of 10 experts endorsing all 10 items of the assessment instrument. The overall G coefficient for the instrument was moderate to good (G>0.70), indicating that the instrument can provide consistent information on the standard achieved by the SEA report. There was moderate inter-rater reliability (G>0.60) when four raters were used to judge the quality of the SEA. This study provides the first steps towards validating an instrument that can provide educational feedback to GPs on their analysis of significant events. The key area identified to improve instrument reliability is variation among peer assessors in their assessment of SEA reports. Further validity and reliability testing should be carried out to provide GPs, their appraisers and contractual bodies with a validated feedback instrument on this aspect of the general practice quality agenda.
Lee, Miyoung; Zhu, Weimo; Ackley-Holbrook, Elizabeth; Brower, Diana G; McMurray, Bryan
2014-07-01
It is critical to employ accurate measures when assessing physical activity (PA) barriers in any subpopulation, yet existing measures are not appropriate for persons with blindness or visual impairment (PBVI) due to a lack of validity or reliability evidence. To develop and calibrate a PA barrier scale for PBVI. An expert panel (n = 3) and 18 PBVI were recruited to establish content validity for a PA barriers subscale; 160 PBVI (96 females) completed the scale along with the Physical Activity Scale for Individuals with Physical Disabilities for calibration. To establish construct-related validity evidence, Confirmative factor analysis (CFA) and Rasch analysis were applied. To investigate internal consistency and reliability, Cronbach's alpha and the reliability coefficient (R) were employed, respectively. Following CFA and Rasch analyses, five items were eliminated due to misfits; reliability coefficients were unchanged upon deletion of these items. The barriers perceived by PBVI to have the most negative impact on PA included "lack of self-discipline" (logit = 1.40) and "lack of motivation" (logit = 1.27). "Too many stairs in the exercise facility" (logit = -1.49) was perceived to have the least impact. The newly-developed scale was found to be a valid and reliable tool for evaluating PA barriers in PBVI. To enhance promotion of health-producing levels of PA in PBVI, practitioners should consider applying this new tool as a precursor to programs aimed at improving PA participation in this group. Copyright © 2014 Elsevier Inc. All rights reserved.
Reliability of measures of transient evoked otoacoustic emissions with contralateral suppression.
Stuart, Andrew; Cobb, Kensi M
2015-01-01
The reliability of measures of transient evoked otoacoustic emissions (TEOAEs) with contralateral suppression was examined. The effect of test session (i.e., initial test; retest without probe removal; retest with probe removal; and retest 1-2 days post initial test), gender, and ear was examined in 14 young adult females and 14 young adult males. TEOAEs were obtained bilaterally with 60 dB peSPL linear click stimuli with and without a contralateral 65 dB SPL broadband noise suppressor. Absolute TEOAE suppression and a normalized index of TEOAE suppression (i.e., percentage of suppression) were examined. Reliability of these measures was assessed with repeated measures linear mixed model analysis of variance, a coefficient of reliability, and Bland-Altman analyses. There were no statistically significant (p>0.05) main effects of test, gender, and ear or interactions for both absolute dB and % TEOAE suppression values. Cronbach's α were greater than 0.90 across the four tests for both TEOAE measures. Mean test differences or bias (i.e., between the initial and subsequent tests) for absolute and % TEOAE suppression ranged from -0.05 to 0.11 dB and -1.5% to 1.1%, respectively. There was no proportional/systematic bias with the mean differences of the first and subsequent measurements. Data herein were consistent with the view that bilateral TEOAE suppression measures are reliable across test sessions of 1-2 days among females and males and may provide a method to monitor medial olivocochlear efferent reflex status over time. Copyright © 2015 Elsevier Inc. All rights reserved.
A pilot study examining density of suppression measurement in strabismus.
Piano, Marianne; Newsham, David
2015-01-01
Establish whether the Sbisa bar, Bagolini filter (BF) bar, and neutral density filter (NDF) bar, used to measure density of suppression, are equivalent and possess test-retest reliability. Determine whether density of suppression is altered when measurement equipment/testing conditions are changed. Our pilot study had 10 subjects aged ≥18 years with childhood-onset strabismus, no ocular pathologies, and no binocular vision when manifest. Density of suppression upon repeated testing, with clinic lights on/off, and using a full/reduced intensity light source, was investigated. Results were analysed for test-retest reliability, equivalence, and changes with alteration of testing conditions. Test-retest reliability issues were present for the BF bar (median 6 filter change from first to final test, p = 0.021) and NDF bar (median 5 filter change from first to final test, p = 0.002). Density of suppression was unaffected by environmental illumination or fixation light intensity variations. Density of suppression measurements were higher when measured with the NDF bar (e.g. NDF bar = 1.5, medium suppression, vs BF bar = 6.5, light suppression). Test-retest reliability issues may be present for the two filter bars currently still under manufacture. Changes in testing conditions do not significantly affect test results, provided the same filter bar is used consistently for testing. Further studies in children with strabismus having active amblyopia treatment would be of benefit. Despite extensive use of these tests in the UK, this is to our knowledge the first study evaluating filter bar equivalence/reliability.
Assessing the psychometric properties of two food addiction scales.
Lemeshow, Adina R; Gearhardt, Ashley N; Genkinger, Jeanine M; Corbin, William R
2016-12-01
While food addiction is well accepted in popular culture and mainstream media, its scientific validity as an addictive behavior is still under investigation. This study evaluated the reliability and validity of the Yale Food Addiction Scale and Modified Yale Food Addiction Scale using data from two community-based convenience samples. We assessed the internal and test-retest reliability of the Yale Food Addiction Scale and Modified Yale Food Addiction Scale, and estimated the sensitivity and negative predictive value of the Modified Yale Food Addiction Scale using the Yale Food Addiction Scale as the benchmark. We calculated Cronbach's alphas and 95% confidence intervals (CIs) for internal reliability and Cohen's Kappa coefficients and 95% CIs for test-retest reliability. Internal consistency (n=232) was marginal to good, ranging from α=0.63 to 0.84. The test-retest reliability (n=45) for food addiction diagnosis was substantial, with Kappa=0.73 (95% CI, 0.48-0.88) (Yale Food Addiction Scale) and 0.79 (95% CI, 0.66-1.00) (Modified Yale Food Addiction Scale). Sensitivity and negative predictive value for classifying food addiction status were excellent: compared to the Yale Food Addiction Scale, the Modified Yale Food Addiction Scale's sensitivity was 92.3% (95% CI, 64%-99.8%), and the negative predictive value was 99.5% (95% CI, 97.5%-100%). Our analyses suggest that the Modified Yale Food Addiction Scale may be an appropriate substitute for the Yale Food Addiction Scale when a brief measure is needed, and support the continued use of both scales to investigate food addiction. Copyright © 2016 Elsevier Ltd. All rights reserved.
Hanskamp-Sebregts, Mirelle; Zegers, Marieke; Vincent, Charles; van Gurp, Petra J; de Vet, Henrica C W; Wollersheim, Hub
2016-01-01
Objectives Record review is the most used method to quantify patient safety. We systematically reviewed the reliability and validity of adverse event detection with record review. Design A systematic review of the literature. Methods We searched PubMed, EMBASE, CINAHL, PsycINFO and the Cochrane Library and from their inception through February 2015. We included all studies that aimed to describe the reliability and/or validity of record review. Two reviewers conducted data extraction. We pooled κ values (κ) and analysed the differences in subgroups according to number of reviewers, reviewer experience and training level, adjusted for the prevalence of adverse events. Results In 25 studies, the psychometric data of the Global Trigger Tool (GTT) and the Harvard Medical Practice Study (HMPS) were reported and 24 studies were included for statistical pooling. The inter-rater reliability of the GTT and HMPS showed a pooled κ of 0.65 and 0.55, respectively. The inter-rater agreement was statistically significantly higher when the group of reviewers within a study consisted of a maximum five reviewers. We found no studies reporting on the validity of the GTT and HMPS. Conclusions The reliability of record review is moderate to substantial and improved when a small group of reviewers carried out record review. The validity of the record review method has never been evaluated, while clinical data registries, autopsy or direct observations of patient care are potential reference methods that can be used to test concurrent validity. PMID:27550650
Ausserhofer, Dietmar; Anderson, Ruth A; Colón-Emeric, Cathleen; Schwendimann, René
2013-08-01
The Safety Organizing Scale is a valid and reliable measure on safety behaviors and practices in hospitals. This study aimed to explore the psychometric properties of the Safety Organizing Scale-Nursing Home version (SOS-NH). In a cross-sectional analysis of staff survey data, we examined validity and reliability of the 9-item Safety SOS-NH using American Educational Research Association guidelines. This substudy of a larger trial used baseline survey data collected from staff members (n = 627) in a variety of work roles in 13 nursing homes (NHs) in North Carolina and Virginia. Psychometric evaluation of the SOS-NH revealed good response patterns with low average of missing values across all items (3.05%). Analyses of the SOS-NH's internal structure (eg, comparative fit indices = 0.929, standardized root mean square error of approximation = 0.045) and consistency (composite reliability = 0.94) suggested its 1-dimensionality. Significant between-facility variability, intraclass correlations, within-group agreement, and design effect confirmed appropriateness of the SOS-NH for measurement at the NH level, justifying data aggregation. The SOS-NH showed discriminate validity from one related concept: communication openness. Initial evidence regarding validity and reliability of the SOS-NH supports its utility in measuring safety behaviors and practices among a wide range of NH staff members, including those with low literacy. Further psychometric evaluation should focus on testing concurrent and criterion validity, using resident outcome measures (eg, patient fall rates). Copyright © 2013 American Medical Directors Association, Inc. All rights reserved.
[Validity and reliability of a scale to assess self-efficacy for physical activity in elderly].
Borges, Rossana Arruda; Rech, Cassiano Ricardo; Meurer, Simone Teresinha; Benedetti, Tânia Rosane Bertoldo
2015-04-01
This study aimed to analyze the confirmatory factor validity and reliability of a self-efficacy scale for physical activity in a sample of 118 elderly (78% women) from 60 to 90 years of age. Mplus 6.1 was used to evaluate the confirmatory factor analysis. Reliability was tested by internal consistency and temporal stability. The original scale consisted of five items with dichotomous answers (yes/no), independently for walking and moderate and vigorous physical activity. The analysis excluded the item related to confidence in performing physical activities when on vacation. Two constructs were identified, called "self-efficacy for walking" and "self-efficacy for moderate and vigorous physical activity", with a factor load ≥ 0.50. Internal consistency was adequate both for walking (> 0.70) and moderate and vigorous physical activity (> 0.80), and temporal stability was adequate for all the items. In conclusion, the self-efficacy scale for physical activity showed adequate validity, reliability, and internal consistency for evaluating this construct in elderly Brazilians.
Lopes, Fernando Rocha Lucena; Monteiro, Karolinne Souza; Figueiredo, Thalita; Wanderley, Thyago da Costa; Pequeno, Thiago de Almeida; Lima, Shirley; Santos, Silvana
2017-05-02
In Brazil, community health workers have gathered monthly information on people with disabilities to maintain the Primary Care Information System since 1998; however, few studies have used this database for scientific or public health policy purposes. This study aimed to evaluate the reliability of information on people with disabilities gathered by community health workers in primary care services. This was a cross-sectional population-based study conducted in two highly consanguineous communities, involving a population of 18,458 inhabitants in Northeastern Brazil. To study the prevalence of people with disabilities, estimations performed by health workers were compared with those obtained by researchers who interviewed 15.6% of the total population. To study the agreement of the information, data on 106 people with disabilities completed independently by researchers and health workers were compared to evaluate the degree of agreement for 28 variables analysed. Kappa statistics (κ) were used to calculate the inter-rater agreement. The prevalence of disability estimated by community health workers was 3.01 and 2.00% for city A and B, respectively, while the percentages obtained by researchers were 6.72 and 5.65%, respectively, showing an underestimation of prevalence according to community health workers. The Kappa index value obtained for all data analysed (2,589 items excluding losses) was 0.808 (p < 0.01), indicating an almost perfect consistency of information collected by health workers compared to by researchers. Community health workers collected information with a high degree of reliability, although the identification of the prevalence of disabled individuals was potentially impaired due to the work process.
Sakakibara, Brodie M.; Miller, William C.; Backman, Catherine L.
2012-01-01
Objective To explore shortened response formats for use with the Activities-specific Balance Confidence scale and then: 1) evaluate the unidimensionality of the scale; 2) evaluate the item difficulty; 3) evaluate the scale for redundancy and content gaps; and 4) evaluate the item standard error of measurement (SEM) and internal consistency reliability among aging individuals (≥50 years) with a lower-limb amputation living in the community. Design Secondary analysis of cross-sectional survey and chart review data. Setting Out-patient amputee clinics, Ontario, Canada. Participants Four hundred forty eight community living adults, at least 50 years old (mean = 68 years), who have used a prosthesis for at least 6 months for a major unilateral lower limb amputation. Three hundred twenty five (72.5%) were men. Intervention N/a Main Outcome Measure(s) Activities-specific Balance Confidence Scale. Results A 5-option response format outperformed 4- and 6-option formats. Factor analyses confirmed a unidimensional scale. The distance between response options is not the same for all items on the scale, evident by the Partial Credit Model (PCM) having a better fit to the data than the Rating Scale Model. Two items, however, did not fit the PCM within statistical reason. Revising the wording of the two items may resolve the misfit, and improve the construct validity and lower the SEM. Overall, the difficulty of the scale’s items is appropriate for use with aging individuals with lower-limb amputation, and is most reliable (Cronbach ∝ = 0.94) for use with individuals with moderately low balance confidence levels. Conclusions The ABC-scale with a simplified 5-option response format is a valid and reliable measure of balance confidence for use with individuals aging with a lower limb amputation. PMID:21704978
NASA Astrophysics Data System (ADS)
Welch, Anita G.; Cakir, Mustafa; Peterson, Claudette M.; Ray, Chris M.
2012-04-01
Background . Studies exploring the relationship between students' achievement and the quality of the classroom learning environments have shown that there is a strong relationship between these two concepts. Learning environment instruments are constantly being revised and updated, including for use in different cultures, which requires continued validation efforts. Purpose The purpose of this study was to establish cross-cultural reliability and validity of the Technology-Rich Outcomes-Focused Learning Environment Inventory (TROFLEI) in both Turkey and the USA. Sample Approximately 980 students attending grades 9-12 in Turkey and 130 students attending grades 9-12 in the USA participated in the study. Design and method Scale reliability analyses and confirmatory factor analysis (CFA) were performed separately for Turkish and US participants for both actual and preferred responses to each scale to confirm the structure of the TROFLEI across these two distinct samples. Results Cronbach's alpha reliability coefficients, ranging from α = 0.820 to 0.931 for Turkish participants and from α = 0.778 to 0.939 for US participants, indicated that all scales have satisfactory internal consistency for both samples. Confirmatory factor analyses resulted in evidence of adequate model fit across both samples for both actual and preferred responses, with the root mean square error of approximation ranging from 0.052 to 0.057 and the comparative fit index ranging from 0.920 to 0.982. Conclusions This study provides initial evidence that the TROFLEI is valid for use in both the Turkish and US high-school populations (grades 9-12). However, the psychometric properties should be examined further with different populations, such as middle-school students (grades 6-8).
Stoll, C; Kapfhammer, H P; Rothenhäusler, H B; Haller, M; Briegel, J; Schmidt, M; Krauseneck, T; Durst, K; Schelling, G
1999-07-01
Many survivors of critical illness and intensive care unit (ICU) treatment have traumatic memories such as nightmares, panic or pain which can be associated with the development of posttraumatic stress disorder (PTSD). In order to simplify the rapid and early detection of PTSD in such patients, we modified an existing questionnaire for diagnosis of PTSD and validated the instrument in a cohort of ARDS patients after long-term ICU therapy. Follow-up cohort study. The 20-bed ICU of a university teaching hospital. A cohort of 52 long-term survivors of the acute respiratory distress syndrome (ARDS). The questionnaire was administered to the study cohort at two time points 2 years apart. At the second evaluation, the patients underwent a structured interview with two trained psychiatrists to diagnose PTSD according to Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV) criteria. The reliability and validity of the questionnaire was then estimated and its specificity, sensitivity and optimal decision threshold determined using receiver operating characteristic (ROC) curve analyses. The questionnaire showed a high internal consistency (Crohnbach's alpha = 0.93) and a high test-retest reliability (intraclass correlation coefficient alpha = 0.89). There was evidence of construct validity by a linear relationship between scores and the number of traumatic memories from the ICU the patients described (Spearman's rho = 0.48, p < 0.01). Criterion validity was demonstrated by ROC curve analyses resulting in a sensitivity of 77.0% and a specificity of 97.5% for the diagnosis of PTSD. The questionnaire was found to be a responsive, valid and reliable instrument to screen survivors of intensive care for PTSD.
Evaluation of a respiratory symptom diary for clinical studies of idiopathic pulmonary fibrosis.
Bacci, Elizabeth Dansie; O'Quinn, Sean; Leidy, Nancy Kline; Murray, Lindsey; Vernon, Margaret
2018-01-01
There are no validated patient diaries for evaluating respiratory symptoms in idiopathic pulmonary fibrosis (IPF). To evaluate the performance properties of the chronic obstructive pulmonary disease (COPD) Evaluating Respiratory Symptoms™ (E-RS™: COPD) measure in patients with IPF. Concept elicitation and cognitive interviews were conducted with IPF patients to evaluate content validity, including comprehensiveness, relevance, and interpretability of E-RS™ items in this patient population. Secondary analyses of IPF clinical study data were performed to evaluate the scoring structure of the tool. With modifications, reliability, validity, and responsiveness of the instrument (E-RS™: IPF) were evaluated. Qualitative interviews (n = 30) were conducted. During the elicitation interviews (n = 20), concept saturation for IPF respiratory symptoms was achieved; all respiratory symptoms covered by the E-RS™ were endorsed by ≥ 30% of the sample. During cognitive interviews (n = 10), all participants found the items interpretable and relevant. Factor analyses conducted via secondary analysis of IPF clinical study data identified no total score and four symptom scales: Chest, Breathlessness, Cough, and Sputum. Reliability of each scale was high (internal consistency [α] >0.85); 2-day reproducibility (ICC >0.88). Validity was supported through significant (P < 0.0001) relationships with the St. George's Respiratory Questionnaire (SGRQ), the University of California, San Diego Shortness of Breath Questionnaire (UCSD-SOBQ), and other variables. The scales were responsive to change when evaluated using SGRQ Symptoms, UCSD-SOBQ, and Patient Global Impression of Change as anchors (P < 0.01 to P < 0.0001). The E-RS™: IPF is a valid, reliable, and responsive tool for evaluating respiratory symptoms in patients with IPF. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Kong, X-b; Guan, H-t; Li, H-g; Zhou, Y; Xiong, C-l
2014-11-01
In this study, the ageing males' symptoms (AMS) scale was translated into Chinese following methodological recommendations for linguistic and cultural adaptation. This study aimed to confirm the reliability, validation and applicability of the simplified Chinese version of the scale (CN-AMS) in older Chinese men, a free health screening for men older than 40 years was conducted. All participants completed a health questionnaire, which consisted of personal health information, AMS scale, the generic quality of life (QoL) instrument SF36 and the Beck Depression Inventory (BDI). The fasting blood samples of participants were collected on the day of completing the health questionnaire. Serum total testosterone (TT), albumin and sex hormone-binding globulin levels were measured and the level of free testosterone was calculated (calculated free testosterone, CFT). A total of 244 men (mean age: 52 ± 7.3 years, range: 40-79 years) were involved in the investigation and provided informed consent before their participation. The reliability of CN-AMS was analysed as internal consistency reliability (Cronbach's alpha was 0.91) as well as a 4-week-interval test-retest stability (Pearson's correlation was 0.83) and found to be good. The validation of CN-AMS was analysed as the internal structure analysis (Pearson's correlation between total score and each item score r = 0.48-0.75), total-domain-correlation (among the three domains r = 0.47-0.68, p < 0.01; domains with the total score r = 0.81-0.88, p < 0.01), and cross-validation with other scales (with SF36 r = -0.59, p < 0.01; with BDI r = 0.50, p < 0.01). Androgen deficiency (AD) was defined as the presence of three sexual symptoms (decreased frequency of morning erections, sexual thoughts and erectile dysfunction) in combination with TT < 11 nmol/L and CFT < 220 pmol/L, and the sensitivity and specificity for CN-AMS was 68.8 and 6.8% respectively. The CN-AMS had sufficient sensitivity in screening AD of older men, but the low specificity made it unsuitable to be adopted as the diagnostic criteria. The scanning capability of AMS scale for AD has the downward trend with ageing and a hypothesis is proposed to give a possible reason for the new finding. © 2014 American Society of Andrology and European Academy of Andrology.
Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese
2014-01-01
Background Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. Methods The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman’s correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. Results The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach’s alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4 ± 14.1, and 40.6 ± 11.3 for the first and second administrations in the second sample. The distribution of Dentocult SM score was 6.8% (score = 0), 34.4% (score = 1), 39.4% (score = 2), and 19.4% (score = 3). Participants with higher scores were more likely to have higher dietary cariogenicity scores (p < 0.001; Kruskal-Wallis test). Conclusions These results provide the preliminary evidence for the reliability and validity of the Japanese language Food Frequency Questionnaire. PMID:24383547
Test-retest reliability of the eating disorder examination-questionnaire (EDE-Q) in a college sample
2013-01-01
Background The Eating Disorder Examination-Questionnaire (EDE-Q), a widely used self-report instrument, is often used for measuring change in eating disorder symptoms over the course of treatment. However, limited data exist about test-retest reliability, particularly for men. The current study evaluated EDE-Q 7-day test-retest reliability in male (n = 47) and female (n = 44) undergraduate students together and separately by gender. Results Internal consistency was consistently higher for women and at Time 2, but remained acceptable for both men and women at both time points. Cronbach’s α ranged from .75 (Restraint at Time 1) to .93 (Shape Concern at Time 2) for women and from .73 (Eating Concern at Time 2) to .89 (Shape Concern at Time 2) for men. With the exception of some of the eating disorder behaviors, test re-test reliability was fairly strong for both men and women. Shape Concern and the global EDE-Q score were highest for both men and women (Spearman’s rho > 0.89 with the exception of Shape Concern for women for which Spearman’s rho = .86). Test re-test reliability was lower for the eating disorder behavior measures, particularly for men, for whom Kendall’s tau-b for frequency and phi for occurrence was less than 0.70 for all but objective bulimic episodes. Conclusions Results were consistent with past research for women, indicating strong test re-test reliability in attitudinal features of eating disorders, but lower test re-test reliability in behavioral features. Internal consistency and test re-test reliability was good for the attitudinal features of eating disorder in men, but tended to be lower for men compared to women. The EDE-Q appears to be a reliable instrument for assessing eating disorder attitudes in both male and female undergraduate students, but is less reliable for assessing ED behaviors, particularly in men. PMID:24999420
Reliability of a structured interview for admission to an emergency medicine residency program.
Blouin, Danielle
2010-10-01
Interviews are most important in resident selection. Structured interviews are more reliable than unstructured ones. We sought to measure the interrater reliability of a newly designed structured interview during the selection process to an Emergency Medicine residency program. The critical incident technique was used to extract the desired dimensions of performance. The interview tool consisted of 7 clinical scenarios and 1 global rating. Three trained interviewers marked each candidate on all scenarios without discussing candidates' responses. Interitem consistency and estimates of variance were computed. Twenty-eight candidates were interviewed. The generalizability coefficient was 0.67. Removing the central tendency ratings increased the coefficient to 0.74. Coefficients of interitem consistency ranged from 0.64 to 0.74. The structured interview tool provided good although suboptimal interrater reliability. Increasing the number of scenarios improves reliability as does applying differential weights to the rating scale anchors. The latter would also facilitate the identification of those candidates with extreme ratings.
Mammoth and Mastodon collagen sequences; survival and utility
NASA Astrophysics Data System (ADS)
Buckley, M.; Larkin, N.; Collins, M.
2011-04-01
Near-complete collagen (I) sequences are proposed for elephantid and mammutid taxa, based upon available African elephant genomic data and supported with LC-MALDI-MS/MS and LC-ESI-MS/MS analyses of collagen digests from proboscidean bone. Collagen sequence coverage was investigated from several specimens of two extinct mammoths ( Mammuthus trogontherii and Mammuthus primigenius), the extinct American mastodon ( Mammut americanum), the extinct straight-tusked elephant ( Elephas ( Palaeoloxodon) antiquus) and extant Asian ( Elephas maximus) and African ( Loxodonta africana) elephants and compared between the two ionization techniques used. Two suspected mammoth fossils from the British Middle Pleistocene (Cromerian) deposits of the West Runton Forest Bed were analysed to investigate the potential use of peptide mass spectrometry for fossil identification. Despite the age of the fossils, sufficient peptides were obtained to identify these as elephantid, and sufficient sequence variation to discriminate elephantid and mammutid collagen (I). In-depth LC-MS analyses further failed to identify a peptide that could be used to reliably distinguish between the three genera of elephantids ( Elephas, Loxodonta and Mammuthus), an observation consistent with predicted amino acid substitution rates between these species.
Halogen Bonding versus Hydrogen Bonding: A Molecular Orbital Perspective
Wolters, Lando P; Bickelhaupt, F Matthias
2012-01-01
We have carried out extensive computational analyses of the structure and bonding mechanism in trihalides DX⋅⋅⋅A− and the analogous hydrogen-bonded complexes DH⋅⋅⋅A− (D, X, A=F, Cl, Br, I) using relativistic density functional theory (DFT) at zeroth-order regular approximation ZORA-BP86/TZ2P. One purpose was to obtain a set of consistent data from which reliable trends in structure and stability can be inferred over a large range of systems. The main objective was to achieve a detailed understanding of the nature of halogen bonds, how they resemble, and also how they differ from, the better understood hydrogen bonds. Thus, we present an accurate physical model of the halogen bond based on quantitative Kohn–Sham molecular orbital (MO) theory, energy decomposition analyses (EDA) and Voronoi deformation density (VDD) analyses of the charge distribution. It appears that the halogen bond in DX⋅⋅⋅A− arises not only from classical electrostatic attraction but also receives substantial stabilization from HOMO–LUMO interactions between the lone pair of A− and the σ* orbital of D–X. PMID:24551497
2011-01-01
Background In this article, the psychometric properties of the Forensic Inpatient Observation Scale (FIOS) were examined. This instrument was developed to observe behavioral functioning of forensic psychiatric patients. Up till now, it has only been used among adult forensic psychiatric patients and this is the first study in which the FIOS is used with youngsters. Methods Data were gathered of 133 patients. The FIOS was routinely used to assess the psychiatric condition of youngsters at fixed intervals with a three-month time period between each measurement. Ward staff working in close contact with the patient conducted the assessments. Of these 133 patients, an YSR/ASR questionnaire was available for 96 of them and a TRF for 110 of the 133 patients. For the descriptive, reliability and validity analyses, SPSS version 16.0 was used. Factor analyses were performed by means of Mplus Version 5.2. Results A series of confirmatory and exploratory factor analyses revealed a five-factor structure for the FIOS. The five-factor structure consisted of the following scales: self-care, social behavior, oppositional behavior, verbal skills and distress. The insight scale of the original factor structure could not be replicated in the youth sample. Cronbach's alpha's of the five scales ranged from .70 to .85. The self-care, verbal skills and oppositional behavior scales of the FIOS showed no relation with emotional and behavior problems reported by the patients themselves or their teachers. The distress scale of the FIOS did show a relation with the emotional problems reported by patients themselves and the social behavior scale with behavioral problems as reported by teachers. Conclusions The internal consistency of the FIOS was sufficient and the factor structure in the present sample of youngsters was in general comparable to the original factor structure in an adult sample. Its value lies in the focus on behavioral functioning of youngsters with judicial measures. What remains to be seen is whether this instrument is sensitive enough to register all aspects of behavioral changes, whether the interrater reliability is sufficient, and whether it has predictive validity to relapse and recidivism. PMID:21951650
Racial/Ethnic Disparities in Consistent Reporting of Smoking-Related Behaviors
Soulakova, Julia N; Huang, Huang; Crockett, Lisa J
2016-01-01
This study assessed the effect of race/ethnicity on the prevalence of inconsistent reports regarding ever smoking, time since smoking cessation, and age of initiating regular smoking. We used the Tobacco Use Supplement to the Current Population Survey data, which came from a test-retest reliability study, and considered three racial/ethnic subpopulations, Hispanics, Non-Hispanic (NH) Blacks and NH Whites. Initial exploration of highly disagreeing reports of time since smoking cessation and age of onset of regular smoking initiation indicated that the majority of these reports corresponded to NH Whites. However, the proportion of the extremely discrepant reports was very small (less than 0.8%), and these reports were not included in the main analyses. Univariate analyses revealed that for each smoking measure, NH Whites tended to report most consistently when compared to Hispanics and NH Blacks. However, the only statistically significant result was that Hispanics were more likely to report their regular smoking initiation age inconsistently than were NH Whites. Analyses that adjusted for other factors confirmed this finding, i.e., Hispanics were 1.8 times more likely to provide inconsistent reports of their age of onset of regular smoking than were NH Whites. Furthermore, these analyses showed that the impact of race/ethnicity on the prevalence of inconsistent reporting may depend on other factors, e.g., age and employment status. For example, non-employed NH Blacks were 1.9 times more likely to recant ever smoking than were non-employed NH Whites. The lower consistency in reports by Hispanics and NH Blacks underscores the importance of developing new survey design and research strategies for detecting relatively small differences in reporting among the racial/ethnic minorities. Additional efforts to motivate racial/ethnic minorities to participate in national surveys may not only help increase representation of these subpopulations in study samples but also help improve overall data quality. PMID:27088100
Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses
Liu, Bo; Madduri, Ravi K; Sotomayor, Borja; Chard, Kyle; Lacinski, Lukasz; Dave, Utpal J; Li, Jianqiang; Liu, Chunchen; Foster, Ian T
2014-01-01
Due to the upcoming data deluge of genome data, the need for storing and processing large-scale genome data, easy access to biomedical analyses tools, efficient data sharing and retrieval has presented significant challenges. The variability in data volume results in variable computing and storage requirements, therefore biomedical researchers are pursuing more reliable, dynamic and convenient methods for conducting sequencing analyses. This paper proposes a Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses, which enables reliable and highly scalable execution of sequencing analyses workflows in a fully automated manner. Our platform extends the existing Galaxy workflow system by adding data management capabilities for transferring large quantities of data efficiently and reliably (via Globus Transfer), domain-specific analyses tools preconfigured for immediate use by researchers (via user-specific tools integration), automatic deployment on Cloud for on-demand resource allocation and pay-as-you-go pricing (via Globus Provision), a Cloud provisioning tool for auto-scaling (via HTCondor scheduler), and the support for validating the correctness of workflows (via semantic verification tools). Two bioinformatics workflow use cases as well as performance evaluation are presented to validate the feasibility of the proposed approach. PMID:24462600
Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses.
Liu, Bo; Madduri, Ravi K; Sotomayor, Borja; Chard, Kyle; Lacinski, Lukasz; Dave, Utpal J; Li, Jianqiang; Liu, Chunchen; Foster, Ian T
2014-06-01
Due to the upcoming data deluge of genome data, the need for storing and processing large-scale genome data, easy access to biomedical analyses tools, efficient data sharing and retrieval has presented significant challenges. The variability in data volume results in variable computing and storage requirements, therefore biomedical researchers are pursuing more reliable, dynamic and convenient methods for conducting sequencing analyses. This paper proposes a Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses, which enables reliable and highly scalable execution of sequencing analyses workflows in a fully automated manner. Our platform extends the existing Galaxy workflow system by adding data management capabilities for transferring large quantities of data efficiently and reliably (via Globus Transfer), domain-specific analyses tools preconfigured for immediate use by researchers (via user-specific tools integration), automatic deployment on Cloud for on-demand resource allocation and pay-as-you-go pricing (via Globus Provision), a Cloud provisioning tool for auto-scaling (via HTCondor scheduler), and the support for validating the correctness of workflows (via semantic verification tools). Two bioinformatics workflow use cases as well as performance evaluation are presented to validate the feasibility of the proposed approach. Copyright © 2014 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Parkes, Jay
2007-01-01
Reliability consists of both important social and scientific values and methods for evidencing those values, though in practice methods are often conflated with the values. With the two distinctly understood, a reliability argument can be made that articulates the particular reliability values most relevant to the particular measurement situation…
Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana
2017-10-19
Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p < 0.05. The physical health, psychological health, social relationship and environment domains on WHOQoL-BREF with correlation coefficient that ranged from 0.214 to 0.360 showed significant correlation with similar domains on SS-QoL(Y). Dissimilar domains between the two scales had r values from 0.035 to 0.366. Discriminant validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p < 0.05). There were no significant differences across different age groups or gender for the domains or overall scores of SS-QoL(Y). Discriminant and known-group validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
On modeling human reliability in space flights - Redundancy and recovery operations
NASA Astrophysics Data System (ADS)
Aarset, M.; Wright, J. F.
The reliability of humans is of paramount importance to the safety of space flight systems. This paper describes why 'back-up' operators might not be the best solution, and in some cases, might even degrade system reliability. The problem associated with human redundancy calls for special treatment in reliability analyses. The concept of Standby Redundancy is adopted, and psychological and mathematical models are introduced to improve the way such problems can be estimated and handled. In the past, human reliability has practically been neglected in most reliability analyses, and, when included, the humans have been modeled as a component and treated numerically the way technical components are. This approach is not wrong in itself, but it may lead to systematic errors if too simple analogies from the technical domain are used in the modeling of human behavior. In this paper redundancy in a man-machine system will be addressed. It will be shown how simplification from the technical domain, when applied to human components of a system, may give non-conservative estimates of system reliability.
Schoenmakers, Birgitte; Wens, Johan
2014-03-04
To investigate if the psychometric qualities of an OSCE consisting of more complex simulated patient encounters remain valid and reliable in the assessment of postgraduate trainees in general practice. In this intervention study without control group, the traditional OSCE was formally replaced by the new, complex version. The study population was composed by all postgraduate trainees (second and third phase) in general practice during the ongoing academic year. Data were handled and collected as part of the formal assessment program. Univariate analyses, the variance of scores and multivariate analyses were performed to assess the test qualities. A total of 340 students participated. Average final scores were slightly higher for third-phase students (t-test, p =0.05). Overall test scores were equally distributed on station level, circuit level and phase level. A multiple regression analysis revealed that test scores were dependent on the stations and circuits, but not on the master phase. In a changing learning environment, assessment and evaluation strategies require reorientation. The reliability and validity of the OSCE remain subject to discussion. In particular, when it comes to content and design, the traditional OSCE might underestimate the performance level of postgraduate trainees in general practice. A reshaping of this OSCE to a more sophisticated design with more complex patient encounters appears to restore the validity of the test results.
Validation of the Chinese version of EORTC QLQ-BN20 for patients with brain cancer.
Zhang, K; Tian, J; He, Z; Sun, W; Pekbay, B; Lin, Y; Wu, D; Zhang, J; Chen, P; Guo, H; Wan, Y; Wang, M; Yang, S; Zheng, J; Zhang, L
2018-03-01
This is a single centre study in mainland China aiming to evaluate the reliability, validity and responsiveness of the Chinese version of EORTC QLQ-BN20, designed by The European Organization for Research and Treatment of Cancer Quality of Life Group to evaluate the life quality of patients with brain tumour, cancer or metastases. One hundred and eighty-eight patients with primary or secondary brain cancer from Hunan Provincial Tumor Hospital during September 2013 to June 2014 completed the Chinese EORTC QLQ-C30/BN20 questionnaires developed by translation, back translation and cultural adaptation. Results were statistically analysed using SPSS17.0. The internal consistency (Cronbach's α coefficient) was between .753 and .869, the correlation coefficients among items and its own dimension were bigger than .4, and all items had a better correlation with its own dimension. The Spearman was used to analyse the correlation of each dimension between EORTC QLQ-BN20 and EORTC QLQ-C30, and the result showed that individual dimensions were moderately correlated, other dimensions were weakly correlated. In conclusion, the Chinese version of EORTC QLQ BN20 questionnaire had great relevance, reliability, convergent validity and discriminant validity. It provides a valuable tool for the assessment of health-related quality of life in clinical studies of Chinese patients with primary or secondary brain cancer. © 2018 John Wiley & Sons Ltd.