Reliability and risk assessment of structures
NASA Technical Reports Server (NTRS)
Chamis, C. C.
1991-01-01
Development of reliability and risk assessment of structural components and structures is a major activity at Lewis Research Center. It consists of five program elements: (1) probabilistic loads; (2) probabilistic finite element analysis; (3) probabilistic material behavior; (4) assessment of reliability and risk; and (5) probabilistic structural performance evaluation. Recent progress includes: (1) the evaluation of the various uncertainties in terms of cumulative distribution functions for various structural response variables based on known or assumed uncertainties in primitive structural variables; (2) evaluation of the failure probability; (3) reliability and risk-cost assessment; and (4) an outline of an emerging approach for eventual certification of man-rated structures by computational methods. Collectively, the results demonstrate that the structural durability/reliability of man-rated structural components and structures can be effectively evaluated by using formal probabilistic methods.
Structural reliability assessment capability in NESSUS
NASA Technical Reports Server (NTRS)
Millwater, H.; Wu, Y.-T.
1992-01-01
The principal capabilities of NESSUS (Numerical Evaluation of Stochastic Structures Under Stress), an advanced computer code developed for probabilistic structural response analysis, are reviewed, and its structural reliability assessed. The code combines flexible structural modeling tools with advanced probabilistic algorithms in order to compute probabilistic structural response and resistance, component reliability and risk, and system reliability and risk. An illustrative numerical example is presented.
Structural reliability assessment capability in NESSUS
NASA Astrophysics Data System (ADS)
Millwater, H.; Wu, Y.-T.
1992-07-01
The principal capabilities of NESSUS (Numerical Evaluation of Stochastic Structures Under Stress), an advanced computer code developed for probabilistic structural response analysis, are reviewed, and its structural reliability assessed. The code combines flexible structural modeling tools with advanced probabilistic algorithms in order to compute probabilistic structural response and resistance, component reliability and risk, and system reliability and risk. An illustrative numerical example is presented.
Costa, Y M; Morita-Neto, O; de Araújo-Júnior, E N S; Sampaio, F A; Conti, P C R; Bonjardim, L R
2017-03-01
Assessing the reliability of medical measurements is a crucial step towards the elaboration of an applicable clinical instrument. There are few studies that evaluate the reliability of somatosensory assessment and pain modulation of masticatory structures. This study estimated the test-retest reliability, that is over time, of the mechanical somatosensory assessment of anterior temporalis, masseter and temporomandibular joint (TMJ) and the conditioned pain modulation (CPM) using the anterior temporalis as the test site. Twenty healthy women were evaluated in two sessions (1 week apart) by the same examiner. Mechanical detection threshold (MDT), mechanical pain threshold (MPT), wind-up ratio (WUR) and pressure pain threshold (PPT) were assessed on the skin overlying the anterior temporalis, masseter and TMJ of the dominant side. CPM was tested by comparing PPT before and during the hand immersion in a hot water bath. anova and intra-class correlation coefficients (ICCs) were applied to the data (α = 5%). The overall ICCs showed acceptable values for the test-retest reliability of mechanical somatosensory assessment of masticatory structures. The ICC values of 75% of all quantitative sensory measurements were considered fair to excellent (fair = 8·4%, good = 33·3% and excellent = 33·3%). However, the CPM paradigm presented poor reliability (ICC = 0·25). The mechanical somatosensory assessment of the masticatory structures, but not the proposed CPM protocol, can be considered sufficiently reliable over time to evaluate the trigeminal sensory function. © 2016 John Wiley & Sons Ltd.
Wafer level reliability for high-performance VLSI design
NASA Technical Reports Server (NTRS)
Root, Bryan J.; Seefeldt, James D.
1987-01-01
As very large scale integration architecture requires higher package density, reliability of these devices has approached a critical level. Previous processing techniques allowed a large window for varying reliability. However, as scaling and higher current densities push reliability to its limit, tighter control and instant feedback becomes critical. Several test structures developed to monitor reliability at the wafer level are described. For example, a test structure was developed to monitor metal integrity in seconds as opposed to weeks or months for conventional testing. Another structure monitors mobile ion contamination at critical steps in the process. Thus the reliability jeopardy can be assessed during fabrication preventing defective devices from ever being placed in the field. Most importantly, the reliability can be assessed on each wafer as opposed to an occasional sample.
Blouin, Danielle; Day, Andrew G.; Pavlov, Andrey
2011-01-01
Background Although never directly compared, structured interviews are reported as being more reliable than unstructured interviews. This study compared the reliability of both types of interview when applied to a common pool of applicants for positions in an emergency medicine residency program. Methods In 2008, one structured interview was added to the two unstructured interviews traditionally used in our resident selection process. A formal job analysis using the critical incident technique guided the development of the structured interview tool. This tool consisted of 7 scenarios assessing 4 of the domains deemed essential for success as a resident in this program. The traditional interview tool assessed 5 general criteria. In addition to these criteria, the unstructured panel members were asked to rate each candidate on the same 4 essential domains rated by the structured panel members. All 3 panels interviewed all candidates. Main outcomes were the overall, interitem, and interrater reliabilities, the correlations between interview panels, and the dimensionality of each interview tool. Results Thirty candidates were interviewed. The overall reliability reached 0.43 for the structured interview, and 0.81 and 0.71 for the unstructured interviews. Analyses of the variance components showed a high interrater, low interitem reliability for the structured interview, and a high interrater, high interitem reliability for the unstructured interviews. The summary measures from the 2 unstructured interviews were significantly correlated, but neither was correlated with the structured interview. Only the structured interview was multidimensional. Conclusions A structured interview did not yield a higher overall reliability than both unstructured interviews. The lower reliability is explained by a lower interitem reliability, which in turn is due to the multidimensionality of the interview tool. Both unstructured panels consistently rated a single dimension, even when prompted to assess the 4 specific domains established as essential to succeed in this residency program. PMID:23205201
Blouin, Danielle; Day, Andrew G; Pavlov, Andrey
2011-12-01
Although never directly compared, structured interviews are reported as being more reliable than unstructured interviews. This study compared the reliability of both types of interview when applied to a common pool of applicants for positions in an emergency medicine residency program. In 2008, one structured interview was added to the two unstructured interviews traditionally used in our resident selection process. A formal job analysis using the critical incident technique guided the development of the structured interview tool. This tool consisted of 7 scenarios assessing 4 of the domains deemed essential for success as a resident in this program. The traditional interview tool assessed 5 general criteria. In addition to these criteria, the unstructured panel members were asked to rate each candidate on the same 4 essential domains rated by the structured panel members. All 3 panels interviewed all candidates. Main outcomes were the overall, interitem, and interrater reliabilities, the correlations between interview panels, and the dimensionality of each interview tool. Thirty candidates were interviewed. The overall reliability reached 0.43 for the structured interview, and 0.81 and 0.71 for the unstructured interviews. Analyses of the variance components showed a high interrater, low interitem reliability for the structured interview, and a high interrater, high interitem reliability for the unstructured interviews. The summary measures from the 2 unstructured interviews were significantly correlated, but neither was correlated with the structured interview. Only the structured interview was multidimensional. A structured interview did not yield a higher overall reliability than both unstructured interviews. The lower reliability is explained by a lower interitem reliability, which in turn is due to the multidimensionality of the interview tool. Both unstructured panels consistently rated a single dimension, even when prompted to assess the 4 specific domains established as essential to succeed in this residency program.
Scale for positive aspects of caregiving experience: development, reliability, and factor structure.
Kate, N; Grover, S; Kulhara, P; Nehra, R
2012-06-01
OBJECTIVE. To develop an instrument (Scale for Positive Aspects of Caregiving Experience [SPACE]) that evaluates positive caregiving experience and assess its psychometric properties. METHODS. Available scales which assess some aspects of positive caregiving experience were reviewed and a 50-item questionnaire with a 5-point rating was constructed. In all, 203 primary caregivers of patients with severe mental disorders were asked to complete the questionnaire. Internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity were evaluated. Principal component factor analysis was run to assess the factorial validity of the scale. RESULTS. The scale developed as part of the study was found to have good internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity. Principal component factor analysis yielded a 4-factor structure, which also had good test-retest reliability and cross-language reliability. There was a strong correlation between the 4 factors obtained. CONCLUSION. The SPACE developed as part of this study has good psychometric properties.
The Reliability and Structure of the Classroom Assessment Scoring System in German Pre-Schools
ERIC Educational Resources Information Center
Stuck, Andrea; Kammermeyer, Gisela; Roux, Susanna
2016-01-01
This study examined the reliability and structure of the Classroom Assessment Scoring System (CLASS; Pianta, R. C., K. M. La Paro, and B. K. Hamre. 2008. "Classroom Assessment Scoring System. Manual Pre-K." Baltimore, MD: Brookes) and the quality of interactional processes in a German pre-school setting, drawing on a sample of 390…
2011-11-01
assessment to quality of localization/characterization estimates. This protocol includes four critical components: (1) a procedure to identify the...critical factors impacting SHM system performance; (2) a multistage or hierarchical approach to SHM system validation; (3) a model -assisted evaluation...Lindgren, E. A ., Buynak, C. F., Steffes, G., Derriso, M., “ Model -assisted Probabilistic Reliability Assessment for Structural Health Monitoring
Probabilistic Assessment of National Wind Tunnel
NASA Technical Reports Server (NTRS)
Shah, A. R.; Shiao, M.; Chamis, C. C.
1996-01-01
A preliminary probabilistic structural assessment of the critical section of National Wind Tunnel (NWT) is performed using NESSUS (Numerical Evaluation of Stochastic Structures Under Stress) computer code. Thereby, the capabilities of NESSUS code have been demonstrated to address reliability issues of the NWT. Uncertainties in the geometry, material properties, loads and stiffener location on the NWT are considered to perform the reliability assessment. Probabilistic stress, frequency, buckling, fatigue and proof load analyses are performed. These analyses cover the major global and some local design requirements. Based on the assumed uncertainties, the results reveal the assurance of minimum 0.999 reliability for the NWT. Preliminary life prediction analysis results show that the life of the NWT is governed by the fatigue of welds. Also, reliability based proof test assessment is performed.
Development of a probabilistic analysis methodology for structural reliability estimation
NASA Technical Reports Server (NTRS)
Torng, T. Y.; Wu, Y.-T.
1991-01-01
The novel probabilistic analysis method for assessment of structural reliability presented, which combines fast-convolution with an efficient structural reliability analysis, can after identifying the most important point of a limit state proceed to establish a quadratic-performance function. It then transforms the quadratic function into a linear one, and applies fast convolution. The method is applicable to problems requiring computer-intensive structural analysis. Five illustrative examples of the method's application are given.
NASA Astrophysics Data System (ADS)
Yu, Bo; Ning, Chao-lie; Li, Bing
2017-03-01
A probabilistic framework for durability assessment of concrete structures in marine environments was proposed in terms of reliability and sensitivity analysis, which takes into account the uncertainties under the environmental, material, structural and executional conditions. A time-dependent probabilistic model of chloride ingress was established first to consider the variations in various governing parameters, such as the chloride concentration, chloride diffusion coefficient, and age factor. Then the Nataf transformation was adopted to transform the non-normal random variables from the original physical space into the independent standard Normal space. After that the durability limit state function and its gradient vector with respect to the original physical parameters were derived analytically, based on which the first-order reliability method was adopted to analyze the time-dependent reliability and parametric sensitivity of concrete structures in marine environments. The accuracy of the proposed method was verified by comparing with the second-order reliability method and the Monte Carlo simulation. Finally, the influences of environmental conditions, material properties, structural parameters and execution conditions on the time-dependent reliability of concrete structures in marine environments were also investigated. The proposed probabilistic framework can be implemented in the decision-making algorithm for the maintenance and repair of deteriorating concrete structures in marine environments.
Study of structural reliability of existing concrete structures
NASA Astrophysics Data System (ADS)
Druķis, P.; Gaile, L.; Valtere, K.; Pakrastiņš, L.; Goremikins, V.
2017-10-01
Structural reliability of buildings has become an important issue after the collapse of a shopping center in Riga 21.11.2013, caused the death of 54 people. The reliability of a building is the practice of designing, constructing, operating, maintaining and removing buildings in ways that ensure maintained health, ward suffered injuries or death due to use of the building. Evaluation and improvement of existing buildings is becoming more and more important. For a large part of existing buildings, the design life has been reached or will be reached in the near future. The structures of these buildings need to be reassessed in order to find out whether the safety requirements are met. The safety requirements provided by the Eurocodes are a starting point for the assessment of safety. However, it would be uneconomical to require all existing buildings and structures to comply fully with these new codes and corresponding safety levels, therefore the assessment of existing buildings differs with each design situation. This case study describes the simple and practical procedure of determination of minimal reliability index β of existing concrete structures designed by different codes than Eurocodes and allows to reassess the actual reliability level of different structural elements of existing buildings under design load.
NASA Technical Reports Server (NTRS)
1973-01-01
A study was conducted to determine the configuration and performance of a space tug. Detailed descriptions of the insulation, meteoroid protection, primary structure, and ground support equipment are presented. Technical assessments leading to the concept selection are analyzed. The tug mass properties, reliability, and safety assessments are included.
A reliability analysis of the revised competitiveness index.
Harris, Paul B; Houston, John M
2010-06-01
This study examined the reliability of the Revised Competitiveness Index by investigating the test-retest reliability, interitem reliability, and factor structure of the measure based on a sample of 280 undergraduates (200 women, 80 men) ranging in age from 18 to 28 years (M = 20.1, SD = 2.1). The findings indicate that the Revised Competitiveness Index has high test-retest reliability, high inter-item reliability, and a stable factor structure. The results support the assertion that the Revised Competitiveness Index assesses competitiveness as a stable trait rather than a dynamic state.
Wyles, Susannah M; Miskovic, Danilo; Ni, Zhifang; Darzi, Ara W; Valori, Roland M; Coleman, Mark G; Hanna, George B
2016-03-01
There is a lack of educational tools available for surgical teaching critique, particularly for advanced laparoscopic surgery. The aim was to develop and implement a tool that assesses training quality and structures feedback for trainers in the English National Training Programme for laparoscopic colorectal surgery. Semi-structured interviews were performed and analysed, and items were extracted. Through the Delphi process, essential items pertaining to desirable trainer characteristics, training structure and feedback were determined. An assessment tool (Structured Training Trainer Assessment Report-STTAR) was developed and tested for feasibility, acceptability and educational impact. Interview transcripts (29 surgical trainers, 10 trainees, four educationalists) were analysed, and item lists created and distributed for consensus opinion (11 trainers and seven trainees). The STTAR consisted of 64 factors, and its web-based version, the mini-STTAR, included 21 factors that were categorised into four groups (training structure, training behaviour, trainer attributes and role modelling) and structured around a training session timeline (beginning, middle and end). The STTAR (six trainers, 48 different assessments) demonstrated good internal consistency (α = 0.88) and inter-rater reliability (ICC = 0.75). The mini-STTAR demonstrated good inter-item reliability (α = 0.79) and intra-observer reliability on comparison of 85 different trainer/trainee combinations (r = 0.701, p = <0.001). Both were found to be feasible and acceptable. The educational report for trainers was found to be useful (4.4 out of 5). An assessment tool that evaluates training quality was developed and shown to be reliable, acceptable and of educational value. It has been successfully implemented into the English National Training Programme for laparoscopic colorectal surgery.
Assessing Student Learning Online: Overcoming Reliability Issues
ERIC Educational Resources Information Center
Arnold, Stephen D.
2012-01-01
Assessing students in online university courses poses challenges to the reliability factor of the measures being utilized. Some programs have the latitude to incorporate proctored assessments, but this is not always practical in asynchronously structured courses reaching out across a broad geographic region. This paper explores digital audio and…
Probabilistic simulation of uncertainties in thermal structures
NASA Technical Reports Server (NTRS)
Chamis, Christos C.; Shiao, Michael
1990-01-01
Development of probabilistic structural analysis methods for hot structures is a major activity at Lewis Research Center. It consists of five program elements: (1) probabilistic loads; (2) probabilistic finite element analysis; (3) probabilistic material behavior; (4) assessment of reliability and risk; and (5) probabilistic structural performance evaluation. Recent progress includes: (1) quantification of the effects of uncertainties for several variables on high pressure fuel turbopump (HPFT) blade temperature, pressure, and torque of the Space Shuttle Main Engine (SSME); (2) the evaluation of the cumulative distribution function for various structural response variables based on assumed uncertainties in primitive structural variables; (3) evaluation of the failure probability; (4) reliability and risk-cost assessment, and (5) an outline of an emerging approach for eventual hot structures certification. Collectively, the results demonstrate that the structural durability/reliability of hot structural components can be effectively evaluated in a formal probabilistic framework. In addition, the approach can be readily extended to computationally simulate certification of hot structures for aerospace environments.
NASA Technical Reports Server (NTRS)
1991-01-01
The technical effort and computer code enhancements performed during the sixth year of the Probabilistic Structural Analysis Methods program are summarized. Various capabilities are described to probabilistically combine structural response and structural resistance to compute component reliability. A library of structural resistance models is implemented in the Numerical Evaluations of Stochastic Structures Under Stress (NESSUS) code that included fatigue, fracture, creep, multi-factor interaction, and other important effects. In addition, a user interface was developed for user-defined resistance models. An accurate and efficient reliability method was developed and was successfully implemented in the NESSUS code to compute component reliability based on user-selected response and resistance models. A risk module was developed to compute component risk with respect to cost, performance, or user-defined criteria. The new component risk assessment capabilities were validated and demonstrated using several examples. Various supporting methodologies were also developed in support of component risk assessment.
Desmarais, Sarah L.; Nicholls, Tonia L.; Wilson, Catherine M.; Brink, Johann
2012-01-01
The Short-Term Assessment of Risk and Treatability (START) is a relatively new structured professional judgment guide for the assessment and management of short-term risks associated with mental, substance use, and personality disorders. The scheme may be distinguished from other violence risk instruments because of its inclusion of 20 dynamic factors that are rated in terms of both vulnerability and strength. This study examined the reliability and validity of START assessments in predicting inpatient aggression. Research assistants completed START assessments for 120 male forensic psychiatric patients through review of hospital files. They additionally completed Historical-Clinical-Risk Management – 20 (HCR-20) and the Hare Psychopathy Checklist: Screening Version (PCL:SV) assessments. Outcome data was coded from hospital files for a 12-month follow-up period using the Overt Aggression Scale (OAS). START assessments evidenced excellent interrater reliability and demonstrated both predictive and incremental validity over the HCR-20 Historical subscale scores and PCL:SV total scores. Overall, results support the reliability and validity of START assessments, and use of the structured professional judgment approach more broadly, as well as the value of using dynamic risk and protective factors to assess violence risk. PMID:22250595
Structural Reliability and Monte Carlo Simulation.
ERIC Educational Resources Information Center
Laumakis, P. J.; Harlow, G.
2002-01-01
Analyzes a simple boom structure and assesses its reliability using elementary engineering mechanics. Demonstrates the power and utility of Monte-Carlo simulation by showing that such a simulation can be implemented more readily with results that compare favorably to the theoretical calculations. (Author/MM)
Quinn, Amity E; Rosen, Rochelle K; McGeary, John E; Amoa, Francine; Kranzler, Henry R; Francazio, Sarah; McGarvey, Stephen T; Swift, Robert M
2014-01-01
The aims of this study were to develop a bilingual version of the Semi-Structured Assessment for Drug Dependence and Alcoholism (SSADDA) in English and Samoan and determine the reliability of assessments of alcohol dependence in American Samoa. The study consisted of development and reliability-testing phases. In the development phase, the SSADDA alcohol module was translated and the translation was evaluated through cognitive interviews. In the reliability-testing phase, the bilingual SSADDA was administered to 40 ethnic Samoans, including a sub-sample of 26 individuals who were retested. Cognitive interviews indicated the initial translation was culturally and linguistically appropriate except items pertaining to alcohol tolerance, which were modified to reflect Samoan concepts. SSADDA reliability testing indicated diagnoses of DSM-III-R and DSM-IV alcohol dependence were reliable. Reliability varied by language of administration. The English/Samoan version of the SSADDA is appropriate for the diagnosis of DSM-III-R alcohol dependence, which may be useful in advancing research and public health efforts to address alcohol problems in American Samoa and the Western Pacific. The translation methods may inform researchers translating diagnostic and assessment tools into different languages and cultures. © The Author 2014. Medical Council on Alcohol and Oxford University Press. All rights reserved.
Neurology objective structured clinical examination reliability using generalizability theory
Park, Yoon Soo; Lukas, Rimas V.; Brorson, James R.
2015-01-01
Objectives: This study examines factors affecting reliability, or consistency of assessment scores, from an objective structured clinical examination (OSCE) in neurology through generalizability theory (G theory). Methods: Data include assessments from a multistation OSCE taken by 194 medical students at the completion of a neurology clerkship. Facets evaluated in this study include cases, domains, and items. Domains refer to areas of skill (or constructs) that the OSCE measures. G theory is used to estimate variance components associated with each facet, derive reliability, and project the number of cases required to obtain a reliable (consistent, precise) score. Results: Reliability using G theory is moderate (Φ coefficient = 0.61, G coefficient = 0.64). Performance is similar across cases but differs by the particular domain, such that the majority of variance is attributed to the domain. Projections in reliability estimates reveal that students need to participate in 3 OSCE cases in order to increase reliability beyond the 0.70 threshold. Conclusions: This novel use of G theory in evaluating an OSCE in neurology provides meaningful measurement characteristics of the assessment. Differing from prior work in other medical specialties, the cases students were randomly assigned did not influence their OSCE score; rather, scores varied in expected fashion by domain assessed. PMID:26432851
Neurology objective structured clinical examination reliability using generalizability theory.
Blood, Angela D; Park, Yoon Soo; Lukas, Rimas V; Brorson, James R
2015-11-03
This study examines factors affecting reliability, or consistency of assessment scores, from an objective structured clinical examination (OSCE) in neurology through generalizability theory (G theory). Data include assessments from a multistation OSCE taken by 194 medical students at the completion of a neurology clerkship. Facets evaluated in this study include cases, domains, and items. Domains refer to areas of skill (or constructs) that the OSCE measures. G theory is used to estimate variance components associated with each facet, derive reliability, and project the number of cases required to obtain a reliable (consistent, precise) score. Reliability using G theory is moderate (Φ coefficient = 0.61, G coefficient = 0.64). Performance is similar across cases but differs by the particular domain, such that the majority of variance is attributed to the domain. Projections in reliability estimates reveal that students need to participate in 3 OSCE cases in order to increase reliability beyond the 0.70 threshold. This novel use of G theory in evaluating an OSCE in neurology provides meaningful measurement characteristics of the assessment. Differing from prior work in other medical specialties, the cases students were randomly assigned did not influence their OSCE score; rather, scores varied in expected fashion by domain assessed. © 2015 American Academy of Neurology.
ERIC Educational Resources Information Center
Marshall, Margaret J.; Duffy, Ashlee Mills; Powell, Stephen; Bartlett, Lesley Erin
2017-01-01
An ePortfolio Assessment Institute (AI) structured as a faculty development opportunity was undertaken to increase faculty confidence in teaching and assessing ePortfolios and to collect reliable data about student performance on four learning outcomes associated with an institutionwide ePortfolio initiative. Faculty raters participated in the…
Probabilistic Assessment of Fracture Progression in Composite Structures
NASA Technical Reports Server (NTRS)
Chamis, Christos C.; Minnetyan, Levon; Mauget, Bertrand; Huang, Dade; Addi, Frank
1999-01-01
This report describes methods and corresponding computer codes that are used to evaluate progressive damage and fracture and to perform probabilistic assessment in built-up composite structures. Structural response is assessed probabilistically, during progressive fracture. The effects of design variable uncertainties on structural fracture progression are quantified. The fast probability integrator (FPI) is used to assess the response scatter in the composite structure at damage initiation. The sensitivity of the damage response to design variables is computed. The methods are general purpose and are applicable to stitched and unstitched composites in all types of structures and fracture processes starting from damage initiation to unstable propagation and to global structure collapse. The methods are demonstrated for a polymer matrix composite stiffened panel subjected to pressure. The results indicated that composite constituent properties, fabrication parameters, and respective uncertainties have a significant effect on structural durability and reliability. Design implications with regard to damage progression, damage tolerance, and reliability of composite structures are examined.
Bachmann, Monica; de Boer, Wout; Schandelmaier, Stefan; Leibold, Andrea; Marelli, Renato; Jeger, Joerg; Hoffmann-Richter, Ulrike; Mager, Ralph; Schaad, Heinz; Zumbrunn, Thomas; Vogel, Nicole; Bänziger, Oskar; Busse, Jason W; Fischer, Katrin; Kunz, Regina
2016-07-29
Work capacity evaluations by independent medical experts are widely used to inform insurers whether injured or ill workers are capable of engaging in competitive employment. In many countries, evaluation processes lack a clearly structured approach, standardized instruments, and an explicit focus on claimants' functional abilities. Evaluation of subjective complaints, such as mental illness, present additional challenges in the determination of work capacity. We have therefore developed a process for functional evaluation of claimants with mental disorders which complements usual psychiatric evaluation. Here we report the design of a study to measure the reliability of our approach in determining work capacity among patients with mental illness applying for disability benefits. We will conduct a multi-center reliability study, in which 20 psychiatrists trained in our functional evaluation process will assess 30 claimants presenting with mental illness for eligibility to receive disability benefits [Reliability of Functional Evaluation in Psychiatry, RELY-study]. The functional evaluation process entails a five-step structured interview and a reporting instrument (Instrument of Functional Assessment in Psychiatry [IFAP]) to document the severity of work-related functional limitations. We will videotape all evaluations which will be viewed by three psychiatrists who will independently rate claimants' functional limitations. Our primary outcome measure is the evaluation of claimant's work capacity as a percentage (0 to 100 %), and our secondary outcomes are the 12 mental functions and 13 functional capacities assessed by the IFAP-instrument. Inter-rater reliability of four psychiatric experts will be explored using multilevel models to estimate the intraclass correlation coefficient (ICC). Additional analyses include subgroups according to mental disorder, the typicality of claimants, and claimant perceived fairness of the assessment process. We hypothesize that a structured functional approach will show moderate reliability (ICC ≥ 0.6) of psychiatric evaluation of work capacity. Enrollment of actual claimants with mental disorders referred for evaluation by disability/accident insurers will increase the external validity of our findings. Finding moderate levels of reliability, we will continue with a randomized trial to test the reliability of a structured functional approach versus evaluation-as-usual.
Elison, Sarah; Davies, Glyn; Ward, Jonathan
2016-07-28
There is a growing literature around substance use disorder treatment outcomes measures. Various constructs have been suggested as being appropriate for measuring recovery outcomes, including "recovery capital" and "treatment progression." However, these previously proposed constructs do not measure changes in psychosocial functioning during the recovery process. Therefore, a new psychometric assessment, the "Recovery Progression Measure" (RPM), has been developed to measure this recovery oriented psychosocial change. The aims of this study were to evaluate the reliability and factor structure of the RPM via data collected from 2218 service users being treated for their substance dependence. Data were collected from service users accessing the Breaking Free Online (BFO) substance use disorder treatment and recovery program, which has within its baseline assessment a 36-item psychometric measure previously developed by the authors to assess the six areas of functioning described in the RPM. Reliability analyses and exploratory factor analyses (EFA) were conducted to examine the underlying factor structure of the RPM measure. Internal reliability of the RPM measure was found to be excellent (α > .70) with the overall assessment to have reliability α = .89, with item-total correlations revealing moderate-excellent reliability of individual items. EFA revealed the RPM to contain an underlying factor structure of eight components. This study provides initial data to support the reliability of the RPM as a recovery measure. Further work is now underway to extend these findings, including convergent and predictive validity analyses.
Exploring the reliability and validity of the social-moral awareness test.
Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth
2012-11-01
The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor structure, reliability and discriminant validity were also assessed. Seventy-one participants with mild-moderate learning disabilities completed the two scales of the SMAT and two False Belief Tasks for Theory of Mind. Reliability of the SMAT was very good, and the scales were shown to be uni-dimensional in factor structure. There was a significant positive relationship between Theory of Mind and both SMAT scales. There is early evidence of the construct validity and reliability of the SMAT. Further assessment of the validity of the SMAT will be required. © 2012 Blackwell Publishing Ltd.
[Inter-rater reliability and validity of the OPD-CA axes structure and conflict].
Benecke, Cord; Bock, Astrid; Wieser, Elke; Tschiesner, Reinhard; Lochmann, Martha; Küspert, Felicia; Schorn, Robert; Viertler, Bernhard; Steinmayr-Gensluckner, Maria
2011-01-01
The manual of the Operationalized Psychodynamic Diagnostics in childhood and adolescence (OPD-CA) is an instrument meanwhile widespread in the clinical practice to assess psychodynamic dimensions. Publications of inter-rater agreement and validity are still outstanding. This study assessed the interrater-reliability and validity for the axis structure and the axis conflict. 60 adolescents between 14 and 17 years, with and without psychic disorders, were diagnosed with the Operationalized Psychodynamic Diagnostics in childhood and adolescence (Arbeitskreis OPD-KJ, 2007) and SCID-II-interviews and questionnaires. A partial sample of 36 OPD-CA-interviews was the data basis for the assessment of inter-rater agreement. Calculations of validity for axis structure and axis conflict were made with the whole sample. Inter-rater agreement for the axis structure and the axis conflict showed good to very good weighted Kappa coefficients among the trained raters. Validity of the axis structure showed good results. The Operationalized Psychodynamic Diagnostics in childhood and adolescence (OPD-CA) allows a reliable diagnostic of axis structure and axis conflict, if the ratings are done on the basis of semistructured videotaped interviews by trained raters. The axis structure shows validity, while the results concerning the validity of the axis conflict remain unclear.
Structural Validation of the Holistic Wellness Assessment
ERIC Educational Resources Information Center
Brown, Charlene; Applegate, E. Brooks; Yildiz, Mustafa
2015-01-01
The Holistic Wellness Assessment (HWA) is a relatively new assessment instrument based on an emergent transdisciplinary model of wellness. This study validated the factor structure identified via exploratory factor analysis (EFA), assessed test-retest reliability, and investigated concurrent validity of the HWA in three separate samples. The…
Schwartz, Karen T G; Bowling, Amanda A; Dickerson, John F; Lynch, Frances L; Brent, David A; Porta, Giovanna; Iyengar, Satish; Weersing, V Robin
2018-05-24
The current study evaluated the interrater reliability of the Child and Adolescent Services Assessment (CASA), a widely used structured interview measuring pediatric mental health service use. Interviews (N = 72) were randomly selected from a pediatric effectiveness trial, and audio was coded by an independent rater. Regressions were employed to identify predictors of rater disagreement. Interrater reliability was high for items (> 94%) and summary metrics (ICC > .79) across service sectors. Predictors of disagreement varied by domain; significant predictors indexed higher clinical severity or social disadvantage. Results support the CASA as a reliable and robust assessment of pediatric service use, but administrators should be alert when assessing vulnerable populations.
Factor Structure and Reliability of Test Items for Saudi Teacher Licence Assessment
ERIC Educational Resources Information Center
Alsadaawi, Abdullah Saleh
2017-01-01
The Saudi National Assessment Centre administers the Computer Science Teacher Test for teacher certification. The aim of this study is to explore gender differences in candidates' scores, and investigate dimensionality, reliability, and differential item functioning using confirmatory factor analysis and item response theory. The confirmatory…
Reliability of the Structured Clinical Interview for DSM-5 Sleep Disorders Module.
Taylor, Daniel J; Wilkerson, Allison K; Pruiksma, Kristi E; Williams, Jacob M; Ruggero, Camilo J; Hale, Willie; Mintz, Jim; Organek, Katherine Marczyk; Nicholson, Karin L; Litz, Brett T; Young-McCaughan, Stacey; Dondanville, Katherine A; Borah, Elisa V; Brundige, Antoinette; Peterson, Alan L
2018-03-15
To develop and demonstrate interrater reliability for a Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) Sleep Disorders (SCISD). The SCISD was designed to be a brief, reliable, and valid interview assessment of adult sleep disorders as defined by the DSM-5. A sample of 106 postdeployment active-duty military members seeking cognitive behavioral therapy for insomnia in a randomized clinical trial were assessed with the SCISD prior to treatment to determine eligibility. Audio recordings of these interviews were double-scored for interrater reliability. The interview is 8 pages long, includes 20 to 51 questions, and takes 10 to 20 minutes to administer. Of the nine major disorders included in the SCISD, six had prevalence rates high enough (ie, n ≥ 5) to include in analyses. Cohen kappa coefficient (κ) was used to assess interrater reliability for insomnia, hypersomnolence, obstructive sleep apnea hypopnea (OSAH), circadian rhythm sleep-wake, nightmare, and restless legs syndrome disorders. There was excellent interrater reliability for insomnia (1.0) and restless legs syndrome (0.83); very good reliability for nightmare disorder (0.78) and OSAH (0.73); and good reliability for hypersomnolence (0.50) and circadian rhythm sleep-wake disorders (0.50). The SCISD is a brief, structured clinical interview that is easy for clinicians to learn and use. The SCISD showed moderate to excellent interrater reliability for six of the major sleep disorders in the DSM-5 among active duty military seeking cognitive behavioral therapy for insomnia in a randomized clinical trial. Replication and extension studies are needed. Registry: ClinicalTrials.gov; Title: Comparing Internet and In-Person Brief Cognitive Behavioral Therapy of Insomnia; Identifier: NCT01549899; URL: https://clinicaltrials.gov/ct2/show/NCT01549899. © 2018 American Academy of Sleep Medicine.
Chaudhary, Richa; Grover, Chander; Bhattacharya, S N; Sharma, Arun
2017-01-01
The assessment of dermatology undergraduates is being done through computer assisted objective structured clinical examination at our institution for the last 4 years. We attempted to compare objective structured clinical examination (OSCE) and computer assisted objective structured clinical examination (CA-OSCE) as assessment tools. To assess the relative effectiveness of CA-OSCE and OSCE as assessment tools for undergraduate dermatology trainees. Students underwent CA-OSCE as well as OSCE-based evaluation of equal weightage as an end of posting assessment. The attendance as well as the marks in both the examination formats were meticulously recorded and statistically analyzed using SPSS version 20.0. Intercooled Stata V9.0 was used to assess the reliability and internal consistency of the examinations conducted. Feedback from both students and examiners was also recorded. The mean attendance for the study group was 77% ± 12.0%. The average score on CA- OSCE and OSCE was 47.4% ± 19.8% and 53.5% ± 18%, respectively. These scores showed a mutually positive correlation, with Spearman's coefficient being 0.593. Spearman's rank correlation coefficient between attendance scores and assessment score was 0.485 for OSCE and 0.451 for CA-OSCE. The Cronbach's alpha coefficient for all the tests ranged from 0.76 to 0.87 indicating high reliability. The comparison was based on a single batch of 139 students. Such an evaluation on more students in larger number of batches over successive years could help throw more light on the subject. Computer assisted objective structured clinical examination was found to be a valid, reliable and effective format for dermatology assessment, being rated as the preferred format by examiners.
Ponton-Carss, Alicia; Hutchison, Carol; Violato, Claudio
2011-10-01
The purpose of this study was to investigate the reliability and validity of a performance assessment of communication, professionalism, and surgical skills competencies for surgery residents. Fourteen residents from the general surgery program of the University of Calgary were assessed in 7 surgical simulation stations that included communication and professionalism skills. The internal consistency reliability of the checklists and global rating scales combined was adequate for communication (α = .75-.92) and surgical skills (α = .86-.96), but not for professionalism (α = 0). There was evidence of validity as surgical skills performance improved as a function of postgraduate year level but not for the professionalism checklist. Surgical skills and communication correlated in the 2 stations assessed (r = .55 and .57; P < .05). There is evidence for both reliability and validity for simultaneously assessing surgical skills and communication skills. Further instrument development is required to assess professionalism in a structured examination context. Copyright © 2011 Elsevier Inc. All rights reserved.
Aarons, Gregory A; McDonald, Elizabeth J; Connelly, Cynthia D; Newton, Rae R
2007-12-01
The purpose of this study was to examine the factor structure, reliability, and validity of the Family Assessment Device (FAD) among a national sample of Caucasian and Hispanic American families receiving public sector mental health services. A confirmatory factor analysis conducted to test model fit yielded equivocal findings. With few exceptions, indices of model fit, reliability, and validity were poorer for Hispanic Americans compared with Caucasian Americans. Contrary to our expectation, an exploratory factor analysis did not result in a better fitting model of family functioning. Without stronger evidence supporting a reformulation of the FAD, we recommend against such a course of action. Findings highlight the need for additional research on the role of culture in measurement of family functioning.
Huang, Lijie; Huang, Taicheng; Zhen, Zonglei; Liu, Jia
2016-03-15
We present a test-retest dataset for evaluation of long-term reliability of measures from structural and resting-state functional magnetic resonance imaging (sMRI and rfMRI) scans. The repeated scan dataset was collected from 61 healthy adults in two sessions using highly similar imaging parameters at an interval of 103-189 days. However, as the imaging parameters were not completely identical, the reliability estimated from this dataset shall reflect the lower bounds of the true reliability of sMRI/rfMRI measures. Furthermore, in conjunction with other test-retest datasets, our dataset may help explore the impact of different imaging parameters on reliability of sMRI/rfMRI measures, which is especially critical for assessing datasets collected from multiple centers. In addition, intelligence quotient (IQ) was measured for each participant using Raven's Advanced Progressive Matrices. The data can thus be used for purposes other than assessing reliability of sMRI/rfMRI alone. For example, data from each single session could be used to associate structural and functional measures of the brain with the IQ metrics to explore brain-IQ association.
Recent advances in computational structural reliability analysis methods
NASA Astrophysics Data System (ADS)
Thacker, Ben H.; Wu, Y.-T.; Millwater, Harry R.; Torng, Tony Y.; Riha, David S.
1993-10-01
The goal of structural reliability analysis is to determine the probability that the structure will adequately perform its intended function when operating under the given environmental conditions. Thus, the notion of reliability admits the possibility of failure. Given the fact that many different modes of failure are usually possible, achievement of this goal is a formidable task, especially for large, complex structural systems. The traditional (deterministic) design methodology attempts to assure reliability by the application of safety factors and conservative assumptions. However, the safety factor approach lacks a quantitative basis in that the level of reliability is never known and usually results in overly conservative designs because of compounding conservatisms. Furthermore, problem parameters that control the reliability are not identified, nor their importance evaluated. A summary of recent advances in computational structural reliability assessment is presented. A significant level of activity in the research and development community was seen recently, much of which was directed towards the prediction of failure probabilities for single mode failures. The focus is to present some early results and demonstrations of advanced reliability methods applied to structural system problems. This includes structures that can fail as a result of multiple component failures (e.g., a redundant truss), or structural components that may fail due to multiple interacting failure modes (e.g., excessive deflection, resonate vibration, or creep rupture). From these results, some observations and recommendations are made with regard to future research needs.
Recent advances in computational structural reliability analysis methods
NASA Technical Reports Server (NTRS)
Thacker, Ben H.; Wu, Y.-T.; Millwater, Harry R.; Torng, Tony Y.; Riha, David S.
1993-01-01
The goal of structural reliability analysis is to determine the probability that the structure will adequately perform its intended function when operating under the given environmental conditions. Thus, the notion of reliability admits the possibility of failure. Given the fact that many different modes of failure are usually possible, achievement of this goal is a formidable task, especially for large, complex structural systems. The traditional (deterministic) design methodology attempts to assure reliability by the application of safety factors and conservative assumptions. However, the safety factor approach lacks a quantitative basis in that the level of reliability is never known and usually results in overly conservative designs because of compounding conservatisms. Furthermore, problem parameters that control the reliability are not identified, nor their importance evaluated. A summary of recent advances in computational structural reliability assessment is presented. A significant level of activity in the research and development community was seen recently, much of which was directed towards the prediction of failure probabilities for single mode failures. The focus is to present some early results and demonstrations of advanced reliability methods applied to structural system problems. This includes structures that can fail as a result of multiple component failures (e.g., a redundant truss), or structural components that may fail due to multiple interacting failure modes (e.g., excessive deflection, resonate vibration, or creep rupture). From these results, some observations and recommendations are made with regard to future research needs.
ERIC Educational Resources Information Center
Ling, Guangming
2012-01-01
To assess the value of individual students' subscores on the Major Field Test in Business (MFT Business), I examined the test's internal structure with factor analysis and structural equation model methods, and analyzed the subscore reliabilities using the augmented scores method. Analyses of the internal structure suggested that the MFT Business…
ERIC Educational Resources Information Center
Lakshmipathy, K.
2015-01-01
The objectives of the present study were to 1) assess student attitudes to physiology, 2) evaluate student opinions about the influence of an objective structured practical examination (OSPE) on competence, and 3) assess the validity and reliability of an indigenously designed feedback questionnaire. A structured questionnaire containing 16 item…
Xiao, Yuan-mei; Wang, Zhi-ming; Wang, Mian-zhen; Lan, Ya-jia
2005-06-01
To test the reliability and validity of two mental workload assessment scales, i.e. subjective workload assessment technique (SWAT) and NASA task load index (NASA-TLX). One thousand two hundred and sixty-eight mental workers were sampled from various kinds of occupations, such as scientific research, education, administration and medicine, etc, with randomized cluster sampling. The re-test reliability, split-half reliability, Cronbach's alpha coefficient and correlation coefficients between item score and total score were adopted to test the reliability. The test of validity included structure validity. The re-test reliability coefficients of these two scales and their items were ranged from 0.516 to 0.753 (P < 0.01), indicating the two scales had good re-test reliability; the split-half reliability of SWAT was 0.645, and its Cronbach's alpha coefficient was more than 0.80, all the correlation coefficients between its items score and total score were more than 0.70; as for NASA-TLX, both the split-half reliability and Cronbach's alpha coefficient were more than 0.80, the correlation coefficients between its items score and total score were all more than 0.60 (P < 0.01) except the item of performance. Both scales had good inner consistency. The Pearson correlation coefficient between the two scales was 0.492 (P < 0.01), implying the results of the two scales had good consistency. Factor analysis showed that the two scales had good structure validity. Both SWAT and NASA-TLX have good reliability and validity and may be used as a valid tool to assess mental workload in China after being revised properly.
Pérez V, Cristhian; Ortiz M, Liliana; Fasce H, Eduardo; Parra P, Paula; Matus B, Olga; McColl C, Peter; Torres A, Graciela; Meyer K, Andrea; Márquez U, Carolina; Ortega B, Javiera
2015-11-01
Academic Involvement Questionnaire, Expectations version (CIA-A), assesses the expectations of involvement in studies. It is a relevant predictor of student success. However, the evidence of its validity and reliability in Chile is low, and in the case of Medical students, there is no evidence at all. To evaluate the factorial structure and internal consistency of the CIA-A in Chilean Medical school freshmen. The survey was applied to 340 Medicine freshmen, chosen by non-probability quota sampling. They answered a back-translated version of CIA-A from Portuguese to Spanish, plus a sociodemographic questionnaire. For psychometric analysis of the CIA-A, an exploratory factor analysis was carried on, the reliability of the factors was calculated, a descriptive analysis was conducted and their correlation was assessed. Five factors were identified: vocational, institutional and social involvement, use of resources and student participation. Their reliabilities ranged between Cronbach's alpha values of 0.71 to 0.87. Factors also showed statistically significant correlations between each other. Identified factor structure is theoretically consistent with the structure of original version. It just disagrees in one factor. In addition, the factors' internal consistency were adequate for using them in research. This supports the construct validity and reliability of the CIA-A to assess involvement expectations in medical school freshmen.
Kumar, A; Bridgham, R; Potts, M; Gushurst, C; Hamp, M; Passal, D
2001-01-01
To determine consistency of assessment in a new paper case-based structured oral examination in a multi-community pediatrics clerkship, and to identify correctable problems in the administration of examination and assessment process. Nine paper case-based oral examinations were audio-taped. From audio-tapes five community coordinators scored examiner behaviors and graded student performance. Correlations among examiner behaviors scores were examined. Graphs identified grading patterns of evaluators. The effect of exam-giving on evaluators was assessed by t-test. Reliability of grades was calculated and the effect of reducing assessment problems was modeled. Exam-givers differed most in their "teaching-guiding" behavior, and this negatively correlated with student grades. Exam reliability was lowered mainly by evaluator differences in leniency and grading pattern; less important was absence of standardization in cases. While grade reliability was low in early use of the paper case-based oral examination, modeling of plausible effects of training and monitoring for greater uniformity in administration of the examination and assigning scores suggests that more adequate reliabilities can be attained.
Williams, Janet B W; Kobak, Kenneth A
2008-01-01
The Montgomery-Asberg Depression Rating Scale (MADRS) is often used in clinical trials to select patients and to assess treatment efficacy. The scale was originally published without suggested questions for clinicians to use in gathering the information necessary to rate the items. Structured and semi-structured interview guides have been found to improve reliability with other scales. To describe the development and test-retest reliability of a structured interview guide for the MADRS (SIGMA). A total of 162 test-retest interviews were conducted by 81 rater pairs. Each patient was interviewed twice, once by each rater conducting an independent interview. The intraclass correlation for total score between raters using the SIGMA was r=0.93, P<0.0001. All ten items had good to excellent interrater reliability. Use of the SIGMA can result in high reliability of MADRS scores in evaluating patients with depression.
Lyon, Aaron R; Pullmann, Michael D; Dorsey, Shannon; Martin, Prerna; Grigore, Alexandra A; Becker, Emily M; Jensen-Doss, Amanda
2018-05-11
Measurement-based care (MBC) is an increasingly popular, evidence-based practice, but there are no tools with established psychometrics to evaluate clinician use of MBC practices in mental health service delivery. The current study evaluated the reliability, validity, and factor structure of scores generated from a brief, standardized tool to measure MBC practices, the Current Assessment Practice Evaluation-Revised (CAPER). Survey data from a national sample of 479 mental health clinicians were used to conduct exploratory and confirmatory factor analyses, as well as reliability and validity analyses (e.g., relationships between CAPER subscales and clinician MBC attitudes). Analyses revealed competing two- and three-factor models. Regardless of the model used, scores from CAPER subscales demonstrated good reliability and convergent and divergent validity with MBC attitudes in the expected directions. The CAPER appears to be a psychometrically sound tool for assessing clinician MBC practices. Future directions for development and application of the tool are discussed.
An overview of reliability assessment and control for design of civil engineering structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Field, R.V. Jr.; Grigoriadis, K.M.; Bergman, L.A.
1998-06-01
Random variations, whether they occur in the input signal or the system parameters, are phenomena that occur in nearly all engineering systems of interest. As a result, nondeterministic modeling techniques must somehow account for these variations to ensure validity of the solution. As might be expected, this is a difficult proposition and the focus of many current research efforts. Controlling seismically excited structures is one pertinent application of nondeterministic analysis and is the subject of the work presented herein. This overview paper is organized into two sections. First, techniques to assess system reliability, in a context familiar to civil engineers,more » are discussed. Second, and as a consequence of the first, active control methods that ensure good performance in this random environment are presented. It is the hope of the authors that these discussions will ignite further interest in the area of reliability assessment and design of controlled civil engineering structures.« less
Sustainability of transport structures - some aspects of the nonlinear reliability assessment
NASA Astrophysics Data System (ADS)
Pukl, Radomír; Sajdlová, Tereza; Strauss, Alfred; Lehký, David; Novák, Drahomír
2017-09-01
Efficient techniques for both nonlinear numerical analysis of concrete structures and advanced stochastic simulation methods have been combined in order to offer an advanced tool for assessment of realistic behaviour, failure and safety assessment of transport structures. The utilized approach is based on randomization of the non-linear finite element analysis of the structural models. Degradation aspects such as carbonation of concrete can be accounted in order predict durability of the investigated structure and its sustainability. Results can serve as a rational basis for the performance and sustainability assessment based on advanced nonlinear computer analysis of the structures of transport infrastructure such as bridges or tunnels. In the stochastic simulation the input material parameters obtained from material tests including their randomness and uncertainty are represented as random variables or fields. Appropriate identification of material parameters is crucial for the virtual failure modelling of structures and structural elements. Inverse analysis using artificial neural networks and virtual stochastic simulations approach is applied to determine the fracture mechanical parameters of the structural material and its numerical model. Structural response, reliability and sustainability have been investigated on different types of transport structures made from various materials using the above mentioned methodology and tools.
Long, Clive G; Banyard, Ellen; Fulton, Barbara; Hollin, Clive R
2014-09-01
Arson and fire-setting are highly prevalent among patients in secure psychiatric settings but there is an absence of valid and reliable assessment instruments and no evidence of a significant approach to intervention. To develop a semi-structured interview assessment specifically for fire-setting to augment structured assessments of risk and need. The extant literature was used to frame interview questions relating to the antecedents, behaviour and consequences necessary to formulate a functional analysis. Questions also covered readiness to change, fire-setting self-efficacy, the probability of future fire-setting, barriers to change, and understanding of fire-setting behaviour. The assessment concludes with indications for assessment and a treatment action plan. The inventory was piloted with a sample of women in secure care and was assessed for comprehensibility, reliability and validity. Staff rated the St Andrews Fire and Risk Instrument (SAFARI) as acceptable to patients and easy to administer. SAFARI was found to be comprehensible by over 95% of the general population, to have good acceptance, high internal reliability, substantial test-retest reliability and validity. SAFARI helps to provide a clear explanation of fire-setting in terms of the complex interplay of antecedents and consequences and facilitates the design of an individually tailored treatment programme in sympathy with a cognitive-behavioural approach. Further studies are needed to verify the reliability and validity of SAFARI with male populations and across settings.
Stirling Convertor Fasteners Reliability Quantification
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Korovaichuk, Igor; Kovacevich, Tiodor; Schreiber, Jeffrey G.
2006-01-01
Onboard Radioisotope Power Systems (RPS) being developed for NASA s deep-space science and exploration missions require reliable operation for up to 14 years and beyond. Stirling power conversion is a candidate for use in an RPS because it offers a multifold increase in the conversion efficiency of heat to electric power and reduced inventory of radioactive material. Structural fasteners are responsible to maintain structural integrity of the Stirling power convertor, which is critical to ensure reliable performance during the entire mission. Design of fasteners involve variables related to the fabrication, manufacturing, behavior of fasteners and joining parts material, structural geometry of the joining components, size and spacing of fasteners, mission loads, boundary conditions, etc. These variables have inherent uncertainties, which need to be accounted for in the reliability assessment. This paper describes these uncertainties along with a methodology to quantify the reliability, and provides results of the analysis in terms of quantified reliability and sensitivity of Stirling power conversion reliability to the design variables. Quantification of the reliability includes both structural and functional aspects of the joining components. Based on the results, the paper also describes guidelines to improve the reliability and verification testing.
Coefficient Alpha: A Reliability Coefficient for the 21st Century?
ERIC Educational Resources Information Center
Yang, Yanyun; Green, Samuel B.
2011-01-01
Coefficient alpha is almost universally applied to assess reliability of scales in psychology. We argue that researchers should consider alternatives to coefficient alpha. Our preference is for structural equation modeling (SEM) estimates of reliability because they are informative and allow for an empirical evaluation of the assumptions…
Structural reliability assessment of the Oman India Pipeline
DOE Office of Scientific and Technical Information (OSTI.GOV)
Al-Sharif, A.M.; Preston, R.
1996-12-31
Reliability techniques are increasingly finding application in design. The special design conditions for the deep water sections of the Oman India Pipeline dictate their use since the experience basis for application of standard deterministic techniques is inadequate. The paper discusses the reliability analysis as applied to the Oman India Pipeline, including selection of a collapse model, characterization of the variability in the parameters that affect pipe resistance to collapse, and implementation of first and second order reliability analyses to assess the probability of pipe failure. The reliability analysis results are used as the basis for establishing the pipe wall thicknessmore » requirements for the pipeline.« less
Increased Authenticity in Practical Assessment Using Emergency Case OSCE Stations
ERIC Educational Resources Information Center
Ruesseler, Miriam; Weinlich, Michael; Byhahn, Christian; Muller, Michael P.; Junger, Jana; Marzi, Ingo; Walcher, Felix
2010-01-01
In case of an emergency, a fast and structured patient management is crucial for patient's outcome. The competencies needed should be acquired and assessed during medical education. The objective structured clinical examination (OSCE) is a valid and reliable assessment format to evaluate practical skills. However, traditional OSCE stations examine…
ERIC Educational Resources Information Center
Woodburn, Jim; Sutcliffe, Nick
1996-01-01
The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Evaluation of a model of violence risk assessment among forensic psychiatric patients.
Douglas, Kevin S; Ogloff, James R P; Hart, Stephen D
2003-10-01
This study tested the interrater reliability and criterion-related validity of structured violence risk judgments made by using one application of the structured professional judgment model of violence risk assessment, the HCR-20 violence risk assessment scheme, which assesses 20 key risk factors in three domains: historical, clinical, and risk management. The HCR-20 was completed for a sample of 100 forensic psychiatric patients who had been found not guilty by reason of a mental disorder and were subsequently released to the community. Violence in the community was determined from multiple file-based sources. Interrater reliability of structured final risk judgments of low, moderate, or high violence risk made on the basis of the structured professional judgment model was acceptable (weighted kappa=.61). Structured final risk judgments were significantly predictive of postrelease community violence, yielding moderate to large effect sizes. Event history analyses showed that final risk judgments made with the structured professional judgment model added incremental validity to the HCR-20 used in an actuarial (numerical) sense. The findings support the structured professional judgment model of risk assessment as well as the HCR-20 specifically and suggest that clinical judgment, if made within a structured context, can contribute in meaningful ways to the assessment of violence risk.
Wood, David L; Sawicki, Gregory S; Miller, M David; Smotherman, Carmen; Lukens-Bull, Katryne; Livingood, William C; Ferris, Maria; Kraemer, Dale F
2014-01-01
National consensus statements recommend that providers regularly assess the transition readiness skills of adolescent and young adults (AYA). In 2010 we developed a 29-item version of Transition Readiness Assessment Questionnaire (TRAQ). We reevaluated item performance and factor structure, and reassessed the TRAQ's reliability and validity. We surveyed youth from 3 academic clinics in Jacksonville, Florida; Chapel Hill, North Carolina; and Boston, Massachusetts. Participants were AYA with special health care needs aged 14 to 21 years. From a convenience sample of 306 patients, we conducted item reduction strategies and exploratory factor analysis (EFA). On a second convenience sample of 221 patients, we conducted confirmatory factor analysis (CFA). Internal reliability was assessed by Cronbach's alpha and criterion validity. Analyses were conducted by the Wilcoxon rank sum test and mixed linear models. The item reduction and EFA resulted in a 20-item scale with 5 identified subscales. The CFA conducted on a second sample provided a good fit to the data. The overall scale has high reliability overall (Cronbach's alpha = .94) and good reliability for 4 of the 5 subscales (Cronbach's alpha ranging from .90 to .77 in the pooled sample). Each of the 5 subscale scores were significantly higher for adolescents aged 18 years and older versus those younger than 18 (P < .0001) in both univariate and multivariate analyses. The 20-item, 5-factor structure for the TRAQ is supported by EFA and CFA on independent samples and has good internal reliability and criterion validity. Additional work is needed to expand or revise the TRAQ subscales and test their predictive validity. Copyright © 2014 Academic Pediatric Association. Published by Elsevier Inc. All rights reserved.
Ultimate Limit State Assessment of Timber Bolt Connection Subjected to Double Unequal Shears
NASA Astrophysics Data System (ADS)
Musilek, Josef; Plachy, Jan
2017-10-01
Nowadays the problems occur when a structure engineer need to assess the ultimate limit state of timber bolt connection which is subjected to double unequal shears. This assessment of ultimate limit state shows the reliability of these connections. In assessing the reliability of this connection in ultimate limit state is a problem, because the formulas and equations that are currently available in design standards and available literature, describing only connections loaded symmetrically - this mean that they describe the timber bolt connection subjected to double equal shears. This fact causes problems because structural engineers have no available support, according to which they could assess reliability of the connection in terms of the ultimate limit state. They must therefore often report following an asymmetrically loaded connections carry about using formulas, which are primarily designed for checking connections loaded symmetrically. This leads logically to the fact that it is not respected by the actual behaviour of the connection in the ultimate limit state. Formulas derived in this paper provide the possibility to assess the ultimate limit state for such connection. The formulas derived in this article allow to carry out a reliability assessment of the ultimate limit state of timber bolt connection subjected to double shear. The using of the formulas derived in this paper leads to better description of the behaviour of this type of connection and also to the more economic design. An example of using these derived formulas is shown. There is shown in this example, how to assess the reliability of timber bolt connection subjected to double unequal shears in terms of ultimate limit states.
Bridge reliability assessment based on the PDF of long-term monitored extreme strains
NASA Astrophysics Data System (ADS)
Jiao, Meiju; Sun, Limin
2011-04-01
Structural health monitoring (SHM) systems can provide valuable information for the evaluation of bridge performance. As the development and implementation of SHM technology in recent years, the data mining and use has received increasingly attention and interests in civil engineering. Based on the principle of probabilistic and statistics, a reliability approach provides a rational basis for analysis of the randomness in loads and their effects on structures. A novel approach combined SHM systems with reliability method to evaluate the reliability of a cable-stayed bridge instrumented with SHM systems was presented in this paper. In this study, the reliability of the steel girder of the cable-stayed bridge was denoted by failure probability directly instead of reliability index as commonly used. Under the assumption that the probability distributions of the resistance are independent to the responses of structures, a formulation of failure probability was deduced. Then, as a main factor in the formulation, the probability density function (PDF) of the strain at sensor locations based on the monitoring data was evaluated and verified. That Donghai Bridge was taken as an example for the application of the proposed approach followed. In the case study, 4 years' monitoring data since the operation of the SHM systems was processed, and the reliability assessment results were discussed. Finally, the sensitivity and accuracy of the novel approach compared with FORM was discussed.
Fatigue reliability of deck structures subjected to correlated crack growth
NASA Astrophysics Data System (ADS)
Feng, G. Q.; Garbatov, Y.; Guedes Soares, C.
2013-12-01
The objective of this work is to analyse fatigue reliability of deck structures subjected to correlated crack growth. The stress intensity factors of the correlated cracks are obtained by finite element analysis and based on which the geometry correction functions are derived. The Monte Carlo simulations are applied to predict the statistical descriptors of correlated cracks based on the Paris-Erdogan equation. A probabilistic model of crack growth as a function of time is used to analyse the fatigue reliability of deck structures accounting for the crack propagation correlation. A deck structure is modelled as a series system of stiffened panels, where a stiffened panel is regarded as a parallel system composed of plates and are longitudinal. It has been proven that the method developed here can be conveniently applied to perform the fatigue reliability assessment of structures subjected to correlated crack growth.
Fiori, Simona; Cioni, Giovanni; Klingels, Katrjin; Ortibus, Els; Van Gestel, Leen; Rose, Stephen; Boyd, Roslyn N; Feys, Hilde; Guzzetta, Andrea
2014-09-01
To describe the development of a novel rating scale for classification of brain structural magnetic resonance imaging (MRI) in children with cerebral palsy (CP) and to assess its interrater and intrarater reliability. The scale consists of three sections. Section 1 contains descriptive information about the patient and MRI. Section 2 contains the graphical template of brain hemispheres onto which the lesion is transposed. Section 3 contains the scoring system for the quantitative analysis of the lesion characteristics, grouped into different global scores and subscores that assess separately side, regions, and depth. A larger interrater and intrarater reliability study was performed in 34 children with CP (22 males, 12 females; mean age at scan of 9 y 5 mo [SD 3 y 3 mo], range 4 y-16 y 11 mo; Gross Motor Function Classification System level I, [n=22], II [n=10], and level III [n=2]). Very high interrater and intrarater reliability of the total score was found with indices above 0.87. Reliability coefficients of the lobar and hemispheric subscores ranged between 0.53 and 0.95. Global scores for hemispheres, basal ganglia, brain stem, and corpus callosum showed reliability coefficients above 0.65. This study presents the first visual, semi-quantitative scale for classification of brain structural MRI in children with CP. The high degree of reliability of the scale supports its potential application for investigating the relationship between brain structure and function and examining treatment response according to brain lesion severity in children with CP. © 2014 Mac Keith Press.
Is It Safe? Reliability and Validity of Structured versus Unstructured Child Safety Judgments
ERIC Educational Resources Information Center
Bartelink, Cora; de Kwaadsteniet, Leontien; ten Berge, Ingrid J.; Witteman, Cilia L. M.
2017-01-01
Background: The LIRIK, an instrument for the assessment of child safety and risk, is designed to improve assessments by guiding professionals through a structured evaluation of relevant signs, risk factors, and protective factors. Objective: We aimed to assess the interrater agreement and the predictive validity of professionals' judgments made…
ERIC Educational Resources Information Center
Bogo, Marion; Regehr, Cheryl; Logie, Carmen; Katz, Ellen; Mylopoulos, Maria; Regehr, Glenn
2011-01-01
The development of standardized, valid, and reliable methods for assessment of students' practice competence continues to be a challenge for social work educators. In this study, the Objective Structured Clinical Examination (OSCE), originally used in medicine to assess performance through simulated interviews, was adapted for social work to…
NASA Technical Reports Server (NTRS)
Yunis, Isam S.; Carney, Kelly S.
1993-01-01
A new aerospace application of structural reliability techniques is presented, where the applied forces depend on many probabilistic variables. This application is the plume impingement loading of the Space Station Freedom Photovoltaic Arrays. When the space shuttle berths with Space Station Freedom it must brake and maneuver towards the berthing point using its primary jets. The jet exhaust, or plume, may cause high loads on the photovoltaic arrays. The many parameters governing this problem are highly uncertain and random. An approach, using techniques from structural reliability, as opposed to the accepted deterministic methods, is presented which assesses the probability of failure of the array mast due to plume impingement loading. A Monte Carlo simulation of the berthing approach is used to determine the probability distribution of the loading. A probability distribution is also determined for the strength of the array. Structural reliability techniques are then used to assess the array mast design. These techniques are found to be superior to the standard deterministic dynamic transient analysis, for this class of problem. The results show that the probability of failure of the current array mast design, during its 15 year life, is minute.
Doering, Stephan; Burgmer, Markus; Heuft, Gereon; Menke, Dina; Bäumer, Brigitta; Lübking, Margit; Feldmann, Marcus; Schneider, Gudrun
2014-01-01
The assessment of personality functioning has recently become a focus of psychiatric diagnostics. The interview-based Operationalized Psychodynamic Diagnosis (OPD-2) provides a 'structure axis' for the assessment of personality functioning. One hundred twenty-four psychiatric patients were diagnosed by means of the Structured Clinical Interviews for DSM-IV (SCID-I and SCID-II), underwent OPD-2 interviews, and completed 9 questionnaires. The OPD-2 structure axis shows good interrater reliability (intraclass correlation = 0.793). Correlations between the OPD-2 structure axis domains and a priori selected questionnaire scales were of medium size and significant. Patients with a personality disorder (PD) showed significantly worse personality functioning than those without. In cluster B PD, personality functioning was more severely impaired than in cluster C PD. The OPD-2 structure axis shows good reliability as well as concurrent and discriminant validity and can be recommended for clinical use and research purposes. © 2013 S. Karger AG, Basel.
Soleimani, Mohammad Ali; Bahrami, Nasim; Yaghoobzadeh, Ameneh; Banihashemi, Hedieh; Nia, Hamid Sharif; Haghdoost, Ali Akbar
2016-01-01
Due to increasing recognition of the importance of death anxiety for understanding human nature, it is important that researchers who investigate death anxiety have reliable and valid methodology to measure. The purpose of this study was to evaluate the validity and reliability of the Persian version of Templer Death Anxiety Scale (TDAS) in family caregivers of cancer patients. A sample of 326 caregivers of cancer patients completed a 15-item questionnaire. Principal components analysis (PCA) followed by a varimax rotation was used to assess factor structure of the DAS. The construct validity of the scale was assessed using exploratory and confirmatory factor analyses. Convergent and discriminant validity were also examined. Reliability was assessed with Cronbach's alpha coefficients and construction reliability. Based on the results of the PCA and consideration of the meaning of our items, a three-factor solution, explaining 60.38% of the variance, was identified. A confirmatory factor analysis (CFA) then supported the adequacy of the three-domain structure of the DAS. Goodness-of-fit indices showed an acceptable fit overall with the full model {χ(2)(df) = 262.32 (61), χ(2)/df = 2.04 [adjusted goodness of fit index (AGFI) = 0.922, parsimonious comparative fit index (PCFI) = 0.703, normed fit Index (NFI) = 0.912, CMIN/DF = 2.048, root mean square error of approximation (RMSEA) = 0.055]}. Convergent and discriminant validity were shown with construct fulfilled. The Cronbach's alpha and construct reliability were greater than 0.70. The findings show that the Persian version of the TDAS has a three-factor structure and acceptable validity and reliability.
A new statistical framework to assess structural alignment quality using information compression
Collier, James H.; Allison, Lloyd; Lesk, Arthur M.; Garcia de la Banda, Maria; Konagurthu, Arun S.
2014-01-01
Motivation: Progress in protein biology depends on the reliability of results from a handful of computational techniques, structural alignments being one. Recent reviews have highlighted substantial inconsistencies and differences between alignment results generated by the ever-growing stock of structural alignment programs. The lack of consensus on how the quality of structural alignments must be assessed has been identified as the main cause for the observed differences. Current methods assess structural alignment quality by constructing a scoring function that attempts to balance conflicting criteria, mainly alignment coverage and fidelity of structures under superposition. This traditional approach to measuring alignment quality, the subject of considerable literature, has failed to solve the problem. Further development along the same lines is unlikely to rectify the current deficiencies in the field. Results: This paper proposes a new statistical framework to assess structural alignment quality and significance based on lossless information compression. This is a radical departure from the traditional approach of formulating scoring functions. It links the structural alignment problem to the general class of statistical inductive inference problems, solved using the information-theoretic criterion of minimum message length. Based on this, we developed an efficient and reliable measure of structural alignment quality, I-value. The performance of I-value is demonstrated in comparison with a number of popular scoring functions, on a large collection of competing alignments. Our analysis shows that I-value provides a rigorous and reliable quantification of structural alignment quality, addressing a major gap in the field. Availability: http://lcb.infotech.monash.edu.au/I-value Contact: arun.konagurthu@monash.edu Supplementary information: Online supplementary data are available at http://lcb.infotech.monash.edu.au/I-value/suppl.html PMID:25161241
A method for evaluating competency in assessment and management of suicide risk.
Hung, Erick K; Binder, Renée L; Fordwood, Samantha R; Hall, Stephen E; Cramer, Robert J; McNiel, Dale E
2012-01-01
Although health professionals increasingly are expected to be able to assess and manage patients' risk for suicide, few methods are available to evaluate this competency. This report describes development of a competency-assessment instrument for suicide risk-assessment (CAI-S), and evaluates its use in an objective structured clinical examination (OSCE). The authors developed the CAI-S on the basis of the literature on suicide risk-assessment and management, and consultation with faculty focus groups from three sites in a large academic psychiatry department. The CAI-S structures faculty ratings regarding interviewing and data collection, case formulation and presentation, treatment-planning, and documentation. To evaluate the CAI-S, 31 faculty members used it to rate the performance of 31 learners (26 psychiatric residents and 5 clinical psychology interns) who participated in an OSCE. After interviewing a standardized patient, learners presented their risk-assessment findings and treatment plans. Faculty used the CAI-S to structure feedback to the learners. In a subsidiary study of interrater reliability, six faculty members rated video-recorded suicide risk-assessments. The CAI-S showed good internal consistency, reliability, and interrater reliability. Concurrent validity was supported by the finding that CAI-S ratings were higher for senior learners than junior learners, and were higher for learners with more clinical experience with suicidal patients than learners with less clinical experience. Faculty and learners rated the method as helpful for structuring feedback and supervision. The findings support the usefulness of the CAI-S for evaluating competency in suicide risk-assessment and management.
Validity and reliability of the robotic objective structured assessment of technical skills
Siddiqui, Nazema Y.; Galloway, Michael L.; Geller, Elizabeth J.; Green, Isabel C.; Hur, Hye-Chun; Langston, Kyle; Pitter, Michael C.; Tarr, Megan E.; Martino, Martin A.
2015-01-01
Objective Objective structured assessments of technical skills (OSATS) have been developed to measure the skill of surgical trainees. Our aim was to develop an OSATS specifically for trainees learning robotic surgery. Study Design This is a multi-institutional study in eight academic training programs. We created an assessment form to evaluate robotic surgical skill through five inanimate exercises. Obstetrics/gynecology, general surgery, and urology residents, fellows, and faculty completed five robotic exercises on a standard training model. Study sessions were recorded and randomly assigned to three blinded judges who scored performance using the assessment form. Construct validity was evaluated by comparing scores between participants with different levels of surgical experience; inter- and intra-rater reliability were also assessed. Results We evaluated 83 residents, 9 fellows, and 13 faculty, totaling 105 participants; 88 (84%) were from obstetrics/gynecology. Our assessment form demonstrated construct validity, with faculty and fellows performing significantly better than residents (mean scores: 89 ± 8 faculty; 74 ± 17 fellows; 59 ± 22 residents, p<0.01). In addition, participants with more robotic console experience scored significantly higher than those with fewer prior console surgeries (p<0.01). R-OSATS demonstrated good inter-rater reliability across all five drills (mean Cronbach's α: 0.79 ± 0.02). Intra-rater reliability was also high (mean Spearman's correlation: 0.91 ± 0.11). Conclusions We developed an assessment form for robotic surgical skill that demonstrates construct validity, inter- and intra-rater reliability. When paired with standardized robotic skill drills this form may be useful to distinguish between levels of trainee performance. PMID:24807319
Tabrizi, Yousef Moghadas; Zangiabadi, Nasser; Mazhari, Shahrzad; Zolala, Farzaneh
2013-01-01
Objective Motor imagery (MI) has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS). It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ) was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. Method Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14days apart) by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R) as the gold standard. Intra-class correlation coefficients (ICCs) were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha) and factorial structure of the KVIQ were studied. Results The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93), and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79). The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84). Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. Conclusions The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients. PMID:24271091
Tabrizi, Yousef Moghadas; Zangiabadi, Nasser; Mazhari, Shahrzad; Zolala, Farzaneh
2013-01-01
Motor imagery (MI) has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS). It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ) was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14 days apart) by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R) as the gold standard. Intra-class correlation coefficients (ICCs) were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha) and factorial structure of the KVIQ were studied. The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93), and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79). The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84). Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients.
ERIC Educational Resources Information Center
Meyer, Ilan H.; And Others
1996-01-01
Structured clinical interviews concerning childhood histories of physical and sexual abuse with 70 mentally ill women at 2 times found test-retest reliability of .63 for physical abuse and .82 for sexual abuse. Validity, assessed as consistency with an independent clinical assessment, showed 75% agreement for physical abuse and 93% agreement for…
ERIC Educational Resources Information Center
Dedrick, Robert F.; Shaunessy-Dedrick, Elizabeth; Suldo, Shannon M.; Ferron, John M.
2015-01-01
In two studies (ns = 312 and 1,149) with 9- to 12-grade students in pre-International Baccalaureate (IB) and IB Diploma programs, we evaluated the reliability, factor structure, measurement invariance, and criterion-related validity of the scores from the School Attitude Assessment Survey-Revised (SAAS-R). Reliabilities of the five SAAS-R subscale…
Mercier, Catherine; Roche, Sylvain; Gaillard, Ségolène; Kassai, Behrouz; Arzimanoglou, Alexis; Herbillon, Vania; Roy, Pascal; Rheims, Sylvain
2016-05-01
Attention deficit hyperactivity disorder (ADHD) is a well-known comorbidity in children with epilepsy. In English-speaking countries, the scores of the original ADHD-rating scale IV are currently used as main outcomes in various clinical trials in children with epilepsy. In French-speaking countries, several French versions are in use though none has been fully validated yet. We sought here for a partial validation of a French version of the ADHD-RS IV regarding construct validity, internal consistency (i.e., scale reliability), item reliability, and responsiveness in a group of French children with ADHD and epilepsy. The study involved 167 children aged 6-15years in 10 French neuropediatric units. The factorial structure and item reliability were assessed with a confirmatory factorial analysis for ordered categorical variables. The dimensions' internal consistency was assessed with Guttman's lambda 6 coefficient. The responsiveness was assessed by the change in score under methylphenidate and in comparison with a control group. The results confirmed the original two-dimensional factorial structure (inattention, hyperactivity/impulsivity) and showed a satisfactory reliability of most items, a good dimension internal consistency, and a good responsiveness of the total score and the two subscores. The studied French version of the ADHD-RS IV is thus validated regarding construct validity, reliability, and responsiveness. It can now be used in French-speaking countries in clinical trials of treatments involving children with ADHD and epilepsy. The full validation requires further investigations. Copyright © 2016 Elsevier Inc. All rights reserved.
Assessment of concrete damage and strength degradation caused by reinforcement corrosion
NASA Astrophysics Data System (ADS)
Nepal, Jaya; Chen, Hua-Peng
2015-07-01
Structural performance deterioration of reinforced concrete structures has been extensively investigated, but very limited studies have been carried out to investigate the effect of reinforcement corrosion on time-dependent reliability with consideration of the influence of mechanical characteristics of the bond interface due to corrosion. This paper deals with how corrosion in reinforcement creates different types of defects in concrete structure and how they are responsible for the structural capacity deterioration of corrosion affected reinforced concrete structures during their service life. Cracking in cover concrete due to reinforcement corrosion is investigated by using rebar-concrete model and realistic concrete properties. The flexural strength deterioration is analytically predicted on the basis of bond strength evolution due to reinforcement corrosion, which is examined by the experimental data available. The time-dependent reliability analysis is undertaken to calculate the life time structural reliability of corrosion damaged concrete structures by stochastic deterioration modelling of reinforced concrete. The results from the numerical example show that the proposed approach is capable of evaluating the damage caused by reinforcement corrosion and also predicting the structural reliability of concrete structures during their lifecycle.
A 2-year study of Gram stain competency assessment in 40 clinical laboratories.
Goodyear, Nancy; Kim, Sara; Reeves, Mary; Astion, Michael L
2006-01-01
We used a computer-based competency assessment tool for Gram stain interpretation to assess the performance of 278 laboratory staff from 40 laboratories on 40 multiple-choice questions. We report test reliability, mean scores, median, item difficulty, discrimination, and analysis of the highest- and lowest-scoring questions. The questions were reliable (KR-20 coefficient, 0.80). Overall mean score was 88% (range, 63%-98%). When categorized by cell type, the means were host cells, 93%; other cells (eg, yeast), 92%; gram-positive, 90%; and gram-negative, 88%. When categorized by type of interpretation, the means were other (eg, underdecolorization), 92%; identify by structure (eg, bacterial morphologic features), 91%; and identify by name (eg, genus and species), 87%. Of the 6 highest-scoring questions (mean scores, > or = 99%) 5 were identify by structure and 1 was identify by name. Of the 6 lowest-scoring questions (mean scores, < 75%) 5 were gram-negative and 1 was host cells. By type of interpretation, 2 were identify by structure and 4 were identify by name. Computer-based Gram stain competency assessment examinations are reliable. Our analysis helps laboratories identify areas for continuing education in Gram stain interpretation and will direct future revisions of the tests.
Probabilistic sizing of laminates with uncertainties
NASA Technical Reports Server (NTRS)
Shah, A. R.; Liaw, D. G.; Chamis, C. C.
1993-01-01
A reliability based design methodology for laminate sizing and configuration for a special case of composite structures is described. The methodology combines probabilistic composite mechanics with probabilistic structural analysis. The uncertainties of constituent materials (fiber and matrix) to predict macroscopic behavior are simulated using probabilistic theory. Uncertainties in the degradation of composite material properties are included in this design methodology. A multi-factor interaction equation is used to evaluate load and environment dependent degradation of the composite material properties at the micromechanics level. The methodology is integrated into a computer code IPACS (Integrated Probabilistic Assessment of Composite Structures). Versatility of this design approach is demonstrated by performing a multi-level probabilistic analysis to size the laminates for design structural reliability of random type structures. The results show that laminate configurations can be selected to improve the structural reliability from three failures in 1000, to no failures in one million. Results also show that the laminates with the highest reliability are the least sensitive to the loading conditions.
Krejsa, Martin; Janas, Petr; Yilmaz, Işık; Marschalko, Marian; Bouchal, Tomas
2013-01-01
The load-carrying system of each construction should fulfill several conditions which represent reliable criteria in the assessment procedure. It is the theory of structural reliability which determines probability of keeping required properties of constructions. Using this theory, it is possible to apply probabilistic computations based on the probability theory and mathematic statistics. Development of those methods has become more and more popular; it is used, in particular, in designs of load-carrying structures with the required level or reliability when at least some input variables in the design are random. The objective of this paper is to indicate the current scope which might be covered by the new method—Direct Optimized Probabilistic Calculation (DOProC) in assessments of reliability of load-carrying structures. DOProC uses a purely numerical approach without any simulation techniques. This provides more accurate solutions to probabilistic tasks, and, in some cases, such approach results in considerably faster completion of computations. DOProC can be used to solve efficiently a number of probabilistic computations. A very good sphere of application for DOProC is the assessment of the bolt reinforcement in the underground and mining workings. For the purposes above, a special software application—“Anchor”—has been developed. PMID:23935412
Wei, Meifen; Russell, Daniel W; Mallinckrodt, Brent; Vogel, David L
2007-04-01
We developed a 12-item, short form of the Experiences in Close Relationship Scale (ECR; Brennan, Clark, & Shaver, 1998) across 6 studies. In Study 1, we examined the reliability and factor structure of the measure. In Studies 2 and 3, we cross-validated the reliability, factor structure, and validity of the short form measure; whereas in Study 4, we examined test-retest reliability over a 1-month period. In Studies 5 and 6, we further assessed the reliability, factor structure, and validity of the short version of the ECR when administered as a stand-alone instrument. Confirmatory factor analyses indicated that 2 factors, labeled Anxiety and Avoidance, provided a good fit to the data after removing the influence of response sets. We found validity to be equivalent for the short and the original versions of the ECR across studies. Finally, the results were comparable when we embedded the short form within the original version of the ECR and when we administered it as a stand-alone measure.
Pérez de los Cobos, José; Trujols, Joan; Siñol, Núria; Vasconcelos e Rego, Lisiane; Iraurgi, Ioseba; Batlle, Francesca
2014-09-01
Reliable and valid assessment of cocaine withdrawal is relevant for treating cocaine-dependent patients. This study examined the psychometric properties of the Spanish version of the Cocaine Selective Severity Assessment (CSSA), an instrument that measures cocaine withdrawal. Participants were 170 cocaine-dependent inpatients receiving detoxification treatment. Principal component analysis revealed a 4-factor structure for CSSA that included the following components: 'Cocaine Craving and Psychological Distress', 'Lethargy', 'Carbohydrate Craving and Irritability', and 'Somatic Depressive Symptoms'. These 4 components accounted for 56.0% of total variance. Internal reliability for these components ranged from unacceptable to good (Chronbach's alpha: 0.87, 0.65, 0.55, and 0.22, respectively). All components except Somatic Depressive Symptoms presented concurrent validity with cocaine use. In summary, while some properties of the Spanish version of the CSSA are satisfactory, such as interpretability of factor structure and test-retest reliability, other properties, such as internal reliability and concurrent validity of some factors, are inadequate. Copyright © 2014 Elsevier Inc. All rights reserved.
Moran, Galia S; Zisman-Ilani, Yaara; Garber-Epstein, Paula; Roe, David
2014-03-01
Recovery is supported by relationships that are characterized by human centeredness, empowerment and a hopeful approach. The Recovery Promoting Relationships Scale (RPRS; Russinova, Rogers, & Ellison, 2006) assesses consumer-provider relationships from the consumer perspective. Here we present the adaptation and psychometric assessment of a Hebrew version of the RPRS. The RPRS was translated to Hebrew (RPRS-Heb) using multiple strategies to assure conceptual soundness. Then 216 mental health consumers were administered the RPRS-Heb as part of a larger project initiative implementing illness management and recovery intervention (IMR) in community settings. Psychometric testing included assessment of the factor structure, reliability, and validity using the Hope Scale, the Working Alliance Inventory, and the Recovery Assessment Scale. The RPRS-Heb factor structure replicated the two factor structures found in the original scale with minor exceptions. Reliability estimates were good: Cronbach's alpha for the total scale was 0.94. An estimate of 0.93 for the Recovery-Promoting Strategies factor, and 0.86 for the Core Relationship. Concurrent validity was confirmed using the Working Alliance Scale (rp = .51, p < .001) and the Hope Scale (rp = .43, p < .001). Criterion validity was examined using the Recovery Assessment Scale (rp = .355, p < .05). The study yielded a 23-item RPRS-Heb version with a psychometrically sound factor structure, satisfactory reliability, and concurrent validity tested against the Hope, Alliance, and Recovery Assessment scales. Outcomes are discussed in the context of the original scale properties and a similar Dutch initiative. The RPRS-Heb can serve as a valuable tool for studying recovery promoting relationships with Hebrew speaking population.
Reliability-based structural optimization: A proposed analytical-experimental study
NASA Technical Reports Server (NTRS)
Stroud, W. Jefferson; Nikolaidis, Efstratios
1993-01-01
An analytical and experimental study for assessing the potential of reliability-based structural optimization is proposed and described. In the study, competing designs obtained by deterministic and reliability-based optimization are compared. The experimental portion of the study is practical because the structure selected is a modular, actively and passively controlled truss that consists of many identical members, and because the competing designs are compared in terms of their dynamic performance and are not destroyed if failure occurs. The analytical portion of this study is illustrated on a 10-bar truss example. In the illustrative example, it is shown that reliability-based optimization can yield a design that is superior to an alternative design obtained by deterministic optimization. These analytical results provide motivation for the proposed study, which is underway.
Assessment of Technogenic Accident Risk of Industrial Building Structures
NASA Astrophysics Data System (ADS)
Baiburin, D. A.; Baiburin, A. Kh
2017-11-01
A methodology for assessing the risk of an industrial building accident was developed taking into account the damage caused by various localization of collapse. Before the beginning of the survey of a facility technical condition, groups including the same type of building structures are selected. Further, assessment is made for the reduction in their load-carrying capacity from the strength and stability conditions taking into account defects. The characteristics of the influence of defects and structural damage on a building safety is the degree of compliance with the standards expressed by the reliability level. Reliability levels assignment is carried out on the basis of calculations, operating experience and inspection of a particular type of structure according to the formalized rules. The risk of collapse according to a separate scenario is calculated for structures that are capable and incapable of causing a progressive ossification. The results of the technique application are based on the analysis of the accident risk at the welding shop “Vysota (Height) 239” of the Chelyabinsk Pipe Rolling Plant.
Factor structure, validity and reliability of the Cambridge Worry Scale in a pregnant population.
Green, Josephine M; Kafetsios, Konstantinos; Statham, Helen E; Snowdon, Claire M
2003-11-01
This article presents the Cambridge Worry Scale (CWS), a content-based measure for assessing worries, and discusses its psychometric properties based on a longitudinal study of 1,207 pregnant women. Principal components analysis revealed a four-factor structure of women's concerns during pregnancy: socio-medical, own health, socio-economic and relational. The measure demonstrated good reliability and validity. Total CWS scores were strongly associated with state and trait anxiety (convergent validity) but also had significant and unique predictive value for mood outcomes (discriminant validity). The CWS discriminated better between women with different reproductive histories than measures of state and trait anxiety. We conclude that the CWS is a reliable and valid tool for assessing the extent and content of worries in specific situations.
Assessment of mesh simplification algorithm quality
NASA Astrophysics Data System (ADS)
Roy, Michael; Nicolier, Frederic; Foufou, S.; Truchetet, Frederic; Koschan, Andreas; Abidi, Mongi A.
2002-03-01
Traditionally, medical geneticists have employed visual inspection (anthroposcopy) to clinically evaluate dysmorphology. In the last 20 years, there has been an increasing trend towards quantitative assessment to render diagnosis of anomalies more objective and reliable. These methods have focused on direct anthropometry, using a combination of classical physical anthropology tools and new instruments tailor-made to describe craniofacial morphometry. These methods are painstaking and require that the patient remain still for extended periods of time. Most recently, semiautomated techniques (e.g., structured light scanning) have been developed to capture the geometry of the face in a matter of seconds. In this paper, we establish that direct anthropometry and structured light scanning yield reliable measurements, with remarkably high levels of inter-rater and intra-rater reliability, as well as validity (contrasting the two methods).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Johnson, D.R.; McClung, R.W.; Janney, M.A.
1987-08-01
A needs assessment was performed for nondestructive testing and materials characterization to achieve improved reliability in ceramic materials for heat engine applications. Raw materials, green state bodies, and sintered ceramics were considered. The overall approach taken to improve reliability of structural ceramics requires key inspections throughout the fabrication flowsheet, including raw materials, greed state, and dense parts. The applications of nondestructive inspection and characterization techniques to ceramic powders and other raw materials, green ceramics, and sintered ceramics are discussed. The current state of inspection technology is reviewed for all identified attributes and stages of a generalized flowsheet for advanced structuralmore » ceramics, and research and development requirements are identified and listed in priority order. 164 refs., 3 figs.« less
Falloon, I R H; Mizuno, M; Murakami, M; Roncone, R; Unoka, Z; Harangozo, J; Pullman, J; Gedye, R; Held, T; Hager, B; Erickson, D; Burnett, K
2005-01-01
To develop a reliable standardized assessment of psychiatric symptoms for use in clinical practice. A 50-item interview, the Current Psychiatric State 50 (CPS-50), was used to assess 237 patients with a range of psychiatric diagnoses. Ratings were made by interviewers after a 2-day training. Comparisons of inter-rater reliability on each item and on eight clinical subscales were made across four international centres and between psychiatrists and non-psychiatrists. A principal components analysis was used to validate these clinical scales. Acceptable inter-rater reliability (intra-class coefficient > 0.80) was found for 46 of the 50 items, and for all eight subscales. There was no difference between centres or between psychiatrists and non-psychiatrists. The principal components analysis factors were similar to the clinical scales. The CPS-50 is a reliable standardized assessment of current mental status that can be used in clinical practice by all mental health professionals after brief training. Blackwell Munksgaard 2004
Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Koocher, Gerald P; Yaghoobzadeh, Ameneh; Haghdoost, Ali Akbar; Mar Win, Ma Thin; Soleimani, Mohammad Ali
2017-01-01
This study aimed to evaluate the validity and reliability of the Persian version of Death Anxiety Scale-Extended (DAS-E). A total of 507 patients with end-stage renal disease completed the DAS-E. The factor structure of the scale was evaluated using exploratory factor analysis with an oblique rotation and confirmatory factor analysis. The content and construct validity of the DAS-E were assessed. Average variance extracted, maximum shared squared variance, and average shared squared variance were estimated to assess discriminant and convergent validity. Reliability was assessed using Cronbach's alpha coefficient (α = .839 and .831), composite reliability (CR = .845 and .832), Theta (θ = .893 and .867), and McDonald Omega (Ω = .796 and .743). The analysis indicated a two-factor solution. Reliability and discriminant validity of the factors was established. Findings revealed that the present scale was a valid and reliable instrument that can be used in assessment of death anxiety in Iranian patients with end-stage renal disease.
NASA Astrophysics Data System (ADS)
Flanigan, Katherine A.; Johnson, Nephi R.; Hou, Rui; Ettouney, Mohammed; Lynch, Jerome P.
2017-04-01
The ability to quantitatively assess the condition of railroad bridges facilitates objective evaluation of their robustness in the face of hazard events. Of particular importance is the need to assess the condition of railroad bridges in networks that are exposed to multiple hazards. Data collected from structural health monitoring (SHM) can be used to better maintain a structure by prompting preventative (rather than reactive) maintenance strategies and supplying quantitative information to aid in recovery. To that end, a wireless monitoring system is validated and installed on the Harahan Bridge which is a hundred-year-old long-span railroad truss bridge that crosses the Mississippi River near Memphis, TN. This bridge is exposed to multiple hazards including scour, vehicle/barge impact, seismic activity, and aging. The instrumented sensing system targets non-redundant structural components and areas of the truss and floor system that bridge managers are most concerned about based on previous inspections and structural analysis. This paper details the monitoring system and the analytical method for the assessment of bridge condition based on automated data-driven analyses. Two primary objectives of monitoring the system performance are discussed: 1) monitoring fatigue accumulation in critical tensile truss elements; and 2) monitoring the reliability index values associated with sub-system limit states of these members. Moreover, since the reliability index is a scalar indicator of the safety of components, quantifiable condition assessment can be used as an objective metric so that bridge owners can make informed damage mitigation strategies and optimize resource management on single bridge or network levels.
NASA Astrophysics Data System (ADS)
Sil, Arjun; Longmailai, Thaihamdau
2017-09-01
The lateral displacement of Reinforced Concrete (RC) frame building during an earthquake has an important impact on the structural stability and integrity. However, seismic analysis and design of RC building needs more concern due to its complex behavior as the performance of the structure links to the features of the system having many influencing parameters and other inherent uncertainties. The reliability approach takes into account the factors and uncertainty in design influencing the performance or response of the structure in which the safety level or the probability of failure could be ascertained. This present study, aims to assess the reliability of seismic performance of a four storey residential RC building seismically located in Zone-V as per the code provisions given in the Indian Standards IS: 1893-2002. The reliability assessment performed by deriving an explicit expression for maximum roof-lateral displacement as a failure function by regression method. A total of 319, four storey RC buildings were analyzed by linear static method using SAP2000. However, the change in the lateral-roof displacement with the variation of the parameters (column dimension, beam dimension, grade of concrete, floor height and total weight of the structure) was observed. A generalized relation established by regression method which could be used to estimate the expected lateral displacement owing to those selected parameters. A comparison made between the displacements obtained from analysis with that of the equation so formed. However, it shows that the proposed relation could be used directly to determine the expected maximum lateral displacement. The data obtained from the statistical computations was then used to obtain the probability of failure and the reliability.
Integrated performance and reliability specification for digital avionics systems
NASA Technical Reports Server (NTRS)
Brehm, Eric W.; Goettge, Robert T.
1995-01-01
This paper describes an automated tool for performance and reliability assessment of digital avionics systems, called the Automated Design Tool Set (ADTS). ADTS is based on an integrated approach to design assessment that unifies traditional performance and reliability views of system designs, and that addresses interdependencies between performance and reliability behavior via exchange of parameters and result between mathematical models of each type. A multi-layer tool set architecture has been developed for ADTS that separates the concerns of system specification, model generation, and model solution. Performance and reliability models are generated automatically as a function of candidate system designs, and model results are expressed within the system specification. The layered approach helps deal with the inherent complexity of the design assessment process, and preserves long-term flexibility to accommodate a wide range of models and solution techniques within the tool set structure. ADTS research and development to date has focused on development of a language for specification of system designs as a basis for performance and reliability evaluation. A model generation and solution framework has also been developed for ADTS, that will ultimately encompass an integrated set of analytic and simulated based techniques for performance, reliability, and combined design assessment.
Structural Test Laboratory | Water Power | NREL
Structural Test Laboratory Structural Test Laboratory NREL engineers design and configure structural components can validate models, demonstrate system reliability, inform design margins, and assess , including mass and center of gravity, to ensure compliance with design goals Dynamic Characterization Use
Beard, J D; Marriott, J; Purdie, H; Crossley, J
2011-01-01
To compare user satisfaction and acceptability, reliability and validity of three different methods of assessing the surgical skills of trainees by direct observation in the operating theatre across a range of different surgical specialties and index procedures. A 2-year prospective, observational study in the operating theatres of three teaching hospitals in Sheffield. The assessment methods were procedure-based assessment (PBA), Objective Structured Assessment of Technical Skills (OSATS) and Non-technical Skills for Surgeons (NOTSS). The specialties were obstetrics and gynaecology (O&G) and upper gastrointestinal, colorectal, cardiac, vascular and orthopaedic surgery. Two to four typical index procedures were selected from each specialty. Surgical trainees were directly observed performing typical index procedures and assessed using a combination of two of the three methods (OSATS or PBA and NOTSS for O&G, PBA and NOTSS for the other specialties) by the consultant clinical supervisor for the case and the anaesthetist and/or scrub nurse, as well as one or more independent assessors from the research team. Information on user satisfaction and acceptability of each assessment method from both assessor and trainee perspectives was obtained from structured questionnaires. The reliability of each method was measured using generalisability theory. Aspects of validity included the internal structure of each tool and correlation between tools, construct validity, predictive validity, interprocedural differences, the effect of assessor designation and the effect of assessment on performance. Of the 558 patients who were consented, a total of 437 (78%) cases were included in the study: 51 consultant clinical supervisors, 56 anaesthetists, 39 nurses, 2 surgical care practitioners and 4 independent assessors provided 1635 assessments on 85 trainees undertaking the 437 cases. A total of 749 PBAs, 695 NOTSS and 191 OSATSs were performed. Non-O&G clinical supervisors and trainees provided mixed, but predominantly positive, responses about a range of applications of PBA. Most felt that PBA was important in surgical education, and would use it again in the future and did not feel that it added time to the operating list. The overall satisfaction of O&G clinical supervisors and trainees with OSATS was not as high, and a majority of those who used both preferred PBA. A majority of anaesthetists and nurses felt that NOTSS allowed them to rate interpersonal skills (communication, teamwork and leadership) more easily than cognitive skills (situation awareness and decision-making), that it had formative value and that it was a valuable adjunct to the assessment of technical skills. PBA demonstrated high reliability (G > 0.8 for only three assessor judgements on the same index procedure). OSATS had lower reliability (G > 0.8 for five assessor judgements on the same index procedure). Both were less reliable on a mix of procedures because of strong procedure-specific factors. A direct comparison of PBA between O&G and non-O&G cases showed a striking difference in reliability. Within O&G, a good level of reliability (G > 0.8) could not be obtained using a feasible number of assessments. Conversely, the reliability within non-O&G cases was exceptionally high, with only two assessor judgements being required. The reasons for this difference probably include the more summative purpose of assessment in O&G and the much higher proportion of O&G trainees in this study with training concerns (42% vs 4%). The reliability of NOTSS was lower than that for PBA. Reliability for the same procedure (G > 0.8) required six assessor judgements. However, as procedure-specific factors exerted a lesser influence on NOTSS, reliability on a mix of procedures could be achieved using only eight assessor judgements. NOTSS also demonstrated a valid internal structure. The strongest correlations between NOTSS and PBA or OSATS were in the 'decision-making' domain. PBA and NOTSS showed better construct validity than OSATS, the year of training and the number of recent index procedures performed being significant independent predictors of performance. There was little variation in scoring between different procedures or different designations of assessor. The results suggest that PBA is a reliable and acceptable method of assessing surgical skills, with good construct validity. Specialties that use OSATS may wish to consider changing the design or switching to PBA. Whatever workplace-based assessment method is used, the purpose, timing and frequency of assessment require detailed guidance. NOTSS is a promising tool for the assessment of non-technical skills, and surgical specialties may wish to consider its inclusion in their assessment framework. Further research is required into the use of health-care professionals other than consultant surgeons to assess trainees, the relationship between performance and experience, the educational impact of assessment and the additional value of video recording.
Estimates Of The Orbiter RSI Thermal Protection System Thermal Reliability
NASA Technical Reports Server (NTRS)
Kolodziej, P.; Rasky, D. J.
2002-01-01
In support of the Space Shuttle Orbiter post-flight inspection, structure temperatures are recorded at selected positions on the windward, leeward, starboard and port surfaces. Statistical analysis of this flight data and a non-dimensional load interference (NDLI) method are used to estimate the thermal reliability at positions were reusable surface insulation (RSI) is installed. In this analysis, structure temperatures that exceed the design limit define the critical failure mode. At thirty-three positions the RSI thermal reliability is greater than 0.999999 for the missions studied. This is not the overall system level reliability of the thermal protection system installed on an Orbiter. The results from two Orbiters, OV-102 and OV-105, are in good agreement. The original RSI designs on the OV-102 Orbital Maneuvering System pods, which had low reliability, were significantly improved on OV-105. The NDLI method was also used to estimate thermal reliability from an assessment of TPS uncertainties that was completed shortly before the first Orbiter flight. Results fiom the flight data analysis and the pre-flight assessment agree at several positions near each other. The NDLI method is also effective for optimizing RSI designs to provide uniform thermal reliability on the acreage surface of reusable launch vehicles.
Validation of cryo-EM structure of IP₃R1 channel.
Murray, Stephen C; Flanagan, John; Popova, Olga B; Chiu, Wah; Ludtke, Steven J; Serysheva, Irina I
2013-06-04
About a decade ago, three electron cryomicroscopy (cryo-EM) single-particle reconstructions of IP3R1 were reported at low resolution. It was disturbing that these structures bore little similarity to one another, even at the level of quaternary structure. Recently, we published an improved structure of IP3R1 at ∼1 nm resolution. However, this structure did not bear any resemblance to any of the three previously published structures, leading to the question of why the structure should be considered more reliable than the original three. Here, we apply several methods, including class-average/map comparisons, tilt-pair validation, and use of multiple refinement software packages, to give strong evidence for the reliability of our recent structure. The map resolution and feature resolvability are assessed with the gold standard criterion. This approach is generally applicable to assessing the validity of cryo-EM maps of other molecular machines. Copyright © 2013 Elsevier Ltd. All rights reserved.
Gao, L; Mao, C; Yu, G Y; Peng, X
2016-10-09
Objective: To translate the adult comorbidity evaluation-27(ACE-27) index authored by professor JF Piccirillo into Chinese and for the purpose of assessing the possible impact of comorbidity on survival of oral cancer patients and improving cancer staging. Methods: The translation included the following steps, obtaining permission from professor Piccirillo, translation, back translation, language modification, adjusted by the advice from the professors of oral and maxillofacial surgery. The test population included 154 patients who were admitted to Peking University of Stomatology during March 2011. Questionnaire survey was conducted on these patients. Retest of reliability, internal consistency reliability, content validity, and structure validity were performed. Results: The simplified Chinese ACE-27 index was established. The Cronbach's α was 0.821 in the internal consistency reliability test. The Kaiser-Meyer-Olkin (KMO) value of 8 items was 0.859 in the structure validity test. Conclusions: The simplified Chinese ACE-27 index has good feasibility and reliability. It is useful to assess the comorbidity of oral cancer patients.
Elaboration and Validation of the Medication Prescription Safety Checklist 1
Pires, Aline de Oliveira Meireles; Ferreira, Maria Beatriz Guimarães; do Nascimento, Kleiton Gonçalves; Felix, Márcia Marques dos Santos; Pires, Patrícia da Silva; Barbosa, Maria Helena
2017-01-01
ABSTRACT Objective: to elaborate and validate a checklist to identify compliance with the recommendations for the structure of medication prescriptions, based on the Protocol of the Ministry of Health and the Brazilian Health Surveillance Agency. Method: methodological research, conducted through the validation and reliability analysis process, using a sample of 27 electronic prescriptions. Results: the analyses confirmed the content validity and reliability of the tool. The content validity, obtained by expert assessment, was considered satisfactory as it covered items that represent the compliance with the recommendations regarding the structure of the medication prescriptions. The reliability, assessed through interrater agreement, was excellent (ICC=1.00) and showed perfect agreement (K=1.00). Conclusion: the Medication Prescription Safety Checklist showed to be a valid and reliable tool for the group studied. We hope that this study can contribute to the prevention of adverse events, as well as to the improvement of care quality and safety in medication use. PMID:28793128
NASA Technical Reports Server (NTRS)
Lee, Alice T.; Gunn, Todd; Pham, Tuan; Ricaldi, Ron
1994-01-01
This handbook documents the three software analysis processes the Space Station Software Analysis team uses to assess space station software, including their backgrounds, theories, tools, and analysis procedures. Potential applications of these analysis results are also presented. The first section describes how software complexity analysis provides quantitative information on code, such as code structure and risk areas, throughout the software life cycle. Software complexity analysis allows an analyst to understand the software structure, identify critical software components, assess risk areas within a software system, identify testing deficiencies, and recommend program improvements. Performing this type of analysis during the early design phases of software development can positively affect the process, and may prevent later, much larger, difficulties. The second section describes how software reliability estimation and prediction analysis, or software reliability, provides a quantitative means to measure the probability of failure-free operation of a computer program, and describes the two tools used by JSC to determine failure rates and design tradeoffs between reliability, costs, performance, and schedule.
Stewart, Regan W; Tuerk, Peter W; Metzger, Isha W; Davidson, Tatiana M; Young, John
2016-02-01
Structured diagnostic interviews are widely considered to be the optimal method of assessing symptoms of posttraumatic stress; however, few clinicians report using structured assessments to guide clinical practice. One commonly cited impediment to these assessment approaches is the amount of time required for test administration and interpretation. Empirically keyed methods to reduce the administration time of structured assessments may be a viable solution to increase the use of standardized and reliable diagnostic tools. Thus, the present research conducted an initial feasibility study using a sample of treatment-seeking military veterans (N = 1,517) to develop a truncated assessment protocol based on the Clinician-Administered Posttraumatic Stress Disorder (PTSD) Scale (CAPS). Decision-tree analysis was utilized to identify a subset of predictor variables among the CAPS items that were most predictive of a diagnosis of PTSD. The algorithm-driven, atheoretical sequence of questions reduced the number of items administered by more than 75% and classified the validation sample at 92% accuracy. These results demonstrated the feasibility of developing a protocol to assess PTSD in a way that imposes little assessment burden while still providing a reliable categorization. (c) 2016 APA, all rights reserved).
Uncertainty and Intelligence in Computational Stochastic Mechanics
NASA Technical Reports Server (NTRS)
Ayyub, Bilal M.
1996-01-01
Classical structural reliability assessment techniques are based on precise and crisp (sharp) definitions of failure and non-failure (survival) of a structure in meeting a set of strength, function and serviceability criteria. These definitions are provided in the form of performance functions and limit state equations. Thus, the criteria provide a dichotomous definition of what real physical situations represent, in the form of abrupt change from structural survival to failure. However, based on observing the failure and survival of real structures according to the serviceability and strength criteria, the transition from a survival state to a failure state and from serviceability criteria to strength criteria are continuous and gradual rather than crisp and abrupt. That is, an entire spectrum of damage or failure levels (grades) is observed during the transition to total collapse. In the process, serviceability criteria are gradually violated with monotonically increasing level of violation, and progressively lead into the strength criteria violation. Classical structural reliability methods correctly and adequately include the ambiguity sources of uncertainty (physical randomness, statistical and modeling uncertainty) by varying amounts. However, they are unable to adequately incorporate the presence of a damage spectrum, and do not consider in their mathematical framework any sources of uncertainty of the vagueness type. Vagueness can be attributed to sources of fuzziness, unclearness, indistinctiveness, sharplessness and grayness; whereas ambiguity can be attributed to nonspecificity, one-to-many relations, variety, generality, diversity and divergence. Using the nomenclature of structural reliability, vagueness and ambiguity can be accounted for in the form of realistic delineation of structural damage based on subjective judgment of engineers. For situations that require decisions under uncertainty with cost/benefit objectives, the risk of failure should depend on the underlying level of damage and the uncertainties associated with its definition. A mathematical model for structural reliability assessment that includes both ambiguity and vagueness types of uncertainty was suggested to result in the likelihood of failure over a damage spectrum. The resulting structural reliability estimates properly represent the continuous transition from serviceability to strength limit states over the ultimate time exposure of the structure. In this section, a structural reliability assessment method based on a fuzzy definition of failure is suggested to meet these practical needs. A failure definition can be developed to indicate the relationship between failure level and structural response. In this fuzzy model, a subjective index is introduced to represent all levels of damage (or failure). This index can be interpreted as either a measure of failure level or a measure of a degree of belief in the occurrence of some performance condition (e.g., failure). The index allows expressing the transition state between complete survival and complete failure for some structural response based on subjective evaluation and judgment.
Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Boyle, Christopher; Yaghoobzadeh, Ameneh; Tahmasbi, Bahram; Rassool, G Hussein; Taebei, Mozhgan; Soleimani, Mohammad Ali
2018-04-01
This study aimed to determine the factor structure of the spiritual well-being among a sample of the Iranian veterans. In this methodological research, 211 male veterans of Iran-Iraq warfare completed the Paloutzian and Ellison spiritual well-being scale. Maximum likelihood (ML) with oblique rotation was used to assess domain structure of the spiritual well-being. The construct validity of the scale was assessed using confirmatory factor analysis (CFA), convergent validity, and discriminant validity. Reliability was evaluated with Cronbach's alpha, Theta (θ), and McDonald Omega (Ω) coefficients, intra-class correlation coefficient (ICC), and construct reliability (CR). Results of ML and CFA suggested three factors which were labeled "relationship with God," "belief in fate and destiny," and "life optimism." The ICC, coefficients of the internal consistency, and CR were >.7 for the factors of the scale. Convergent validity and discriminant validity did not fulfill the requirements. The Persian version of spiritual well-being scale demonstrated suitable validity and reliability among the veterans of Iran-Iraq warfare.
A two-factor theory for concussion assessment using ImPACT: memory and speed.
Schatz, Philip; Maerlender, Arthur
2013-12-01
We present the initial validation of a two-factor structure of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) using ImPACT composite scores and document the reliability and validity of this factor structure. Factor analyses were conducted for baseline (N = 21,537) and post-concussion (N = 560) data, yielding "Memory" (Verbal and Visual) and "Speed" (Visual Motor Speed and Reaction Time) Factors; inclusion of Total Symptom Scores resulted in a third discrete factor. Speed and Memory z-scores were calculated, and test-retest reliability (using intra-class correlation coefficients) at 1 month (0.88/0.81), 1 year (0.85/0.75), and 2 years (0.76/0.74) were higher than published data using Composite scores. Speed and Memory scores yielded 89% sensitivity and 70% specificity, which was higher than composites (80%/62%) and comparable with subscales (91%/69%). This emergent two-factor structure has improved test-retest reliability with no loss of sensitivity/specificity and may improve understanding and interpretability of ImPACT test results.
NASA Technical Reports Server (NTRS)
Sobel, Larry; Buttitta, Claudio; Suarez, James
1993-01-01
Probabilistic predictions based on the Integrated Probabilistic Assessment of Composite Structures (IPACS) code are presented for the material and structural response of unnotched and notched, 1M6/3501-6 Gr/Ep laminates. Comparisons of predicted and measured modulus and strength distributions are given for unnotched unidirectional, cross-ply, and quasi-isotropic laminates. The predicted modulus distributions were found to correlate well with the test results for all three unnotched laminates. Correlations of strength distributions for the unnotched laminates are judged good for the unidirectional laminate and fair for the cross-ply laminate, whereas the strength correlation for the quasi-isotropic laminate is deficient because IPACS did not yet have a progressive failure capability. The paper also presents probabilistic and structural reliability analysis predictions for the strain concentration factor (SCF) for an open-hole, quasi-isotropic laminate subjected to longitudinal tension. A special procedure was developed to adapt IPACS for the structural reliability analysis. The reliability results show the importance of identifying the most significant random variables upon which the SCF depends, and of having accurate scatter values for these variables.
Görtelmeyer, Roman; Schmidt, Jürgen; Suckfüll, Markus; Jastreboff, Pawel; Gebauer, Alexander; Krüger, Hagen; Wittmann, Werner
2011-08-01
To evaluate the reliability, dimensionality, predictive validity, construct validity, and sensitivity to change of the THI-12 total and sub-scales as diagnostic aids to describe and quantify tinnitus-evoked reactions and evaluate treatment efficacy. Explorative analysis of the German tinnitus handicap inventory (THI-12) to assess potential sensitivity to tinnitus therapy in placebo-controlled randomized studies. Correlation analysis, including Cronbach's coefficient α and explorative common factor analysis (EFA), was conducted within and between assessments to demonstrate the construct validity, dimensionality, and factorial structure of the THI-12. N = 618 patients suffering from subjective tinnitus who were to be screened to participate in a randomized, placebo-controlled, 16-week, longitudinal study. The THI-12 can reliably diagnose tinnitus-related impairments and disabilities and assess changes over time. The test-retest coefficient for neighboured visits was r > 0.69, the internal consistency of the THI-12 total score was α ≤ 0.79 and α ≤ 0.89 at subsequent visits. Predictability of THI-12 total score and overall variance increased with successive measurements. The three-factorial structure allowed for evaluation of factors that affect aspects of patients' health-related quality of life. The THI-12, with its three-factorial structure, is a simple, reliable, and valid instrument for the diagnosis and assessment of tinnitus and associated impairment over time.
ERIC Educational Resources Information Center
Davison, Mark L.; Semmes, Robert; Huang, Lan; Close, Catherine N.
2012-01-01
Data from 181 college students were used to assess whether math reasoning item response times in computerized testing can provide valid and reliable measures of a speed dimension. The alternate forms reliability of the speed dimension was .85. A two-dimensional structural equation model suggests that the speed dimension is related to the accuracy…
Advancing implementation science through measure development and evaluation: a study protocol.
Lewis, Cara C; Weiner, Bryan J; Stanick, Cameo; Fischer, Sarah M
2015-07-22
Significant gaps related to measurement issues are among the most critical barriers to advancing implementation science. Three issues motivated the study aims: (a) the lack of stakeholder involvement in defining pragmatic measure qualities; (b) the dearth of measures, particularly for implementation outcomes; and (c) unknown psychometric and pragmatic strength of existing measures. Aim 1: Establish a stakeholder-driven operationalization of pragmatic measures and develop reliable, valid rating criteria for assessing the construct. Aim 2: Develop reliable, valid, and pragmatic measures of three critical implementation outcomes, acceptability, appropriateness, and feasibility. Aim 3: Identify Consolidated Framework for Implementation Research and Implementation Outcome Framework-linked measures that demonstrate both psychometric and pragmatic strength. For Aim 1, we will conduct (a) interviews with stakeholder panelists (N = 7) and complete a literature review to populate pragmatic measure construct criteria, (b) Q-sort activities (N = 20) to clarify the internal structure of the definition, (c) Delphi activities (N = 20) to achieve consensus on the dimension priorities, (d) test-retest and inter-rater reliability assessments of the emergent rating system, and (e) known-groups validity testing of the top three prioritized pragmatic criteria. For Aim 2, our systematic development process involves domain delineation, item generation, substantive validity assessment, structural validity assessment, reliability assessment, and predictive validity assessment. We will also assess discriminant validity, known-groups validity, structural invariance, sensitivity to change, and other pragmatic features. For Aim 3, we will refine our established evidence-based assessment (EBA) criteria, extract the relevant data from the literature, rate each measure using the EBA criteria, and summarize the data. The study outputs of each aim are expected to have a positive impact as they will establish and guide a comprehensive measurement-focused research agenda for implementation science and provide empirically supported measures, tools, and methods for accomplishing this work.
ERIC Educational Resources Information Center
Reiter, Harold I.; Rosenfeld, Jack; Nandagopal, Kiruthiga; Eva, Kevin W.
2004-01-01
Context: Various research studies have examined the question of whether expert or non-expert raters, faculty or students, evaluators or standardized patients, give more reliable and valid summative assessments of performance on Objective Structured Clinical Examinations (OSCEs). Less studied has been the question of whether or not non-faculty…
ERIC Educational Resources Information Center
Abraham, Reem Rachel; Raghavendra, Rao; Surekha, Kamath; Asha, Kamath
2009-01-01
A single examination does not fulfill all the functions of assessment. The present study was undertaken to determine the reliability and student satisfaction regarding the objective structured practical examination (OSPE) as a method of assessment of laboratory exercises in physiology before implementing it in the forthcoming university…
DOE Office of Scientific and Technical Information (OSTI.GOV)
None
2015-08-01
Since 1990, the National Renewable Energy Laboratory’s (NREL's) National Wind Technology Center (NWTC) has tested more than 150 wind turbine blades. NWTC researchers can test full-scale and subcomponent articles, conduct data analyses, and provide engineering expertise on best design practices. Structural testing of wind turbine blades enables designers, manufacturers, and owners to validate designs and assess structural performance to specific load conditions. Rigorous structural testing can reveal design and manufacturing problems at an early stage of development that can lead to overall improvements in design and increase system reliability.
Dedy, Nicolas J; Szasz, Peter; Louridas, Marisa; Bonrath, Esther M; Husslein, Heinrich; Grantcharov, Teodor P
2015-06-01
Nontechnical skills are critical for patient safety in the operating room (OR). As a result, regulatory bodies for accreditation and certification have mandated the integration of these competencies into postgraduate education. A generally accepted approach to the in-training assessment of nontechnical skills, however, is lacking. The goal of the present study was to develop an evidence-based and reliable tool for the in-training assessment of residents' nontechnical performance in the OR. The Objective Structured Assessment of Nontechnical Skills tool was designed as a 5-point global rating scale with descriptive anchors for each item, based on existing evidence-based frameworks of nontechnical skills, as well as resident training requirements. The tool was piloted on scripted videos and refined in an iterative process. The final version was used to rate residents' performance in recorded OR crisis simulations and during live observations in the OR. A total of 37 simulations and 10 live procedures were rated. Interrater agreement was good for total mean scores, both in simulation and in the real OR, with intraclass correlation coefficients >0.90 in all settings for average and single measures. Internal consistency of the scale was high (Cronbach's alpha = 0.80). The Objective Structured Assessment of Nontechnical Skills global rating scale was developed as an evidence-based tool for the in-training assessment of residents' nontechnical performance in the OR. Unique descriptive anchors allow for a criterion-referenced assessment of performance. Good reliability was demonstrated in different settings, supporting applications in research and education. Copyright © 2015 Elsevier Inc. All rights reserved.
The Shutdown Dissociation Scale (Shut-D)
Schalinski, Inga; Schauer, Maggie; Elbert, Thomas
2015-01-01
The evolutionary model of the defense cascade by Schauer and Elbert (2010) provides a theoretical frame for a short interview to assess problems underlying and leading to the dissociative subtype of posttraumatic stress disorder. Based on known characteristics of the defense stages “fright,” “flag,” and “faint,” we designed a structured interview to assess the vulnerability for the respective types of dissociation. Most of the scales that assess dissociative phenomena are designed as self-report questionnaires. Their items are usually selected based on more heuristic considerations rather than a theoretical model and thus include anything from minor dissociative experiences to major pathological dissociation. The shutdown dissociation scale (Shut-D) was applied in several studies in patients with a history of multiple traumatic events and different disorders that have been shown previously to be prone to symptoms of dissociation. The goal of the present investigation was to obtain psychometric characteristics of the Shut-D (including factor structure, internal consistency, retest reliability, predictive, convergent and criterion-related concurrent validity). A total population of 225 patients and 68 healthy controls were accessed. Shut-D appears to have sufficient internal reliability, excellent retest reliability, high convergent validity, and satisfactory predictive validity, while the summed score of the scale reliably separates patients with exposure to trauma (in different diagnostic groups) from healthy controls. The Shut-D is a brief structured interview for assessing the vulnerability to dissociate as a consequence of exposure to traumatic stressors. The scale demonstrates high-quality psychometric properties and may be useful for researchers and clinicians in assessing shutdown dissociation as well as in predicting the risk of dissociative responding. PMID:25976478
Saito, Rintaro; Suzuki, Harukazu; Hayashizaki, Yoshihide
2003-04-12
Recent screening techniques have made large amounts of protein-protein interaction data available, from which biologically important information such as the function of uncharacterized proteins, the existence of novel protein complexes, and novel signal-transduction pathways can be discovered. However, experimental data on protein interactions contain many false positives, making these discoveries difficult. Therefore computational methods of assessing the reliability of each candidate protein-protein interaction are urgently needed. We developed a new 'interaction generality' measure (IG2) to assess the reliability of protein-protein interactions using only the topological properties of their interaction-network structure. Using yeast protein-protein interaction data, we showed that reliable protein-protein interactions had significantly lower IG2 values than less-reliable interactions, suggesting that IG2 values can be used to evaluate and filter interaction data to enable the construction of reliable protein-protein interaction networks.
Zhang, Dengke; Pang, Yanxia; Cai, Weixiong; Fazio, Rachel L; Ge, Jianrong; Su, Qiaorong; Xu, Shuiqin; Pan, Yinan; Chen, Sanmei; Zhang, Hongwei
2016-08-01
Impairment of theory of mind (ToM) is a common phenomenon following traumatic brain injury (TBI) that has clear effects on patients' social functioning. A growing body of research has focused on this area, and several methods have been developed to assess ToM deficiency. Although an informant assessment scale would be useful for examining individuals with TBI, very few studies have adopted this approach. The purpose of the present study was to develop an informant assessment scale of ToM for adults with traumatic brain injury (IASToM-aTBI) and to test its reliability and validity with 196 adults with TBI and 80 normal adults. A 44-item scale was developed following a literature review, interviews with patient informants, consultations with experts, item analysis, and exploratory factor analysis (EFA). The following three common factors were extracted: social interaction, understanding of beliefs, and understanding of emotions. The psychometric analyses indicate that the scale has good internal consistency reliability, split-half reliability, test-retest reliability, inter-rater reliability, structural validity, discriminate validity and criterion validity. These results provide preliminary evidence that supports the reliability and validity of the IASToM-aTBI as a ToM assessment tool for adults with TBI.
Cropp, Carola; Salzer, Simone; Häusser, Leonard F; Streeck-Fischer, Annette
2013-01-01
The axis structure of the Operationalized Psychodynamic Diagnostics in childhood and adolescence (OPD-CA) has proven to be a reliable and valid diagnostic tool under research conditions. However, corresponding data regarding the integration of OPD-CA axis structure into clinical practice is still lacking. Hence, this aspect was examined as part of a randomized controlled clinical trial realized at Asklepios Fachklinikum Tiefenbrunn. Here, the OPD-CA axis structure has been applied to assess the structural level of 42 adolescent patients (15-19 years). In contrast to previous studies, the assessment was not carried out by independent raters using a videotaped OPD-CA interview, but the rating was part of clinical routine procedures. Also under these conditions, inter-rater reliability was high, in particular regarding the four subscales of the OPD-CA axis structure. With respect to construct validity, the results of our study supported a two-factor solution, which is in accordance with the findings of two previous works. One factor corresponded to the dimension "self-regulation" while the other factor included both the dimension "self-perception and object perception" as well as the dimension "communication skills". Implications of the findings for research and practice are discussed.
Boileau, C; Martel-Pelletier, J; Abram, F; Raynauld, J-P; Troncy, E; D'Anjou, M-A; Moreau, M; Pelletier, J-P
2008-07-01
Osteoarthritis (OA) structural changes take place over decades in humans. MRI can provide precise and reliable information on the joint structure and changes over time. In this study, we investigated the reliability of quantitative MRI in assessing knee OA structural changes in the experimental anterior cruciate ligament (ACL) dog model of OA. OA was surgically induced by transection of the ACL of the right knee in five dogs. High resolution three dimensional MRI using a 1.5 T magnet was performed at baseline, 4, 8 and 26 weeks post surgery. Cartilage volume/thickness, cartilage defects, trochlear osteophyte formation and subchondral bone lesion (hypersignal) were assessed on MRI images. Animals were killed 26 weeks post surgery and macroscopic evaluation was performed. There was a progressive and significant increase over time in the loss of knee cartilage volume, the cartilage defect and subchondral bone hypersignal. The trochlear osteophyte size also progressed over time. The greatest cartilage loss at 26 weeks was found on the tibial plateaus and in the medial compartment. There was a highly significant correlation between total knee cartilage volume loss or defect and subchondral bone hypersignal, and also a good correlation between the macroscopic and the MRI findings. This study demonstrated that MRI is a useful technology to provide a non-invasive and reliable assessment of the joint structural changes during the development of OA in the ACL dog model. The combination of this OA model with MRI evaluation provides a promising tool for the evaluation of new disease-modifying osteoarthritis drugs (DMOADs).
System reliability of randomly vibrating structures: Computational modeling and laboratory testing
NASA Astrophysics Data System (ADS)
Sundar, V. S.; Ammanagi, S.; Manohar, C. S.
2015-09-01
The problem of determination of system reliability of randomly vibrating structures arises in many application areas of engineering. We discuss in this paper approaches based on Monte Carlo simulations and laboratory testing to tackle problems of time variant system reliability estimation. The strategy we adopt is based on the application of Girsanov's transformation to the governing stochastic differential equations which enables estimation of probability of failure with significantly reduced number of samples than what is needed in a direct simulation study. Notably, we show that the ideas from Girsanov's transformation based Monte Carlo simulations can be extended to conduct laboratory testing to assess system reliability of engineering structures with reduced number of samples and hence with reduced testing times. Illustrative examples include computational studies on a 10-degree of freedom nonlinear system model and laboratory/computational investigations on road load response of an automotive system tested on a four-post test rig.
Extending the Concept and Assessment of Teacher Efficacy.
ERIC Educational Resources Information Center
Rich, Yisrael; And Others
1996-01-01
Two teacher efficacy subscales developed by S. Gibson and M. Dembo (1984) translated into Hebrew and administered to Israeli teachers retained their factor structures and adequate reliability. A subscale developed for the study to measure teacher efficacy in enhancing student social relations specifically also had adequate reliability. (SLD)
Item Analysis to Improve Reliability for an Internal Medicine Undergraduate OSCE
ERIC Educational Resources Information Center
Auewarakul, Chirayu; Downing, Steven M.; Praditsuwan, Rungnirand; Jaturatamrong, Uapong
2005-01-01
Utilization of objective structured clinical examinations (OSCEs) for final assessment of medical students in Internal Medicine requires a representative sample of OSCE stations. The reliability and generalizability of OSCE scores provides validity evidence for OSCE scores and supports its contribution to the final clinical grade of medical…
Validation of a method for assessing resident physicians' quality improvement proposals.
Leenstra, James L; Beckman, Thomas J; Reed, Darcy A; Mundell, William C; Thomas, Kris G; Krajicek, Bryan J; Cha, Stephen S; Kolars, Joseph C; McDonald, Furman S
2007-09-01
Residency programs involve trainees in quality improvement (QI) projects to evaluate competency in systems-based practice and practice-based learning and improvement. Valid approaches to assess QI proposals are lacking. We developed an instrument for assessing resident QI proposals--the Quality Improvement Proposal Assessment Tool (QIPAT-7)-and determined its validity and reliability. QIPAT-7 content was initially obtained from a national panel of QI experts. Through an iterative process, the instrument was refined, pilot-tested, and revised. Seven raters used the instrument to assess 45 resident QI proposals. Principal factor analysis was used to explore the dimensionality of instrument scores. Cronbach's alpha and intraclass correlations were calculated to determine internal consistency and interrater reliability, respectively. QIPAT-7 items comprised a single factor (eigenvalue = 3.4) suggesting a single assessment dimension. Interrater reliability for each item (range 0.79 to 0.93) and internal consistency reliability among the items (Cronbach's alpha = 0.87) were high. This method for assessing resident physician QI proposals is supported by content and internal structure validity evidence. QIPAT-7 is a useful tool for assessing resident QI proposals. Future research should determine the reliability of QIPAT-7 scores in other residency and fellowship training programs. Correlations should also be made between assessment scores and criteria for QI proposal success such as implementation of QI proposals, resident scholarly productivity, and improved patient outcomes.
Das, Rebekah; Buckley, Jonathan; Williams, Marie
2017-03-01
To develop and assess structure, test-retest reliability, and discriminative validity of a self-report questionnaire (University of South Australia Urinary Sensation Assessment: USA 2 ) to assess multiple dimensions of urgency sensation. The USA 2 was designed and tested over two prospective, observational studies (2013-2014). Participants were English speaking Australians aged 50 or more with and without overactive bladder (OAB; determined by OAB awareness tool), recruited via health and recreation centers. In Study 1, exploratory factor analysis determined USA 2 structure and subscales. In Study 2, confirmatory factor analysis reassessed structure; Mann-Whitney U-tests determined discriminative validity (OAB vs. non-OAB for subscale and total scores) with Cohen's d effect sizes. Thirty-three individuals completed the USA 2 twice; intraclass correlation coefficients (ICCs) and Wilcoxon signed rank tests assessed test-retest reliability. Questionnaires were returned by 189 eligible participants in Study 1 and 211 in Study 2. Exploratory factor analysis revealed three subscales: "urgency," "affective," "fullness." Confirmatory factor analysis supported these subscales. Subscale and total scores were significantly different between groups with and without OAB (P < 0.001). Cohen's d effect sizes (95%CI) were total score 1.8 (0.5-3.1), "urgency" subscale 1.8 (1.3-2.3), "affective" 1.7 (0.95-2.4), and "fullness" 0.75 (0.42-1.09). Total and subscales scores demonstrated test-retest reliability; ICCs (95%CIs) of 0.95 (0.9-0.98), 0.96 (0.92-0.98), 0.94 (0.88-0.97), and 0.78 (0.56-0.89). The USA 2 assesses multiple dimensions of urgency sensation, is reliable over a 2-week period, and discriminates between older adults with and without OAB. Further validation is required in conditions other than overactive bladder. Neurourol. Urodynam. 36:667-672, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Lehto, Rebecca H; Allen, Kelly A; Goudarzian, Amir Hossein; Yaghoobzadeh, Ameneh; Soleimani, Mohammad Ali
2017-07-01
Objective: Limited research has examined the psychometric properties of death depression scales in Persian populations with cardiac disease despite the need for valid assessment tools for evaluating depressive symptoms in patients with life-limiting chronic conditions. The present study aimed at evaluating the reliability and validity of the Persian Version of Death Depression Scale - Revised (DDS-R) in Iranian patients who had recent acute myocardial infarction (AMI). Method: This psychometric study was conducted with a convenience sample of 407 patients with AMI diagnosis who completed the Persian version of the DDS-R. The face, content, and construct validity of the scale were ascertained. Internal consistency, test-retest, and construct reliability (CR) were used to assess reliability of the Persian Version of DDS-R. Results: Based on maximum likelihood exploratory factor analysis and consideration of conceptual meaning, a 4-factor solution was identified, explaining 75.89% of the total variance. Goodness-of-fit indices (GFI), Comparative Fit Index (CFI), Normed Fit Index (NFI), Incremental Fit Index (IFI), and Root Mean Square Error of Approximation (RMSEA) in the final DDS-R structure demonstrated the adequacy of the 4-domain structure. The internal consistency, construct reliability, and Intra-class Correlation Coefficients (ICC) were greater than .70. Conclusion: The DDS-R was found to be a valid and reliable assessment tool for evaluating death depression symptoms in Iranian patients with AMI.
Ishman, Stacey L; Benke, James R; Johnson, Kaalan Erik; Zur, Karen B; Jacobs, Ian N; Thorne, Marc C; Brown, David J; Lin, Sandra Y; Bhatti, Nasir; Deutsch, Ellen S
2012-10-01
OBJECTIVES To confirm interrater reliability using blinded evaluation of a skills-assessment instrument to assess the surgical performance of resident and fellow trainees performing pediatric direct laryngoscopy and rigid bronchoscopy in simulated models. DESIGN Prospective, paired, blinded observational validation study. SUBJECTS Paired observers from multiple institutions simultaneously evaluated residents and fellows who were performing surgery in an animal laboratory or using high-fidelity manikins. The evaluators had no previous affiliation with the residents and fellows and did not know their year of training. INTERVENTIONS One- and 2-page versions of an objective structured assessment of technical skills (OSATS) assessment instrument composed of global and a task-specific surgical items were used to evaluate surgical performance. RESULTS Fifty-two evaluations were completed by 17 attending evaluators. The instrument agreement for the 2-page assessment was 71.4% when measured as a binary variable (ie, competent vs not competent) (κ = 0.38; P = .08). Evaluation as a continuous variable revealed a 42.9% percentage agreement (κ = 0.18; P = .14). The intraclass correlation was 0.53, considered substantial/good interrater reliability (69% reliable). For the 1-page instrument, agreement was 77.4% when measured as a binary variable (κ = 0.53, P = .0015). Agreement when evaluated as a continuous measure was 71.0% (κ = 0.54, P < .001). The intraclass correlation was 0.73, considered high interrater reliability (85% reliable). CONCLUSIONS The OSATS assessment instrument is an effective tool for evaluating surgical performance among trainees with acceptable interrater reliability in a simulator setting. Reliability was good for both the 1- and 2-page OSATS checklists, and both serve as excellent tools to provide immediate formative feedback on operational competency.
Harlan, E; Clark, L A
1999-06-01
Researchers and clinicians alike increasingly seek brief, reliable, and valid measures to obtain personality trait ratings from both selves and peers. We report the development of a paragraph-descriptor short form of a full-length personality assessment instrument, the Schedule for Nonadaptive and Adaptive Personality (SNAP) with both self- and other versions. Reliability and validity data were collected on a sample of 294 college students, from 90 of whom we also obtained parental ratings of their personality. Internal consistency reliability was good in both self- and parent data. The factorial structures of the self-report short and long forms were very similar. Convergence between parental ratings was moderately high. Self-parent convergence was variable, with lower agreement on scales assessing subjective distress than those assessing more observable behaviors; it also was stronger for higher order factors than for scales.
Xiao, Ting; Stamatakis, Katherine A; McVay, Allese B
Local health departments (LHDs) have an important function in controlling the growing epidemic of obesity in the United States. Data are needed to gain insight into the existence of routine functions and structures of LHDs that support and sustain obesity prevention efforts. The purpose of this study was to develop and examine the reliability of measures to assess foundational LHD organizational processes and functions specific to obesity prevention. Survey measures were developed using a stratified, random sample of US LHDs to assess supportive organizational processes and infrastructure for obesity prevention representing different domains. Data were analyzed using weighted κ and intraclass correlation coefficient for assessing test-retest reliability. Most items and summary indices in the majority of survey domains had moderate/substantial or almost perfect reliability. The overall findings support this survey instrument to be a reliable measurement tool for a large number of processes and functions that comprise obesity prevention-related capacity in LHDs.
Piqueras, Jose A; Martín-Vivar, María; Sandin, Bonifacio; San Luis, Concepción; Pineda, David
2017-08-15
Anxiety and depression are among the most common mental disorders during childhood and adolescence. Among the instruments for the brief screening assessment of symptoms of anxiety and depression, the Revised Child Anxiety and Depression Scale (RCADS) is one of the more widely used. Previous studies have demonstrated the reliability of the RCADS for different assessment settings and different versions. The aims of this study were to examine the mean reliability of the RCADS and the influence of the moderators on the RCADS reliability. We searched in EBSCO, PsycINFO, Google Scholar, Web of Science, and NCBI databases and other articles manually from lists of references of extracted articles. A total of 146 studies were included in our meta-analysis. The RCADS showed robust internal consistency reliability in different assessment settings, countries, and languages. We only found that reliability of the RCADS was significantly moderated by the version of RCADS. However, these differences in reliability between different versions of the RCADS were slight and can be due to the number of items. We did not examine factor structure, factorial invariance across gender, age, or country, and test-retest reliability of the RCADS. The RCADS is a reliable instrument for cross-cultural use, with the advantage of providing more information with a low number of items in the assessment of both anxiety and depression symptoms in children and adolescents. Copyright © 2017. Published by Elsevier B.V.
Reliability Technology to Achieve Insertion of Advanced Packaging (RELTECH) program
NASA Astrophysics Data System (ADS)
Fayette, Daniel F.; Speicher, Patricia; Stoklosa, Mark J.; Evans, Jillian V.; Evans, John W.; Gentile, Mike; Pagel, Chuck A.; Hakim, Edward
1993-08-01
A joint military-commercial effort to evaluate multichip module (MCM) structures is discussed. The program, Reliability Technology to Achieve Insertion of Advanced Packaging (RELTECH), has been designed to identify the failure mechanisms that are possible in MCM structures. The RELTECH test vehicles, technical assessment task, product evaluation plan, reliability modeling task, accelerated and environmental testing, and post-test physical analysis and failure analysis are described. The information obtained through RELTECH can be used to address standardization issues, through development of cost effective qualification and appropriate screening criteria, for inclusion into a commercial specification and the MIL-H-38534 general specification for hybrid microcircuits.
Reliability Technology to Achieve Insertion of Advanced Packaging (RELTECH) program
NASA Technical Reports Server (NTRS)
Fayette, Daniel F.; Speicher, Patricia; Stoklosa, Mark J.; Evans, Jillian V.; Evans, John W.; Gentile, Mike; Pagel, Chuck A.; Hakim, Edward
1993-01-01
A joint military-commercial effort to evaluate multichip module (MCM) structures is discussed. The program, Reliability Technology to Achieve Insertion of Advanced Packaging (RELTECH), has been designed to identify the failure mechanisms that are possible in MCM structures. The RELTECH test vehicles, technical assessment task, product evaluation plan, reliability modeling task, accelerated and environmental testing, and post-test physical analysis and failure analysis are described. The information obtained through RELTECH can be used to address standardization issues, through development of cost effective qualification and appropriate screening criteria, for inclusion into a commercial specification and the MIL-H-38534 general specification for hybrid microcircuits.
A systematic review of the factor structure and reliability of the Spence Children's Anxiety Scale.
Orgilés, Mireia; Fernández-Martínez, Iván; Guillén-Riquelme, Alejandro; Espada, José P; Essau, Cecilia A
2016-01-15
The Spence Children's Anxiety Scale (SCAS) is a widely used instrument for assessing symptoms of anxiety disorders among children and adolescents. Previous studies have demonstrated its good reliability for children and adolescents from different backgrounds. However, remarkable variability in the reliability of the SCAS across studies and inconsistent results regarding its factor structure has been found. The present study aims to examine the SCAS factor structure by means of a systematic review with narrative synthesis, the mean reliability of the SCAS by means of a meta-analysis, and the influence of the moderators on the SCAS reliability. Databases employed to collect the studies included Scholar Google, PsycARTICLES, PsycINFO, Web of Science, and Scopus since 1997. Twenty-nine and 32 studies, which examined the factor structure and the internal consistency of the SCAS, respectively, were included. The SCAS was found to have strong internal consistency, influenced by different moderators. The systematic review demonstrated that the original six-factor model was supported by most studies. Factorial invariance studies (across age, gender, country) and test-retest reliability of the SCAS were not examined in this study. It is concluded that the SCAS is a reliable instrument for cross-cultural use, and it is suggested that the original six-factor model is appropriate for cross-cultural application. Copyright © 2015 Elsevier B.V. All rights reserved.
Hand assessment in older adults with musculoskeletal hand problems: a reliability study.
Myers, Helen L; Thomas, Elaine; Hay, Elaine M; Dziedzic, Krysia S
2011-01-07
Musculoskeletal hand pain is common in the general population. This study aims to investigate the inter- and intra-observer reliability of two trained observers conducting a simple clinical interview and physical examination for hand problems in older adults. The reliability of applying the American College of Rheumatology (ACR) criteria for hand osteoarthritis to community-dwelling older adults will also be investigated. Fifty-five participants aged 50 years and over with a current self-reported hand problem and registered with one general practice were recruited from a previous health questionnaire study. Participants underwent a standardised, structured clinical interview and physical examination by two independent trained observers and again by one of these observers a month later. Agreement beyond chance was summarised using Kappa statistics and intra-class correlation coefficients. Median values for inter- and intra-observer reliability for clinical interview questions were found to be "substantial" and "moderate" respectively [median agreement beyond chance (Kappa) was 0.75 (range: -0.03, 0.93) for inter-observer ratings and 0.57 (range: -0.02, 1.00) for intra-observer ratings]. Inter- and intra-observer reliability for physical examination items was variable, with good reliability observed for some items, such as grip and pinch strength, and poor reliability observed for others, notably assessment of altered sensation, pain on resisted movement and judgements based on observation and palpation of individual features at single joints, such as bony enlargement, nodes and swelling. Moderate agreement was observed both between and within observers when applying the ACR criteria for hand osteoarthritis. Standardised, structured clinical interview is reliable for taking a history in community-dwelling older adults with self reported hand problems. Agreement between and within observers for physical examination items is variable. Low Kappa values may have resulted, in part, from a low prevalence of clinical signs and symptoms in the study participants. The decision to use clinical interview and hand assessment variables in clinical practice or further research in primary care should include consideration of clinical applicability and training alongside reliability. Further investigation is required to determine the relationship between these clinical questions and assessments and the clinical course of hand pain and hand problems in community-dwelling older adults.
NASA Astrophysics Data System (ADS)
Buczyński, P.
2018-05-01
This article presents a new approach to reliability assessment of the road structure in which the base layer will be constructed in the process of cold deep recycling with foamed bitumen. In order to properly assess the reliability of the structure with the recycled base, it is necessary to determine the distribution of stress and strain in typical pavement layer systems. The true stress and strain values were established for particular structural layers using the complex modulus (E*) determined based on the master curves. The complex modulus was determined by the direct tension-compression test on cylindrical specimens (DTC-CY) at five temperatures (-7°C, 5°C, 13°C, 25°C, 40°C) and six loading times (0.1 Hz, 0.3 Hz, 1 Hz, 3 Hz, 10 Hz, 20 Hz) in accordance with EN 12697-26 in the linear viscoelasticity (LVE) range for small strains ranging from 25 to 50 με. The master curves of the complex modulus were constructed using the Richards model for the mixtures typically incorporated in structural layers, i.e., SMA11, AC16W, AC22P and MCAS. The values of the modulus characterizing particular layers were determined with temperature distribution in the structure taken into account, when the surface temperature was 40°C. The stress distribution was established for those calculation models. The stress values were used to evaluate the fatigue life under controlled stress conditions (IT-FT). This evaluation, with the controlled stress corresponding to that in the structure, facilitated the quality assessment of the rehabilitated recycled base course. Results showed that the recycled base mixtures having the indirect tensile strength (ITSDRY) similar to the stress in the structure under analysis needed an additional fatigue life evaluation in the indirect tensile test ITT. This approach to the recycled base quality assessment will allow eliminating the damage induced by overloading.
Thylstrup, Birgitte; Simonsen, Sebastian; Nemery, Caroline; Simonsen, Erik; Noll, Jane Fjernestad; Myatt, Mikkel Wanting; Hesse, Morten
2016-08-25
The personality disorder categories in the Diagnostic and Statistical Manual of Mental Disorders IV have been extensively criticized, and there is a growing consensus that personality pathology should be represented dimensionally rather than categorically. The aim of this pilot study was to test the Clinical Assessment of the Level of Personality Functioning Scale, a semi-structured clinical interview, designed to assess the Level of Personality Functioning Scale of the DSM-5 (Section III) by applying strategies similar to what characterizes assessments in clinical practice. The inter-rater reliability of the assessment of the four domains and the total impairment in the Level of Personality Functioning Scale were measured in a patient sample that varied in terms of severity and type of pathology. Ratings were done independently by the interviewer and two experts who watched a videotaped Clinical Assessment of the Level of Personality Functioning Scale interview. Inter-rater reliability coefficients varied between domains and were not sufficient for clinical practice, but may support the use of the interview to assess the dimensions of personality functioning for research purposes. While designed to measure the Level of Personality Functioning Scale with a high degree of similarity to clinical practice, the Clinical Assessment of the Level of Personality Functioning Scale had weak reliabilities and a rating based on a single interview should not be considered a stand-alone assessment of areas of functioning for a given patient.
Probabilistic simulation of the human factor in structural reliability
NASA Astrophysics Data System (ADS)
Chamis, Christos C.; Singhal, Surendra N.
1994-09-01
The formal approach described herein computationally simulates the probable ranges of uncertainties for the human factor in probabilistic assessments of structural reliability. Human factors such as marital status, professional status, home life, job satisfaction, work load, and health are studied by using a multifactor interaction equation (MFIE) model to demonstrate the approach. Parametric studies in conjunction with judgment are used to select reasonable values for the participating factors (primitive variables). Subsequently performed probabilistic sensitivity studies assess the suitability of the MFIE as well as the validity of the whole approach. Results show that uncertainties range from 5 to 30 percent for the most optimistic case, assuming 100 percent for no error (perfect performance).
Probabilistic Simulation of the Human Factor in Structural Reliability
NASA Technical Reports Server (NTRS)
Chamis, Christos C.; Singhal, Surendra N.
1994-01-01
The formal approach described herein computationally simulates the probable ranges of uncertainties for the human factor in probabilistic assessments of structural reliability. Human factors such as marital status, professional status, home life, job satisfaction, work load, and health are studied by using a multifactor interaction equation (MFIE) model to demonstrate the approach. Parametric studies in conjunction with judgment are used to select reasonable values for the participating factors (primitive variables). Subsequently performed probabilistic sensitivity studies assess the suitability of the MFIE as well as the validity of the whole approach. Results show that uncertainties range from 5 to 30 percent for the most optimistic case, assuming 100 percent for no error (perfect performance).
Assessing the competences associated with a nursing Bachelor thesis by means of rubrics.
Llaurado-Serra, M; Rodríguez, E; Gallart, A; Fuster, P; Monforte-Royo, C; De Juan, M Á
2018-07-01
Writing a Bachelor thesis is the last step in obtaining a university degree. The thesis may be job- or research-orientated, but it must demonstrate certain degree-level competences. Rubrics are a useful way of unifying the assessment criteria. To design a system of rubrics for assessing the competences associated with the Bachelor thesis of a nursing degree, to examine the system's reliability and validity and to analyse results in relation to the final thesis mark. Cross-sectional and psychometric study conducted between 2012 and 2014. Nursing degree at a Spanish university. Twelve tutors who designed the system of rubrics. Students (n = 76) who wrote their Bachelor thesis during the 2013-2014 academic year. After deciding which aspects would be assessed, who would assess them and when, the tutors developed seven rubrics (drafting process, assessment of the written thesis by the supervisor and by a panel, student self-assessment, peer assessment, tutor evaluation of the peer assessment and panel assessment of the viva). We analysed the reliability (inter-rater and internal consistency) and validity (convergent and discriminant) of the rubrics, and also the relationship between the competences assessed and the final thesis mark. All the rubrics had internal consistency coefficients >0.80. The rubric for oral communication skills (viva) yielded inter-rater reliability of 0.95. Factor analysis indicated a unidimensional structure for all but one of the rubrics, the exception being the rubric for peer assessment, which had a two-factor structure. The main competences associated with a good quality Bachelor thesis were written communication skills and the ability to work independently. The assessment system based on seven rubrics is shown to be valid and reliable. Writing a Bachelor thesis requires a range of degree-level competences and it offers nursing students the opportunity to develop their evidence-based practice skills. Copyright © 2018 Elsevier Ltd. All rights reserved.
An Assessment of Reliability and Validity of a Rubric for Grading APA-Style Introductions
ERIC Educational Resources Information Center
Stellmack, Mark A.; Konheim-Kalkstein, Yasmine L.; Manor, Julia E.; Massey, Abigail R.; Schmitz, Julie Ann P.
2009-01-01
This article describes the empirical evaluation of the reliability and validity of a grading rubric for grading APA-style introductions of undergraduate students. Levels of interrater agreement and intrarater agreement were not extremely high but were similar to values reported in the literature for comparably structured rubrics. Rank-order…
Ehrhart, Mark G.; Torres, Elisa M.; Finn, Natalie K.; Roesch, Scott C.
2016-01-01
There have been recent calls for pragmatic measures to assess factors that influence evidence-based practice (EBP) implementation processes and outcomes. The Implementation Leadership Scale (ILS) is a brief and efficient measure that can be used for research or organizational development purposes to assess leader behaviors and actions that actively support effective EBP implementation. The ILS was developed and validated in mental health settings. This study validates the ILS factor structure with providers in alcohol and other drug (AOD) use treatment agencies. Participants were 323 service providers working in 72 workgroups from three AOD use treatment agencies. Confirmatory factor analyses and reliability analyses were conducted to examine the psychometric properties of the ILS. Convergent and discriminant validity were also assessed. Confirmatory factor analyses demonstrated good fit to the hypothesized first and second order factor structure. Internal consistency reliability was excellent. Convergent and discriminant validity was supported. The ILS psychometric characteristics, reliability, and validity were supported in AOD use treatment agencies. The ILS is a brief and pragmatic measure that can be used for research and practice to assess leadership for EBP implementation in AOD use treatment agencies. PMID:27431044
Aarons, Gregory A; Ehrhart, Mark G; Torres, Elisa M; Finn, Natalie K; Roesch, Scott C
2016-09-01
There have been recent calls for pragmatic measures to assess factors that influence evidence-based practice (EBP) implementation processes and outcomes. The Implementation Leadership Scale (ILS) is a brief and efficient measure that can be used for research or organizational development purposes to assess leader behaviors and actions that actively support effective EBP implementation. The ILS was developed and validated in mental health settings. This study validates the ILS factor structure with providers in alcohol and other drug (AOD) use treatment agencies. Participants were 323 service providers working in 72 workgroups from three AOD use treatment agencies. Confirmatory factor analyses and reliability analyses were conducted to examine the psychometric properties of the ILS. Convergent and discriminant validity were also assessed. Confirmatory factor analyses demonstrated good fit to the hypothesized first and second order factor structure. Internal consistency reliability was excellent. Convergent and discriminant validity was supported. The ILS psychometric characteristics, reliability, and validity were supported in AOD use treatment agencies. The ILS is a brief and pragmatic measure that can be used for research and practice to assess leadership for EBP implementation in AOD use treatment agencies. Copyright © 2016 Elsevier Inc. All rights reserved.
Lindström, Eva; Jedenius, Erik; Levander, Sten
2009-01-01
The objective of the study was to validate a self-administrated symptom rating scale for use in patients with schizophrenia spectrum disorders by item analysis, exploration of factor structure, and analyses of reliability and validity. Data on 151 patients, initially treated by risperidone, obtained within the framework of a naturalistic Phase IV longitudinal study, were analysed by comparing patient and clinician ratings of symptoms, side-effects and global indices of illness. The Symptom Self-rating Scale for Schizophrenia (4S) is psychometrically adequate (item analysis, internal consistency, factor structure). Side-effect ratings were reliable. Symptom ratings displayed consistent associations with clinicians' ratings of corresponding symptom dimensions, suggesting construct validity. Patients had most difficulties assessing negative symptom items. Patients were well able to assess their own symptoms and drug side-effects. The factor structure of symptom ratings differs between patients and clinicians as well as how they construe global indices of illness. Clinicians focus on psychotic, patients on affective symptoms. Use of symptom self-ratings is one way to improve communication and thereby strengthen the therapeutic alliance and increase treatment adherence.
Assessing the factor structure of the Chinese conformity to masculine norms inventory.
Rochelle, Tina L; Yim, K H
2015-01-01
The purpose of the present study was to examine the factor structure and assess the reliability of the Chinese Conformity to Masculine Norms Inventory-46 (CCMNI-46). Using a cohort of 254 Hong Kong-born Chinese males, scale reliability determination involved the internal consistencies of the entire instrument. Ages of respondents ranged from 18 to 81 years (M = 38.05; SD = 17.3). Confirmatory factor analysis provided support for the psychometric properties of the CCMNI-46, thus confirming the multidimensional structure of the CMNI-46 and the replicability of the CMNI using a Hong Kong Chinese sample. All items loaded onto the corresponding factor with the exception of one item from the emotional control subscale. The overall reliability of the CCMNI-46 was lower than previous Western studies and may well reflect the subtle diversity of masculinity across cultures. The findings offered psychometric support for use of the CCMNI-46 in research and practice regarding Hong Kong Chinese masculinity. The CCMNI-46 provides a useful template for the operationalization of masculine norms in Chinese society.
Vasconcelos-Raposo, José; Fernandes, Helder Miguel; Teixeira, Carla M
2013-01-01
The purpose of the present study was to assess the factor structure and reliability of the Depression, Anxiety and Stress Scales (DASS-21) in a large Portuguese community sample. Participants were 1020 adults (585 women and 435 men), with a mean age of 36.74 (SD = 11.90) years. All scales revealed good reliability, with Cronbach's alpha values between .80 (anxiety) and .84 (depression). The internal consistency of the total score was .92. Confirmatory factor analysis revealed that the best-fitting model (*CFI = .940, *RMSEA = .038) consisted of a latent component of general psychological distress (or negative affectivity) plus orthogonal depression, anxiety and stress factors. The Portuguese version of the DASS-21 showed good psychometric properties (factorial validity and reliability) and thus can be used as a reliable and valid instrument for measuring depression, anxiety and stress symptoms.
Development and validity of a scale to measure workplace culture of health.
Kwon, Youngbum; Marzec, Mary L; Edington, Dee W
2015-05-01
To describe the development of and test the validity and reliability of the Workplace Culture of Health (COH) scale. Exploratory factor analysis and confirmatory factor analysis were performed on data from a health care organization (N = 627). To verify the factor structure, confirmatory factor analysis was performed on a second data set from a medical equipment manufacturer (N = 226). The COH scale included a structure of five orthogonal factors: senior leadership and polices, programs and rewards, quality assurance, supervisor support, and coworker support. With regard to construct validity (convergent and discriminant) and reliability, two different US companies showed the same factorial structure, satisfactory fit statistics, and suitable internal and external consistency. The COH scale represents a reliable and valid scale to assess the workplace environment and culture for supporting health.
Measuring Maladaptive Cognitions in Complicated Grief: Introducing the Typical Beliefs Questionnaire
Skritskaya, Natalia A.; Mauro, Christine; Olonoff, Matthew; Qiu, Xin; Duncan, Sarah; Wang, Yuanjia; Duan, Naihua; Lebowitz, Barry; Reynolds, Charles F.; Simon, Naomi M.; Zisook, Sidney; Shear, M. Katherine
2016-01-01
Objectives Maladaptive cognitions related to loss are thought to contribute to development of complicated grief and are crucial to address in treatment, but tools available to assess them are limited. This paper introduces the Typical Beliefs Questionnaire (TBQ), a 25-item self-report instrument to assess cognitions that interfere with adaptation to loss. Design Study participants completed an assessment battery during their initial evaluation and again after completing treatment at 20 weeks. Test-retest reliability was assessed on a subsample of the participants who did not show change in complicated grief severity after the first four weeks of treatment. To examine latent structure of the TBQ, an exploratory factor analysis (EFA) was performed. Setting Academic medical centers in Boston, New York, Pittsburgh and San Diego from 2010–2014. Participants 394 bereaved adults who met criteria for complicated grief. Measurements The TBQ along with assessments of complicated grief symptoms and related avoidance, depression symptoms, functional impairment, and perceived social support. Results The TBQ exhibited good internal consistency (α= .82) and test-retest reliability (n=105; ICC= .74). EFA indicated a five-factor structure: “Protesting the Death,” “Negative Thoughts About the World,” “Needing the Person,” “Less Grief is Wrong” and “Grieving Too Much.” The total score and all factors showed sensitivity to change with treatment. Conclusions This new tool allows a clinician to quickly and reliably ascertain presence of specific maladaptive cognitions related to complicated grief, and subsequently, to use the information to aid a diagnostic assessment, to structure the treatment, and to measure treatment outcomes. PMID:27793576
Development and Evaluation of the Telephone Crisis Support Skills Scale.
Kitchingman, Taneile A; Wilson, Coralie J; Caputi, Peter; Woodward, Alan; Hunt, Tara
2015-01-01
Although telephone services continue to play an important role in the delivery of front-line crisis support, published evidence of the standardized assessment of such services does not exist to date. To describe the development of the Telephone Crisis Support Skills Scale (TCSSS), an instrument to assess workers' intentions to use recommended skills with callers, and to evaluate its factor structure and reliability. TCSSS items were mapped to a national telephone crisis support practice model. A national sample of workers (n = 210) completed the TCSSS as part of a larger online survey. Principal axis factoring was used to evaluate the structure of the instrument. Internal consistency was assessed by Cronbach's α values. A single factor accounted for more than 40% of the variance within TCSSS ratings, indicating unidimensional structure. Cronbach's α coefficients suggested adequate internal consistency. Results indicate that the TCSSS is an internally consistent, unidimensional scale, sufficiently sensitive to detect workers' skill priorities for different caller problem types. Further study is required to confirm the factor structure and reliability of the TCSSS using workers from different organizations. Following further evaluation, the TCSSS may be applied to assessing readiness for and quality of service delivery.
Probabilistic assessment of uncertain adaptive hybrid composites
NASA Technical Reports Server (NTRS)
Shiao, Michael C.; Singhal, Surendra N.; Chamis, Christos C.
1994-01-01
Adaptive composite structures using actuation materials, such as piezoelectric fibers, were assessed probabilistically utilizing intraply hybrid composite mechanics in conjunction with probabilistic composite structural analysis. Uncertainties associated with the actuation material as well as the uncertainties in the regular (traditional) composite material properties were quantified and considered in the assessment. Static and buckling analyses were performed for rectangular panels with various boundary conditions and different control arrangements. The probability density functions of the structural behavior, such as maximum displacement and critical buckling load, were computationally simulated. The results of the assessment indicate that improved design and reliability can be achieved with actuation material.
Apollo experience report: Reliability and quality assurance
NASA Technical Reports Server (NTRS)
Sperber, K. P.
1973-01-01
The reliability of the Apollo spacecraft resulted from the application of proven reliability and quality techniques and from sound management, engineering, and manufacturing practices. Continual assessment of these techniques and practices was made during the program, and, when deficiencies were detected, adjustments were made and the deficiencies were effectively corrected. The most significant practices, deficiencies, adjustments, and experiences during the Apollo Program are described in this report. These experiences can be helpful in establishing an effective base on which to structure an efficient reliability and quality assurance effort for future space-flight programs.
A proposed method to investigate reliability throughout a questionnaire.
Wentzel-Larsen, Tore; Norekvål, Tone M; Ulvik, Bjørg; Nygård, Ottar; Pripp, Are H
2011-10-05
Questionnaires are used extensively in medical and health care research and depend on validity and reliability. However, participants may differ in interest and awareness throughout long questionnaires, which can affect reliability of their answers. A method is proposed for "screening" of systematic change in random error, which could assess changed reliability of answers. A simulation study was conducted to explore whether systematic change in reliability, expressed as changed random error, could be assessed using unsupervised classification of subjects by cluster analysis (CA) and estimation of intraclass correlation coefficient (ICC). The method was also applied on a clinical dataset from 753 cardiac patients using the Jalowiec Coping Scale. The simulation study showed a relationship between the systematic change in random error throughout a questionnaire and the slope between the estimated ICC for subjects classified by CA and successive items in a questionnaire. This slope was proposed as an awareness measure--to assessing if respondents provide only a random answer or one based on a substantial cognitive effort. Scales from different factor structures of Jalowiec Coping Scale had different effect on this awareness measure. Even though assumptions in the simulation study might be limited compared to real datasets, the approach is promising for assessing systematic change in reliability throughout long questionnaires. Results from a clinical dataset indicated that the awareness measure differed between scales.
Roley, Susanne Smith; Mailloux, Zoe; Parham, L. Diane; Koomar, Jane; Schaaf, Roseann C.; Van Jaarsveld, Annamarie; Cohn, Ellen
2014-01-01
This study examined the reliability and validity of the structural section of the Ayres Sensory Integration® Fidelity Measure© (ASIFM), which provides a method for monitoring the extent to which an intervention was implemented as conceptualized in studies of occupational therapy using sensory integration intervention methods (OT–SI). We examined the structural elements of the measure, including content of assessment reports, availability of specific equipment and adequate space, safety monitoring, and integration of communication with parents and other team members, such as collaborative goal setting with parents or family and teacher education, into the intervention program. Analysis of self-report ratings by 259 occupational therapists from 185 different facilities indicated that the structural section of the ASIFM has acceptable interrater reliability (r ≥ .82) and significantly differentiates between settings in which therapists reportedly do and do not practice OT–SI (p < .001). PMID:25184462
Reliability of sonographic assessment of tendinopathy in tennis elbow.
Poltawski, Leon; Ali, Syed; Jayaram, Vijay; Watson, Tim
2012-01-01
To assess the reliability and compute the minimum detectable change using sonographic scales to quantify the extent of pathology and hyperaemia in the common extensor tendon in people with tennis elbow. The lateral elbows of 19 people with tennis elbow were assessed sonographically twice, 1-2 weeks apart. Greyscale and power Doppler images were recorded for subsequent rating of abnormalities. Tendon thickening, hypoechogenicity, fibrillar disruption and calcification were each rated on four-point scales, and scores were summed to provide an overall rating of structural abnormality; hyperaemia was scored on a five point scale. Inter-rater reliability was established using the intraclass correlation coefficient (ICC) to compare scores assigned independently to the same set of images by a radiologist and a physiotherapist with training in musculoskeletal imaging. Test-retest reliability was assessed by comparing scores assigned by the physiotherapist to images recorded at the two sessions. The minimum detectable change (MDC) was calculated from the test-retest reliability data. ICC values for inter-rater reliability ranged from 0.35 (95% CI: 0.05, 0.60) for fibrillar disruption to 0.77 (0.55, 0.88) for overall greyscale score, and 0.89 (0.79, 0.95) for hyperaemia. Test-retest reliability ranged from 0.70 (0.48, 0.84) for tendon thickening to 0.82 (0.66, 0.90) for overall greyscale score and 0.86 (0.73, 0.93) for calcification. The MDC for the greyscale total score was 2.0/12 and for the hyperaemia score was 1.1/5. The sonographic scoring system used in this study may be used reliably to quantify tendon abnormalities and change over time. A relatively inexperienced imager can conduct the assessment and use the rating scales reliably.
The reliability of the Glasgow Coma Scale: a systematic review.
Reith, Florence C M; Van den Brande, Ruben; Synnot, Anneliese; Gruen, Russell; Maas, Andrew I R
2016-01-01
The Glasgow Coma Scale (GCS) provides a structured method for assessment of the level of consciousness. Its derived sum score is applied in research and adopted in intensive care unit scoring systems. Controversy exists on the reliability of the GCS. The aim of this systematic review was to summarize evidence on the reliability of the GCS. A literature search was undertaken in MEDLINE, EMBASE and CINAHL. Observational studies that assessed the reliability of the GCS, expressed by a statistical measure, were included. Methodological quality was evaluated with the consensus-based standards for the selection of health measurement instruments checklist and its influence on results considered. Reliability estimates were synthesized narratively. We identified 52 relevant studies that showed significant heterogeneity in the type of reliability estimates used, patients studied, setting and characteristics of observers. Methodological quality was good (n = 7), fair (n = 18) or poor (n = 27). In good quality studies, kappa values were ≥0.6 in 85%, and all intraclass correlation coefficients indicated excellent reliability. Poor quality studies showed lower reliability estimates. Reliability for the GCS components was higher than for the sum score. Factors that may influence reliability include education and training, the level of consciousness and type of stimuli used. Only 13% of studies were of good quality and inconsistency in reported reliability estimates was found. Although the reliability was adequate in good quality studies, further improvement is desirable. From a methodological perspective, the quality of reliability studies needs to be improved. From a clinical perspective, a renewed focus on training/education and standardization of assessment is required.
Advances and trends in computational structural mechanics
NASA Technical Reports Server (NTRS)
Noor, A. K.
1986-01-01
Recent developments in computational structural mechanics are reviewed with reference to computational needs for future structures technology, advances in computational models for material behavior, discrete element technology, assessment and control of numerical simulations of structural response, hybrid analysis, and techniques for large-scale optimization. Research areas in computational structural mechanics which have high potential for meeting future technological needs are identified. These include prediction and analysis of the failure of structural components made of new materials, development of computational strategies and solution methodologies for large-scale structural calculations, and assessment of reliability and adaptive improvement of response predictions.
ERIC Educational Resources Information Center
Cook, David A.; Zendejas, Benjamin; Hamstra, Stanley J.; Hatala, Rose; Brydges, Ryan
2014-01-01
Ongoing transformations in health professions education underscore the need for valid and reliable assessment. The current standard for assessment validation requires evidence from five sources: content, response process, internal structure, relations with other variables, and consequences. However, researchers remain uncertain regarding the types…
Assessor Training: Its Effects on Criterion-Based Assessment in a Medical Context
ERIC Educational Resources Information Center
Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E.
2008-01-01
Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…
Home Lighting Assessment for Clients With Low Vision
Bhorade, Anjali; Gordon, Mae; Hollingsworth, Holly; Engsberg, Jack E.; Baum, M. Carolyn
2013-01-01
OBJECTIVE. The goal was to develop an objective, comprehensive, near-task home lighting assessment for older adults with low vision. METHOD. A home lighting assessment was developed and tested with older adults with low vision. Interrater and test–retest reliability studies were conducted. Clinical utility was assessed by occupational therapists with expertise in low vision rehabilitation. RESULTS. Interrater reliability was high (intraclass correlation coefficient [ICC] = .83–1.0). Test–retest reliability was moderate (ICC = .67). Responses to a Clinical Utility Feedback Form developed for this study indicated that the Home Environment Lighting Assessment (HELA) has strong clinical utility. CONCLUSION. The HELA provides a structured tool to describe the quantitative and qualitative aspects of home lighting environments where near tasks are performed and can be used to plan lighting interventions. The HELA has the potential to affect assessment and intervention practices of rehabilitation professionals in the area of low vision and improve near-task performance of people with low vision. PMID:24195901
Axis IV--psychosocial and environmental problems--in the DSM-IV.
Ramirez, A; Ekselius, L; Ramklint, M
2013-11-01
The aim of this study was to further explore the properties of axis IV in the Diagnostic and statistical manual of mental disorders, 4th edition (DSM-IV). In a naturalistic cross-sectional design, a group (n = 163) of young (18-25 years old) Swedish psychiatric outpatients was assessed according to DSM-IV. Psychosocial and environmental problems/axis IV were evaluated through structured interviewing by a social worker and by self-assessment on a questionnaire. Reliability between professional assessment and self-assessment of axis IV was examined. Concurrent validity of axis IV was also examined. Reliability between professional and self-assessed axis IV was fair to almost perfect, 0.31-0.83, according to prevalence and bias-adjusted kappa. Categories of psychosocial stress and environmental problems were related to the presence of axis I disorders, co-morbidity, personality disorders and decreasing Global Assessment of Functioning (GAF) values. The revised axis IV according to DSM-IV seems to have concurrent validity, but is still hampered by limited reliability. © 2013 John Wiley & Sons Ltd.
Reliability analysis of the objective structured clinical examination using generalizability theory.
Trejo-Mejía, Juan Andrés; Sánchez-Mendiola, Melchor; Méndez-Ramírez, Ignacio; Martínez-González, Adrián
2016-01-01
The objective structured clinical examination (OSCE) is a widely used method for assessing clinical competence in health sciences education. Studies using this method have shown evidence of validity and reliability. There are no published studies of OSCE reliability measurement with generalizability theory (G-theory) in Latin America. The aims of this study were to assess the reliability of an OSCE in medical students using G-theory and explore its usefulness for quality improvement. An observational cross-sectional study was conducted at National Autonomous University of Mexico (UNAM) Faculty of Medicine in Mexico City. A total of 278 fifth-year medical students were assessed with an 18-station OSCE in a summative end-of-career final examination. There were four exam versions. G-theory with a crossover random effects design was used to identify the main sources of variance. Examiners, standardized patients, and cases were considered as a single facet of analysis. The exam was applied to 278 medical students. The OSCE had a generalizability coefficient of 0.93. The major components of variance were stations, students, and residual error. The sites and the versions of the tests had minimum variance. Our study achieved a G coefficient similar to that found in other reports, which is acceptable for summative tests. G-theory allows the estimation of the magnitude of multiple sources of error and helps decision makers to determine the number of stations, test versions, and examiners needed to obtain reliable measurements.
Reliability analysis of the objective structured clinical examination using generalizability theory.
Trejo-Mejía, Juan Andrés; Sánchez-Mendiola, Melchor; Méndez-Ramírez, Ignacio; Martínez-González, Adrián
2016-01-01
Background The objective structured clinical examination (OSCE) is a widely used method for assessing clinical competence in health sciences education. Studies using this method have shown evidence of validity and reliability. There are no published studies of OSCE reliability measurement with generalizability theory (G-theory) in Latin America. The aims of this study were to assess the reliability of an OSCE in medical students using G-theory and explore its usefulness for quality improvement. Methods An observational cross-sectional study was conducted at National Autonomous University of Mexico (UNAM) Faculty of Medicine in Mexico City. A total of 278 fifth-year medical students were assessed with an 18-station OSCE in a summative end-of-career final examination. There were four exam versions. G-theory with a crossover random effects design was used to identify the main sources of variance. Examiners, standardized patients, and cases were considered as a single facet of analysis. Results The exam was applied to 278 medical students. The OSCE had a generalizability coefficient of 0.93. The major components of variance were stations, students, and residual error. The sites and the versions of the tests had minimum variance. Conclusions Our study achieved a G coefficient similar to that found in other reports, which is acceptable for summative tests. G-theory allows the estimation of the magnitude of multiple sources of error and helps decision makers to determine the number of stations, test versions, and examiners needed to obtain reliable measurements.
O'Grady, Michael G; Dusing, Stacey C
2015-01-01
Play is vital for development. Infants and children learn through play. Traditional standardized developmental tests measure whether a child performs individual skills within controlled environments. Play-based assessments can measure skill performance during natural, child-driven play. The purpose of this study was to systematically review reliability, validity, and responsiveness of all play-based assessments that quantify motor and cognitive skills in children from birth to 36 months of age. Studies were identified from a literature search using PubMed, ERIC, CINAHL, and PsycINFO databases and the reference lists of included papers. Included studies investigated reliability, validity, or responsiveness of play-based assessments that measured motor and cognitive skills for children to 36 months of age. Two reviewers independently screened 40 studies for eligibility and inclusion. The reviewers independently extracted reliability, validity, and responsiveness data. They examined measurement properties and methodological quality of the included studies. Four current play-based assessment tools were identified in 8 included studies. Each play-based assessment tool measured motor and cognitive skills in a different way during play. Interrater reliability correlations ranged from .86 to .98 for motor development and from .23 to .90 for cognitive development. Test-retest reliability correlations ranged from .88 to .95 for motor development and from .45 to .91 for cognitive development. Structural validity correlations ranged from .62 to .90 for motor development and from .42 to .93 for cognitive development. One study assessed responsiveness to change in motor development. Most studies had small and poorly described samples. Lack of transparency in data management and statistical analysis was common. Play-based assessments have potential to be reliable and valid tools to assess cognitive and motor skills, but higher-quality research is needed. Psychometric properties should be considered for each play-based assessment before it is used in clinical and research practice. © 2015 American Physical Therapy Association.
Structural vulnerability assessment using reliability of slabs in avalanche area
NASA Astrophysics Data System (ADS)
Favier, Philomène; Bertrand, David; Eckert, Nicolas; Naaim, Mohamed
2013-04-01
Improvement of risk assessment or hazard zoning requires a better understanding of the physical vulnerability of structures. To consider natural hazard issue such as snow avalanches, once the flow is characterized, highlight on the mechanical behaviour of the structure is a decisive step. A challenging approach is to quantify the physical vulnerability of impacted structures according to various avalanche loadings. The main objective of this presentation is to introduce methodology and outcomes regarding the assessment of vulnerability of reinforced concrete buildings using reliability methods. Reinforced concrete has been chosen as it is one of the usual material used to build structures exposed to potential avalanche loadings. In avalanche blue zones, structures have to resist to a pressure up to 30kPa. Thus, by providing systematic fragility relations linked to the global failure of the structure, this method may serve the avalanche risk assessment. To do so, a slab was numerically designed. It represented the avalanche facing wall of a house. Different configuration cases of the element in stake have been treated to quantify numerical aspects of the problem, such as the boundary conditions or the mechanical behaviour of the structure. The structure is analysed according to four different limit states, semi-local and global failures are considered to describe the slab behaviour. The first state is attained when cracks appear in the tensile zone, then the two next states are described consistent with the Eurocode, the final state is the total collapse of the structure characterized by the yield line theory. Failure probability is estimated in accordance to the reliability framework. Monte Carlo simulations were conducted to quantify the fragility to different loadings. Sensitivity of models in terms of input distributions were defined with statistical tools such as confidence intervals and Sobol's indexes. Conclusion and discussion of this work are established to well determine contributions, limits and future needs or developments of the research. First of all, this study provides spectrum of fragility curves of reinforced concrete structures which could be used to improve risk assessment. Second, the influence of the failure criterion picked up in this survey are discussed. Then, the weight of the statistical distribution choice is analysed. Finally, the limit between vulnerability and fragility relations is set up to establish the boundary use of our approach.
2008-10-01
provide adequate means for thermal heat dissipation and cooling. Thus electronic packaging has four main functions [1]: • Signal distribution which... dissipation , involving structural and materials consideration. • Mechanical, chemical and electromagnetic protection of components and... nature when compared to phenomenological models. Microelectronic packaging industry spends typically several months building and reliability
Weyers, Simone; Jemi, Iman; Karger, André; Raski, Bianca; Rotthoff, Thomas; Pentzek, Michael; Mortsiefer, Achim
2016-01-01
Background: Imparting communication skills has been given great importance in medical curricula. In addition to standardized assessments, students should communicate with real patients in actual clinical situations during workplace-based assessments and receive structured feedback on their performance. The aim of this project was to pilot a formative testing method for workplace-based assessment. Our investigation centered in particular on whether or not physicians view the method as feasible and how high acceptance is among students. In addition, we assessed the reliability of the method. Method: As part of the project, 16 students held two consultations each with chronically ill patients at the medical practice where they were completing GP training. These consultations were video-recorded. The trained mentoring physician rated the student’s performance and provided feedback immediately following the consultations using the Berlin Global Rating scale (BGR). Two impartial, trained raters also evaluated the videos using BGR. For qualitative and quantitative analysis, information on how physicians and students viewed feasibility and their levels of acceptance was collected in written form in a partially standardized manner. To test for reliability, the test-retest reliability was calculated for both of the overall evaluations given by each rater. The inter-rater reliability was determined for the three evaluations of each individual consultation. Results: The formative assessment method was rated positively by both physicians and students. It is relatively easy to integrate into daily routines. Its significant value lies in the personal, structured and recurring feedback. The two overall scores for each patient consultation given by the two impartial raters correlate moderately. The degree of uniformity among the three raters in respect to the individual consultations is low. Discussion: Within the scope of this pilot project, only a small sample of physicians and students could be surveyed to a limited extent. There are indications that the assessment can be improved by integrating more information on medical context and student self-assessments. Despite the current limitations regarding test criteria, it is clear that workplace-based assessment of communication skills in the clinical setting is a valuable addition to the communication curricula of medical schools. PMID:27990466
Weyers, Simone; Jemi, Iman; Karger, André; Raski, Bianca; Rotthoff, Thomas; Pentzek, Michael; Mortsiefer, Achim
2016-01-01
Background: Imparting communication skills has been given great importance in medical curricula. In addition to standardized assessments, students should communicate with real patients in actual clinical situations during workplace-based assessments and receive structured feedback on their performance. The aim of this project was to pilot a formative testing method for workplace-based assessment. Our investigation centered in particular on whether or not physicians view the method as feasible and how high acceptance is among students. In addition, we assessed the reliability of the method. Method: As part of the project, 16 students held two consultations each with chronically ill patients at the medical practice where they were completing GP training. These consultations were video-recorded. The trained mentoring physician rated the student's performance and provided feedback immediately following the consultations using the Berlin Global Rating scale (BGR). Two impartial, trained raters also evaluated the videos using BGR. For qualitative and quantitative analysis, information on how physicians and students viewed feasibility and their levels of acceptance was collected in written form in a partially standardized manner. To test for reliability, the test-retest reliability was calculated for both of the overall evaluations given by each rater. The inter-rater reliability was determined for the three evaluations of each individual consultation. Results: The formative assessment method was rated positively by both physicians and students. It is relatively easy to integrate into daily routines. Its significant value lies in the personal, structured and recurring feedback. The two overall scores for each patient consultation given by the two impartial raters correlate moderately. The degree of uniformity among the three raters in respect to the individual consultations is low. Discussion: Within the scope of this pilot project, only a small sample of physicians and students could be surveyed to a limited extent. There are indications that the assessment can be improved by integrating more information on medical context and student self-assessments. Despite the current limitations regarding test criteria, it is clear that workplace-based assessment of communication skills in the clinical setting is a valuable addition to the communication curricula of medical schools.
Inter-Observer Reliability of DSM-5 Substance Use Disorders*
Denis, Cécile M.; Gelernter, Joel; Hart, Amy B.; Kranzler, Henry R.
2015-01-01
Aims Although studies have examined the impact of changes made in DSM-5 on the estimated prevalence of substance use disorder (SUD) diagnoses, there is limited evidence of the reliability of DSM-5 SUDs. We evaluated the inter-observer reliability of four DSM-5 SUDs in a sample in which we had previously evaluated the reliability of DSM-IV diagnoses, allowing us to compare the two systems. Methods Two different interviewers each assessed 173 subjects over a 2-week period using the Semi-Structured Assessment for Drug Dependence and Alcoholism (SSADDA). Using the percent agreement and kappa (κ) coefficient, we examined the reliability of DSM-5 lifetime alcohol, opioid, cocaine, and cannabis use disorders, which we compared to that of SSADDA-derived DSM-IV SUD diagnoses. We also assessed the effect of additional lifetime SUD and lifetime mood or anxiety disorder diagnoses on the reliability of the DSM-5 SUD diagnoses. Results Reliability was good to excellent for the four disorders, with κ values ranging from 0.65 to 0.94. Agreement was consistently lower for SUDs of mild severity than for moderate or severe disorders. DSM-5 SUD diagnoses showed greater reliability than DSM-IV diagnoses of abuse or dependence or dependence only. Co-occurring SUD and lifetime mood or anxiety disorders exerted a modest effect on the reliability of the DSM-5 SUD diagnoses. Conclusions For alcohol, opioid, cocaine and cannabis use disorders, DSM-5 criteria and diagnoses are at least as reliable as those of DSM-IV. PMID:26048641
Angst, Ueli M.; Boschmann, Carolina; Wagner, Matthias; Elsener, Bernhard
2017-01-01
The aging of reinforced concrete infrastructure in developed countries imposes an urgent need for methods to reliably assess the condition of these structures. Corrosion of the embedded reinforcing steel is the most frequent cause for degradation. While it is well known that the ability of a structure to withstand corrosion depends strongly on factors such as the materials used or the age, it is common practice to rely on threshold values stipulated in standards or textbooks. These threshold values for corrosion initiation (Ccrit) are independent of the actual properties of a certain structure, which clearly limits the accuracy of condition assessments and service life predictions. The practice of using tabulated values can be traced to the lack of reliable methods to determine Ccrit on-site and in the laboratory. Here, an experimental protocol to determine Ccrit for individual engineering structures or structural members is presented. A number of reinforced concrete samples are taken from structures and laboratory corrosion testing is performed. The main advantage of this method is that it ensures real conditions concerning parameters that are well known to greatly influence Ccrit, such as the steel-concrete interface, which cannot be representatively mimicked in laboratory-produced samples. At the same time, the accelerated corrosion test in the laboratory permits the reliable determination of Ccrit prior to corrosion initiation on the tested structure; this is a major advantage over all common condition assessment methods that only permit estimating the conditions for corrosion after initiation, i.e., when the structure is already damaged. The protocol yields the statistical distribution of Ccrit for the tested structure. This serves as a basis for probabilistic prediction models for the remaining time to corrosion, which is needed for maintenance planning. This method can potentially be used in material testing of civil infrastructures, similar to established methods used for mechanical testing. PMID:28892023
Angst, Ueli M; Boschmann, Carolina; Wagner, Matthias; Elsener, Bernhard
2017-08-31
The aging of reinforced concrete infrastructure in developed countries imposes an urgent need for methods to reliably assess the condition of these structures. Corrosion of the embedded reinforcing steel is the most frequent cause for degradation. While it is well known that the ability of a structure to withstand corrosion depends strongly on factors such as the materials used or the age, it is common practice to rely on threshold values stipulated in standards or textbooks. These threshold values for corrosion initiation (Ccrit) are independent of the actual properties of a certain structure, which clearly limits the accuracy of condition assessments and service life predictions. The practice of using tabulated values can be traced to the lack of reliable methods to determine Ccrit on-site and in the laboratory. Here, an experimental protocol to determine Ccrit for individual engineering structures or structural members is presented. A number of reinforced concrete samples are taken from structures and laboratory corrosion testing is performed. The main advantage of this method is that it ensures real conditions concerning parameters that are well known to greatly influence Ccrit, such as the steel-concrete interface, which cannot be representatively mimicked in laboratory-produced samples. At the same time, the accelerated corrosion test in the laboratory permits the reliable determination of Ccrit prior to corrosion initiation on the tested structure; this is a major advantage over all common condition assessment methods that only permit estimating the conditions for corrosion after initiation, i.e., when the structure is already damaged. The protocol yields the statistical distribution of Ccrit for the tested structure. This serves as a basis for probabilistic prediction models for the remaining time to corrosion, which is needed for maintenance planning. This method can potentially be used in material testing of civil infrastructures, similar to established methods used for mechanical testing.
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms - Part II.
Setia, Maninder Singh
2017-01-01
This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources.
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms – Part II
Setia, Maninder Singh
2017-01-01
This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources. PMID:28584367
Structured implicit review: a new method for monitoring nursing care quality.
Pearson, M L; Lee, J L; Chang, B L; Elliott, M; Kahn, K L; Rubenstein, L V
2000-11-01
Nurses' independent decisions about assessment, treatment, and nursing interventions for hospitalized patients are important determinants of quality of care. Physician peer implicit review of medical records has been central to Medicare quality management and is considered the gold standard for reviewing physician care, but peer implicit review of nursing processes of care has not received similar attention. The objective of this study was to develop and evaluate nurse structured implicit review (SIR) methods. We developed SIR instruments for rating the quality of inpatient nursing care for congestive heart failure (CHF) and cerebrovascular accident (CVA). Nurse reviewers used the SIR form to rate a nationally representative sample of randomly selected medical records for each disease from 297 acute care hospitals in 5 states (collected by the RAND-HCFA Prospective Payment System study). The study subjects were elderly Medicare inpatients with CHF (n = 291) or CVA (n = 283). We developed and tested scales reflecting domains of nursing process, evaluated interrater and interitem reliability, and assessed the extent to which items and scales predicted overall ratings of the quality of nursing care. Interrater reliability for 14 of 16 scales (CHF) or 10 of 16 scales (CVA) was > or = 0.40. Interitem reliability was > 0.80 for all but 1 scale (both diseases). Functional Assessment, Physical Assessment, and Medication Tracking ratings were the strongest predictors of overall nursing quality ratings (P < 0.001 for each). Nurse peer review with SIR has adequate interrater and excellent scale reliabilities and can be a valuable tool for assessing nurse performance.
de Montbrun, Sandra; Roberts, Patricia L; Satterthwaite, Lisa; MacRae, Helen
2016-07-01
To implement the Colorectal Objective Structured Assessment of Technical skill (COSATS) into American Board of Colon and Rectal Surgery (ABCRS) certification and build evidence of validity for the interpretation of the scores of this high stakes assessment tool. Currently, technical skill assessment is not a formal component of board certification. With the technical demands of surgical specialties, documenting competence in technical skill at the time of certification with a valid tool is ideal. In September 2014, the COSATS was a mandatory component of ABCRS certification. Seventy candidates took the examination, with their performance evaluated by expert colorectal surgeons using a task-specific checklist, global rating scale, and overall performance scale. Passing scores were set and compared using 2 standard setting methodologies, using a compensatory and conjunctive model. Inter-rater reliability and the reliability of the pass/fail decision were calculated using Cronbach alpha and Subkoviak methodology, respectively. Overall COSATS scores and pass/fail status were compared with results on the ABCRS oral examination. The pass rate ranged from 85.7% to 90%. Inter-rater reliability (0.85) and reliability of the pass/fail decision (0.87 and 0.84) were high. A low positive correlation (r= 0.25) was seen between the COSATS and oral examination. All individuals who failed the COSATS passed the ABCRS oral examination. COSATS is the first technical skill examination used in national surgical board certification. This study suggests that the current certification process may be failing to identify individuals who have demonstrated technical deficiencies on this standardized assessment tool.
The Structure of Women's Mood in the Early Postpartum
ERIC Educational Resources Information Center
Buttner, Melissa M.; O'Hara, Michael W.; Watson, David
2012-01-01
The "postpartum blues" is a mild, predictable mood disturbance occurring within the first several days following childbirth. Previous analyses of the "blues" symptom structure yielded inconclusive findings, making reliable assessment a significant methodological limitation. The current study aimed to explicate the symptom…
Nadkarni, Lindsay D; Roskind, Cindy G; Auerbach, Marc A; Calhoun, Aaron W; Adler, Mark D; Kessler, David O
2018-04-01
The aim of this study was to assess the validity of a formative feedback instrument for leaders of simulated resuscitations. This is a prospective validation study with a fully crossed (person × scenario × rater) study design. The Concise Assessment of Leader Management (CALM) instrument was designed by pediatric emergency medicine and graduate medical education experts to be used off the shelf to evaluate and provide formative feedback to resuscitation leaders. Four experts reviewed 16 videos of in situ simulated pediatric resuscitations and scored resuscitation leader performance using the CALM instrument. The videos consisted of 4 pediatric emergency department resuscitation teams each performing in 4 pediatric resuscitation scenarios (cardiac arrest, respiratory arrest, seizure, and sepsis). We report on content and internal structure (reliability) validity of the CALM instrument. Content validity was supported by the instrument development process that involved professional experience, expert consensus, focused literature review, and pilot testing. Internal structure validity (reliability) was supported by the generalizability analysis. The main component that contributed to score variability was the person (33%), meaning that individual leaders performed differently. The rater component had almost zero (0%) contribution to variance, which implies that raters were in agreement and argues for high interrater reliability. These results provide initial evidence to support the validity of the CALM instrument as a reliable assessment instrument that can facilitate formative feedback to leaders of pediatric simulated resuscitations.
North Carolina Family Assessment Scale: Measurement Properties for Youth Mental Health Services
ERIC Educational Resources Information Center
Lee, Bethany R.; Lindsey, Michael A.
2010-01-01
Objective: The purpose of this study is to assess the reliability and validity of the North Carolina Family Assessment Scale (NCFAS) among families involved with youth mental health services. Methods: Using NCFAS data collected by child mental health intake workers with 158 families, factor analysis was conducted to assess factor structure, and…
Lalanne, Christophe; Chassany, Olivier; Carrieri, Patrizia; Marcellin, Fabienne; Armstrong, Andrew R; Lert, France; Spire, Bruno; Dray-Spira, Rosemary; Duracinsky, Martin
2016-04-01
To identify a simplified factor structure for the PROQOL-human immunodeficiency virus (HIV) questionnaire to improve the measurement of the health-related quality of life (HRQL) of HIV-positive patients in clinical care and research settings. HRQL data were collected using the eight-dimension PROQOL-HIV questionnaire from 2,537 patients (VESPA2 study). Exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) validated a simpler four-factor structure and assessed measurement invariance (MI). Multigroup analysis assessed the effect of sex, age, and antiretroviral therapy (ART) on the resulting factor scores. Correlations with symptom and Short Form (SF)-12 self-reports assessed convergent validity. Item analysis, EFA, and CFAs confirmed the validity [comparative fit index (CFI), 0.948; root mean square error of approximation, 0.064] and reliability (α's ≥ 0.8) of four dimensions: physical health and symptoms, health concerns and mental distress, social and intimate relationships, and treatment-related impact. Strong MI was demonstrated across sex and age (decrease in CFI <0.01). A multiple-cause multiple-indicator model indicated that HRQL correlated as expected with sex, age, and the ART status. Correlations of HRQL, symptom reports, and SF-12 scores evidenced convergent validity criterion. The simplified factor structure and scoring scheme for PROQOL-HIV will allow clinicians to monitor with greater reliability the HRQL of patients in clinical care and research settings. Copyright © 2016 Elsevier Inc. All rights reserved.
Griffiths, A; Cox, T; Karanika, M; Khan, S; Tomás, J M
2006-10-01
To examine the factor structure, reliability, and validity of a new context-specific questionnaire for the assessment of work and organisational factors. The Work Organisation Assessment Questionnaire (WOAQ) was developed as part of a risk assessment and risk reduction methodology for hazards inherent in the design and management of work in the manufacturing sector. Two studies were conducted. Data were collected from 524 white- and blue-collar employees from a range of manufacturing companies. Exploratory factor analysis was carried out on 28 items that described the most commonly reported failures of work design and management in companies in the manufacturing sector. Concurrent validity data were also collected. A reliability study was conducted with a further 156 employees. Principal component analysis, with varimax rotation, revealed a strong 28-item, five factor structure. The factors were named: quality of relationships with management, reward and recognition, workload, quality of relationships with colleagues, and quality of physical environment. Analyses also revealed a more general summative factor. Results indicated that the questionnaire has good internal consistency and test-retest reliability and validity. Being associated with poor employee health and changes in health related behaviour, the WOAQ factors are possible hazards. It is argued that the strength of those associations offers some estimation of risk. Feedback from the organisations involved indicated that the WOAQ was easy to use and meaningful for them as part of their risk assessment procedures. The studies reported here describe a model of the hazards to employee health and health related behaviour inherent in the design and management of work in the manufacturing sector. It offers an instrument for their assessment. The scales derived which form the WOAQ were shown to be reliable, valid, and meaningful to the user population.
Thimm, Jens C
2017-12-01
The Computerized Adaptive Test of Personality Disorder-Static Form (CAT-PD-SF) is a self-report inventory developed to assess pathological personality traits. The current study explored the reliability and higher order factor structure of the Norwegian version of the CAT-PD-SF and the relationships between the CAT-PD traits and domains of personality functioning in an undergraduate student sample ( N = 375). In addition to the CAT-PD-SF, the short form of the Severity Indices of Personality Problems and the Brief Symptom Inventory were administered. The results showed that the Norwegian CAT-PD-SF has good score reliability. Factor analysis of the CAT-PD-SF scales indicated five superordinate factors that correspond to the trait domains of the alternative DSM-5 model for personality disorders. The CAT-PD traits were highly predictive of impaired personality functioning after controlling for psychological distress. It is concluded that the CAT-PD-SF is a promising tool for the assessment of personality disorder traits.
Wilby, K J; Black, E K; Austin, Z; Mukhalalati, B; Aboulsoud, S; Khalifa, S I
2016-07-10
This study aimed to evaluate the feasibility and psychometric defensibility of implementing a comprehensive objective structured clinical examination (OSCE) on the complete pharmacy programme for pharmacy students in a Middle Eastern context, and to identify facilitators and barriers to implementation within new settings. Eight cases were developed, validated, and had standards set according to a blueprint, and were assessed with graduating pharmacy students. Assessor reliability was evaluated using inter-class coefficients (ICCs). Concurrent validity was evaluated by comparing OSCE results to professional skills course grades. Field notes were maintained to generate recommendations for implementation in other contexts. The examination pass mark was 424 points out of 700 (60.6%). All 23 participants passed. Mean performance was 74.6%. Low to moderate inter-rater reliability was obtained for analytical and global components (average ICC 0.77 and 0.48, respectively). In conclusion, OSCE was feasible in Qatar but context-related validity and reliability concerns must be addressed prior to future iterations in Qatar and elsewhere.
NASA Astrophysics Data System (ADS)
Abramov, Ivan
2018-03-01
Development of design documentation for a future construction project gives rise to a number of issues with the main one being selection of manpower for structural units of the project's overall implementation system. Well planned and competently staffed integrated structural construction units will help achieve a high level of reliability and labor productivity and avoid negative (extraordinary) situations during the construction period eventually ensuring improved project performance. Research priorities include the development of theoretical recommendations for enhancing reliability of a structural unit staffed as an integrated construction crew. The author focuses on identification of destabilizing factors affecting formation of an integrated construction crew; assessment of these destabilizing factors; based on the developed mathematical model, highlighting the impact of these factors on the integration criterion with subsequent identification of an efficiency and reliability criterion for the structural unit in general. The purpose of this article is to develop theoretical recommendations and scientific and methodological provisions of an organizational and technological nature in order to identify a reliability criterion for a structural unit based on manpower integration and productivity criteria. With this purpose in mind, complex scientific tasks have been defined requiring special research, development of corresponding provisions and recommendations based on the system analysis findings presented herein.
2012-01-01
Background Preventive child health care is well suited for the early detection of parenting and developmental problems. However, as far as the younger age group is concerned, there are no validated early detection instruments which cover both the child and its environment. Therefore, we have developed a broad-scope structured interview which assesses parents’ concerns and their need for support, using both the parental perspective and the experience of the child health care nurse: the Structured Problem Analysis of Raising Kids (SPARK). This study reports the psychometric characteristics of the SPARK. Method A cross-sectional study of 2012 18-month-old children, living in Zeeland, a province of the Netherlands. Inter-rater reliability was assessed in 67 children. Convergent validity was assessed by comparing SPARK-domains with domains in self-report questionnaires on child development and parenting stress. Discriminative validity was assessed by comparing different outcomes of the SPARK between groups with different levels of socio-economic status and by performing an extreme-groups comparison. The user experience of both parents and nurses was assessed with the aid of an online survey. Results The response rate was 92.1% for the SPARK. Self-report questionnaires were returned in the case of 66.9% of the remaining 1721 children. There was selective non-reporting: 33.1% of the questionnaires were not returned, covering 65.2% of the children with a high-risk label according to the SPARK (p < 0.001). Inter-rater reliability was good to excellent with intraclass correlations between 0.85 and 1.0 for physical topics; between 0.61 and 0.8 for social-emotional topics and 0.92 for the overall risk assessment. Convergent validity was unexpectedly low (all correlations ≤0.3) although the pattern was as expected. Discriminative validity was good. Users were satisfied with the SPARK and identified some topics for improvement. Conclusion The SPARK discriminates between children with a high, increased and low risk of parenting and developmental problems. It does so in a reliable way, but more research is needed on aspects of validity and in other populations. PMID:22697218
Development and validation of an instrument to assess perceived social influence on health behaviors
HOLT, CHERYL L.; CLARK, EDDIE M.; ROTH, DAVID L.; CROWTHER, MARTHA; KOHLER, CONNIE; FOUAD, MONA; FOUSHEE, RUSTY; LEE, PATRICIA A.; SOUTHWARD, PENNY L.
2012-01-01
Assessment of social influence on health behavior is often approached through a situational context. The current study adapted an existing, theory-based instrument from another content domain to assess Perceived Social Influence on Health Behavior (PSI-HB) among African Americans, using an individual difference approach. The adapted instrument was found to have high internal reliability (α = .81–.84) and acceptable testretest reliability (r = .68–.85). A measurement model revealed a three-factor structure and supported the theoretical underpinnings. Scores were predictive of health behaviors, particularly among women. Future research using the new instrument may have applied value assessing social influence in the context of health interventions. PMID:20522506
Skritskaya, Natalia A; Mauro, Christine; Olonoff, Matthew; Qiu, Xin; Duncan, Sarah; Wang, Yuanjia; Duan, Naihua; Lebowitz, Barry; Reynolds, Charles F; Simon, Naomi M; Zisook, Sidney; Shear, M Katherine
2017-05-01
Maladaptive cognitions related to loss are thought to contribute to development of complicated grief and are crucial to address in treatment, but tools available to assess them are limited. This paper introduces the Typical Beliefs Questionnaire (TBQ), a 25-item self-report instrument to assess cognitions that interfere with adaptation to loss. Study participants completed an assessment battery during their initial evaluation and again after completing treatment at 20 weeks. Test-retest reliability was assessed on a subsample of the participants who did not show change in complicated grief severity after the first 4 weeks of treatment. To examine latent structure of the TBQ, an exploratory factor analysis (EFA) was performed. Academic medical centers in Boston, New York, Pittsburgh, and San Diego from 2010-2014. 394 bereaved adults who met criteria for complicated grief. The TBQ along with assessments of complicated grief symptoms and related avoidance, depression symptoms, functional impairment, and perceived social support. The TBQ exhibited good internal consistency (α = 0.82) and test-retest reliability (N = 105; intraclass correlation coefficient = 0.74). EFA indicated a five-factor structure: "Protesting the Death," "Negative Thoughts About the World," "Needing the Person," "Less Grief is Wrong" and "Grieving Too Much." The total score and all factors showed sensitivity to change with treatment. This new tool allows a clinician to quickly and reliably ascertain presence of specific maladaptive cognitions related to complicated grief, and subsequently, to use the information to aid a diagnostic assessment, to structure the treatment, and to measure treatment outcomes. Copyright © 2016 American Association for Geriatric Psychiatry. Published by Elsevier Inc. All rights reserved.
Internet addiction assessment tools: dimensional structure and methodological status.
Lortie, Catherine L; Guitton, Matthieu J
2013-07-01
Excessive internet use is becoming a concern, and some have proposed that it may involve addiction. We evaluated the dimensions assessed by, and psychometric properties of, a range of questionnaires purporting to assess internet addiction. Fourteen questionnaires were identified purporting to assess internet addiction among adolescents and adults published between January 1993 and October 2011. Their reported dimensional structure, construct, discriminant and convergent validity and reliability were assessed, as well as the methods used to derive these. Methods used to evaluate internet addiction questionnaires varied considerably. Three dimensions of addiction predominated: compulsive use (79%), negative outcomes (86%) and salience (71%). Less common were escapism (21%), withdrawal symptoms (36%) and other dimensions. Measures of validity and reliability were found to be within normally acceptable limits. There is a broad convergence of questionnaires purporting to assess internet addiction suggesting that compulsive use, negative outcome and salience should be covered and the questionnaires show adequate psychometric properties. However, the methods used to evaluate the questionnaires vary widely and possible factors contributing to excessive use such as social motivation do not appear to be covered. © 2013 Society for the Study of Addiction.
Stirling engine - Approach for long-term durability assessment
NASA Technical Reports Server (NTRS)
Tong, Michael T.; Bartolotta, Paul A.; Halford, Gary R.; Freed, Alan D.
1992-01-01
The approach employed by NASA Lewis for the long-term durability assessment of the Stirling engine hot-section components is summarized. The approach consists of: preliminary structural assessment; development of a viscoplastic constitutive model to accurately determine material behavior under high-temperature thermomechanical loads; an experimental program to characterize material constants for the viscoplastic constitutive model; finite-element thermal analysis and structural analysis using a viscoplastic constitutive model to obtain stress/strain/temperature at the critical location of the hot-section components for life assessment; and development of a life prediction model applicable for long-term durability assessment at high temperatures. The approach should aid in the provision of long-term structural durability and reliability of Stirling engines.
Ferris, M; Cohen, S; Haberman, C; Javalkar, K; Massengill, S; Mahan, J D; Kim, S; Bickford, K; Cantu, G; Medeiros, M; Phillips, A; Ferris, M T; Hooper, S R
2015-01-01
The Self-Management and Transition to Adulthood with Rx=Treatment (STARx) Questionnaire was developed to collect information on self-management and health care transition (HCT) skills, via self-report, in a broad population of adolescents and young adults (AYAs) with chronic conditions. Over several iterations, the STARx questionnaire was created with AYA, family, and health provider input. The development and pilot testing of the STARx Questionnaire took place with the assistance of 1219 AYAs with different chronic health conditions, in multiple institutions and settings over three phases: item development, pilot testing, reliability and factor structuring. The three development phases resulted in a final version of the STARx Questionnaire. The exploratory factor analysis of the third version of the 18-item STARx identified six factors that accounted for about 65% of the variance: Medication management, Provider communication, Engagement during appointments, Disease knowledge, Adult health responsibilities, and Resource utilization. Reliability estimates revealed good internal consistency and temporal stability, with the alpha coefficient for the overall scale being .80. The STARx was developmentally sensitive, with older patients scoring significantly higher on nearly every factor than younger patients. The STARx Questionnaire is a reliable, self-report tool with adequate internal consistency, temporal stability, and a strong, multidimensional factor structure. It provides another assessment strategy to measure self-management and transition skills in AYAs with chronic conditions. Copyright © 2015 Elsevier Inc. All rights reserved.
Assessing the applicability of template-based protein docking in the twilight zone.
Negroni, Jacopo; Mosca, Roberto; Aloy, Patrick
2014-09-02
The structural modeling of protein interactions in the absence of close homologous templates is a challenging task. Recently, template-based docking methods have emerged to exploit local structural similarities to help ab-initio protocols provide reliable 3D models for protein interactions. In this work, we critically assess the performance of template-based docking in the twilight zone. Our results show that, while it is possible to find templates for nearly all known interactions, the quality of the obtained models is rather limited. We can increase the precision of the models at expenses of coverage, but it drastically reduces the potential applicability of the method, as illustrated by the whole-interactome modeling of nine organisms. Template-based docking is likely to play an important role in the structural characterization of the interaction space, but we still need to improve the repertoire of structural templates onto which we can reliably model protein complexes. Copyright © 2014 Elsevier Ltd. All rights reserved.
Validation of the Dutch Eating Behaviour Questionnaire (DEBQ) among Maltese women.
Dutton, Elaine; Dovey, Terence M
2016-12-01
The main aim of this study was to assess the dimensional structure of the Maltese version of the Dutch Eating Behaviour Questionnaire (DEBQ) and evaluate the instrument's validity and reliability among Maltese women (N = 586). Exploratory factor analysis reflected the theoretical structure of three factors; emotional, restrained and external eating which was supported by a Confirmatory Factor analysis. Minor issues with specific items in the Emotional and External eating scale were identified and discussed. Criterion-related validity was ascertained through correlations with the EAT-26. The study also assessed the DEBQ's predictive value in differentiating between BMI groups and between dieters and weight maintainers. The results suggest that the Maltese DEBQ is a psychometrically valid and reliable instrument for assessing eating behaviours with women in the Maltese community. The study also highlights the critical role of Emotional and Restrained eating in dieting and overweight Maltese women. Copyright © 2016 Elsevier Ltd. All rights reserved.
Innes, Ev; Straker, Leon
2003-01-01
The purpose of this study was to understand the current beliefs of therapists in Australia, and the strategies they use to address the issues of credibility, reliability, consistency, trustworthiness, validity, generalisability and quality in conducting work-related assessments. In-depth semi-structured interviews were conducted with 26 occupational therapists and physiotherapists from around Australia. Participants expressed the belief that the therapist was the assessment instrument and was central to the credibility of an assessment. Conflict was reported when participants modified standardised assessments in an attempt to focus on context relevant activities and tasks. Participants were aware of the issues of reliability and validity but believed it was not practical to establish these aspects formally in most work-related assessments. The strategies used to achieve credibility, reliability, consistency, trustworthiness, validity, generalisability and quality were similar to those recommended for use in qualitative research. The strategies identified in this study can provide the basis for therapists to examine how they conduct work-related assessments and consider whether they currently use these strategies or have the opportunity to implement others.
A Protocol for Advanced Psychometric Assessment of Surveys
Squires, Janet E.; Hayduk, Leslie; Hutchinson, Alison M.; Cranley, Lisa A.; Gierl, Mark; Cummings, Greta G.; Norton, Peter G.; Estabrooks, Carole A.
2013-01-01
Background and Purpose. In this paper, we present a protocol for advanced psychometric assessments of surveys based on the Standards for Educational and Psychological Testing. We use the Alberta Context Tool (ACT) as an exemplar survey to which this protocol can be applied. Methods. Data mapping, acceptability, reliability, and validity are addressed. Acceptability is assessed with missing data frequencies and the time required to complete the survey. Reliability is assessed with internal consistency coefficients and information functions. A unitary approach to validity consisting of accumulating evidence based on instrument content, response processes, internal structure, and relations to other variables is taken. We also address assessing performance of survey data when aggregated to higher levels (e.g., nursing unit). Discussion. In this paper we present a protocol for advanced psychometric assessment of survey data using the Alberta Context Tool (ACT) as an exemplar survey; application of the protocol to the ACT survey is underway. Psychometric assessment of any survey is essential to obtaining reliable and valid research findings. This protocol can be adapted for use with any nursing survey. PMID:23401759
Probabilistic Assessment of High-Throughput Wireless Sensor Networks
Kim, Robin E.; Mechitov, Kirill; Sim, Sung-Han; Spencer, Billie F.; Song, Junho
2016-01-01
Structural health monitoring (SHM) using wireless smart sensors (WSS) has the potential to provide rich information on the state of a structure. However, because of their distributed nature, maintaining highly robust and reliable networks can be challenging. Assessing WSS network communication quality before and after finalizing a deployment is critical to achieve a successful WSS network for SHM purposes. Early studies on WSS network reliability mostly used temporal signal indicators, composed of a smaller number of packets, to assess the network reliability. However, because the WSS networks for SHM purpose often require high data throughput, i.e., a larger number of packets are delivered within the communication, such an approach is not sufficient. Instead, in this study, a model that can assess, probabilistically, the long-term performance of the network is proposed. The proposed model is based on readily-available measured data sets that represent communication quality during high-throughput data transfer. Then, an empirical limit-state function is determined, which is further used to estimate the probability of network communication failure. Monte Carlo simulation is adopted in this paper and applied to a small and a full-bridge wireless networks. By performing the proposed analysis in complex sensor networks, an optimized sensor topology can be achieved. PMID:27258270
A proposed method to investigate reliability throughout a questionnaire
2011-01-01
Background Questionnaires are used extensively in medical and health care research and depend on validity and reliability. However, participants may differ in interest and awareness throughout long questionnaires, which can affect reliability of their answers. A method is proposed for "screening" of systematic change in random error, which could assess changed reliability of answers. Methods A simulation study was conducted to explore whether systematic change in reliability, expressed as changed random error, could be assessed using unsupervised classification of subjects by cluster analysis (CA) and estimation of intraclass correlation coefficient (ICC). The method was also applied on a clinical dataset from 753 cardiac patients using the Jalowiec Coping Scale. Results The simulation study showed a relationship between the systematic change in random error throughout a questionnaire and the slope between the estimated ICC for subjects classified by CA and successive items in a questionnaire. This slope was proposed as an awareness measure - to assessing if respondents provide only a random answer or one based on a substantial cognitive effort. Scales from different factor structures of Jalowiec Coping Scale had different effect on this awareness measure. Conclusions Even though assumptions in the simulation study might be limited compared to real datasets, the approach is promising for assessing systematic change in reliability throughout long questionnaires. Results from a clinical dataset indicated that the awareness measure differed between scales. PMID:21974842
ERIC Educational Resources Information Center
Witwer, Andrea N.; Lecavalier, Luc; Norris, Megan
2012-01-01
The "Children's Interview for Psychiatric Syndromes-Parent Version" (P-ChIPS) is a structured psychiatric interview designed to assess the presence of psychiatric disorders in children and adolescents. This study examined the reliability and validity of the P-ChIPS in 61 youngsters (6- to 17-years-old) with Autism Spectrum Disorders. Reliability…
ERIC Educational Resources Information Center
Martinkova, Patricia; Goldhaber, Dan
2015-01-01
Inter-rater reliability, commonly assessed by intra-class correlation coefficient ICC, is an important index for describing the extent to which there is consistency amongst two or more raters in assigned measures. In organizational research, the data structure is often hierarchical and designs deviate substantially from the ideal of a balanced…
Reliable and valid assessment of point-of-care ultrasonography.
Todsen, Tobias; Tolsgaard, Martin Grønnebæk; Olsen, Beth Härstedt; Henriksen, Birthe Merete; Hillingsø, Jens Georg; Konge, Lars; Jensen, Morten Lind; Ringsted, Charlotte
2015-02-01
To explore the reliability and validity of the Objective Structured Assessment of Ultrasound Skills (OSAUS) scale for point-of-care ultrasonography (POC US) performance. POC US is increasingly used by clinicians and is an essential part of the management of acute surgical conditions. However, the quality of performance is highly operator-dependent. Therefore, reliable and valid assessment of trainees' ultrasonography competence is needed to ensure patient safety. Twenty-four physicians, representing novices, intermediates, and experts in POC US, scanned 4 different surgical patient cases in a controlled set-up. All ultrasound examinations were video-recorded and assessed by 2 blinded radiologists using OSAUS. Reliability was examined using generalizability theory. Construct validity was examined by comparing performance scores between the groups and by correlating physicians' OSAUS scores with diagnostic accuracy. The generalizability coefficient was high (0.81) and a D-study demonstrated that 1 assessor and 5 cases would result in similar reliability. The construct validity of the OSAUS scale was supported by a significant difference in the mean scores between the novice group (17.0; SD 8.4) and the intermediate group (30.0; SD 10.1), P = 0.007, as well as between the intermediate group and the expert group (72.9; SD 4.4), P = 0.04, and by a high correlation between OSAUS scores and diagnostic accuracy (Spearman ρ correlation coefficient = 0.76; P < 0.001). This study demonstrates high reliability as well as evidence of construct validity of the OSAUS scale for assessment of POC US competence. Hence, the OSAUS scale may be suitable for both in-training as well as end-of-training assessment.
Gorlin, Eugenia I; Dalrymple, Kristy; Chelminski, Iwona; Zimmerman, Mark
2016-08-30
Despite growing recognition that the symptoms and functional impairments of Attention Deficit/Hyperactivity Disorder (ADHD) persist into adulthood, only a few psychometrically sound diagnostic measures have been developed for the assessment of ADHD in adults, and none have been validated for use in a broad treatment-seeking psychiatric sample. The current study presents the reliability and validity of a semi-structured DSM-based diagnostic interview module for ADHD, which was administered to 1194 adults presenting to an outpatient psychiatric practice. The module showed excellent internal consistency and interrater reliability, good convergent and discriminant validity (as indexed by relatively high correlations with self-report measures of ADHD and ADHD-related constructs and little or no correlation with other, non-ADHD symptom domains), and good construct validity (as indexed by significantly higher rates of psychosocial impairment and self-reported family history of ADHD in individuals who meet criteria for an ADHD diagnosis). This instrument is thus a reliable and valid diagnostic tool for the detection of ADHD in adults presenting for psychiatric evaluation and treatment. Published by Elsevier Ireland Ltd.
Grant, Jon E; Kim, Suck Won; McCabe, James S
2006-06-01
Kleptomania presents difficulties in diagnosis for clinicians. This study aimed to develop and test a DSM-IV-based diagnostic instrument for kleptomania. To assess for current kleptomania the Structured Clinical Interview for Kleptomania (SCI-K) was administered to 112 consecutive subjects requesting psychiatric outpatient treatment for a variety of disorders. Reliability and validity were determined. Classification accuracy was examined using the longitudinal course of illness. The SCI-K demonstrated excellent test-retest (Phi coefficient = 0.956 (95% CI = 0.937, 0.970)) and inter-rater reliability (phi coefficient = 0.718 (95% CI = 0.506, 0.848)) in the diagnosis of kleptomania. Concurrent validity was observed with a self-report measure using DSM-IV kleptomania criteria (phi coefficient = 0.769 (95% CI = 0.653, 0.850)). Discriminant validity was observed with a measure of depression (point biserial coefficient = -0.020 (95% CI = -0.205, 0.166)). The SCI-K demonstrated both high sensitivity and specificity based on longitudinal assessment. The SCI-K demonstrated excellent reliability and validity in diagnosing kleptomania in subjects presenting with various psychiatric problems. These findings require replication in larger groups, including non-psychiatric populations, to examine their generalizability. Copyright (c) 2006 John Wiley & Sons, Ltd.
Olt, Helen; Jirwe, Maria; Gustavsson, Petter; Emami, Azita
2010-01-01
The purpose of this study was to describe the translation, adaption, and psychometric evaluation process in relation to validity and reliability of the Swedish version of the instrument, Inventory for Assessing The Process of Cultural Competence Among Healthcare Professionals-Revised (IAPCC-R) following the translation, adaptation, and psychometric evaluation process. Validity tests were conducted on the response processes (N = 15), the content (N = 7), and the internal structure of the instrument (N = 334). Reliability (alpha = .65 for the total scale varying between -.01 and .65 for the different subscales) was evaluated in terms of internal consistency. Results indicated weak validity and reliability though it is difficult to conclude whether this is related to adaptation issues or the original construction.The testing of the response process identified problems in relation to respondents' conceptualization of cultural competence. The test of the content identified a weak correspondence between the items and the underlying model. In addition, a confirmatory factor analysis did not confirm the proposed structure of the instrument. This study concludes that this instrument is not valid and reliable for use with a Swedish population of practicing nurses or nursing students.
Carvalho, Teresa; Cunha, Marina; Pinto-Gouveia, José; Duarte, Joana
2015-03-30
The PTSD Checklist-Military Version (PCL-M) is a brief self-report instrument widely used to assess Post-traumatic Stress Disorder (PTSD) symptomatology in war Veterans, according to DSM-IV. This study sought out to explore the factor structure and reliability of the Portuguese version of the PCL-M. A sample of 660 Portuguese Colonial War Veterans completed the PCL-M. Several Confirmatory Factor Analyses were conducted to test different structures for PCL-M PTSD symptoms. Although the respecified first-order four-factor model based on King et al.'s model showed the best fit to the data, the respecified first and second-order models based on the DSM-IV symptom clusters also presented an acceptable fit. In addition, the PCL-M showed adequate reliability. The Portuguese version of the PCL-M is thus a valid and reliable measure to assess the severity of PTSD symptoms as described in DSM-IV. Its use with Portuguese Colonial War Veterans may ease screening of possible PTSD cases, promote more suitable treatment planning, and enable monitoring of therapeutic outcomes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Xu, Jun; Kong, Fan
2018-05-01
Extreme value distribution (EVD) evaluation is a critical topic in reliability analysis of nonlinear structural dynamic systems. In this paper, a new method is proposed to obtain the EVD. The maximum entropy method (MEM) with fractional moments as constraints is employed to derive the entire range of EVD. Then, an adaptive cubature formula is proposed for fractional moments assessment involved in MEM, which is closely related to the efficiency and accuracy for reliability analysis. Three point sets, which include a total of 2d2 + 1 integration points in the dimension d, are generated in the proposed formula. In this regard, the efficiency of the proposed formula is ensured. Besides, a "free" parameter is introduced, which makes the proposed formula adaptive with the dimension. The "free" parameter is determined by arranging one point set adjacent to the boundary of the hyper-sphere which contains the bulk of total probability. In this regard, the tail distribution may be better reproduced and the fractional moments could be evaluated with accuracy. Finally, the proposed method is applied to a ten-storey shear frame structure under seismic excitations, which exhibits strong nonlinearity. The numerical results demonstrate the efficacy of the proposed method.
The Work-Health-Check (WHC): a brief new tool for assessing psychosocial stress in the workplace.
Gadinger, M C; Schilling, O; Litaker, D; Fischer, J E
2012-01-01
Brief, psychometrically robust questionnaires assessing work-related psychosocial stressors are lacking. The purpose of the study is to evaluate the psychometric properties of a brief new questionnaire for assessing sources of work-related psychosocial stress. Managers, blue- and white-collar workers (n= 628 at measurement point one, n=459 at measurement point two), sampled from an online panel of a German marketing research institute. We either developed or identified appropriate items from existing questionnaires for ten scales, which are conceptually based in work stress models and reflected either work-related demands or resources. Factorial structure was evaluated by confirmatory factor analyses (CFA). Scale reliability was assessed by Cronbach's Alpha, and test-retest; correlations with work-related efforts demonstrated convergent and discriminant validity for the demand and resource scales, respectively. Scale correlations with health indicators tested criterion validity. All scales had satisfactory reliability (Cronbach's Alpha: 0.74-0.93, retest reliabilities: 0.66-0.81). CFA supported the anticipated factorial structure. Significant correlations between job-related efforts and demand scales (mean r=0.44) and non-significant correlations with the resource scales (mean r=0.07) suggested good convergent and discriminant validity, respectively. Scale correlations with health indicators demonstrated good criterion validity. The WHC appears to be a brief, psychometrically robust instrument for assessing work-related psychosocial stressors.
Management of the aging of critical safety-related concrete structures in light-water reactor plants
DOE Office of Scientific and Technical Information (OSTI.GOV)
Naus, D.J.; Oland, C.B.; Arndt, E.G.
1990-01-01
The Structural Aging Program has the overall objective of providing the USNRC with an improved basis for evaluating nuclear power plant safety-related structures for continued service. The program consists of a management task and three technical tasks: materials property data base, structural component assessment/repair technology, and quantitative methodology for continued-service determinations. Objectives, accomplishments, and planned activities under each of these tasks are presented. Major program accomplishments include development of a materials property data base for structural materials as well as an aging assessment methodology for concrete structures in nuclear power plants. Furthermore, a review and assessment of inservice inspection techniquesmore » for concrete materials and structures has been complete, and work on development of a methodology which can be used for performing current as well as reliability-based future condition assessment of concrete structures is well under way. 43 refs., 3 tabs.« less
Fatigue Reliability of Gas Turbine Engine Structures
NASA Technical Reports Server (NTRS)
Cruse, Thomas A.; Mahadevan, Sankaran; Tryon, Robert G.
1997-01-01
The results of an investigation are described for fatigue reliability in engine structures. The description consists of two parts. Part 1 is for method development. Part 2 is a specific case study. In Part 1, the essential concepts and practical approaches to damage tolerance design in the gas turbine industry are summarized. These have evolved over the years in response to flight safety certification requirements. The effect of Non-Destructive Evaluation (NDE) methods on these methods is also reviewed. Assessment methods based on probabilistic fracture mechanics, with regard to both crack initiation and crack growth, are outlined. Limit state modeling techniques from structural reliability theory are shown to be appropriate for application to this problem, for both individual failure mode and system-level assessment. In Part 2, the results of a case study for the high pressure turbine of a turboprop engine are described. The response surface approach is used to construct a fatigue performance function. This performance function is used with the First Order Reliability Method (FORM) to determine the probability of failure and the sensitivity of the fatigue life to the engine parameters for the first stage disk rim of the two stage turbine. A hybrid combination of regression and Monte Carlo simulation is to use incorporate time dependent random variables. System reliability is used to determine the system probability of failure, and the sensitivity of the system fatigue life to the engine parameters of the high pressure turbine. 'ne variation in the primary hot gas and secondary cooling air, the uncertainty of the complex mission loading, and the scatter in the material data are considered.
Dreessen, L; Arntz, A
1998-01-01
The short-interval test-retest interrater reliability of the Structured Clinical Interview for DSM-III-R personality disorders (SCID-II) was studied in a psychotherapy outpatient group whose main complaint was mostly an Axis I anxiety disorder. Using a test-retest approach to assess interrater reliability, three sources of variance were taken into account (rater variance in the elicitation and interpretation of information and patient variance across interviews). Base rate requirements were established before calculating reliability coefficients. On the whole, interrater agreement on the SCID-II was found to be satisfactory, except for the histrionic personality traits. This is the first study that has estimated short-interval test-retest interrater reliability of the SCID-II in outpatients, and also the first that has studied single SCID-II traits and dimensional diagnoses. The results found support the use of the SCID-II as a diagnostic instrument for clinical and research purposes.
Choosing a reliability inspection plan for interval censored data
Lu, Lu; Anderson-Cook, Christine Michaela
2017-04-19
Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less
Choosing a reliability inspection plan for interval censored data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Lu; Anderson-Cook, Christine Michaela
Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less
Wood, Lisa; Burke, Eilish; Byrne, Rory; Enache, Gabriela; Morrison, Anthony P
2016-10-01
Stigma is a significant difficulty for people who experience psychosis. To date, there have been no outcome measures developed to examine stigma exclusively in people with psychosis. The aim of this study was develop and validate a semi-structured interview measure of stigma (SIMS) in psychosis. The SIMS is an eleven item measure of stigma developed in consultation with service users who have experienced psychosis. 79 participants with experience of psychosis were recruited for the purposes of this study. They were administered the SIMS alongside a battery of other relevant outcome measures to examine reliability and validity. A one-factor solution was identified for the SIMS which encompassed all ten rateable items. The measure met all reliability and validity criteria and illustrated good internal consistency, inter-rater reliability, test retest reliability, criterion validity, construct validity, sensitivity to change and had no floor or ceiling effects. The SIMS is a reliable and valid measure of stigma in psychosis. It may be more engaging and acceptable than other stigma measures due to its semi-structured interview format. Crown Copyright © 2016. Published by Elsevier B.V. All rights reserved.
Inter-observer reliability of DSM-5 substance use disorders.
Denis, Cécile M; Gelernter, Joel; Hart, Amy B; Kranzler, Henry R
2015-08-01
Although studies have examined the impact of changes made in DSM-5 on the estimated prevalence of substance use disorder (SUD) diagnoses, there is limited evidence concerning the reliability of DSM-5 SUDs. We evaluated the inter-observer reliability of four DSM-5 SUDs in a sample in which we had previously evaluated the reliability of DSM-IV diagnoses, allowing us to compare the two systems. Two different interviewers each assessed 173 subjects over a 2-week period using the Semi-Structured Assessment for Drug Dependence and Alcoholism (SSADDA). Using the percent agreement and kappa (κ) coefficient, we examined the reliability of DSM-5 lifetime alcohol, opioid, cocaine, and cannabis use disorders, which we compared to that of SSADDA-derived DSM-IV SUD diagnoses. We also assessed the effect of additional lifetime SUD and lifetime mood or anxiety disorder diagnoses on the reliability of the DSM-5 SUD diagnoses. Reliability was good to excellent for the four disorders, with κ values ranging from 0.65 to 0.94. Agreement was consistently lower for SUDs of mild severity than for moderate or severe disorders. DSM-5 SUD diagnoses showed greater reliability than DSM-IV diagnoses of abuse or dependence or dependence only. Co-occurring SUD and lifetime mood or anxiety disorders exerted a modest effect on the reliability of the DSM-5 SUD diagnoses. For alcohol, opioid, cocaine and cannabis use disorders, DSM-5 criteria and diagnoses are at least as reliable as those of DSM-IV. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Jackson, Howard F; Tunstall, Victoria; Hague, Gemma; Daniels, Leanne; Crompton, Stacey; Taplin, Kimberly
2014-01-01
Jackson et al. (this edition) argue that structure is an important component in reducing the handicaps caused by cognitive impairments following acquired brain injury and that post-acute neuropsychological brain injury rehabilitation programmes should not only endeavour to provide structure but also aim to develop self-structuring. However, at present there is no standardized device for assessing self-structuring. To provide preliminary analysis of the psychometric properties of the Behavioural Assessment of Self-Structuring (BASS) staff rating scale (a 26 item informant five point rating scale based on the degree of support client requires to achieve self-structuring item). BASS data was utilised for clients attending residential rehabilitation. Reliability (inter-rarer and intra-rater), validity (construct, concurrent and discriminate) and sensitivity to change were investigated. Initial results indicate that the BASS has reasonably good reliability, good construct validity (via principal components analysis), good discriminant validity, and good concurrent validity correlating well with a number of other outcome measures (HoNOS; NPDS, Supervision Rating Scale, MPAI, FIM and FAM). The BASS did not correlate well with the NPCNA. Finally, the BASS was shown to demonstrate sensitivity to change. Although some caution is required in drawing firm conclusions at the present time and further exploration of the psychometric properties of the BASS is required, initial results are encouraging for the use of the BASS in assessing rehabilitation progress. These findings are discussed in terms of the value of the concept of self-structuring to the rehabilitation process for individuals with neuropsychological impairments consequent on acquired brain injury.
First Order Reliability Application and Verification Methods for Semistatic Structures
NASA Technical Reports Server (NTRS)
Verderaime, Vincent
1994-01-01
Escalating risks of aerostructures stimulated by increasing size, complexity, and cost should no longer be ignored by conventional deterministic safety design methods. The deterministic pass-fail concept is incompatible with probability and risk assessments, its stress audits are shown to be arbitrary and incomplete, and it compromises high strength materials performance. A reliability method is proposed which combines first order reliability principles with deterministic design variables and conventional test technique to surmount current deterministic stress design and audit deficiencies. Accumulative and propagation design uncertainty errors are defined and appropriately implemented into the classical safety index expression. The application is reduced to solving for a factor that satisfies the specified reliability and compensates for uncertainty errors, and then using this factor as, and instead of, the conventional safety factor in stress analyses. The resulting method is consistent with current analytical skills and verification practices, the culture of most designers, and with the pace of semistatic structural designs.
[Reliability and construct validity of the OPD-CA axes structure and prerequisites for treatment].
Weitkamp, Katharina; Wiegand-Grefe, Silke; Romer, Georg
2013-01-01
As an instrument to assess specific psychodynamic dimensions, the Operationalized Psychodynamic Diagnostics in Childhood and Adolescence (OPD-CA) is widely used in clinical care and psychotherapeutic training. However, the psychometric validation of its axes is partly still missing. The aim of this study was to test the reliability and construct validity of the axes structure and prerequisites of treatment. 171 children and adolescents (aged 4 to 21 years) with a diagnosed psychiatric disorder who began an analytic psychotherapy were additionally assessed with the OPD-CA by their therapists (n = 25) in the context of naturalistic care in private practice. Therapists were all qualified as analytic child and adolescent psychotherapists and underwent a standardized OPD-CA training. Results indicated conceptually meaningful factor structures for both axes tested. These factor structures predominantly followed the conceptually defined dimensions. Internal consistency was high for the axis structure, modest to low fort he axis prerequisites of treatment. Implications and recommendations for a future revision of the OPD-CA with particular respect of single items and their operationalization are discussed.
Probabilistic structural mechanics research for parallel processing computers
NASA Technical Reports Server (NTRS)
Sues, Robert H.; Chen, Heh-Chyun; Twisdale, Lawrence A.; Martin, William R.
1991-01-01
Aerospace structures and spacecraft are a complex assemblage of structural components that are subjected to a variety of complex, cyclic, and transient loading conditions. Significant modeling uncertainties are present in these structures, in addition to the inherent randomness of material properties and loads. To properly account for these uncertainties in evaluating and assessing the reliability of these components and structures, probabilistic structural mechanics (PSM) procedures must be used. Much research has focused on basic theory development and the development of approximate analytic solution methods in random vibrations and structural reliability. Practical application of PSM methods was hampered by their computationally intense nature. Solution of PSM problems requires repeated analyses of structures that are often large, and exhibit nonlinear and/or dynamic response behavior. These methods are all inherently parallel and ideally suited to implementation on parallel processing computers. New hardware architectures and innovative control software and solution methodologies are needed to make solution of large scale PSM problems practical.
Probabilistic confidence for decisions based on uncertain reliability estimates
NASA Astrophysics Data System (ADS)
Reid, Stuart G.
2013-05-01
Reliability assessments are commonly carried out to provide a rational basis for risk-informed decisions concerning the design or maintenance of engineering systems and structures. However, calculated reliabilities and associated probabilities of failure often have significant uncertainties associated with the possible estimation errors relative to the 'true' failure probabilities. For uncertain probabilities of failure, a measure of 'probabilistic confidence' has been proposed to reflect the concern that uncertainty about the true probability of failure could result in a system or structure that is unsafe and could subsequently fail. The paper describes how the concept of probabilistic confidence can be applied to evaluate and appropriately limit the probabilities of failure attributable to particular uncertainties such as design errors that may critically affect the dependability of risk-acceptance decisions. This approach is illustrated with regard to the dependability of structural design processes based on prototype testing with uncertainties attributable to sampling variability.
Balaguier, Romain; Madeleine, Pascal; Vuillerme, Nicolas
2016-01-01
The assessment of pressure pain threshold (PPT) provides a quantitative value related to the mechanical sensitivity to pain of deep structures. Although excellent reliability of PPT has been reported in numerous anatomical locations, its absolute and relative reliability in the lower back region remains to be determined. Because of the high prevalence of low back pain in the general population and because low back pain is one of the leading causes of disability in industrialized countries, assessing pressure pain thresholds over the low back is particularly of interest. The purpose of this study study was (1) to evaluate the intra- and inter- absolute and relative reliability of PPT within 14 locations covering the low back region of asymptomatic individuals and (2) to determine the number of trial required to ensure reliable PPT measurements. Fifteen asymptomatic subjects were included in this study. PPTs were assessed among 14 anatomical locations in the low back region over two sessions separated by one hour interval. For the two sessions, three PPT assessments were performed on each location. Reliability was assessed computing intraclass correlation coefficients (ICC), standard error of measurement (SEM) and minimum detectable change (MDC) for all possible combinations between trials and sessions. Bland-Altman plots were also generated to assess potential bias in the dataset. Relative reliability for both intra- and inter- session was almost perfect with ICC ranged from 0.85 to 0.99. With respect to the intra-session, no statistical difference was reported for ICCs and SEM regardless of the conducted comparisons between trials. Conversely, for inter-session, ICCs and SEM values were significantly larger when two consecutive PPT measurements were used for data analysis. No significant difference was observed for the comparison between two consecutive measurements and three measurements. Excellent relative and absolute reliabilities were reported for both intra- and inter-session. Reliable measurements can be equally achieved when using the mean of two or three consecutive PPT measurements, as usually proposed in the literature, or with only the first one. Although reliability was almost perfect regardless of the conducted comparison between PPT assessments, our results suggest using two consecutive measurements to obtain higher short term absolute reliability.
Greeven, Anja; Spinhoven, Philip; van Balkom, Anton J L M
2009-01-01
This study investigated the psychometric properties of the first clinician-administered semi-structured interview for assessing the severity of hypochondriacal symptoms. The Hypochondriasis Yale-Brown Obsessive-Compulsive Scale (H-YBOCS) consisted of three a priori dimensions: hypochondriacal obsessions, compulsions and avoidance. The 16-item interview was conducted with 112 participants with Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, hypochondriasis. We analysed factor analytic structure, reliability, construct validity and sensitivity to change. Factor analysis supported a three-factor model similar to the a priori dimensions. Internal consistency ranged from satisfactory to good. Inter-rater reliability was excellent. The construct validity was low to moderate. The H-YBOCS was sensitive for measuring changes in symptom severity. The H-YBOCS is a (factorially) valid and coherent interview with a high level of agreement across different raters. The relatively low discriminant validity could be due to co-morbid anxiety and depressive disorders. Overall, the H-YBOCS seems to be a promising contribution to the assessment of hypochondriasis. *The hypochondriasis Y-BOCS is a feasible clinician rated interview to assess the severity of hypochondriacal complaints.
Cheong, Sau Kuan; Lang, Cathryne P; Hemphill, Sheryl A; Johnston, Leanne M
2017-06-01
To evaluate the preliminary validity and reliability of the myTREEHOUSE Self-Concept Assessment for children with cerebral palsy (CP) aged 8 to 12 years. The myTREEHOUSE Self-Concept Assessment includes 26 items divided into eight domains, assessed across three Performance Perspectives (Personal, Social, and Perceived) and an additional Importance Rating. Face and content validity was assessed by semi-structured interviews with seven expert professionals regarding the assessment construct, content, and clinical utility. Reliability was assessed with 50 children aged 8 to 12 years with CP (29 males, 21 females; mean age 10y 2mo; Gross Motor Function Classification System [GMFCS] level I=35, II=8, III=5, IV=1; mean Wechsler Intelligence Scale for Children - Fourth Edition [WISC-IV]=104), whose data was used to calculate internal consistency of the scale, and a subset of 35 children (20 males, 15 females; mean age 10y 5mo; GMFCS level I=26, II=4, III=4, IV=1; mean WISC-IV=103) who participated in test-retest reliability within 14 to 28 days. Face and content validity was supported by positive expert feedback, with only minor adjustments suggested to clarify the wording of some items. After these amendments, strong internal consistency (Cronbach's α 0.84-0.91) and moderate to good test-retest reliability (intraclass correlation coefficient 0.64-0.75) was found for each component. The myTREEHOUSE Self-Concept Assessment is a valid and reliable assessment of self-concept for children with CP aged 8 to 12 years. © 2017 Mac Keith Press.
A Measure of Burnout for Business Students
ERIC Educational Resources Information Center
Law, Daniel W.
2010-01-01
The author surveyed 163 business students representing all business majors from a major state university. Participants completed a questionnaire utilizing a modified version of the Maslach Burnout Inventory. The data were factor analyzed to assess its basic underlying structure, and each burnout component was assessed for reliability. Results…
Reliability and validity of the Incontinence Quiz-Turkish version.
Kara, Kerime C; Çıtak Karakaya, İlkim; Tunalı, Nur; Karakaya, Mehmet G
2018-01-01
The aim of this study was to investigate the reliability and validity of the Turkish version of the Incontinence Quiz, which was developed by Branch et al. (1994), to assess women's knowledge of and attitudes toward urinary incontinence. Comprehensibility of the Turkish version of the 14-item Incontinence Quiz, which was prepared following translation-back translation procedures, was tested on a pilot group of eight women, and its internal reliability, test-retest reliability and construct validity were assessed in 150 women who attended the gynecology clinics of three hospitals in İçel, Turkey. Physical and sociodemographic characteristics and presence of incontinence complaints were also recorded. Data were analyzed at the 0.05 alpha level, using SPSS version 22. The scale had good reliability and validity. The internal reliability coefficient (Cronbach α) was 0.80, test-retest correlation coefficients were 0.83-0.94; and with regard to construct validity, Kaiser-Meyer-Olkin coefficient was 0.76 and Barlett sphericity test was 562.777 (P = 0.000). Turkish version of the Incontinence Quiz had a four-factor structure, with Eigenvalues ranging from 1.17 to 4.08. The Incontinence Quiz-Turkish version is a highly comprehensible, reliable and valid scale, which may be used to assess Turkish-speaking women's knowledge of and attitudes toward urinary incontinence. © 2017 Japan Society of Obstetrics and Gynecology.
Reliability and validity of the adapted Resistance Training Skills Battery for Children.
Furzer, Bonnie J; Bebich-Philip, Marc D; Wright, Kemi E; Reid, Siobhan L; Thornton, Ashleigh L
2017-12-29
Resistance training (RT) is emerging as a training modality to improve motor function and facilitate physical activity participation in children across the motor proficiency spectrum. Although RT competency assessments have been established and validated among adolescent cohorts, the extent to which these methods are suitable for assessing children's RT skills is unknown. This project aimed to assess the psychometric properties of the adapted Resistance Training Skills Battery for Children (RTSBc), in children with varying motor proficiency. Repeated measures design with 40 participants (M age=8.2±1.7years) displaying varying levels of motor proficiency. Participants performed the adapted RTSBc on two occasions, receiving a score for their execution of each component, in addition to an overall RT skill quotient child (RTSQc). Cronbach's alpha, intra-class correlation (ICC), Bland-Altman analysis, and typical error were used to assess test-retest reliability. To examine construct validity, exploratory factor analysis was performed alongside computing correlations between participants' muscle strength, motor proficiency, age, lean muscle mass, and RTSQc. The RTSBc displayed an acceptable level of internal consistency (alpha=0.86) and test-retest reliability (ICC range=0.86-0.99). Exploratory factor analysis supported internal test structure, with all six RT skills loading strongly on a single factor (range 0.56-0.89). Analyses of structural validity revealed positive correlations for RTSQc in relation to motor proficiency (r=0.52, p<0.001) and strength scores (r=0.61, p<0.001). Analyses revealed support for the construct validity and test-retest reliability of the RTSBc, providing preliminary evidence that the RTSBc is appropriate for use in the assessment of children's RT competency. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Hendrie, H. C.; Lane, K. A.; Ogunniyi, A.; Baiyewu, O.; Gureje, O.; Evans, R.; Smith-Gamble, V.; Pettaway, M.; Unverzagt, F. W.; Gao, S.; Hall, K. S.
2010-01-01
Background Assessing function is a crucial element in the diagnosis of dementia. This information is usually obtained from key informants. However, reliable informants are not always available. Methods A 10-item semi-structured home interview (the CHIF, or Clinician Home-based Interview to assess Function) to assess function primarily by measuring instrumental activities of daily living directly was developed and tested for inter-rater reliability and validity as part of the Indianapolis–Ibadan dementia project. The primary validity measurements were correlations between scores on the CHIF and independently gathered scores on the Blessed Dementia Scale (from informants) and the Mini-mental State Examination (MMSE). Sensitivities and specificities of scores on the CHIF and receiver operator characteristic (ROC) curves were constructed with dementia as the dependent variable. Results Inter-rater reliability for the CHIF was high (Pearson’s correlation coefficient 0.99 in Indianapolis and 0.87 in Ibadan). Internal consistency, in both samples, was good (Cronbach’s α 0.95 in Indianapolis and 0.83 in Ibadan). Scores on the CHIF correlated well with the Blessed Dementia scores at both sites (−0.71, p < 0.0001 for Indianapolis and −0.56, p < 0.0001 for Ibadan) and with the MMSE (0.75, p < 0.0001 for Indianapolis and 0.44, p < 0.0001 for Ibadan). For all items at both sites, the subjects without dementia performed significantly better than those with dementia. The area under the ROC curve for dementia diagnosis was 0.965 for Indianapolis and 0.925 for Ibadan. Conclusion The CHIF is a useful instrument to assess function directly in elderly participants in international studies, particularly in the absence of reliable informants. PMID:16640794
[Santa Claus is perceived as reliable and friendly: results of the Danish Christmas 2013 survey].
Amin, Faisal Mohammad; West, Anders Sode; Jørgensen, Carina Sleiborg; Simonsen, Sofie Amalie; Lindberg, Ulrich; Tranum-Jensen, Jørgen; Hougaard, Anders
2013-12-02
Several studies have indicated that the population in general perceives doctors as reliable. In the present study perceptions of reliability and kindness attributed to another socially significant archetype, Santa Claus, have been comparatively examined in relation to the doctor. In all, 52 randomly chosen participants were shown a film, where a narrator dressed either as Santa Claus or as a doctor tells an identical story. Structured interviews were then used to assess the subjects' perceptions of reliability and kindness in relation to the narrator's appearance. We found a strong inclination for Santa Claus being perceived as friendlier than the doctor (p = 0.053). However, there was no significant difference in the perception of reliability between Santa Claus and the doctor (p = 0.524). The positive associations attributed to Santa Claus probably cause that he is perceived friendlier than the doctor who may be associated with more serious and unpleasant memories of illness and suffering. Surprisingly, and despite him being an imaginary person, Santa Claus was assessed as being as reliable as the doctor.
ERIC Educational Resources Information Center
Siu, Andrew M. H.; Shek, Daniel T. L.
2005-01-01
This paper reports evidence on the factor structure, reliability, and validity of the Chinese Family Assessment Instrument (C-FAI), an instrument developed to assess family functioning in Chinese populations. A convenience sample of 1,462 adolescents from junior secondary schools completed the C-FAI and measures of parent-adolescent conflict.…
Decision-theoretic methodology for reliability and risk allocation in nuclear power plants
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cho, N.Z.; Papazoglou, I.A.; Bari, R.A.
1985-01-01
This paper describes a methodology for allocating reliability and risk to various reactor systems, subsystems, components, operations, and structures in a consistent manner, based on a set of global safety criteria which are not rigid. The problem is formulated as a multiattribute decision analysis paradigm; the multiobjective optimization, which is performed on a PRA model and reliability cost functions, serves as the guiding principle for reliability and risk allocation. The concept of noninferiority is used in the multiobjective optimization problem. Finding the noninferior solution set is the main theme of the current approach. The assessment of the decision maker's preferencesmore » could then be performed more easily on the noninferior solution set. Some results of the methodology applications to a nontrivial risk model are provided and several outstanding issues such as generic allocation and preference assessment are discussed.« less
Reliability tests and guidelines for B-mode ultrasound assessment of central adiposity.
Stoner, Lee; Chinn, Victoria; Cornwall, Jon; Meikle, Grant; Page, Rachel; Lambrick, Danielle; Faulkner, James
2015-11-01
Ultrasound represents a validated and relatively inexpensive diagnostic device for assessing central adiposity; however, widespread adoption has been impeded by the lack of reliable standard operating procedures. To examine the reliability of, and describe guidelines for, ultrasound-derived recording of intra-abdominal fat thickness (IAT) and maximal preperitoneal fat thickness (PFT). Ultrasound scans were obtained from 20 adults (50% female, 26 ± 7 years, 24·5 kg/m(2) ) on three different mornings. IAT was assessed 2 cm above the umbilicus (transverse plane) measuring from linea alba to: (i) anterior aorta, (ii) posterior aorta and (iii) anterior aspect of the vertebral column. PFT was measured from linea alba to visceral peritoneum in (i) sagittal and (ii) transverse planes, immediately over and inferior to the xiphi-sternum, respectively. For IAT, the criterion intraclass correlation coefficient (ICC) of 0·75 was exceeded for measurements to anterior aorta (0·95), posterior aorta (0·94) and vertebra (0·96). The reliability coefficient expressed as a percentage of the mean (RC%) was lowest (better) for measurement to vertebrae (9·8%). For PFT, mean thickness was comparable for sagittal (1·74 cm) and transverse (1·76 cm) planes; ICC values were also comparable for both planes (0·98 vs. 0·98, respectively), as were RC% (7·5% vs. 7·1%, respectively). IAT assessments to the vertebra were marginally more reliable than those to other structures. While PFT assessments were equally reliable for both measurements planes, precise probe placement was easier for the sagittal plane. Based on these findings, guidelines for the reliable measurement of central adiposity using ultrasound are presented. © 2015 Stichting European Society for Clinical Investigation Journal Foundation.
Dwyer, Tim; Takahashi, Susan Glover; Hynes, Melissa Kennedy; Herold, Jodi; Wasserstein, David; Nousiainen, Markku; Ferguson, Peter; Wadey, Veronica; Murnaghan, M. Lucas; Leroux, Tim; Semple, John; Hodges, Brian; Ogilvie-Harris, Darrell
2014-01-01
Background Assessing residents’ understanding and application of the 6 intrinsic CanMEDS roles (communicator, professional, manager, collaborator, health advocate, scholar) is challenging for postgraduate medical educators. We hypothesized that an objective structured clinical examination (OSCE) designed to assess multiple intrinsic CanMEDS roles would be sufficiently reliable and valid. Methods The OSCE comprised 6 10-minute stations, each testing 2 intrinsic roles using case-based scenarios (with or without the use of standardized patients). Residents were evaluated using 5-point scales and an overall performance rating at each station. Concurrent validity was sought by correlation with in-training evaluation reports (ITERs) from the last 12 months and an ordinal ranking created by program directors (PDs). Results Twenty-five residents from postgraduate years (PGY) 0, 3 and 5 participated. The interstation reliability for total test scores (percent) was 0.87, while reliability for each of the communicator, collaborator, manager and professional roles was greater than 0.8. Total test scores, individual station scores and individual CanMEDS role scores all showed a significant effect by PGY level. Analysis of the PD rankings of intrinsic roles demonstrated a high correlation with the OSCE role scores. A correlation was seen between ITER and OSCE for the communicator role, while the ITER medical expert and total scores highly correlated with the communicator, manager and professional OSCE scores. Conclusion An OSCE designed to assess the intrinsic CanMEDS roles was sufficiently valid and reliable for regular use in an orthopedic residency program. PMID:25078926
A Study of the Validity and Reliability of a Mathematics Lesson Attitude Scale and Student Attitudes
ERIC Educational Resources Information Center
Tezer, Murat; Ozcan, Deniz
2015-01-01
Attitudes of the students towards mathematics lessons are very important in terms of their success and motivation. The purpose of this study is to develop a scale for the assessment of primary school students' attitudes towards mathematics courses in the 2nd and 3rd grades, to analyse its validity-reliability structure and to determine the…
Assessing child and adolescent pragmatic language competencies: toward evidence-based assessments.
Russell, Robert L; Grizzle, Kenneth L
2008-06-01
Using language appropriately and effectively in social contexts requires pragmatic language competencies (PLCs). Increasingly, deficits in PLCs are linked to child and adolescent disorders, including autism spectrum, externalizing, and internalizing disorders. As the role of PLCs expands in diagnosis and treatment of developmental psychopathology, psychologists and educators will need to appraise and select clinical and research PLC instruments for use in assessments and/or studies. To assist in this appraisal, 24 PLC instruments, containing 1,082 items, are assessed by addressing four questions: (1) Can PLC domains targeted by assessment items be reliably identified?, (2) What are the core PLC domains that emerge across the 24 instruments?, (3) Do PLC questionnaires and tests assess similar PLC domains?, and (4) Do the instruments achieve content, structural, diagnostic, and ecological validity? Results indicate that test and questionnaire items can be reliably categorized into PLC domains, that PLC domains featured in questionnaires and tests significantly differ, and that PLC instruments need empirical confirmation of their dimensional structure, content validity across all developmental age bands, and ecological validity. Progress in building a better evidence base for PLC assessments should be a priority in future research.
Structured assessment of microsurgery skills in the clinical setting.
Chan, WoanYi; Niranjan, Niri; Ramakrishnan, Venkat
2010-08-01
Microsurgery is an essential component in plastic surgery training. Competence has become an important issue in current surgical practice and training. The complexity of microsurgery requires detailed assessment and feedback on skills components. This article proposes a method of Structured Assessment of Microsurgery Skills (SAMS) in a clinical setting. Three types of assessment (i.e., modified Global Rating Score, errors list and summative rating) were incorporated to develop the SAMS method. Clinical anastomoses were recorded on videos using a digital microscope system and were rated by three consultants independently and in a blinded fashion. Fifteen clinical cases of microvascular anastomoses performed by trainees and a consultant microsurgeon were assessed using SAMS. The consultant had consistently the highest scores. Construct validity was also demonstrated by improvement of SAMS scores of microsurgery trainees. The overall inter-rater reliability was strong (alpha=0.78). The SAMS method provides both formative and summative assessment of microsurgery skills. It is demonstrated to be a valid, reliable and feasible assessment tool of operating room performance to provide systematic and comprehensive feedback as part of the learning cycle. Copyright 2009 British Association of Plastic, Reconstructive and Aesthetic Surgeons. Published by Elsevier Ltd. All rights reserved.
Moeini, Babak; Zamanian, Hadi; Taheri-Kharameh, Zahra; Ramezani, Tahereh; Saati-Asr, Mohamadhasan; Hajrahimian, Mohamadhasan; Amini-Tehrani, Mohammadali
2018-01-01
Spirituality plays an important role in coping with chronic diseases for patients and they often report unmet spiritual and existential needs, which should be considered for a holistic view of their health. Studying spiritual needs in this generation requires culturally appropriate and valid instruments. The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The "forward-backward" procedure was applied to translate the SpNQ from English into Persian. The SpNQ-Persian Version (SpNQ-PV) was checked in terms of validity and reliability with a convenience sample of 100 elders with chronic diseases who were recruited from the inpatient wards at two university hospitals in Qom, Iran. The validity was assessed using content, face, and construct validity. The Cronbach alpha and test-retest were used to assess the reliability of the questionnaire. The results of the exploratory factor analysis indicated a five-factor solution for the questionnaire, which included religious needs, existential needs, forgiveness/generativity needs, need for inner peace, and emotional needs. These accounted for 60.1% of the total observed variance. One item was removed (factor loading <0.4). Convergent validity was supported mostly by the pattern of association between SpNQ-PV and the Spiritual Well-being Scale. Cronbach alpha of the subscales ranged from 0.56 to 0.78 and the test-retest reliability ranged from 0.72 to 0.91, which indicated an acceptable range of reliability. The SpNQ-PV showed a minor difference in structuring and indicated good psychometric properties, which can be used to assess the spiritual needs of Iranian elders suffering from chronic diseases. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
The reasons for betel-quid chewing scale: assessment of factor structure, reliability, and validity
2014-01-01
Background Despite the fact that betel-quid is one of the most commonly used psychoactive substances worldwide and a major risk-factor for head-and-neck cancer incidence and mortality globally, currently no standardized instrument is available to assess the reasons why individuals chew betel-quid. A measure to assess reasons for chewing betel-quid could help researchers and clinicians develop prevention and treatment strategies. In the current study, we sought to develop and evaluate a self-report instrument for assessing the reasons for chewing betel quid which contributes toward the goal of developing effective interventions to reduce betel quid chewing in vulnerable populations. Methods The current study assessed the factor structure, reliability and convergent validity of the Reasons for Betel-quid Chewing Scale (RBCS), a newly developed 10 item measure adapted from several existing “reasons for smoking” scales. The measure was administered to 351 adult betel-quid chewers in Guam. Results Confirmatory factor analysis of this measure revealed a three factor structure: reinforcement, social/cultural, and stimulation. Further tests revealed strong support for the internal consistency and convergent validity of this three factor measure. Conclusion The goal of designing an intervention to reduce betel-quid chewing necessitates an understanding of why chewers chew; the current study makes considerable contributions towards that objective. PMID:24889863
The reasons for betel-quid chewing scale: assessment of factor structure, reliability, and validity.
Little, Melissa A; Pokhrel, Pallav; Murphy, Kelle L; Kawamoto, Crissy T; Suguitan, Gil S; Herzog, Thaddeus A
2014-06-03
Despite the fact that betel-quid is one of the most commonly used psychoactive substances worldwide and a major risk-factor for head-and-neck cancer incidence and mortality globally, currently no standardized instrument is available to assess the reasons why individuals chew betel-quid. A measure to assess reasons for chewing betel-quid could help researchers and clinicians develop prevention and treatment strategies. In the current study, we sought to develop and evaluate a self-report instrument for assessing the reasons for chewing betel quid which contributes toward the goal of developing effective interventions to reduce betel quid chewing in vulnerable populations. The current study assessed the factor structure, reliability and convergent validity of the Reasons for Betel-quid Chewing Scale (RBCS), a newly developed 10 item measure adapted from several existing "reasons for smoking" scales. The measure was administered to 351 adult betel-quid chewers in Guam. Confirmatory factor analysis of this measure revealed a three factor structure: reinforcement, social/cultural, and stimulation. Further tests revealed strong support for the internal consistency and convergent validity of this three factor measure. The goal of designing an intervention to reduce betel-quid chewing necessitates an understanding of why chewers chew; the current study makes considerable contributions towards that objective.
Commercialization of NESSUS: Status
NASA Technical Reports Server (NTRS)
Thacker, Ben H.; Millwater, Harry R.
1991-01-01
A plan was initiated in 1988 to commercialize the Numerical Evaluation of Stochastic Structures Under Stress (NESSUS) probabilistic structural analysis software. The goal of the on-going commercialization effort is to begin the transfer of Probabilistic Structural Analysis Method (PSAM) developed technology into industry and to develop additional funding resources in the general area of structural reliability. The commercialization effort is summarized. The SwRI NESSUS Software System is a general purpose probabilistic finite element computer program using state of the art methods for predicting stochastic structural response due to random loads, material properties, part geometry, and boundary conditions. NESSUS can be used to assess structural reliability, to compute probability of failure, to rank the input random variables by importance, and to provide a more cost effective design than traditional methods. The goal is to develop a general probabilistic structural analysis methodology to assist in the certification of critical components in the next generation Space Shuttle Main Engine.
NASA Technical Reports Server (NTRS)
Halford, Gary R.; Shah, Ashwin; Arya, Vinod K.; Krause, David L.; Bartolotta, Paul A.
2002-01-01
Deep-space missions require onboard electric power systems with reliable design lifetimes of up to 10 yr and beyond. A high-efficiency Stirling radioisotope power system is a likely candidate for future deep-space missions and Mars rover applications. To ensure ample durability, the structurally critical heater head of the Stirling power convertor has undergone extensive computational analyses of operating temperatures (up to 650 C), stresses, and creep resistance of the thin-walled Inconel 718 bill of material. Durability predictions are presented in terms of the probability of survival. A benchmark structural testing program has commenced to support the analyses. This report presents the current status of durability assessments.
Remelhe, Mafalda; Teixeira, Pedro M; Lopes, Irene; Silva, Luís; Correia de Sousa, Jaime
2017-01-12
Enabling patients with asthma to obtain the knowledge, confidence and skills they need in order to assume a major role in the management of their disease is cost effective. It should be an integral part of any plan for long-term control of asthma. The modified Patient Enablement Instrument (mPEI) is an easily administered questionnaire that was adapted in the United Kingdom to measure patient enablement in asthma, but its applicability in Portugal is not known. Validity and reliability of questionnaires should be tested before use in settings different from those of the original version. The purpose of this study was to test the applicability of the mPEI to Portuguese asthma patients after translation and cross-cultural adaptation, and to verify the structural validity, internal consistency and reproducibility of the instrument. The mPEI was translated to Portuguese and back translated to English. Its content validity was assessed by a debriefing interview with 10 asthma patients. The translated instrument was then administered to a random sample of 142 patients with persistent asthma. Structural validity and internal consistency were assessed. For reproducibility analysis, 86 patients completed the instrument again 7 days later. Item-scale correlations and exploratory factor analysis were used to assess structural validity. Cronbach's alpha was used to test internal consistency, and the intra-class correlation coefficient was used for the analysis of reproducibility. All items of the Portuguese version of the mPEI were found to be equivalent to the original English version. There were strong item-scale correlations that confirmed construct validity, with a one component structure and good internal consistency (Cronbach's alpha >0.8) as well as high test-retest reliability (ICC=0.85). The mPEI showed sound psychometric properties for the evaluation of enablement in patients with asthma making it a reliable instrument for use in research and clinical practice in Portugal. Further studies are needed to confirm its responsiveness.
Chahoud, M; Chahine, R; Salameh, P; Sauleau, E A
2017-06-01
Our goal is to validate and to verify the reliability of the French and English versions of the Insomnia Severity Index (ISI) in Lebanese adolescents. A cross-sectional study was implemented. 104 Lebanese students aged between 14 and 19 years participated in the study. The English version of the questionnaire was distributed to English-speaking students and the French version was administered to French-speaking students. A scale (1 to 7 with 1 = very well understood and 7 = not at all) was used to identify the level of the students' understanding of each instruction, question and answer of the ISI. The scale's structural validity was assessed. The factor structure of ISI was evaluated by principal component analysis. The internal consistency of this scale was evaluated by Cronbach's alpha. To assess test-retest reliability the intraclass correlation coefficient (ICC) was used. The principal component analysis confirmed the presence of a two-component factor structure in the English version and a three-component factor structure in the French version with eigenvalues > 1. The English version of the ISI had an excellent internal consistency (α = 0.90), while the French version had a good internal consistency (α = 0.70). The ICC presented an excellent agreement in the French version (ICC = 0.914, CI = 0.856-0.949) and a good agreement in the English one (ICC = 0.762, CI = 0.481-890). The Bland-Altman plots of the two versions of the ISI showed that the responses over two weeks' were comparable and very few outliers were detected. The results of our analyses reveal that both English and French versions of the ISI scale have good internal consistency and are reproducible and reliable. Therefore, it can be used to assess the prevalence of insomnia in Lebanese adolescents.
Assessing Reliability of Medical Record Reviews for the Detection of Hospital Adverse Events.
Ock, Minsu; Lee, Sang-il; Jo, Min-Woo; Lee, Jin Yong; Kim, Seon-Ha
2015-09-01
The purpose of this study was to assess the inter-rater reliability and intra-rater reliability of medical record review for the detection of hospital adverse events. We conducted two stages retrospective medical records review of a random sample of 96 patients from one acute-care general hospital. The first stage was an explicit patient record review by two nurses to detect the presence of 41 screening criteria (SC). The second stage was an implicit structured review by two physicians to identify the occurrence of adverse events from the positive cases on the SC. The inter-rater reliability of two nurses and that of two physicians were assessed. The intra-rater reliability was also evaluated by using test-retest method at approximately two weeks later. In 84.2% of the patient medical records, the nurses agreed as to the necessity for the second stage review (kappa, 0.68; 95% confidence interval [CI], 0.54 to 0.83). In 93.0% of the patient medical records screened by nurses, the physicians agreed about the absence or presence of adverse events (kappa, 0.71; 95% CI, 0.44 to 0.97). When assessing intra-rater reliability, the kappa indices of two nurses were 0.54 (95% CI, 0.31 to 0.77) and 0.67 (95% CI, 0.47 to 0.87), whereas those of two physicians were 0.87 (95% CI, 0.62 to 1.00) and 0.37 (95% CI, -0.16 to 0.89). In this study, the medical record review for detecting adverse events showed intermediate to good level of inter-rater and intra-rater reliability. Well organized training program for reviewers and clearly defining SC are required to get more reliable results in the hospital adverse event study.
Reliability and Validity of Assessing User Satisfaction With Web-Based Health Interventions
Lehr, Dirk; Reis, Dorota; Vis, Christiaan; Riper, Heleen; Berking, Matthias; Ebert, David Daniel
2016-01-01
Background The perspective of users should be taken into account in the evaluation of Web-based health interventions. Assessing the users’ satisfaction with the intervention they receive could enhance the evidence for the intervention effects. Thus, there is a need for valid and reliable measures to assess satisfaction with Web-based health interventions. Objective The objective of this study was to analyze the reliability, factorial structure, and construct validity of the Client Satisfaction Questionnaire adapted to Internet-based interventions (CSQ-I). Methods The psychometric quality of the CSQ-I was analyzed in user samples from 2 separate randomized controlled trials evaluating Web-based health interventions, one from a depression prevention intervention (sample 1, N=174) and the other from a stress management intervention (sample 2, N=111). At first, the underlying measurement model of the CSQ-I was analyzed to determine the internal consistency. The factorial structure of the scale and the measurement invariance across groups were tested by multigroup confirmatory factor analyses. Additionally, the construct validity of the scale was examined by comparing satisfaction scores with the primary clinical outcome. Results Multigroup confirmatory analyses on the scale yielded a one-factorial structure with a good fit (root-mean-square error of approximation =.09, comparative fit index =.96, standardized root-mean-square residual =.05) that showed partial strong invariance across the 2 samples. The scale showed very good reliability, indicated by McDonald omegas of .95 in sample 1 and .93 in sample 2. Significant correlations with change in depressive symptoms (r=−.35, P<.001) and perceived stress (r=−.48, P<.001) demonstrated the construct validity of the scale. Conclusions The proven internal consistency, factorial structure, and construct validity of the CSQ-I indicate a good overall psychometric quality of the measure to assess the user’s general satisfaction with Web-based interventions for depression and stress management. Multigroup analyses indicate its robustness across different samples. Thus, the CSQ-I seems to be a suitable measure to consider the user’s perspective in the overall evaluation of Web-based health interventions. PMID:27582341
Reliability and Validity of Assessing User Satisfaction With Web-Based Health Interventions.
Boß, Leif; Lehr, Dirk; Reis, Dorota; Vis, Christiaan; Riper, Heleen; Berking, Matthias; Ebert, David Daniel
2016-08-31
The perspective of users should be taken into account in the evaluation of Web-based health interventions. Assessing the users' satisfaction with the intervention they receive could enhance the evidence for the intervention effects. Thus, there is a need for valid and reliable measures to assess satisfaction with Web-based health interventions. The objective of this study was to analyze the reliability, factorial structure, and construct validity of the Client Satisfaction Questionnaire adapted to Internet-based interventions (CSQ-I). The psychometric quality of the CSQ-I was analyzed in user samples from 2 separate randomized controlled trials evaluating Web-based health interventions, one from a depression prevention intervention (sample 1, N=174) and the other from a stress management intervention (sample 2, N=111). At first, the underlying measurement model of the CSQ-I was analyzed to determine the internal consistency. The factorial structure of the scale and the measurement invariance across groups were tested by multigroup confirmatory factor analyses. Additionally, the construct validity of the scale was examined by comparing satisfaction scores with the primary clinical outcome. Multigroup confirmatory analyses on the scale yielded a one-factorial structure with a good fit (root-mean-square error of approximation =.09, comparative fit index =.96, standardized root-mean-square residual =.05) that showed partial strong invariance across the 2 samples. The scale showed very good reliability, indicated by McDonald omegas of .95 in sample 1 and .93 in sample 2. Significant correlations with change in depressive symptoms (r=-.35, P<.001) and perceived stress (r=-.48, P<.001) demonstrated the construct validity of the scale. The proven internal consistency, factorial structure, and construct validity of the CSQ-I indicate a good overall psychometric quality of the measure to assess the user's general satisfaction with Web-based interventions for depression and stress management. Multigroup analyses indicate its robustness across different samples. Thus, the CSQ-I seems to be a suitable measure to consider the user's perspective in the overall evaluation of Web-based health interventions.
Hussein, Ahmed A; Sexton, Kevin J; May, Paul R; Meng, Maxwell V; Hosseini, Abolfazl; Eun, Daniel D; Daneshmand, Siamak; Bochner, Bernard H; Peabody, James O; Abaza, Ronney; Skinner, Eila C; Hautmann, Richard E; Guru, Khurshid A
2018-04-13
We aimed to develop a structured scoring tool: cystectomy assessment and surgical evaluation (CASE) that objectively measures and quantifies performance during robot-assisted radical cystectomy (RARC) for men. A multinational 10-surgeon expert panel collaborated towards development and validation of CASE. The critical steps of RARC in men were deconstructed into nine key domains, each assessed by five anchors. Content validation was done utilizing the Delphi methodology. Each anchor was assessed in terms of context, score concordance, and clarity. The content validity index (CVI) was calculated for each aspect. A CVI ≥ 0.75 represented consensus, and this statement was removed from the next round. This process was repeated until consensus was achieved for all statements. CASE was used to assess de-identified videos of RARC to determine reliability and construct validity. Linearly weighted percent agreement was used to assess inter-rater reliability (IRR). A logit model for odds ratio (OR) was used to assess construct validation. The expert panel reached consensus on CASE after four rounds. The final eight domains of the CASE included: pelvic lymph node dissection, development of the peri-ureteral space, lateral pelvic space, anterior rectal space, control of the vascular pedicle, anterior vesical space, control of the dorsal venous complex, and apical dissection. IRR > 0.6 was achieved for all eight domains. Experts outperformed trainees across all domains. We developed and validated a reliable structured, procedure-specific tool for objective evaluation of surgical performance during RARC. CASE may help differentiate novice from expert performances.
Jordán, Carlos M; Díaz, Marta I; Comeche, María I; Ortega, José
2007-01-01
Background Internet psychology services are rapidly increasing and that implies online assessment. To guarantee the results of these new online evaluation procedures, it is necessary to have reliable and valid assessment tools. Objective In this work we analyzed the online versions of two popular psychopathology screening questionnaires: the General Health Questionnaire-28 (GHQ-28) and the Symptoms Check-List-90-Revised (SCL-90-R). Methods A total of 185 psychology students were recruited from two universities in Madrid, Spain. All of them had Internet access at home. A test-retest situation and factorial analysis were used to generate reliability and validity data. Both paper-and-pencil questionnaires (test) and their online versions (retest) were completed by 100 participants (median gap = 17 days). Results Results suggest that both online questionnaires were fairly equivalent to their paper-and-pencil versions, with higher reliability values for the SCL-90-R. Factorial analysis tended to reproduce the structure shown in former investigations of both questionnaires, replicating the four-factor structure of the GHQ-28 but failing to do so with the nine-factor structure of the SCL-90-R. Instead, a large unrotated factor appeared. Conclusions Further research should be carried out to confirm these data, but our work supports the online use of both assessment tools. The psychometric properties of the online version of GHQ-28 is similar to the paper-and-pencil and we can recommend its utilization in a Web environment. In contrast, SCL-90-R can only be recommended as a global index for psychological distress, using the Global Severity Index (GSI), not necessarily its subscales; and it should be considered that the online scores were lower than the ones with the paper-and-pencil version. PMID:17478411
Vallejo, Miguel A; Jordán, Carlos M; Díaz, Marta I; Comeche, María I; Ortega, José
2007-01-31
Internet psychology services are rapidly increasing and that implies online assessment. To guarantee the results of these new online evaluation procedures, it is necessary to have reliable and valid assessment tools. In this work we analyzed the online versions of two popular psychopathology screening questionnaires: the General Health Questionnaire-28 (GHQ-28) and the Symptoms Check-List-90-Revised (SCL-90-R). A total of 185 psychology students were recruited from two universities in Madrid, Spain. All of them had Internet access at home. A test-retest situation and factorial analysis were used to generate reliability and validity data. Both paper-and-pencil questionnaires (test) and their online versions (retest) were completed by 100 participants (median gap = 17 days). Results suggest that both online questionnaires were fairly equivalent to their paper-and-pencil versions, with higher reliability values for the SCL-90-R. Factorial analysis tended to reproduce the structure shown in former investigations of both questionnaires, replicating the four-factor structure of the GHQ-28 but failing to do so with the nine-factor structure of the SCL-90-R. Instead, a large unrotated factor appeared. Further research should be carried out to confirm these data, but our work supports the online use of both assessment tools. The psychometric properties of the online version of GHQ-28 is similar to the paper-and-pencil and we can recommend its utilization in a Web environment. In contrast, SCL-90-R can only be recommended as a global index for psychological distress, using the Global Severity Index (GSI), not necessarily its subscales; and it should be considered that the online scores were lower than the ones with the paper-and-pencil version.
Reliability and Validity of 3 Methods of Assessing Orthopedic Resident Skill in Shoulder Surgery.
Bernard, Johnathan A; Dattilo, Jonathan R; Srikumaran, Uma; Zikria, Bashir A; Jain, Amit; LaPorte, Dawn M
Traditional measures for evaluating resident surgical technical skills (e.g., case logs) assess operative volume but not level of surgical proficiency. Our goal was to compare the reliability and validity of 3 tools for measuring surgical skill among orthopedic residents when performing 3 open surgical approaches to the shoulder. A total of 23 residents at different stages of their surgical training were tested for technical skill pertaining to 3 shoulder surgical approaches using the following measures: Objective Structured Assessment of Technical Skills (OSATS) checklists, the Global Rating Scale (GRS), and a final pass/fail assessment determined by 3 upper extremity surgeons. Adverse events were recorded. The Cronbach α coefficient was used to assess reliability of the OSATS checklists and GRS scores. Interrater reliability was calculated with intraclass correlation coefficients. Correlations among OSATS checklist scores, GRS scores, and pass/fail assessment were calculated with Spearman ρ. Validity of OSATS checklists was determined using analysis of variance with postgraduate year (PGY) as a between-subjects factor. Significance was set at p < 0.05 for all tests. Criterion validity was shown between the OSATS checklists and GRS for the 3 open shoulder approaches. Checklist scores showed superior interrater reliability compared with GRS and subjective pass/fail measurements. GRS scores were positively correlated across training years. The incidence of adverse events was significantly higher among PGY-1 and PGY-2 residents compared with more experienced residents. OSATS checklists are a valid and reliable assessment of technical skills across 3 surgical shoulder approaches. However, checklist scores do not measure quality of technique. Documenting adverse events is necessary to assess quality of technique and ultimate pass/fail status. Multiple methods of assessing surgical skill should be considered when evaluating orthopedic resident surgical performance. Copyright © 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
de Montbrun, Sandra L; Roberts, Patricia L; Lowry, Ann C; Ault, Glenn T; Burnstein, Marcus J; Cataldo, Peter A; Dozois, Eric J; Dunn, Gary D; Fleshman, James; Isenberg, Gerald A; Mahmoud, Najjia N; Reznick, Richard K; Satterthwaite, Lisa; Schoetz, David; Trudel, Judith L; Weiss, Eric G; Wexner, Steven D; MacRae, Helen
2013-12-01
To develop and evaluate an objective method of technical skills assessment for graduating subspecialists in colorectal (CR) surgery-the Colorectal Objective Structured Assessment of Technical Skill (COSATS). It may be reasonable for the public to assume that surgeons certified as competent have had their technical skills assessed. However, technical skill, despite being the hallmark of a surgeon, is not directly assessed at the time of certification by surgical boards. A procedure-based, multistation technical skills examination was developed to reflect a sample of the range of skills necessary for CR surgical practice. These consisted of bench, virtual reality, and cadaveric models. Reliability and construct validity were evaluated by comparing 10 graduating CR residents with 10 graduating general surgery (GS) residents from across North America. Expert CR surgeons, blinded to level of training, evaluated performance using a task-specific checklist and a global rating scale. The mean global rating score was used as the overall examination score and a passing score was set at "borderline competent for CR practice." The global rating scale demonstrated acceptable interstation reliability (0.69) for a homogeneous group of examinees. Both the overall checklist and global rating scores effectively discriminated between CR and GS residents (P < 0.01), with 27% of the variance attributed to level of training. Nine CR residents but only 3 GS residents were deemed competent. The Colorectal Objective Structured Assessment of Technical Skill effectively discriminated between CR and GS residents. With further validation, the Colorectal Objective Structured Assessment of Technical Skill could be incorporated into the colorectal board examination where it would be the first attempt of a surgical specialty to formally assess technical skill at the time of certification.
Psychometric Properties of the Canadian Nurse Informatics Competency Assessment Scale.
Kleib, Manal; Nagle, Lynn
2018-04-10
Assessment of nursing informatics competencies has gained momentum in the scholarly literature in response to the increased need for resources available to support informatics capacity in nursing. The purpose of this study was to examine the factor structure and internal consistency reliability of the Canadian Nurse Informatics Competency Assessment Scale, a newly developed 21-item measure based on published entry-to-practice informatics competencies for RNs. For this study, 2844 nurses completed the Canadian Nurse Informatics Competency Assessment Scale through a cross-sectional survey. Exploratory principal component analysis with oblique promax rotation revealed a four-component/factor structure for the 21-item Canadian Nurse Informatics Competency Assessment Scale, explaining 61.04% of the variance. Item loading per each component reflected the original Canadian Association of Schools of Nursing grouping of nursing informatics competency indicators, as per three key domains of competency: information and knowledge management (α = .85); professional and regulatory accountability (α = .81); and use of information and communication technology in the delivery of patient care (α = .87) with the exception of one item (Indicator 3), which loaded into the category of foundational information and communication technology skills (α = .81). This study provided preliminary evidence for the construct validity of the entry-to-practice competency domains and the factor structure and reliability of the Canadian Nurse Informatics Competency Assessment Scale among practicing nurses. Further testing among nurses in other settings and among nursing students is recommended.
Ceramic component reliability with the restructured NASA/CARES computer program
NASA Technical Reports Server (NTRS)
Powers, Lynn M.; Starlinger, Alois; Gyekenyesi, John P.
1992-01-01
The Ceramics Analysis and Reliability Evaluation of Structures (CARES) integrated design program on statistical fast fracture reliability and monolithic ceramic components is enhanced to include the use of a neutral data base, two-dimensional modeling, and variable problem size. The data base allows for the efficient transfer of element stresses, temperatures, and volumes/areas from the finite element output to the reliability analysis program. Elements are divided to insure a direct correspondence between the subelements and the Gaussian integration points. Two-dimensional modeling is accomplished by assessing the volume flaw reliability with shell elements. To demonstrate the improvements in the algorithm, example problems are selected from a round-robin conducted by WELFEP (WEakest Link failure probability prediction by Finite Element Postprocessors).
Pisarnturakit, Pagaporn P; Shaw, Bret R; Tanasukarn, Chanuantong; Vatanasomboon, Paranee
2012-09-01
Primary caregivers' child oral health care beliefs and practices are major factors in the prevention of Early Childhood Caries (ECC). This study assessed the validity and reliability of a newly-developed scale--the Early Childhood Caries Perceptions Scale (ECCPS)--used to measure beliefs regarding ECC preventive practices among primary caregivers of young children. The ECCPS was developed based on the Health Belief Model. The construct validity and reliability of the ECCPS were examined among 254 low-socioeconomic status primary caregivers with children under five years old, recruifed from 4 Bangkok Metropolitan Administration Health Centers and a kindergarten school. Exploratory factor analysis (EFA) revealed a four-factor structure. The four factors were labeled as Perceived Susceptibility, Perceived Severity, Perceived Benefits and Perceived Barriers. Internal consistency measured by the Cronbach's coefficient alpha for those four factors were 0.897, 0.971, 0.975 and 0.789, respectively. The ECCPS demonstrated satisfactory levels of reliability and validity for assessing the health beliefs related to ECC prevention among low-socioeconomic primary caregivers.
Evaluation Criteria for Micro-CAI: A Psychometric Approach
Wallace, Douglas; Slichter, Mark; Bolwell, Christine
1985-01-01
The increased use of microcomputer-based instructional programs has resulted in a greater need for third-party evaluation of the software. This in turn has prompted the development of micro-CAI evaluation tools. The present project sought to develop a prototype instrument to assess the impact of CAI program presentation characteristics on students. Data analysis and scale construction was conducted using standard item reliability analyses and factor analytic techniques. Adequate subscale reliabilities and factor structures were found, suggesting that a psychometric approach to CAI evaluation may possess some merit. Efforts to assess the utility of the resultant instrument are currently underway.
Reliability and safety, and the risk of construction damage in mining areas
NASA Astrophysics Data System (ADS)
Skrzypczak, Izabela; Kogut, Janusz P.; Kokoszka, Wanda; Oleniacz, Grzegorz
2018-04-01
This article concerns the reliability and safety of building structures in mining areas, with a particular emphasis on the quantitative risk analysis of buildings. The issues of threat assessment and risk estimation, in the design of facilities in mining exploitation areas, are presented here, indicating the difficulties and ambiguities associated with their quantification and quantitative analysis. This article presents the concept of quantitative risk assessment of the impact of mining exploitation, in accordance with ISO 13824 [1]. The risk analysis is illustrated through an example of a construction located within an area affected by mining exploitation.
Mitchell, John D; Amir, Rabia; Montealegre-Gallegos, Mario; Mahmood, Feroze; Shnider, Marc; Mashari, Azad; Yeh, Lu; Bose, Ruma; Wong, Vanessa; Hess, Philip; Amador, Yannis; Jeganathan, Jelliffe; Jones, Stephanie B; Matyal, Robina
2018-06-01
While standardized examinations and data from simulators and phantom models can assess knowledge and manual skills for ultrasound, an Objective Structured Clinical Examination (OSCE) could assess workflow understanding. We recruited 8 experts to develop an OSCE to assess workflow understanding in perioperative ultrasound. The experts used a binary grading system to score 19 graduating anesthesia residents at 6 stations. Overall average performance was 86.2%, and 3 stations had an acceptable internal reliability (Kuder-Richardson formula 20 coefficient >0.5). After refinement, this OSCE can be combined with standardized examinations and data from simulators and phantom models to assess proficiency in ultrasound.
Turkish Version of Kolcaba's Immobilization Comfort Questionnaire: A Validity and Reliability Study.
Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra
2015-12-01
The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.
Hsu, L-F; Hung, C-L; Kuo, L-J; Tsai, P-S
2017-09-01
No instrument is available to assess the impact of faecal incontinence (FI) of quality of life for Chinese-speaking population. The purpose of the study was to adapt the Faecal Incontinence Quality of Life Scale (FIQL) for patients with colorectal cancer, assess the factor structure and reduce the items for brevity. A sample of 120 participants were enrolled. Internal consistency, test-retest reliability, and convergent and contrasted-groups validity were assessed. Construct validity was analysed using an exploratory and confirmatory factor analyses (CFA). The internal consistency (Cronbach's α of the total scale and four subscales = 0.98 and 0.97, 0.96, 0.92, 0.82 respectively), test-retest reliability (intraclass correlation coefficients ≥.98 for all scales with p < .001) and significant correlations of all scales with selected subscales of the Medical Outcomes Study 36-Item Short-Form Health Survey and the Wexner scale suggested satisfactory reliability and validity. The severe FI group (with a Wexner score ≥9) scored significantly lower on the scale than the less severe FI group (with a Wexner score <9) did (p < .001). The CFA supported a two-factor structure and demonstrated an excellent model fit of the 15-item abbreviated version of the FIQL-Chinese. The FIQL-Chinese has satisfactory validity and reliability and the abbreviated version may be more practical and applicable. © 2016 John Wiley & Sons Ltd.
Bajwa, Nadia M; Yudkowsky, Rachel; Belli, Dominique; Vu, Nu Viet; Park, Yoon Soo
2017-03-01
The purpose of this study was to provide validity and feasibility evidence in measuring professionalism using the Professionalism Mini-Evaluation Exercise (P-MEX) scores as part of a residency admissions process. In 2012 and 2013, three standardized-patient-based P-MEX encounters were administered to applicants invited for an interview at the University of Geneva Pediatrics Residency Program. Validity evidence was gathered for P-MEX content (item analysis); response process (qualitative feedback); internal structure (inter-rater reliability with intraclass correlation and Generalizability); relations to other variables (correlations); and consequences (logistic regression to predict admission). To improve reliability, Kane's formula was used to create an applicant composite score using P-MEX, structured letter of recommendation (SLR), and structured interview (SI) scores. Applicant rank lists using composite scores versus faculty global ratings were compared using the Wilcoxon signed-rank test. Seventy applicants were assessed. Moderate associations were found between pairwise correlations of P-MEX scores and SLR (r = 0.25, P = .036), SI (r = 0.34, P = .004), and global ratings (r = 0.48, P < .001). Generalizability of the P-MEX using three cases was moderate (G-coefficient = 0.45). P-MEX scores had the greatest correlation with acceptance (r = 0.56, P < .001), were the strongest predictor of acceptance (OR 4.37, P < .001), and increased pseudo R-squared by 0.20 points. Including P-MEX scores increased composite score reliability from 0.51 to 0.74. Rank lists of applicants using composite score versus global rating differed significantly (z = 5.41, P < .001). Validity evidence supports the use of P-MEX scores to improve the reliability of the residency admissions process by improving applicant composite score reliability.
ERIC Educational Resources Information Center
Parent, Mike C.; Moradi, Bonnie
2011-01-01
The Conformity to Feminine Norms Inventory-45 (CFNI-45; Parent & Moradi, 2010) is an important tool for assessing level of conformity to feminine gender norms and for investigating the implications of such norms for women's functioning. The authors of the present study assessed the factor structure, measurement invariance, reliability, and…
The Brazilian version of the effort-reward imbalance questionnaire to assess job stress.
Chor, Dóra; Werneck, Guilherme Loureiro; Faerstein, Eduardo; Alves, Márcia Guimarães de Mello; Rotenberg, Lúcia
2008-01-01
The effort-reward imbalance (ERI) model has been used to assess the health impact of job stress. We aimed at describing the cross-cultural adaptation of the ERI questionnaire into Portuguese and some psychometric properties, in particular internal consistency, test-retest reliability, and factorial structure. We developed a Brazilian version of the ERI using a back-translation method and tested its reliability. The test-retest reliability study was conducted with 111 health workers and University staff. The current analyses are based on 89 participants, after exclusion of those with missing data. Reproducibility (interclass correlation coefficients) for the "effort", "'reward", and "'overcommitment"' dimensions of the scale was estimated at 0.76, 0.86, and 0.78, respectively. Internal consistency (Cronbach's alpha) estimates for these same dimensions were 0.68, 0.78, and 0.78, respectively. The exploratory factorial structure was fairly consistent with the model's theoretical components. We conclude that the results of this study represent the first evidence in favor of the application of the Brazilian Portuguese version of the ERI scale in health research in populations with similar socioeconomic characteristics.
Reliability assessment of an OVH HV power line truss transmission tower subjected to seismic loading
NASA Astrophysics Data System (ADS)
Winkelmann, Karol; Jakubowska, Patrycja; Soltysik, Barbara
2017-03-01
The study focuses on the reliability of a transmission tower OS24 ON150 + 10, an element of an OVH HV power line, under seismic loading. In order to describe the seismic force, the real-life recording of the horizontal component of the El Centro earthquake was adopted. The amplitude and the period of this excitation are assumed random, their variation is described by Weibull distribution. The possible space state of the phenomenon is given in the form of a structural response surface (RSM methodology), approximated by an ANOVA table with directional sampling (DS) points. Four design limit states are considered: stress limit criterion for a natural load combination, criterion for an accidental combination (one-sided cable snap), vertical and horizontal translation criteria. According to these cases the HLRF reliability index β is used for structural safety assessment. The RSM approach is well suited for the analysis - it is numerically efficient, not excessively time consuming, indicating a high confidence level. Given the problem conditions, the seismic excitation is shown the sufficient trigger to the loss of load-bearing capacity or stability of the tower.
Psychometric Evaluation of the Wang Pregnancy Stress Scale: Revised for Taiwanese Women.
Wang, Janet F; Billings, Anthony A
2015-01-01
Develop and assess psychometric properties of the Wang Pregnancy Stress Scale for measuring stress among pregnant women in Taiwan. Data were collected in 3 obstetric and gynecological clinics in Taiwan; 485 pregnant women participated in this study. We used exploratory factor analysis and internal consistency reliability was measured using Cronbach's alpha. A 4-factor structure emerged for the Wang Pregnancy Stress Scale. The internal reliability of the scale as measured by Cronbach's alpha was .898, with standardized alpha .905. The Wang Pregnancy Stress Scale has high reliability and validity in measuring pregnancy stress that would allow nurses or health care workers to measure women's stress levels during pregnancy. Nurses can use the assessed pregnancy stress to alter intervention of care for their pregnant clients.
Lee, Soo Cheng; Moy, Foong Ming; Hairi, Noran Naqiah
2017-01-01
The multidimensional scale of perceived social support (MSPSS) was developed to measure perceived social support. It has been translated and culturally adapted among natives literate in the Malay language. However, its psychometric properties for teachers who are majority females and married have not been assessed. This was a cross-sectional study conducted among the public secondary school teachers in the central region of Peninsular Malaysia from May to July 2013. A total of 150 and 203 teachers were recruited to perform exploratory factor analysis and confirmatory factor analysis (CFA), respectively. Reliability testing was evaluated on 141 teachers via internal consistency and two-week interval test-retest. The 12-item three-factor structure of MSPSS-M was revised to 8-item two-factor structure. The revised MSPSS-M demonstrated excellent fit in CFA with adequate divergent and convergent validity and good factor loadings (0.80-0.90). The revised MSPSS-M also displayed good internal consistency with Cronbach's alpha of 0.91, 0.93 and 0.92 and good test-retest reliability with intraclass correlation of 0.89, 0.88 and 0.88 in the total scale, family and friends factors, respectively. The revised 8-item MSPSS-M is a reliable and valid tool for assessment of perceived social support among teachers.
[An instrument in Spanish to evaluate the performance of clinical teachers by students].
Bitran, Marcela; Mena, Beltrán; Riquelme, Arnoldo; Padilla, Oslando; Sánchez, Ignacio; Moreno, Rodrigo
2010-06-01
The modernization of clinical teaching has called for the creation of faculty development programs, and the design of suitable instruments to evaluate clinical teachers' performance. To report the development and validation of an instrument in Spanish designed to measure the students' perceptions of their clinical teachers' performance and to provide them with feedback to improve their teaching practices. In a process that included the active participation of authorities, professors in charge of courses and internships, clinical teachers, students and medical education experts, we developed a 30-item questionnaire called MEDUC30 to evaluate the performance of clinical teachers by their students. The internal validity was assessed by factor analysis of 5214 evaluations of 265 teachers, gathered from 2004 to 2007. The reliability was measured with the Cronbach's alpha coefficient and the generalizability coefficient (g). MEDUC30 had good content and construct validity. Its internal structure was compatible with four factors: patient-centered teaching, teaching skills, assessment skills and learning climate, and it proved to be consistent with the structure anticipated by the theory. The scores were highly reliable (Cronbach's alpha: 0.97); five evaluations per teacher were sufficient to reach a reliability coefficient (g) of 0.8. MEDUC30 is a valid, reliable and useful instrument to evaluate the performance of clinical teachers. To our knowledge, this is the first instrument in Spanish for which solid validity and reliability evidences have been reported. We hope that MEDUC30 will be used to improve medical education in Spanish-speaking medical schools, providing teachers a specific feedback upon which to improve their pedagogical practice, and authorities with valuable information for the assessment of their faculty.
De Wilde, Katrien Sophie; Tency, Inge; Boudrez, Hedwig; Temmerman, Marleen; Maes, Lea; Clays, Els
2016-06-01
Smoking during pregnancy can cause several maternal and neonatal health risks, yet a considerable number of pregnant women continue to smoke. The objectives of this study were to test the factorial structure, validity and reliability of the Dutch version of the Modified Reasons for Smoking Scale (MRSS) in a sample of smoking pregnant women and to understand reasons for continued smoking during pregnancy. A longitudinal design was performed. Data of 97 pregnant smokers were collected during prenatal consultation. Structural equation modelling was performed to assess the construct validity of the MRSS: an exploratory factor analysis was conducted, followed by a confirmatory factor analysis.Test-retest reliability (<16 weeks and 32-34 weeks pregnancy) and internal consistency were assessed using the intraclass correlation coefficient and the Cronbach's alpha, respectively. To verify concurrent validity, Mann-Whitney U-tests were performed examining associations between the MRSS subscales and nicotine dependence, daily consumption, depressive symptoms and intention to quit. We found a factorial structure for the MRSS of 11 items within five subscales in order of importance: tension reduction, addiction, pleasure, habit and social function. Results for internal consistency and test-retest reliability were good to acceptable. There were significant associations of nicotine dependence with tension reduction and addiction and of daily consumption with addiction and habit. Validity and reliability of the MRSS were shown in a sample of pregnant smokers. Tension reduction was the most important reason for continued smoking, followed by pleasure and addiction. Although the score for nicotine dependence was low, addiction was an important reason for continued smoking during pregnancy; therefore, nicotine replacement therapy could be considered. Half of the respondents experienced depressive symptoms. Hence, it is important to identify those women who need more specialized care, which can include not only smoking cessation counselling but also treatment for depression. © 2016 John Wiley & Sons, Ltd.
Chin, Weng Yee; Choi, Edmond P H; Chan, Kit T Y; Wong, Carlos K H
2015-01-01
The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald's omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for "somatic", "depressed affect", "positive affect" and "interpersonal problems" were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value <0.01 and all effect size statistics >0.2). The CES-D was externally responsive, with the AUC>0.7. The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four-factor and bi-factor structure, the CES-D is supported for cross-cultural comparisons of depression in multi-center studies.
ERIC Educational Resources Information Center
Wiley, Edward W.; Shavelson, Richard J.; Kurpius, Amy A.
2014-01-01
The name "SAT" has become synonymous with college admissions testing; it has been dubbed "the gold standard." Numerous studies on its reliability and predictive validity show that the SAT predicts college performance beyond high school grade point average. Surprisingly, studies of the factorial structure of the current version…
Finn, Natalie K; Torres, Elisa M; Ehrhart, Mark G; Roesch, Scott C; Aarons, Gregory A
2016-08-01
The Implementation Leadership Scale (ILS) is a brief, pragmatic, and efficient measure that can be used for research or organizational development to assess leader behaviors and actions that actively support effective implementation of evidence-based practices (EBPs). The ILS was originally validated with mental health clinicians. This study validates the ILS factor structure with providers in community-based organizations (CBOs) providing child welfare services. Participants were 214 service providers working in 12 CBOs that provide child welfare services. All participants completed the ILS, reporting on their immediate supervisor. Confirmatory factor analyses were conducted to examine the factor structure of the ILS. Internal consistency reliability and measurement invariance were also examined. Confirmatory factor analyses showed acceptable fit to the hypothesized first- and second-order factor structure. Internal consistency reliability was strong and there was partial measurement invariance for the first-order factor structure when comparing child welfare and mental health samples. The results support the use of the ILS to assess leadership for implementation of EBPs in child welfare organizations. © The Author(s) 2016.
Wagner, Flávia; Martel, Michelle M; Cogo-Moreira, Hugo; Maia, Carlos Renato Moreira; Pan, Pedro Mario; Rohde, Luis Augusto; Salum, Giovanni Abrahão
2016-01-01
The best structural model for attention-deficit/hyperactivity disorder (ADHD) symptoms remains a matter of debate. The objective of this study is to test the fit and factor reliability of competing models of the dimensional structure of ADHD symptoms in a sample of randomly selected and high-risk children and pre-adolescents from Brazil. Our sample comprised 2512 children aged 6-12 years from 57 schools in Brazil. The ADHD symptoms were assessed using parent report on the development and well-being assessment (DAWBA). Fit indexes from confirmatory factor analysis were used to test unidimensional, correlated, and bifactor models of ADHD, the latter including "g" ADHD and "s" symptom domain factors. Reliability of all models was measured with omega coefficients. A bifactor model with one general factor and three specific factors (inattention, hyperactivity, impulsivity) exhibited the best fit to the data, according to fit indices, as well as the most consistent factor loadings. However, based on omega reliability statistics, the specific inattention, hyperactivity, and impulsivity dimensions provided very little reliable information after accounting for the reliable general ADHD factor. Our study presents some psychometric evidence that ADHD specific ("s") factors might be unreliable after taking common ("g" factor) variance into account. These results are in accordance with the lack of longitudinal stability among subtypes, the absence of dimension-specific molecular genetic findings and non-specific effects of treatment strategies. Therefore, researchers and clinicians might most effectively rely on the "g" ADHD to characterize ADHD dimensional phenotype, based on currently available symptom items.
[Evaluation of an educational website on First Aid].
Mori, Satomi; Whitaker, Iveth Yamaguchi; Marin, Heimar de Fátima
2013-08-01
The aim of this study was to evaluate the structure, quality of information and usability of a website on First Aid. The evaluation was performed by information technology (IT) and health care professionals and by students, using specific and validated instruments. The kappa method was used to evaluate the agreement of the answers, and Cronbach's α coefficient was used to assess the reliability of the instrument. There was no agreement (0.047) among the answers obtained from the IT professionals, indicating that the structure of the website must be reviewed. There was also no agreement in the evaluation by the health care professionals (-0.062); however, the overall positive scores suggest that the quality of the information of the website is adequate. The assessment of reliability of the instrument to evaluate the navigability rendered a value of α=0.974. Although improvement of the website structure is recommended, the quality of the information is good, and its use has contributed to the apprenticeship of students.
Wu, Xi Vivien; Enskär, Karin; Pua, Lay Hoon; Heng, Doreen Gek Noi; Wang, Wenru
2016-09-22
A major focus in nursing education is on the judgement of clinical performance, and it is a complex process due to the diverse nature of nursing practice. A holistic approach in assessment of competency is advocated. Difficulties in the development of valid and reliable assessment measures in nursing competency have resulted in the development of assessment instruments with an increase in face and content validity, but few studies have tested these instruments psychometrically. It is essential to develop a holistic assessment tool to meet the needs of the clinical education. The study aims to develop a Holistic Clinical Assessment Tool (HCAT) and test its psychometric properties. The HCAT was developed based on the systematic literature review and the findings of qualitative studies. An expert panel was invited to evaluate the content validity of the tool. A total of 130 final-year nursing undergraduate students were recruited to evaluate the psychometric properties (i.e. factor structure, internal consistency and test-retest reliability) of the tool. The HCAT has good content validity with content validity index of .979. The exploratory factor analysis reveals a four-factor structure of the tool. The internal consistency and test-retest reliability of the HCAT are satisfactory with Cronbach alpha ranging from .789 to .965 and Intraclass Correlation Coefficient ranging from .881 to .979 for the four subscales and total scale. HCAT has the potential to be used as a valid measure to evaluate clinical competence in nursing students, and provide specific and ongoing feedback to enhance the holistic clinical learning experience. In addition, HCAT functions as a tool for self-reflection, peer-assessment and guides preceptors in clinical teaching and assessment.
Leckelt, Marius; Wetzel, Eunike; Gerlach, Tanja M; Ackerman, Robert A; Miller, Joshua D; Chopik, William J; Penke, Lars; Geukes, Katharina; Küfner, Albrecht C P; Hutteman, Roos; Richter, David; Renner, Karl-Heinz; Allroggen, Marc; Brecheen, Courtney; Campbell, W Keith; Grossmann, Igor; Back, Mitja D
2018-01-01
Due to increased empirical interest in narcissism across the social sciences, there is a need for inventories that can be administered quickly while also reliably measuring both the agentic and antagonistic aspects of grandiose narcissism. In this study, we sought to validate the factor structure, provide representative descriptive data and reliability estimates, assess the reliability across the trait spectrum, and examine the nomological network of the short version of the Narcissistic Admiration and Rivalry Questionnaire (NARQ-S; Back et al., 2013). We used data from a large convenience sample (total N = 11,937) as well as data from a large representative sample (total N = 4,433) that included responses to other narcissism measures as well as related constructs, including the other Dark Triad traits, Big Five personality traits, and self-esteem. Confirmatory factor analysis and item response theory were used to validate the factor structure and estimate the reliability across the latent trait spectrum, respectively. Results suggest that the NARQ-S shows a robust factor structure and is a reliable and valid short measure of the agentic and antagonistic aspects of grandiose narcissism. We also discuss future directions and applications of the NARQ-S as a short and comprehensive measure of grandiose narcissism. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Bem Sex Role Inventory Validation in the International Mobility in Aging Study.
Ahmed, Tamer; Vafaei, Afshin; Belanger, Emmanuelle; Phillips, Susan P; Zunzunegui, Maria-Victoria
2016-09-01
This study investigated the measurement structure of the Bem Sex Role Inventory (BSRI) with different factor analysis methods. Most previous studies on validity applied exploratory factor analysis (EFA) to examine the BSRI. We aimed to assess the psychometric properties and construct validity of the 12-item short-form BSRI in a sample administered to 1,995 older adults from wave 1 of the International Mobility in Aging Study (IMIAS). We used Cronbach's alpha to assess internal consistency reliability and confirmatory factor analysis (CFA) to assess psychometric properties. EFA revealed a three-factor model, further confirmed by CFA and compared with the original two-factor structure model. Results revealed that a two-factor solution (instrumentality-expressiveness) has satisfactory construct validity and superior fit to data compared to the three-factor solution. The two-factor solution confirms expected gender differences in older adults. The 12-item BSRI provides a brief, psychometrically sound, and reliable instrument in international samples of older adults.
The long case and its modifications: a literature review.
Ponnamperuma, Gominda G; Karunathilake, Indika M; McAleer, Sean; Davis, Margery H
2009-10-01
This review provides a summary of the published literature on the suitability of the long case and its modifications for high-stakes assessment. Databases related to medicine were searched for articles published from 2000 to 2008, using the keywords 'long case', 'clinical examinations' and 'clinical assessment'. Reference lists of review articles were hand-searched. Articles related to the objective structured clinical examination were eliminated. Research-based articles with hard data were given more emphasis in this review than those based on opinion. Eighteen articles were identified. The main disadvantage of the long case is its inability to sample the curriculum widely, resulting in low reliability. The main advantage of the long case is its ability to assess the candidate's overall (holistic) approach to the patient. Modifications to the long case attempt to: structure the format and the marking scheme; increase the number of examiners; observe the candidate's behaviour, and increase the number of cases. The long case is a traditional clinical examination format for the assessment of clinical competence and assessment at this level is important. The starting point for the majority of recent research on the long case has been an acceptance of its low reliability and modifications to the format have been proposed. Further evidence of the efficacy of these modifications is required, however, before they can be recommended for summative assessment. If further research is to be undertaken on the long case, it should focus on finding practicable ways of sampling the curriculum widely to increase reliability while maintaining the holistic approach towards the patient, which represents the attraction of the long case.
Klein, Britt; Meyer, Denny; Austin, David William; Abbott, Jo-Anne M
2015-01-01
Background Internet-based assessment has the potential to assist with the diagnosis of mental health disorders and overcome the barriers associated with traditional services (eg, cost, stigma, distance). Further to existing online screening programs available, there is an opportunity to deliver more comprehensive and accurate diagnostic tools to supplement the assessment and treatment of mental health disorders. Objective The aim was to evaluate the diagnostic criterion validity and test-retest reliability of the electronic Psychological Assessment System (e-PASS), an online, self-report, multidisorder, clinical assessment and referral system. Methods Participants were 616 adults residing in Australia, recruited online, and representing prospective e-PASS users. Following e-PASS completion, 158 participants underwent a telephone-administered structured clinical interview and 39 participants repeated the e-PASS within 25 days of initial completion. Results With structured clinical interview results serving as the gold standard, diagnostic agreement with the e-PASS varied considerably from fair (eg, generalized anxiety disorder: κ=.37) to strong (eg, panic disorder: κ=.62). Although the e-PASS’ sensitivity also varied (0.43-0.86) the specificity was generally high (0.68-1.00). The e-PASS sensitivity generally improved when reducing the e-PASS threshold to a subclinical result. Test-retest reliability ranged from moderate (eg, specific phobia: κ=.54) to substantial (eg, bulimia nervosa: κ=.87). Conclusions The e-PASS produces reliable diagnostic results and performs generally well in excluding mental disorders, although at the expense of sensitivity. For screening purposes, the e-PASS subclinical result generally appears better than a clinical result as a diagnostic indicator. Further development and evaluation is needed to support the use of online diagnostic assessment programs for mental disorders. Trial Registration Australian and New Zealand Clinical Trials Registry ACTRN121611000704998; http://www.anzctr.org.au/trial_view.aspx?ID=336143 (Archived by WebCite at http://www.webcitation.org/618r3wvOG). PMID:26392066
Griffiths, A; Cox, T; Karanika, M; Khan, S; Tomás, J‐M
2006-01-01
Objectives To examine the factor structure, reliability, and validity of a new context‐specific questionnaire for the assessment of work and organisational factors. The Work Organisation Assessment Questionnaire (WOAQ) was developed as part of a risk assessment and risk reduction methodology for hazards inherent in the design and management of work in the manufacturing sector. Method Two studies were conducted. Data were collected from 524 white‐ and blue‐collar employees from a range of manufacturing companies. Exploratory factor analysis was carried out on 28 items that described the most commonly reported failures of work design and management in companies in the manufacturing sector. Concurrent validity data were also collected. A reliability study was conducted with a further 156 employees. Results Principal component analysis, with varimax rotation, revealed a strong 28‐item, five factor structure. The factors were named: quality of relationships with management, reward and recognition, workload, quality of relationships with colleagues, and quality of physical environment. Analyses also revealed a more general summative factor. Results indicated that the questionnaire has good internal consistency and test‐retest reliability and validity. Being associated with poor employee health and changes in health related behaviour, the WOAQ factors are possible hazards. It is argued that the strength of those associations offers some estimation of risk. Feedback from the organisations involved indicated that the WOAQ was easy to use and meaningful for them as part of their risk assessment procedures. Conclusions The studies reported here describe a model of the hazards to employee health and health related behaviour inherent in the design and management of work in the manufacturing sector. It offers an instrument for their assessment. The scales derived which form the WOAQ were shown to be reliable, valid, and meaningful to the user population. PMID:16858081
Nateghi, Roshanak; Guikema, Seth D; Wu, Yue Grace; Bruss, C Bayan
2016-01-01
The U.S. federal government regulates the reliability of bulk power systems, while the reliability of power distribution systems is regulated at a state level. In this article, we review the history of regulating electric service reliability and study the existing reliability metrics, indices, and standards for power transmission and distribution networks. We assess the foundations of the reliability standards and metrics, discuss how they are applied to outages caused by large exogenous disturbances such as natural disasters, and investigate whether the standards adequately internalize the impacts of these events. Our reflections shed light on how existing standards conceptualize reliability, question the basis for treating large-scale hazard-induced outages differently from normal daily outages, and discuss whether this conceptualization maps well onto customer expectations. We show that the risk indices for transmission systems used in regulating power system reliability do not adequately capture the risks that transmission systems are prone to, particularly when it comes to low-probability high-impact events. We also point out several shortcomings associated with the way in which regulators require utilities to calculate and report distribution system reliability indices. We offer several recommendations for improving the conceptualization of reliability metrics and standards. We conclude that while the approaches taken in reliability standards have made considerable advances in enhancing the reliability of power systems and may be logical from a utility perspective during normal operation, existing standards do not provide a sufficient incentive structure for the utilities to adequately ensure high levels of reliability for end-users, particularly during large-scale events. © 2015 Society for Risk Analysis.
TENI: A comprehensive battery for cognitive assessment based on games and technology.
Delgado, Marcela Tenorio; Uribe, Paulina Arango; Alonso, Andrés Aparicio; Díaz, Ricardo Rosas
2016-01-01
TENI (Test de Evaluación Neuropsicológica Infantil) is an instrument developed to assess cognitive abilities in children between 3 and 9 years of age. It is based on a model that incorporates games and technology as tools to improve the assessment of children's capacities. The test was standardized with two Chilean samples of 524 and 82 children living in urban zones. Evidence of reliability and validity based on current standards is presented. Data show good levels of reliability for all subtests. Some evidence of validity in terms of content, test structure, and association with other variables is presented. This instrument represents a novel approach and a new frontier in cognitive assessment. Further studies with clinical, rural, and cross-cultural populations are required.
The Beck Cognitive Insight Scale (BCIS): translation and validation of the Taiwanese version.
Kao, Yu-Chen; Liu, Yia-Ping
2010-04-09
Over the last few decades, research concerning the insight of patients with schizophrenia and its relationships with other clinical variables has been given much attention in the clinical setting. Since that time, a series of instruments assessing insight have been developed. The purpose of this study was to examine the reliability and validity of the Taiwanese version of the Beck Cognitive Insight Scale (BCIS). The BCIS is a self-administered instrument designed to evaluate cognitive processes that involves reevaluating patients' anomalous experiences and specific misinterpretations. The English language version of the BCIS was translated into Taiwanese for use in this study. A total of 180 subjects with and without psychosis completed the Taiwanese version of the BCIS and additional evaluations to assess researcher-rated insight scales and psychopathology. Psychometric properties (factor structures and various types of reliability and validity) were assessed for this translated questionnaire. Overall, the Taiwanese version of the BCIS showed good reliability and stability over time. This translated scale comprised a two-factor solution corresponding to reflective attitude and certain attitude subscales. Following the validation of the internal structure of the scale, we obtained an R-C (reflective attitude minus certain attitude) index of the translated BCIS, representing the measurement of cognitive insight by subtracting the score of the certain attitude subscale from that of the reflective attitude subscale. As predicted, the differences in mean reflective attitude, certain attitude and R-C index between subjects with and without psychosis were significant. Our data also demonstrated that psychotic patients were significantly less reflective, more confident in their beliefs, and had less cognitive insight compared with nonpsychotic control groups. In light of these findings, we believe that the Taiwanese version of BCIS is a valid and reliable instrument for the assessment of cognitive insight in psychotic patients.
Ogden, C A; Akobeng, A K; Abbott, J; Aggett, P; Sood, M R; Thomas, A G
2011-09-01
To validate IMPACT-III (UK), a health-related quality of life (HRQoL) instrument, in British children with inflammatory bowel disease (IBD). One hundred six children and parents were invited to participate. IMPACT-III (UK) was validated by inspection by health professionals and children to assess face and content validity, factor analysis to determine optimum domain structure, use of Cronbach alpha coefficients to test internal reliability, ANOVA to assess discriminant validity, correlation with the Child Health Questionnaire to assess concurrent validity, and use of intraclass correlation coefficients to assess test-retest reliability. The independent samples t test was used to measure differences between sexes and age groups, and between paper and computerised versions of IMPACT-III (UK). IMPACT-III (UK) had good face and content validity. The most robust factor solution was a 5-domain structure: body image, embarrassment, energy, IBD symptoms, and worries/concerns about IBD, all of which demonstrated good internal reliability (α = 0.74-0.88). Discriminant validity was demonstrated by significant (P < 0.05, P < 0.01) differences in HRQoL scores between the severe, moderate, and inactive/mild symptom severity groups for the embarrassment scale (63.7 vs 81.0 vs 81.2), IBD symptom scale (45.0 vs 64.2 vs 80.6), and the energy scale (46.4 vs 62.1 vs 77.7). Concurrent validity of IMPACT-III (UK) with comparable domains of the Child Health Questionnaire was confirmed. Test-retest reliability was confirmed with good intraclass correlation coefficients of 0.66 to 0.84. Paper and computer versions of IMPACT-III (UK) collected comparable scores, and there were no differences between the sexes and age groups. IMPACT-III (UK) appears to be a useful tool to measure HRQoL in British children with IBD.
Tang, D Y Y; Liu, A C Y; Leung, M H T; Siu, B W M
2013-06-01
OBJECTIVE. Antisocial personality disorder (ASPD) is a risk factor for violence and is associated with poor treatment response when it is a co-morbid condition with substance abuse. It is an under-recognised clinical entity in the local Hong Kong setting, for which there are only a few available Chinese-language diagnostic instruments. None has been tested for its psychometric properties in the Cantonese-speaking population in Hong Kong. This study therefore aimed to assess the reliability and validity of the Chinese version of the ASPD subscale of the Structured Clinical Interview for the DSM-IV Axis II Disorders (SCID-II) in Hong Kong Chinese. METHODS. This assessment tool was modified according to dialectal differences between Mainland China and Hong Kong. Inpatients in Castle Peak Hospital, Hong Kong, who were designated for priority follow-up based on their assessed propensity for violence and who fulfilled the inclusion criteria for the study, were recruited. To assess the level of agreement, best-estimate diagnosis made by a multidisciplinary team was compared with diagnostic status determined by the SCID-II ASPD subscale. The internal consistency, sensitivity, and specificity of the subscale were also calculated. RESULTS. The internal consistency of the subscale was acceptable at 0.79, whereas the test-retest reliability and inter-rater reliability showed an excellent and good agreement of 0.90 and 0.86, respectively. Best-estimate clinical diagnosis-SCID diagnosis agreement was acceptable at 0.76. The sensitivity, specificity, positive and negative predictive values were 0.91, 0.86, 0.83, and 0.93, respectively. CONCLUSION. The Chinese version of the SCID-II ASPD subscale is reliable and valid for diagnosing ASPD in a Cantonese-speaking clinical population.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1987-03-01
An assessment of needs was completed, and a five-year project plan was developed with extensive input from private industry. Objective is to develop the industrial technology base required for reliable ceramics for application in advanced automotive heat engines. The project approach includes determining the mechanisms controlling reliability, improving processes for fabricating existing ceramics, developing new materials with increased reliability, and testing these materials in simulated engine environments to confirm reliability. Although this is a generic materials project, the focus is on structural ceramics for advanced gas turbine and diesel engines, ceramic bearings and attachments, and ceramic coatings for thermal barriermore » and wear applications in these engines.« less
Assessment of Semi-Structured Clinical Interview for Mobile Phone Addiction Disorder.
Alavi, Seyyed Salman; Mohammadi, Mohammad Reza; Jannatifard, Fereshteh; Mohammadi Kalhori, Soroush; Sepahbodi, Ghazal; BabaReisi, Mohammad; Sajedi, Sahar; Farshchi, Mojtaba; KhodaKarami, Rasul; Hatami Kasvaee, Vahid
2016-04-01
The Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) classified mobile phone addiction disorder under "impulse control disorder not elsewhere classified". This study surveyed the diagnostic criteria of DSM-IV-TR for the diagnosis of mobile phone addiction in correspondence with Iranian society and culture. Two hundred fifty students of Tehran universities were entered into this descriptive-analytical and cross-sectional study. Quota sampling method was used. At first, semi- structured clinical interview (based on DSM-IV-TR) was performed for all the cases, and another specialist reevaluated the interviews. Data were analyzed using content validity, inter-scorer reliability (Kappa coefficient) and test-retest via SPSS18 software. The content validity of the semi- structured clinical interview matched the DSM-IV-TR criteria for behavioral addiction. Moreover, their content was appropriate, and two items, including "SMS pathological use" and "High monthly cost of using the mobile phone" were added to promote its validity. Internal reliability (Kappa) and test-retest reliability were 0.55 and r = 0.4 (p<0. 01) respectively. The results of this study revealed that semi- structured diagnostic criteria of DSM-IV-TR are valid and reliable for diagnosing mobile phone addiction, and this instrument is an effective tool to diagnose this disorder.
Improved reliability of wind turbine towers with active tuned mass dampers (ATMDs)
NASA Astrophysics Data System (ADS)
Fitzgerald, Breiffni; Sarkar, Saptarshi; Staino, Andrea
2018-04-01
Modern multi-megawatt wind turbines are composed of slender, flexible, and lightly damped blades and towers. These components exhibit high susceptibility to wind-induced vibrations. As the size, flexibility and cost of the towers have increased in recent years, the need to protect these structures against damage induced by turbulent aerodynamic loading has become apparent. This paper combines structural dynamic models and probabilistic assessment tools to demonstrate improvements in structural reliability when modern wind turbine towers are equipped with active tuned mass dampers (ATMDs). This study proposes a multi-modal wind turbine model for wind turbine control design and analysis. This study incorporates an ATMD into the tower of this model. The model is subjected to stochastically generated wind loads of varying speeds to develop wind-induced probabilistic demand models for towers of modern multi-megawatt wind turbines under structural uncertainty. Numerical simulations have been carried out to ascertain the effectiveness of the active control system to improve the structural performance of the wind turbine and its reliability. The study constructs fragility curves, which illustrate reductions in the vulnerability of towers to wind loading owing to the inclusion of the damper. Results show that the active controller is successful in increasing the reliability of the tower responses. According to the analysis carried out in this paper, a strong reduction of the probability of exceeding a given displacement at the rated wind speed has been observed.
Ahmed, Ashraf; Qayed, Khalil Ibrahim; Abdulrahman, Mahera; Tavares, Walter; Rosenfeld, Jack
2014-08-01
Numerous studies have shown that multiple mini-interviews (MMI) provides a standard, fair, and more reliable method for assessing applicants. This article presents the first MMI experience for selection of medical residents in the Middle East culture and an Arab country. In 2012, we started using the MMI in interviewing applicants to the residency program of Dubai Health Authority. This interview process consisted of eight, eight-minute structured interview scenarios. Applicants rotated through the stations, each with its own interviewer and scenario. They read the scenario and were requested to discuss the issues with the interviewers. Sociodemographic and station assessment data provided for each applicant were analyzed to determine whether the MMI was a reliable assessment of the non-clinical attributes in the present setting of an Arab country. One hundred and eighty-seven candidates from 27 different countries were interviewed for Dubai Residency Training Program using MMI. They were graduates of 5 medical universities within United Arab Emirates (UAE) and 60 different universities outside UAE. With this applicant's pool, a MMI with eight stations, produced absolute and relative reliability of 0.8 and 0.81, respectively. The person × station interaction contributed 63% of the variance components, the person contributed 34% of the variance components, and the station contributed 2% of the variance components. The MMI has been used in numerous universities in English speaking countries. The MMI evaluates non-clinical attributes and this study provides further evidence for its reliability but in a different country and culture. The MMI offers a fair and more reliable assessment of applicants to medical residency programs. The present data show that this assessment technique applied in a non-western country and Arab culture still produced reliable results.
Implementing the undergraduate mini-CEX: a tailored approach at Southampton University.
Hill, Faith; Kendall, Kathleen; Galbraith, Kevin; Crossley, Jim
2009-04-01
The mini-clinical evaluation exercise (mini-CEX) is widely used in the UK to assess clinical competence, but there is little evidence regarding its implementation in the undergraduate setting. This study aimed to estimate the validity and reliability of the undergraduate mini-CEX and discuss the challenges involved in its implementation. A total of 3499 mini-CEX forms were completed. Validity was assessed by estimating associations between mini-CEX score and a number of external variables, examining the internal structure of the instrument, checking competency domain response rates and profiles against expectations, and by qualitative evaluation of stakeholder interviews. Reliability was evaluated by overall reliability coefficient (R), estimation of the standard error of measurement (SEM), and from stakeholders' perceptions. Variance component analysis examined the contribution of relevant factors to students' scores. Validity was threatened by various confounding variables, including: examiner status; case complexity; attachment specialty; patient gender, and case focus. Factor analysis suggested that competency domains reflect a single latent variable. Maximum reliability can be achieved by aggregating scores over 15 encounters (R = 0.73; 95% confidence interval [CI] +/- 0.28 based on a 6-point assessment scale). Examiner stringency contributed 29% of score variation and student attachment aptitude 13%. Stakeholder interviews revealed staff development needs but the majority perceived the mini-CEX as more reliable and valid than the previous long case. The mini-CEX has good overall utility for assessing aspects of the clinical encounter in an undergraduate setting. Strengths include fidelity, wide sampling, perceived validity, and formative observation and feedback. Reliability is limited by variable examiner stringency, and validity by confounding variables, but these should be viewed within the context of overall assessment strategies.
Baker, Elizabeth A; Ledford, Cynthia H; Fogg, Louis; Way, David P; Park, Yoon Soo
2015-01-01
Construct: Clinical skills are used in the care of patients, including reporting, diagnostic reasoning, and decision-making skills. Written comprehensive new patient admission notes (H&Ps) are a ubiquitous part of student education but are underutilized in the assessment of clinical skills. The interpretive summary, differential diagnosis, explanation of reasoning, and alternatives (IDEA) assessment tool was developed to assess students' clinical skills using written comprehensive new patient admission notes. The validity evidence for assessment of clinical skills using clinical documentation following authentic patient encounters has not been well documented. Diagnostic justification tools and postencounter notes are described in the literature (1,2) but are based on standardized patient encounters. To our knowledge, the IDEA assessment tool is the first published tool that uses medical students' H&Ps to rate students' clinical skills. The IDEA assessment tool is a 15-item instrument that asks evaluators to rate students' reporting, diagnostic reasoning, and decision-making skills based on medical students' new patient admission notes. This study presents validity evidence in support of the IDEA assessment tool using Messick's unified framework, including content (theoretical framework), response process (interrater reliability), internal structure (factor analysis and internal-consistency reliability), and relationship to other variables. Validity evidence is based on results from four studies conducted between 2010 and 2013. First, the factor analysis (2010, n = 216) yielded a three-factor solution, measuring patient story, IDEA, and completeness, with reliabilities of .79, .88, and .79, respectively. Second, an initial interrater reliability study (2010) involving two raters demonstrated fair to moderate consensus (κ = .21-.56, ρ =.42-.79). Third, a second interrater reliability study (2011) with 22 trained raters also demonstrated fair to moderate agreement (intraclass correlations [ICCs] = .29-.67). There was moderate reliability for all three skill domains, including reporting skills (ICC = .53), diagnostic reasoning skills (ICC = .64), and decision-making skills (ICC = .63). Fourth, there was a significant correlation between IDEA rating scores (2010-2013) and final Internal Medicine clerkship grades (r = .24), 95% confidence interval (CI) [.15, .33]. The IDEA assessment tool is a novel tool with validity evidence to support its use in the assessment of students' reporting, diagnostic reasoning, and decision-making skills. The moderate reliability achieved supports formative or lower stakes summative uses rather than high-stakes summative judgments.
Intersession reliability of fMRI activation for heat pain and motor tasks
Quiton, Raimi L.; Keaser, Michael L.; Zhuo, Jiachen; Gullapalli, Rao P.; Greenspan, Joel D.
2014-01-01
As the practice of conducting longitudinal fMRI studies to assess mechanisms of pain-reducing interventions becomes more common, there is a great need to assess the test–retest reliability of the pain-related BOLD fMRI signal across repeated sessions. This study quantitatively evaluated the reliability of heat pain-related BOLD fMRI brain responses in healthy volunteers across 3 sessions conducted on separate days using two measures: (1) intraclass correlation coefficients (ICC) calculated based on signal amplitude and (2) spatial overlap. The ICC analysis of pain-related BOLD fMRI responses showed fair-to-moderate intersession reliability in brain areas regarded as part of the cortical pain network. Areas with the highest intersession reliability based on the ICC analysis included the anterior midcingulate cortex, anterior insula, and second somatosensory cortex. Areas with the lowest intersession reliability based on the ICC analysis also showed low spatial reliability; these regions included pregenual anterior cingulate cortex, primary somatosensory cortex, and posterior insula. Thus, this study found regional differences in pain-related BOLD fMRI response reliability, which may provide useful information to guide longitudinal pain studies. A simple motor task (finger-thumb opposition) was performed by the same subjects in the same sessions as the painful heat stimuli were delivered. Intersession reliability of fMRI activation in cortical motor areas was comparable to previously published findings for both spatial overlap and ICC measures, providing support for the validity of the analytical approach used to assess intersession reliability of pain-related fMRI activation. A secondary finding of this study is that the use of standard ICC alone as a measure of reliability may not be sufficient, as the underlying variance structure of an fMRI dataset can result in inappropriately high ICC values; a method to eliminate these false positive results was used in this study and is recommended for future studies of test–retest reliability. PMID:25161897
Cavelti, M; Wirtz, M; Corrigan, P; Vauth, R
2017-03-01
The recovery framework has found its way into local and national mental health services and policies around the world, especially in English speaking countries. To promote this process, it is necessary to assess personal recovery validly and reliably. The Recovery Assessment Scale (RAS) is the most established measure in recovery research. The aim of the current study is to examine the factor structure of the German version of the RAS (RAS-G). One hundred and fifty-six German-speaking clients with schizophrenia or schizoaffective disorder from a community mental health service completed the RAS-G plus measures of recovery attitudes, self-stigma, psychotic symptoms, depression, and functioning. A confirmatory factor analysis of the original 24-item RAS version was conducted to examine its factor structure, followed by reliability and validity testing of the extracted factors. The CFA yielded five factors capturing 14 items which showed a substantial overlap with the original subscales Personal Confidence and Hope, Goal and Success Orientation, Willingness to Ask for Help, Reliance on Others, and No Domination by Symptoms. The factors demonstrated mean to excellent reliability (0.59-0.89) and satisfactory criterial validity by positive correlations with measures of recovery attitudes and functioning, and negative correlations with measures of self-stigma, and psychotic and depressive symptoms. The study results are discussed in the light of other studies examining the factor structure of the RAS. Overall, they support the use of the RAS-G as a means to promote recovery oriented services, policies, and research in German-speaking countries. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Chan, Kin Sun
2018-01-01
Objectives This study aimed to evaluate the internal consistency, reliability, convergent validity, known-group comparisons, and structural validity of the Chinese version of Fear of Intimacy with Helping Professionals (C–FIS–HP) scale in Macau. Methods A cross-sectional design was used on a sample of 593 older people in 6 health centers. We used Chinese version of Exercise of Self-Care Agency Scale (C-ESCAS) and Morisky 4-item medication adherence scale to evaluate self-care actions and medication adherence. The internal consistency and reliability of C–FIS–HP were analyzed using the Spearman-Brown split-half reliability, Cronbach’s alpha, and test–retest reliability. Convergent validity was tested the construct of C–FIS–HP and self-care actions. Known-group comparisons differentiated predefined groups in an expected direction. Two separated samples were used to test the structural validity. An exploratory factor analysis (EFA) tested the factor structure of C–FISHP using the principal axis factoring. A confirmatory factor analysis (CFA) was further conducted to confirm the factor structure constructed in the prior EFA. Results The C–FIS–HP had a Spearman-Brown split-half coefficient, Cronbach’s alpha, and intraclass correlation coefficient of 0.96, 0.93, and 0.96, respectively. Convergent validity was satisfactory with significantly correlations between the C-FIS-HP and C-ESCAS. C–FIS–HP to differentiate the differences between high-, moderate-, and low- medication adherence groups. EFA demonstrated a two-factor structure among 297 older people. A first-order CFA was performed to confirm the construct dimensionality of C–FIS–HP with satisfactory fit indices (NFI = 0.92; IFI = 0.95; TLI = 0.94; CFI = 0.95 and RMSEA = 0.07) among 296 older people. Conclusions C–FIS–HP is a reliable and valid test for assessing helping relationships in older Chinese people. Health professionals can use C–FIS–HP as a clinical tool to assess the comfort level of patients in a helping relationship, and use this information to develop culturally sensitive therapeutic interventions and treatment plans. Further studies need to be conducted concerning the different psychometric properties, as well as the application of C–FIS–HP in various regions. PMID:29795563
First-order reliability application and verification methods for semistatic structures
NASA Astrophysics Data System (ADS)
Verderaime, V.
1994-11-01
Escalating risks of aerostructures stimulated by increasing size, complexity, and cost should no longer be ignored in conventional deterministic safety design methods. The deterministic pass-fail concept is incompatible with probability and risk assessments; stress audits are shown to be arbitrary and incomplete, and the concept compromises the performance of high-strength materials. A reliability method is proposed that combines first-order reliability principles with deterministic design variables and conventional test techniques to surmount current deterministic stress design and audit deficiencies. Accumulative and propagation design uncertainty errors are defined and appropriately implemented into the classical safety-index expression. The application is reduced to solving for a design factor that satisfies the specified reliability and compensates for uncertainty errors, and then using this design factor as, and instead of, the conventional safety factor in stress analyses. The resulting method is consistent with current analytical skills and verification practices, the culture of most designers, and the development of semistatic structural designs.
ERIC Educational Resources Information Center
Desmarais, Sarah L.; Nicholls, Tonia L.; Wilson, Catherine M.; Brink, Johann
2012-01-01
The Short-Term Assessment of Risk and Treatability (START; C. D. Webster, M. L. Martin, J. Brink, T. L. Nicholls, & S. L. Desmarais, 2009; C. D. Webster, M. L. Martin, J. Brink, T. L. Nicholls, & C. Middleton, 2004) is a relatively new structured professional judgment guide for the assessment and management of short-term risks associated…
ERIC Educational Resources Information Center
Petway, Kevin T., II; Rikoon, Samuel H.; Brenneman, Meghan W.; Burrus, Jeremy; Roberts, Richard D.
2016-01-01
The Mission Skills Assessment (MSA) is an online assessment that targets 6 noncognitive constructs: creativity, curiosity, ethics, resilience, teamwork, and time management. Each construct is measured by means of a student self-report scale, a student alternative scale (e.g., situational judgment test), and a teacher report scale. Use of the MSA…
Vives-Vergara, Alejandra; González-López, Francisca; Solar, Orielle; Bernales-Baksai, Pamela; González, María José; Benach, Joan
2017-04-20
The purpose of this study is to perform a psychometric analysis (acceptability, reliability and factor structure) of the Chilean version of the new Employment Precariousness Scale (EPRES). The data is drawn from a sample of 4,248 private salaried workers with a formal contract from the first Chilean Employment Conditions, Work, Health and Quality of Life (ENETS) survey, applied to a nationally representative sample of the Chilean workforce in 2010. Item and scale-level statistics were performed to assess scaling properties, acceptability and reliability. The six-dimensional factor structure was examined with confirmatory factor analysis. The scale exhibited high acceptability (roughly 80%) and reliability (Cronbach's alpha 0.83) and the factor structure was confirmed. One subscale (rights) demonstrated poorer metric properties without compromising the overall scale. The Chilean version of the Employment Precariousness Scale (EPRES-Ch) demonstrated good metric properties, pointing to its suitability for use in epidemiologic and public health research.
NASA Astrophysics Data System (ADS)
Lauer, Eric A.; Corner, Brian D.; Li, Peng; Beecher, Robert M.; Deutsch, Curtis
2002-03-01
Traditionally, medical geneticists have employed visual inspection (anthroposcopy) to clinically evaluate dysmorphology. In the last 20 years, there has been an increasing trend towards quantitative assessment to render diagnosis of anomalies more objective and reliable. These methods have focused on direct anthropometry, using a combination of classical physical anthropology tools and new instruments tailor-made to describe craniofacial morphometry. These methods are painstaking and require that the patient remain still for extended periods of time. Most recently, semiautomated techniques (e.g., structured light scanning) have been developed to capture the geometry of the face in a matter of seconds. In this paper, we establish that direct anthropometry and structured light scanning yield reliable measurements, with remarkably high levels of inter-rater and intra-rater reliability, as well as validity (contrasting the two methods).
Bartels, Meike; Cath, Danielle C.; Boomsma, Dorret I.
2008-01-01
The factor structure of the Dutch translation of the Autism-Spectrum Quotient (AQ; a continuous, quantitative measure of autistic traits) was evaluated with confirmatory factor analyses in a large general population and student sample. The criterion validity of the AQ was examined in three matched patient groups (autism spectrum conditions (ASC), social anxiety disorder, and obsessive–compulsive disorder). A two factor model, consisting of a “Social interaction” factor and “Attention to detail” factor could be identified. The internal consistency and test–retest reliability of the AQ were satisfactory. High total AQ and factor scores were specific to ASC patients. Men scored higher than women and science students higher than non-science students. The Dutch translation of the AQ is a reliable instrument to assess autism spectrum conditions. PMID:18302013
Reliability-Based Life Assessment of Stirling Convertor Heater Head
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Halford, Gary R.; Korovaichuk, Igor
2004-01-01
Onboard radioisotope power systems being developed and planned for NASA's deep-space missions require reliable design lifetimes of up to 14 yr. The structurally critical heater head of the high-efficiency Stirling power convertor has undergone extensive computational analysis of operating temperatures, stresses, and creep resistance of the thin-walled Inconel 718 bill of material. A preliminary assessment of the effect of uncertainties in the material behavior was also performed. Creep failure resistance of the thin-walled heater head could show variation due to small deviations in the manufactured thickness and in uncertainties in operating temperature and pressure. Durability prediction and reliability of the heater head are affected by these deviations from nominal design conditions. Therefore, it is important to include the effects of these uncertainties in predicting the probability of survival of the heater head under mission loads. Furthermore, it may be possible for the heater head to experience rare incidences of small temperature excursions of short duration. These rare incidences would affect the creep strain rate and, therefore, the life. This paper addresses the effects of such rare incidences on the reliability. In addition, the sensitivities of variables affecting the reliability are quantified, and guidelines developed to improve the reliability are outlined. Heater head reliability is being quantified with data from NASA Glenn Research Center's accelerated benchmark testing program.
The reliability and validity of the Maryland Assessment of Recovery in Serious Mental Illness Scale.
Drapalski, Amy L; Medoff, Deborah; Dixon, Lisa; Bellack, Alan
2016-05-30
The current study aims to further evaluate the psychometric properties of the Maryland Assessment of Recovery in Serious Mental Illness (MARS), a relatively new instrument designed to assess personal recovery status in individuals with serious mental illness. Two hundred and fifty individuals with serious mental illness receiving outpatient mental health treatment completed a baseline assessment which included the MARS and measures to assess recovery-related constructs, clinical outcomes, and social and community functioning. The MARS demonstrated excellent internal consistency and test-retest reliability. Good construct validity was evidenced by strong positive relationships between the MARS and recovery-related constructs (e.g. hope, empowerment, self-efficacy, and personal agency) and a strong negative relationship with self-stigma. Divergent validity was demonstrated by weaker relationships with cognitive and social functioning. The confirmatory factor analysis did not confirm the unitary factor structure found in previous research. Given the equivocal result of the CFA, additional exploratory work is needed to determine if a more complex factor structure is present. This study provides addition support for the psychometric soundness of the MARS and subsequently, its potential use as a measure of personal recovery status in people with serious mental illness. Published by Elsevier Ireland Ltd.
Assessing a Norwegian translation of the Organizational Climate Measure.
Bernstrøm, Vilde Hoff; Lone, Jon Anders; Bjørkli, Cato A; Ulleberg, Pål; Hoff, Thomas
2013-04-01
This study investigated the Norwegian translation of the Organizational Climate Measure developed by Patterson and colleagues. The Organizational Climate Measure is a global measure of organizational climate based on Quinn and Rohrbaugh's competing values model. The survey was administered to a Norwegian branch of an international service sector company (N = 555). The results revealed satisfactory internal reliability and interrater agreement for the 17 scales, and confirmatory factor analysis supported the original factor structure. The findings gave preliminary support for the Organizational Climate Measure as a reliable measure with a stable factor structure, and indicated that it is potentially useful in the Norwegian context.
Universal first-order reliability concept applied to semistatic structures
NASA Technical Reports Server (NTRS)
Verderaime, V.
1994-01-01
A reliability design concept was developed for semistatic structures which combines the prevailing deterministic method with the first-order reliability method. The proposed method surmounts deterministic deficiencies in providing uniformly reliable structures and improved safety audits. It supports risk analyses and reliability selection criterion. The method provides a reliability design factor derived from the reliability criterion which is analogous to the current safety factor for sizing structures and verifying reliability response. The universal first-order reliability method should also be applicable for air and surface vehicles semistatic structures.
Universal first-order reliability concept applied to semistatic structures
NASA Astrophysics Data System (ADS)
Verderaime, V.
1994-07-01
A reliability design concept was developed for semistatic structures which combines the prevailing deterministic method with the first-order reliability method. The proposed method surmounts deterministic deficiencies in providing uniformly reliable structures and improved safety audits. It supports risk analyses and reliability selection criterion. The method provides a reliability design factor derived from the reliability criterion which is analogous to the current safety factor for sizing structures and verifying reliability response. The universal first-order reliability method should also be applicable for air and surface vehicles semistatic structures.
Chang, Olivia H; King, Louise P; Modest, Anna M; Hur, Hye-Chun
2016-01-01
To develop a teaching and assessment tool for laparoscopic suturing and intracorporeal knot tying. We designed an Objective Structured Assessment of Technical Skills (OSATS) tool that includes a procedure-specific checklist (PSC) and global rating scale (GRS) to assess laparoscopic suturing and intracorporeal knot-tying performance. Obstetrics and Gynecology residents at our institution were videotaped while performing a laparoscopic suturing and intracorporeal knot-tying task at a surgical simulation workshop. A total of 2 expert reviewers assessed resident performance using the OSATS tool during live performance and 1 month later using the videotaped recordings. OSATS scores were analyzed using the Wilcoxon rank-sum test. Data are presented as median scores (interquartile range [IQR]). Intrarater and interrater reliabilities were assessed using a Spearman correlation and are presented as an r correlation coefficient and p value. An r ≥ 0.8 was considered as a high correlation. After testing, we received feedback from residents and faculty to improve the OSATS tool as part of an iterative design process. In all, 14 of 21 residents (66.7%) completed the study, with 9 junior residents and 5 senior residents. Junior residents had a lower score on the PSC than senior residents did; however, this was not statistically significant (median = 6.0 [IQR: 4.0-10.0] and median = 13.0 [IQR: 10.0-13.0]; p = 0.09). There was excellent intrarater reliability with our OSATS tool (for PSC component, r = 0.88 for Rater 1 and 0.93 for Rater 2, both p < 0.0001; for GRS component, r = 0.85 for Rater 1 and 0.88 for Rater 2, both p ≤ 0.0002). The PSC also has high interrater reliability during live evaluation (r = 0.92; p < 0.0001), and during the videotape scoring with r = 0.77 (p = 0.001). Our OSATS tool may be a useful assessment and teaching tool for laparoscopic suturing and intracorporeal knot-tying skills. Overall, good intrarater reliability was demonstrated, suggesting that this tool may be useful for longitudinal assessment of surgical skills. Copyright © 2015 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Structural composite panel performance under long-term load
Theodore L. Laufenberg
1988-01-01
Information on the performance of wood-based structural composite panels under long-term load is currently needed to permit their use in engineered assemblies and systems. A broad assessment of the time-dependent properties of panels is critical for creating databases and models of the creep-rupture phenomenon that lead to reliability-based design procedures. This...
Hoben, Matthias; Estabrooks, Carole A.; Squires, Janet E.; Behrens, Johann
2016-01-01
We translated the Canadian residential long term care versions of the Alberta Context Tool (ACT) and the Conceptual Research Utilization (CRU) Scale into German, to study the association between organizational context factors and research utilization in German nursing homes. The rigorous translation process was based on best practice guidelines for tool translation, and we previously published methods and results of this process in two papers. Both instruments are self-report questionnaires used with care providers working in nursing homes. The aim of this study was to assess the factor structure, reliability, and measurement invariance (MI) between care provider groups responding to these instruments. In a stratified random sample of 38 nursing homes in one German region (Metropolregion Rhein-Neckar), we collected questionnaires from 273 care aides, 196 regulated nurses, 152 allied health providers, 6 quality improvement specialists, 129 clinical leaders, and 65 nursing students. The factor structure was assessed using confirmatory factor models. The first model included all 10 ACT concepts. We also decided a priori to run two separate models for the scale-based and the count-based ACT concepts as suggested by the instrument developers. The fourth model included the five CRU Scale items. Reliability scores were calculated based on the parameters of the best-fitting factor models. Multiple-group confirmatory factor models were used to assess MI between provider groups. Rather than the hypothesized ten-factor structure of the ACT, confirmatory factor models suggested 13 factors. The one-factor solution of the CRU Scale was confirmed. The reliability was acceptable (>0.7 in the entire sample and in all provider groups) for 10 of 13 ACT concepts, and high (0.90–0.96) for the CRU Scale. We could demonstrate partial strong MI for both ACT models and partial strict MI for the CRU Scale. Our results suggest that the scores of the German ACT and the CRU Scale for nursing homes are acceptably reliable and valid. However, as the ACT lacked strict MI, observed variables (or scale scores based on them) cannot be compared between provider groups. Rather, group comparisons should be based on latent variable models, which consider the different residual variances of each group. PMID:27656156
Tam, Wilson; Keung, Vera; Lee, Albert; Lo, Kenneth; Cheung, Calvin
2014-11-21
Childhood obesity is a major public health issue in many countries, including China. The importance of parenting relative to the healthy development of children requires the development of instruments for assessing parental influence on child dietary pattern. This study aimed to confirm the internal reliability and validity of a self-report measure on parental feeding styles, including emotional feeding, instrumental feeding, prompting or encouragement to eat, and control over eating. A 27-item parental feeding style questionnaire (PFSQ) was translated into Chinese and then translated back into English to verify consistency. The questionnaire was then used to conduct a cross-sectional survey on the parents of Hong Kong preschoolers. The internal reliability and validity of the questionnaire were examined by Cronbach's alpha and exploratory factor analysis, respectively. 4,553 completed questionnaires were received. Cronbach's alpha of subscales ranged from 0.63 to 0.81, and the overall reliability was good (alpha = 0.75). The factor structure of this questionnaire was similar to that of the original and Turkish versions. One-factor structure was identified for emotional feeding, instrumental feeding (four items), and prompting or encouragement to eat, whereas a two-factor structure was revealed for control over eating. The Chinese version of the PFSQ has good reliability and validity in assessing parental feeding styles in Hong Kong. Researchers can use this instrument to improve their understanding on how parental feeding styles may affect the dietary patterns and ultimately the weight statuses of children among Chinese-speaking populations across different countries.
Quantification of uncertainties in the performance of smart composite structures
NASA Technical Reports Server (NTRS)
Shiao, Michael C.; Chamis, Christos C.
1993-01-01
A composite wing with spars, bulkheads, and built-in control devices is evaluated using a method for the probabilistic assessment of smart composite structures. Structural responses (such as change in angle of attack, vertical displacements, and stresses in regular plies with traditional materials and in control plies with mixed traditional and actuation materials) are probabilistically assessed to quantify their respective scatter. Probabilistic sensitivity factors are computed to identify those parameters that have a significant influence on a specific structural response. Results show that the uncertainties in the responses of smart composite structures can be quantified. Responses such as structural deformation, ply stresses, frequencies, and buckling loads in the presence of defects can be reliably controlled to satisfy specified design requirements.
Aeropropulsion 1979. [conferences
NASA Technical Reports Server (NTRS)
1979-01-01
State of the art technology in aeronautical propulsion is assessed. Noise and air pollution control techniques, advances in supersonic propulsion for transport aircraft, and composite materials and structures for reliable engine components are covered along with engine design for improved fuel consumption.
Extending the validity of the Feeding Practices and Structure Questionnaire.
Jansen, Elena; Mallan, Kimberley M; Daniels, Lynne A
2015-06-30
Feeding practices are commonly examined as potentially modifiable determinants of children's eating behaviours and weight status. Although a variety of questionnaires exist to assess different feeding aspects, many lack thorough reliability and validity testing. The Feeding Practices and Structure Questionnaire (FPSQ) is a tool designed to measure early feeding practices related to non-responsive feeding and structure of the meal environment. Face validity, factorial validity, internal reliability and cross-sectional correlations with children's eating behaviours have been established in mothers with 2-year-old children. The aim of the present study was to further extend the validity of the FPSQ by examining factorial, construct and predictive validity, and stability. Participants were from the NOURISH randomised controlled trial which evaluated an intervention with first-time mothers designed to promote protective feeding practices. Maternal feeding practices (FP) and child eating behaviours were assessed when children were aged 2 years and 3.7 years (n = 388). Confirmatory Factor analysis, group differences, predictive relationships, and stability were tested. The original 9-factor structure was confirmed when children were aged 3.7 ± 0.3 years. Cronbach's alpha was above the recommended 0.70 cut-off for all factors except Structured Meal Timing, Over Restriction and Distrust in Appetite which were 0.58, 0.67 and 0.66 respectively. Allocated group differences reflected behaviour consistent with intervention content and all feeding practices were stable across both time points (range of r = 0.45-0.70). There was some evidence for the predictive validity of factors with 2 FP showing expected relationships, 2 FP showing expected and unexpected relationships and 5 FP showing no relationship. Reliability and validity was demonstrated for most subscales of the FPSQ. Future validation is warranted with culturally diverse samples and with fathers and other caregivers. The use of additional outcomes to further explore predictive validity is recommended as well as testing test-retest reliability of the questionnaire.
Goossens, Joline; Verhaeghe, Sofie; Van Hecke, Ann; Barrett, Geraldine; Delbaere, Ilse; Beeckman, Dimitri
2018-01-01
To evaluate the psychometric properties of the Dutch version of the London Measure of Unplanned Pregnancy in women with pregnancies ending in birth. A two-phase psychometric evaluation design was set-up. Phase I comprised the translation from English into Dutch and pretesting with 6 women using cognitive interviews. In phase II, the reliability and validity of the Dutch version of the LMUP was assessed in 517 women giving birth recently. Reliability (internal consistency) was assessed using Cronbach's alpha, inter-item correlations, and corrected item-total correlations. Construct validity was assessed using principal components analysis and hypothesis testing. Exploratory Mokken scale analysis was carried out. 517 women aged 15-45 completed the Dutch version of the LMUP. Reliability testing showed acceptable internal consistency (alpha = 0.74, positive inter-item correlations between all items, all corrected item-total correlations >0.20). Validity testing confirmed the unidimensional structure of the scale and all hypotheses were confirmed. The overall Loevinger's H coefficient was 0.57, representing a 'strong' scale. The Dutch version of the LMUP is a reliable and valid measure that can be used in the Dutch-speaking population in Belgium to assess pregnancy planning. Future research is necessary to assess the stability of the Dutch version of the LMUP, and to evaluate its psychometric properties in women with abortions.
Brunton, Laura K; Bartlett, Doreen J
2017-07-01
The Fatigue Impact and Severity Self-Assessment (FISSA) was created to assess the impact, severity, and self-management of fatigue for individuals with cerebral palsy (CP) aged 14-31 years. Items were generated from a review of measures and interviews with individuals with CP. Focus groups with health-care professionals were used for item reduction. A mailed survey was conducted (n=163/367) to assess the factor structure, known-groups validity, and test-retest reliability. The final measure contained 31 items in two factors and discriminated between individuals expected to have different levels of fatigue. Individuals with more functional abilities reported less fatigue (p < 0.002) and those with higher pain reported higher fatigue (p < 0.001). The FISSA was shown to have adequate test-retest reliability, intraclass correlation coefficient (ICC)(3,1)=0.74 (95% confidence interval [CI] 0.53-0.87). The FISSA valid and reliable for individuals with CP. It allows for identification of the activities that may be compromised by fatigue to enhance collaborative goal setting and intervention planning.
Amaral, Anna Beatriz C N; Rider, Elizabeth A; Lajolo, Paula P; Tone, Luiz G; Pinto, Rogerio M C; Lajolo, Marisa P; Calhoun, Aaron W
2016-12-11
The goal of this study was to translate, adapt and validate the items of the Gap-Kalamazoo Communication Skills Assessment Form for use in the Brazilian cultural setting. The Gap-Kalamazoo Communication Skills Assessment Form was translated into Portuguese by two independent bilingual Brazilian translators and was reconciled by a third bilingual healthcare professional. The translated text was then assessed for content using a modified Delphi technique and adjusted as needed to assure content validity. A total of nine phrases in the completed tool were adjusted. The final tool was then used to assess videotaped simulations as a means of validation. Response process was assessed using exploratory factor analysis and internal structure was assessed via Cronbach's Alpha (internal consistency) and Intraclass Correlation (test-retest reliability and inter-rater reliability). One hundred and four (104) videotaped communication skills simulations were assessed by 38 subjects (6 staff physicians, 4 faculty physicians, 8 resident physicians, 4 professional actors with experience in simulation, and 16 other allied healthcare professionals). Measures of Internal consistency (Cronbach's alpha = 0.818) and test-retest reliability (intra-class correlation coefficient = 0.942) were high. Exploratory factor analysis confirmed the uni-dimensionality of the instrument. Our results support the validity and reliability of the Brazilian Gap-Kalamazoo Communication Skills Assessment Form when used among Brazilian medical residents. The Brazilian version of Gap-Kalamazoo Communication Skills Assessment Form was found to be adequate both in the linguistic and technical aspects. The use of this instrument in Brazilian medical education can enhance the assessment of physician-patient-team relationships on an ongoing basis.
Beehler, Sarah; Ahern, Jennifer; Balmer, Brandi; Kuhlman, Jennifer
2017-01-01
This pilot study evaluated the validity and reliability of an Experience of Neighborhood (EON) measure developed to assess neighborhood characteristics that shape reintegration opportunities for returning service members and their families. A total of 91 post-9/11 veterans and spouses completed a survey administered at the Minnesota State Fair. Participants self-reported on their reintegration status (veterans), social functioning (spouses), social support, and mental health. EON factor structure, internal consistency reliability, and validity (discriminant, content, criterion) were analyzed. The EON measure showed adequate reliability, discriminant validity, and content validity. More work is needed to assess criterion validity because EON scores were not correlated with scores on a Census-based index used to measure quality of military neighborhoods. The EON may be useful in assessing broad local factors influencing health among returning veterans and spouses. More research is needed to understand geographic variation in neighborhood conditions and how those affect reintegration and mental health for military families.
Nunes, Andreia; Limpo, Teresa; Lima, César F.; Castro, São Luís
2018-01-01
The importance of quickly assessing personality traits in many studies prompted the development of brief scales such as the Ten-Item Personality Inventory (TIPI), a measure of five personality traits (extraversion, agreeableness, conscientiousness, emotional stability, and openness). In the current study, we present the Portuguese version of TIPI and examine its psychometric properties, based on a sample of 333 Portuguese adults aged 18 to 65 years. The results revealed reliability coefficients similar to the original version (α = 0.39–0.72), very good 4-week test–retest reliability (n = 81, rs > 0.71), expected factorial structure, high convergent validity with the Big-Five Inventory (rs > 0.60), and correlations with self-esteem, affect, and aggressiveness similar to those found with standard measures of personality traits. Overall, our findings suggest that the Portuguese TIPI is a reliable and valid alternative to longer measures: it offers a promising tool for research contexts in which the available time for personality assessment is highly limited. PMID:29674989
Nunes, Andreia; Limpo, Teresa; Lima, César F; Castro, São Luís
2018-01-01
The importance of quickly assessing personality traits in many studies prompted the development of brief scales such as the Ten-Item Personality Inventory (TIPI), a measure of five personality traits (extraversion, agreeableness, conscientiousness, emotional stability, and openness). In the current study, we present the Portuguese version of TIPI and examine its psychometric properties, based on a sample of 333 Portuguese adults aged 18 to 65 years. The results revealed reliability coefficients similar to the original version (α = 0.39-0.72), very good 4-week test-retest reliability ( n = 81, r s > 0.71), expected factorial structure, high convergent validity with the Big-Five Inventory ( r s > 0.60), and correlations with self-esteem, affect, and aggressiveness similar to those found with standard measures of personality traits. Overall, our findings suggest that the Portuguese TIPI is a reliable and valid alternative to longer measures: it offers a promising tool for research contexts in which the available time for personality assessment is highly limited.
Beehler, Sarah; Ahern, Jennifer; Balmer, Brandi; Kuhlman, Jennifer
2017-01-01
This pilot study evaluated the validity and reliability of an Experience of Neighborhood (EON) measure developed to assess neighborhood characteristics that shape reintegration opportunities for returning service members and their families. A total of 91 post-9/11 veterans and spouses completed a survey administered at the Minnesota State Fair. Participants self-reported on their reintegration status (veterans), social functioning (spouses), social support, and mental health. EON factor structure, internal consistency reliability, and validity (discriminant, content, criterion) were analyzed. The EON measure showed adequate reliability, discriminant validity, and content validity. More work is needed to assess criterion validity because EON scores were not correlated with scores on a Census-based index used to measure quality of military neighborhoods. The EON may be useful in assessing broad local factors influencing health among returning veterans and spouses. More research is needed to understand geographic variation in neighborhood conditions and how those affect reintegration and mental health for military families. PMID:28936370
The Reliability and Predictive Validity of the Stalking Risk Profile.
McEwan, Troy E; Shea, Daniel E; Daffern, Michael; MacKenzie, Rachel D; Ogloff, James R P; Mullen, Paul E
2018-03-01
This study assessed the reliability and validity of the Stalking Risk Profile (SRP), a structured measure for assessing stalking risks. The SRP was administered at the point of assessment or retrospectively from file review for 241 adult stalkers (91% male) referred to a community-based forensic mental health service. Interrater reliability was high for stalker type, and moderate-to-substantial for risk judgments and domain scores. Evidence for predictive validity and discrimination between stalking recidivists and nonrecidivists for risk judgments depended on follow-up duration. Discrimination was moderate (area under the curve = 0.66-0.68) and positive and negative predictive values good over the full follow-up period ( Mdn = 170.43 weeks). At 6 months, discrimination was better than chance only for judgments related to stalking of new victims (area under the curve = 0.75); however, high-risk stalkers still reoffended against their original victim(s) 2 to 4 times as often as low-risk stalkers. Implications for the clinical utility and refinement of the SRP are discussed.
Development and validation of a Malawian version of the primary care assessment tool.
Dullie, Luckson; Meland, Eivind; Hetlevik, Øystein; Mildestvedt, Thomas; Gjesdal, Sturla
2018-05-16
Malawi does not have validated tools for assessing primary care performance from patients' experience. The aim of this study was to develop a Malawian version of Primary Care Assessment Tool (PCAT-Mw) and to evaluate its reliability and validity in the assessment of the core primary care dimensions from adult patients' perspective in Malawi. A team of experts assessed the South African version of the primary care assessment tool (ZA-PCAT) for face and content validity. The adapted questionnaire underwent forward and backward translation and a pilot study. The tool was then used in an interviewer administered cross-sectional survey in Neno district, Malawi, to test validity and reliability. Exploratory factor analysis was performed on a random half of the sample to evaluate internal consistency, reliability and construct validity of items and scales. The identified constructs were then tested with confirmatory factor analysis. Likert scale assumption testing and descriptive statistics were done on the final factor structure. The PCAT-Mw was further tested for intra-rater and inter-rater reliability. From the responses of 631 patients, a 29-item PCAT-Mw was constructed comprising seven multi-item scales, representing five primary care dimensions (first contact, continuity, comprehensiveness, coordination and community orientation). All the seven scales achieved good internal consistency, item-total correlations and construct validity. Cronbach's alpha coefficient ranged from 0.66 to 0.91. A satisfactory goodness of fit model was achieved (GFI = 0.90, CFI = 0.91, RMSEA = 0.05, PCLOSE = 0.65). The full range of possible scores was observed for all scales. Scaling assumptions tests were achieved for all except the two comprehensiveness scales. Intra-class correlation coefficient (ICC) was 0.90 (n = 44, 95% CI 0.81-0.94, p < 0.001) for intra-rater reliability and 0.84 (n = 42, 95% CI 0.71-0.96, p < 0.001) for inter-rater reliability. Comprehensive metric analyses supported the reliability and validity of PCAT-Mw in assessing the core concepts of primary care from adult patients' experience. This tool could be used for health service research in primary care in Malawi.
ERIC Educational Resources Information Center
Gerber, F.; Carminati, G. Galli
2013-01-01
Background: The lack of psychometric measures of psychopathology especially in intellectual disabilities (ID) population was addressed by creation of the Psychiatric Assessment Schedule for Adult with Developmental Disability (PAS-ADD-10) in Moss et?al. This schedule is a structured interview designed for professionals in psychopathology. The…
ERIC Educational Resources Information Center
Gensthaler, A.; Mohler, E.; Resch, F.; Paulus, F.; Schwenck, C.; Freitag, C. M.; Goth, K.
2013-01-01
A behaviorally inhibited temperament in early childhood has been identified as a potential risk factor for anxiety disorders in children and adolescents. The purpose of our investigation was the development and evaluation of the factor structure, reliability and validity of the first retrospective parent report measure to assess behavioral…
ERIC Educational Resources Information Center
Rikoon, Samuel H.; Liebtag, Travis; Olivera-Aguilar, Margarita; Steinberg, Jonathan; Robbins, Steven B.
2015-01-01
In this report, we describe the development of an extension of the "SuccessNavigator"® assessment for late high school settings. We discuss the assessment's conceptualization and support its application with psychometric studies detailing scale development in terms of structural analyses, reliability, and several other aspects of…
COCOA: A New Validated Instrument to Assess Medical Students' Attitudes towards Older Adults
ERIC Educational Resources Information Center
Hollar, David; Roberts, Ellen; Busby-Whitehead, Jan
2011-01-01
This study tested the reliability and validity of the Carolina Opinions on Care of Older Adults (COCOA) survey compared with the Geriatric Assessment Survey (GAS). Participants were first year medical students (n = 160). A Linear Structural Relations (LISREL) measurement model for COCOA had a moderately strong fit that was significantly better…
Detrital carbon pools in temperate forests: magnitude and potential for landscape-scale assessment
John B. Bradford; Peter Weishampel; Marie-Louise Smith; Randall Kolka; Richard A. Birdsey; Scott V. Ollinger; Michael G. Ryan
2009-01-01
Reliably estimating carbon storage and cycling in detrital biomass is an obstacle to carbon accounting. We examined carbon pools and fluxes in three small temperate forest landscapes to assess the magnitude of carbon stored in detrital biomass and determine whether detrital carbon storage is related to stand structural properties (leaf area, aboveground biomass,...
Lai, Claudia K Y
2014-01-01
The Neuropsychiatric Inventory (NPI) is one of the most commonly used assessment scales for assessing symptoms in people with dementia and other neurological disorders. This paper analyzes its conceptual framework, measurement mode, psychometric properties, and merits and problems. All articles discussing the psychometric properties and factor structure of the NPI were searched for in Medline via Ovid. The abstracts of these papers were read to determine their relevance to the purpose of this paper. If deemed appropriate, a full paper was then obtained and read. The NPI has reasonably good content validity and internal consistency, and good test-retest and interrater reliability. There is limited information about its sensitivity, specificity, positive and negative predictive values, and, in particular, responsiveness. Merits of the NPI include being comprehensive, avoiding symptom overlap, ease of use, and flexibility. It has problems in scoring (no multiples of 5, 7, and 11) and, therefore, analysis using parametric tests may not be appropriate. The use of individual subscales also warrants further investigation. In terms of its content and concurrent validity, intra- and interrater reliability, test-retest reliability, and internal consistency, the NPI can be considered as valid and reliable, and can be used across different ethnic groups. The tool is most likely unable to deliver as good a performance in terms of discriminating between different disorders. More studies are required to further evaluate its psychometric properties, particularly in the areas of factor structure and responsiveness. The clinical utility of the NPI also needs to be further explored.
Wang, Chang-Hwai; Lee, Jin-Chuan; Yuan, Yu-Hsi
2014-01-01
The purpose of this research is to establish and verify the psychometric and structural properties of the self-report Chinese Sexual Assault Symptom Scale (C-SASS) to assess the trauma experienced by Chinese victims of sexual assault. An earlier version of the C-SASS was constructed using a modified list of the same trauma symptoms administered to an American sample and used to develop and validate the Sexual Assault Symptom Scale II (SASS II). The rationale of this study is to revise the earlier version of the C-SASS, using a larger and more representative sample and more robust statistical analysis than in earlier research, to permit a more thorough examination of the instrument and further confirm the dimensions of sexual assault trauma in Chinese victims of rape. In this study, a sample of 418 victims from northern Taiwan was collected to confirm the reliability and validity of the C-SASS. Exploratory factor analysis yielded five common factors: Safety Fears, Self-Blame, Health Fears, Anger and Emotional Lability, and Fears About the Criminal Justice System. Further tests of the validity and composite reliability of the C-SASS were provided by the structural equation modeling (SEM). The results indicated that the C-SASS was a brief, valid, and reliable instrument for assessing sexual assault trauma among Chinese victims in Taiwan. The scale can be used to evaluate victims in sexual assault treatment centers around Taiwan, as well as to capture the characteristics of sexual assault trauma among Chinese victims.
Heeren, Alexandre; Ceschi, Grazia; Valentiner, David P; Dethier, Vincent; Philippot, Pierre
2013-01-01
The main aim of this study was to assess the reliability and structural validity of the French version of the 12-item version of the Personal Report of Confidence as Speaker (PRCS), one of the most promising measurements of public speaking fear. A total of 611 French-speaking volunteers were administered the French versions of the short PRCS, the Liebowitz Social Anxiety Scale, the Fear of Negative Evaluation scale, as well as the Trait version of the Spielberger State-Trait Anxiety Inventory and the Beck Depression Inventory-II, which assess the level of anxious and depressive symptoms, respectively. Regarding its structural validity, confirmatory factor analyses indicated a single-factor solution, as implied by the original version. Good scale reliability (Cronbach's alpha = 0.86) was observed. The item discrimination analysis suggested that all the items contribute to the overall scale score reliability. The French version of the short PRCS showed significant correlations with the Liebowitz Social Anxiety Scale (r = 0.522), the Fear of Negative Evaluation scale (r = 0.414), the Spielberger State-Trait Anxiety Inventory (r = 0.516), and the Beck Depression Inventory-II (r = 0.361). The French version of the short PRCS is a reliable and valid measure for the evaluation of the fear of public speaking among a French-speaking sample. These findings have critical consequences for the measurement of psychological and pharmacological treatment effectiveness in public speaking fear among a French-speaking sample.
Heeren, Alexandre; Ceschi, Grazia; Valentiner, David P; Dethier, Vincent; Philippot, Pierre
2013-01-01
Background: The main aim of this study was to assess the reliability and structural validity of the French version of the 12-item version of the Personal Report of Confidence as Speaker (PRCS), one of the most promising measurements of public speaking fear. Methods: A total of 611 French-speaking volunteers were administered the French versions of the short PRCS, the Liebowitz Social Anxiety Scale, the Fear of Negative Evaluation scale, as well as the Trait version of the Spielberger State-Trait Anxiety Inventory and the Beck Depression Inventory-II, which assess the level of anxious and depressive symptoms, respectively. Results: Regarding its structural validity, confirmatory factor analyses indicated a single-factor solution, as implied by the original version. Good scale reliability (Cronbach’s alpha = 0.86) was observed. The item discrimination analysis suggested that all the items contribute to the overall scale score reliability. The French version of the short PRCS showed significant correlations with the Liebowitz Social Anxiety Scale (r = 0.522), the Fear of Negative Evaluation scale (r = 0.414), the Spielberger State-Trait Anxiety Inventory (r = 0.516), and the Beck Depression Inventory-II (r = 0.361). Conclusion: The French version of the short PRCS is a reliable and valid measure for the evaluation of the fear of public speaking among a French-speaking sample. These findings have critical consequences for the measurement of psychological and pharmacological treatment effectiveness in public speaking fear among a French-speaking sample. PMID:23662060
Morphology delimits more species than molecular genetic clusters of invasive Pilosella.
Moffat, Chandra E; Ensing, David J; Gaskin, John F; De Clerck-Floate, Rosemarie A; Pither, Jason
2015-07-01
• Accurate assessments of biodiversity are paramount for understanding ecosystem processes and adaptation to change. Invasive species often contribute substantially to local biodiversity; correctly identifying and distinguishing invaders is thus necessary to assess their potential impacts. We compared the reliability of morphology and molecular sequences to discriminate six putative species of invasive Pilosella hawkweeds (syn. Hieracium, Asteraceae), known for unreliable identifications and historical introgression. We asked (1) which morphological traits dependably discriminate putative species, (2) if genetic clusters supported morphological species, and (3) if novel hybridizations occur in the invaded range.• We assessed 33 morphometric characters for their discriminatory power using the randomForest classifier and, using AFLPs, evaluated genetic clustering with the program structure and subsequently with an AMOVA. The strength of the association between morphological and genotypic dissimilarity was assessed with a Mantel test.• Morphometric analyses delimited six species while genetic analyses defined only four clusters. Specifically, we found (1) eight morphological traits could reliably distinguish species, (2) structure suggested strong genetic differentiation but for only four putative species clusters, and (3) genetic data suggest both novel hybridizations and multiple introductions have occurred.• (1) Traditional floristic techniques may resolve more species than molecular analyses in taxonomic groups subject to introgression. (2) Even within complexes of closely related species, relatively few but highly discerning morphological characters can reliably discriminate species. (3) By clarifying patterns of morphological and genotypic variation of invasive Pilosella, we lay foundations for further ecological study and mitigation. © 2015 Botanical Society of America, Inc.
Questionnaire to assess patient satisfaction with pharmaceutical care in Spanish language.
Traverso, María Luz; Salamano, Mercedes; Botta, Carina; Colautti, Marisel; Palchik, Valeria; Pérez, Beatriz
2007-08-01
To develop and validate a questionnaire, in Spanish, for assessing patient satisfaction with pharmaceutical care received in community pharmacies. Selection and translation of questionnaire's items; definition of response scale and demographic questions. Evaluation of face and content validity, feasibility, factor structure, reliability and construct validity. Forty-one community pharmacies of the province of Santa Fe. Argentina. Questionnaire administered to patients receiving pharmaceutical care or traditional pharmacy services. Pilot test to assess feasibility. Factor analysis used principal components and varimax rotation. Reliability established using internal consistency with Cronbach's alpha. Construct validity determined with extreme group method. A self-administered questionnaire with 27 items, 5-point Likert response scale and demographic questions was designed considering multidimensional structure of patient satisfaction. Questionnaire evaluates cumulative experience of patients with comprehensive pharmaceutical care practice in community pharmacies. Two hundred and seventy-four complete questionnaires were obtained. Factor analysis resulted in three factors: Managing therapy, Interpersonal relationship and General satisfaction, with a cumulative variance of 62.51%. Cronbach's alpha for the whole questionnaire was 0.96, and 0.95, 0.88 and 0.76 for the three factors, respectively. Mann-Whitney test for construct validity did not showed significant differences between pharmacies that provide pharmaceutical care and those that do not, however, 23 items showed significant differences between the two groups of pharmacies. The questionnaire developed can be a reliable and valid instrument to assess patient satisfaction with pharmaceutical care in community pharmacies in Spanish. Further research is needed to deepen the validation process.
Confirmatory factor analysis of the Chinese Breast Cancer Screening Beliefs Questionnaire.
Kwok, Cannas; Fethney, Judith; White, Kate
2012-01-01
Chinese women have been consistently reported as having low breast cancer screening practices. The Chinese Breast Cancer Screening Beliefs Questionnaire (CBCSB) was designed to assess Chinese Australian women's beliefs, knowledge, and attitudes toward breast cancer and screening practices. The objectives of the study were to confirm the factor structure of the CBCSB with a new, larger sample of immigrant Chinese Australian women and to report its clinical validity. A convenience sample of 785 Chinese Australian women was recruited from Chinese community organizations and shopping malls. Cronbach α was used to assess internal consistency reliability, and Amos v18 was used for confirmatory factor analysis. Clinical validity was assessed through linear regression using SPSS v18. The 3-factor structure of the CBCSB was confirmed, although the model required respecification to arrive at a suitable model fit as measured by the goodness-of-fit index (0.98), adjusted goodness-of-fit index (0.97), normed fit index (0.95), and root mean square error of approximation (0.031). Internal consistency reliability coefficients were satisfactory (>.6). Women who engaged in all 3 types of screening had more proactive attitudes to health checkups and perceived less barriers to mammographic screening. The CBCSB is a valid and reliable tool for assessing Chinese women's beliefs, knowledge, and attitudes about breast cancer and breast cancer screening practices. The CBCSB can be used for providing practicing nurses with insights into the provision of culturally sensitive breast health education.
Brennan, Peter A; Croke, David T; Reed, Malcolm; Smith, Lee; Munro, Euan; Foulkes, John; Arnett, Richard
2016-01-01
Objective structured clinical examinations (OSCE) are widely used for summative assessment in surgery. Despite standardizing these as much as possible, variation, including examiner scoring, can occur which may affect reliability. In study of a high-stakes UK postgraduate surgical OSCE, we investigated whether examiners changing stations once during a long examining day affected marking, reliability, and overall candidates' scores compared with examiners who examined the same scenario all day. An observational study of 18,262 examiner-candidate interactions from the UK Membership of the Royal College of Surgeons examination was carried at 3 Surgical Colleges across the United Kingdom. Scores between examiners were compared using analysis of variance. Examination reliability was assessed with Cronbach's alpha, and the comparative distribution of total candidates' scores for each day was evaluated using t-tests of unit-weighted z scores. A significant difference was found in absolute scores differences awarded in the morning and afternoon sessions between examiners who changed stations at lunchtime and those who did not (p < 0.001). No significant differences were found for the main effects of either broad content area (p = 0.290) or station content area (p = 0.450). The reliability of each day was not affected by examiner switching (p = 0.280). Overall, no difference was found in z-score distribution of total candidate scores and categories of examiner switching. This large study has found that although the range of marks awarded varied when examiners change OSCE stations, examination reliability and the likely candidate outcome were not affected. These results may have implications for examination design and examiner experience in surgical OSCEs and beyond. Copyright © 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Tsuno, Kanami; Yoshimasu, Kouichi; Hayashi, Takashi; Tatsuta, Nozomi; Ito, Yuki; Kamijima, Michihiro; Nakai, Kunihiko
2018-01-01
Nowadays, attention deficit hyperactivity (ADH) problems are observed commonly among school-age children. However, questionnaires specific to ADH behaviors among preschool children are very few. The aim of this study was to investigate the reliability and validity of the 25-item Behavioral Check List (BCL), which was developed from interviews of parents with children who were diagnosed as having Attention-deficit/hyperactivity disorder (ADHD) and measures ADH behaviors in preschool age. We recruited 22 teachers from 10 nurseries/kindergartens in Miyagi Prefecture, Japan. A total of 138 preschool children were assessed using the BCL. To investigate inter-rater reliability, two teachers from each facility assess seven to twenty children in their class, and intraclass correlation coefficients (ICCs) were calculated. The teachers additionally answered questions in the 1/5-5 Caregiver-Teacher Report Form (C-TRF) to investigate the criterion validity of the BCL. To investigate structural validity, exploratory factor analysis with promax rotation and confirmatory factor analysis were performed. The internal consistency reliability of the BCL was good (α = 0.92) and correlation analyses also confirmed its excellent criterion validity. Although exploratory factor analysis for the BCL yielded a five-factor model that consisted of a factor structure different from that of the original one, the results were similar to the original six factors. The ICCs of the BCL were 0.38-0.99 and it was not high enough for inter-rater reliability in some facilities. However, there is a possibility to improve it by giving raters adequate explanations when using BCL. The present study showed acceptable levels of reliability and validity of the BCL among Japanese preschool children.
Validation of the Practice Environment Scale to the Brazilian culture.
Gasparino, Renata C; Guirardello, Edinêis de B
2017-07-01
To validate the Brazilian version of the Practice Environment Scale. The Practice Environment Scale is a tool that evaluates the presence of characteristics that are favourable for professional nursing practice because a better work environment contributes to positive results for patients, professionals and institutions. Methodological study including 209 nurses. Validity was assessed via a confirmatory factor analysis using structural equation modelling, in which the correlations between the instrument and the following variables were tested: burnout, job satisfaction, safety climate, perception of quality of care and intention to leave the job. Subgroups were compared and the reliability was assessed using Cronbach's alpha and the composite reliability. Factor analysis resulted in exclusion of seven items. Significant correlations were obtained between the subscales and all variables in the study. The reliability was considered acceptable. The Brazilian version of the Practice Environment Scale is a valid and reliable tool used to assess the characteristics that promote professional nursing practice. Use of this tool in Brazilian culture should allow managers to implement changes that contribute to the achievement of better results, in addition to identifying and comparing the environments of health institutions. © 2017 John Wiley & Sons Ltd.
Barnett, Lisa M; Ridgers, Nicola D; Zask, Avigdor; Salmon, Jo
2015-01-01
To determine reliability and face validity of an instrument to assess young children's perceived fundamental movement skill competence. Validation and reliability study. A pictorial instrument based on the Test Gross Motor Development-2 assessed perceived locomotor (six skills) and object control (six skills) competence using the format and item structure from the physical competence subscale of the Pictorial Scale of Perceived Competence and Acceptance for Young Children. Sample 1 completed object control items in May (n=32) and locomotor items in October 2012 (n=23) at two time points seven days apart. Children were asked at the end of the test-retest their understanding of what was happening in each picture to determine face validity. Sample 2 (n=58) completed 12 items in November 2012 on a single occasion to test internal reliability only. Sample 1 children were aged 5-7 years (M=6.0, SD=0.8) at object control assessment and 5-8 years at locomotor assessment (M=6.5, SD=0.9). Sample 2 children were aged 6-8 years (M=7.2, SD=0.73). Intra-class correlations assessed in Sample 1 children were excellent for object control (intra-class correlation=0.78), locomotor (intra-class correlation=0.82) and all 12 skills (intra-class correlations=0.83). Face validity was acceptable. Internal consistency was adequate in both samples for each subscale and all 12 skills (alpha range 0.60-0.81). This study has provided preliminary evidence for instrument reliability and face validity. This enables future alignment between the measurement of perceived and actual fundamental movement skill competence in young children. Crown Copyright © 2014. Published by Elsevier Ltd. All rights reserved.
Artemiou, E; Adams, C L; Hecker, K G; Vallevand, A; Violato, C; Coe, J B
2014-11-22
In human medicine, standardised patients (SP) have been shown to reliably and accurately assess learners' communication performance in high-stakes certification Objective Structured Clinical Examinations (OSCE), offering a feasible way to reduce the need for recruitment, time commitment and coordination of faculty assessors. In this study, we evaluated the use of standardised clients (SC) as a viable option for assessing veterinary students' communication performance. We designed a four-station, two-track communication skills OSCE. SC assessors used an adapted nine-item Liverpool Undergraduate Communication Assessment Scale (LUCAS). Faculty used a 21-item checklist derived from the Calgary-Cambridge Guide (CCG) and a five-point global rating scale. Participants were second year veterinary students (n=96). For the four stations, intrastation reliability (α) ranged from 0.63 to 0.82 for the LUCAS, and 0.73 to 0.87 for the CCG. The interstation reliability coefficients were 0.85 for the LUCAS and 0.89 for the CGG. The calculated Generalisability (G) coefficients were 0.62 for the LUCAS and 0.60 for the CGG. Supporting construct validity, SC and faculty assessors showed a significant correlation between the LUCAS and CCG total percent scores (r=0.45, P<0.001), and likewise between the LUCAS and global rating scores (r=0.49, P<0.001).Study results support that SC assessors offer a reliable and valid approach for assessing veterinary communication OSCE. British Veterinary Association.
Patient-perceived hospital service quality: an empirical assessment.
Pai, Yogesh P; Chary, Satyanarayana T; Pai, Rashmi Yogesh
2018-02-12
Purpose The purpose of this paper is to appraise Pai and Chary's (2016) conceptual framework for measuring patient-perceived hospital service quality (HSQ). Design/methodology/approach A structured questionnaire was used to obtain data from teaching, public and corporate hospital patients. Several tests were conducted to assess the instrument's reliability and validity. Pai and Chary's (2016) nine dimensions for measuring HSQ were examined in this paper. Findings The tests confirm that Pai and Chary's (2016) conceptual framework is reliable and valid. The study also establishes that the nine dimensions measure HSQ. Practical implications The framework empowers managers to assess service quality in any hospital settings, corporate, public and teaching, using an approach that is superior to the existing HSQ scales. Originality/value This paper helps researchers and practitioners to assess HSQ from patient perspectives in any hospital setting.
Petschonek, Sarah; Burlison, Jonathan; Cross, Carl; Martin, Kathy; Laver, Joseph; Landis, Ronald S; Hoffman, James M
2013-12-01
Given the growing support for establishing a just patient safety culture in health-care settings, a valid tool is needed to assess and improve just patient safety culture. The purpose of this study was to develop a measure of individual perceptions of just culture for a hospital setting. The 27-item survey was administered to 998 members of a health-care staff in a pediatric research hospital as part of the hospital's ongoing patient safety culture assessment process. Subscales included balancing a blame-free approach with accountability, feedback and communication, openness of communication, quality of the event reporting process, continuous improvement, and trust. The final sample of 404 participants (40% response rate) included nurses, physicians, pharmacists, and other hospital staff members involved in patient care. Confirmatory factor analysis was used to test the internal structure of the measure and reliability analyses were conducted on the subscales. Moderate support for the factor structure was established with confirmatory factor analysis. After modifications were made to improve statistical fit, the final version of the measure included 6 subscales loading onto one higher-order dimension. Additionally, Cronbach α reliability scores for the subscales were positive, with each dimension being above 0.7 with the exception of one. The instrument designed and tested in this study demonstrated adequate structure and reliability. Given the uniqueness of the current sample, further verification of the JCAT is needed from hospitals that serve broader populations. A validated tool could also be used to evaluate the relation between just culture and patient safety outcomes.
Petschonek, Sarah; Burlison, Jonathan; Cross, Carl; Martin, Kathy; Laver, Joseph; Landis, Ronald S.; Hoffman, James M.
2014-01-01
Objectives Given the growing support for establishing a just patient safety culture in healthcare settings, a valid tool is needed to assess and improve just patient safety culture. The purpose of this study was to develop a measure of individual perceptions of just culture for a hospital setting. Methods The 27 item survey was administered to 998 members of a healthcare staff in a pediatric research hospital as part of the hospital's ongoing patient safety culture assessment process. Subscales included balancing a blame-free approach with accountability, feedback and communication, openness of communication, quality of the event reporting process, continuous improvement, and trust. The final sample of 404 participants (40% response rate) included nurses, physicians, pharmacists and other hospital staff members involved in patient care. Confirmatory factor analysis was used to test the internal structure of the measure and reliability analyses were conducted on the subscales. Results Moderate support for the factor structure was established with confirmatory factor analysis. After modifications were made to improve statistical fit, the final version of the measure included six subscales loading onto one higher-order dimension. Additionally, Cronbach's alpha reliability scores for the subscales were positive, with each dimension being above 0.7 with the exception of one. Conclusions The instrument designed and tested in this study demonstrated adequate structure and reliability. Given the uniqueness of the current sample, further verification of the JCAT is needed from hospitals that serve broader populations. A validated tool could also be used to evaluate the relation between just culture and patient safety outcomes. PMID:24263549
Dima, Alexandra Lelia; Schulz, Peter Johannes
2017-01-01
Background The eHealth Literacy Scale (eHEALS) is a tool to assess consumers’ comfort and skills in using information technologies for health. Although evidence exists of reliability and construct validity of the scale, less agreement exists on structural validity. Objective The aim of this study was to validate the Italian version of the eHealth Literacy Scale (I-eHEALS) in a community sample with a focus on its structural validity, by applying psychometric techniques that account for item difficulty. Methods Two Web-based surveys were conducted among a total of 296 people living in the Italian-speaking region of Switzerland (Ticino). After examining the latent variables underlying the observed variables of the Italian scale via principal component analysis (PCA), fit indices for two alternative models were calculated using confirmatory factor analysis (CFA). The scale structure was examined via parametric and nonparametric item response theory (IRT) analyses accounting for differences between items regarding the proportion of answers indicating high ability. Convergent validity was assessed by correlations with theoretically related constructs. Results CFA showed a suboptimal model fit for both models. IRT analyses confirmed all items measure a single dimension as intended. Reliability and construct validity of the final scale were also confirmed. The contrasting results of factor analysis (FA) and IRT analyses highlight the importance of considering differences in item difficulty when examining health literacy scales. Conclusions The findings support the reliability and validity of the translated scale and its use for assessing Italian-speaking consumers’ eHealth literacy. PMID:28400356
Development of a direct observation Measure of Environmental Qualities of Activity Settings.
King, Gillian; Rigby, Patty; Batorowicz, Beata; McMain-Klein, Margot; Petrenchik, Theresa; Thompson, Laura; Gibson, Michelle
2014-08-01
The aim of this study was to develop an observer-rated measure of aesthetic, physical, social, and opportunity-related qualities of leisure activity settings for young people (with or without disabilities). Eighty questionnaires were completed by sets of raters who independently rated 22 community/home activity settings. The scales of the 32-item Measure of Environmental Qualities of Activity Settings (MEQAS; Opportunities for Social Activities, Opportunities for Physical Activities, Pleasant Physical Environment, Opportunities for Choice, Opportunities for Personal Growth, and Opportunities to Interact with Adults) were determined using principal components analyses. Test-retest reliability was determined for eight activity settings, rated twice (4-6wk interval) by a trained rater. The factor structure accounted for 80% of the variance. The Kaiser-Meyer-Olkin Measure of Sampling Adequacy was 0.73. Cronbach's alphas for the scales ranged from 0.76 to 0.96, and interrater reliabilities (ICCs) ranged from 0.60 to 0.93. Test-retest reliabilities ranged from 0.70 to 0.90. Results suggest that the MEQAS has a sound factor structure and preliminary evidence of internal consistency, interrater, and test-retest reliability. The MEQAS is the first observer-completed measure of environmental qualities of activity settings. The MEQAS allows researchers to assess comprehensively qualities and affordances of activity settings, and can be used to design and assess environmental qualities of programs for young people. © 2014 Mac Keith Press.
Ho, Chester H; Cheung, Amanda; Southern, Danielle; Ocampo, Wrechelle; Kaufman, Jaime; Hogan, David B; Baylis, Barry; Conly, John M; Stelfox, Henry T; Ghali, William A
2016-12-01
Research regarding the reliability of the Braden Scale and nurses' perspectives on the instrument for predicting pressure ulcer (PU) risk in acute care settings is limited. A mixed-methods study was conducted in a tertiary acute care facility to examine interrater reliability (IRR) of the Braden Scale and its subscales, and a qualitative survey using semi-structured interviews was conducted among nurses caring for patients in acute care units to gain nurse perspective regarding scale usability. Data were extracted from a previous retrospective, randomized, controlled trial involving adult patients with compromised mobility receiving care in a tertiary acute care hospital in Canada. One-way, intraclass correlation coefficients (ICCs) were calculated on item and total scores, and kappa statistics were used to determine reliability of categorizing patients on their risk. Interview results were categorized by common themes. Reliability was assessed on 64 patients, where nurses and research staff independently assessed enrolled participants at baseline and after 72 hours using the Braden Scale as it appeared on an electronic medical record. IRR for the total score was high (ICC = 0.807). The friction and shear item had the lowest reliability (ICC = 0.266). Reliability of categorizing patients' level of risk had moderate agreement (κ = 0.408). Three (3) major and 12 subthemes emerged from the 14 nurse interviews; nurses were aware of the scale's purpose but were uncertain of its effectiveness, some items were difficult to rate, and questions were raised as to whether using the scale enhanced patient care. Aspects identified by nurses to enhance usability included: 1) changes to the electronic version (incorporating the scale into daily assessment documents with readily available item descriptions), 2) additional training, and 3) easily available resource material to improve reliability and usability of scale. These findings need to be considered when using the Braden Scale in clinical practice. Further study of the value of the total Braden Scale and its subscales is warranted.
Dijkstra, Boukje; Golbach, Milou; De Jong, Cor; Schellekens, Arnt
2016-01-01
Background Addiction, or substance dependence, is nowadays considered a chronic relapsing condition. However, perceptions of addiction vary widely, also among healthcare professionals. Perceptions of addiction are thought to contribute to attitude and stigma towards patients with addiction. However, studies into perceptions of addiction among healthcare professionals are limited and instruments for reliable assessment of their perceptions are lacking. The Illness Perception Questionnaire (IPQ) is widely used to evaluate perceptions of illness. The aim of this study was to evaluate the psychometric properties of the IPQ: factor structure, internal consistency, and discriminant validity, when applied to evaluate healthcare professionals’ perceptions of addiction. Methods Participants were 1072 healthcare professionals in training and master students from the Netherlands and Indonesia, recruited from various addiction-training programs. The revised version of the IPQ was adapted to measure perceptions of addiction (IPQ-A). Maximum likelihood method was used to explore the best-fit IPQ factor structure. Internal consistency was evaluated for the final factors. The final factor structure was used to assess discriminant validity of the IPQ, by comparing illness perceptions of addiction between 1) medical students from the Netherlands and Indonesia, 2) medical students psychology students and educational science students from the Netherlands, and 3) participants with different training levels: medical students versus medical doctors. Results Factor analysis revealed an eight-factor structure for the perception subscale (demoralization, timeline chronic, consequences, personal control, treatment control, illness coherence, timeline cyclical emotional representations) and a four-factor structure for the attribution subscale (psychological attributions, risk factors, smoking/alcohol, overwork). Internal reliability was acceptable to good. The IPQ-A was able to detect differences in perceptions between healthcare professionals from different cultural and educational background and level of training. Conclusions The IPQ-A is a valid and reliable instrument to assess healthcare professionals’ perceptions of addiction. PMID:27824872
Ayu, Astri Parawita; Dijkstra, Boukje; Golbach, Milou; De Jong, Cor; Schellekens, Arnt
2016-01-01
Addiction, or substance dependence, is nowadays considered a chronic relapsing condition. However, perceptions of addiction vary widely, also among healthcare professionals. Perceptions of addiction are thought to contribute to attitude and stigma towards patients with addiction. However, studies into perceptions of addiction among healthcare professionals are limited and instruments for reliable assessment of their perceptions are lacking. The Illness Perception Questionnaire (IPQ) is widely used to evaluate perceptions of illness. The aim of this study was to evaluate the psychometric properties of the IPQ: factor structure, internal consistency, and discriminant validity, when applied to evaluate healthcare professionals' perceptions of addiction. Participants were 1072 healthcare professionals in training and master students from the Netherlands and Indonesia, recruited from various addiction-training programs. The revised version of the IPQ was adapted to measure perceptions of addiction (IPQ-A). Maximum likelihood method was used to explore the best-fit IPQ factor structure. Internal consistency was evaluated for the final factors. The final factor structure was used to assess discriminant validity of the IPQ, by comparing illness perceptions of addiction between 1) medical students from the Netherlands and Indonesia, 2) medical students psychology students and educational science students from the Netherlands, and 3) participants with different training levels: medical students versus medical doctors. Factor analysis revealed an eight-factor structure for the perception subscale (demoralization, timeline chronic, consequences, personal control, treatment control, illness coherence, timeline cyclical emotional representations) and a four-factor structure for the attribution subscale (psychological attributions, risk factors, smoking/alcohol, overwork). Internal reliability was acceptable to good. The IPQ-A was able to detect differences in perceptions between healthcare professionals from different cultural and educational background and level of training. The IPQ-A is a valid and reliable instrument to assess healthcare professionals' perceptions of addiction.
Zackoff, Matthew; Jerardi, Karen; Unaka, Ndidi; Sucharew, Heidi; Klein, Melissa
2015-06-01
Residents play a critical role in the education of peers and medical students, yet attainment of teaching skills is not routinely assessed. The primary aim of this study was to develop a novel, skill-based Observed Structured Teaching Evaluation (OSTE) and self-assessment survey to measure the impact of a resident-as-teacher curriculum on teaching competency. The secondary aim was to determine interrater reliability of the OSTE. A prospective study quantitatively assessed intern teaching competency via videotaped teaching encounters (videos) before and after a month-long hospital medicine rotation and self-assessment surveys over a 5-month period. The intervention group received the resident-as-teacher curriculum. Videos were evaluated by 2 blinded faculty via an OSTE covering 9 skills within 3 core components: preparation, teaching, and reflection. Pre- to post-HM rotation month differences were evaluated within and between groups using the Wilcoxon signed rank test and Wilcoxon rank-sum test, respectively. Twenty-two of 25 (88%) control and 27 of 28 (96%) intervention interns participated; 100% of participants completed the study. The intervention group's pre-post difference for the total OSTE score and the average self-assessed competence statistically improved; however, no significant difference was seen between groups. The difference in preparation scores was significant for the intervention compared with the control. The OSTE's interrater reliability demonstrated good agreement with weighted kappas of 0.86 for preparation, 0.71 for teaching, and 0.93 for reflection. Implementation of an objective, skill-based OSTE detected observable changes in interns' teaching competency after implementation of a brief resident-as-teacher curriculum. The OSTE's good interrater reliability may allow standardized assessment of skill attainment over time. Copyright © 2015 by the American Academy of Pediatrics.
Grilo, C M
2004-01-01
To examine the factor structure of DSM-IV criteria for obsessive compulsive personality disorder (OCPD) in patients with binge eating disorder (BED). Two hundred and eleven consecutive out-patients with axis I diagnoses of BED were reliably assessed with semi-structured diagnostic interviews. The eight criteria for the OCPD diagnosis were examined with reliability and correlational analyses. Exploratory factor analysis was performed to identify potential components. Cronbach's coefficient alpha for the OCPD criteria was 0.77. Principal components factor analysis with varimax rotation revealed a three-factor solution (rigidity, perfectionism, and miserliness), which accounted for 65% of variance. The DSM-IV criteria for OCPD showed good internal consistency. Exploratory factor analysis, however, revealed three components that may reflect distinct interpersonal, intrapersonal (cognitive), and behavioral features.
Iversen, J V; Bartels, E M; Jørgensen, J E; Nielsen, T G; Ginnerup, C; Lind, M C; Langberg, H
2016-12-01
The VISA-A questionnaire has proven to be a valid and reliable tool for assessing severity of Achilles tendinopathy (AT). The aim was to translate and cross-culturally adapt the VISA-A questionnaire for a Danish-speaking AT population, and subsequently perform validity and reliability tests. Translation and following cross-cultural adaptation was performed as translation, synthesis, reverse translation, expert review, and pretesting. The final Danish version (VISA-A-DK) was tested for reliability on healthy controls (n = 75) and patients (n = 36). Tests for internal consistency, validity, and structure were performed on 71 patients. VISA-A-DK showed good reliability for patients (r = 0.80 ICC = 0.79) and healthy individuals (r = 0.98 ICC = 0.97). Internal consistency was 0.73 (Cronbach's alpha). The mean VISA-A-DK score in AT patients was 51 [47-55]. This was significantly lower than healthy controls with a score of 93 (90-95). Criterion validity was considered good when comparing the scores of the Danish version with the original version in both healthy individuals and patients. VISA-A-DK is a valid and reliable instrument and has shown compatible to the original version in assessment of AT patients. VISA-A-DK is a useful tool in the assessment of AT, both in research and in a clinical setting. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Abdul Khaiyom, Jamilah Hanum; Mukhtar, Firdaus; Ibrahim, Normala; Mohd Sidik, Sherina; Oei, Tian Po Sumantri
2016-12-01
The Catastrophic Cognitions Questionnaire-Modified (CCQ-M) is a common instrument for measuring catastrophic thoughts. In some countries, however, CCQ-M still poses concerns following the lack of appropriate validation among their populations. The current study aimed to examine the factor structure of the CCQ-M, the reliability, and the validity in community samples in Malaysia. The Malay version of CCQ-M and additional measures assessing the symptoms and cognitions relevant to anxiety disorders were completed by 682 university students and general community. Exploratory factor analysis revealed a two-factor structure accounting for 62.2% of the total variance. Confirmatory factor analysis confirmed the two-factor model by deleting four items. The Cronbach's alpha coefficients for the total and the two subscales were .94, .90, and .92, respectively. Test-retest reliability analysis was conducted on 82 university students in the interval period of 14 days, and the result was r = .58. Evidence supported the concurrent, convergent, and discriminant validity. In conclusion, the 17-item CCQ-M-Malaysia is a valid and reliable instrument for assessing catastrophic cognitions among Malaysian populations. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.
[The Basel Interview for Psychosis (BIP): structure, reliability and validity].
Riecher-Rössler, A; Ackermann, T; Uttinger, M; Ittig, S; Koranyi, S; Rapp, C; Bugra, H; Studerus, E
2015-02-01
Although several instruments have been developed to identify patients with an at-risk mental state (ARMS) for psychosis and first episode of psychosis (FEP), up to now there were no instruments for a detailed assessment of risk factors and indicators of emerging psychosis and the temporal development of psychiatric symptoms over the whole life span in these patients. We therefore developed the Basle Interview for Psychosis (BIP). The aim of this study is to describe the development of the BIP and to report about its psychometric properties. The BIP is a comprehensive semi-structured interview that was developed for the Basel early detection of psychoses (FePsy) study. Its items were derived from the most important risk factors and indicators of psychosis described in the literature and from several existing instruments. It contains the following six sections: 1) social and physical development and family, 2) signs and symptoms, 3) vulnerability, 4) help-seeking behavior, 5) illness insight, 6) evaluation of the interview. To estimate the inter-rater reliabilities of the items of sections 2 and 3, 20 interviews were conducted and rated by 8 well-trained raters. The factorial structure of the BIP section "signs and symptoms" was explored in a sample of 120 ARMS and 77 FEP patients. On the basis of the discovered factorial structure, we created new subscales and assessed their reliabilities and validities. Of the 153 studied items of sections 2 and 3, 150 (98 %) were rated with sufficiently high agreement (inter-rater reliability > 0.4). The items of section "signs and symptoms" could be grouped into 5 subscales with predominantly good to very good internal consistencies, homogeneities, and discriminant and convergent validities. Predictive validities could be demonstrated for the subscales "Positive Psychotic Symptoms", "Disturbance of Thinking" and the total score. The BIP is the first interview for comprehensively assessing risk factors and indicators of emerging psychosis and the temporal development of psychiatric symptoms over the whole life span, which has been validated in ARMS and FEP patients. We could show that the BIP has excellent psychometric properties. © Georg Thieme Verlag KG Stuttgart · New York.
Space flight risk data collection and analysis project: Risk and reliability database
NASA Technical Reports Server (NTRS)
1994-01-01
The focus of the NASA 'Space Flight Risk Data Collection and Analysis' project was to acquire and evaluate space flight data with the express purpose of establishing a database containing measurements of specific risk assessment - reliability - availability - maintainability - supportability (RRAMS) parameters. The developed comprehensive RRAMS database will support the performance of future NASA and aerospace industry risk and reliability studies. One of the primary goals has been to acquire unprocessed information relating to the reliability and availability of launch vehicles and the subsystems and components thereof from the 45th Space Wing (formerly Eastern Space and Missile Command -ESMC) at Patrick Air Force Base. After evaluating and analyzing this information, it was encoded in terms of parameters pertinent to ascertaining reliability and availability statistics, and then assembled into an appropriate database structure.
Scale of attitudes toward alcohol - Spanish version: evidences of validity and reliability 1
Ramírez, Erika Gisseth León; de Vargas, Divane
2017-01-01
ABSTRACT Objective: validate the Scale of attitudes toward alcohol, alcoholism and individuals with alcohol use disorders in its Spanish version. Method: methodological study, involving 300 Colombian nurses. Adopting the classical theory, confirmatory factor analysis was applied without prior examination, based on the strong historical evidence of the factorial structure of the original scale to determine the construct validity of this Spanish version. To assess the reliability, Cronbach’s Alpha and Mc Donalid’s Omega coefficients were used. Results: the confirmatory factor analysis indicated the good fit of the scale model in a four-factor distribution, with a cut-off point at 3.2, demonstrating 66.7% of sensitivity. Conclusions: the Scale of attitudes toward alcohol, alcoholism and individuals with alcohol use disorders in Spanish presented robust psychometric qualities, affirming that the instrument possesses a solid factorial structure and reliability and is capable of precisely measuring the nurses’ atittudes towards the phenomenon proposed. PMID:28793126
Reliability and Structural Validity of The Teacher Rating Scales of Early Academic Competence
ERIC Educational Resources Information Center
Reid, Erin E.; Diperna, James C.; Missall, Kristen; Volpe, Robert J.
2014-01-01
Currently, there are few strengths-based preschool rating scales that sample a wide array of behaviors believed to be essential for early academic success. The purpose of this study was to assess the factor structure of a new measure of early academic competence for at-risk preschool populations. The Teacher Rating Scales of Early Academic…
Assessment of Semi-Structured Clinical Interview for Mobile Phone Addiction Disorder
Alavi, Seyyed Salman; Jannatifard, Fereshteh; Mohammadi Kalhori, Soroush; Sepahbodi, Ghazal; BabaReisi, Mohammad; Sajedi, Sahar; Farshchi, Mojtaba; KhodaKarami, Rasul; Hatami Kasvaee, Vahid
2016-01-01
Objective: The Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) classified mobile phone addiction disorder under “impulse control disorder not elsewhere classified”. This study surveyed the diagnostic criteria of DSM-IV-TR for the diagnosis of mobile phone addiction in correspondence with Iranian society and culture. Method: Two hundred fifty students of Tehran universities were entered into this descriptive-analytical and cross-sectional study. Quota sampling method was used. At first, semi- structured clinical interview (based on DSM-IV-TR) was performed for all the cases, and another specialist reevaluated the interviews. Data were analyzed using content validity, inter-scorer reliability (Kappa coefficient) and test-retest via SPSS18 software. Results: The content validity of the semi- structured clinical interview matched the DSM–IV-TR criteria for behavioral addiction. Moreover, their content was appropriate, and two items, including “SMS pathological use” and “High monthly cost of using the mobile phone” were added to promote its validity. Internal reliability (Kappa) and test–retest reliability were 0.55 and r = 0.4 (p<0. 01) respectively. Conclusion: The results of this study revealed that semi- structured diagnostic criteria of DSM-IV-TR are valid and reliable for diagnosing mobile phone addiction, and this instrument is an effective tool to diagnose this disorder. PMID:27437008
Matulis, Simone; Loos, Laura; Langguth, Nadine; Schreiber, Franziska; Gutermann, Jana; Gawrilow, Caterina; Steil, Regina
2015-01-01
Background The Trauma Symptom Checklist for Children (TSC-C) is the most widely used self-report scale to assess trauma-related symptoms in children and adolescents on six clinical scales. The purpose of the present study was to develop a German version of the TSC-C and to investigate its psychometric properties, such as factor structure, reliability, and validity, in a sample of German adolescents. Method A normative sample of N=583 and a clinical sample of N=41 adolescents with a history of physical or sexual abuse aged between 13 and 21 years participated in the study. Results The Confirmatory Factor Analysis on the six-factor model (anger, anxiety, depression, dissociation, posttraumatic stress, and sexual concerns with the subdimensions preoccupation and distress) revealed acceptable to good fit statistics in the normative sample. One item had to be excluded from the German version of the TSC-C because the factor loading was too low. All clinical scales presented acceptable to good reliability, with Cronbach's α's ranging from .80 to .86 in the normative sample and from .72 to .87 in the clinical sample. Concurrent validity was also demonstrated by the high correlations between the TSC-C scales and instruments measuring similar psychopathology. TSC-C scores reliably differentiated between adolescents with trauma history and those without trauma history, indicating discriminative validity. Conclusions In conclusion, the German version of the TSC-C is a reliable and valid instrument for assessing trauma-related symptoms on six different scales in adolescents aged between 13 and 21 years. PMID:26498182
Matulis, Simone; Loos, Laura; Langguth, Nadine; Schreiber, Franziska; Gutermann, Jana; Gawrilow, Caterina; Steil, Regina
2015-01-01
The Trauma Symptom Checklist for Children (TSC-C) is the most widely used self-report scale to assess trauma-related symptoms in children and adolescents on six clinical scales. The purpose of the present study was to develop a German version of the TSC-C and to investigate its psychometric properties, such as factor structure, reliability, and validity, in a sample of German adolescents. A normative sample of N=583 and a clinical sample of N=41 adolescents with a history of physical or sexual abuse aged between 13 and 21 years participated in the study. The Confirmatory Factor Analysis on the six-factor model (anger, anxiety, depression, dissociation, posttraumatic stress, and sexual concerns with the subdimensions preoccupation and distress) revealed acceptable to good fit statistics in the normative sample. One item had to be excluded from the German version of the TSC-C because the factor loading was too low. All clinical scales presented acceptable to good reliability, with Cronbach's α's ranging from .80 to .86 in the normative sample and from .72 to .87 in the clinical sample. Concurrent validity was also demonstrated by the high correlations between the TSC-C scales and instruments measuring similar psychopathology. TSC-C scores reliably differentiated between adolescents with trauma history and those without trauma history, indicating discriminative validity. In conclusion, the German version of the TSC-C is a reliable and valid instrument for assessing trauma-related symptoms on six different scales in adolescents aged between 13 and 21 years.
Statistical Analysis on the Mechanical Properties of Magnesium Alloys
Liu, Ruoyu; Jiang, Xianquan; Zhang, Hongju; Zhang, Dingfei; Wang, Jingfeng; Pan, Fusheng
2017-01-01
Knowledge of statistical characteristics of mechanical properties is very important for the practical application of structural materials. Unfortunately, the scatter characteristics of magnesium alloys for mechanical performance remain poorly understood until now. In this study, the mechanical reliability of magnesium alloys is systematically estimated using Weibull statistical analysis. Interestingly, the Weibull modulus, m, of strength for magnesium alloys is as high as that for aluminum and steels, confirming the very high reliability of magnesium alloys. The high predictability in the tensile strength of magnesium alloys represents the capability of preventing catastrophic premature failure during service, which is essential for safety and reliability assessment. PMID:29113116
Factor structure and reliability of the Spanish version of the Dissociative Ability Scale.
Pérez-Fabello, María José; Campos, Alfredo
2017-01-01
Everybody has dissociative ability to some extent, though this may vary from one individual to another. Several tests have been designed to measure dissociative ability, such as the Dissociative Ability Scale (Fisher, Johnson, & Elkins, 2013). Thus, the aim of this study was to assess the reliability and validity of the Spanish version of this test in a sample of 204 undergraduates seeking a fine arts degree at the University of Vigo (Spain). The reliability and validity of the Dissociative Ability Scale was found to be satisfactory for measuring dissociative ability. The results are discussed and innovative lines of research are proposed.
Reliability and validity of the Symptoms of Depression Questionnaire (SDQ)
Pedrelli, Paola; Blais, Mark A.; Alpert, Jonathan E.; Shelton, Richard C.; Walker, Rosemary S. W.; Fava, Maurizio
2015-01-01
Current measures for major depressive disorder focus primarily on the assessment of depressive symptoms, while often omitting other common features. However, the presence of comorbid features in the anxiety spectrum influences outcome and may effect treatment. More comprehensive measures of depression are needed that include the assessment of symptoms in the anxiety–depression spectrum. This study examines the reliability and validity of the Symptoms of Depression Questionnaire (SDQ), which assesses irritability, anger attacks, and anxiety symptoms together with the commonly considered symptoms of depression. Analysis of the factor structure of the SDQ identified 5 subscales, including one in the anxiety–depression spectrum, with adequate internal consistency and concurrent validity. The SDQ may be a valuable new tool to better characterize depression and identify and administer more targeted interventions. PMID:25275853
Villota, Orlando; Diaz, Mario; Ceron, Carmen; Moller, Ingrid; Naredo, Esperanza; Saaibi, Diego Luis
2017-07-28
To assess the intra- and inter-observer reliability of ultrasound (US) in scoring B-mode, Doppler synovitis and combined B-mode and Doppler synovitis scores in different peripheral joints of rheumatoid arthritis (RA) patients. Four rheumatologists with a formal training in musculoskeletal US (MSKUS) particularly focus on definitions and scoring synovitis on B-mode and Doppler mode participated in a patient-based reliability exercise on 16 active RA patients. The four rheumatologists independently and consecutively performed a B-mode and power Doppler (PD) US assessment of 7 joints of each patient in two rounds in a blinded fashion. Each joint was semi quantitatively scored from 0 to 3 for B-mode synovitis (BS), Doppler synovitis (DS), and combined B-mode/Doppler synovitis (CS). Intraobserver reliability was assessed by Cohen's κ. Interobserver reliability was assessed by unweight Light's κ. The mean prevalence of synovitis on B-mode was 83% of joints; scores ranging from grade 1 in 18% of joints, to grade 3 in 33%. In 55% of joints synovial PD signal was detected and the distribution of scores range from 14% of joints for grade 3, to 26% for grade 2. After a total of 448 joints scanned with 896 adquired images our intraobserver and interobserver reliability was good to excellent for most of the joints. Formal, structured and continuous training in musculoskeletal ultrasound would bring a good to excellent reproducibility in rheumatological hands with a high reliability in real time acquisition BS, DS and CS modalities for scoring synovitis in patients with active rheumatoid arthritis. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Palmer, Kara K.
2017-01-01
Assessing children’s perceptions of their movement abilities (i.e., perceived competence) is traditionally done using picture scales—Pictorial Scale of Perceived Competence and Acceptance for Young Children or Pictorial Scale of Perceived Movement Skill Competence. Pictures fail to capture the temporal components of movement. To address this limitation, we created a digital-based instrument to assess perceived motor competence: the Digital Scale of Perceived Motor Competence. The purpose of this study was to determine the validity, reliability, and internal consistency of the Digital-based Scale of Perceived Motor Skill Competence. The Digital-based Scale of Perceived Motor Skill Competence is based on the twelve fundamental motor skills from the Test of Gross Motor Development-2nd Edition with a similar layout and item structure as the Pictorial Scale of Perceived Movement Skill Competence. Face Validity of the instrument was examined in Phase I (n = 56; Mage = 8.6 ± 0.7 years, 26 girls). Test-retest reliability and internal consistency were assessed in Phase II (n = 54, Mage = 8.7 years ± 0.5 years, 26 girls). Intra-class correlations (ICC) and Cronbach’s alpha were conducted to determine test-retest reliability and internal consistency for all twelve skills along with locomotor and object control subscales. The Digital Scale of Perceived Motor Competence demonstrates excellent test-retest reliability (ICC = 0.83, total; ICC = 0.77, locomotor; ICC = 0.79, object control) and acceptable/good internal consistency (α = 0.62, total; α = 0.57, locomotor; α = 0.49, object control). Findings provide evidence of the reliability of the three level digital-based instrument of perceived motor competence for older children. PMID:29910408
Gao, Wenjun; Yuan, Changrong; Wang, Jichuan; Du, Jiarui; Wu, Huiqiao; Qian, Xiaojie; Hinds, Pamela S
2013-01-01
The City of Hope Quality of Life-Ostomy Questionnaire is a widely accepted scale to assess quality of life in ostomy patients. However, the validity and reliability of the Chinese version (C-COH) have not been studied. The objective of the study was to assess the validity and reliability of the C-COH among ostomy patients sampled from Shanghai from August 2010 to June 2011. Content validity was examined based on the reviews of a panel of 10 experts; test-retest was conducted to assess the item reliabilities of the scale; a pilot sample (n = 274) was selected to explore the factorial structure of the C-COH using exploratory factor analysis; a validation sample (n = 370) was selected to confirm the findings from the exploratory study using confirmatory factor analysis (CFA). Statistical package SPSS version 16.0 was used for the exploratory factor analysis, and Amos 17.0 was used for the CFA. The C-COH was developed by modifying 1 item and excluding 11 items from the original scale. Four factors/subscales (physical well-being, psychological well-being, social well-being, and spiritual well-being) were identified and confirmed in the C-COH The scale reliabilities estimated from the CFA results for the 4 subscales were 0.860, 0.885, 0.864, and 0.686, respectively. Findings support the reliability and validity of the C-COH. The C-COH could be a useful measure of the level of quality of life among Chinese patients with a stoma and may provide important intervention implications for healthcare providers to help improve the life quality of patients with a stoma.
Nia, Hamid Sharif; Sharif, Saeed Pahlevan; Froelicher, Erika Sivarajan; Boyle, Christopher; Goudarzian, Amir Hossein; Yaghoobzadeh, Ameneh; Oskouie, Fatemeh
2018-04-01
The aim of this study was to validate a Persian version of the Cardiac Depression Scale (CDS) in Iranian patients with acute myocardial infarction (AMI). The CDS was forward translated from English into Persian and back-translated to English. Validity was assessed using face, content, and construct validity. Also Cronbach's alpha (α), theta (), and McDonald's omega coefficient were used to evaluate the reliability. Construct validity of the scale showed two factors with eigenvalues greater than one. The Cronbach's α, , McDonald's omega, and construct reliability were greater than .70. The Persian version of the CDS has a two-factor structure (i.e., death anxiety and life satisfaction) and has acceptable reliability and validity. Therefore, the validated instrument can be used in future studies to assess depression in patients with AMI in Iranians.
A scale for measuring hygiene behavior: development, reliability and validity.
Stevenson, Richard J; Case, Trevor I; Hodgson, Deborah; Porzig-Drummond, Renata; Barouei, Javad; Oaten, Megan J
2009-09-01
There is currently no general self-report measure for assessing hygiene behavior. This article details the development and testing of such a measure. In studies 1 to 4, a total of 855 participants were used for scale and subscale development and for reliability and validity testing. The latter involved establishing the relationships between self-reported hygiene behavior and existing measures, hand hygiene behavior, illness rates, and a physiological marker of immune function. In study 5, a total of 507 participants were used to assess the psychometric properties of the final revised version of the scale. The final 23-item scale comprised 5 subscales: general, household, food-related, handwashing technique, and personal hygiene. Studies 1 to 4 confirmed the scale's reliability and validity, and study 5 confirmed the scale's 5-factor structure. The scale is potentially suitable for multiple uses, in various settings, and for experimental and correlational approaches.
Kadioglu, Hasibe; Erol, Saime; Ergun, Ayse
2015-01-01
The purpose of this research was to examine the psychometric properties of the Turkish version of the situational self-efficacy scale for vegetable and fruit consumption in adolescents. This was a methodological study. The study was conducted in four public secondary schools in Istanbul, Turkey. Subjects were 1586 adolescents. Content and construct validity were assessed to test the validity of the scale. The reliability was assessed in terms of internal consistency and test-retest reliability. For confirmatory factor analysis, χ(2) statistics plus other fit indices were used, including the goodness-of-fit index, the adjusted goodness-of-fit index, the nonnormed fit index, the comparative fit index, the standardized root mean residual, and the root mean square error of approximation. Pearson's correlation was used for test-retest reliability and item total correlation. The internal consistency was assessed by using Cronbach α. Confirmatory factor analysis strongly supported the three-component structure representing positive social situations (α = .81), negative effect situations (α = .93), and difficult situations (α = .78). Psychometric analyses of the Turkish version of the situational self-efficacy scale indicate high reliability and good content and construct validity. Researchers and health professionals will find it useful to employ the Turkish situational self-efficacy scale in evaluating situational self-efficacy for fruit and vegetable consumption in Turkish adolescents.
ERIC Educational Resources Information Center
Chakhssi, Farid; de Ruiter, Corine; Bernstein, David
2010-01-01
The Behavioural Status Index (BEST-Index) has been introduced into Dutch forensic psychiatry to measure change in risk level of future violence. The BEST-Index is a structured observational measure that assesses aggressive behavior, degree of insight, social skills, self-care, and work and leisure skills during inpatient treatment. Thus far,…
2011-09-01
a quality evaluation with limited data, a model -based assessment must be...that affect system performance, a multistage approach to system validation, a modeling and experimental methodology for efficiently addressing a ...affect system performance, a multistage approach to system validation, a modeling and experimental methodology for efficiently addressing a wide range
ERIC Educational Resources Information Center
Mejia, Anilena; Filus, Ania; Calam, Rachel; Morawska, Alina; Sanders, Matthew R.
2016-01-01
In the present study, we explored the factor structure as well as validity and reliability of the Spanish version of the Child Adjustment and Parent Efficacy Scale (CAPES) suitable for assessing child behavioural and emotional difficulties (Intensity Scale) and parental self-efficacy (Self-Efficacy Scale) among Spanish-speaking parents from the…
ERIC Educational Resources Information Center
Reid, Robert J.; Brown, Tiffany L.; Peterson, N. Andrew; Snowden, Lonnie; Hines, Alice
2009-01-01
Research has pointed to the important role that acculturation plays in understanding a range of physical health behaviors as well as psychological functioning, but only a few studies have attempted to establish reliable and valid measures of African American acculturation. The scale developed by Snowden and Hines (1999) to assess African American…
The development of a structured rating schedule (the BAS) to assess skills in breaking bad news
Miller, S J; Hope, T; Talbot, D C
1999-01-01
There has been considerable interest in how doctors break bad news, with calls from within the profession and from patients for doctors to improve their communication skills. In order to aid clinical training and assessment of the skills used in breaking bad news there is a need for a reliable, practical and valid, structured rating schedule. Such a rating schedule was compiled from agreed criteria in the literature. Video-taped recordings of simulated consultations breaking bad news were independently assessed by three raters using the schedule and compared to three experts who gave global ratings. The primary outcome measures were internal consistency of the schedule and level of agreement between raters. The internal consistency was high with a Cronbach's alpha of 0.93. Agreement between raters using the schedule was moderate to good. The majority of the variation in scores was due to the differences in skills demonstrated in the interviews. The agreement between raters not using the schedule was poor. The BAS provides a simple to use, reliable, and consistent rating schedule for assessing skills used in breaking bad news. It could be a valuable aid to teaching this difficult task. © 1999 Cancer Research Campaign PMID:10360657
Kwok, Cannas; Endrawes, Gihane; Lee, Chun Fan
2016-02-01
The aim of the study was to report the psychometric properties of the Arabic version of the Breast Cancer Screening Beliefs Questionnaire (BCSBQ). A convenience sample of 251 Arabic-Australian women was recruited from a number of Arabic community organizations. Construct validity was examined by Cuzick's non-parametric test while Cronbach α was used to assess internal consistency reliability. Explanatory factor analysis was conducted to study the factor structure. The results indicated that the Arabic version of the BCSBQ had satisfactory validity and internal consistency. The Cronbach's alpha of the three subscales ranged between 0.810 and 0.93. The frequency of breast cancer screening practices (breast awareness, clinical breast-examination and mammography) were significantly associated with attitudes towards general health check-up and perceived barriers to mammographic screening. Exploratory factor analysis showed a similar fit for the hypothesized three-factor structure with our data set. The Arabic version of the BCBSQ is a culturally appropriate, valid and reliable instrument for assessing the beliefs, knowledge and attitudes to breast cancer and breast cancer screening practices among Arabic-Australian women. Copyright © 2015 Elsevier Ltd. All rights reserved.
Risk management for the Space Exploration Initiative
NASA Technical Reports Server (NTRS)
Buchbinder, Ben
1993-01-01
Probabilistic Risk Assessment (PRA) is a quantitative engineering process that provides the analytic structure and decision-making framework for total programmatic risk management. Ideally, it is initiated in the conceptual design phase and used throughout the program life cycle. Although PRA was developed for assessment of safety, reliability, and availability risk, it has far greater application. Throughout the design phase, PRA can guide trade-off studies among system performance, safety, reliability, cost, and schedule. These studies are based on the assessment of the risk of meeting each parameter goal, with full consideration of the uncertainties. Quantitative trade-off studies are essential, but without full identification, propagation, and display of uncertainties, poor decisions may result. PRA also can focus attention on risk drivers in situations where risk is too high. For example, if safety risk is unacceptable, the PRA prioritizes the risk contributors to guide the use of resources for risk mitigation. PRA is used in the Space Exploration Initiative (SEI) Program. To meet the stringent requirements of the SEI mission, within strict budgetary constraints, the PRA structure supports informed and traceable decision-making. This paper briefly describes the SEI PRA process.
Brett, Benjamin L; Solomon, Gary S; Hill, Jennifer; Schatz, Philip
2018-03-01
This study examined the test-retest reliability of the four- and two-factor structures (i.e., Memory and Speed) of ImPACT over a 2-year interval across multiple groups with premorbid conditions, including those with a history of special education or learning disorders (LD; n = 114), treatment history for headache/migraine (n = 81), and a control group (n = 792). Nine hundred and eighty seven high school athletes completed baseline testing using online ImPACT across a 2-year interval. Paired-samples t-tests documented improvement from initial to follow-up assessments. Test stability was examined using Regression-based measures (RBM) and Reliable change indices (RCI). Reliability was examined using intraclass correlation coefficients (ICC). Significant improvement on all four composites were observed for the control group over a 2-year interval; whereas significant differences were observed only on Visual Motor Speed for the LD and headache/migraine treatment history groups. ICCs ranges were similar across groups and greater or comparable reliability was observed for the two-factor structure on Memory (0.67-0.73) and Speed (0.76-0.78) composites. RCIs and RBMs demonstrated stability for the four- and two-factor structures, with few cases falling outside the range of expected change within a healthy sample at the 90% and 95% CIs. Typical practices of obtaining new baselines every 2 years in the high school population can be applied to athletes with a history of special education or LD and headache/migraine treatment. The two-factor structure has potential to increase test-retest reliability. Further research regarding clinical utility is needed. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Tabuse, Hideaki; Kalali, Amir; Azuma, Hideki; Ozaki, Norio; Iwata, Nakao; Naitoh, Hiroshi; Higuchi, Teruhiko; Kanba, Shigenobu; Shioe, Kunihiko; Akechi, Tatsuo; Furukawa, Toshi A
2007-09-30
The Hamilton Rating Scale for Depression (HAMD) is the de facto international gold standard for the assessment of depression. There are some criticisms, however, especially with regard to its inter-rater reliability, due to the lack of standardized questions or explicit scoring procedures. The GRID-HAMD was developed to provide standardized explicit scoring conventions and a structured interview guide for administration and scoring of the HAMD. We developed the Japanese version of the GRID-HAMD and examined its inter-rater reliability among experienced and inexperienced clinicians (n=70), how rater characteristics may affect it, and how training can improve it in the course of a model training program using videotaped interviews. The results showed that the inter-rater reliability of the GRID-HAMD total score was excellent to almost perfect and those of most individual items were also satisfactory to excellent, both with experienced and inexperienced raters, and both before and after the training. With its standardized definitions, questions and detailed scoring conventions, the GRID-HAMD appears to be the best achievable set of interview guides for the HAMD and can provide a solid tool for highly reliable assessment of depression severity.
2015-01-01
Background The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. Methods and Results The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald’s omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for “somatic”, “depressed affect”, “positive affect” and “interpersonal problems” were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value <0.01 and all effect size statistics >0.2). The CES-D was externally responsive, with the AUC>0.7. Conclusions The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four-factor and bi-factor structure, the CES-D is supported for cross-cultural comparisons of depression in multi-center studies. PMID:26252739
Predicting kinetics of polymorphic transformations from structure mapping and coordination analysis
NASA Astrophysics Data System (ADS)
Stevanović, Vladan; Trottier, Ryan; Musgrave, Charles; Therrien, Félix; Holder, Aaron; Graf, Peter
2018-03-01
To extend materials design and discovery into the space of metastable polymorphs, rapid and reliable assessment of transformation kinetics to lower energy structures is essential. Herein we focus on diffusionless polymorphic transformations and investigate routes to assess their kinetics using solely crystallographic arguments. As part of this investigation we developed a general algorithm to map crystal structures onto each other, and ascertain the low-energy (fast-kinetics) transformation pathways between them. Pathways with minimal dissociation of chemical bonds, along which the number of bonds (in ionic systems the first-shell coordination) does not decrease below that in the end structures, are shown to always be the fast-kinetics pathways. These findings enable the rapid assessment of the kinetics of polymorphic transformation and the identification of long-lived metastable structures. The utility is demonstrated on a number of transformations including those between high-pressure SnO2 phases, which lack a detailed atomic-level understanding.
Study on application of aerospace technology to improve surgical implants
NASA Technical Reports Server (NTRS)
Johnson, R. E.; Youngblood, J. L.
1982-01-01
The areas where aerospace technology could be used to improve the reliability and performance of metallic, orthopedic implants was assessed. Specifically, comparisons were made of material controls, design approaches, analytical methods and inspection approaches being used in the implant industry with hardware for the aerospace industries. Several areas for possible improvement were noted such as increased use of finite element stress analysis and fracture control programs on devices where the needs exist for maximum reliability and high structural performance.
A Probabilistic Design Method Applied to Smart Composite Structures
NASA Technical Reports Server (NTRS)
Shiao, Michael C.; Chamis, Christos C.
1995-01-01
A probabilistic design method is described and demonstrated using a smart composite wing. Probabilistic structural design incorporates naturally occurring uncertainties including those in constituent (fiber/matrix) material properties, fabrication variables, structure geometry and control-related parameters. Probabilistic sensitivity factors are computed to identify those parameters that have a great influence on a specific structural reliability. Two performance criteria are used to demonstrate this design methodology. The first criterion requires that the actuated angle at the wing tip be bounded by upper and lower limits at a specified reliability. The second criterion requires that the probability of ply damage due to random impact load be smaller than an assigned value. When the relationship between reliability improvement and the sensitivity factors is assessed, the results show that a reduction in the scatter of the random variable with the largest sensitivity factor (absolute value) provides the lowest failure probability. An increase in the mean of the random variable with a negative sensitivity factor will reduce the failure probability. Therefore, the design can be improved by controlling or selecting distribution parameters associated with random variables. This can be implemented during the manufacturing process to obtain maximum benefit with minimum alterations.
Assessment of and standardization for quantitative nondestructive test
NASA Technical Reports Server (NTRS)
Neuschaefer, R. W.; Beal, J. B.
1972-01-01
Present capabilities and limitations of nondestructive testing (NDT) as applied to aerospace structures during design, development, production, and operational phases are assessed. It will help determine what useful structural quantitative and qualitative data may be provided from raw materials to vehicle refurbishment. This assessment considers metal alloys systems and bonded composites presently applied in active NASA programs or strong contenders for future use. Quantitative and qualitative data has been summarized from recent literature, and in-house information, and presented along with a description of those structures or standards where the information was obtained. Examples, in tabular form, of NDT technique capabilities and limitations have been provided. NDT techniques discussed and assessed were radiography, ultrasonics, penetrants, thermal, acoustic, and electromagnetic. Quantitative data is sparse; therefore, obtaining statistically reliable flaw detection data must be strongly emphasized. The new requirements for reusable space vehicles have resulted in highly efficient design concepts operating in severe environments. This increases the need for quantitative NDT evaluation of selected structural components, the end item structure, and during refurbishment operations.
Damage Tolerance Assessment Branch
NASA Technical Reports Server (NTRS)
Walker, James L.
2013-01-01
The Damage Tolerance Assessment Branch evaluates the ability of a structure to perform reliably throughout its service life in the presence of a defect, crack, or other form of damage. Such assessment is fundamental to the use of structural materials and requires an integral blend of materials engineering, fracture testing and analysis, and nondestructive evaluation. The vision of the Branch is to increase the safety of manned space flight by improving the fracture control and the associated nondestructive evaluation processes through development and application of standards, guidelines, advanced test and analytical methods. The Branch also strives to assist and solve non-aerospace related NDE and damage tolerance problems, providing consultation, prototyping and inspection services.
Ying, Yu-Wen; Lee, Peter Allen; Tsai, Jeanne L
2004-11-01
The Inventory of College Challenges for Ethnic Minority Students (ICCEMS) is a newly developed instrument that assesses challenges faced by ethnic minority college students across a range of cultural, academic, social, and practical domains. The present study tested the ICCEMS among Chinese American students in an attempt to identify its factor structure and assess its psychometric properties. A total of 13 factor domains emerged. The Cronbach's alpha and 1-month test-retest reliability of the subscales and the overall scale supported their reliability. Both criterion and construct validities were also demonstrated. Chinese American college students faced the greatest challenges in terms of unclear career direction and academic demands. 2004 APA
Assessing Perceptions AbouT Hazardous Substances (PATHS): The PATHS questionnaire
Amlôt, Richard; Page, Lisa; Pearce, Julia; Wessely, Simon
2013-01-01
How people perceive the nature of a hazardous substance may determine how they respond when potentially exposed to it. We tested a new Perceptions AbouT Hazardous Substances (PATHS) questionnaire. In Study 1 (N = 21), we assessed the face validity of items concerning perceptions about eight properties of a hazardous substance. In Study 2 (N = 2030), we tested the factor structure, reliability and validity of the PATHS questionnaire across four qualitatively different substances. In Study 3 (N = 760), we tested the impact of information provision on Perceptions AbouT Hazardous Substances scores. Our results showed that our eight measures demonstrated good reliability and validity when used for non-contagious hazards. PMID:23104995
Validation of the Perceived Stigmatization Questionnaire for Brazilian adult burn patients.
Freitas, Noélle de Oliveira; Forero, Carlos García; Caltran, Marina Paes; Alonso, Jordi; Dantas, Rosana A Spadoti; Piccolo, Monica Sarto; Farina, Jayme Adriano; Lawrence, John W; Rossi, Lidia A
2018-01-01
Currently, there is no questionnaire to assess perceived stigmatization among people with visible differences in Brazil. The Perceived Stigmatization Questionnaire (PSQ), developed in the United States, is a valid instrument to assess the perception of stigmatizing behaviours among burn survivors. The objective of this cross-sectional and multicentre study was to assess the factor structure, reliability and validity of the Brazilian Portuguese version of the PSQ in burn patients. A Brazilian version of the 21-item PSQ was answered by 240 adult burn patients, undergoing rehabilitation in two burns units in Brazil. We tested its construct validity by correlating PSQ scores with depression (Beck Depression Index-BDI) and self-esteem (Rosenberg Self-Esteem Scale-RSE), as well as with two domains of the Revised Burn Specific Health Scale-BSHS-R: affect and body image, and interpersonal relationships. We used Confirmatory Item Factor Analysis (CIFA) to test whether the data fit a measurement model involving a three-factor structure (absence of friendly behaviour; confusing/staring behaviour; and hostile behaviour). We conducted Exploratory Factor Analyses (EFA) of the subscale in a 50% random sample of individuals (training split), treating items as ordinal categorical using unweighted least squares estimation. To assess discriminant validity of the Brazilian version of the PSQ we correlated PSQ scores with known groups (sex, total body surface area burned, and visibility of the scars) and assessed its reliability by means of Cronbach's alpha and using test-retest. Goodness-of-fit indices for confirmatory factor analysis were satisfactory for the PSQ, but not for the hostile behaviour subscale, which was modified to improve fit by eliminating 3 items. Cronbach's alphas for the PSQ refined version (PSQ-R) ranged from 0.65 to 0.88, with test-retest reliability 0.87 for the total score. The PSQ-R scores correlated strongly with depression (0.63; p < 0.001), self-esteem (-0.57; p < 0.001), body image (-0.63; p < 0.001), and interpersonal relationships (-0.55; p < 0.001). PSQ-R total scores were significantly lower for patients with visible scars (effect size = 0.51, p = 0.029). The PSQ-R showed reliability and validity comparable to the original version. However, the cross-cultural structure of the subscale "hostile behaviour" and sensitivity to change of the PSQ should be further evaluated.
Validation of the Perceived Stigmatization Questionnaire for Brazilian adult burn patients
Forero, Carlos García; Caltran, Marina Paes; Alonso, Jordi; Dantas, Rosana A. Spadoti; Piccolo, Monica Sarto; Farina, Jayme Adriano; Lawrence, John W.; Rossi, Lidia A.
2018-01-01
Currently, there is no questionnaire to assess perceived stigmatization among people with visible differences in Brazil. The Perceived Stigmatization Questionnaire (PSQ), developed in the United States, is a valid instrument to assess the perception of stigmatizing behaviours among burn survivors. The objective of this cross-sectional and multicentre study was to assess the factor structure, reliability and validity of the Brazilian Portuguese version of the PSQ in burn patients. A Brazilian version of the 21-item PSQ was answered by 240 adult burn patients, undergoing rehabilitation in two burns units in Brazil. We tested its construct validity by correlating PSQ scores with depression (Beck Depression Index-BDI) and self-esteem (Rosenberg Self-Esteem Scale-RSE), as well as with two domains of the Revised Burn Specific Health Scale—BSHS-R: affect and body image, and interpersonal relationships. We used Confirmatory Item Factor Analysis (CIFA) to test whether the data fit a measurement model involving a three-factor structure (absence of friendly behaviour; confusing/staring behaviour; and hostile behaviour). We conducted Exploratory Factor Analyses (EFA) of the subscale in a 50% random sample of individuals (training split), treating items as ordinal categorical using unweighted least squares estimation. To assess discriminant validity of the Brazilian version of the PSQ we correlated PSQ scores with known groups (sex, total body surface area burned, and visibility of the scars) and assessed its reliability by means of Cronbach's alpha and using test-retest. Goodness-of-fit indices for confirmatory factor analysis were satisfactory for the PSQ, but not for the hostile behaviour subscale, which was modified to improve fit by eliminating 3 items. Cronbach’s alphas for the PSQ refined version (PSQ-R) ranged from 0.65 to 0.88, with test-retest reliability 0.87 for the total score. The PSQ-R scores correlated strongly with depression (0.63; p < 0.001), self-esteem (-0.57; p < 0.001), body image (-0.63; p < 0.001), and interpersonal relationships (-0.55; p < 0.001). PSQ-R total scores were significantly lower for patients with visible scars (effect size = 0.51, p = 0.029). The PSQ-R showed reliability and validity comparable to the original version. However, the cross-cultural structure of the subscale “hostile behaviour” and sensitivity to change of the PSQ should be further evaluated. PMID:29381711
NASA Technical Reports Server (NTRS)
Neam, Douglas C.; Gerber, John D.
1992-01-01
The stringent stability requirements of the Corrective Optics Space Telescope Axial Replacement (COSTAR) necessitates a Deployable Optical Bench (DOB) with both a low CTE and high resonant frequency. The DOB design consists of a monocoque thin shell structure which marries metallic machined parts with graphite epoxy formed structure. Structural analysis of the DOB has been integrated into the laminate design and optimization process. Also, the structural analytical results are compared with vibration and thermal test data to assess the reliability of the analysis.
Sauter, Floor M; Heyne, David; Blöte, Anke W; van Widenfelt, Brigit M; Westenberg, P Michiel
2010-05-01
The effectiveness of cognitive-behaviour therapy with young people may be influenced by a young person's capacity for self-reflection and insight. Clinicians who assess clients' proficiencies in these cognitive capacities can better tailor cognitive and behavioural techniques to the client, facilitating engagement and enhancing treatment outcome. It is therefore important that sound instruments for assessing self-reflection and insight in young people are available. The aim of the current study was to translate and adapt the Self-Reflection and Insight Scale (SRIS) for use with a child and adolescent population (Study 1), and to evaluate the psychometric properties of the resulting measure, the Self-Reflection and Insight Scale for Youth (SRIS-Y; Study 2). In Study 1 (n=145), the comprehensibility of the SRIS-Y was assessed in a community sample of children and adolescents. Study 2 (n=215) then explored the reliability and structural, convergent, and divergent validity of the SRIS-Y. The SRIS-Y was found to be comprehensible to young people, and had good reliability and structural validity. It appears that the SRIS-Y is a sound instrument for assessing therapy-relevant cognitive capacities in young people, of potential benefit in both research and clinical contexts. Future research foci include the predictive validity of the instrument.
Kramp, Kelvin H; van Det, Marc J; Hoff, Christiaan; Lamme, Bas; Veeger, Nic J G M; Pierie, Jean-Pierre E N
2015-01-01
Global Operative Assessment of Laparoscopic Skills (GOALS) assessment has been designed to evaluate skills in laparoscopic surgery. A longitudinal blinded study of randomized video fragments was conducted to estimate the validity and reliability of GOALS in novice trainees. In total, 10 trainees each performed 6 consecutive laparoscopic cholecystectomies. Sixty procedures were recorded on video. Video fragments of (1) opening of the peritoneum; (2) dissection of Calot's triangle and achievement of critical view of safety; and (3) dissection of the gallbladder from the liver bed were blinded, randomized, and rated by 2 consultant surgeons using GOALS. Also, a grade was given for overall competence. The correlation of GOALS with live observation Objective Structured Assessment of Technical Skills (OSATS) scores was calculated. Construct validity was estimated using the Friedman 2-way analysis of variance by ranks and the Wilcoxon signed-rank test. The interrater reliability was calculated using the absolute and consistency agreement 2-way random-effects model intraclass correlation coefficient. A high correlation was found between mean GOALS score (r = 0.879, p = 0.021) and mean OSATS score. The GOALS score increased significantly across the 6 procedures (p = 0.002). The trainees performed significantly better on their sixth when compared with their first cholecystectomy (p = 0.004). The consistency agreement interrater reliability was 0.37 for the mean GOALS score (p = 0.002) and 0.55 for overall competence (p < 0.001) of the 3 video fragments. The validity observed in this randomized blinded longitudinal study supports the existing evidence that GOALS is a valid tool for assessment of novice trainees. A relatively low reliability was found in this study. Copyright © 2014 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Dueñas, María; Mendonça, Liliane; Sampaio, Rute; Gouvinhas, Cláudia; Oliveira, Daniela; Castro-Lopes, José Manuel; Azevedo, Luís Filipe
2017-03-01
The Bowel Function Index (BFI) is a simple and sound bowel function and opioid-induced constipation (OIC) screening tool. We aimed to develop the translation and cultural adaptation of this measure (BFI-P) and to assess its reliability and validity for the Portuguese language and a chronic pain population. The BFI-P was created after a process including translation, back translation and cultural adaptation. Participants (n = 226) were recruited in a chronic pain clinic and were assessed at baseline and after one week. Internal consistency, test-retest reliability, responsiveness, construct (convergent and known groups) and factorial validity were assessed. Test-retest reliability had an intra-class correlation of 0.605 for BFI mean score. Internal consistency of BFI had Cronbach's alpha of 0.865. The construct validity of BFI-P was shown to be excellent and the exploratory factor analysis confirmed its unidimensional structure. The responsiveness of BFI-P was excellent, with a suggested 17-19 point and 8-12 point change in score constituting a clinically relevant change in constipation for patients with and without previous constipation, respectively. This study had some limitations, namely, the criterion validity of BFI-P was not directly assessed; and the absence of a direct criterion for OIC precluded the assessment of the criterion based responsiveness of BFI-P. Nevertheless, BFI may importantly contribute to better OIC screening and its Portuguese version (BFI-P) has been shown to have excellent reliability, internal consistency, validity and responsiveness. Further suggestions regarding statistically and clinically important change cut-offs for this instrument are presented.
Development of the CarMen-Q Questionnaire for mental workload assessment.
Rubio-Valdehita, Susana; López-Núñez, María I; López-Higes, Ramón; Díaz-Ramiro, Eva M
2017-11-01
Mental workload has emerged as one of the most important occupational risk factors present in most psychological and physical diseases caused by work. In view of the lack of specific tools to assess mental workload, the objective of this research was to assess the construct validity and reliability of a new questionnaire for mental workload assessment (CarMen-Q). The sample was composed of 884 workers from several professional sectors, between 18 and 65 years old, 53.4% men and 46.6% women. To evaluate the validity based on relationships with other measures, the NASA-TLX scale was also administered. Confirmatory factor analysis showed an internal structure made up of four dimensions: cognitive, temporal and emotional demands and performance requirement. The results show satisfactory evidence of validity based on relationships with NASA-TLX and good reliability. The questionnaire has good psychometric properties and can be an easy, brief, useful tool for mental workload diagnosis and prevention.
Shayan, Zahra; Pourmovahed, Zahra; Najafipour, Fatemeh; Abdoli, Ali Mohammad; Mohebpour, Fatemeh; Najafipour, Sedighe
2015-12-01
Nowadays, infertility problems have become a social concern, and are associated with multiple psychological and social problems. Also, it affects the interpersonal communication between the individual, familial, and social characteristics. Since women are exposed to stressors of physical, mental, social factors, and treatment of infertility, providing a psychometric screening tool is necessary for disorders of this group. The aim of this study was to determine the factor structure of the general health questionnaire-28 to discover mental disorders in infertile women. In this study, 220 infertile women undergoing treatment of infertility were selected from the Yazd Research and Clinical Center for Infertility with convenience sampling in 2011. After completing the general health questionnaire by the project manager, validity and, reliability of the questionnaire were calculated by confirmatory factor structure and Cronbach's alpha, respectively. Four factors, including anxiety and insomnia, social dysfunction, depression, and physical symptoms were extracted from the factor structure. 50.12% of the total variance was explained by four factors. The reliability coefficient of the questionnaire was obtained 0.90. Analysis of the factor structure and reliability of General Health Questionnaire-28 showed that it is suitable as a screening instrument for assessing general health of infertile women.
Mak, Kwok-Kei; Nam, JeeEun Karin; Kim, Dongil; Aum, Narae; Choi, Jung-Seok; Cheng, Cecilia; Ko, Huei-Chen; Watanabe, Hiroko
2017-03-01
The Korean Scale for Internet Addiction (K-Scale) was developed in Korea for assessing addictive internet behaviors. This study aims to adopt K-Scale and examine its psychometric properties in Japanese adolescents. In 2014, 589 (36.0% boys) high school students (Grade 10-12) from Japan completed a survey, including items of Japanese versions of K-Scale and Smartphone Scale for Smartphone Addiction (S-Scale). Model fit indices of the original four-factor structure, three-factor structure obtained from exploratory factor analysis, and improved two-factor structure of K-Scale were computed using confirmatory factor analysis, with internal reliability of included items reported. The convergent validity of K-Scale was tested against self-rated internet addiction, and S-Scale using multiple regression models. The results showed that a second-order two-factor 13-item structure was the most parsimonious model (NFI=0.919, NNFI=0.935, CFI=0.949, and RMSEA=0.05) with good internal reliability (Cronbach's alpha=0.87). The two factors revealed were "Disturbance of Adaptation and Life Orientation" and "Withdrawal and Tolerance". Moreover, the correlation between internet user classifications defined by K-Scale and self-rating was significant. K-Scale total score was significantly and positively associated with S-Scale total (adjusted R 2 =0.440) and subscale scores (adjusted R 2 =0.439). In conclusion, K-Scale is a valid and reliable assessment scale of internet addiction for Japanese high school students after modifications. Copyright © 2017. Published by Elsevier B.V.
Keowmani, Thamron; Lee, Lily Wong Lee
2016-01-01
To study the validity and reliability of the Malay version of the Specific Thalassemia Quality of Life Instrument (STQOLI) in Sabah's adult thalassemia patients. This cross-sectional study was done at Thalassemia Treatment Centre, Queen Elizabeth Hospital in Sabah, Malaysia. Eighty-two adult thalassemia patients who fulfilled the inclusion and exclusion criteria were conveniently selected for participation in the study. The English version of STQOLI was translated into Malay by using forward and back translations. The content of the questionnaire was validated by the chief hematologist of the hospital. The construct validity of the 40-item questionnaire was assessed by principal component analysis with varimax rotation and the scale reliability was assessed by Cronbach's alpha. The study failed to replicate the internal structure of the Greek STQOLI. Instead, 12 factors have been identified from the exploratory factor analysis, which accounted for 72.2% of the variance. However, only eight factors were interpretable. The factors were iron chelation pump impact, transfusion impact, time spent on treatment and its impact on work and social life, sex life, side effects of treatment, cardiovascular problems, psychology, and iron chelation pill impact. The overall scale reliability was 0.913. This study was unable to replicate the internal structure of the Greek STQOLI in Sabah's adult thalassemia patients. Instead, a new structure has emerged that can be used as a guide to develop a questionnaire specific for adult thalassemia patients in Sabah. Future research should focus on the eight factors identified from this study.
Koloski, N A; Jones, M; Hammer, J; von Wulffen, M; Shah, A; Hoelz, H; Kutyla, M; Burger, D; Martin, N; Gurusamy, S R; Talley, N J; Holtmann, G
2017-08-01
The clinical assessments of patients with gastrointestinal symptoms can be time-consuming, and the symptoms captured during the consultation may be influenced by a variety of patient and non-patient factors. To facilitate standardized symptom assessment in the routine clinical setting, we developed the Structured Assessment of Gastrointestinal Symptom (SAGIS) instrument to precisely characterize symptoms in a routine clinical setting. We aimed to validate SAGIS including its reliability, construct and discriminant validity, and utility in the clinical setting. Development of the SAGIS consisted of initial interviews with patients referred for the diagnostic work-up of digestive symptoms and relevant complaints identified. The final instrument consisted of 22 items as well as questions on extra intestinal symptoms and was given to 1120 consecutive patients attending a gastroenterology clinic randomly split into derivation (n = 596) and validation datasets (n = 551). Discriminant validity along with test-retest reliability was assessed. The time taken to perform a clinical assessment with and without the SAGIS was recorded along with doctor satisfaction with this tool. Exploratory factor analysis conducted on the derivation sample suggested five symptom constructs labeled as abdominal pain/discomfort (seven items), gastroesophageal reflux disease/regurgitation symptoms (four items), nausea/vomiting (three items), diarrhea/incontinence (five items), and difficult defecation and constipation (2 items). Confirmatory factor analysis conducted on the validation sample supported the initially developed five-factor measurement model ([Formula: see text], p < 0.0001, χ 2 /df = 4.6, CFI = 0.90, TLI = 0.88, RMSEA = 0.08). All symptom groups demonstrated differentiation between disease groups. The SAGIS was shown to be reliable over time and resulted in a 38% reduction of the time required for clinical assessment. The SAGIS instrument has excellent psychometric properties and supports the clinical assessment of and symptom-based categorization of patients with a wide spectrum of gastrointestinal symptoms.
Filippou, Georgios; Scirè, Carlo A; Damjanov, Nemanja; Adinolfi, Antonella; Carrara, Greta; Picerno, Valentina; Toscano, Carmela; Bruyn, George A; D'Agostino, Maria Antonietta; Delle Sedie, Andrea; Filippucci, Emilio; Gutierrez, Marwin; Micu, Mihaela; Möller, Ingrid; Naredo, Esperanza; Pineda, Carlos; Porta, Francesco; Schmidt, Wolfgang A; Terslev, Lene; Vlad, Violeta; Zufferey, Pascal; Iagnocco, Annamaria
2017-11-01
To define the ultrasonographic characteristics of calcium pyrophosphate crystal (CPP) deposits in joints and periarticular tissues and to evaluate the intra- and interobserver reliability of expert ultrasonographers in the assessment of CPP deposition disease (CPPD) according to the new definitions. After a systematic literature review, a Delphi survey was circulated among a group of expert ultrasonographers, who were members of the CPPD Ultrasound (US) Outcome Measures in Rheumatology (OMERACT) subtask force, to obtain definitions of the US characteristics of CPPD at the level of fibrocartilage (FC), hyaline cartilage (HC), tendon, and synovial fluid (SF). Subsequently, the reliability of US in assessing CPPD at knee and wrist levels according to the agreed definitions was tested in static images and in patients with CPPD. Cohen's κ was used for statistical analysis. HC and FC of the knee yielded the highest interobserver κ values among all the structures examined, in both the Web-based (0.73 for HC and 0.58 for FC) and patient-based exercises (0.55 for the HC and 0.64 for the FC). Kappa values for the other structures were lower, ranging from 0.28 in tendons to 0.50 in SF in the static exercise and from 0.09 (proximal patellar tendon) to 0.27 (triangular FC of the wrist) in the patient-based exercise. The new OMERACT definitions for the US identification of CPPD proved to be reliable at the level of the HC and FC of the knee. Further studies are needed to better define the US characteristics of CPPD and optimize the scanning technique in other anatomical sites.
Savoia, Elena; Biddinger, Paul D; Burstein, Jon; Stoto, Michael A
2010-01-01
As proxies for actual emergencies, drills and exercises can raise awareness, stimulate improvements in planning and training, and provide an opportunity to examine how different components of the public health system would combine to respond to a challenge. Despite these benefits, there remains a substantial need for widely accepted and prospectively validated tools to evaluate agencies' and hospitals' performance during such events. Unfortunately, to date, few studies have focused on addressing this need. The purpose of this study was to assess the validity and reliability of a qualitative performance assessment tool designed to measure hospitals' communication and operational capabilities during a functional exercise. The study population included 154 hospital personnel representing nine hospitals that participated in a functional exercise in Massachusetts in June 2008. A 25-item questionnaire was developed to assess the following three hospital functional capabilities: (1) inter-agency communication; (2) communication with the public; and (3) disaster operations. Analyses were conducted to examine internal consistency, associations among scales, the empirical structure of the items, and inter-rater agreement. Twenty-two questions were retained in the final instrument, which demonstrated reliability with alpha coefficients of 0.83 or higher for all scales. A three-factor solution from the principal components analysis accounted for 57% of the total variance, and the factor structure was consistent with the original hypothesized domains. Inter-rater agreement between participants' self reported scores and external evaluators' scores ranged from moderate to good. The resulting 22-item performance measurement tool reliably measured hospital capabilities in a functional exercise setting, with preliminary evidence of concurrent and criterion-related validity.
Gagné, Myriam; Boulet, Louis-Philippe; Pérez, Norma; Moisan, Jocelyne
2018-04-30
To systematically identify the measurement properties of patient-reported outcome instruments (PROs) that evaluate adherence to inhaled maintenance medication in adults with asthma. We conducted a systematic review of six databases. Two reviewers independently included studies on the measurement properties of PROs that evaluated adherence in asthmatic participants aged ≥18 years. Based on the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN), the reviewers (1) extracted data on internal consistency, reliability, measurement error, content validity, structural validity, hypotheses testing, cross-cultural validity, criterion validity, and responsiveness; (2) assessed the methodological quality of the included studies; (3) assessed the quality of the measurement properties (positive or negative); and (4) summarised the level of evidence (limited, moderate, or strong). We screened 6,068 records and included 15 studies (14 PROs). No studies evaluated measurement error or responsiveness. Based on methodological and measurement property quality assessments, we found limited positive evidence of: (a) internal consistency of the Adherence Questionnaire, Refined Medication Adherence Reason Scale (MAR-Scale), Medication Adherence Report Scale for Asthma (MARS-A), and Test of the Adherence to Inhalers (TAI); (b) reliability of the TAI; and (c) structural validity of the Adherence Questionnaire, MAR-Scale, MARS-A, and TAI. We also found limited negative evidence of: (d) hypotheses testing of Adherence Questionnaire; (e) reliability of the MARS-A; and (f) criterion validity of the MARS-A and TAI. Our results highlighted the need to conduct further high-quality studies that will positively evaluate the reliability, validity, and responsiveness of the available PROs. This article is protected by copyright. All rights reserved.
2013-01-01
Background The assessment of personality organization and its observable behavioral manifestations, i.e. personality functioning, has a long tradition in psychodynamic psychiatry. Recently, the DSM-5 Levels of Personality Functioning Scale has moved it into the focus of psychiatric diagnostics. Based on Kernberg’s concept of personality organization the Structured Interview of Personality Organization (STIPO) was developed for diagnosing personality functioning. The STIPO covers seven dimensions: (1) identity, (2) object relations, (3) primitive defenses, (4) coping/rigidity, (5) aggression, (6) moral values, and (7) reality testing and perceptual distortions. The English version of the STIPO has previously revealed satisfying psychometric properties. Methods Validity and reliability of the German version of the 100-item instrument have been evaluated in 122 psychiatric patients. All patients were diagnosed according to the Diagnostic and Statistical Manual for Mental Disorders (DSM-IV) and were assessed by means of the STIPO. Moreover, all patients completed eight questionnaires that served as criteria for external validity of the STIPO. Results Interrater reliability varied between intraclass correlations of .89 and 1.0, Crohnbach’s α for the seven dimensions was .69 to .93. All a priori selected questionnaire scales correlated significantly with the corresponding STIPO dimensions. Patients with personality disorder (PD) revealed significantly higher STIPO scores (i.e. worse personality functioning) than patients without PD; patients cluster B PD showed significantly higher STIPO scores than patients with cluster C PD. Conclusions Interrater reliability, Crohnbach’s α, concurrent validity, and differential validity of the STIPO are satisfying. The STIPO represents an appropriate instrument for the assessment of personality functioning in clinical and research settings. PMID:23941404
Grover, S; Chakrabarti, S; Ghormode, D; Dutt, A; Kate, N; Kulhara, P
2011-12-01
The Involvement Evaluation Questionnaire (IEQ) is a comprehensive, conceptually valid and reliable means of assessing caregiver burden. However, its psychometric properties have rarely been examined in non-European settings. The aim of the present study was to evaluate the psychometric properties of an Indian translation of the IEQ (Hindi-IEQ). The European Union (English) version of IEQ was translated into Hindi and reviewed by a group of experts and caregivers for translation accuracy, cultural appropriateness, and for relevance and acceptability of items and constructs. The Hindi-IEQ was then administered to 162 primary caregivers of patients with severe mental illnesses. Eighteen caregivers completed both the English and Hindi versions to check the level of agreement between them. Another 27 completed the Hindi-IEQ twice, a week apart, to evaluate its test-retest reliability. Factor structure of the Hindi-IEQ was examined using an exploratory, principal components and factor analysis. Pearson's correlation coefficients were significant for 24 items, while intraclass correlation coefficients were significant for 28 of the 31 items (P < 0.05), indicating a satisfactory level of agreement between the Hindi and English versions. Test-retest reliability for all items of the Hindi-IEQ was adequate, with kappa values ranging from 0.46 to 0.95 and intraclass correlation coefficients from 0.76 to 1.00. Internal consistency (Cronbach's alpha = 0.89) and the split-half reliability (Spearman-Brown coefficient = 0.68) of the Hindi-IEQ were also satisfactory. However, several differences were noted in the factor structure and distribution of scores of the Hindi-IEQ, which were quite unlike that of the European Union version. The similarities and differences between the 2 versions of the IEQ indicated that sociocultural factors could influence assessment of caregiver burden across different cultures.
Evaluating the care of general medicine inpatients: how good is implicit review?
Hayward, R A; McMahon, L F; Bernard, A M
1993-04-01
Peer review often consists of implicit evaluations by physician reviewers of the quality and appropriateness of care. This study evaluated the ability of implicit review to measure reliably various aspects of care on a general medicine inpatient service. Retrospective review of patients' charts, using structured implicit review, of a stratified random sample of consecutive admissions to a general medicine ward. A university teaching hospital. Twelve internists were trained in structured implicit review and reviewed 675 patient admissions (with 20% duplicate reviews for a total of 846 reviews). Although inter-rater reliabilities for assessments of overall quality of care and preventable deaths (kappa = 0.5) were adequate for aggregate comparisons (for example, comparing mean ratings on two hospital wards), they were inadequate for reliable evaluations of single patients using one or two reviewers. Reviewers' agreement about most focused quality problems (for example, timeliness of diagnostic evaluation and clinical readiness at time of discharge) and about the appropriateness of hospital ancillary resource use was poor (kappa < or = 0.2). For most focused implicit measures, bias due to specific reviewers who were systematically more harsh or lenient (particularly for evaluation of resource-use appropriateness) accounted for much of the variation in reviewers' assessments, but this was not a substantial problem for the measure of overall quality. Reviewers rarely reported being unable to evaluate the quality of care because of deficiencies in documentation in the patient's chart. For assessment of overall quality and preventable deaths of general medicine inpatients, implicit review by peers had moderate degrees of reliability, but for most other specific aspects of care, physician reviewers could not agree. Implicit review was particularly unreliable at evaluating the appropriateness of hospital resource use and the patient's readiness for discharge, two areas where this type of review is often used.
Reliability Assessment of GaN Power Switches
2015-04-17
Possibilities for single event burnout testing were examined as well. Device simulation under the conditions of some of the testing was performed on...reverse-bias (HTRB) and single electron burnout (SEE) tests. 8. Refine test structures, circuits, and procedures, and, if possible, develop
Peer Evaluation Can Reliably Measure Local Knowledge
ERIC Educational Resources Information Center
Reyes-García, Victoria; Díaz-Reviriego, Isabel; Duda, Romain; Fernández-Llamazares, Álvaro; Gallois, Sandrine; Guèze, Maximilien; Napitupulu, Lucentezza; Pyhälä, Aili
2016-01-01
We assess the consistency of measures of individual local ecological knowledge obtained through peer evaluation against three standard measures: identification tasks, structured questionnaires, and self-reported skills questionnaires. We collected ethnographic information among the Baka (Congo), the Punan (Borneo), and the Tsimane' (Amazon) to…
ERIC Educational Resources Information Center
LaBelle, Sara; Johnson, Zac D.
2018-01-01
Three studies were conducted to generate a valid and reliable instrument to measure student-to-student confirmation. Study One (N = 396) sought to establish a factor structure based on previous research. Study Two (N = 396) sought to confirm this factor structure and assess criterion-related validity. Study Three (N = 283) sought to assess…
Andreu, Yolanda; Galdon, Maria J; Durá, Estrella; Ferrando, Maite; Pascual, Juan; Turk, Dennis C; Jiménez, Yolanda; Poveda, Rafael
2006-01-01
Background This paper seeks to analyse the psychometric and structural properties of the Multidimensional Pain Inventory (MPI) in a sample of temporomandibular disorder patients. Methods The internal consistency of the scales was obtained. Confirmatory Factor Analysis was carried out to test the MPI structure section by section in a sample of 114 temporomandibular disorder patients. Results Nearly all scales obtained good reliability indexes. The original structure could not be totally confirmed. However, with a few adjustments we obtained a satisfactory structural model of the MPI which was slightly different from the original: certain items and the Self control scale were eliminated; in two cases, two original scales were grouped in one factor, Solicitous and Distracting responses on the one hand, and Social activities and Away from home activities, on the other. Conclusion The MPI has been demonstrated to be a reliable tool for the assessment of pain in temporomandibular disorder patients. Some divergences to be taken into account have been clarified. PMID:17169143
In-service health monitoring of composite structures
NASA Technical Reports Server (NTRS)
Pinto, Gino A.; Ventres, C. S.; Ginty, Carol A.; Chamis, Christos C.
1990-01-01
The aerospace industry is witnessing a vast utilization of composites in critical structural applications and anticipates even more use of them in future aircraft. Therefore, a definite need exists for a composite health monitoring expert system to meet today's current needs and tomorrow's future demands. The primary goal for this conceptual health monitoring system is functional reliably for in-service operation in the environments of various composite structures. The underlying philosophy of this system is to utilize proven vibration techniques to assess the structural integrity of a fibrous composite. Statistical methods are used to determine if the variances in the measured data are acceptable for making a reliable decision on the health status of the composite. The flexible system allows for algorithms describing any composite fatigue or damage behavior characteristic to be provided as an input to the system. Alert thresholds and variances can also be provided as an input to this system and may be updated to allow for future changes/refinements in the composite's structural integrity behavior.
Moniz, Tracy; Arntfield, Shannon; Miller, Kristina; Lingard, Lorelei; Watling, Chris; Regehr, Glenn
2015-09-01
Reflective writing is a popular tool to support the growth of reflective capacity in undergraduate medical learners. Its popularity stems from research suggesting that reflective capacity may lead to improvements in skills such as empathy, communication, collaboration and professionalism. This has led to assumptions that reflective writing can also serve as a tool for student assessment. However, evidence to support the reliability and validity of reflective writing as a meaningful assessment strategy is lacking. Using a published instrument for measuring 'reflective capacity' (the Reflection Evaluation for Learners' Enhanced Competencies Tool [REFLECT]), four trained raters independently scored four samples of writing from each of 107 undergraduate medical students to determine the reliability of reflective writing scores. REFLECT scores were then correlated with scores on a Year 4 objective structured clinical examination (OSCE) and Year 2 multiple-choice question (MCQ) examinations to examine, respectively, convergent and divergent validity. Across four writing samples, four-rater Cronbach's α-values ranged from 0.72 to 0.82, demonstrating reasonable inter-rater reliability with four raters using the REFLECT rubric. However, inter-sample reliability was fairly low (four-sample Cronbach's α = 0.54, single-sample intraclass correlation coefficient: 0.23), which suggests that performance on one reflective writing sample was not strongly indicative of performance on the next. Approximately 14 writing samples are required to achieve reasonable inter-sample reliability. The study found weak, non-significant correlations between reflective writing scores and both OSCE global scores (r = 0.13) and MCQ examination scores (r = 0.10), demonstrating a lack of relationship between reflective writing and these measures of performance. Our findings suggest that to draw meaningful conclusions about reflective capacity as a stable construct in individuals requires 14 writing samples per student, each assessed by four or five raters. This calls into question the feasibility and utility of using reflective writing rigorously as an assessment tool in undergraduate medical education. © 2015 John Wiley & Sons Ltd.
Imura, Tomoya; Takamura, Masahiro; Okazaki, Yoshihiro; Tokunaga, Satoko
2016-10-01
We developed a scale to measure time management and assessed its reliability and validity. We then used this scale to examine the impact of time management on psychological stress response. In Study 1-1, we developed the scale and assessed its internal consistency and criterion-related validity. Findings from a factor analysis revealed three elements of time management, “time estimation,” “time utilization,” and “taking each moment as it comes.” In Study 1-2, we assessed the scale’s test-retest reliability. In Study 1-3, we assessed the validity of the constructed scale. The results indicate that the time management scale has good reliability and validity. In Study 2, we performed a covariance structural analysis to verify our model that hypothesized that time management influences perceived control of time and psychological stress response, and perceived control of time influences psychological stress response. The results showed that time estimation increases the perceived control of time, which in turn decreases stress response. However, we also found that taking each moment as it comes reduces perceived control of time, which in turn increases stress response.
van der Heijde, Désirée; Braun, Jürgen; Deodhar, Atul; Baraliakos, Xenofon; Landewé, Robert; Richards, Hanno B; Porter, Brian; Readie, Aimee
2018-05-30
In ankylosing spondylitis (AS), structural damage that occurs as a result of syndesmophyte formation and ankylosis of the vertebral column is irreversible. Structural damage is currently assessed by conventional radiography and scoring systems that reliably assess radiographic structural damage are needed to capture the differential effects of drugs on structural damage progression. The validity of the modified Stoke Ankylosing Spondylitis Spinal Score (mSASSS) as a primary outcome measure in evaluating the effect of AS treatments on radiographic progression rates was assessed in this review. The mSASSS has not been used, to date, as a primary outcome measure in a prospective randomized controlled clinical trial of biologic therapy in AS. This review of the medical literature confirmed that the mSASSS is the most validated and widely used method for assessing radiographic progression in AS, correlating with worsening measures of disease signs and symptoms, spinal mobility and physical function, with a 2-year interval being required to ensure sufficient sensitivity to change.
Diviani, Nicola; Dima, Alexandra Lelia; Schulz, Peter Johannes
2017-04-11
The eHealth Literacy Scale (eHEALS) is a tool to assess consumers' comfort and skills in using information technologies for health. Although evidence exists of reliability and construct validity of the scale, less agreement exists on structural validity. The aim of this study was to validate the Italian version of the eHealth Literacy Scale (I-eHEALS) in a community sample with a focus on its structural validity, by applying psychometric techniques that account for item difficulty. Two Web-based surveys were conducted among a total of 296 people living in the Italian-speaking region of Switzerland (Ticino). After examining the latent variables underlying the observed variables of the Italian scale via principal component analysis (PCA), fit indices for two alternative models were calculated using confirmatory factor analysis (CFA). The scale structure was examined via parametric and nonparametric item response theory (IRT) analyses accounting for differences between items regarding the proportion of answers indicating high ability. Convergent validity was assessed by correlations with theoretically related constructs. CFA showed a suboptimal model fit for both models. IRT analyses confirmed all items measure a single dimension as intended. Reliability and construct validity of the final scale were also confirmed. The contrasting results of factor analysis (FA) and IRT analyses highlight the importance of considering differences in item difficulty when examining health literacy scales. The findings support the reliability and validity of the translated scale and its use for assessing Italian-speaking consumers' eHealth literacy. ©Nicola Diviani, Alexandra Lelia Dima, Peter Johannes Schulz. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 11.04.2017.
A Second-Order Confirmatory Factor Analysis of the Moral Distress Scale-Revised for Nurses.
Sharif Nia, Hamid; Shafipour, Vida; Allen, Kelly-Ann; Heidari, Mohammad Reza; Yazdani-Charati, Jamshid; Zareiyan, Armin
2017-01-01
Moral distress is a growing problem for healthcare professionals that may lead to dissatisfaction, resignation, or occupational burnout if left unattended, and nurses experience different levels of this phenomenon. This study aims to investigate the factor structure of the Persian version of the Moral Distress Scale-Revised in intensive care and general nurses. This methodological research was conducted with 771 nurses from eight hospitals in the Mazandaran Province of Iran in 2017. Participants completed the Moral Distress Scale-Revised, data collected, and factor structure assessed using the construct, convergent, and divergent validity methods. The reliability of the scale was assessed using internal consistency (Cronbach's alpha, Theta, and McDonald's omega coefficients) and construct reliability. Ethical considerations: This study was approved by the Ethics Committee of Mazandaran University of Medical Sciences. The exploratory factor analysis ( N = 380) showed that the Moral Distress Scale-Revised has five factors: lack of professional competence at work, ignoring ethical issues and patient conditions, futile care, carrying out the physician's orders without question and unsafe care, and providing care under personal and organizational pressures, which explained 56.62% of the overall variance. The confirmatory factor analysis ( N = 391) supported the five-factor solution and the second-order latent factor model. The first-order model did not show a favorable convergent and divergent validity. Ultimately, the Moral Distress Scale-Revised was found to have a favorable internal consistency and construct reliability. The Moral Distress Scale-Revised was found to be a multidimensional construct. The data obtained confirmed the hypothesis of the factor structure model with a latent second-order variable. Since the convergent and divergent validity of the scale were not confirmed in this study, further assessment is necessary in future studies.
Bayesian Chance-Constrained Hydraulic Barrier Design under Geological Structure Uncertainty.
Chitsazan, Nima; Pham, Hai V; Tsai, Frank T-C
2015-01-01
The groundwater community has widely recognized geological structure uncertainty as a major source of model structure uncertainty. Previous studies in aquifer remediation design, however, rarely discuss the impact of geological structure uncertainty. This study combines chance-constrained (CC) programming with Bayesian model averaging (BMA) as a BMA-CC framework to assess the impact of geological structure uncertainty in remediation design. To pursue this goal, the BMA-CC method is compared with traditional CC programming that only considers model parameter uncertainty. The BMA-CC method is employed to design a hydraulic barrier to protect public supply wells of the Government St. pump station from salt water intrusion in the "1500-foot" sand and the "1700-foot" sand of the Baton Rouge area, southeastern Louisiana. To address geological structure uncertainty, three groundwater models based on three different hydrostratigraphic architectures are developed. The results show that using traditional CC programming overestimates design reliability. The results also show that at least five additional connector wells are needed to achieve more than 90% design reliability level. The total amount of injected water from the connector wells is higher than the total pumpage of the protected public supply wells. While reducing the injection rate can be achieved by reducing the reliability level, the study finds that the hydraulic barrier design to protect the Government St. pump station may not be economically attractive. © 2014, National Ground Water Association.
Bulk electric system reliability evaluation incorporating wind power and demand side management
NASA Astrophysics Data System (ADS)
Huang, Dange
Electric power systems are experiencing dramatic changes with respect to structure, operation and regulation and are facing increasing pressure due to environmental and societal constraints. Bulk electric system reliability is an important consideration in power system planning, design and operation particularly in the new competitive environment. A wide range of methods have been developed to perform bulk electric system reliability evaluation. Theoretically, sequential Monte Carlo simulation can include all aspects and contingencies in a power system and can be used to produce an informative set of reliability indices. It has become a practical and viable tool for large system reliability assessment technique due to the development of computing power and is used in the studies described in this thesis. The well-being approach used in this research provides the opportunity to integrate an accepted deterministic criterion into a probabilistic framework. This research work includes the investigation of important factors that impact bulk electric system adequacy evaluation and security constrained adequacy assessment using the well-being analysis framework. Load forecast uncertainty is an important consideration in an electrical power system. This research includes load forecast uncertainty considerations in bulk electric system reliability assessment and the effects on system, load point and well-being indices and reliability index probability distributions are examined. There has been increasing worldwide interest in the utilization of wind power as a renewable energy source over the last two decades due to enhanced public awareness of the environment. Increasing penetration of wind power has significant impacts on power system reliability, and security analyses become more uncertain due to the unpredictable nature of wind power. The effects of wind power additions in generating and bulk electric system reliability assessment considering site wind speed correlations and the interactive effects of wind power and load forecast uncertainty on system reliability are examined. The concept of the security cost associated with operating in the marginal state in the well-being framework is incorporated in the economic analyses associated with system expansion planning including wind power and load forecast uncertainty. Overall reliability cost/worth analyses including security cost concepts are applied to select an optimal wind power injection strategy in a bulk electric system. The effects of the various demand side management measures on system reliability are illustrated using the system, load point, and well-being indices, and the reliability index probability distributions. The reliability effects of demand side management procedures in a bulk electric system including wind power and load forecast uncertainty considerations are also investigated. The system reliability effects due to specific demand side management programs are quantified and examined in terms of their reliability benefits.
HDMR methods to assess reliability in slope stability analyses
NASA Astrophysics Data System (ADS)
Kozubal, Janusz; Pula, Wojciech; Vessia, Giovanna
2014-05-01
Stability analyses of complex rock-soil deposits shall be tackled considering the complex structure of discontinuities within rock mass and embedded soil layers. These materials are characterized by a high variability in physical and mechanical properties. Thus, to calculate the slope safety factor in stability analyses two issues must be taken into account: 1) the uncertainties related to structural setting of the rock-slope mass and 2) the variability in mechanical properties of soils and rocks. High Dimensional Model Representation (HDMR) (Chowdhury et al. 2009; Chowdhury and Rao 2010) can be used to carry out the reliability index within complex rock-soil slopes when numerous random variables with high coefficient of variations are considered. HDMR implements the inverse reliability analysis, meaning that the unknown design parameters are sought provided that prescribed reliability index values are attained. Such approach uses implicit response functions according to the Response Surface Method (RSM). The simple RSM can be efficiently applied when less than four random variables are considered; as the number of variables increases, the efficiency in reliability index estimation decreases due to the great amount of calculations. Therefore, HDMR method is used to improve the computational accuracy. In this study, the sliding mechanism in Polish Flysch Carpathian Mountains have been studied by means of HDMR. The Southern part of Poland where Carpathian Mountains are placed is characterized by a rather complicated sedimentary pattern of flysh rocky-soil deposits that can be simplified into three main categories: (1) normal flysch, consisting of adjacent sandstone and shale beds of approximately equal thickness, (2) shale flysch, where shale beds are thicker than adjacent sandstone beds, and (3) sandstone flysch, where the opposite holds. Landslides occur in all flysch deposit types thus some configurations of possible unstable settings (within fractured rocky-soil masses) resulting in sliding mechanisms have been investigated in this study. The reliability indices values drawn from the HDRM method have been compared with conventional approaches as neural networks: the efficiency of HDRM is shown in the case studied. References Chowdhury R., Rao B.N. and Prasad A.M. 2009. High-dimensional model representation for structural reliability analysis. Commun. Numer. Meth. Engng, 25: 301-337. Chowdhury R. and Rao B. 2010. Probabilistic Stability Assessment of Slopes Using High Dimensional Model Representation. Computers and Geotechnics, 37: 876-884.
NASA Applications and Lessons Learned in Reliability Engineering
NASA Technical Reports Server (NTRS)
Safie, Fayssal M.; Fuller, Raymond P.
2011-01-01
Since the Shuttle Challenger accident in 1986, communities across NASA have been developing and extensively using quantitative reliability and risk assessment methods in their decision making process. This paper discusses several reliability engineering applications that NASA has used over the year to support the design, development, and operation of critical space flight hardware. Specifically, the paper discusses several reliability engineering applications used by NASA in areas such as risk management, inspection policies, components upgrades, reliability growth, integrated failure analysis, and physics based probabilistic engineering analysis. In each of these areas, the paper provides a brief discussion of a case study to demonstrate the value added and the criticality of reliability engineering in supporting NASA project and program decisions to fly safely. Examples of these case studies discussed are reliability based life limit extension of Shuttle Space Main Engine (SSME) hardware, Reliability based inspection policies for Auxiliary Power Unit (APU) turbine disc, probabilistic structural engineering analysis for reliability prediction of the SSME alternate turbo-pump development, impact of ET foam reliability on the Space Shuttle System risk, and reliability based Space Shuttle upgrade for safety. Special attention is given in this paper to the physics based probabilistic engineering analysis applications and their critical role in evaluating the reliability of NASA development hardware including their potential use in a research and technology development environment.
Ball, Sarah C; Benjamin, Sara E; Ward, Dianne S
2007-04-01
To our knowledge, a direct observation protocol for assessing dietary intake among young children in child care has not been published. This article reviews the development and testing of a diet observation system for child care facilities that occurred during a larger intervention trial. Development of this system was divided into five phases, done in conjunction with a larger intervention study; (a) protocol development, (b) training of field staff, (c) certification of field staff in a laboratory setting, (d) implementation in a child-care setting, and (e) certification of field staff in a child-care setting. During the certification phases, methods were used to assess the accuracy and reliability of all observers at estimating types and amounts of food and beverages commonly served in child care. Tests of agreement show strong agreement among five observers, as well as strong accuracy between the observers and 20 measured portions of foods and beverages with a mean intraclass correlation coefficient value of 0.99. This structured observation system shows promise as a valid and reliable approach for assessing dietary intake of children in child care and makes a valuable contribution to the growing body of literature on the dietary assessment of young children.
Development and Initial Psychometrics of Counseling Supervisor's Behavior Questionnaire
ERIC Educational Resources Information Center
Lee, Ahram; Park, Eun Hye; Byeon, Eunji; Lee, Sang Min
2016-01-01
This study describes the development and psychometric properties of the Counseling Supervisor's Behavior Questionnaire, designed to assess the specific behaviors of supervisors, which can be observed by supervisees during supervision sessions. Factor structure, construct and concurrent validity, and internal consistency reliability of the…
Steenson, Sharalyn; Özcebe, Hilal; Arslan, Umut; Konşuk Ünlü, Hande; Araz, Özgür M; Yardim, Mahmut; Üner, Sarp; Bilir, Nazmi; Huang, Terry T-K
2018-01-01
Childhood obesity rates have been rising rapidly in developing countries. A better understanding of the risk factors and social context is necessary to inform public health interventions and policies. This paper describes the validation of several measurement scales for use in Turkey, which relate to child and parent perceptions of physical activity (PA) and enablers and barriers of physical activity in the home environment. The aim of this study was to assess the validity and reliability of several measurement scales in Turkey using a population sample across three socio-economic strata in the Turkish capital, Ankara. Surveys were conducted in Grade 4 children (mean age = 9.7 years for boys; 9.9 years for girls), and their parents, across 6 randomly selected schools, stratified by SES (n = 641 students, 483 parents). Construct validity of the scales was evaluated through exploratory and confirmatory factor analysis. Internal consistency of scales and test-retest reliability were assessed by Cronbach's alpha and intra-class correlation. The scales as a whole were found to have acceptable-to-good model fit statistics (PA Barriers: RMSEA = 0.076, SRMR = 0.0577, AGFI = 0.901; PA Outcome Expectancies: RMSEA = 0.054, SRMR = 0.0545, AGFI = 0.916, and PA Home Environment: RMSEA = 0.038, SRMR = 0.0233, AGFI = 0.976). The PA Barriers subscales showed good internal consistency and poor to fair test-retest reliability (personal α = 0.79, ICC = 0.29, environmental α = 0.73, ICC = 0.59). The PA Outcome Expectancies subscales showed good internal consistency and test-retest reliability (negative α = 0.77, ICC = 0.56; positive α = 0.74, ICC = 0.49). Only the PA Home Environment subscale on support for PA was validated in the final confirmatory model; it showed moderate internal consistency and test-retest reliability (α = 0.61, ICC = 0.48). This study is the first to validate measures of perceptions of physical activity and the physical activity home environment in Turkey. Our results support the originally hypothesized two-factor structures for Physical Activity Barriers and Physical Activity Outcome Expectancies. However, we found the one-factor rather than two-factor structure for Physical Activity Home Environment had the best model fit. This study provides general support for the use of these scales in Turkey in terms of validity, but test-retest reliability warrants further research.
Preliminary development of the adolescent students' basic psychological needs at school scale.
Tian, Lili; Han, Mengmeng; Huebner, E Scott
2014-04-01
The aim of the present study was to develop and provide evidence for the validity of a new measure of adolescent students' psychological need satisfaction at school, using a sample of Chinese students. We conducted four studies with four independent samples (total n = 1872). The first study aimed to develop items for the new instrument and to ascertain its factorial structure using exploratory factor analysis procedures. The second study aimed to examine the instrument's factorial structure using confirmatory factor analysis procedures as well as to assess its internal consistency reliability, convergent and divergent validity. The third study aimed to assess its measurement invariance across gender and age. The fourth study aimed to test its test-retest reliability over time and predictive validity. These preliminary results showed that the new instrument has promising psychometric properties. The potential contributions of the new instrument for future research and educational practices were discussed. Copyright © 2014 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.
Factor structure and psychometric properties of the Fertility Problem Inventory–Short Form
Zurlo, Maria Clelia; Cattaneo Della Volta, Maria Franscesca; Vallone, Federica
2017-01-01
The study analyses factor structure and psychometric properties of the Italian version of the Fertility Problem Inventory–Short Form. A sample of 206 infertile couples completed the Italian version of Fertility Problem Inventory (46 items) with demographics, State Anxiety Scale of State-Trait Anxiety Inventory (Form Y), Edinburgh Depression Scale and Dyadic Adjustment Scale, used to assess convergent and discriminant validity. Confirmatory factor analysis was unsatisfactory (comparative fit index = 0.87; Tucker-Lewis Index = 0.83; root mean square error of approximation = 0.17), and Cronbach’s α (0.95) revealed a redundancy of items. Exploratory factor analysis was carried out deleting cross-loading items, and Mokken scale analysis was applied to verify the items homogeneity within the reduced subscales of the questionnaire. The Fertility Problem Inventory–Short Form consists of 27 items, tapping four meaningful and reliable factors. Convergent and discriminant validity were confirmed. Findings indicated that the Fertility Problem Inventory–Short Form is a valid and reliable measure to assess infertility-related stress dimensions. PMID:29379625
Ardestani, M S; Niknami, S; Hidarnia, A; Hajizadeh, E
2016-08-18
This research examined the validity and reliability of a researcher-developed questionnaire based on Social Cognitive Theory (SCT) to assess the physical activity behaviour of Iranian adolescent girls (SCT-PAIAGS). Psychometric properties of the SCT-PAIAGS were assessed by determining its face validity, content and construct validity as well as its reliability. In order to evaluate factor structure, cross-sectional research was conducted on 400 high-school girls in Tehran. Content validity index, content validity ratio and impact score for the SCT-PAIAGS varied between 0.97-1, 0.91-1 and 4.6-4.9 respectively. Confirmatory factor analysis approved a six-factor structure comprising self-efficacy, self-regulation, family support, friend support, outcome expectancy and self-efficacy to overcoming impediments. Factor loadings, t-values and fit indices showed that the SCT model was fitted to the data. Cronbach's α-coefficient ranged from 0.78 to 0.85 and intraclass correlation coefficient from 0.73 to 0.90.
Ko, Junsu; Park, Hahnbeom; Seok, Chaok
2012-08-10
Protein structures can be reliably predicted by template-based modeling (TBM) when experimental structures of homologous proteins are available. However, it is challenging to obtain structures more accurate than the single best templates by either combining information from multiple templates or by modeling regions that vary among templates or are not covered by any templates. We introduce GalaxyTBM, a new TBM method in which the more reliable core region is modeled first from multiple templates and less reliable, variable local regions, such as loops or termini, are then detected and re-modeled by an ab initio method. This TBM method is based on "Seok-server," which was tested in CASP9 and assessed to be amongst the top TBM servers. The accuracy of the initial core modeling is enhanced by focusing on more conserved regions in the multiple-template selection and multiple sequence alignment stages. Additional improvement is achieved by ab initio modeling of up to 3 unreliable local regions in the fixed framework of the core structure. Overall, GalaxyTBM reproduced the performance of Seok-server, with GalaxyTBM and Seok-server resulting in average GDT-TS of 68.1 and 68.4, respectively, when tested on 68 single-domain CASP9 TBM targets. For application to multi-domain proteins, GalaxyTBM must be combined with domain-splitting methods. Application of GalaxyTBM to CASP9 targets demonstrates that accurate protein structure prediction is possible by use of a multiple-template-based approach, and ab initio modeling of variable regions can further enhance the model quality.
The relationship between cost estimates reliability and BIM adoption: SEM analysis
NASA Astrophysics Data System (ADS)
Ismail, N. A. A.; Idris, N. H.; Ramli, H.; Rooshdi, R. R. Raja Muhammad; Sahamir, S. R.
2018-02-01
This paper presents the usage of Structural Equation Modelling (SEM) approach in analysing the effects of Building Information Modelling (BIM) technology adoption in improving the reliability of cost estimates. Based on the questionnaire survey results, SEM analysis using SPSS-AMOS application examined the relationships between BIM-improved information and cost estimates reliability factors, leading to BIM technology adoption. Six hypotheses were established prior to SEM analysis employing two types of SEM models, namely the Confirmatory Factor Analysis (CFA) model and full structural model. The SEM models were then validated through the assessment on their uni-dimensionality, validity, reliability, and fitness index, in line with the hypotheses tested. The final SEM model fit measures are: P-value=0.000, RMSEA=0.079<0.08, GFI=0.824, CFI=0.962>0.90, TLI=0.956>0.90, NFI=0.935>0.90 and ChiSq/df=2.259; indicating that the overall index values achieved the required level of model fitness. The model supports all the hypotheses evaluated, confirming that all relationship exists amongst the constructs are positive and significant. Ultimately, the analysis verified that most of the respondents foresee better understanding of project input information through BIM visualization, its reliable database and coordinated data, in developing more reliable cost estimates. They also perceive to accelerate their cost estimating task through BIM adoption.
Evren, Cuneyt; Dalbudak, Ercan; Topcu, Merve; Kutlu, Nilay; Evren, Bilge; Pontes, Halley M
2018-07-01
The main aims of the current study were to test the factor structure, reliability and validity of the nine-item Internet Gaming Disorder Scale-Short Form (IGDS9-SF), a standardized measure to assess symptoms and prevalence of Internet Gaming Disorder (IGD). In the present study participants were assessed with the IGDS9-SF, nine-item Internet Gaming Disorder Scale (IGDS) and the Young's Internet Addiction Test-Short Form (YIAT-SF). Confirmatory factor analyzes demonstrated that the factor structure (i.e., the dimensional structure) of the IGDS9-SF was satisfactory. The scale was also reliable (i.e., internally consistent with a Cronbach's alpha of 0.89) and showed adequate convergent and criterion-related validity, as indicated by statistically significant positive correlations between average time daily spent playing games during last year, IGDS and YIAT-SF scores. By applying the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) threshold for diagnosing IGD (e.g., endorsing at least five criteria), it was found that the prevalence of disordered gamers ranged from 0.96% (whole sample) to 2.57% (e-sports players). These findings support the Turkish version of the IGDS9-SF as a valid and reliable tool for determining the extent of IGD-related problems among young adults and for the purposes of early IGD diagnosis in clinical settings and similar research. Copyright © 2018 Elsevier B.V. All rights reserved.
Development of a scale to assess cancer stigma in the non-patient population.
Marlow, Laura A V; Wardle, Jane
2014-04-23
Illness-related stigma has attracted considerable research interest, but few studies have specifically examined stigmatisation of cancer in the non-patient population. The present study developed and validated a Cancer Stigma Scale (CASS) for use in the general population. An item pool was developed on the basis of previous research into illness-related stigma in the general population and patients with cancer. Two studies were carried out. The first study used Exploratory factor analysis to explore the structure of items in a sample of 462 postgraduate students recruited through a London university. The second study used Confirmatory factor analysis to confirm the structure among 238 adults recruited through an online market research panel. Internal reliability, test-retest reliability and construct validity were also assessed. Exploratory factor analysis suggested six subscales, representing: Awkwardness, Severity, Avoidance, Policy Opposition, Personal Responsibility and Financial Discrimination. Confirmatory factor analysis confirmed this structure with a 25-item scale. All subscales showed adequate to good internal and test-retest reliability in both samples. Construct validity was also good, with mean scores for each subscale varying in the expected directions by age, gender, experience of cancer, awareness of lifestyle risk factors for cancer, and social desirability. Means for the subscales were consistent across the two samples. These findings highlight the complexity of cancer stigma and provide the Cancer Stigma Scale (CASS) which can be used to compare populations, types of cancer and evaluate the effects of interventions designed to reduce cancer stigma in non-patient populations.
Al-Eidan, Fahad; Baig, Lubna Ansari; Magzoub, Mohi-Eldin; Omair, Aamir
2016-04-01
To assess reliability and validity of evaluation tool using Haematology course as an example. The cross-sectional study was conducted at King Saud Bin Abdul Aziz University of Health Sciences, Riyadh, Saudi Arabia, in 2012, while data analysis was completed in 2013. The 27-item block evaluation instrument was developed by a multidisciplinary faculty after a comprehensive literature review. Validity of the questionnaire was confirmed using principal component analysis with varimax rotation and Kaiser normalisation. Identified factors were combined to get the internal consistency reliability of each factor. Student's t-test was used to compare mean ratings between male and female students for the faculty and block evaluation. Of the 116 subjects in the study, 80(69%) were males and 36(31%) were females. Reliability of the questionnaire was Cronbach's alpha 0.91. Factor analysis yielded a logically coherent 7 factor solution that explained 75% of the variation in the data. The factors were group dynamics in problem-based learning (alpha0.92), block administration (alpha 0.89), quality of objective structured clinical examination (alpha 0.86), block coordination (alpha 0.81), structure of problem-based learning (alpha 0.84), quality of written exam (alpha 0.91), and difficulty of exams (alpha0.41). Female students' opinion on depth of analysis and critical thinking was significantly higher than that of the males (p=0.03). The faculty evaluation tool used was found to be reliable, but its validity, as assessed through factor analysis, has to be interpreted with caution as the responders were less than the minimum required for factor analysis.
NASA Astrophysics Data System (ADS)
Martowicz, Adam; Uhl, Tadeusz
2012-10-01
The paper discusses the applicability of a reliability- and performance-based multi-criteria robust design optimization technique for micro-electromechanical systems, considering their technological uncertainties. Nowadays, micro-devices are commonly applied systems, especially in the automotive industry, taking advantage of utilizing both the mechanical structure and electronic control circuit on one board. Their frequent use motivates the elaboration of virtual prototyping tools that can be applied in design optimization with the introduction of technological uncertainties and reliability. The authors present a procedure for the optimization of micro-devices, which is based on the theory of reliability-based robust design optimization. This takes into consideration the performance of a micro-device and its reliability assessed by means of uncertainty analysis. The procedure assumes that, for each checked design configuration, the assessment of uncertainty propagation is performed with the meta-modeling technique. The described procedure is illustrated with an example of the optimization carried out for a finite element model of a micro-mirror. The multi-physics approach allowed the introduction of several physical phenomena to correctly model the electrostatic actuation and the squeezing effect present between electrodes. The optimization was preceded by sensitivity analysis to establish the design and uncertain domains. The genetic algorithms fulfilled the defined optimization task effectively. The best discovered individuals are characterized by a minimized value of the multi-criteria objective function, simultaneously satisfying the constraint on material strength. The restriction of the maximum equivalent stresses was introduced with the conditionally formulated objective function with a penalty component. The yielded results were successfully verified with a global uniform search through the input design domain.
Jacobsberg, L; Perry, S; Frances, A
1995-12-01
Instruments to assess personality disorders offer reliability, but at the cost of large amounts of a skilled clinician's time to make assessments. The Structured Clinical Interview for DSM-III Axis II (SCID-II; Spitzer, Williams, Gibbon, & First, 1990), incorporates a self-report screening questionnaire, reducing the number of items needing evaluation by the interviewer. However, false negative responses may cause clinically important areas to be overlooked. To establish the rate of false negative responses, we compared participant self-report on the SCID-II with Axis II diagnostic assessment done by clinicians using the Personality Disorder Examination (Loranger, Susman, Oldham, & Russakoff, 1987). The false negative rate was low for every diagnosis, supporting validity of following up with clinician questioning only those diagnostic elements endorsed in the self-report. Avoidant and dependent personality disorders were accurately self-reported. This, an efficient assessment instrument for personality disorders might combine self-report of those disorders where self-report is reliable, with clinician assessment where needed.
The Addiction Severity Index: reliability and validity in a Dutch alcoholic population.
DeJong, C A; Willems, J C; Schippers, G M; Hendriks, V M
1995-04-01
The Addiction Severity Index (ASI) was evaluated for its psychometric qualities in a Dutch alcoholic population admitted to an addiction treatment center in The Netherlands. Its factorial structure in this population was found to be consistent with the established six factor structure of the ASI. Reliability analysis revealed that the homogeneity of the subscales was acceptable with the exception of the Alcohol Scale. The six subscales were not highly intercorrelated. The results of this study indicate that the ASI is a useful instrument for the assessment of several problems associated with alcoholism. However, the Alcohol Scale appears to be limited as a diagnostic and research instrument in the field of inpatient treatment of alcohol dependence in The Netherlands.
[Psychometric examination of the School Social Climate Questionnaire in Chileans students].
Guerra Vio, Cristóbal; Castro Arancibia, Lorena; Vargas Castro, Judith
2011-02-01
The School Social Climate Questionnaire (CECSCE) was adapted and applied. Subsequently, its psychometric proprieties were analyzed. The 1075 Chilean students who participated were assessed with the CECSCE and the School Violence Scale. The results showed that the CECSCE has a bifactorial structure, although there was also the possibility of a unifactorial structure. The CECSCE achieved satisfactory reliability and homogeneity indexes. The CECSCES scores were inversely related to the school violence rate. Lastly, differences by gender and educational level were analyzed. Given that there are differences in school climate perceptions in favor of girls, Chilean standards are presented in percentiles by gender. It can therefore be concluded that the CECSCE is sufficiently valid and reliable to be applied in Chile.
Vikström, Anna; Skånér, Ylva; Strender, Lars-Erik; Nilsson, Gunnar H
2007-01-01
Background Terminologies and classifications are used for different purposes and have different structures and content. Linking or mapping terminologies and classifications has been pointed out as a possible way to achieve various aims as well as to attain additional advantages in describing and documenting health care data. The objectives of this study were: • to explore and develop rules to be used in a mapping process • to evaluate intercoder reliability and the assessed degree of concordance when the 'Swedish primary health care version of the International Classification of Diseases version 10' (ICD-10) is matched to the Systematized Nomenclature of Medicine, Clinical Terms (SNOMED CT) • to describe characteristics in the coding systems that are related to obstacles to high quality mapping. Methods Mapping (interpretation, matching, assessment and rule development) was done by two coders. The Swedish primary health care version of ICD-10 with 972 codes was randomly divided into an allotment of three sets of categories, used in three mapping sequences, A, B and C. Mapping was done independently by the coders and new rules were developed between the sequences. Intercoder reliability was measured by comparing the results after each set. The extent of matching was assessed as either 'partly' or 'completely concordant' Results General principles for mapping were outlined before the first sequence, A. New mapping rules had significant impact on the results between sequences A - B (p < 0.01) and A - C (p < 0.001). The intercoder reliability in our study reached 83%. Obstacles to high quality mapping were mainly a lack of agreement by the coders due to structural and content factors in SNOMED CT and in the current ICD-10 version. The predominant reasons for this were difficulties in interpreting the meaning of the categories in the current ICD-10 version, and the presence of many related concepts in SNOMED CT. Conclusion Mapping from ICD-10-categories to SNOMED CT needs clear and extensive rules. It is possible to reach high intercoder reliability in mapping from ICD-10-categories to SNOMED CT. However, several obstacles to high quality mapping remain due to structure and content characteristics in both coding systems. PMID:17472757
Reliability Quantification of Advanced Stirling Convertor (ASC) Components
NASA Technical Reports Server (NTRS)
Shah, Ashwin R.; Korovaichuk, Igor; Zampino, Edward
2010-01-01
The Advanced Stirling Convertor, is intended to provide power for an unmanned planetary spacecraft and has an operational life requirement of 17 years. Over this 17 year mission, the ASC must provide power with desired performance and efficiency and require no corrective maintenance. Reliability demonstration testing for the ASC was found to be very limited due to schedule and resource constraints. Reliability demonstration must involve the application of analysis, system and component level testing, and simulation models, taken collectively. Therefore, computer simulation with limited test data verification is a viable approach to assess the reliability of ASC components. This approach is based on physics-of-failure mechanisms and involves the relationship among the design variables based on physics, mechanics, material behavior models, interaction of different components and their respective disciplines such as structures, materials, fluid, thermal, mechanical, electrical, etc. In addition, these models are based on the available test data, which can be updated, and analysis refined as more data and information becomes available. The failure mechanisms and causes of failure are included in the analysis, especially in light of the new information, in order to develop guidelines to improve design reliability and better operating controls to reduce the probability of failure. Quantified reliability assessment based on fundamental physical behavior of components and their relationship with other components has demonstrated itself to be a superior technique to conventional reliability approaches based on utilizing failure rates derived from similar equipment or simply expert judgment.
Nanoscale deformation measurements for reliability assessment of material interfaces
NASA Astrophysics Data System (ADS)
Keller, Jürgen; Gollhardt, Astrid; Vogel, Dietmar; Michel, Bernd
2006-03-01
With the development and application of micro/nano electronic mechanical systems (MEMS, NEMS) for a variety of market segments new reliability issues will arise. The understanding of material interfaces is the key for a successful design for reliability of MEMS/NEMS and sensor systems. Furthermore in the field of BIOMEMS newly developed advanced materials and well known engineering materials are combined despite of fully developed reliability concepts for such devices and components. In addition the increasing interface-to volume ratio in highly integrated systems and nanoparticle filled materials are challenges for experimental reliability evaluation. New strategies for reliability assessment on the submicron scale are essential to fulfil the needs of future devices. In this paper a nanoscale resolution experimental method for the measurement of thermo-mechanical deformation at material interfaces is introduced. The determination of displacement fields is based on scanning probe microscopy (SPM) data. In-situ SPM scans of the analyzed object (i.e. material interface) are carried out at different thermo-mechanical load states. The obtained images are compared by grayscale cross correlation algorithms. This allows the tracking of local image patterns of the analyzed surface structure. The measurement results are full-field displacement fields with nanometer resolution. With the obtained data the mixed mode type of loading at material interfaces can be analyzed with highest resolution for future needs in micro system and nanotechnology.
PEP solar array definition study
NASA Technical Reports Server (NTRS)
1979-01-01
The conceptual design of a large, flexible, lightweight solar array is presented focusing on a solar array overview assessment, solar array blanket definition, structural-mechanical systems definition, and launch/reentry blanket protection features. The overview assessment includes a requirements and constraints review, the thermal environment assessment on the design selection, an evaluation of blanket integration sequence, a conceptual blanket/harness design, and a hot spot analysis considering the effects of shadowing and cell failures on overall array reliability. The solar array blanket definition includes the substrate design, hinge designs and blanket/harness flexibility assessment. The structural/mechanical systems definition includes an overall loads and deflection assessment, a frequency analysis of the deployed assembly, a components weights estimate, design of the blanket housing and tensioning mechanism. The launch/reentry blanket protection task includes assessment of solar cell/cover glass cushioning concepts during ascent and reentry flight condition.
Lui, P Priscilla; Fernando, Gaithri A
2018-02-01
Numerous scales currently exist that assess well-being, but research on measures of well-being is still advancing. Conceptualization and measurement of subjective well-being have emphasized intrapsychic over psychosocial domains of optimal functioning, and disparate research on hedonic, eudaimonic, and psychological well-being lacks a unifying theoretical model. Lack of systematic investigations on the impact of culture on subjective well-being has also limited advancement of this field. The goals of this investigation were to (1) develop and validate a self-report measure, the Well-Being Scale (WeBS), that simultaneously assesses overall well-being and physical, financial, social, hedonic, and eudaimonic domains of this construct; (2) evaluate factor structures that underlie subjective well-being; and (3) examine the measure's psychometric properties. Three empirical studies were conducted to develop and validate the 29-item scale. The WeBS demonstrated an adequate five-factor structure in an exploratory structural equation model in Study 1. Confirmatory factor analyses showed that a bifactor structure best fit the WeBS data in Study 2 and Study 3. Overall WeBS scores and five domain-specific subscale scores demonstrated adequate to excellent internal consistency reliability and construct validity. Mean differences in overall well-being and its five subdomains are presented for different ethnic groups. The WeBS is a reliable and valid measure of multiple aspects of well-being that are considered important to different ethnocultural groups.
The Clinical Threat Assessment of the Lone-Actor Terrorist.
Meloy, J Reid; Genzman, Jacqueline
2016-12-01
The Terrorist Radicalization Assessment Protocol (TRAP-18) is a structured professional judgment instrument for the assessment of individuals who present a concern for lone-actor terrorism. It consists of eight proximal warning behaviors and 10 distal characteristics. Previous research has demonstrated its interrater reliability and some concurrent and postdictive validity. In this article, TRAP-18 is retrospectively applied to the case of US Army psychiatrist and jihadist Malik Nidal Hasan, who committed a mass murder at Fort Hood, Texas in 2009. The strengths and limitations of TRAP-18 as a structured professional judgment instrument for mental health clinicians are discussed, and clinical risk management suggestions are made. Copyright © 2016 Elsevier Inc. All rights reserved.
Development and Validation of the Caring Loneliness Scale.
Karhe, Liisa; Kaunonen, Marja; Koivisto, Anna-Maija
2016-12-01
The Caring Loneliness Scale (CARLOS) includes 5 categories derived from earlier qualitative research. This article assesses the reliability and construct validity of a scale designed to measure patient experiences of loneliness in a professional caring relationship. Statistical analysis with 4 different sample sizes included Cronbach's alpha and exploratory factor analysis with principal axis factoring extraction. The sample size of 250 gave the most useful and comprehensible structure, but all 4 samples yielded underlying content of loneliness experiences. The initial 5 categories were reduced to 4 factors with 24 items and Cronbach's alpha ranging from .77 to .90. The findings support the reliability and validity of CARLOS for the assessment of Finnish breast cancer and heart surgery patients' experiences but as all instruments, further validation is needed.
System Analysis by Mapping a Fault-tree into a Bayesian-network
NASA Astrophysics Data System (ADS)
Sheng, B.; Deng, C.; Wang, Y. H.; Tang, L. H.
2018-05-01
In view of the limitations of fault tree analysis in reliability assessment, Bayesian Network (BN) has been studied as an alternative technology. After a brief introduction to the method for mapping a Fault Tree (FT) into an equivalent BN, equations used to calculate the structure importance degree, the probability importance degree and the critical importance degree are presented. Furthermore, the correctness of these equations is proved mathematically. Combining with an aircraft landing gear’s FT, an equivalent BN is developed and analysed. The results show that richer and more accurate information have been achieved through the BN method than the FT, which demonstrates that the BN is a superior technique in both reliability assessment and fault diagnosis.
Le, Minh Thi Hong; Tran, Thach Duc; Holton, Sara; Nguyen, Huong Thanh; Wolfe, Rory; Fisher, Jane
2017-01-01
To assess the internal consistency, latent structure and convergent validity of the Depression, Anxiety and Stress Scale-21 (DASS-21) among adolescents in Vietnam. An anonymous, self-completed questionnaire was conducted among 1,745 high school students in Hanoi, Vietnam between October, 2013 and January, 2014. Confirmatory factor analyses were performed to assess the latent structure of the DASS-21. Factorial invariance between girls and boys was examined. Cronbach alphas and correlation coefficients between DASS-21 factor scores and the domain scores of the Duke Health Profile Adolescent Vietnamese validated version (ADHP-V) were calculated to assess DASS-21 internal consistency and convergent validity. A total of 1,606/ 1,745 (92.6%) students returned the questionnaire. Of those, 1,387 students provided complete DASS-21 data. The scale demonstrated adequate internal consistency (Cronbach α: 0.761 to 0.906). A four-factor model showed the best fit to the data. Items loaded significantly on a common general distress factor, the depression, and the anxiety factors, but few on the stress factor (p<0.05). DASS-21 convergent validity was confirmed with moderate correlation coefficients (-0.47 to -0.66) between its factor scores and the ADHP-V mental health related domains. The DASS-21 is reliable and suitable for use to assess symptoms of common mental health problems, especially depression and anxiety among Vietnamese adolescents. However, its ability in detecting stress among these adolescents may be limited. Further research is warrant to explore these results.
The Aftercare and School Observation System (ASOS): Reliability and Component Structure.
Ingoldsby, Erin M; Shelleby, Elizabeth C; Lane, Tonya; Shaw, Daniel S; Dishion, Thomas J; Wilson, Melvin N
2013-10-01
This study examines the psychometric properties and component structure of a newly developed observational system, the Aftercare and School Observation System (ASOS). Participants included 468 children drawn from a larger longitudinal intervention study. The system was utilized to assess participant children in school lunchrooms and recess and various afterschool environments. Exploratory factor analyses examined whether a core set of component constructs assessing qualities of children's relationships, caregiver involvement and monitoring, and experiences in school and aftercare contexts that have been linked to children's behavior problems would emerge. Construct validity was assessed by examining associations between ASOS constructs and questionnaire measures assessing children's behavior problems and relationship qualities in school and aftercare settings. Across both settings, two factors showed very similar empirical structures and item loadings, reflecting the constructs of a negative/aggressive context and caregiver positive involvement, with one additional unique factor from the school setting reflecting the extent to which caregiver methods used resulted in less negative behavior and two additional unique factors from the aftercare setting reflecting positivity in the child's interactions and general environment and negativity in the child's interactions and setting. Modest correlations between ASOS factors and aftercare provider and teacher ratings of behavior problems, adult-child relationships, and a rating of school climate contributed to our interpretation that the ASOS scores capture meaningful features of children's experiences in these settings. This study represents the first step of establishing that the ASOS reliably and validly captures risk and protective relationships and experiences in extra-familial settings.
Quality assessment of protein model-structures using evolutionary conservation.
Kalman, Matan; Ben-Tal, Nir
2010-05-15
Programs that evaluate the quality of a protein structural model are important both for validating the structure determination procedure and for guiding the model-building process. Such programs are based on properties of native structures that are generally not expected for faulty models. One such property, which is rarely used for automatic structure quality assessment, is the tendency for conserved residues to be located at the structural core and for variable residues to be located at the surface. We present ConQuass, a novel quality assessment program based on the consistency between the model structure and the protein's conservation pattern. We show that it can identify problematic structural models, and that the scores it assigns to the server models in CASP8 correlate with the similarity of the models to the native structure. We also show that when the conservation information is reliable, the method's performance is comparable and complementary to that of the other single-structure quality assessment methods that participated in CASP8 and that do not use additional structural information from homologs. A perl implementation of the method, as well as the various perl and R scripts used for the analysis are available at http://bental.tau.ac.il/ConQuass/. nirb@tauex.tau.ac.il Supplementary data are available at Bioinformatics online.
Homaifar, Beeta; Matarazzo, Bridget; Wortzel, Hal S
2013-09-01
This column is the second in a series presenting a model for therapeutic risk management of the suicidal patient. As discussed in the first part of the series, the model involves several elements including augmenting clinical risk assessment with structured instruments, stratifying risk in terms of both severity and temporality, and developing and documenting a safety plan. This column explores in more detail how to augment clinical risk assessment with structured instruments. Unstructured clinical interviews have the potential to miss important aspects of suicide risk assessment. By augmenting the free-form clinical interview with structured instruments that demonstrate reliability and validity, a more nuanced and multifaceted approach to suicide risk assessment is achieved. Incorporating structured instruments into practice also serves a medicolegal function, since these instruments may become a living part of the medical record, establishing baseline levels of suicidal thoughts and behaviors and facilitating future clinical determinations regarding safety needs. We describe several instruments used in a multidisciplinary suicide consultation service, each of which has demonstrated relevance to suicide risk assessment and screening, ease of administration, and strong psychometric properties. In addition, we emphasize the importance of viewing suicide risk assessment as an ongoing process rather than as a singular event. Finally, we discuss special considerations in the evolving practice of risk assessment.
Assessment of physical server reliability in multi cloud computing system
NASA Astrophysics Data System (ADS)
Kalyani, B. J. D.; Rao, Kolasani Ramchand H.
2018-04-01
Business organizations nowadays functioning with more than one cloud provider. By spreading cloud deployment across multiple service providers, it creates space for competitive prices that minimize the burden on enterprises spending budget. To assess the software reliability of multi cloud application layered software reliability assessment paradigm is considered with three levels of abstractions application layer, virtualization layer, and server layer. The reliability of each layer is assessed separately and is combined to get the reliability of multi-cloud computing application. In this paper, we focused on how to assess the reliability of server layer with required algorithms and explore the steps in the assessment of server reliability.
Probabilistic design of fibre concrete structures
NASA Astrophysics Data System (ADS)
Pukl, R.; Novák, D.; Sajdlová, T.; Lehký, D.; Červenka, J.; Červenka, V.
2017-09-01
Advanced computer simulation is recently well-established methodology for evaluation of resistance of concrete engineering structures. The nonlinear finite element analysis enables to realistically predict structural damage, peak load, failure, post-peak response, development of cracks in concrete, yielding of reinforcement, concrete crushing or shear failure. The nonlinear material models can cover various types of concrete and reinforced concrete: ordinary concrete, plain or reinforced, without or with prestressing, fibre concrete, (ultra) high performance concrete, lightweight concrete, etc. Advanced material models taking into account fibre concrete properties such as shape of tensile softening branch, high toughness and ductility are described in the paper. Since the variability of the fibre concrete material properties is rather high, the probabilistic analysis seems to be the most appropriate format for structural design and evaluation of structural performance, reliability and safety. The presented combination of the nonlinear analysis with advanced probabilistic methods allows evaluation of structural safety characterized by failure probability or by reliability index respectively. Authors offer a methodology and computer tools for realistic safety assessment of concrete structures; the utilized approach is based on randomization of the nonlinear finite element analysis of the structural model. Uncertainty of the material properties or their randomness obtained from material tests are accounted in the random distribution. Furthermore, degradation of the reinforced concrete materials such as carbonation of concrete, corrosion of reinforcement, etc. can be accounted in order to analyze life-cycle structural performance and to enable prediction of the structural reliability and safety in time development. The results can serve as a rational basis for design of fibre concrete engineering structures based on advanced nonlinear computer analysis. The presented methodology is illustrated on results from two probabilistic studies with different types of concrete structures related to practical applications and made from various materials (with the parameters obtained from real material tests).
NASA Technical Reports Server (NTRS)
Hardrath, H. F.
1974-01-01
Fracture mechanics is a rapidly emerging discipline for assessing the residual strength of structures containing flaws due to fatigue, corrosion or accidental damage and for anticipating the rate of which such flaws will propagate if not repaired. The discipline is also applicable in the design of structures with improved resistance to such flaws. The present state of the design art is reviewed using this technology to choose materials, to configure safe and efficient structures, to specify inspection procedures, to predict lives of flawed structures and to develop reliability of current and future airframes.
Ku-band signal design study. [space shuttle orbiter data processing network
NASA Technical Reports Server (NTRS)
Rubin, I.
1978-01-01
Analytical tools, methods and techniques for assessing the design and performance of the space shuttle orbiter data processing system (DPS) are provided. The computer data processing network is evaluated in the key areas of queueing behavior synchronization and network reliability. The structure of the data processing network is described as well as the system operation principles and the network configuration. The characteristics of the computer systems are indicated. System reliability measures are defined and studied. System and network invulnerability measures are computed. Communication path and network failure analysis techniques are included.
Van der Elst, Wim; Molenberghs, Geert; Hilgers, Ralf-Dieter; Verbeke, Geert; Heussen, Nicole
2016-11-01
There are various settings in which researchers are interested in the assessment of the correlation between repeated measurements that are taken within the same subject (i.e., reliability). For example, the same rating scale may be used to assess the symptom severity of the same patients by multiple physicians, or the same outcome may be measured repeatedly over time in the same patients. Reliability can be estimated in various ways, for example, using the classical Pearson correlation or the intra-class correlation in clustered data. However, contemporary data often have a complex structure that goes well beyond the restrictive assumptions that are needed with the more conventional methods to estimate reliability. In the current paper, we propose a general and flexible modeling approach that allows for the derivation of reliability estimates, standard errors, and confidence intervals - appropriately taking hierarchies and covariates in the data into account. Our methodology is developed for continuous outcomes together with covariates of an arbitrary type. The methodology is illustrated in a case study, and a Web Appendix is provided which details the computations using the R package CorrMixed and the SAS software. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Validation of the MISSCARE-BRASIL survey - A tool to assess missed nursing care.
Siqueira, Lillian Dias Castilho; Caliri, Maria Helena Larcher; Haas, Vanderlei José; Kalisch, Beatrice; Dantas, Rosana Aparecida Spadoti
2017-12-21
to analyze the metric validity and reliability properties of the MISSCARE-BRASIL survey. methodological research conducted by assessing construct validity and reliability via confirmatory factor analysis, known-groups validation, convergent construct validation, analysis of internal consistency and test-retest reliability. The sample consisted of 330 nursing professionals, of whom 86 participated in the retest phase. of the 330 participants, 39.7% were aides, 33% technicians, 20.9% nurses, and 6.4% nurses with administrative roles. Confirmatory factorial analysis demonstrated that the Brazilian Portuguese version of the instrument is adequately adjusted to the dimensional structure the scale authors originally proposed. The correlation between "satisfaction with position/role" and "satisfaction with teamwork" and the survey's missed care variables was moderate (Spearman's coefficient =0.35; p<0.001). The results of the Student's t-test indicated known-group validity. Professionals from closed units reported lower levels of missed care in comparison with the other units. The reliability showed a strong correlation, with the exception of "institutional management/leadership style" (intraclass correlation coefficient (ICC)=0.15; p=0.04). The internal consistency was adequate (Cronbach's alpha was greater than 0.70). the MISSCARE-BRASIL was valid and reliable in the group studied. The application of the MISSCARE-BRASIL can contribute to identifying solutions for missed nursing care.
De Silva Weliange, Shreenika H; Fernando, Dulitha; Gunatilake, Jagath
2014-05-03
Environmental characteristics are known to be associated with patterns of physical activity (PA). Although several validated tools exist, to measure the environment characteristics, these instruments are not necessarily suitable for application in all settings especially in a developing country. This study was carried out to develop and validate an instrument named the "Physical And Social Environment Scale--PASES" to assess the physical and social environmental factors associated with PA. This will enable identification of various physical and social environmental factors affecting PA in Sri Lanka, which will help in the development of more tailored intervention strategies for promoting higher PA levels in Sri Lanka. The PASES was developed using a scientific approach of defining the construct, item generation, analysis of content of items and item reduction. Both qualitative and quantitative methods of key informant interviews, in-depth interviews and rating of the items generated by experts were conducted. A cross sectional survey among 180 adults was carried out to assess the factor structure through principal component analysis. Another cross sectional survey among a different group of 180 adults was carried out to assess the construct validity through confirmatory factor analysis. Reliability was assessed with test re-test reliability and internal consistency using Spearman r and Cronbach's alpha respectively. Thirty six items were selected after the expert ratings and were developed into interviewer administered questions. Exploration of factor structure of the 34 items which were factorable through principal component analysis with Quartimax rotation extracted 8 factors. The 34 item instrument was assessed for construct validity with confirmatory factor analysis which confirmed an 8 factor model (x2 = 339.9, GFI = 0.90). The identified factors were infrastructure for walking, aesthetics and facilities for cycling, vehicular traffic safety, access and connectivity, recreational facilities for PA, safety, social cohesion and social acceptance of PA with the two non-factorable factors, residential density and land use mix. The PASES also showed good test re-test reliability and a moderate level of internal consistency. The PASES is a valid and reliable tool which could be used to assess the physical and social environment associated with PA in Sri Lanka.
Validation of the Implementation Leadership Scale (ILS) with Supervisors' Self-Ratings.
Torres, Elisa M; Ehrhart, Mark G; Beidas, Rinad S; Farahnak, Lauren R; Finn, Natalie K; Aarons, Gregory A
2018-01-01
Although often discussed, there is a lack of empirical research on the role of leadership in the management and delivery of health services. The implementation leadership scale (ILS) assesses the degree to which leaders are knowledgeable, proactive, perseverant, and supportive during evidence-based practice (EBP) implementation. The purpose of this study was to examine the psychometric properties of the ILS for leaders' self-ratings using a sample of mental health clinic supervisors (N = 119). Supervisors (i.e., leaders) completed surveys including self-ratings of their implementation leadership. Confirmatory factor analysis, reliability, and validity of the ILS were evaluated. The ILS factor structure was supported in the sample of supervisors. Results demonstrated internal consistency reliability and validity. Cronbach alpha's ranged from 0.92 to 0.96 for the ILS subscales and 0.95 for the ILS overall scale. The factor structure replication and reliability of the ILS in a sample of supervisors demonstrates its applicability with employees across organizational levels.
Reliability assessment of slender concrete columns at the stability failure
NASA Astrophysics Data System (ADS)
Valašík, Adrián; Benko, Vladimír; Strauss, Alfred; Täubling, Benjamin
2018-01-01
The European Standard for designing concrete columns within the use of non-linear methods shows deficiencies in terms of global reliability, in case that the concrete columns fail by the loss of stability. The buckling failure is a brittle failure which occurs without warning and the probability of its formation depends on the columns slenderness. Experiments with slender concrete columns were carried out in cooperation with STRABAG Bratislava LTD in Central Laboratory of Faculty of Civil Engineering SUT in Bratislava. The following article aims to compare the global reliability of slender concrete columns with slenderness of 90 and higher. The columns were designed according to methods offered by EN 1992-1-1 [1]. The mentioned experiments were used as basis for deterministic nonlinear modelling of the columns and subsequent the probabilistic evaluation of structural response variability. Final results may be utilized as thresholds for loading of produced structural elements and they aim to present probabilistic design as less conservative compared to classic partial safety factor based design and alternative ECOV method.
Beyond the Factor of Safety: Developing Fragility Curves to Characterize System Reliability
2010-07-01
increasingly common compo- nents of flood risk assessments. This report introduces the concept of the fragility curve and shows how fragility curves are...curves are identified in the literature on structures and risk assessment to identify what methods have been used to develop fragility curves in...and disadvantages of the various approaches are considered. DISCLAIMER: The contents of this report are not to be used for advertising
ERIC Educational Resources Information Center
Sandilos, Lia E.
2012-01-01
The purpose of the current study was to evaluate the structural validity and stability of scores on a measure of global classroom quality, the Classroom Assessment Scoring System, Kindergarten-Third Grade (CLASS K-3; Pianta, La Paro, & Hamre, 2008). Using data from a sample of 417 kindergarten classrooms in the rural Southern and Mid-Atlantic…
Static test induced loads verification beyond elastic limit
NASA Technical Reports Server (NTRS)
Verderaime, V.; Harrington, F.
1996-01-01
Increasing demands for reliable and least-cost high-performance aerostructures are pressing design analyses, materials, and manufacturing processes to new and narrowly experienced performance and verification technologies. This study assessed the adequacy of current experimental verification of the traditional binding ultimate safety factor which covers rare events in which no statistical design data exist. Because large high-performance structures are inherently very flexible, boundary rotations and deflections under externally applied loads approaching fracture may distort their transmission and unknowingly accept submarginal structures or prematurely fracturing reliable ones. A technique was developed, using measured strains from back-to-back surface mounted gauges, to analyze, define, and monitor induced moments and plane forces through progressive material changes from total-elastic to total-inelastic zones within the structural element cross section. Deviations from specified test loads are identified by the consecutively changing ratios of moment-to-axial load.
Static test induced loads verification beyond elastic limit
NASA Technical Reports Server (NTRS)
Verderaime, V.; Harrington, F.
1996-01-01
Increasing demands for reliable and least-cost high performance aerostructures are pressing design analyses, materials, and manufacturing processes to new and narrowly experienced performance and verification technologies. This study assessed the adequacy of current experimental verification of the traditional binding ultimate safety factor which covers rare events in which no statistical design data exist. Because large, high-performance structures are inherently very flexible, boundary rotations and deflections under externally applied loads approaching fracture may distort their transmission and unknowingly accept submarginal structures or prematurely fracturing reliable ones. A technique was developed, using measured strains from back-to-back surface mounted gauges, to analyze, define, and monitor induced moments and plane forces through progressive material changes from total-elastic to total inelastic zones within the structural element cross section. Deviations from specified test loads are identified by the consecutively changing ratios of moment-to-axial load.
Student assessment by objective structured examination in a neurology clerkship
Adesoye, Taiwo; Smith, Sandy; Blood, Angela; Brorson, James R.
2012-01-01
Objectives: We evaluated the reliability and predictive ability of an objective structured clinical examination (OSCE) in the assessment of medical students at the completion of a neurology clerkship. Methods: We analyzed data from 195 third-year medical students who took the OSCE. For each student, the OSCE consisted of 2 standardized patient encounters. The scores obtained from each encounter were compared. Faculty clinical evaluations of each student for 2 clinical inpatient rotations were also compared. Hierarchical regression analysis was applied to test the ability of the averaged OSCE scores to predict standardized written examination scores and composite clinical scores. Results: Students' OSCE scores from the 2 standardized patient encounters were significantly correlated with each other (r = 0.347, p < 0.001), and the scores for all students were normally distributed. In contrast, students' faculty clinical evaluation scores from 2 different clinical inpatient rotations were uncorrelated, and scores were skewed toward the highest ratings. After accounting for clerkship order, better OSCE scores were predictive of better National Board of Medical Examiners standardized examination scores (R2Δ = 0.131, p < 0.001) and of better faculty clinical scores (R2Δ = 0.078, p < 0.001). Conclusions: Student assessment by an OSCE provides a reliable and predictive objective assessment of clinical performance in a neurology clerkship. PMID:22855865
Martínez-González, Agustín E.; Rodríguez-Jiménez, Tíscar; Piqueras, José A.; Vera-Villarroel, Pablo; Godoy, Antonio
2015-01-01
In recent years, there has been a considerable increase in the development of assessment tools for obsessive-compulsive symptomatology in children and adolescents. The Obsessive Compulsive Inventory-Child Version (OCI-CV) is a well-established assessment self-report, with special interest for the assessment of dimensions of Obsessive Compulsive Disorder (OCD). This instrument has shown to be useful for clinical and non-clinical populations in two languages (English and European Spanish). Thus, the aim of this study was to analyze the psychometric properties of the OCI-CV in a Chilean community sample. The sample consisted of 816 children and adolescents with a mean age of 14.54 years (SD = 2.21; range = 10–18 years). Factor structure, internal consistency, test-retest reliability, convergent/divergent validity, and gender/age differences were examined. Confirmatory factor analysis showed a 6-factor structure (Doubting/Checking, Obsessing, Hoarding, Washing, Ordering, and Neutralizing) with one second-order factor. Good estimates of reliability (including internal consistency and test-retest), evidence supporting the validity, and small age and gender differences (higher levels of OCD symptomatology among older participants and women, respectively) are found. The OCI-CV is also an adequate scale for the assessment of obsessions and compulsions in a general population of Chilean children and adolescents. PMID:26317404
Koumpouros, Yiannis; Papageorgiou, Effie; Karavasili, Alexandra; Alexopoulou, Despoina
2017-07-01
To examine the Assistive Technology Device Predisposition Assessment scale and provide evidence of validity and reliability of the Greek version. We translated and adapted the original instrument in Greek according to the most well-known guidelines recommendations. Field test studies were conducted in a rehabilitation hospital to validate the appropriateness of the final results. Ratings of the different items were statistically analyzed. We recruited 115 subjects who were administered the Form E of the original questionnaire. The experimental analysis conducted revealed a three subscales structure: (i) Adaptability, (ii) Fit to Use, and (iii) Socializing. According to the results of our study the three subscales measure different constructs. Reliability measures (ICC = 0.981, Pearson's correlation = 0.963, Cronbach's α = 0.701) yielded high values. Test-retest outcome showed great stability. This is the first study, at least to the knowledge of the authors, which focuses merely on measuring the satisfaction of the users from the used assistive device, while exploring the Assistive Technology Device Predisposition Assessment - Device Form in such depth. According to the results, it is a stable, valid and reliable instrument and applicable to the Greek population. Thus, it can be used to measure the satisfaction of patients with assistive devices. Implications for Rehabilitation The paper explores the cultural adaptability and applicability of ATD PA - Device Form. ATD PA - Device Form can be used to assess user satisfaction by the selected assistive device. ATD PA - Device Form is a valid and reliable instrument in measuring users' satisfaction in Greekreality.
Tander, Berna; Ulus, Yasemin; Terzi, Yüksel; Zahiroğlu, Yeliz; Kesmen, Hakan; Farisoğullari, Bayram; Akyol, Yeşim; Bilgici, Ayhan; Kuru, Ömer
2016-12-01
This study aims to evaluate the reliability and validity of the Turkish language version of VITACORA-19 (psoriatic arthritis quality of life questionnaire) in patients with psoriatic arthritis. The Turkish version of VITACORA-19 questionnaire was obtained after a translation and back translation process. The study sample included 61 PsA patients (22 males, 39 females; mean age 46.5±12.2 years; range 19 to 71 years). To assess the test-retest reliability of the Turkish VITACORA-19, the questionnaire was reapplied 10 to 15 days after the first interview (interclass correlation coefficient). Cronbach's alpha (a) was used to evaluate the internal consistency. VITACORA-19 was compared with visual analog scale for physician and patient global assessments, the Health Assessment Questionnaire, and Nottingham Health Profile for construct validity. The internal structure of VITACORA-19 was examined by factor analysis. The individual item intraclass correlation coefficient ranged from 0.77 to 0.98 and Cronbach's alpha ranged from 0.77 to 0.98. The Cronbach's alpha value for whole scale was determined as 0.96. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.90, and Bartlett's test of sphericity had a p<0.001. Turkish VITACORA-19 total scores were correlated negatively with Health Assessment Questionnaire, visual analog scale for pain, and Nottingham Health Profile subgroups, and positively with physician and patient global assessments (p<0.01). Turkish version of VITACORA-19 questionnaire is a reliable and valid measure for health-related quality of life in Turkish patients with psoriatic arthritis.
ULUS, Yasemin; TERZİ, Yüksel; ZAHİROĞLU, Yeliz; KESMEN, Hakan; FARİSOĞULLARI, Bayram; AKYOL, Yeşim; BİLGİCİ, Ayhan; KURU, Ömer
2016-01-01
Objectives This study aims to evaluate the reliability and validity of the Turkish language version of VITACORA-19 (psoriatic arthritis quality of life questionnaire) in patients with psoriatic arthritis. Patients and methods The Turkish version of VITACORA-19 questionnaire was obtained after a translation and back translation process. The study sample included 61 PsA patients (22 males, 39 females; mean age 46.5±12.2 years; range 19 to 71 years). To assess the test-retest reliability of the Turkish VITACORA-19, the questionnaire was reapplied 10 to 15 days after the first interview (interclass correlation coefficient). Cronbach’s alpha (a) was used to evaluate the internal consistency. VITACORA-19 was compared with visual analog scale for physician and patient global assessments, the Health Assessment Questionnaire, and Nottingham Health Profile for construct validity. The internal structure of VITACORA-19 was examined by factor analysis. Results The individual item intraclass correlation coefficient ranged from 0.77 to 0.98 and Cronbach's alpha ranged from 0.77 to 0.98. The Cronbach's alpha value for whole scale was determined as 0.96. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.90, and Bartlett's test of sphericity had a p<0.001. Turkish VITACORA-19 total scores were correlated negatively with Health Assessment Questionnaire, visual analog scale for pain, and Nottingham Health Profile subgroups, and positively with physician and patient global assessments (p<0.01). Conclusion Turkish version of VITACORA-19 questionnaire is a reliable and valid measure for health-related quality of life in Turkish patients with psoriatic arthritis. PMID:29900999
De Vet, Emely; De Ridder, Denise; Stok, Marijn; Brunso, Karen; Baban, Adriana; Gaspar, Tania
2014-09-02
Applying self-regulation strategies have proven important in eating behaviors, but it remains subject to investigation what strategies adolescents report to use to ensure healthy eating, and adequate measures are lacking. Therefore, we developed and validated a self-regulation questionnaire applied to eating (TESQ-E) for adolescents. Study 1 reports a four-step approach to develop the TESQ-E questionnaire (n = 1097). Study 2 was a cross-sectional survey among adolescents from nine European countries (n = 11,392) that assessed the TESQ-E, eating-related behaviors, dietary intake and background characteristics. In study 3, the TESQ-E was administered twice within four weeks to evaluate test-retest reliability (n = 140). Study 4 was a cross-sectional survey (n = 93) that assessed the TESQ-E and related psychological constructs (e.g., motivation, autonomy, self-control). All participants were aged between 10 and 17 years. Study 1 resulted in a 24-item questionnaire assessing adolescent-reported use of six specific strategies for healthy eating that represent three general self-regulation approaches. Study 2 showed that the easy-to-administer theory-based TESQ-E has a clear factor structure and good subscale reliabilities. The questionnaire was related to eating-related behaviors and dietary intake, indicating predictive validity. Study 3 showed good test-retest reliabilities for the TESQ-E. Study 4 indicated that TESQ-E was related to but also distinguishable from general self-regulation and motivation measures. The TESQ-E provides a reliable and valid measure to assess six theory-based self-regulation strategies that adolescents may use to ensure their healthy eating.
The Cardiff Acne Disability Index (CADI): linguistic and cultural validation in Serbian.
Jankovic, Slavenka; Vukicevic, Jelica; Djordjevic, Sanja; Jankovic, Janko; Marinkovic, Jelena; Basra, Mohammad K A
2013-02-01
The aims of this study were to translate the Cardiff Acne Disability Index (CADI) into Serbian and to assess its validity and reliability in Serbian acne patients. The CADI was translated and linguistically validated into Serbian according to published guidelines. This version of CADI, along with the Serbian version of Children's Dermatology Life Quality Index (CDLQI) and a short demographic questionnaire, was administrated to a cohort of secondary school pupils. The Global Acne Grading Score was used to measure the clinical severity of acne. The internal consistency reliability of the Serbian version of CADI was assessed by Cronbach's alpha coefficient while its concurrent validity was assessed by Spearman's correlation coefficient. Construct validity was examined by factor analysis. A total of 465 pupils completed questionnaires. Self-reported acne was present in 76% of pupils (353/465). The Serbian version of CADI showed high internal consistency reliability (Cronbach's alpha coefficient = 0.79). The mean item-total correlation coefficient was 0.74 with a range of 0.53-0.81. The concurrent validity of the scale was supported by a moderate but highly significant correlation with the CDLQI (Spearman's rho = 0.66; P < 0.001). Factor analysis revealed the presence of two dimensions underlying the factor structure of the scale. The Serbian version of the CADI is a reliable, valid, and valuable tool for assessing the impact of acne on the quality of life of Serbian-speaking patients.
NASA Astrophysics Data System (ADS)
Castellarin, A.; Montanari, A.; Brath, A.
2002-12-01
The study derives Regional Depth-Duration-Frequency (RDDF) equations for a wide region of northern-central Italy (37,200 km 2) by following an adaptation of the approach originally proposed by Alila [WRR, 36(7), 2000]. The proposed RDDF equations have a rather simple structure and allow an estimation of the design storm, defined as the rainfall depth expected for a given storm duration and recurrence interval, in any location of the study area for storm durations from 1 to 24 hours and for recurrence intervals up to 100 years. The reliability of the proposed RDDF equations represents the main concern of the study and it is assessed at two different levels. The first level considers the gauged sites and compares estimates of the design storm obtained with the RDDF equations with at-site estimates based upon the observed annual maximum series of rainfall depth and with design storm estimates resulting from a regional estimator recently developed for the study area through a Hierarchical Regional Approach (HRA) [Gabriele and Arnell, WRR, 27(6), 1991]. The second level performs a reliability assessment of the RDDF equations for ungauged sites by means of a jack-knife procedure. Using the HRA estimator as a reference term, the jack-knife procedure assesses the reliability of design storm estimates provided by the RDDF equations for a given location when dealing with the complete absence of pluviometric information. The results of the analysis show that the proposed RDDF equations represent practical and effective computational means for producing a first guess of the design storm at the available raingauges and reliable design storm estimates for ungauged locations. The first author gratefully acknowledges D.H. Burn for sponsoring the submission of the present abstract.
Mouthon, L; Rannou, F; Bérezné, A; Pagnoux, C; Arène, J‐P; Foïs, E; Cabane, J; Guillevin, L; Revel, M; Fermanian, J; Poiraudeau, S
2007-01-01
Objective To develop and assess the reliability and construct validity of a scale assessing disability involving the mouth in systemic sclerosis (SSc). Methods We generated a 34‐item provisional scale from mailed responses of patients (n = 74), expert consensus (n = 10) and literature analysis. A total of 71 other SSc patients were recruited. The test–retest reliability was assessed using the intraclass coefficient correlation and divergent validity using the Spearman correlation coefficient. Factor analysis followed by varimax rotation was performed to assess the factorial structure of the scale. Results The item reduction process retained 12 items with 5 levels of answers (total score range 0–48). The mean total score of the scale was 20.3 (SD 9.7). The test–retest reliability was 0.96. Divergent validity was confirmed for global disability (Health Assessment Questionnaire (HAQ), r = 0.33), hand function (Cochin Hand Function Scale, r = 0.37), inter‐incisor distance (r = −0.34), handicap (McMaster‐Toronto Arthritis questionnaire (MACTAR), r = 0.24), depression (Hospital Anxiety and Depression (HAD); HADd, r = 0.26) and anxiety (HADa, r = 0.17). Factor analysis extracted 3 factors with eigenvalues of 4.26, 1.76 and 1.47, explaining 63% of the variance. These 3 factors could be clinically characterised. The first factor (5 items) represents handicap induced by the reduction in mouth opening, the second (5 items) handicap induced by sicca syndrome and the third (2 items) aesthetic concerns. Conclusion We propose a new scale, the Mouth Handicap in Systemic Sclerosis (MHISS) scale, which has excellent reliability and good construct validity, and assesses specifically disability involving the mouth in patients with SSc. PMID:17502364
Bidell, Markus P
2017-01-01
These three studies provide initial evidence for the development, factor structure, reliability, and validity of the Lesbian, Gay, Bisexual, and Transgender Development of Clinical Skills Scale (LGBT-DOCSS), a new interdisciplinary LGBT clinical self-assessment for health and mental health providers. Research participants were voluntarily recruited in the United States and United Kingdom and included trainees, clinicians, and educators from applied psychology, counseling, psychotherapy, and primary care medicine. Study 1 (N = 602) used exploratory and confirmatory factor analytic techniques, revealing an 18-item three-factor structure (Clinical Preparedness, Attitudinal Awareness, and Basic Knowledge). Study 2 established internal consistency for the overall LGBT-DOCSS (α = .86) and for each of the three subscales (Clinical Preparedness = .88, Attitudinal Awareness = .80, and Basic Knowledge = .83) and 2-week test-retest reliability (.87). In study 3 (N = 564), participant criteria (sexual orientation and education level) and four established scales that measured LGBT prejudice, assessment skills, and social desirability were used to support initial content and discriminant validity. Psychometric properties, limitations, and recommendations are discussed.
Tomography reconstruction methods for damage diagnosis of wood structure in construction field
NASA Astrophysics Data System (ADS)
Qiu, Qiwen; Lau, Denvid
2018-03-01
The structural integrity of wood building element plays a critical role in the public safety, which requires effective methods for diagnosis of internal damage inside the wood body. Conventionally, the non-destructive testing (NDT) methods such as X-ray computed tomography, thermography, radar imaging reconstruction method, ultrasonic tomography, nuclear magnetic imaging techniques, and sonic tomography have been used to obtain the information about the internal structure of wood. In this paper, the applications, advantages and disadvantages of these traditional tomography methods are reviewed. Additionally, the present article gives an overview of recently developed tomography approach that relies on the use of mechanical and electromagnetic waves for assessing the structural integrity of wood buildings. This developed tomography reconstruction method is believed to provide a more accurate, reliable, and comprehensive assessment of wood structural integrity
Blanchin, Myriam; Dauchy, Sarah; Cano, Alejandra; Brédart, Anne; Aaronson, Neil K; Hardouin, Jean-Benoit
2015-07-29
The Impact of Cancer version 2 (IOCv2) was designed to assess the physical and psychosocial health experience of cancer survivors through its positive and negative impacts. Although the IOCv2 is available in English and Dutch, it has not yet been validated for use in French-speaking populations. The current study was undertaken to provide a comprehensive assessment of the reliability and validity of the French language version of the IOCv2 in a sample of breast cancer survivors. An adapted French version of the IOCv2 as well as demographic and medical information were completed by 243 women to validate the factor structure divergent/divergent validities and reliability. Concurrent validity was assessed by correlating the IOCv2 scales with measures from the SF-12, PostTraumatic Growth Inventory and Fear of Cancer Recurrence Inventory. The French version of the IOCv2 supports the structure of the original version, with four positive impact dimensions and four negative impact dimensions. This result was suggested by the good fit of the confirmatory factor analysis and the adequate reliability revealed by Cronbach's alpha coefficients and other psychometric indices. The concurrent validity analysis revealed patterns of association between IOCv2 scale scores and other measures. Unlike the original version, a structure with a Positive Impact domain consisting in the IOCv2 positive dimensions and a Negative Impact domain consisting in the negative ones has not been clearly evidenced in this study. The limited practical use of the conditional dimensions Employment Concerns and Relationship Concerns, whether the patient is partnered or not, did not make possible to provide evidence of validity and reliability of these dimensions as the subsets of sample to work with were not large enough. The scores of these conditional dimensions have to be used with full knowledge of the facts of this limitation of the study. Integrating IOCv2 into studies will contribute to evaluate the psychosocial health experience of the growing population of cancer survivors, enabling better understanding of the multi-dimensional impact of cancer.
Viljoen, Jodi L.; Gray, Andrew L.; Shaffer, Catherine; Latzman, Natasha E.; Scalora, Mario J.; Ullman, Daniel
2018-01-01
Although the Juvenile Sex Offender Assessment Protocol–II (J-SOAP-II) and the Structured Assessment of Violence Risk in Youth (SAVRY) include an emphasis on dynamic, or modifiable factors, there has been little research on dynamic changes on these tools. To help address this gap, we compared admission and discharge scores of 163 adolescents who attended a residential, cognitive-behavioral treatment program for sexual offending. Based on reliable change indices, one half of youth showed a reliable decrease on the J-SOAP-II Dynamic Risk Total Score and one third of youth showed a reliable decrease on the SAVRY Dynamic Risk Total Score. Contrary to expectations, decreases in risk factors and increases in protective factors did not predict reduced sexual, violent nonsexual, or any reoffending. In addition, no associations were found between scores on the Psychopathy Checklist:Youth Version and levels of change. Overall, the J-SOAP-II and the SAVRY hold promise in measuring change, but further research is needed. PMID:26199271
Viljoen, Jodi L; Gray, Andrew L; Shaffer, Catherine; Latzman, Natasha E; Scalora, Mario J; Ullman, Daniel
2017-06-01
Although the Juvenile Sex Offender Assessment Protocol-II (J-SOAP-II) and the Structured Assessment of Violence Risk in Youth (SAVRY) include an emphasis on dynamic, or modifiable factors, there has been little research on dynamic changes on these tools. To help address this gap, we compared admission and discharge scores of 163 adolescents who attended a residential, cognitive-behavioral treatment program for sexual offending. Based on reliable change indices, one half of youth showed a reliable decrease on the J-SOAP-II Dynamic Risk Total Score and one third of youth showed a reliable decrease on the SAVRY Dynamic Risk Total Score. Contrary to expectations, decreases in risk factors and increases in protective factors did not predict reduced sexual, violent nonsexual, or any reoffending. In addition, no associations were found between scores on the Psychopathy Checklist:Youth Version and levels of change. Overall, the J-SOAP-II and the SAVRY hold promise in measuring change, but further research is needed.
Lee, Andrew G; Boldt, H Culver; Golnik, Karl C; Arnold, Anthony C; Oetting, Thomas A; Beaver, Hilary A; Olson, Richard J; Zimmerman, M Bridget; Carter, Keith
2006-03-01
To describe the use of the journal club as a tool to teach and assess competency in practice-based learning (PBL) and improvement among residents in ophthalmology. Interventional case series. Ophthalmology residents. Three academic ophthalmology residency programs in the United States. A survey was performed of self-assessed skills in PBL among residents in ophthalmology training before and after the implementation of a structured review checklist during a traditional resident journal club. The survey had 5 domains, including (A) appraise and assimilate evidence, (B) read a journal article critically, (C) use a systematic and standardized checklist, (D) apply knowledge of study designs and statistical methods, and (E) maintain a self-documented written record of compliance. The respondents scored their ability (range, 1-5). The use of a structured journal club tool was associated with a statistically significant improvement in self-assessed ability in all 5 domains. Although validity, reliability, and long-term efficacy studies are necessary, the structured journal club is one method of teaching and assessing resident competency in PBL and improvement.
O'Connor, Teresia M; Cerin, Ester; Hughes, Sheryl O; Robles, Jessica; Thompson, Deborah I; Mendoza, Jason A; Baranowski, Tom; Lee, Rebecca E
2014-01-15
Latino preschoolers (3-5 year old children) have among the highest rates of obesity. Low levels of physical activity (PA) are a risk factor for obesity. Characterizing what Latino parents do to encourage or discourage their preschooler to be physically active can help inform interventions to increase their PA. The objective was therefore to develop and assess the psychometrics of a new instrument: the Preschooler Physical Activity Parenting Practices (PPAPP) among a Latino sample, to assess parenting practices used to encourage or discourage PA among preschool-aged children. Cross-sectional study of 240 Latino parents who reported the frequency of using PA parenting practices. 95% of respondents were mothers; 42% had more than a high school education. Child mean age was 4.5 (±0.9) years (52% male). Test-retest reliability was assessed in 20%, 2 weeks later. We assessed the fit of a priori models using Confirmatory factor analyses (CFA). In a separate sub-sample (35%), preschool-aged children wore accelerometers to assess associations with their PA and PPAPP subscales. The a-priori models showed poor fit to the data. A modified factor structure for encouraging PPAPP had one multiple-item scale: engagement (15 items), and two single-items (have outdoor toys; not enroll in sport-reverse coded). The final factor structure for discouraging PPAPP had 4 subscales: promote inactive transport (3 items), promote screen time (3 items), psychological control (4 items) and restricting for safety (4 items). Test-retest reliability (ICC) for the two scales ranged from 0.56-0.85. Cronbach's alphas ranged from 0.5-0.9. Several sub-factors correlated in the expected direction with children's objectively measured PA. The final models for encouraging and discouraging PPAPP had moderate to good fit, with moderate to excellent test-retest reliabilities. The PPAPP should be further evaluated to better assess its associations with children's PA and offers a new tool for measuring PPAPP among Latino families with preschool-aged children.
Schnyer, Rosa N; Conboy, Lisa A; Jacobson, Eric; McKnight, Patrick; Goddard, Thomas; Moscatelli, Francesca; Legedza, Anna T R; Kerr, Catherine; Kaptchuk, Ted J; Wayne, Peter M
2005-12-01
The diagnostic framework and clinical reasoning process in Chinese medicine emphasizes the contextual and qualitative nature of a patient's illness. Chinese medicine assessment data may help interpret clinical outcomes. As part of a study aimed at assessing the validity and improving the inter-rater reliability of the Chinese diagnostic process, a structured assessment instrument was developed for use in clinical trials of acupuncture and other Chinese medical therapies. To foster collaboration and maximize resources and information, an interdisciplinary advisory team was assembled. Under the guidance of two group process facilitators, and in order to establish whether the assessment instrument was consistent with accepted Chinese medicine diagnostic categories (face validity) and included the full range of each concept's meaning (content validity), a panel of Traditional Chinese Medicine (TCM) expert clinicians was convened and their responses were organized using the Delphi process, an iterative, anonymous, idea-generating and consensus-building process. An aggregate rating measure was obtained by taking the mean of mean ratings for each question across all 10 experts. Over three rounds, the overall rating increased from 7.4 (SD = 1.3) in Round 1 to 9.1 (SD = 0.5) in Round 3. The level of agreement among clinicians was measured by a decrease in SD. The final instrument TEAMSI-TCM (Traditional East Asian Medicine Structured Interview, TCM version) uses the pattern differentiation model characteristic of TCM. This modular, dynamic version was specifically designed to assess women, with a focus on gynecologic conditions; with modifications it can be adapted for use with other populations and conditions. TEAMSI-TCM is a prescriptive instrument that guides clinicians to use the proper indicators, combine them in a systematic manner, and generate conclusions. In conjunction with treatment manualization and training it may serve to increase inter-rater reliability and inter-trial reproducibility in Chinese medicine clinical trials. Testing of the validity and reliability of this instrument currently is underway.
The German Version of the Herth Hope Index (HHI-D): Development and Psychometric Properties.
Geiser, Franziska; Zajackowski, Katharina; Conrad, Rupert; Imbierowicz, Katrin; Wegener, Ingo; Herth, Kaye A; Urbach, Anne Sarah
2015-01-01
The importance of hope is evident in clinical oncological care. Hope is associated with psychological and also physical functioning. However, there is still a dearth of empirical research on hope as a multidimensional concept. The Herth Hope Index is a reliable and valid instrument for the measurement of hope and is available in many languages. Until now no authorized German translation has been published and validated. After translation, the questionnaire was completed by 192 patients with different tumor entities in radiation therapy. Reliability, concurrent validity, and factor structure of the questionnaire were determined. Correlations were high with depression and anxiety as well as optimism and pessimism. As expected, correlations with coping styles were moderate. Internal consistency and test-retest reliability were satisfactory. We could not replicate the original 3-factor model. Application of the scree plot criterion in an exploratory factor analysis resulted in a single-factor structure. The Herth Hope Index - German Version (HHI-D) is a short, reliable, and valid instrument for the assessment of hope in patient populations. We recommend using only the HHI-D total score until further research gives more insights into possible factorial solutions and subscales. © 2015 S. Karger GmbH, Freiburg.
DOT National Transportation Integrated Search
2009-09-01
Post-tensioned (PT) bridges are major structures that carry significant traffic. PT bridges are economical for spanning long distances. : In Texas, there are several signature PT bridges. In the late 1990s and early 2000s, several state highway agenc...
Teacher Well-Being: Exploring Its Components and a Practice-Oriented Scale
ERIC Educational Resources Information Center
Collie, Rebecca J.; Shapka, Jennifer D.; Perry, Nancy E.; Martin, Andrew J.
2015-01-01
This study examined the psychometric properties of the Teacher Well-Being Scale, which assesses three factors of teachers' work-related well-being: workload, organizational, and student interaction well-being. With a sample of Canadian teachers, results confirmed the reliability, approximate normality, and factor structure of the scale; provided…
Validity and Reliability of the Teamwork Scale for Youth
ERIC Educational Resources Information Center
Lower, Leeann M.; Newman, Tarkington J.; Anderson-Butcher, Dawn
2017-01-01
Purpose: This study examines the psychometric properties of the Teamwork Scale for Youth, an assessment designed to measure youths' perceptions of their teamwork competency. Methods: The Teamwork Scale for Youth was administered to a sample of 460 youths. Confirmatory factor analyses examined the factor structure and measurement invariance of the…
Commentary on Coefficient Alpha: A Cautionary Tale
ERIC Educational Resources Information Center
Green, Samuel B.; Yang, Yanyun
2009-01-01
The general use of coefficient alpha to assess reliability should be discouraged on a number of grounds. The assumptions underlying coefficient alpha are unlikely to hold in practice, and violation of these assumptions can result in nontrivial negative or positive bias. Structural equation modeling was discussed as an informative process both to…
A Psychometric Evaluation of the Core Bereavement Items
ERIC Educational Resources Information Center
Holland, Jason M.; Nam, Ilsung; Neimeyer, Robert A.
2013-01-01
Despite being a routinely administered assessment of grieving, few studies have empirically examined the psychometric properties of the Core Bereavement Items (CBI). The present study investigated the factor structure, internal reliability, and concurrent validity of the CBI in a large, diverse sample of bereaved young adults (N = 1,366).…
Development and Initial Psychometrics of the Counselor Burnout Inventory
ERIC Educational Resources Information Center
Lee, Sang Min; Baker, Crystal R.; Cho, Seong Ho; Heckathorn, Danette E.; Holland, Michael W.; Newgent, Rebecca A.; Ogle, Nick T.; Powell, Michael L.; Quinn, James J.; Wallace, Sam L.; Yu, Kumlan
2007-01-01
This article describes the development and psychometric properties of the Counselor Burnout Inventory (CBI), which is designed to meet the needs of the counseling profession by assessing burnout in counselors. Factor structure, concurrent validity, internal consistency, and test-retest reliability of the CBI scores are reported. Implications for…
Giezen, Hilde; Stevens, Martin; van den Akker-Scheek, Inge; Reininga, Inge H F
2017-01-01
The Copenhagen Hip And Groin Outcome Score (HAGOS) was developed to assess disease-specific consequences in young to middle-aged, physically active hip and/or groin patients. The study aimed to determine validity and reliability of the Dutch version of the HAGOS (HAGOS-NL) for middle-aged patients with hip complaints. To assess validity, 117 participants completed five questionnaires: HAGOS-NL, international Hip Outcome Tool (iHOT-12NL), Hip disability and Osteoarthritis Outcome Score (HOOS), RAND-36 Health Survey and Tegner activity scale. Structural validity was determined by conducting confirmatory factor analysis. Construct validity was analyzed by formulating predefined hypotheses regarding relationships between the HAGOS-NL and subscales of the iHOT-12NL, HOOS, RAND-36 and Tegner activity scale. The HAGOS-NL was filled out again by 67 patients to explore test-retest reliability. Reliability was assessed in terms of Cronbach's alpha, Intraclass Correlation Coefficient (ICC), Standard Error of Measurement (SEM) and Minimal Detectable Change (MDC). The Bland and Altman method was used to explore absolute agreement. Factor analysis confirmed that the HAGOS-NL consists of six subscales. All hypotheses were confirmed, indicating good construct validity. Internal consistency was good, with Cronbach's alpha values ranging from 0.89 to 0.98. Test-retest reliability was considered good, with ICC values of 0.80 and higher. The SEM ranged from 6.6 to 12.3, and MDC at individual level from 18.3 to 34.1 and at group level from 2.3 to 4.4. Bland and Altman analyses showed no bias. The HAGOS-NL is a reliable and valid instrument for measuring pain, physical functioning and quality of life in middle-aged patients with hip complaints.
Cross-cultural Adaption and Validation of the Danish Voice Handicap Index.
Sorensen, Jesper Roed; Printz, Trine; Mehlum, Camilla Slot; Heidemann, Christian Hamilton; Groentved, Aagot Moeller; Godballe, Christian
2018-02-02
We aimed to assess psychometric properties, including internal consistency, reliability, and clinical validity of the Danish version of the Voice Handicap Index (VHI). A cross-sectional survey study was carried out. For validation, the existing nonvalidated Danish version of the VHI was used. Data from 208 patients with voice disorders of different etiology (neurogenic, functional, and structural) and a control group of 85 vocally healthy individuals were included. A test-retest reliability analysis of 42 patients and 45 control persons was performed. The internal consistency, test-retest reliability, and clinical validity of the questionnaire were assessed. Internal consistency was high with a Cronbach α >0.90 for both the patient and control group. Test-retest reliability measured as intraclass correlation coefficient was good with 0.93 (95% confidence interval [95% confidence interval]: 0.87-0.96) for patients and 0.78 (95% confidence interval: 0.63-0.87) for the control group which indicates sufficient reliability of the questionnaire. The Danish VHI has good clinical validity as it has a strong correlation between patient's perception of the severity of their voice disorder and the VHI score from the Spearman correlation of 0.69. The existing Danish version of the VHI has been thoroughly validated and found to be in line with the original VHI from Jacobsen et al. It showed good internal consistency, test-retest reliability, and clinical validity. It is suitable for use in daily practice and in research projects as it is able to assess patients' perception of their voice disorder severity. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
de Almeida Vieira Monteiro, Ana Paula Teixeira; Fernandes, Alexandre Bastos
2016-05-17
Cultural competence is an essential component in rendering effective and culturally responsive services to culturally and ethnically diverse clients. Still, great difficulty exists in assessing the cultural competence of mental health nurses. There are no Portuguese validated measurement instruments to assess cultural competence in mental health nurses. This paper reports a study testing the reliability and validity of the Portuguese version of the Multicultural Mental Health Awareness Scale-MMHAS in a sample of Portuguese nurses. Following a standard forward/backward translation into Portuguese, the adapted version of MMHAS, along with a sociodemographic questionnaire, were applied to a sample of 306 Portuguese nurses (299 males, 77 females; ages 21-68 years, M = 35.43, SD = 9.85 years). A psychometric research design was used with content and construct validity and reliability. Reliability was assessed using internal consistency and item-total correlations. Construct validity was determined using factor analysis. The factor analysis confirmed that the Portuguese version of MMHAS has a three-factor structure of multicultural competencies (Awareness, Knowledge, and Skills) explaining 59.51% of the total variance. Strong content validity and reliability correlations were demonstrated. The Portuguese version of MMHAS has a strong internal consistency, with a Cronbach's alpha of 0.958 for the total scale. The results supported the construct validity and reliability of the Portuguese version of MMHAS, proving that is a reliable and valid measure of multicultural counselling competencies in mental health nursing. The MMHAS Portuguese version can be used to evaluate the effectiveness of multicultural competency training programs in Portuguese-speaking mental health nurses. The scale can also be a useful in future studies of multicultural competencies in Portuguese-speaking nurses.
Fang, Jin-Bo; Zhou, Chun-Fen; Huang, Jing; Qiu, Chang-Jian
2018-06-01
The Occupational Fatigue Exhaustion/Recovery Scale (OFER) was designed to assess occupational fatigue in nurses. Although the original English version of this instrument has shown high degrees of reliability and validity, a Chinese version of this scale has yet to be verified. The aim of this study was to evaluate the psychometric properties of the OFER in a population of Chinese nurses. The scale was translated using translation and back-translation. The validities and reliabilities were evaluated on 923 qualified participants using content validity index, concurrent validity, factorial validity, internal consistency reliability, and test-retest reliability. The content validity index for the OFER was .92. The correlation coefficients between the scores of the OFER subscales and the criteria in this study (varying from -.498 to .705) verified that the OFER has acceptable concurrent validity. Principal component analysis and confirmatory factor analysis revealed that three factors correspond to the structure of the original instrument and that recovery mediates the relationship between acute and chronic fatigue. The Cronbach's alpha for the chronic fatigue, acute fatigue, and intershift recovery subscales were .83, .85, and .86, respectively. Test-retest reliabilities with correlation coefficients from .61 to .78 were found in the three subscales. OFER is a reliable and valid instrument for assessing work-related fatigue in Chinese nurses. However, further improvement of the acute fatigue subscale is recommended. The OFER has the potential to elicit information that is useful for assessing fatigue in nurses in China. Furthermore, as it differentiates between acute and chronic fatigue, OFER may be an effective tool for guiding the development and implementation of various, related intervention measures.
Zangger, Graziella; Zwisler, Ann-Dorthe; Kikkenborg Berg, Selina; Kristensen, Marie S; Grønset, Charlotte N; Uddin, Jamal; Pedersen, Susanne S; Oldridge, Neil B; Thygesen, Lau C
2018-01-01
Background Patient-reported health-related quality of life is increasingly used as an outcome measure in clinical trials and as a performance measure to evaluate quality of care. The objective of this study was to assess the psychometric properties of the Danish HeartQoL questionnaire, a core heart disease-specific health-related quality of life questionnaire, in implantable cardioverter defibrillator recipients. Design This study involved cross-sectional and test-retest study designs. Method Implantable cardioverter defibrillator recipients in the cross-sectional study completed the HeartQoL, the Short-Form 36 Health Survey, and the Hospital Anxiety and Depression Scale. The HeartQoL structure, construct-related validity (convergent and discriminative) and reliability (internal consistency) were assessed. HeartQoL reproducibility (test-retest) was assessed in an independent sample of implantable cardioverter defibrillator recipients. Results Mokken scale analysis supported the bi-dimensional structure of HeartQoL among 358 implantable cardioverter defibrillator recipients. Convergent ( r > 0.72) and discriminative validity were confirmed. The HeartQoL scales demonstrated satisfactory internal consistency (Cronbach's alpha > 0.90). Test-retest reliability (two weeks interval) was assessed in 89 implantable cardioverter defibrillator recipients and found to be acceptable for each scale (intra-class correlation > 0.90). Conclusion The Danish HeartQoL questionnaire demonstrated satisfactory key psychometric attributes of validity and reliability in this implantable cardioverter defibrillator population. This study adds support for the HeartQoL as a core heart-specific health-related quality of life questionnaire in a broad group of patients with heart disease including implantable cardioverter defibrillator recipients.
Pinna, Federica; Diana, Enrica; Sanna, Lucia; Deiana, Valeria; Manchia, Mirko; Nicotra, Eraldo; Fiorillo, Andrea; Albert, Umberto; Nivoli, Alessandra; Volpe, Umberto; Atti, Anna Rita; Ferrari, Silvia; Medda, Federica; Atzeni, Maria Gloria; Manca, Daniela; Mascia, Elisa; Farci, Fernando; Ghiani, Mariangela; Cau, Rossella; Tuveri, Marta; Cossu, Efisio; Loy, Elena; Mereu, Alessandra; Mariotti, Stefano; Carpiniello, Bernardo
2017-07-19
The purpose of the study was to evaluate in a sample of insulin-treated diabetic patients, with type 1 or type 2 diabetes, the psychometric characteristics of the Italian version of the DEPS-R scale, a diabetes-specific self-report questionnaire used to analyze disordered eating behaviors. The study was performed on 211 consecutive insulin-treated diabetic patients attending two specialist centers. Lifetime prevalence of eating disorders (EDs) according to DSM-IV and DSM-5 criteria were assessed by means of the Module H of the Structured Clinical Interview for DSM IV Axis I Disorder and the Module H modified, according to DSM-5 criteria. The following questionnaires were administered: DEPS-R and the Eating Disorder Inventory - 3 (EDI-3). Test/retest reproducibility was assessed on a subgroup of 70 patients. The factorial structure, internal consistency, test-retest reliability and concurrent validity of DEPS-R were assessed. Overall, 21.8% of the sample met criteria for at least one DSM-5 diagnosis of ED. A "clinical risk" of ED was observed in 13.3% of the sample. Females displayed higher scores at DEPS-R, a higher percentage of at least one diagnosis of ED and a higher clinical risk for ED. A high level of reproducibility and homogeneity of the scale were revealed. A significant correlation was detected between DEPS-R and the 3 ED risk scales of EDI-3. The data confirmed the overall reliability and validity of the scale. In view of the significance and implications of EDs in diabetic patients, it should be conducted a more extensive investigation of the phenomenon by means of evaluation instruments of demonstrated validity and reliability.
[Development of a Japanese version of the TALE scale].
Ochiai, Tsutomu; Oguchi, Takashi
2013-12-01
The Thinking About Life Experiences (TALE) Scale (Bluck & Alea, 2011) has three subscales that assess the self, social, and directive functions of autobiographical memory. This study constructs a Japanese version of the TALE Scale and examines its reliability and validity. Fifteen items that assess the three functions of autobiographical memory were translated into Japanese. We conducted an online investigation with 600 men and women between 20-59 years of age. In Study 1, exploratory and confirmatory factor analysis identified that the three-factor structure of the Japanese version of the TALE Scale was the same as the original TALE Scale. Sufficient internal consistency of the scale was found, and the construct validity of the scale was supported by correlation analysis. Study 2 confirmed that the test-retest reliabilities of the three subscales were sufficient. Thus, this Japanese version of the TALE Scale is useful to assess autobiographical memory functions in Japan.
Seismic and Restoration Assessment of Monumental Masonry Structures
Asteris, Panagiotis G.; Douvika, Maria G.; Apostolopoulou, Maria; Moropoulou, Antonia
2017-01-01
Masonry structures are complex systems that require detailed knowledge and information regarding their response under seismic excitations. Appropriate modelling of a masonry structure is a prerequisite for a reliable earthquake-resistant design and/or assessment. However, modelling a real structure with a robust quantitative (mathematical) representation is a very difficult, complex and computationally-demanding task. The paper herein presents a new stochastic computational framework for earthquake-resistant design of masonry structural systems. The proposed framework is based on the probabilistic behavior of crucial parameters, such as material strength and seismic characteristics, and utilizes fragility analysis based on different failure criteria for the masonry material. The application of the proposed methodology is illustrated in the case of a historical and monumental masonry structure, namely the assessment of the seismic vulnerability of the Kaisariani Monastery, a byzantine church that was built in Athens, Greece, at the end of the 11th to the beginning of the 12th century. Useful conclusions are drawn regarding the effectiveness of the intervention techniques used for the reduction of the vulnerability of the case-study structure, by means of comparison of the results obtained. PMID:28767073
Seismic and Restoration Assessment of Monumental Masonry Structures.
Asteris, Panagiotis G; Douvika, Maria G; Apostolopoulou, Maria; Moropoulou, Antonia
2017-08-02
Masonry structures are complex systems that require detailed knowledge and information regarding their response under seismic excitations. Appropriate modelling of a masonry structure is a prerequisite for a reliable earthquake-resistant design and/or assessment. However, modelling a real structure with a robust quantitative (mathematical) representation is a very difficult, complex and computationally-demanding task. The paper herein presents a new stochastic computational framework for earthquake-resistant design of masonry structural systems. The proposed framework is based on the probabilistic behavior of crucial parameters, such as material strength and seismic characteristics, and utilizes fragility analysis based on different failure criteria for the masonry material. The application of the proposed methodology is illustrated in the case of a historical and monumental masonry structure, namely the assessment of the seismic vulnerability of the Kaisariani Monastery, a byzantine church that was built in Athens, Greece, at the end of the 11th to the beginning of the 12th century. Useful conclusions are drawn regarding the effectiveness of the intervention techniques used for the reduction of the vulnerability of the case-study structure, by means of comparison of the results obtained.
Krüger-Gottschalk, Antje; Knaevelsrud, Christine; Rau, Heinrich; Dyer, Anne; Schäfer, Ingo; Schellong, Julia; Ehring, Thomas
2017-11-28
The Posttraumatic Stress Disorder (PTSD) Checklist (PCL, now PCL-5) has recently been revised to reflect the new diagnostic criteria of the disorder. A clinical sample of trauma-exposed individuals (N = 352) was assessed with the Clinician Administered PTSD Scale for DSM-5 (CAPS-5) and the PCL-5. Internal consistencies and test-retest reliability were computed. To investigate diagnostic accuracy, we calculated receiver operating curves. Confirmatory factor analyses (CFA) were performed to analyze the structural validity. Results showed high internal consistency (α = .95), high test-retest reliability (r = .91) and a high correlation with the total severity score of the CAPS-5, r = .77. In addition, the recommended cutoff of 33 on the PCL-5 showed high diagnostic accuracy when compared to the diagnosis established by the CAPS-5. CFAs comparing the DSM-5 model with alternative models (the three-factor solution, the dysphoria, anhedonia, externalizing behavior and hybrid model) to account for the structural validity of the PCL-5 remained inconclusive. Overall, the findings show that the German PCL-5 is a reliable instrument with good diagnostic accuracy. However, more research evaluating the underlying factor structure is needed.
Feelings about culture scales: development, factor structure, reliability, and validity.
Maffini, Cara S; Wong, Y Joel
2015-04-01
Although measures of cultural identity, values, and behavior exist in the multicultural psychological literature, there is currently no measure that explicitly assesses ethnic minority individuals' positive and negative affect toward culture. Therefore, we developed 2 new measures called the Feelings About Culture Scale--Ethnic Culture and Feelings About Culture Scale--Mainstream American Culture and tested their psychometric properties. In 6 studies, we piloted the measures, conducted factor analyses to clarify their factor structure, and examined reliability and validity. The factor structure revealed 2 dimensions reflecting positive and negative affect for each measure. Results provided evidence for convergent, discriminant, criterion-related, and incremental validity as well as the reliability of the scales. The Feelings About Culture Scales are the first known measures to examine both positive and negative affect toward an individual's ethnic culture and mainstream American culture. The focus on affect captures dimensions of psychological experiences that differ from cognitive and behavioral constructs often used to measure cultural orientation. These measures can serve as a valuable contribution to both research and counseling by providing insight into the nuanced affective experiences ethnic minority individuals have toward culture. (c) 2015 APA, all rights reserved).
Li, Guanghui; Wei, Jianhua; Wang, Xi; Wu, Guofeng; Ma, Dandan; Wang, Bo; Liu, Yanpu; Feng, Xinghua
2013-08-01
Cleft lip in the presence or absence of a cleft palate is a major public health problem. However, few studies have been published concerning the soft-tissue morphology of cleft lip infants. Currently, obtaining reliable three-dimensional (3D) surface models of infants remains a challenge. The aim of this study was to investigate a new way of capturing 3D images of cleft lip infants using a structured light scanning system. In addition, the accuracy and precision of the acquired facial 3D data were validated and compared with direct measurements. Ten unilateral cleft lip patients were enrolled in the study. Briefly, 3D facial images of the patients were acquired using a 3D scanner device before and after the surgery. Fourteen items were measured by direct anthropometry and 3D image software. The accuracy and precision of the 3D system were assessed by comparative analysis. The anthropometric data obtained using the 3D method were in agreement with the direct anthropometry measurements. All data calculated by the software were 'highly reliable' or 'reliable', as defined in the literature. The localisation of four landmarks was not consistent in repeated experiments of inter-observer reliability in preoperative images (P<0.05), while the intra-observer reliability in both pre- and postoperative images was good (P>0.05). The structured light scanning system is proven to be a non-invasive, accurate and precise method in cleft lip anthropometry. Copyright © 2013 British Association of Plastic, Reconstructive and Aesthetic Surgeons. Published by Elsevier Ltd. All rights reserved.
Reading PDB: perception of molecules from 3D atomic coordinates.
Urbaczek, Sascha; Kolodzik, Adrian; Groth, Inken; Heuser, Stefan; Rarey, Matthias
2013-01-28
The analysis of small molecule crystal structures is a common way to gather valuable information for drug development. The necessary structural data is usually provided in specific file formats containing only element identities and three-dimensional atomic coordinates as reliable chemical information. Consequently, the automated perception of molecular structures from atomic coordinates has become a standard task in cheminformatics. The molecules generated by such methods must be both chemically valid and reasonable to provide a reliable basis for subsequent calculations. This can be a difficult task since the provided coordinates may deviate from ideal molecular geometries due to experimental uncertainties or low resolution. Additionally, the quality of the input data often differs significantly thus making it difficult to distinguish between actual structural features and mere geometric distortions. We present a method for the generation of molecular structures from atomic coordinates based on the recently published NAOMI model. By making use of this consistent chemical description, our method is able to generate reliable results even with input data of low quality. Molecules from 363 Protein Data Bank (PDB) entries could be perceived with a success rate of 98%, a result which could not be achieved with previously described methods. The robustness of our approach has been assessed by processing all small molecules from the PDB and comparing them to reference structures. The complete data set can be processed in less than 3 min, thus showing that our approach is suitable for large scale applications.
Reliability assessments in qualitative health promotion research.
Cook, Kay E
2012-03-01
This article contributes to the debate about the use of reliability assessments in qualitative research in general, and health promotion research in particular. In this article, I examine the use of reliability assessments in qualitative health promotion research in response to health promotion researchers' commonly held misconception that reliability assessments improve the rigor of qualitative research. All qualitative articles published in the journal Health Promotion International from 2003 to 2009 employing reliability assessments were examined. In total, 31.3% (20/64) articles employed some form of reliability assessment. The use of reliability assessments increased over the study period, ranging from <20% in 2003/2004 to 50% and above in 2008/2009, while at the same time the total number of qualitative articles decreased. The articles were then classified into four types of reliability assessments, including the verification of thematic codes, the use of inter-rater reliability statistics, congruence in team coding and congruence in coding across sites. The merits of each type were discussed, with the subsequent discussion focusing on the deductive nature of reliable thematic coding, the limited depth of immediately verifiable data and the usefulness of such studies to health promotion and the advancement of the qualitative paradigm.
Setyonugroho, Winny; Kropmans, Thomas; Murphy, Ruth; Hayes, Peter; van Dalen, Jan; Kennedy, Kieran M
2018-01-01
Comparing outcome of clinical skills assessment is challenging. This study proposes reliable and valid comparison of communication skills (1) assessment as practiced in Objective Structured Clinical Examinations (2). The aim of the present study is to compare CS assessment, as standardized according to the MAAS Global, between stations in a single undergraduate medical year. An OSCE delivered in an Irish undergraduate curriculum was studied. We chose the MAAS-Global as an internationally recognized and validated instrument to calibrate the OSCE station items. The MAAS-Global proportion is the percentage of station checklist items that can be considered as 'true' CS. The reliability of the OSCE was calculated with G-Theory analysis and nested ANOVA was used to compare mean scores of all years. MAAS-Global scores in psychiatry stations were significantly higher than those in other disciplines (p<0.03) and above the initial pass mark of 50%. The higher students' scores in psychiatry stations were related to higher MAAS-Global proportions when compared to the general practice stations. Comparison of outcome measurements, using the MAAS Global as a standardization instrument, between interdisciplinary station checklists was valid and reliable. The MAAS-Global was used as a single validated instrument and is suggested as gold standard. Copyright © 2017. Published by Elsevier B.V.
NASA Astrophysics Data System (ADS)
Fisher, W. P., Jr.; Elbaum, B.; Coulter, A.
2010-07-01
Reliability coefficients indicate the proportion of total variance attributable to differences among measures separated along a quantitative continuum by a testing, survey, or assessment instrument. Reliability is usually considered to be influenced by both the internal consistency of a data set and the number of items, though textbooks and research papers rarely evaluate the extent to which these factors independently affect the data in question. Probabilistic formulations of the requirements for unidimensional measurement separate consistency from error by modelling individual response processes instead of group-level variation. The utility of this separation is illustrated via analyses of small sets of simulated data, and of subsets of data from a 78-item survey of over 2,500 parents of children with disabilities. Measurement reliability ultimately concerns the structural invariance specified in models requiring sufficient statistics, parameter separation, unidimensionality, and other qualities that historically have made quantification simple, practical, and convenient for end users. The paper concludes with suggestions for a research program aimed at focusing measurement research more on the calibration and wide dissemination of tools applicable to individuals, and less on the statistical study of inter-variable relations in large data sets.
Kolokotroni, Philippa; Anagnostopoulos, Fotios; Missitzis, Ioannis
2017-07-01
The study and measurement of psychosocial adjustment is important for evaluating patients' well-being, and assessing the illness's course, treatment's success, and patients' recovery. In this study, internal consistency reliability and construct validity of the Greek version of the Psychosocial Adjustment to Illness Scale-Self-Report (PAIS-SR) were examined. Demographic and psychosocial data were collected from a sample of 243 women with breast cancer, recruited from September 2011 to December 2012. With some exceptions in specific items, the original conceptually-derived PAIS-SR subscales emerged in a seven-factor solution. Social Environment, Job and Household Duties, and Psychological Distress accounted for more of the total variance than other subscales. PAIS-SR showed good internal consistency reliability, with Cronbach's alpha coefficients >0.62. Correlations of PAIS-SR domains with measures of quality of life and posttraumatic stress symptoms supported the convergent validity of the PAIS-SR and its significance for cancer research. The Greek version of the PAIS-SR has acceptable internal consistency reliability and construct validity, as well as satisfactory convergent validity. Results provide some suggestions for the development of programs to evaluate adjustment status and implement psychosocial interventions among breast cancer survivors.
Benitez-Rosario, Miguel Angel; Caceres-Miranda, Raquel; Aguirre-Jaime, Armando
2016-03-01
A reliable and valid measure of the structure and process of end-of-life care is important for improving the outcomes of care. This study evaluated the validity and reliability of the Spanish adaptation of a satisfaction tool of the Care Evaluation Scale (CES), which was developed in Japan to evaluate palliative care structure and process from the perspective of family members. Standard forward-backward translation and a pilot test were conducted. A multicenter survey was conducted with the relatives of patients admitted to palliative care units for symptom control. The dimensional structure was assessed using confirmatory factor analyses. Concurrent and discriminant validity were tested by correlation with the SERQVHOS, a Spanish hospital care satisfaction scale and with an 11-point rating scale on satisfaction with care. The reliability of the CES was tested by Cronbach α and by test-retest correlation. A total of 284 primary caregivers completed the CES, with low missing response rates. The results of the factor analysis suggested a six-factor solution explaining 69% of the total variance. The CES moderately correlated with the SERQVHOS and with the overall satisfaction scale (intraclass correlation coefficients of 0.66 and 0.44, respectively; P = 0.001). Cronbach α was 0.90 overall and ranged from 0.85 to 0.89 for subdomains. Intraclass correlation coefficient was 0.88 (P = 0.001) for test-retest analysis. The Spanish CES was found to be a reliable and valid measure of the satisfaction with end-of-life care structure and process from family members' perspectives. Copyright © 2016 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Oberjé, Edwin J M; Dima, Alexandra L; Pijnappel, Frank J; Prins, Jan M; de Bruin, Marijn
2015-01-01
Reporting guidelines call for descriptions of control group support in equal detail as for interventions. However, how to assess the active content (behaviour change techniques (BCTs)) of treatment-as-usual (TAU) delivered to control groups in trials remains unclear. The objective of this study is to pre-test a method of assessing TAU in a multicentre cost-effectiveness trial of an HIV-treatment adherence intervention. HIV-nurses (N = 21) completed a semi-structured open-ended questionnaire enquiring about TAU adherence counselling. Two coders independently coded BCTs. Completeness and clarity of nurse responses, inter-coder reliabilities and the type of BCTs reported were examined. The clarity and completeness of nurse responses were adequate. Twenty-three of the 26 identified BCTs could be reliably coded (mean κ = .79; mean agreement rate = 96%) and three BCTs scored below κ = .60. Total number of BCTs reported per nurse ranged between 7 and 19 (M = 13.86, SD = 3.35). This study suggests that the TAU open-ended questionnaire is a feasible and reliable tool to capture active content of support provided to control participants in a multicentre adherence intervention trial. Considerable variability in the number of BCTs provided to control patients was observed, illustrating the importance of reliably collecting and accurately reporting control group support.
Areia, Neide P; Major, Sofia; Relvas, Ana P
2017-10-01
The aim of this study was to validate the Portuguese version of the Family Inventory of Needs (FIN). The FIN aims to measure important family needs and their fulfilment by a healthcare team. This cross-sectional study involved a sample of 364 family members of cancer patients, recruited from three medical institutions and through online recruitment. Three instruments were used: a socio-demographic questionnaire, the FIN and the Brief Symptom Inventory - 18 (BSI-18). Construct validity and reliability were considered regarding the FIN's psychometric properties. The method used to determine construct validity was factor structure analysis (confirmatory factor analysis), inter-factor correlations (Spearman's rank correlation) and convergent validity (Spearman's rank correlation). To assess scale reliability, the FIN's internal consistency was evaluated (Cronbach's alpha coefficient). Descriptive and frequency statistics and tests to compare means were used to assess important needs and to what extent they were met. The four-factor structure of the FIN was confirmed. Thus, the FIN has four domains: Basic Information, Information on treatment and care, Support and Patient Comfort. Convergent validity with the BSI-18 was verified. Both subscales of the FIN and each domain exceeded the minimum reliability standard of 0.70. Family members also reported important needs that were not adequately met by healthcare professionals. The Portuguese version of the FIN seems to be a reliable and valid tool for identifying cancer patients' important family needs and to what extent these are met. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pontes, Halley M.; Macur, Mirna; Griffiths, Mark D.
2016-01-01
Background and aims Since the inclusion of Internet Gaming Disorder (IGD) in the latest (fifth) edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) as a tentative disorder, a few psychometric screening instruments have been developed to assess IGD, including the 9-item Internet Gaming Disorder Scale – Short-Form (IGDS9-SF) – a short, valid, and reliable instrument. Methods Due to the lack of research on IGD in Slovenia, this study aimed to examine the psychometric properties of the IGDS9-SF in addition to investigating the prevalence rates of IGD in a nationally representative sample of eighth graders from Slovenia (N = 1,071). Results The IGDS9-SF underwent rigorous psychometric scrutiny in terms of validity and reliability. Construct validation was investigated with confirmatory factor analysis to examine the factorial structure of the IGDS9-SF and a unidimensional structure appeared to fit the data well. Concurrent and criterion validation were also investigated by examining the association between IGD and relevant psychosocial and game-related measures, which warranted these forms of validity. In terms of reliability, the Slovenian version IGDS9-SF obtained excellent results regarding its internal consistency at different levels, and the test appears to be a valid and reliable instrument to assess IGD among Slovenian youth. Finally, the prevalence rates of IGD were found to be around 2.5% in the whole sample and 3.1% among gamers. Discussion and conclusion Taken together, these results illustrate the suitability of the IGDS9-SF and warrants further research on IGD in Slovenia. PMID:27363464
Reid, Matthew W; Hannemann, Nathan P; York, Gerald E; Ritter, John L; Kini, Jonathan A; Lewis, Jeffrey D; Sherman, Paul M; Velez, Carmen S; Drennon, Ann Marie; Bolzenius, Jacob D; Tate, David F
2017-07-01
To compare volumetric results from NeuroQuant® and FreeSurfer in a service member setting. Since the advent of medical imaging, quantification of brain anatomy has been a major research and clinical effort. Rapid advancement of methods to automate quantification and to deploy this information into clinical practice has surfaced in recent years. NeuroQuant® is one such tool that has recently been used in clinical settings. Accurate volumetric data are useful in many clinical indications; therefore, it is important to assess the intermethod reliability and concurrent validity of similar volume quantifying tools. Volumetric data from 148 U.S. service members across three different experimental groups participating in a study of mild traumatic brain injury (mTBI) were examined. Groups included mTBI (n = 71), posttraumatic stress disorder (n = 22), or a noncranial orthopedic injury (n = 55). Correlation coefficients and nonparametric group mean comparisons were used to assess reliability and concurrent validity, respectively. Comparison of these methods across our entire sample demonstrates generally fair to excellent reliability as evidenced by large intraclass correlation coefficients (ICC = .4 to .99), but little concurrent validity as evidenced by significantly different Mann-Whitney U comparisons for 26 of 30 brain structures measured. While reliability between the two segmenting tools is fair to excellent, volumetric outcomes are statistically different between the two methods. As suggested by both developers, structure segmentation should be visually verified prior to clinical use and rigor should be used when interpreting results generated by either method. Copyright © 2017 by the American Society of Neuroimaging.
Training and quality assurance with the Structured Clinical Interview for DSM-IV (SCID-I/P).
Ventura, J; Liberman, R P; Green, M F; Shaner, A; Mintz, J
1998-06-15
Accuracy in psychiatric diagnosis is critical for evaluating the suitability of the subjects for entry into research protocols and for establishing comparability of findings across study sites. However, training programs in the use of diagnostic instruments for research projects are not well systematized. Furthermore, little information has been published on the maintenance of interrater reliability of diagnostic assessments. At the UCLA Research Center for Major Mental Illnesses, a Training and Quality Assurance Program for SCID interviewers was used to evaluate interrater reliability and diagnostic accuracy. Although clinically experienced interviewers achieved better interrater reliability and overall diagnostic accuracy than neophyte interviewers, both groups were able to achieve and maintain high levels of interrater reliability, diagnostic accuracy, and interviewer skill. At the first quality assurance check after training, there were no significant differences between experienced and neophyte interviewers in interrater reliability or diagnostic accuracy. Standardization of training and quality assurance procedures within and across research projects may make research findings from study sites more comparable.
Cubaka, Vincent Kalumire; Schriver, Michael; Vedsted, Peter; Makoul, Gregory; Kallestrup, Per
2018-04-23
To identify, adapt and validate a measure for providers' communication and interpersonal skills in Rwanda. After selection, translation and piloting of the measure, structural validity, test-retest reliability, and differential item functioning were assessed. Identification and adaptation: The 14-item Communication Assessment Tool (CAT) was selected and adapted. Content validation found all items highly relevant in the local context except two, which were retained upon understanding the reasoning applied by patients. Eleven providers and 291 patients were involved in the field-testing. Confirmatory factor analysis showed a good fit for the original one factor model. Test-retest reliability assessment revealed a mean quadratic weighted Kappa = 0.81 (range: 0.69-0.89, N = 57). The average proportion of excellent scores was 15.7% (SD: 24.7, range: 9.9-21.8%, N = 180). Differential item functioning was not observed except for item 1, which focuses on greetings, for age groups (p = 0.02, N = 180). The Kinyarwanda version of CAT (K-CAT) is a reliable and valid patient-reported measure of providers' communication and interpersonal skills. K-CAT was validated on nurses and its use on other types of providers may require further validation. K-CAT is expected to be a valuable feedback tool for providers in practice and in training. Copyright © 2018 Elsevier B.V. All rights reserved.
Ecologically relevant outcome measure for post-inpatient rehabilitation.
Marquez de la Plata, Carlos; Qualls, Devin; Plenger, Patrick; Malec, James F; Hayden, Mary Ellen
2017-01-01
Transfer of skills learned within the clinic environment to patients' home or community is important in post-inpatient brain injury rehabilitation (PBIR). Outcome measures used in PBIR assess level of independence during functional tasks; however, available functional instruments do not quantitate the environment in which the behaviors occur. To examine the reliability and validity of an instrument used to assess patients' functional abilities while quantifying the amount of structure and distractions in the environment. 2501 patients who sustained a traumatic brain injury (TBI) or cerebrovascular accident (CVA) and participated in a multidisciplinary PBIR program between 2006 and 2014 were identified retrospectively for this study. The PERPOS and MPAI-4 were used to assess functional abilities at admission and at discharge. Construct validity was assessed using a bivariate Spearman rho analysis A subsample of 56 consecutive admissions during 2014 were examined to determine inter-rater reliability. Intra-class correlation coefficient (ICC) and Kappa coefficients assessed inter-rater agreement of the total PERPOS and PERPOS subscales respectively. The PERPOS and MPAI-4 demonstrated a strong negative association among both TBI and CVA patients. Kappa scores for the three PERPOS scales each demonstrated good to excellent inter-rater agreement. The ICC for overall PERPOS scores fell in the good agreement range. The PERPOS can be used reliably in PBIR to quantify patients' functional abilities within the context of environmental demands.
A prospective study assessing agreement and reliability of a geriatric evaluation.
Locatelli, Isabella; Monod, Stéfanie; Cornuz, Jacques; Büla, Christophe J; Senn, Nicolas
2017-07-19
The present study takes place within a geriatric program, aiming at improving the diagnosis and management of geriatric syndromes in primary care. Within this program it was of prime importance to be able to rely on a robust and reproducible geriatric consultation to use as a gold standard for evaluating a primary care brief assessment tool. The specific objective of the present study was thus assessing the agreement and reliability of a comprehensive geriatric consultation. The study was conducted at the outpatient clinic of the Service of Geriatric Medicine, University of Lausanne, Switzerland. All community-dwelling older persons aged 70 years and above were eligible. Patients were excluded if they hadn't a primary care physician, they were unable to speak French, or they were already assessed by a geriatrician within the last 12 months. A set of 9 geriatricians evaluated 20 patients. Each patient was assessed twice within a 2-month delay. Geriatric consultations were based on a structured evaluation process, leading to rating the following geriatric conditions: functional, cognitive, visual, and hearing impairment, mood disorders, risk of fall, osteoporosis, malnutrition, and urinary incontinence. Reliability and agreement estimates on each of these items were obtained using a three-way Intraclass Correlation and a three-way Observed Disagreement index. The latter allowed a decomposition of overall disagreement into disagreements due to each source of error variability (visit, rater and random). Agreement ranged between 0.62 and 0.85. For most domains, geriatrician-related error variability explained an important proportion of disagreement. Reliability ranged between 0 and 0.8. It was poor/moderate for visual impairment, malnutrition and risk of fall, and good/excellent for functional/cognitive/hearing impairment, osteoporosis, incontinence and mood disorders. Six out of nine items of the geriatric consultation described in this study (functional/cognitive/hearing impairment, osteoporosis, incontinence and mood disorders) present a good to excellent reliability and can safely be used as a reference (gold standard) to evaluate the diagnostic performance of a primary care brief assessment tool. More objective/significant measures are needed to improve reliability of malnutrition, visual impairment, and risk of fall assessment before they can serve as a safe gold standard of a primary care tool.
Parts and Components Reliability Assessment: A Cost Effective Approach
NASA Technical Reports Server (NTRS)
Lee, Lydia
2009-01-01
System reliability assessment is a methodology which incorporates reliability analyses performed at parts and components level such as Reliability Prediction, Failure Modes and Effects Analysis (FMEA) and Fault Tree Analysis (FTA) to assess risks, perform design tradeoffs, and therefore, to ensure effective productivity and/or mission success. The system reliability is used to optimize the product design to accommodate today?s mandated budget, manpower, and schedule constraints. Stand ard based reliability assessment is an effective approach consisting of reliability predictions together with other reliability analyses for electronic, electrical, and electro-mechanical (EEE) complex parts and components of large systems based on failure rate estimates published by the United States (U.S.) military or commercial standards and handbooks. Many of these standards are globally accepted and recognized. The reliability assessment is especially useful during the initial stages when the system design is still in the development and hard failure data is not yet available or manufacturers are not contractually obliged by their customers to publish the reliability estimates/predictions for their parts and components. This paper presents a methodology to assess system reliability using parts and components reliability estimates to ensure effective productivity and/or mission success in an efficient manner, low cost, and tight schedule.
Revised scoring and improved reliability for the Communication Patterns Questionnaire.
Crenshaw, Alexander O; Christensen, Andrew; Baucom, Donald H; Epstein, Norman B; Baucom, Brian R W
2017-07-01
The Communication Patterns Questionnaire (CPQ; Christensen, 1987) is a widely used self-report measure of couple communication behavior and is well validated for assessing the demand/withdraw interaction pattern, which is a robust predictor of poor relationship and individual outcomes (Schrodt, Witt, & Shimkowski, 2014). However, no studies have examined the CPQ's factor structure using analytic techniques sufficient by modern standards, nor have any studies replicated the factor structure using additional samples. Further, the current scoring system uses fewer than half of the total items for its 4 subscales, despite the existence of unused items that have content conceptually consistent with those subscales. These characteristics of the CPQ have likely contributed to findings that subscale scores are often troubled by suboptimal psychometric properties such as low internal reliability (e.g., Christensen, Eldridge, Catta-Preta, Lim, & Santagata, 2006). The present study uses exploratory and confirmatory factor analyses on 4 samples to reexamine the factor structure of the CPQ to improve scale score reliability and to determine if including more items in the subscales is warranted. Results indicate that a 3-factor solution (constructive communication and 2 demand/withdraw scales) provides the best fit for the data. That factor structure was confirmed in the replication samples. Compared with the original scales, the revised scales include additional items that expand the conceptual range of the constructs, substantially improve reliability of scale scores, and demonstrate stronger associations with relationship satisfaction and sensitivity to change in therapy. Implications for research and treatment are discussed. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Damschroder, Laura J; Goodrich, David E; Kim, Hyungjin Myra; Holleman, Robert; Gillon, Leah; Kirsh, Susan; Richardson, Caroline R; Lutes, Lesley D
2016-09-01
Practical and valid instruments are needed to assess fidelity of coaching for weight loss. The purpose of this study was to develop and validate the ASPIRE Coaching Fidelity Checklist (ACFC). Classical test theory guided ACFC development. Principal component analyses were used to determine item groupings. Psychometric properties, internal consistency, and inter-rater reliability were evaluated for each subscale. Criterion validity was tested by predicting weight loss as a function of coaching fidelity. The final 19-item ACFC consists of two domains (session process and session structure) and five subscales (sets goals and monitor progress, assess and personalize self-regulatory content, manages the session, creates a supportive and empathetic climate, and stays on track). Four of five subscales showed high internal consistency (Cronbach alphas > 0.70) for group-based coaching; only two of five subscales had high internal reliability for phone-based coaching. All five sub-scales were positively and significantly associated with weight loss for group- but not for phone-based coaching. The ACFC is a reliable and valid instrument that can be used to assess fidelity and guide skill-building for weight management interventionists.
Mohd Din, F H; Hoe, Victor C W; Chan, C K; Muslan, M A
2015-05-01
The Pain Catastrophizing Scale (PCS) is designed to assess negative thoughts in response to pain. It is composed of three domains: helplessness, rumination, and magnification. We report on the translation, adaptation, and validation of scores on a Malay-speaking version of the PCS, the PCS-MY. Guidelines for the process of cross-cultural adaptations of assessment measures were implemented. A sample of 303 young military recruits participated in the study. Factor structure, reliability, and validity of scores on the PCS-MY were examined. Convergent validity was investigated with the Positive and Negative Affect Scale, Short-form 12 version 2, and Ryff's Psychological Well-being Scale. Most participants were men, ranging in age from 19 to 26. The reliability of the PCS-MY scores was adequate (α = 0.90; mean inter-item correlation = 0.43). Confirmatory factor analysis showed that a modified version of the PCS-MY provided best fit estimates to the sample data. The PCS-MY total score was negatively correlated with mental well-being and positively correlated with negative affect (all ps < 0.001). The PCS-MY was demonstrated to have adequate reliability and validity estimates in the study sample.
Development of a scale to assess cancer stigma in the non-patient population
2014-01-01
Background Illness-related stigma has attracted considerable research interest, but few studies have specifically examined stigmatisation of cancer in the non-patient population. The present study developed and validated a Cancer Stigma Scale (CASS) for use in the general population. Methods An item pool was developed on the basis of previous research into illness-related stigma in the general population and patients with cancer. Two studies were carried out. The first study used Exploratory factor analysis to explore the structure of items in a sample of 462 postgraduate students recruited through a London university. The second study used Confirmatory factor analysis to confirm the structure among 238 adults recruited through an online market research panel. Internal reliability, test-retest reliability and construct validity were also assessed. Results Exploratory factor analysis suggested six subscales, representing: Awkwardness, Severity, Avoidance, Policy Opposition, Personal Responsibility and Financial Discrimination. Confirmatory factor analysis confirmed this structure with a 25-item scale. All subscales showed adequate to good internal and test-retest reliability in both samples. Construct validity was also good, with mean scores for each subscale varying in the expected directions by age, gender, experience of cancer, awareness of lifestyle risk factors for cancer, and social desirability. Means for the subscales were consistent across the two samples. Conclusions These findings highlight the complexity of cancer stigma and provide the Cancer Stigma Scale (CASS) which can be used to compare populations, types of cancer and evaluate the effects of interventions designed to reduce cancer stigma in non-patient populations. PMID:24758482
Validation of the VISA-A questionnaire for Turkish language: the VISA-A-Tr study.
Dogramaci, Yunus; Kalaci, Aydiner; Kücükkübas, Nigar; Inandi, Taceddin; Esen, Erdinc; Yanat, A Nedim
2011-04-01
To evaluate the validity and reliability of the Turkish version of the Victorian Institute of Sports Assessment-Achilles (VISA-A) questionnaire for patients with Achilles tendinopathy. Fifty-five patients with a diagnosis of Achilles tendinopathy and 55 healthy subjects were included in the study. VISA-A questionnaires were translated and culturally adapted into Turkish. The final Turkish version (VISA-A-Tr) was tested for reliability on healthy individuals and patients. Tests for internal consistency, validity and structure were performed on 55 patients. The VISA-A-Tr showed good test-retest reliability (Pearson's r=0.99, p<0.001). The patients with Achilles tendinopathy had a significantly lower score (p<0.001) than the healthy individuals. The VISA-A-Tr score correlated significantly with the Stanish tendon grading system (Spearman's r=-0.86; p<0.001). The VISA-A-Tr is a valid and reliable tool for evaluating the severity of Achilles tendinopathy.
Assessment of the psychometric properties of the Family Management Measure.
Knafl, Kathleen; Deatrick, Janet A; Gallo, Agatha; Dixon, Jane; Grey, Margaret; Knafl, George; O'Malley, Jean
2011-06-01
This paper reports development of the Family Management Measure (FaMM) of parental perceptions of family management of chronic conditions. By telephone interview, 579 parents of children age 3 to 19 with a chronic condition (349 partnered mothers, 165 partners, 65 single mothers) completed the FaMM and measures of child functional status and behavioral problems and family functioning. Analyses addressed reliability, factor structure, and construct validity. Exploratory factor analysis yielded six scales: Child's Daily Life, Condition Management Ability, Condition Management Effort, Family Life Difficulty, Parental Mutuality, and View of Condition Impact. Internal consistency reliability ranged from .72 to .91, and test-retest reliability from .71 to .94. Construct validity was supported by significant correlations in hypothesized directions between FaMM scales and established measures. Results support FaMM's; reliability and validity, indicating it performs in a theoretically meaningful way and taps distinct aspects of family response to childhood chronic conditions.
Hwang, Huei-Lih; Lin, Huey-Shyan; Wang, Hsiu-Hung
2010-12-01
Death education involves acquiring knowledge, changing behavior, and developing proper views of life in both the affective and the value domains. Critical thinking that is honed through reflecting on life-and-death issues represents a way to reach these goals. Designing assessments able to measure college student content and critical thinking skills related to life-and-death issues is thus important. The Test of Critical Thinking Skills for Life-And-Death content (TCTS-LD) instrument requires the administration of additional tests to assess reliability and validity for future use in the assessment of perceptions on life and death. The purpose of this study was to refine the TCTS-LD. A cross-sectional, descriptive design was used to recruit 715 college students in southern Taiwan. Three structured scales were administered in class to the participants. Data were collected in 2004 and 2006. Confirmatory factor analysis was applied to validate the structure of scales. Examination of the reliability of the three-factor and 15-item scale revealed a Kuder-Richardson coefficient of internal consistency of .54. The split-half reliability coefficients were .47 in the Spearman-Brown correlation and .40 in the intraclass correlation coefficient (ICC). The test-retest reliability coefficients (n = 22) were .58 in Pearson correlation and .56 in ICC. In addition to content validity verification by experts and face validity by students, the validity of this test was assessed using three methods, including (a) a comparable validity rating between this test and the TCTS-A (r = .34, p < .001; (b) a contrast-group technique with different responses to the instrument between those in education and nursing majors (t = 2.71, p < .01), with scores of 10.98 (SD = 2.42) and 9.82 (SD = 2.25), respectively; and (c) a confirmatory factor analysis confirming that TCTS-LD is related to the three dimensions of assumption, evaluation, and induction (χ = 81.800, p = .158, normed chi-square χ/df = 1.169, comparative fit index [CFI] = .976, Tucker-Lewis index = .984, root mean square error of approximation [RMSEA] = 0.015). Three factors explained 31.19% of total variance for the revised TCTS-LD. The revised TCTS-LD scale improved performance and effectiveness to a certain degree. However, reliability and construct validity must be further tested to permit its use as an evaluation tool.
[Reliability and validity of Driving Anger Scale in professional drivers in China].
Li, Z; Yang, Y M; Zhang, C; Li, Y; Hu, J; Gao, L W; Zhou, Y X; Zhang, X J
2017-11-10
Objective: To assess the reliability and validity of the Chinese version of Driving Anger Scale (DAS) in professional drivers in China and provide a scientific basis for the application of the scale in drivers in China. Methods: Professional drivers, including taxi drivers, bus drivers, truck drivers and school bus drivers, were selected to complete the questionnaire. Cronbach's α and split-half reliability were calculated to evaluate the reliability of DAS, and content, contract, discriminant and convergent validity were performed to measure the validity of the scale. Results: The overall Cronbach's α of DAS was 0.934 and the split-half reliability was 0.874. The correlation coefficient of each subscale with the total scale was 0.639-0.922. The simplified version of DAS supported a presupposed six-factor structure, explaining 56.371% of the total variance revealed by exploratory factor analysis. The DAS had good convergent and discriminant validity, with the success rate of calibration experiment of 100%. Conclusion: DAS has a good reliability and validity in professional drivers in China, and the use of DAS is worth promoting in divers.