Reliability generalization: a viable key for establishing validity generalization
NASA Technical Reports Server (NTRS)
Kennedy, R. S.; Turnage, J. J.
1991-01-01
Even with radical restriction of range, reliability coefficients from 10 studies gave an average interstudy value of .74, suggesting constancy of reliability over diverse experiments. A value from a new test can help index reliability of tests not previously studied.
Reliability Generalization of the Alcohol Use Disorder Identification Test.
ERIC Educational Resources Information Center
Shields, Alan L.; Caruso, John C.
2002-01-01
Evaluated the reliability of scores from the Alcohol Use Disorders Identification Test (AUDIT; J. Sounders and others, 1993) in a reliability generalization study based on 17 empirical journal articles. Results show AUDIT scores to be generally reliable for basic assessment. (SLD)
The Balanced Inventory of Desirable Responding (BIDR): A Reliability Generalization Study
ERIC Educational Resources Information Center
Li, Andrew; Bagger, Jessica
2007-01-01
The Balanced Inventory of Desirable Responding (BIDR) is one of the most widely used social desirability scales. The authors conducted a reliability generalization study to examine the typical reliability coefficients of BIDR scores and explored factors that explained the variability of reliability estimates across studies. The results indicated…
Score Reliability: A Retrospective Look Back at 12 Years of Reliability Generalization Studies
ERIC Educational Resources Information Center
Vacha-Haase, Tammi; Thompson, Bruce
2011-01-01
The present study was conducted to characterize (a) the features of the thousands of primary reports synthesized in 47 reliability generalization (RG) measurement meta-analysis studies and (b) typical methodological practice within the RG literature to date. With respect to the treatment of score reliability in the literature, in an astounding…
A Reliability Generalization Study of the Marlowe-Crowne Social Desirability Scale.
ERIC Educational Resources Information Center
Beretvas, S, Natasha; Meyers, Jason L.; Leite, Walter L.
2002-01-01
Conducted a reliability generalization study of the Marlowe-Crowne Social Desirability Scale (D. Crowne and D. Marlowe, 1960). Analysis of 93 studies show that the predicted score reliability for male adolescents was 0.53, and reliability for men's responses was lower than for women's. Discusses the need for further analysis of the scale. (SLD)
General Aviation Aircraft Reliability Study
NASA Technical Reports Server (NTRS)
Pettit, Duane; Turnbull, Andrew; Roelant, Henk A. (Technical Monitor)
2001-01-01
This reliability study was performed in order to provide the aviation community with an estimate of Complex General Aviation (GA) Aircraft System reliability. To successfully improve the safety and reliability for the next generation of GA aircraft, a study of current GA aircraft attributes was prudent. This was accomplished by benchmarking the reliability of operational Complex GA Aircraft Systems. Specifically, Complex GA Aircraft System reliability was estimated using data obtained from the logbooks of a random sample of the Complex GA Aircraft population.
The Typical General Aviation Aircraft
NASA Technical Reports Server (NTRS)
Turnbull, Andrew
1999-01-01
The reliability of General Aviation aircraft is unknown. In order to "assist the development of future GA reliability and safety requirements", a reliability study needs to be performed. Before any studies on General Aviation aircraft reliability begins, a definition of a typical aircraft that encompasses most of the general aviation characteristics needs to be defined. In this report, not only is the typical general aviation aircraft defined for the purpose of the follow-on reliability study, but it is also separated, or "sifted" into several different categories where individual analysis can be performed on the reasonably independent systems. In this study, the typical General Aviation aircraft is a four-place, single engine piston, all aluminum fixed-wing certified aircraft with a fixed tricycle landing gear and a cable operated flight control system. The system breakdown of a GA aircraft "sifts" the aircraft systems and components into five categories: Powerplant, Airframe, Aircraft Control Systems, Cockpit Instrumentation Systems, and the Electrical Systems. This breakdown was performed along the lines of a failure of the system. Any component that caused a system to fail was considered a part of that system.
Reliability Generalization of Scores on the Spielberger State-Trait Anxiety Inventory.
ERIC Educational Resources Information Center
Barnes, Laura L. B.; Harp, Diane; Jung, Woo Sik
2002-01-01
Conducted a reliability generalization study for the State-Trait Anxiety Inventory (C. Spielberger, 1983) by reviewing and classifying 816 research articles. Average reliability coefficients were acceptable for both internal consistency and test-retest reliability, but variation was present among the estimates. Other differences are discussed.…
Du, Han; Wang, Lijuan
2018-04-23
Intraindividual variability can be measured by the intraindividual standard deviation ([Formula: see text]), intraindividual variance ([Formula: see text]), estimated hth-order autocorrelation coefficient ([Formula: see text]), and mean square successive difference ([Formula: see text]). Unresolved issues exist in the research on reliabilities of intraindividual variability indicators: (1) previous research only studied conditions with 0 autocorrelations in the longitudinal responses; (2) the reliabilities of [Formula: see text] and [Formula: see text] have not been studied. The current study investigates reliabilities of [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text], and the intraindividual mean, with autocorrelated longitudinal data. Reliability estimates of the indicators were obtained through Monte Carlo simulations. The impact of influential factors on reliabilities of the intraindividual variability indicators is summarized, and the reliabilities are compared across the indicators. Generally, all the studied indicators of intraindividual variability were more reliable with a more reliable measurement scale and more assessments. The reliabilities of [Formula: see text] were generally lower than those of [Formula: see text] and [Formula: see text], the reliabilities of [Formula: see text] were usually between those of [Formula: see text] and [Formula: see text] unless the scale reliability was large and/or the interindividual standard deviation in autocorrelation coefficients was large, and the reliabilities of the intraindividual mean were generally the highest. An R function is provided for planning longitudinal studies to ensure sufficient reliabilities of the intraindividual indicators are achieved.
López-Pina, José Antonio; Sánchez-Meca, Julio; López-López, José Antonio; Marín-Martínez, Fulgencio; Núñez-Núñez, Rosa Ma; Rosa-Alcázar, Ana I; Gómez-Conesa, Antonia; Ferrer-Requena, Josefa
2015-01-01
The Yale-Brown Obsessive-Compulsive Scale for children and adolescents (CY-BOCS) is a frequently applied test to assess obsessive-compulsive symptoms. We conducted a reliability generalization meta-analysis on the CY-BOCS to estimate the average reliability, search for reliability moderators, and propose a predictive model that researchers and clinicians can use to estimate the expected reliability of the CY-BOCS scores. A total of 47 studies reporting a reliability coefficient with the data at hand were included in the meta-analysis. The results showed good reliability and a large variability associated to the standard deviation of total scores and sample size.
ERIC Educational Resources Information Center
Vacha-Haase, Tammi; Kogan, Lori R.; Tani, Crystal R.; Woodall, Renee A.
2001-01-01
Used reliability generalization to explore the variance of scores on 10 Minnesota Multiphasic Personality Inventory (MMPI) clinical scales drawing on 1,972 articles in the literature on the MMPI. Results highlight the premise that scores, not tests, are reliable or unreliable, and they show that study characteristics do influence scores on the…
A Latent Class Approach to Estimating Test-Score Reliability
ERIC Educational Resources Information Center
van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas
2011-01-01
This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…
NEPP DDR Device Reliability FY13 Report
NASA Technical Reports Server (NTRS)
Guertin, Steven M.; Armbar, Mehran
2014-01-01
This document reports the status of the NEPP Double Data Rate (DDR) Device Reliability effort for FY2013. The task targeted general reliability of > 100 DDR2 devices from Hynix, Samsung, and Micron. Detailed characterization of some devices when stressed by several data storage patterns was studied, targeting ability of the data cells to store the different data patterns without refresh, highlighting the weakest bits. DDR2, Reliability, Data Retention, Temperature Stress, Test System Evaluation, General Reliability, IDD measurements, electronic parts, parts testing, microcircuits
Meta-Analysis of Coefficient Alpha
ERIC Educational Resources Information Center
Rodriguez, Michael C.; Maeda, Yukiko
2006-01-01
The meta-analysis of coefficient alpha across many studies is becoming more common in psychology by a methodology labeled reliability generalization. Existing reliability generalization studies have not used the sampling distribution of coefficient alpha for precision weighting and other common meta-analytic procedures. A framework is provided for…
Psychometrics Matter in Health Behavior: A Long-term Reliability Generalization Study.
Pickett, Andrew C; Valdez, Danny; Barry, Adam E
2017-09-01
Despite numerous calls for increased understanding and reporting of reliability estimates, social science research, including the field of health behavior, has been slow to respond and adopt such practices. Therefore, we offer a brief overview of reliability and common reporting errors; we then perform analyses to examine and demonstrate the variability of reliability estimates by sample and over time. Using meta-analytic reliability generalization, we examined the variability of coefficient alpha scores for a well-designed, consistent, nationwide health study, covering a span of nearly 40 years. For each year and sample, reliability varied. Furthermore, reliability was predicted by a sample characteristic that differed among age groups within each administration. We demonstrated that reliability is influenced by the methods and individuals from which a given sample is drawn. Our work echoes previous calls that psychometric properties, particularly reliability of scores, are important and must be considered and reported before drawing statistical conclusions.
ERIC Educational Resources Information Center
Lane, Ginny G.; White, Amy E.; Henson, Robin K.
2002-01-01
Conducted a reliability generalizability study on the Coopersmith Self-Esteem Inventory (CSEI; S. Coopersmith, 1967) to examine the variability of reliability estimates across studies and to identify study characteristics that may predict this variability. Results show that reliability for CSEI scores can vary considerably, especially at the…
Taghipour, Morteza; Mohseni-Bandpei, Mohammad Ali; Behtash, Hamid; Abdollahi, Iraj; Rajabzadeh, Fatemeh; Pourahmadi, Mohammad Reza; Emami, Mahnaz
2018-04-24
Rehabilitative ultrasound (US) imaging is one of the popular methods for investigating muscle morphologic characteristics and dimensions in recent years. The reliability of this method has been investigated in different studies. As studies have been performed with different designs and quality, reported values of rehabilitative US have a wide range. The objective of this study was to systematically review the literature conducted on the reliability of rehabilitative US imaging for the assessment of deep abdominal and lumbar trunk muscle dimensions. The PubMed/MEDLINE, Scopus, Google Scholar, Science Direct, Embase, Physiotherapy Evidence, Ovid, and CINAHL databases were searched to identify original research articles conducted on the reliability of rehabilitative US imaging published from June 2007 to August 2017. The articles were qualitatively assessed; reliability data were extracted; and the methodological quality was evaluated by 2 independent reviewers. Of the 26 included studies, 16 were considered of high methodological quality. Except for 2 studies, all high-quality studies reported intraclass correlation coefficients (ICCs) for intra-rater reliability of 0.70 or greater. Also, ICCs reported for inter-rater reliability in high-quality studies were generally greater than 0.70. Among low-quality studies, reported ICCs ranged from 0.26 to 0.99 and 0.68 to 0.97 for intra- and inter-rater reliability, respectively. Also, the reported standard error of measurement and minimal detectable change for rehabilitative US were generally in an acceptable range. Generally, the results of the reviewed studies indicate that rehabilitative US imaging has good levels of both inter- and intra-rater reliability. © 2018 by the American Institute of Ultrasound in Medicine.
Validation of general job satisfaction in the Korean Labor and Income Panel Study.
Park, Shin Goo; Hwang, Sang Hee
2017-01-01
The purpose of this study is to assess the validity and reliability of general job satisfaction (JS) in the Korean Labor and Income Panel Study (KLIPS). We used the data from the 17th wave (2014) of the nationwide KLIPS, which selected a representative panel sample of Korean households and individuals aged 15 or older residing in urban areas. We included in this study 7679 employed subjects (4529 males and 3150 females). The general JS instrument consisted of five items rated on a scale from 1 (strongly disagree) to 5 (strongly agree). The general JS reliability was assessed using the corrected item-total correlation and Cronbach's alpha coefficient. The validity of general JS was assessed using confirmatory factor analysis (CFA) and Pearson's correlation. The corrected item-total correlations ranged from 0.736 to 0.837. Therefore, no items were removed. Cronbach's alpha for general JS was 0.925, indicating excellent internal consistency. The CFA of the general JS model showed a good fit. Pearson's correlation coefficients for convergent validity showed moderate or strong correlations. The results obtained in our study confirm the validity and reliability of general JS.
Measuring eating disorder attitudes and behaviors: a reliability generalization study
2014-01-01
Background Although score reliability is a sample-dependent characteristic, researchers often only report reliability estimates from previous studies as justification for employing particular questionnaires in their research. The present study followed reliability generalization procedures to determine the mean score reliability of the Eating Disorder Inventory and its most commonly employed subscales (Drive for Thinness, Bulimia, and Body Dissatisfaction) and the Eating Attitudes Test as a way to better identify those characteristics that might impact score reliability. Methods Published studies that used these measures were coded based on their reporting of reliability information and additional study characteristics that might influence score reliability. Results Score reliability estimates were included in 26.15% of studies using the EDI and 36.28% of studies using the EAT. Mean Cronbach’s alphas for the EDI (total score = .91; subscales = .75 to .89), EAT-40 (total score = .81) and EAT-26 (total score = .86; subscales = .56 to .80) suggested variability in estimated internal consistency. Whereas some EDI subscales exhibited higher score reliability in clinical eating disorder samples than in nonclinical samples, other subscales did not exhibit these differences. Score reliability information for the EAT was primarily reported for nonclinical samples, making it difficult to characterize the effect of type of sample on these measures. However, there was a tendency for mean score reliability to be higher in the adult (vs. adolescent) samples and in female (vs. male) samples. Conclusions Overall, this study highlights the importance of assessing and reporting internal consistency during every test administration because reliability is affected by characteristics of the participants being examined. PMID:24764530
ERIC Educational Resources Information Center
Henson, Robin K.; Thompson, Bruce
Given the potential value of reliability generalization (RG) studies in the development of cumulative psychometric knowledge, the purpose of this paper is to provide a tutorial on how to conduct such studies and to serve as a guide for researchers wishing to use this methodology. After some brief comments on classical test theory, the paper…
Yalın Sapmaz, Şermin; Özek Erkuran, Handan; Ergin, Dilek; Öztürk, Masum; Şen Celasin, Nesrin; Karaarslan, Duygu; Aydemir, Ömer
2018-02-23
Background/aim: This study aimed to assess the validity and reliability of the Turkish version of the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form. Materials and methods: The study sample consisted of 32 patients treated in a child psychiatry unit and diagnosed with generalized anxiety disorder and 98 healthy volunteers who were attending middle or high school during the study period. For the assessment, the Screen for Child Anxiety and Related Emotional Disorders (SCARED) was also used along with the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form. Results: Regarding reliability analyses, the Cronbach alpha internal consistency coefficient was calculated as 0.932. The test-retest correlation coefficient was calculated as r = 0.707. As for construct validity, one factor that could explain 62.6% of the variance was obtained and this was consistent with the original construct of the scale. As for concurrent validity, the scale showed a high correlation with SCARED. Conclusion: It was concluded that Turkish version of the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form could be utilized as a valid and reliable tool both in clinical practice and for research purposes.
The Quest for Reliable Epidemiological Data on Suicide: The Padua Sample.
ERIC Educational Resources Information Center
De Leo, Diego; And Others
This study was a preliminary step in gathering reliable data on suicides and suicide attempts in Padua, Italy. Data were collected from the first aid department of the Padua general hospital, 67 general practitioners in the city, staff of a night-time and holiday home-call medical service, the reanimation department of the Padua general hospital,…
Retest Reliability of the Rosenzweig Picture-Frustration Study and Similar Semiprojective Techniques
ERIC Educational Resources Information Center
Rosenzweig, Saul; And Others
1975-01-01
The research dealing with the reliability of the Rosenzweig Picture-Frustration Study is surveyed. Analysis of various split-half, and retest procedures are reviewed and their relative effectiveness evaluated. Reliability measures as applied to projective techniques in general are discussed. (Author/DEP)
Reliability Generalization (RG) Analysis: The Test Is Not Reliable
ERIC Educational Resources Information Center
Warne, Russell
2008-01-01
Literature shows that most researchers are unaware of some of the characteristics of reliability. This paper clarifies some misconceptions by describing the procedures, benefits, and limitations of reliability generalization while using it to illustrate the nature of score reliability. Reliability generalization (RG) is a meta-analytic method…
ERIC Educational Resources Information Center
Yildiz, Mehmet Ali
2017-01-01
The current research aims to adapt the General Belongingness Scale (GBS), developed by Malone, Pillow, and Osman (2012), into Turkish for adolescents and to conduct the validity and reliability studies for it. Ages of the participants, a total of 567 adolescents including 274 males (48.3%) and 293 females (51.7%) ranged between 14 and 18 (average…
NASA Astrophysics Data System (ADS)
Saini, K. K.; Sehgal, R. K.; Sethi, B. L.
2008-10-01
In this paper major reliability estimators are analyzed and there comparatively result are discussed. There strengths and weaknesses are evaluated in this case study. Each of the reliability estimators has certain advantages and disadvantages. Inter-rater reliability is one of the best ways to estimate reliability when your measure is an observation. However, it requires multiple raters or observers. As an alternative, you could look at the correlation of ratings of the same single observer repeated on two different occasions. Each of the reliability estimators will give a different value for reliability. In general, the test-retest and inter-rater reliability estimates will be lower in value than the parallel forms and internal consistency ones because they involve measuring at different times or with different raters. Since reliability estimates are often used in statistical analyses of quasi-experimental designs.
The Yale-Brown Obsessive Compulsive Scale: A Reliability Generalization Meta-Analysis.
López-Pina, José Antonio; Sánchez-Meca, Julio; López-López, José Antonio; Marín-Martínez, Fulgencio; Núñez-Núñez, Rosa Maria; Rosa-Alcázar, Ana I; Gómez-Conesa, Antonia; Ferrer-Requena, Josefa
2015-10-01
The Yale-Brown Obsessive Compulsive Scale (Y-BOCS) is the most frequently applied test to assess obsessive compulsive symptoms. We conducted a reliability generalization meta-analysis on the Y-BOCS to estimate the average reliability, examine the variability among the reliability estimates, search for moderators, and propose a predictive model that researchers and clinicians can use to estimate the expected reliability of the Y-BOCS. We included studies where the Y-BOCS was applied to a sample of adults and reliability estimate was reported. Out of the 11,490 references located, 144 studies met the selection criteria. For the total scale, the mean reliability was 0.866 for coefficients alpha, 0.848 for test-retest correlations, and 0.922 for intraclass correlations. The moderator analyses led to a predictive model where the standard deviation of the total test and the target population (clinical vs. nonclinical) explained 38.6% of the total variability among coefficients alpha. Finally, clinical implications of the results are discussed. © The Author(s) 2014.
Reliability Generalization of the Psychopathy Checklist Applied in Youthful Samples
ERIC Educational Resources Information Center
Campbell, Justin S.; Pulos, Steven; Hogan, Mike; Murry, Francie
2005-01-01
This study examines the average reliability of Hare Psychopathy Checklists (PCLs) adapted for use in samples of youthful offenders (aged 12 to 21 years). Two forms of reliability are examined: 18 alpha estimates of internal consistency and 18 intraclass correlation (two or more raters) estimates of interrater reliability. The results, an average…
ERIC Educational Resources Information Center
Hellman, Chan M.; Fuqua, Dale R.; Worley, Jody
2006-01-01
The Survey of Perceived Organizational Support (SPOS) is a unidimensional measure of the general belief held by an employee that the organization is committed to him or her, values his or her continued membership, and is generally concerned about the employee's well-being. In the interest of efficiency, researchers are often compelled to use a…
ERIC Educational Resources Information Center
Boonstra, Anne M.; Reneman, Michiel F.; Stewart, Roy E.; Balk, Gerlof A.
2012-01-01
The aim of this study was to determine the reliability and discriminant validity of the Dutch version of the life satisfaction questionnaire (Lisat-9 DV) to assess patients with an acquired brain injury. The reliability study used a test-retest design, and the validity study used a cross-sectional design. The setting was the general rehabilitation…
A Reliability Generalization Meta-Analysis of Coefficient Alpha for the Maslach Burnout Inventory
ERIC Educational Resources Information Center
Wheeler, Denna L.; Vassar, Matt; Worley, Jody A.; Barnes, Laura L. B.
2011-01-01
The purpose of this study was to synthesize internal consistency reliability for the subscale scores on the Maslach Burnout Inventory (MBI). The authors addressed three research questions: (a) What is the mean subscale score reliability for the MBI across studies? (b) What factors are associated with observed variance in MBI subscale score…
Validity and Reliability of a General Nutrition Knowledge Questionnaire for Japanese Adults.
Matsumoto, Mai; Tanaka, Rie; Ikemoto, Shinji
2017-01-01
Nutrition knowledge is necessary for individuals to adopt appropriate dietary habits, and needs to be evaluated before nutrition education is provided. However, there is no tool to assess general nutrition knowledge of adults in Japan. Our aims were to determine the validity and reliability of a general nutrition knowledge questionnaire for Japanese adults. We developed the pilot version of the Japanese general nutrition knowledge questionnaire (JGNKQ) and administered the pilot study to assess content validity and internal reliability to 1,182 Japanese adults aged 18-64 y. The JGNKQ was further modified based on the pilot study and the final version consisted of 5 sections and 147 items. The JGNKQ was administered to female undergraduate Japanese students in their senior year twice in 2015 to assess construct validity and test-retest reliability. Ninety-six students majoring in nutrition and 44 students in other majors who studied at the same university completed the first questionnaire. Seventy-five students completed the questionnaire twice. The responses from the first questionnaire and both questionnaires were used to assess construct validity and test-retest reliability, respectively. The students in nutrition major had significantly higher scores than the students in other majors on all sections of the questionnaire (p=0.000); therefore, the questionnaire had good construct validity. The test-retest reliability correlation coefficient value of overall and each section except "The use of dietary information to make dietary choices" were 0.75, 0.67, 0.67, 0.68 and 0.61, respectively. We suggest that the JGNKQ is an effective tool to assess the nutrition knowledge level of Japanese adults.
Psychometric Inferences from a Meta-Analysis of Reliability and Internal Consistency Coefficients
ERIC Educational Resources Information Center
Botella, Juan; Suero, Manuel; Gambara, Hilda
2010-01-01
A meta-analysis of the reliability of the scores from a specific test, also called reliability generalization, allows the quantitative synthesis of its properties from a set of studies. It is usually assumed that part of the variation in the reliability coefficients is due to some unknown and implicit mechanism that restricts and biases the…
Reliability Generalization of the Patterns of Adaptive Learning Survey Goal Orientation Scales
ERIC Educational Resources Information Center
Ross, Margaret E.; Blackburn, Marcy; Forbes, Sean
2005-01-01
A reliability generalization study was completed on the Patterns of Adaptive Learning Survey achievement goal orientation scales to assess the prediction of (a) the different orientation scales, (b) the adaptation of items to meet research needs, (c) the number of respondents completing the instrument, and (d) the publication date cited for the…
ERIC Educational Resources Information Center
Usher, Wayne
2009-01-01
This study was undertaken to determine the level of understanding of Gold Coast general practitioners (GPs) pertaining to such criteria as reliability, interactive and usability components associated with health websites. These are important considerations due to the increased levels of computer and World Wide Web (WWW)/Internet use and health…
Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H
2017-08-15
Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error (<6%). Most of the outcome measures reported high internal consistency (α > .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
16 CFR 260.5 - Interpretation and substantiation of environmental marketing claims.
Code of Federal Regulations, 2011 CFR
2011-01-01
... reasonable basis substantiating the claim. A reasonable basis consists of competent and reliable evidence. In... reliable scientific evidence, defined as tests, analyses, research, studies or other evidence based on the... qualified to do so, using procedures generally accepted in the profession to yield accurate and reliable...
Salyers, M P; McHugo, G J; Cook, J A; Razzano, L A; Drake, R E; Mueser, K T
2001-09-01
Reliability of well-known instruments was examined in 202 people with severe mental illness participating in a multisite vocational study. We examined interrater reliability of the Positive and Negative Syndrome Scale (PANSS) and the internal consistency and test-retest reliability of the PANSS, the Rosenberg Self-Esteem Scale, the Medical Outcomes Study Short Form-36 (SF-36), and the Quality of Life Interview. Most scales had good levels of reliability, with intraclass correlation coefficients (ICCs) and coefficient alphas above .70. However, the SF-36 scales were generally less stable over time, particularly Social Functioning (ICC = .55). Test-retest reliability was lower among less educated respondents and among ethnic minorities. We recommend close monitoring of psychometric issues in future multisite studies.
Predicting Cost/Reliability/Maintainability of Advanced General Aviation Avionics Equipment
NASA Technical Reports Server (NTRS)
Davis, M. R.; Kamins, M.; Mooz, W. E.
1978-01-01
A methodology is provided for assisting NASA in estimating the cost, reliability, and maintenance (CRM) requirements for general avionics equipment operating in the 1980's. Practical problems of predicting these factors are examined. The usefulness and short comings of different approaches for modeling coast and reliability estimates are discussed together with special problems caused by the lack of historical data on the cost of maintaining general aviation avionics. Suggestions are offered on how NASA might proceed in assessing cost reliability CRM implications in the absence of reliable generalized predictive models.
ERIC Educational Resources Information Center
Setzer, J. Carl; He, Yi
2009-01-01
Reliability Analysis for the Internationally Administered 2002 Series GED (General Educational Development) Tests Reliability refers to the consistency, or stability, of test scores when the authors administer the measurement procedure repeatedly to groups of examinees (American Educational Research Association [AERA], American Psychological…
The Reliability and Validity of the Social Responsiveness Scale in a UK General Child Population
ERIC Educational Resources Information Center
Wigham, Sarah; McConachie, Helen; Tandos, Jonathan; Le Couteur, Ann S.
2012-01-01
This is the first UK study to report the reliability, validity, and factor structure of the Social Responsiveness Scale (SRS) in a general population sample. Parents of 500 children (aged 5-8 years) in North East England completed the SRS. Profiles of scores were similar to USA norms, and a single factor structure was identified. Good construct…
A Bayesian approach to reliability and confidence
NASA Technical Reports Server (NTRS)
Barnes, Ron
1989-01-01
The historical evolution of NASA's interest in quantitative measures of reliability assessment is outlined. The introduction of some quantitative methodologies into the Vehicle Reliability Branch of the Safety, Reliability and Quality Assurance (SR and QA) Division at Johnson Space Center (JSC) was noted along with the development of the Extended Orbiter Duration--Weakest Link study which will utilize quantitative tools for a Bayesian statistical analysis. Extending the earlier work of NASA sponsor, Richard Heydorn, researchers were able to produce a consistent Bayesian estimate for the reliability of a component and hence by a simple extension for a system of components in some cases where the rate of failure is not constant but varies over time. Mechanical systems in general have this property since the reliability usually decreases markedly as the parts degrade over time. While they have been able to reduce the Bayesian estimator to a simple closed form for a large class of such systems, the form for the most general case needs to be attacked by the computer. Once a table is generated for this form, researchers will have a numerical form for the general solution. With this, the corresponding probability statements about the reliability of a system can be made in the most general setting. Note that the utilization of uniform Bayesian priors represents a worst case scenario in the sense that as researchers incorporate more expert opinion into the model, they will be able to improve the strength of the probability calculations.
49 CFR Appendix E to Part 238 - General Principles of Reliability-Based Maintenance Programs
Code of Federal Regulations, 2010 CFR
2010-10-01
... 49 Transportation 4 2010-10-01 2010-10-01 false General Principles of Reliability-Based... STANDARDS Pt. 238, App. E Appendix E to Part 238—General Principles of Reliability-Based Maintenance... maintenance programs are based on the following general principles. A failure is an unsatisfactory condition...
Marshall, Andrew J; Evanovich, Emma K; David, Sarah Jo; Mumma, Gregory H
2018-01-17
High comorbidity rates among emotional disorders have led researchers to examine transdiagnostic factors that may contribute to shared psychopathology. Bifactor models provide a unique method for examining transdiagnostic variables by modelling the common and unique factors within measures. Previous findings suggest that the bifactor model of the Depression Anxiety and Stress Scale (DASS) may provide a method for examining transdiagnostic factors within emotional disorders. This study aimed to replicate the bifactor model of the DASS, a multidimensional measure of psychological distress, within a US adult sample and provide initial estimates of the reliability of the general and domain-specific factors. Furthermore, this study hypothesized that Worry, a theorized transdiagnostic variable, would show stronger relations to general emotional distress than domain-specific subscales. Confirmatory factor analysis was used to evaluate the bifactor model structure of the DASS in 456 US adult participants (279 females and 177 males, mean age 35.9 years) recruited online. The DASS bifactor model fitted well (CFI = 0.98; RMSEA = 0.05). The General Emotional Distress factor accounted for most of the reliable variance in item scores. Domain-specific subscales accounted for modest portions of reliable variance in items after accounting for the general scale. Finally, structural equation modelling indicated that Worry was strongly predicted by the General Emotional Distress factor. The DASS bifactor model is generalizable to a US community sample and General Emotional Distress, but not domain-specific factors, strongly predict the transdiagnostic variable Worry.
Riskind, J H; Beck, A T; Berchick, R J; Brown, G; Steer, R A
1987-09-01
This study examined the interrater reliability of generalized anxiety disorder (GAD) and major depressive disorder (MDD) diagnoses derived from the Structured Clinical Interview for DSM-III (SCID). Using videotaped interviews, paired raters made independent diagnoses of 75 psychiatric outpatients. The percent agreement of the raters was 82% for MDD and 86% for GAD; the respective kappa values were .72 and .79. The results indicated that the SCID can be employed reliably to differentiate MDD from GAD. The SCID is recommended for further research with these disorders.
The revised Generalized Expectancy for Success Scale: a validity and reliability study.
Hale, W D; Fiedler, L R; Cochran, C D
1992-07-01
The Generalized Expectancy for Success Scale (GESS; Fibel & Hale, 1978) was revised and assessed for reliability and validity. The revised version was administered to 199 college students along with other conceptually related measures, including the Rosenberg Self-Esteem Scale, the Life Orientation Test, and Rotter's Internal-External Locus of Control Scale. One subsample of students also completed the Eysenck Personality Inventory, while another subsample performed a criterion-related task that involved risk taking. Item analysis yielded 25 items with correlations of .45 or higher with the total score. Results indicated high internal consistency and test-retest reliability.
California condor plumage and molt as field study aids
Wilbur, S.R.
1975-01-01
An analysis is made of the reliability of plumage and molt characteristics of the California condor for estimating age and identifying individual birds. Neither character seems sufficiently reliable to use in more than a general way.
Measuring the Nonviolent Tendencies of College Students.
ERIC Educational Resources Information Center
Mayton, Daniel M., II; Richel, Timothy W.; Susnjic, Silvia; Majdanac, Maja
The Teenage Nonviolence Test (TNT) has previously been established as a generally reliable and valid measure of nonviolence in adolescents. This study examined the extent to which the TNT's reliability and validity could be extended to college students aged 18-22 years of age. Five of the six subscales of the TNT were found to be reliable. The…
Choosing a reliability inspection plan for interval censored data
Lu, Lu; Anderson-Cook, Christine Michaela
2017-04-19
Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less
Choosing a reliability inspection plan for interval censored data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Lu; Anderson-Cook, Christine Michaela
Reliability test plans are important for producing precise and accurate assessment of reliability characteristics. This paper explores different strategies for choosing between possible inspection plans for interval censored data given a fixed testing timeframe and budget. A new general cost structure is proposed for guiding precise quantification of total cost in inspection test plan. Multiple summaries of reliability are considered and compared as the criteria for choosing the best plans using an easily adapted method. Different cost structures and representative true underlying reliability curves demonstrate how to assess different strategies given the logistical constraints and nature of the problem. Resultsmore » show several general patterns exist across a wide variety of scenarios. Given the fixed total cost, plans that inspect more units with less frequency based on equally spaced time points are favored due to the ease of implementation and consistent good performance across a large number of case study scenarios. Plans with inspection times chosen based on equally spaced probabilities offer improved reliability estimates for the shape of the distribution, mean lifetime, and failure time for a small fraction of population only for applications with high infant mortality rates. The paper uses a Monte Carlo simulation based approach in addition to the common evaluation based on the asymptotic variance and offers comparison and recommendation for different applications with different objectives. Additionally, the paper outlines a variety of different reliability metrics to use as criteria for optimization, presents a general method for evaluating different alternatives, as well as provides case study results for different common scenarios.« less
Software reliability models for critical applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham, H.; Pham, M.
This report presents the results of the first phase of the ongoing EG G Idaho, Inc. Software Reliability Research Program. The program is studying the existing software reliability models and proposes a state-of-the-art software reliability model that is relevant to the nuclear reactor control environment. This report consists of three parts: (1) summaries of the literature review of existing software reliability and fault tolerant software reliability models and their related issues, (2) proposed technique for software reliability enhancement, and (3) general discussion and future research. The development of this proposed state-of-the-art software reliability model will be performed in the secondmore » place. 407 refs., 4 figs., 2 tabs.« less
Software reliability models for critical applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham, H.; Pham, M.
This report presents the results of the first phase of the ongoing EG&G Idaho, Inc. Software Reliability Research Program. The program is studying the existing software reliability models and proposes a state-of-the-art software reliability model that is relevant to the nuclear reactor control environment. This report consists of three parts: (1) summaries of the literature review of existing software reliability and fault tolerant software reliability models and their related issues, (2) proposed technique for software reliability enhancement, and (3) general discussion and future research. The development of this proposed state-of-the-art software reliability model will be performed in the second place.more » 407 refs., 4 figs., 2 tabs.« less
A scale for measuring hygiene behavior: development, reliability and validity.
Stevenson, Richard J; Case, Trevor I; Hodgson, Deborah; Porzig-Drummond, Renata; Barouei, Javad; Oaten, Megan J
2009-09-01
There is currently no general self-report measure for assessing hygiene behavior. This article details the development and testing of such a measure. In studies 1 to 4, a total of 855 participants were used for scale and subscale development and for reliability and validity testing. The latter involved establishing the relationships between self-reported hygiene behavior and existing measures, hand hygiene behavior, illness rates, and a physiological marker of immune function. In study 5, a total of 507 participants were used to assess the psychometric properties of the final revised version of the scale. The final 23-item scale comprised 5 subscales: general, household, food-related, handwashing technique, and personal hygiene. Studies 1 to 4 confirmed the scale's reliability and validity, and study 5 confirmed the scale's 5-factor structure. The scale is potentially suitable for multiple uses, in various settings, and for experimental and correlational approaches.
Cross-Cultural Perspectives of Service Quality and Risk in Air Transportation
NASA Technical Reports Server (NTRS)
Cunningham, Lawrence F.; Young, Clifford E.; Lee, Moonkyu
2002-01-01
This study compares US and Korean customers in terms of their perceptions of airline service quality based on SERVPERF and industry-based measures, as well as their perceptions of risks involved in the airline choice. SERVPERF is a set of multi-dimensional measures of customer evaluations of service quality. The results indicate that: (1) US passengers are generally more satisfied with their airline service than Korean customers on most of the SERVPERF dimensions; (2) Koreans are generally more satisfied with the bumping procedures whereas US participants feel more satisfied with the airline's baggage handling, operations/safety, and connections; and (3) US participants perceive higher levels of performance and financial risks whereas Koreans feel greater social risk in choosing an airline. This study also examines the SERVPERF, industry-based measure, and perceived risk in predicting customer satisfaction with, and intention to repatronize the airline. The results suggest that US customers consider service reliability, in-flight comfort, and connections as the key factors determining satisfaction with airline service whereas Korean passengers generally regard reliability, assurance, and risk factors as predictors of satisfaction. The determining factors of customer intention to repatronize the airline are reliability and empathy for US, and reliability and overall risk for Korean customers. The study demonstrates the applicability of SERVPERF as a cross-cultural tool and indicates the importance of perceived risk in cross-cultural studies.
2016-10-01
Reports an error in "Reliability Generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R)" by Hayley M. Herrington, Timothy B. Smith, Erika Feinauer and Derek Griner ( Journal of Counseling Psychology , Advanced Online Publication, Mar 17, 2016, np). The name of author Erika Feinauer was misspelled as Erika Feinhauer. All versions of this article have been corrected. (The following abstract of the original article appeared in record 2016-13160-001.) Individuals' strength of ethnic identity has been linked with multiple positive indicators, including academic achievement and overall psychological well-being. The measure researchers use most often to assess ethnic identity, the Multigroup Ethnic Identity Measure (MEIM), underwent substantial revision in 2007. To inform scholars investigating ethnic identity, we performed a reliability generalization analysis on data from the revised version (MEIM-R) and compared it with data from the original MEIM. Random-effects weighted models evaluated internal consistency coefficients (Cronbach's alpha). Reliability coefficients for the MEIM-R averaged α = .88 across 37 samples, a statistically significant increase over the average of α = .84 for the MEIM across 75 studies. Reliability coefficients for the MEIM-R did not differ across study and participant characteristics such as sample gender and ethnic composition. However, consistently lower reliability coefficients averaging α = .81 were found among participants with low levels of education, suggesting that greater attention to data reliability is warranted when evaluating the ethnic identity of individuals such as middle-school students. Future research will be needed to ascertain whether data with other measures of aspects of personal identity (e.g., racial identity, gender identity) also differ as a function of participant level of education and associated cognitive or maturation processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Use of Internal Consistency Coefficients for Estimating Reliability of Experimental Tasks Scores
Green, Samuel B.; Yang, Yanyun; Alt, Mary; Brinkley, Shara; Gray, Shelley; Hogan, Tiffany; Cowan, Nelson
2017-01-01
Reliabilities of scores for experimental tasks are likely to differ from one study to another to the extent that the task stimuli change, the number of trials varies, the type of individuals taking the task changes, the administration conditions are altered, or the focal task variable differs. Given reliabilities vary as a function of the design of these tasks and the characteristics of the individuals taking them, making inferences about the reliability of scores in an ongoing study based on reliability estimates from prior studies is precarious. Thus, it would be advantageous to estimate reliability based on data from the ongoing study. We argue that internal consistency estimates of reliability are underutilized for experimental task data and in many applications could provide this information using a single administration of a task. We discuss different methods for computing internal consistency estimates with a generalized coefficient alpha and the conditions under which these estimates are accurate. We illustrate use of these coefficients using data for three different tasks. PMID:26546100
Loeding, B L; Greenan, J P
1998-12-01
The study examined the validity and reliability of four assessments, with three instruments per domain. Domains included generalizable mathematics, communication, interpersonal relations, and reasoning skills. Participants were deaf, legally blind, or visually impaired students enrolled in vocational classes at residential secondary schools. The researchers estimated the internal consistency reliability, test-retest reliability, and construct validity correlations of three subinstruments: student self-ratings, teacher ratings, and performance assessments. The data suggest that these instruments are highly internally consistent measures of generalizable vocational skills. Four performance assessments have high-to-moderate test-retest reliability estimates, and were generally considered to possess acceptable validity and reliability.
Photomultiplier tube reliability study for the HEAO program
NASA Technical Reports Server (NTRS)
Richardson, C.
1974-01-01
Results concerning the research on photomultiplier tubes required for the HEAO program are reported. The general specifications are discussed for providing a series of tests for helping the operational reliability of its application, and for permitting comparison of performance of similar types, from various manufacturers.
Open and Distance Education Accreditation Standards Scale: Validity and Reliability Studies
ERIC Educational Resources Information Center
Can, Ertug
2016-01-01
The purpose of this study is to develop, and test the validity and reliability of a scale for the use of researchers to determine the accreditation standards of open and distance education based on the views of administrators, teachers, staff and students. This research was designed according to the general descriptive survey model since it aims…
Insightful practice: a reliable measure for medical revalidation
Guthrie, Bruce; Sullivan, Frank M; Mercer, Stewart W; Russell, Andrew; Bruce, David A
2012-01-01
Background Medical revalidation decisions need to be reliable if they are to reassure on the quality and safety of professional practice. This study tested an innovative method in which general practitioners (GPs) were assessed on their reflection and response to a set of externally specified feedback. Setting and participants 60 GPs and 12 GP appraisers in the Tayside region of Scotland, UK. Methods A feedback dataset was specified as (1) GP-specific data collected by GPs themselves (patient and colleague opinion; open book self-evaluated knowledge test; complaints) and (2) Externally collected practice-level data provided to GPs (clinical quality and prescribing safety). GPs' perceptions of whether the feedback covered UK General Medical Council specified attributes of a ‘good doctor’ were examined using a mapping exercise. GPs' professionalism was examined in terms of appraiser assessment of GPs' level of insightful practice, defined as: engagement with, insight into and appropriate action on feedback data. The reliability of assessment of insightful practice and subsequent recommendations on GPs' revalidation by face-to-face and anonymous assessors were investigated using Generalisability G-theory. Main outcome measures Coverage of General Medical Council attributes by specified feedback and reliability of assessor recommendations on doctors' suitability for revalidation. Results Face-to-face assessment proved unreliable. Anonymous global assessment by three appraisers of insightful practice was highly reliable (G=0.85), as were revalidation decisions using four anonymous assessors (G=0.83). Conclusions Unlike face-to-face appraisal, anonymous assessment of insightful practice offers a valid and reliable method to decide GP revalidation. Further validity studies are needed. PMID:22653078
Shayan, Zahra; Pourmovahed, Zahra; Najafipour, Fatemeh; Abdoli, Ali Mohammad; Mohebpour, Fatemeh; Najafipour, Sedighe
2015-12-01
Nowadays, infertility problems have become a social concern, and are associated with multiple psychological and social problems. Also, it affects the interpersonal communication between the individual, familial, and social characteristics. Since women are exposed to stressors of physical, mental, social factors, and treatment of infertility, providing a psychometric screening tool is necessary for disorders of this group. The aim of this study was to determine the factor structure of the general health questionnaire-28 to discover mental disorders in infertile women. In this study, 220 infertile women undergoing treatment of infertility were selected from the Yazd Research and Clinical Center for Infertility with convenience sampling in 2011. After completing the general health questionnaire by the project manager, validity and, reliability of the questionnaire were calculated by confirmatory factor structure and Cronbach's alpha, respectively. Four factors, including anxiety and insomnia, social dysfunction, depression, and physical symptoms were extracted from the factor structure. 50.12% of the total variance was explained by four factors. The reliability coefficient of the questionnaire was obtained 0.90. Analysis of the factor structure and reliability of General Health Questionnaire-28 showed that it is suitable as a screening instrument for assessing general health of infertile women.
A Monte Carlo Simulation Study of the Reliability of Intraindividual Variability
Estabrook, Ryne; Grimm, Kevin J.; Bowles, Ryan P.
2012-01-01
Recent research has seen intraindividual variability (IIV) become a useful technique to incorporate trial-to-trial variability into many types of psychological studies. IIV as measured by individual standard deviations (ISDs) has shown unique prediction to several types of positive and negative outcomes (Ram, Rabbit, Stollery, & Nesselroade, 2005). One unanswered question regarding measuring intraindividual variability is its reliability and the conditions under which optimal reliability is achieved. Monte Carlo simulation studies were conducted to determine the reliability of the ISD compared to the intraindividual mean. The results indicate that ISDs generally have poor reliability and are sensitive to insufficient measurement occasions, poor test reliability, and unfavorable amounts and distributions of variability in the population. Secondary analysis of psychological data shows that use of individual standard deviations in unfavorable conditions leads to a marked reduction in statistical power, although careful adherence to underlying statistical assumptions allows their use as a basic research tool. PMID:22268793
Reliability reporting across studies using the Buss Durkee Hostility Inventory.
Vassar, Matt; Hale, William
2009-01-01
Empirical research on anger and hostility has pervaded the academic literature for more than 50 years. Accurate measurement of anger/hostility and subsequent interpretation of results requires that the instruments yield strong psychometric properties. For consistent measurement, reliability estimates must be calculated with each administration, because changes in sample characteristics may alter the scale's ability to generate reliable scores. Therefore, the present study was designed to address reliability reporting practices for a widely used anger assessment, the Buss Durkee Hostility Inventory (BDHI). Of the 250 published articles reviewed, 11.2% calculated and presented reliability estimates for the data at hand, 6.8% cited estimates from a previous study, and 77.1% made no mention of score reliability. Mean alpha estimates of scores for BDHI subscales generally fell below acceptable standards. Additionally, no detectable pattern was found between reporting practices and publication year or journal prestige. Areas for future research are also discussed.
Ratter, Julia; Radlinger, Lorenz; Lucas, Cees
2014-09-01
Are submaximal and maximal exercise tests reliable, valid and acceptable in people with chronic pain, fibromyalgia and fatigue disorders? Systematic review of studies of the psychometric properties of exercise tests. People older than 18 years with chronic pain, fibromyalgia and chronic fatigue disorders. Studies of the measurement properties of tests of physical capacity in people with chronic pain, fibromyalgia or chronic fatigue disorders were included. Studies were required to report: reliability coefficients (intraclass correlation coefficient, alpha reliability coefficient, limits of agreements and Bland-Altman plots); validity coefficients (intraclass correlation coefficient, Spearman's correlation, Kendal T coefficient, Pearson's correlation); or dropout rates. Fourteen studies were eligible: none had low risk of bias, 10 had unclear risk of bias and four had high risk of bias. The included studies evaluated: Åstrand test; modified Åstrand test; Lean body mass-based Åstrand test; submaximal bicycle ergometer test following another protocol other than Åstrand test; 2-km walk test; 5-minute, 6-minute and 10-minute walk tests; shuttle walk test; and modified symptom-limited Bruce treadmill test. None of the studies assessed maximal exercise tests. Where they had been tested, reliability and validity were generally high. Dropout rates were generally acceptable. The 2-km walk test was not recommended in fibromyalgia. Moderate evidence was found for reliability, validity and acceptability of submaximal exercise tests in patients with chronic pain, fibromyalgia or chronic fatigue. There is no evidence about maximal exercise tests in patients with chronic pain, fibromyalgia and chronic fatigue. Copyright © 2014. Published by Elsevier B.V.
ERIC Educational Resources Information Center
La Monte, Michelle Evonne
2012-01-01
This study focused on developing a valid and reliable instrument that can not only identify successful co-teaching, but also the professional development needs of co-teachers and their administrators in public schools. Two general questions about the quality of co-teaching were addressed in this study: (a) How well did descriptors within each of…
ERIC Educational Resources Information Center
Thombs, Brett D.; Bernstein, David P.; Lobbestael, Jill; Arntz, Arnoud
2009-01-01
Objective: The 28-item Childhood Trauma Questionnaire-Short Form (CTQ-SF) has been translated into at least 10 different languages. The validity of translated versions of the CTQ-SF, however, has generally not been examined. The objective of this study was to investigate the factor structure, internal consistency reliability, and known-groups…
Reliability of an interactive computer program for advance care planning.
Schubart, Jane R; Levi, Benjamin H; Camacho, Fabian; Whitehead, Megan; Farace, Elana; Green, Michael J
2012-06-01
Despite widespread efforts to promote advance directives (ADs), completion rates remain low. Making Your Wishes Known: Planning Your Medical Future (MYWK) is an interactive computer program that guides individuals through the process of advance care planning, explaining health conditions and interventions that commonly involve life or death decisions, helps them articulate their values/goals, and translates users' preferences into a detailed AD document. The purpose of this study was to demonstrate that (in the absence of major life changes) the AD generated by MYWK reliably reflects an individual's values/preferences. English speakers ≥30 years old completed MYWK twice, 4 to 6 weeks apart. Reliability indices were assessed for three AD components: General Wishes; Specific Wishes for treatment; and Quality-of-Life values (QoL). Twenty-four participants completed the study. Both the Specific Wishes and QoL scales had high internal consistency in both time periods (Knuder Richardson formula 20 [KR-20]=0.83-0.95, and 0.86-0.89). Test-retest reliability was perfect for General Wishes (κ=1), high for QoL (Pearson's correlation coefficient=0.83), but lower for Specific Wishes (Pearson's correlation coefficient=0.57). MYWK generates an AD where General Wishes and QoL (but not Specific Wishes) statements remain consistent over time.
Reliability of an Interactive Computer Program for Advance Care Planning
Levi, Benjamin H.; Camacho, Fabian; Whitehead, Megan; Farace, Elana; Green, Michael J
2012-01-01
Abstract Despite widespread efforts to promote advance directives (ADs), completion rates remain low. Making Your Wishes Known: Planning Your Medical Future (MYWK) is an interactive computer program that guides individuals through the process of advance care planning, explaining health conditions and interventions that commonly involve life or death decisions, helps them articulate their values/goals, and translates users' preferences into a detailed AD document. The purpose of this study was to demonstrate that (in the absence of major life changes) the AD generated by MYWK reliably reflects an individual's values/preferences. English speakers ≥30 years old completed MYWK twice, 4 to 6 weeks apart. Reliability indices were assessed for three AD components: General Wishes; Specific Wishes for treatment; and Quality-of-Life values (QoL). Twenty-four participants completed the study. Both the Specific Wishes and QoL scales had high internal consistency in both time periods (Knuder Richardson formula 20 [KR-20]=0.83–0.95, and 0.86–0.89). Test-retest reliability was perfect for General Wishes (κ=1), high for QoL (Pearson's correlation coefficient=0.83), but lower for Specific Wishes (Pearson's correlation coefficient=0.57). MYWK generates an AD where General Wishes and QoL (but not Specific Wishes) statements remain consistent over time. PMID:22512830
PREDICTION OF RELIABILITY IN BIOGRAPHICAL QUESTIONNAIRES.
ERIC Educational Resources Information Center
STARRY, ALLAN R.
THE OBJECTIVES OF THIS STUDY WERE (1) TO DEVELOP A GENERAL CLASSIFICATION SYSTEM FOR LIFE HISTORY ITEMS, (2) TO DETERMINE TEST-RETEST RELIABILITY ESTIMATES, AND (3) TO ESTIMATE RESISTANCE TO EXAMINEE FAKING, FOR REPRESENTATIVE BIOGRAPHICAL QUESTIONNAIRES. TWO 100-ITEM QUESTIONNAIRES WERE CONSTRUCTED THROUGH RANDOM ASSIGNMENT BY CONTENT AREA OF 200…
Richler, Jennifer J.; Floyd, R. Jackie; Gauthier, Isabel
2014-01-01
Efforts to understand individual differences in high-level vision necessitate the development of measures that have sufficient reliability, which is generally not a concern in group studies. Holistic processing is central to research on face recognition and, more recently, to the study of individual differences in this area. However, recent work has shown that the most popular measure of holistic processing, the composite task, has low reliability. This is particularly problematic for the recent surge in interest in studying individual differences in face recognition. Here, we developed and validated a new measure of holistic face processing specifically for use in individual-differences studies. It avoids some of the pitfalls of the standard composite design and capitalizes on the idea that trial variability allows for better traction on reliability. Across four experiments, we refine this test and demonstrate its reliability. PMID:25228629
The interrater reliability of DSM III in children.
Werry, J S; Methven, R J; Fitzpatrick, J; Dixon, H
1983-09-01
A total of 195 admissions to a child psychiatric inpatient unit were diagnosed independently by two to four clinicians on the basis of case presentations at the first ward-round after admission. The DSM III as a whole and the major categories were of high or acceptable reliability, though a few were clearly unreliable. The results are generally consistent with other studies. Unlike other studies, the subcategories were examined and found to vary widely in reliability both as a whole across the system and within parent major categories, throwing considerable doubt upon their utility. The results indicate the need both for improved diagnostic data-gathering techniques in child psychiatry and for more better-designed studies of reliability and, most necessarily, of validity.
The reliability of a quality appraisal tool for studies of diagnostic reliability (QAREL).
Lucas, Nicholas; Macaskill, Petra; Irwig, Les; Moran, Robert; Rickards, Luke; Turner, Robin; Bogduk, Nikolai
2013-09-09
The aim of this project was to investigate the reliability of a new 11-item quality appraisal tool for studies of diagnostic reliability (QAREL). The tool was tested on studies reporting the reliability of any physical examination procedure. The reliability of physical examination is a challenging area to study given the complex testing procedures, the range of tests, and lack of procedural standardisation. Three reviewers used QAREL to independently rate 29 articles, comprising 30 studies, published during 2007. The articles were identified from a search of relevant databases using the following string: "Reproducibility of results (MeSH) OR reliability (t.w.) AND Physical examination (MeSH) OR physical examination (t.w.)." A total of 415 articles were retrieved and screened for inclusion. The reviewers undertook an independent trial assessment prior to data collection, followed by a general discussion about how to score each item. At no time did the reviewers discuss individual papers. Reliability was assessed for each item using multi-rater kappa (κ). Multi-rater reliability estimates ranged from κ = 0.27 to 0.92 across all items. Six items were recorded with good reliability (κ > 0.60), three with moderate reliability (κ = 0.41 - 0.60), and two with fair reliability (κ = 0.21 - 0.40). Raters found it difficult to agree about the spectrum of patients included in a study (Item 1) and the correct application and interpretation of the test (Item 10). In this study, we found that QAREL was a reliable assessment tool for studies of diagnostic reliability when raters agreed upon criteria for the interpretation of each item. Nine out of 11 items had good or moderate reliability, and two items achieved fair reliability. The heterogeneity in the tests included in this study may have resulted in an underestimation of the reliability of these two items. We discuss these and other factors that could affect our results and make recommendations for the use of QAREL.
Dimensional indicators of generalized anxiety disorder severity for DSM-V.
Niles, Andrea N; Lebeau, Richard T; Liao, Betty; Glenn, Daniel E; Craske, Michelle G
2012-03-01
For DSM-V, simple dimensional measures of disorder severity will accompany diagnostic criteria. The current studies examine convergent validity and test-retest reliability of two potential dimensional indicators of worry severity for generalized anxiety disorder (GAD): percent of the day worried and number of worry domains. In study 1, archival data from diagnostic interviews from a community sample of individuals diagnosed with one or more anxiety disorders (n = 233) were used to assess correlations between percent of the day worried and number of worry domains with other measures of worry severity (clinical severity rating (CSR), age of onset, number of comorbid disorders, Penn state worry questionnaire (PSWQ)) and DSM-IV criteria (excessiveness, uncontrollability and number of physical symptoms). Both measures were significantly correlated with CSR and number of comorbid disorders, and with all three DSM-IV criteria. In study 2, test-retest reliability of percent of the day worried and number of worry domains were compared to test-retest reliability of DSM-IV diagnostic criteria in a non-clinical sample of undergraduate students (n = 97) at a large west coast university. All measures had low test-retest reliability except percent of the day worried, which had moderate test-retest reliability. Findings suggest that these two indicators capture worry severity, and percent of the day worried may be the most reliable existing indicator. These measures may be useful as dimensional measures for DSM-V. Copyright © 2012 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Li, Lin; Zeng, Li; Lin, Zi-Jing; Cazzell, Mary; Liu, Hanli
2015-05-01
Test-retest reliability of neuroimaging measurements is an important concern in the investigation of cognitive functions in the human brain. To date, intraclass correlation coefficients (ICCs), originally used in inter-rater reliability studies in behavioral sciences, have become commonly used metrics in reliability studies on neuroimaging and functional near-infrared spectroscopy (fNIRS). However, as there are six popular forms of ICC, the adequateness of the comprehensive understanding of ICCs will affect how one may appropriately select, use, and interpret ICCs toward a reliability study. We first offer a brief review and tutorial on the statistical rationale of ICCs, including their underlying analysis of variance models and technical definitions, in the context of assessment on intertest reliability. Second, we provide general guidelines on the selection and interpretation of ICCs. Third, we illustrate the proposed approach by using an actual research study to assess intertest reliability of fNIRS-based, volumetric diffuse optical tomography of brain activities stimulated by a risk decision-making protocol. Last, special issues that may arise in reliability assessment using ICCs are discussed and solutions are suggested.
Perceived experiences of atheist discrimination: Instrument development and evaluation.
Brewster, Melanie E; Hammer, Joseph; Sawyer, Jacob S; Eklund, Austin; Palamar, Joseph
2016-10-01
The present 2 studies describe the development and initial psychometric evaluation of a new instrument, the Measure of Atheist Discrimination Experiences (MADE), which may be used to examine the minority stress experiences of atheist people. Items were created from prior literature, revised by a panel of expert researchers, and assessed psychometrically. In Study 1 (N = 1,341 atheist-identified people), an exploratory factor analysis with 665 participants suggested the presence of 5 related dimensions of perceived discrimination. However, bifactor modeling via confirmatory factor analysis and model-based reliability estimates with data from the remaining 676 participants affirmed the presence of a strong "general" factor of discrimination and mixed to poor support for substantive subdimensions. In Study 2 (N = 1,057 atheist-identified people), another confirmatory factor analysis and model-based reliability estimates strongly supported the bifactor model from Study 1 (i.e., 1 strong "general" discrimination factor) and poor support for subdimensions. Across both studies, the MADE general factor score demonstrated evidence of good reliability (i.e., Cronbach's alphas of .94 and .95; omega hierarchical coefficients of .90 and .92), convergent validity (i.e., with stigma consciousness, β = .56; with awareness of public devaluation, β = .37), and preliminary evidence for concurrent validity (i.e., with loneliness β = .18; with psychological distress β = .27). Reliability and validity evidence for the MADE subscale scores was not sufficient to warrant future use of the subscales. Limitations and implications for future research and clinical work with atheist individuals are discussed. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
The Role of Temperament in Children's Reliance on Others as Sources of Information
ERIC Educational Resources Information Center
Canfield, Caitlin F.; Saudino, Kimberly J.; Ganea, Patricia A.
2015-01-01
By 3?years of age, children generally have a firm understanding of others' reliability, but there is considerable variation among individual children. Little attention has been paid to factors that influence such individual differences. This study addressed this by assessing the relation between reliability understanding and temperament in…
Spanager, Lene; Beier-Holgersen, Randi; Dieckmann, Peter; Konge, Lars; Rosenberg, Jacob; Oestergaard, Doris
2013-11-01
Nontechnical skills are essential for safe and efficient surgery. The aim of this study was to evaluate the reliability of an assessment tool for surgeons' nontechnical skills, Non-Technical Skills for Surgeons dk (NOTSSdk), and the effect of rater training. A 1-day course was conducted for 15 general surgeons in which they rated surgeons' nontechnical skills in 9 video recordings of scenarios simulating real intraoperative situations. Data were gathered from 2 sessions separated by a 4-hour training session. Interrater reliability was high for both pretraining ratings (Cronbach's α = .97) and posttraining ratings (Cronbach's α = .98). There was no statistically significant development in assessment skills. The D study showed that 2 untrained raters or 1 trained rater was needed to obtain generalizability coefficients >.80. The high pretraining interrater reliability indicates that videos were easy to rate and Non-Technical Skills for Surgeons dk easy to use. This implies that Non-Technical Skills for Surgeons dk (NOTSSdk) could be an important tool in surgical training, potentially improving safety and quality for surgical patients. Copyright © 2013 Elsevier Inc. All rights reserved.
Relating design and environmental variables to reliability
NASA Astrophysics Data System (ADS)
Kolarik, William J.; Landers, Thomas L.
The combination of space application and nuclear power source demands high reliability hardware. The possibilities of failure, either an inability to provide power or a catastrophic accident, must be minimized. Nuclear power experiences on the ground have led to highly sophisticated probabilistic risk assessment procedures, most of which require quantitative information to adequately assess such risks. In the area of hardware risk analysis, reliability information plays a key role. One of the lessons learned from the Three Mile Island experience is that thorough analyses of critical components are essential. Nuclear grade equipment shows some reliability advantages over commercial. However, no statistically significant difference has been found. A recent study pertaining to spacecraft electronics reliability, examined some 2500 malfunctions on more than 300 aircraft. The study classified the equipment failures into seven general categories. Design deficiencies and lack of environmental protection accounted for about half of all failures. Within each class, limited reliability modeling was performed using a Weibull failure model.
Komal
2018-05-01
Nowadays power consumption is increasing day-by-day. To fulfill failure free power requirement, planning and implementation of an effective and reliable power management system is essential. Phasor measurement unit(PMU) is one of the key device in wide area measurement and control systems. The reliable performance of PMU assures failure free power supply for any power system. So, the purpose of the present study is to analyse the reliability of a PMU used for controllability and observability of power systems utilizing available uncertain data. In this paper, a generalized fuzzy lambda-tau (GFLT) technique has been proposed for this purpose. In GFLT, system components' uncertain failure and repair rates are fuzzified using fuzzy numbers having different shapes such as triangular, normal, cauchy, sharp gamma and trapezoidal. To select a suitable fuzzy number for quantifying data uncertainty, system experts' opinion have been considered. The GFLT technique applies fault tree, lambda-tau method, fuzzified data using different membership functions, alpha-cut based fuzzy arithmetic operations to compute some important reliability indices. Furthermore, in this study ranking of critical components of the system using RAM-Index and sensitivity analysis have also been performed. The developed technique may be helpful to improve system performance significantly and can be applied to analyse fuzzy reliability of other engineering systems. Copyright © 2018 ISA. Published by Elsevier Ltd. All rights reserved.
Lower Bounds to the Reliabilities of Factor Score Estimators.
Hessen, David J
2016-10-06
Under the general common factor model, the reliabilities of factor score estimators might be of more interest than the reliability of the total score (the unweighted sum of item scores). In this paper, lower bounds to the reliabilities of Thurstone's factor score estimators, Bartlett's factor score estimators, and McDonald's factor score estimators are derived and conditions are given under which these lower bounds are equal. The relative performance of the derived lower bounds is studied using classic example data sets. The results show that estimates of the lower bounds to the reliabilities of Thurstone's factor score estimators are greater than or equal to the estimates of the lower bounds to the reliabilities of Bartlett's and McDonald's factor score estimators.
A reliability study of the new sensors for movement analysis (SHARIF-HMIS).
Abedi, Mohen; Manshadi, Farideh Dehghan; Zavieh, Minoo Khalkhali; Ashouri, Sajad; Azimi, Hadi; Parnanpour, Mohamad
2016-04-01
SHARIF-HMIS is a new inertial sensor designed for movement analysis. The aim of the present study was to assess the inter-tester and intra-tester reliability of some kinematic parameters in different lumbar motions making use of this sensor. 24 healthy persons and 28 patients with low back pain participated in the current reliability study. The test was performed in five different lumbar motions consisting of lumbar flexion in 0, 15, and 30° in the right and left directions. For measuring inter-tester reliability, all the tests were carried out twice on the same day separately by two physiotherapists. Intra-tester reliability was assessed by reproducing the tests after 3 days by the same physiotherapist. The present study revealed satisfactory inter- and intra-tester reliability indices in different positions. ICCs for intra-tester reliability ranged from 0.65 to 0.98 and 0.59 to 0.81 for healthy and patient participants, respectively. Also, ICCs for inter-tester reliability ranged from 0.65 to 0.92 for the healthy and 0.65 to 0.87 for patient participants. In general, it can be inferred from the results that measuring the kinematic parameters in lumbar movements using inertial sensors enjoys acceptable reliability. Copyright © 2015 Elsevier Ltd. All rights reserved.
Daniel-Filho, Durval Anibal; Pires, Elda Maria Stafuzza Gonçalves; Paes, Angela Tavares; Troster, Eduardo Juan; Silva, Simone Cristina Azevedo B S; Granato, Mariana Fachini; Couto, Thomaz Bittencourt; Barreto, Joyce Kelly Silva; Campos, Alexandre Holthausen; Monte, Julio C Martins; Schvartsman, Claudio
2017-10-01
Evaluation of non-cognitive skills never has been used in Brazil. This study aims to evaluate Multiple Mini Interviews (MMI) in the admission process of a School of Medicine in São Paulo, Brazil. The population of the study comprised 240 applicants summoned for the interviews, and 96 raters. MMI contributed to 25% of the applicants' final grade. Eight scenarios were created with the aim of evaluating different non-cognitive skills, each one had two raters. At the end of the interviews, the applicants and raters described their impressions about MMI. The reliability of the MMI was analyzed using the Theory of Generalization and Many-Facet Rasch Model (MFRM). The G-study showed that the general reliability of the process was satisfactory (coefficient G = 0.743). The MMI grades were not affected by the raters' profile, time of interview (p = 0.715), and randomization group (p = 0.353). The Rasch analysis showed that there was no misfitting effects or inconsistent stations or raters. A significant majority of the applicants (98%) and all the raters believed MMIs were important in selecting students with a more adequate profile to study medicine. The general reliability of the selection process was excellent, and it was fully accepted by the applicants and raters.
An Investigation of the Impact of Guessing on Coefficient α and Reliability
2014-01-01
Guessing is known to influence the test reliability of multiple-choice tests. Although there are many studies that have examined the impact of guessing, they used rather restrictive assumptions (e.g., parallel test assumptions, homogeneous inter-item correlations, homogeneous item difficulty, and homogeneous guessing levels across items) to evaluate the relation between guessing and test reliability. Based on the item response theory (IRT) framework, this study investigated the extent of the impact of guessing on reliability under more realistic conditions where item difficulty, item discrimination, and guessing levels actually vary across items with three different test lengths (TL). By accommodating multiple item characteristics simultaneously, this study also focused on examining interaction effects between guessing and other variables entered in the simulation to be more realistic. The simulation of the more realistic conditions and calculations of reliability and classical test theory (CTT) item statistics were facilitated by expressing CTT item statistics, coefficient α, and reliability in terms of IRT model parameters. In addition to the general negative impact of guessing on reliability, results showed interaction effects between TL and guessing and between guessing and test difficulty.
The validation of Huffaz Intelligence Test (HIT)
NASA Astrophysics Data System (ADS)
Rahim, Mohd Azrin Mohammad; Ahmad, Tahir; Awang, Siti Rahmah; Safar, Ajmain
2017-08-01
In general, a hafiz who can memorize the Quran has many specialties especially in respect to their academic performances. In this study, the theory of multiple intelligences introduced by Howard Gardner is embedded in a developed psychometric instrument, namely Huffaz Intelligence Test (HIT). This paper presents the validation and the reliability of HIT of some tahfiz students in Malaysia Islamic schools. A pilot study was conducted involving 87 huffaz who were randomly selected to answer the items in HIT. The analysis method used includes Partial Least Square (PLS) on reliability, convergence and discriminant validation. The study has validated nine intelligences. The findings also indicated that the composite reliabilities for the nine types of intelligences are greater than 0.8. Thus, the HIT is a valid and reliable instrument to measure the multiple intelligences among huffaz.
The Children's Play Therapy Instrument (CPTI). Description, development, and reliability studies.
Kernberg, P F; Chazan, S E; Normandin, L
1998-01-01
The Children's Play Therapy Instrument (CPTI), its development, and reliability studies are described. The CPTI is a new instrument to examine a child's play activity in individual psychotherapy. Three independent raters used the CPTI to rate eight videotaped play therapy vignettes. Results were compared with the authors' consensual scores from a preliminary study. Generally good to excellent levels of interrater reliability were obtained for the independent raters on intraclass correlation coefficients for ordinal categories of the CPTI. Likewise, kappa levels were acceptable to excellent for nominal categories of the scale. The CPTI holds promise to become a reliable measure of play activity in child psychotherapy. Further research is needed to assess discriminant validity of the CPTI for use as a diagnostic tool and as a measure of process and outcome.
Are general surgeons able to accurately self-assess their level of technical skills?
Rizan, C; Ansell, J; Tilston, T W; Warren, N; Torkington, J
2015-11-01
Self-assessment is a way of improving technical capabilities without the need for trainer feedback. It can identify areas for improvement and promote professional medical development. The aim of this review was to identify whether self-assessment is an accurate form of technical skills appraisal in general surgery. The PubMed, MEDLINE(®), Embase(™) and Cochrane databases were searched for studies assessing the reliability of self-assessment of technical skills in general surgery. For each study, we recorded the skills assessed and the evaluation methods used. Common endpoints between studies were compared to provide recommendations based on the levels of evidence. Twelve studies met the inclusion criteria from 22,292 initial papers. There was no level 1 evidence published. All papers compared the correlation between self-appraisal versus an expert score but differed in the technical skills assessment and the evaluation tools used. The accuracy of self-assessment improved with increasing experience (level 2 recommendation), age (level 3 recommendation) and the use of video playback (level 3 recommendation). Accuracy was reduced by stressful learning environments (level 2 recommendation), lack of familiarity with assessment tools (level 3 recommendation) and in advanced surgical procedures (level 3 recommendation). Evidence exists to support the reliability of self-assessment of technical skills in general surgery. Several variables have been shown to affect the accuracy of self-assessment of technical skills. Future work should focus on evaluating the reliability of self-assessment during live operating procedures.
General Dissociation Scale and Hypnotizability with African American College Students.
ERIC Educational Resources Information Center
Sapp, Marty; Hitchcock, Kim
The purpose of this study was to assess the reliability of the General Dissociation Scale with African American college students, and provide additional data on how to assess hypnotizability with these students. Two-hundred and two undergraduate African American college students participated in this study. Students completed the HGSHS:A, a measure…
Generalized Trust and Trust in Institutions in Confucian Asia
ERIC Educational Resources Information Center
Tan, Soo Jiuan; Tambyah, Siok Kuan
2011-01-01
This study examines generalized trust and trust in institutions in Confucian Asia, covering six countries namely, China, Japan, Singapore, South Korea, Taiwan and Vietnam, and one dependent region, Hong Kong. Using data from the 2006 AsiaBarometer Survey, our study affirms the reliability and validity of using a two-item scale to measure…
Murphy, Douglas J; Bruce, David A; Mercer, Stewart W; Eva, Kevin W
2009-05-01
To investigate the reliability and feasibility of six potential workplace-based assessment methods in general practice training: criterion audit, multi-source feedback from clinical and non-clinical colleagues, patient feedback (the CARE Measure), referral letters, significant event analysis, and video analysis of consultations. Performance of GP registrars (trainees) was evaluated with each tool to assess the reliabilities of the tools and feasibility, given raters and number of assessments needed. Participant experience of process determined by questionnaire. 171 GP registrars and their trainers, drawn from nine deaneries (representing all four countries in the UK), participated. The ability of each tool to differentiate between doctors (reliability) was assessed using generalisability theory. Decision studies were then conducted to determine the number of observations required to achieve an acceptably high reliability for "high-stakes assessment" using each instrument. Finally, descriptive statistics were used to summarise participants' ratings of their experience using these tools. Multi-source feedback from colleagues and patient feedback on consultations emerged as the two methods most likely to offer a reliable and feasible opinion of workplace performance. Reliability co-efficients of 0.8 were attainable with 41 CARE Measure patient questionnaires and six clinical and/or five non-clinical colleagues per doctor when assessed on two occasions. For the other four methods tested, 10 or more assessors were required per doctor in order to achieve a reliable assessment, making the feasibility of their use in high-stakes assessment extremely low. Participant feedback did not raise any major concerns regarding the acceptability, feasibility, or educational impact of the tools. The combination of patient and colleague views of doctors' performance, coupled with reliable competence measures, may offer a suitable evidence-base on which to monitor progress and completion of doctors' training in general practice.
An instrument for assessment of videotapes of general practitioners' performance.
Cox, J; Mulholland, H
1993-01-01
OBJECTIVES--To identify those important characteristics of doctors' and patients' behaviour that distinguish between "good" and "bad" consultations when viewed on videotape; to use these characteristics to develop a reliable instrument for assessing general practitioners' performance in their own consultations. DESIGN--Questionnaires completed by patients, general practitioner trainers, and general practitioner trainees. Reliability of draft instrument tested by general practitioner trainers. SETTING--All vocational training schemes for general practice in the Northern region of England. SUBJECTS--First stage: 76 patients in seven groups, 108 general practice trainers in 12 groups, and 122 general practice trainees in 10 groups. Second stage: 85 general practice trainers in 12 groups. MAIN OUTCOME MEASURES--Trainers' ratings of importance; alpha coefficients of draft instrument by trainee, group, and consultation. RESULTS--6890 characteristics of good and bad consultations were consolidated into a draft assessment instrument consisting of 46 pairs of definitions separated by six point bipolar scales. Nine statement pairs given low importance ratings by trainers were eliminated, reducing the instrument to 37 statement pairs. To test reliability, general practitioner trainers used the instrument to assess three consultations. With the exception of one group of trainers, all alpha coefficients exceeded the acceptable level of 0.80. CONCLUSION--The instrument produced is reliable for assessing general practitioners' performance in their own consultations. PMID:8490501
Software reliability models for fault-tolerant avionics computers and related topics
NASA Technical Reports Server (NTRS)
Miller, Douglas R.
1987-01-01
Software reliability research is briefly described. General research topics are reliability growth models, quality of software reliability prediction, the complete monotonicity property of reliability growth, conceptual modelling of software failure behavior, assurance of ultrahigh reliability, and analysis techniques for fault-tolerant systems.
Standardization of the Gordon Primary Measures of Music Audiation in Greece
ERIC Educational Resources Information Center
Stamou, Lelouda; Schmidt, Charles P.; Humphreys, Jere T.
2010-01-01
The purpose of this study was to standardize the Primary Measures of Music Audiation in Greece ( N = 1,188). Split-halves reliability was acceptable across grade levels (K through 3) for the Tonal and Rhythm subtests, but test-retest reliability was generally unacceptable, especially for the Rhythm subtest. Concurrent validity was mixed, with…
Combinatorial Reliability and Repair
1992-07-01
Press, Oxford, 1987. [2] G. Gordon and L. Traldi, Generalized activities and the Tutte polynomial, Discrete Math . 85 (1990), 167-176. [3] A. B. Huseby, A...Chromatic polynomials and network reliability, Discrete Math . 67 (1987), 57-79. [7] A. Satayanarayana and R. K. Wood, A linear-time algorithm for comput- ing...K-terminal reliability in series-parallel networks, SIAM J. Comput. 14 (1985), 818-832. [8] L. Traldi, Generalized activities and K-terminal reliability, Discrete Math . 96 (1991), 131-149. 4
Reliability assessments in qualitative health promotion research.
Cook, Kay E
2012-03-01
This article contributes to the debate about the use of reliability assessments in qualitative research in general, and health promotion research in particular. In this article, I examine the use of reliability assessments in qualitative health promotion research in response to health promotion researchers' commonly held misconception that reliability assessments improve the rigor of qualitative research. All qualitative articles published in the journal Health Promotion International from 2003 to 2009 employing reliability assessments were examined. In total, 31.3% (20/64) articles employed some form of reliability assessment. The use of reliability assessments increased over the study period, ranging from <20% in 2003/2004 to 50% and above in 2008/2009, while at the same time the total number of qualitative articles decreased. The articles were then classified into four types of reliability assessments, including the verification of thematic codes, the use of inter-rater reliability statistics, congruence in team coding and congruence in coding across sites. The merits of each type were discussed, with the subsequent discussion focusing on the deductive nature of reliable thematic coding, the limited depth of immediately verifiable data and the usefulness of such studies to health promotion and the advancement of the qualitative paradigm.
Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.
2012-01-01
The purpose of this article is to help researchers avoid common pitfalls associated with reliability including incorrectly assuming that (a) measurement error always attenuates observed score correlations, (b) different sources of measurement error originate from the same source, and (c) reliability is a function of instrumentation. To accomplish our purpose, we first describe what reliability is and why researchers should care about it with focus on its impact on effect sizes. Second, we review how reliability is assessed with comment on the consequences of cumulative measurement error. Third, we consider how researchers can use reliability generalization as a prescriptive method when designing their research studies to form hypotheses about whether or not reliability estimates will be acceptable given their sample and testing conditions. Finally, we discuss options that researchers may consider when faced with analyzing unreliable data. PMID:22518107
Reliability Generalization: An Examination of the Positive Affect and Negative Affect Schedule
ERIC Educational Resources Information Center
Leue, Anja; Lange, Sebastian
2011-01-01
The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…
Reliability and agreement in student ratings of the class environment.
Nelson, Peter M; Christ, Theodore J
2016-09-01
The current study estimated the reliability and agreement of student ratings of the classroom environment obtained using the Responsive Environmental Assessment for Classroom Teaching (REACT; Christ, Nelson, & Demers, 2012; Nelson, Demers, & Christ, 2014). Coefficient alpha, class-level reliability, and class agreement indices were evaluated as each index provides important information for different interpretations and uses of student rating scale data. Data for 84 classes across 29 teachers in a suburban middle school were sampled to derive reliability and agreement indices for the REACT subscales across 4 class sizes: 25, 20, 15, and 10. All participating teachers were White and a larger number of 6th-grade classes were included (42%) relative to 7th- (33%) or 8th- (23%) grade classes. Teachers were responsible for a variety of content areas, including language arts (26%), science (26%), math (20%), social studies (19%), communications (6%), and Spanish (3%). Coefficient alpha estimates were generally high across all subscales and class sizes (α = .70-.95); class-mean estimates were greatly impacted by the number of students sampled from each class, with class-level reliability values generally falling below .70 when class size was reduced from 25 to 20. Further, within-class student agreement varied widely across the REACT subscales (mean agreement = .41-.80). Although coefficient alpha and test-retest reliability are commonly reported in research with student rating scales, class-level reliability and agreement are not. The observed differences across coefficient alpha, class-level reliability, and agreement indices provide evidence for evaluating students' ratings of the class environment according to their intended use (e.g., differentiating between classes, class-level instructional decisions). (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Valle, Susanne Collier; Støen, Ragnhild; Sæther, Rannei; Jensenius, Alexander Refsum; Adde, Lars
2015-10-01
A computer-based video analysis has recently been presented for quantitative assessment of general movements (GMs). This method's test-retest reliability, however, has not yet been evaluated. The aim of the current study was to evaluate the test-retest reliability of computer-based video analysis of GMs, and to explore the association between computer-based video analysis and the temporal organization of fidgety movements (FMs). Test-retest reliability study. 75 healthy, term-born infants were recorded twice the same day during the FMs period using a standardized video set-up. The computer-based movement variables "quantity of motion mean" (Qmean), "quantity of motion standard deviation" (QSD) and "centroid of motion standard deviation" (CSD) were analyzed, reflecting the amount of motion and the variability of the spatial center of motion of the infant, respectively. In addition, the association between the variable CSD and the temporal organization of FMs was explored. Intraclass correlation coefficients (ICC 1.1 and ICC 3.1) were calculated to assess test-retest reliability. The ICC values for the variables CSD, Qmean and QSD were 0.80, 0.80 and 0.86 for ICC (1.1), respectively; and 0.80, 0.86 and 0.90 for ICC (3.1), respectively. There were significantly lower CSD values in the recordings with continual FMs compared to the recordings with intermittent FMs (p<0.05). This study showed high test-retest reliability of computer-based video analysis of GMs, and a significant association between our computer-based video analysis and the temporal organization of FMs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Neziraj, M; Sarac Kart, N; Samuelson, Karin
2011-08-01
The view of delirium has changed considerably over the last decade, and delirium is now a very topical issue within the intensive care unit (ICU) setting. Delirium has proved to be common in critically ill patients and is manifested as acute changes in mental status with reduced cognitive ability, incoherent thought patterns, impaired consciousness, agitation and acute confusion. In order to be able to prevent, identify and alleviate problems related to delirium it is important that validated instruments for delirium screening are implemented and evaluated. The aim of this study was to translate the Intensive Care Delirium Screening Checklist (ICDSC) into Swedish and test the inter-rater reliability in a Swedish general ICU setting. The study was carried out during 2009 in a general Swedish ICU. A translation of the scale from English into Swedish was made, including back-translation, critical review and pilot testing. A total of 49 paired ratings were carried out using the Swedish version of the ICDSC scale. The inter-rater reliability was tested using weighted kappa (κ) statistics (linear weighting). The ICDSC scale was successfully translated into Swedish and the inter-rater reliability testing of the Swedish version resulted in a weighted k value of 0.92. The result of this study indicates that the Swedish version of the ICDSC scale has a very good inter-rater reliability. The high inter-rater reliability and the ease of administration make the ICDSC scale applicable for delirium screening in a Swedish ICU setting. © 2011 The Authors. Acta Anaesthesiologica Scandinavica © 2011 The Acta Anaesthesiologica Scandinavica Foundation.
Terluin, Berend; Smits, Niels; Brouwers, Evelien P M; de Vet, Henrica C W
2016-09-15
The Four-Dimensional Symptom Questionnaire (4DSQ) is a self-report questionnaire measuring distress, depression, anxiety and somatization with separate scales. The 4DSQ has extensively been validated in clinical samples, especially from primary care settings. Information about measurement properties and normative data in the general population was lacking. In a Dutch general population sample we examined the 4DSQ scales' structure, the scales' reliability and measurement invariance with respect to gender, age and education, the scales' score distributions across demographic categories, and normative data. 4DSQ data were collected in a representative Dutch Internet panel. Confirmatory factor analysis was used to examine the scales' structure. Reliability was examined by Cronbach's alpha, and coefficients omega-total and omega-hierarchical. Differential item functioning (DIF) analysis was used to evaluate measurement invariance across gender, age and education. The total response rate was 82.4 % (n = 5273/6399). The depression scale proved to be unidimensional. The other scales were best represented as bifactor models consisting of a large general factor and one or more smaller specific factors. The general factors accounted for more than 95 % of the reliable variance of the scales. Reliability was high (≥0.85) by all estimates. The distress-, depression- and anxiety scales were invariant across gender, age and education. The somatization scale demonstrated some lack of measurement invariance as a result of decreased thresholds for some of the items in young people (16-24 years) and increased thresholds in elderly people (65+ years). The somatization scale was invariant regarding gender and education. The 4DSQ scores varied significantly across demographic categories, but the explained variance was small (<6 %). Normative data were generated for gender and age categories. Approximately 17 % of the participants scored above average on de distress scale, whereas 12 % scored above average on de somatization scale. Percentages of people scoring high enough on depression or anxiety as to suspect the presence of depressive or anxiety disorder were 4.1 and 2.5 respectively. Evidence supports reliability and measurement invariance of the 4DSQ in the general Dutch population. The normative data provided in this study can be used to compare a subject's 4DSQ scores with a general population reference group.
Abma, Femke I; van der Klink, Jac J L; Bültmann, Ute
2013-03-01
The promotion of a sustainable, healthy and productive working life attracts more and more attention. Recently the Work Role Functioning Questionnaire (WRFQ) has been cross-culturally translated and adapted to Dutch. This questionnaire aims to measure the health-related work functioning of workers with health problems. The aim of this study is to evaluate the reliability, validity (including five new items) and responsiveness of the WRFQ 2.0 in the working population. A longitudinal study was conducted among workers. The reliability (internal consistency, test-retest reliability, measurement error), validity (structural validity-factor analysis, construct validity by means of hypotheses testing) and responsiveness of the WRFQ 2.0 were evaluated. A total of N = 553 workers completed the survey. The final WRFQ 2.0 has four subscales and showed very good internal consistency, moderate test-retest reliability, good construct validity and moderate responsiveness in the working population. The WRFQ was able to distinguish between groups with different levels of mental health, physical health, fatigue and need for recovery. A moderate correlation was found between WRFQ and related constructs respectively work ability and work productivity. A weak relationship was found with general self-rated health, work engagement and work involvement. The WRFQ 2.0 is a reliable and valid instrument to measure health-related work functioning in the working population. Further validation in larger samples is recommended, especially for test-retest reliability, responsiveness and the questionnaire's ability to predict the future course of health-related work functioning.
Hand assessment in older adults with musculoskeletal hand problems: a reliability study.
Myers, Helen L; Thomas, Elaine; Hay, Elaine M; Dziedzic, Krysia S
2011-01-07
Musculoskeletal hand pain is common in the general population. This study aims to investigate the inter- and intra-observer reliability of two trained observers conducting a simple clinical interview and physical examination for hand problems in older adults. The reliability of applying the American College of Rheumatology (ACR) criteria for hand osteoarthritis to community-dwelling older adults will also be investigated. Fifty-five participants aged 50 years and over with a current self-reported hand problem and registered with one general practice were recruited from a previous health questionnaire study. Participants underwent a standardised, structured clinical interview and physical examination by two independent trained observers and again by one of these observers a month later. Agreement beyond chance was summarised using Kappa statistics and intra-class correlation coefficients. Median values for inter- and intra-observer reliability for clinical interview questions were found to be "substantial" and "moderate" respectively [median agreement beyond chance (Kappa) was 0.75 (range: -0.03, 0.93) for inter-observer ratings and 0.57 (range: -0.02, 1.00) for intra-observer ratings]. Inter- and intra-observer reliability for physical examination items was variable, with good reliability observed for some items, such as grip and pinch strength, and poor reliability observed for others, notably assessment of altered sensation, pain on resisted movement and judgements based on observation and palpation of individual features at single joints, such as bony enlargement, nodes and swelling. Moderate agreement was observed both between and within observers when applying the ACR criteria for hand osteoarthritis. Standardised, structured clinical interview is reliable for taking a history in community-dwelling older adults with self reported hand problems. Agreement between and within observers for physical examination items is variable. Low Kappa values may have resulted, in part, from a low prevalence of clinical signs and symptoms in the study participants. The decision to use clinical interview and hand assessment variables in clinical practice or further research in primary care should include consideration of clinical applicability and training alongside reliability. Further investigation is required to determine the relationship between these clinical questions and assessments and the clinical course of hand pain and hand problems in community-dwelling older adults.
NASA Technical Reports Server (NTRS)
English, Thomas
2005-01-01
A standard tool of reliability analysis used at NASA-JSC is the event tree. An event tree is simply a probability tree, with the probabilities determining the next step through the tree specified at each node. The nodal probabilities are determined by a reliability study of the physical system at work for a particular node. The reliability study performed at a node is typically referred to as a fault tree analysis, with the potential of a fault tree existing.for each node on the event tree. When examining an event tree it is obvious why the event tree/fault tree approach has been adopted. Typical event trees are quite complex in nature, and the event tree/fault tree approach provides a systematic and organized approach to reliability analysis. The purpose of this study was two fold. Firstly, we wanted to explore the possibility that a semi-Markov process can create dependencies between sojourn times (the times it takes to transition from one state to the next) that can decrease the uncertainty when estimating time to failures. Using a generalized semi-Markov model, we studied a four element reliability model and were able to demonstrate such sojourn time dependencies. Secondly, we wanted to study the use of semi-Markov processes to introduce a time variable into the event tree diagrams that are commonly developed in PRA (Probabilistic Risk Assessment) analyses. Event tree end states which change with time are more representative of failure scenarios than are the usual static probability-derived end states.
Lucas, Nicholas; Macaskill, Petra; Irwig, Les; Moran, Robert; Bogduk, Nikolai
2009-01-01
Trigger points are promoted as an important cause of musculoskeletal pain. There is no accepted reference standard for the diagnosis of trigger points, and data on the reliability of physical examination for trigger points are conflicting. To systematically review the literature on the reliability of physical examination for the diagnosis of trigger points. MEDLINE, EMBASE, and other sources were searched for articles reporting the reliability of physical examination for trigger points. Included studies were evaluated for their quality and applicability, and reliability estimates were extracted and reported. Nine studies were eligible for inclusion. None satisfied all quality and applicability criteria. No study specifically reported reliability for the identification of the location of active trigger points in the muscles of symptomatic participants. Reliability estimates varied widely for each diagnostic sign, for each muscle, and across each study. Reliability estimates were generally higher for subjective signs such as tenderness (kappa range, 0.22-1.0) and pain reproduction (kappa range, 0.57-1.00), and lower for objective signs such as the taut band (kappa range, -0.08-0.75) and local twitch response (kappa range, -0.05-0.57). No study to date has reported the reliability of trigger point diagnosis according to the currently proposed criteria. On the basis of the limited number of studies available, and significant problems with their design, reporting, statistical integrity, and clinical applicability, physical examination cannot currently be recommended as a reliable test for the diagnosis of trigger points. The reliability of trigger point diagnosis needs to be further investigated with studies of high quality that use current diagnostic criteria in clinically relevant patients.
Park, Ji Eun; Han, Kyunghwa; Sung, Yu Sub; Chung, Mi Sun; Koo, Hyun Jung; Yoon, Hee Mang; Choi, Young Jun; Lee, Seung Soo; Kim, Kyung Won; Shin, Youngbin; An, Suah; Cho, Hyo-Min
2017-01-01
Objective To evaluate the frequency and adequacy of statistical analyses in a general radiology journal when reporting a reliability analysis for a diagnostic test. Materials and Methods Sixty-three studies of diagnostic test accuracy (DTA) and 36 studies reporting reliability analyses published in the Korean Journal of Radiology between 2012 and 2016 were analyzed. Studies were judged using the methodological guidelines of the Radiological Society of North America-Quantitative Imaging Biomarkers Alliance (RSNA-QIBA), and COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) initiative. DTA studies were evaluated by nine editorial board members of the journal. Reliability studies were evaluated by study reviewers experienced with reliability analysis. Results Thirty-one (49.2%) of the 63 DTA studies did not include a reliability analysis when deemed necessary. Among the 36 reliability studies, proper statistical methods were used in all (5/5) studies dealing with dichotomous/nominal data, 46.7% (7/15) of studies dealing with ordinal data, and 95.2% (20/21) of studies dealing with continuous data. Statistical methods were described in sufficient detail regarding weighted kappa in 28.6% (2/7) of studies and regarding the model and assumptions of intraclass correlation coefficient in 35.3% (6/17) and 29.4% (5/17) of studies, respectively. Reliability parameters were used as if they were agreement parameters in 23.1% (3/13) of studies. Reproducibility and repeatability were used incorrectly in 20% (3/15) of studies. Conclusion Greater attention to the importance of reporting reliability, thorough description of the related statistical methods, efforts not to neglect agreement parameters, and better use of relevant terminology is necessary. PMID:29089821
Bidi, Fatemeh; Namdari-Pejman, Mahdi; Kareshki, Hossein; Ahmadnia, Hadi
2012-01-01
Internet addiction is one of the harmful effects of the Internet. The findings of several studies have indicated a relationship between general health and Internet addiction. Metacognition, which includes the knowledge, processes, and strategies to evaluate, and monitor or control the cognition, can play a significant role in this regard. The present research aimed to assess the mediating role of metacognitive variables in the relationship between Internet addiction and general health. This correlational study included 94 male and female users with different nationalities at Internet cafés in Abu Dhabi (the United Arab Emirates). All subjects aged at least 18 years and were proficient in English. The research tools included the General Health Questionnaire (with a reliability of 0.89), Metacognition Questionnaire (with a reliability of 0.82), and Kimberly Young's Internet Addiction Test (with a reliability of 0.88). The hypothesis was tested applying SPSS18 and Amos18. The results indicated a significant positive relationship between all aspects of metacognition and Internet addiction (r = 0.30; P < 0.01). A significant positive relationship was also observed between Internet addiction and general health (r = 0.47; P < 0.01). Path analysis revealed the mediating role of metacognition in the relationship between low general health and Internet addiction. Among the metacognitive variables, the mind control had the highest correlation coefficient (r = 0.80). The internet and digital technologies have caused unwanted and negative effects which are classified as emerging damages. The relationship between Internet addiction and general health has been confirmed in this research. In addition, metacognitive processes can have a positive and mediating role on this relationship.
Lee, Chin-Pang; Chiu, Yu-Wen; Chu, Chun-Lin; Chen, Yu; Jiang, Kun-Hao; Chen, Jiun-Liang; Chen, Ching-Yen
2016-12-01
The aging males' symptoms (AMS) scale is an instrument used to determine the health-related quality of life in adult and elderly men. The purpose of this study was to synthesize internal consistency (Cronbach's alpha) and test-retest reliability for the AMS scale and its three subscales. Of the 123 studies reviewed, 12 provided alpha coefficients which were then used in the meta-analyses of internal consistency. Seven of the 12 included studies provided test-retest coefficients, and these were used in the meta-analyses of test-retest reliability. The AMS scale had excellent internal consistency [α = 0.89 (95% CI 0.88-0.90)]; the mean alpha estimates across the AMS subscales ranged from 0.79 to 0.82. The AMS scale also had good test-retest reliability [r = 0.85 (95% CI 0.82-0.88]; the test-retest reliability coefficients of the AMS subscales ranged from 0.76 to 0.83. There was significant heterogeneity among the included studies. The AMS scale and the three subscales had fairly good internal consistency and test-retest reliability. Future psychometric studies of the AMS scale should report important characteristics of the participants, details of item scores, and test-retest reliability.
Moore, Amy Lawson; Miller, Terissa M
2018-01-01
The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
Models for evaluating the performability of degradable computing systems
NASA Technical Reports Server (NTRS)
Wu, L. T.
1982-01-01
Recent advances in multiprocessor technology established the need for unified methods to evaluate computing systems performance and reliability. In response to this modeling need, a general modeling framework that permits the modeling, analysis and evaluation of degradable computing systems is considered. Within this framework, several user oriented performance variables are identified and shown to be proper generalizations of the traditional notions of system performance and reliability. Furthermore, a time varying version of the model is developed to generalize the traditional fault tree reliability evaluation methods of phased missions.
Holm, Søren; Hofmann, Bjørn
2017-10-01
A precondition for reducing scientific misconduct is evidence about scientists' attitudes. We need reliable survey instruments, and this study investigates the reliability of Kalichman's "Survey 2: research misconduct" questionnaire. The study is a post hoc analysis of data from three surveys among biomedical doctoral students in Scandinavia (2010-2015). We perform reliability analysis, and exploratory and confirmatory factor analysis using a split-sample design as a partial validation. The results indicate that a reliable 13-item scale can be formed (Cronbach's α = .705), and factor analysis indicates that there are four reliable subscales each tapping a different construct: (a) general attitude to misconduct (α = .768), (b) attitude to personal misconduct (α = .784), (c) attitude to whistleblowing (α = .841), and (d) attitude to blameworthiness/punishment (α = .877). A full validation of the questionnaire requires further research. We, nevertheless, hope that the results will facilitate the increased use of the questionnaire in research.
Electrical service reliability: the customer perspective
DOE Office of Scientific and Technical Information (OSTI.GOV)
Samsa, M.E.; Hub, K.A.; Krohm, G.C.
1978-09-01
Electric-utility-system reliability criteria have traditionally been established as a matter of utility policy or through long-term engineering practice, generally with no supportive customer cost/benefit analysis as justification. This report presents results of an initial study of the customer perspective toward electric-utility-system reliability, based on critical review of over 20 previous and ongoing efforts to quantify the customer's value of reliable electric service. A possible structure of customer classifications is suggested as a reasonable level of disaggregation for further investigation of customer value, and these groups are characterized in terms of their electricity use patterns. The values that customers assign tomore » reliability are discussed in terms of internal and external cost components. A list of options for effecting changes in customer service reliability is set forth, and some of the many policy issues that could alter customer-service reliability are identified.« less
The factorial reliability of the Middlesex Hospital Questionnaire in normal subjects.
Bagley, C
1980-03-01
The internal reliability of the Middlesex Hospital Questionnaire and its component subscales has been checked by means of principal components analyses of data on 256 normal subjects. The subscales (with the possible exception of Hysteria) were found to contribute to the general underlying factor of psychoneurosis. In general, the principal components analysis points to the reliability of the subscales, despite some item overlap.
The inventory for déjà vu experiences assessment. Development, utility, reliability, and validity.
Sno, H N; Schalken, H F; de Jonghe, F; Koeter, M W
1994-01-01
In this article the development, utility, reliability, and validity of the Inventory for Déjà vu Experiences Assessment (IDEA) are described. The IDEA is a 23-item self-administered questionnaire consisting of a general section of nine questions and qualitative section of 14 questions. The latter questions comprise 48 topics. The questionnaire appeared to be a user-friendly instrument with satisfactory to good reliability and validity. The IDEA permits the study of quantitative and qualitative characteristics of déjà vu experiences.
Lindskog, Marcus; Winman, Anders; Juslin, Peter; Poom, Leo
2013-01-01
Two studies investigated the reliability and predictive validity of commonly used measures and models of Approximate Number System acuity (ANS). Study 1 investigated reliability by both an empirical approach and a simulation of maximum obtainable reliability under ideal conditions. Results showed that common measures of the Weber fraction (w) are reliable only when using a substantial number of trials, even under ideal conditions. Study 2 compared different purported measures of ANS acuity as for convergent and predictive validity in a within-subjects design and evaluated an adaptive test using the ZEST algorithm. Results showed that the adaptive measure can reduce the number of trials needed to reach acceptable reliability. Only direct tests with non-symbolic numerosity discriminations of stimuli presented simultaneously were related to arithmetic fluency. This correlation remained when controlling for general cognitive ability and perceptual speed. Further, the purported indirect measure of ANS acuity in terms of the Numeric Distance Effect (NDE) was not reliable and showed no sign of predictive validity. The non-symbolic NDE for reaction time was significantly related to direct w estimates in a direction contrary to the expected. Easier stimuli were found to be more reliable, but only harder (7:8 ratio) stimuli contributed to predictive validity. PMID:23964256
Guillén-Riquelme, Alejandro; Buela-Casal, Gualberto
2014-01-01
Since its creation the STAI has been cited in more than 14,000 documents, with more than 60 adaptations in different countries. In some adaptations this instrument has no clinical scores. The aim of this work is to determine if the State-Trait Anxiety Inventory (STAI) has higher scores in patients diagnosed with anxiety than in general population. In addition, we want to examine if the internal consistency is adequate in anxious patient samples. We performed a literature search in Tripdatabase, Cochrane, Web of Knowledge, Scopus, PyscINFO and Scholar Google, for documents published between 2008 y 2012. We selected 131 scientific articles to compare between patients diagnosed with anxiety and general population, and 25 for the generalization of reliability. For the analysis we used Cohen's d for means comparisons (random-effects method) and Cronbach's alpha for the reliability generalization (fixed-effects method). In the groups comparision the differences in state anxiety (d=1.39; CI95%: 1.22-1.56) and in the trait anxiety (d=1.74; CI95%:1.56-1.91) were significants. The reliability for patients of some anxiety disorder was between 0.87 and 0.93. So it seems that the STAI is sensitive to the level of anxiety of the individual and reliable for patients with diagnosis of panic attack, specific phobia, social phobia, generalized social phobia, generalized anxiety disorder, post-traumatic stress disorder, obsessive compulsive disorder or acute Stress disorder.
ERIC Educational Resources Information Center
Alzu'bi, Mohammad Akram
2014-01-01
The study aimed at analyzing English questions of the Jordanian Secondary Certificate Examinations via Blooms' cognitive levels. An analysis sheet was prepared by the researcher for the purpose of the study, which was ensured to be valid and reliable. The whole questions of the general secondary examinations for English course in both levels…
ERIC Educational Resources Information Center
Stefanic, Nicholas; Randles, Clint
2015-01-01
The purpose of this study was to explore the reliability of measures of both individual and group creative work using the consensual assessment technique (CAT). CAT was used to measure individual and group creativity among a population of pre-service music teachers enrolled in a secondary general music class (n = 23) and was evaluated from…
Evaluation of Environmental Profiles for Reliability Demonstration
1975-09-01
the increase in the ram air flow rate. As a result, one cannot generalize in advance about the effect of velocity increase on air-conditioner turbine ...152 6.2.6.3 Forced Cooling Air Temperature/ Flow Schedule. 152 Sample Test Provile ....... .............. 154 6.2.8 Profiles for Multi...Profiles for Reliability Demonstration Study Flow ....... . ....... 7 2 Typical MIL-STD-781 Profile ................ 23 3 Test Cycle A - Ambient Cooled
Study samples are too small to produce sufficiently precise reliability coefficients.
Charter, Richard A
2003-04-01
In a survey of journal articles, test manuals, and test critique books, the author found that a mean sample size (N) of 260 participants had been used for reliability studies on 742 tests. The distribution was skewed because the median sample size for the total sample was only 90. The median sample sizes for the internal consistency, retest, and interjudge reliabilities were 182, 64, and 36, respectively. The author presented sample size statistics for the various internal consistency methods and types of tests. In general, the author found that the sample sizes that were used in the internal consistency studies were too small to produce sufficiently precise reliability coefficients, which in turn could cause imprecise estimates of examinee true-score confidence intervals. The results also suggest that larger sample sizes have been used in the last decade compared with those that were used in earlier decades.
The Children's Play Therapy Instrument (CPTI): Description, Development, and Reliability Studies
Kernberg, Paulina F.; Chazan, Saralea E.; Normandin, Lina
1998-01-01
The Children's Play Therapy Instrument (CPTI), its development, and reliability studies are described. The CPTI is a new instrument to examine a child's play activity in individual psychotherapy. Three independent raters used the CPTI to rate eight videotaped play therapy vignettes. Results were compared with the authors' consensual scores from a preliminary study. Generally good to excellent levels of interrater reliability were obtained for the independent raters on intraclass correlation coefficients for ordinal categories of the CPTI. Likewise, kappa levels were acceptable to excellent for nominal categories of the scale. The CPTI holds promise to become a reliable measure of play activity in child psychotherapy. Further research is needed to assess discriminant validity of the CPTI for use as a diagnostic tool and as a measure of process and outcome.(The Journal of Psychotherapy Practice and Research 1998; 7:196–207) PMID:9631341
Keller, Carmen; Siegrist, Michael
2015-09-01
In an obesogenic environment, people have to adopt effective weight management strategies to successfully gain or maintain normal body weight. Little is known about the strategies used by the general population in daily life. Due to the lack of a comprehensive measurement instrument to assess conceptually different strategies with various scales, we developed the weight management strategies inventory (WMSI). In study 1, we collected 19 weight management strategies from research on self-regulation of food intake and successful weight loss and maintenance, as well as from expert interviews. We classified them under the five main categories of health self-regulation strategies - goal setting and monitoring, prospection and planning, automating behavior, construal, and inhibition. We formulated 93 items. In study 2, we developed the WMSI in a random sample from the general population (N = 658), using reliability and exploratory factor analysis. This resulted in 19 factors with 63 items, representing the 19 strategies. In study 3, we tested the 19-factor structure in a quota (age, gender) sample from the general population (N = 616), using confirmatory factor analysis. A good model fit (CFI = .918; RMSEA = .043) was revealed. Reliabilities and construct validity were high. Positive correlations of most strategies with dieting success and negative correlations of some strategies with body mass index were found among dieters (N = 292). Study 4 (N = 162) revealed a good test-retest reliability. The WMSI assesses theoretically derived, evidence-based, and conceptually different weight management strategies with different scales that have good psychometric characteristics. The scales can also be used for pre- and post measures in intervention studies. The scales provide insights into the general population's weight management strategies and facilitate tailoring and evaluating health communication. Copyright © 2015 Elsevier Ltd. All rights reserved.
Validity and Reliability of General Nutrition Knowledge Questionnaire for Adults in Uganda
Bukenya, Richard; Ahmed, Abhiya; Andrade, Jeanette M.; Grigsby-Toussaint, Diana S.; Muyonga, John; Andrade, Juan E.
2017-01-01
This study sought to develop and validate a general nutrition knowledge questionnaire (GNKQ) for Ugandan adults. The initial draft consisted of 133 items on five constructs associated with nutrition knowledge; expert recommendations (16 items), food groups (70 items), selecting food (10 items), nutrition and disease relationship (23 items), and food fortification in Uganda (14 items). The questionnaire validity was evaluated in three studies. For the content validity (study 1), a panel of five content matter nutrition experts reviewed the GNKQ draft before and after face validity. For the face validity (study 2), head teachers and health workers (n = 27) completed the questionnaire before attending one of three focus groups to review the clarity of the items. For the construct and test-rest reliability (study 3), head teachers (n = 40) from private and public primary schools and nutrition (n = 52) and engineering (n = 49) students from Makerere University took the questionnaire twice (two weeks apart). Experts agreed (content validity index, CVI > 0.9; reliability, Gwet’s AC1 > 0.85) that all constructs were relevant to evaluate nutrition knowledge. After the focus groups, 29 items were identified as unclear, requiring major (n = 5) and minor (n = 24) reviews. The final questionnaire had acceptable internal consistency (Cronbach α > 0.95), test-retest reliability (r = 0.89), and differentiated (p < 0.001) nutrition knowledge scores between nutrition (67 ± 5) and engineering (39 ± 11) students. Only the construct on nutrition recommendations was unreliable (Cronbach α = 0.51, test-retest r = 0.55), which requires further optimization. The final questionnaire included topics on food groups (41 items), selecting food (2 items), nutrition and disease relationship (14 items), and food fortification in Uganda (22 items) and had good content, construct, and test-retest reliability to evaluate nutrition knowledge among Ugandan adults. PMID:28230779
The reliability of multidimensional neuropsychological measures: from alpha to omega.
Watkins, Marley W
To demonstrate that Coefficient omega, a model-based estimate, is more a more appropriate index of reliability than coefficient alpha for the multidimensional scales that are commonly employed by neuropsychologists. As an illustration, a structural model of an overarching general factor and four first-order factors for the WAIS-IV based on the standardization sample of 2200 participants was identified and omega coefficients were subsequently computed for WAIS-IV composite scores. Alpha coefficients were ≥ .90 and omega coefficients ranged from .75 to .88 for WAIS-IV factor index scores, indicating that the blend of general and group factor variance in each index score created a reliable multidimensional composite. However, the amalgam of variance from general and group factors did not allow the precision of Full Scale IQ (FSIQ) and factor index scores to be disentangled. In contrast, omega hierarchical coefficients were low for all four factor index scores (.10-.41), indicating that most of the reliable variance of each factor index score was due to the general intelligence factor. In contrast, the omega hierarchical coefficient for the FSIQ score was .84. Meaningful interpretation of WAIS-IV factor index scores as unambiguous indicators of group factors is imprecise, thereby fostering unreliable identification of neurocognitive strengths and weaknesses, whereas the WAIS-IV FSIQ score can be interpreted as a reliable measure of general intelligence. It was concluded that neuropsychologists should base their clinical decisions on reliable scores as indexed by coefficient omega.
Wagner, Flávia; Martel, Michelle M; Cogo-Moreira, Hugo; Maia, Carlos Renato Moreira; Pan, Pedro Mario; Rohde, Luis Augusto; Salum, Giovanni Abrahão
2016-01-01
The best structural model for attention-deficit/hyperactivity disorder (ADHD) symptoms remains a matter of debate. The objective of this study is to test the fit and factor reliability of competing models of the dimensional structure of ADHD symptoms in a sample of randomly selected and high-risk children and pre-adolescents from Brazil. Our sample comprised 2512 children aged 6-12 years from 57 schools in Brazil. The ADHD symptoms were assessed using parent report on the development and well-being assessment (DAWBA). Fit indexes from confirmatory factor analysis were used to test unidimensional, correlated, and bifactor models of ADHD, the latter including "g" ADHD and "s" symptom domain factors. Reliability of all models was measured with omega coefficients. A bifactor model with one general factor and three specific factors (inattention, hyperactivity, impulsivity) exhibited the best fit to the data, according to fit indices, as well as the most consistent factor loadings. However, based on omega reliability statistics, the specific inattention, hyperactivity, and impulsivity dimensions provided very little reliable information after accounting for the reliable general ADHD factor. Our study presents some psychometric evidence that ADHD specific ("s") factors might be unreliable after taking common ("g" factor) variance into account. These results are in accordance with the lack of longitudinal stability among subtypes, the absence of dimension-specific molecular genetic findings and non-specific effects of treatment strategies. Therefore, researchers and clinicians might most effectively rely on the "g" ADHD to characterize ADHD dimensional phenotype, based on currently available symptom items.
Reliability generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R).
Herrington, Hayley M; Smith, Timothy B; Feinauer, Erika; Griner, Derek
2016-10-01
[Correction Notice: An Erratum for this article was reported in Vol 63(5) of Journal of Counseling Psychology (see record 2016-33161-001). The name of author Erika Feinauer was misspelled as Erika Feinhauer. All versions of this article have been corrected.] Individuals' strength of ethnic identity has been linked with multiple positive indicators, including academic achievement and overall psychological well-being. The measure researchers use most often to assess ethnic identity, the Multigroup Ethnic Identity Measure (MEIM), underwent substantial revision in 2007. To inform scholars investigating ethnic identity, we performed a reliability generalization analysis on data from the revised version (MEIM-R) and compared it with data from the original MEIM. Random-effects weighted models evaluated internal consistency coefficients (Cronbach's alpha). Reliability coefficients for the MEIM-R averaged α = .88 across 37 samples, a statistically significant increase over the average of α = .84 for the MEIM across 75 studies. Reliability coefficients for the MEIM-R did not differ across study and participant characteristics such as sample gender and ethnic composition. However, consistently lower reliability coefficients averaging α = .81 were found among participants with low levels of education, suggesting that greater attention to data reliability is warranted when evaluating the ethnic identity of individuals such as middle-school students. Future research will be needed to ascertain whether data with other measures of aspects of personal identity (e.g., racial identity, gender identity) also differ as a function of participant level of education and associated cognitive or maturation processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Test battery for measuring the perception and recognition of facial expressions of emotion
Wilhelm, Oliver; Hildebrandt, Andrea; Manske, Karsten; Schacht, Annekathrin; Sommer, Werner
2014-01-01
Despite the importance of perceiving and recognizing facial expressions in everyday life, there is no comprehensive test battery for the multivariate assessment of these abilities. As a first step toward such a compilation, we present 16 tasks that measure the perception and recognition of facial emotion expressions, and data illustrating each task's difficulty and reliability. The scoring of these tasks focuses on either the speed or accuracy of performance. A sample of 269 healthy young adults completed all tasks. In general, accuracy and reaction time measures for emotion-general scores showed acceptable and high estimates of internal consistency and factor reliability. Emotion-specific scores yielded lower reliabilities, yet high enough to encourage further studies with such measures. Analyses of task difficulty revealed that all tasks are suitable for measuring emotion perception and emotion recognition related abilities in normal populations. PMID:24860528
Code of Federal Regulations, 2011 CFR
2011-01-01
... HUMAN RELIABILITY PROGRAM Establishment of and Procedures for the Human Reliability Program General Provisions § 712.1 Purpose. This part establishes the policies and procedures for a Human Reliability Program... judgment and reliability may be impaired by physical or mental/personality disorders, alcohol abuse, use of...
De Vet, Emely; De Ridder, Denise; Stok, Marijn; Brunso, Karen; Baban, Adriana; Gaspar, Tania
2014-09-02
Applying self-regulation strategies have proven important in eating behaviors, but it remains subject to investigation what strategies adolescents report to use to ensure healthy eating, and adequate measures are lacking. Therefore, we developed and validated a self-regulation questionnaire applied to eating (TESQ-E) for adolescents. Study 1 reports a four-step approach to develop the TESQ-E questionnaire (n = 1097). Study 2 was a cross-sectional survey among adolescents from nine European countries (n = 11,392) that assessed the TESQ-E, eating-related behaviors, dietary intake and background characteristics. In study 3, the TESQ-E was administered twice within four weeks to evaluate test-retest reliability (n = 140). Study 4 was a cross-sectional survey (n = 93) that assessed the TESQ-E and related psychological constructs (e.g., motivation, autonomy, self-control). All participants were aged between 10 and 17 years. Study 1 resulted in a 24-item questionnaire assessing adolescent-reported use of six specific strategies for healthy eating that represent three general self-regulation approaches. Study 2 showed that the easy-to-administer theory-based TESQ-E has a clear factor structure and good subscale reliabilities. The questionnaire was related to eating-related behaviors and dietary intake, indicating predictive validity. Study 3 showed good test-retest reliabilities for the TESQ-E. Study 4 indicated that TESQ-E was related to but also distinguishable from general self-regulation and motivation measures. The TESQ-E provides a reliable and valid measure to assess six theory-based self-regulation strategies that adolescents may use to ensure their healthy eating.
How do Residents of Recovery Houses Experience Confrontation between Entry and 12-Month Follow-Up?
Polcin, Douglas L.; Galloway, Gantt P.; Bond, Jason; Korcha, Rachael; Greenfield, Thomas K.
2010-01-01
The role of confrontation in recovery has been vigorously debated. Proponents suggest that confrontation can help break down denial and increase motivation. Critics point to counseling studies showing confrontation harms the therapeutic alliance and increases resistance. Frequently missing in these debates is an operational definition of confrontation that can be reliably measured. The Alcohol and Drug Confrontation Scale (ADCS) is a new 72-item measure that defines confrontation as “warnings about potential harm” that might result from substance use. Previous psychometric work using a sample of residents of recovery homes at intake (N=323) indicated the ADCS had acceptable reliability and validity. Confrontation from different sources (e.g., family, friends and professionals) was generally experienced as supportive and helpful. The goals of the current study were twofold: 1) to see if the psychometric properties of the ADCS were maintained at 6 and 12 month follow up, and 2) to see if experiences and perceptions of confrontation changed over time. Despite minor variations in the factor structure between baseline and follow up, the ADCS generally maintained good reliability and validity. At follow up, the amount of confrontation participants received declined, but it continued to be generally experienced as supportive and helpful. PMID:20464806
Assessing the reliability of ecotoxicological studies: An overview of current needs and approaches.
Moermond, Caroline; Beasley, Amy; Breton, Roger; Junghans, Marion; Laskowski, Ryszard; Solomon, Keith; Zahner, Holly
2017-07-01
In general, reliable studies are well designed and well performed, and enough details on study design and performance are reported to assess the study. For hazard and risk assessment in various legal frameworks, many different types of ecotoxicity studies need to be evaluated for reliability. These studies vary in study design, methodology, quality, and level of detail reported (e.g., reviews, peer-reviewed research papers, or industry-sponsored studies documented under Good Laboratory Practice [GLP] guidelines). Regulators have the responsibility to make sound and verifiable decisions and should evaluate each study for reliability in accordance with scientific principles regardless of whether they were conducted in accordance with GLP and/or standardized methods. Thus, a systematic and transparent approach is needed to evaluate studies for reliability. In this paper, 8 different methods for reliability assessment were compared using a number of attributes: categorical versus numerical scoring methods, use of exclusion and critical criteria, weighting of criteria, whether methods are tested with case studies, domain of applicability, bias toward GLP studies, incorporation of standard guidelines in the evaluation method, number of criteria used, type of criteria considered, and availability of guidance material. Finally, some considerations are given on how to choose a suitable method for assessing reliability of ecotoxicity studies. Integr Environ Assess Manag 2017;13:640-651. © 2016 The Authors. Integrated Environmental Assessment and Management published by Wiley Periodicals, Inc. on behalf of Society of Environmental Toxicology & Chemistry (SETAC). © 2016 The Authors. Integrated Environmental Assessment and Management published by Wiley Periodicals, Inc. on behalf of Society of Environmental Toxicology & Chemistry (SETAC).
Boterhoven de Haan, Katrina L; Hafekost, Jennifer; Lawrence, David; Sawyer, Michael G; Zubrick, Stephen R
2015-03-01
The General Functioning 12-item subscale (GF12) of The McMaster Family Assessment Device (FAD) has been validated as a single index measure to assess family functioning. This study reports on the reliability and validity of using only the six positive items from the General Functioning subscale (GF6+). Existing data from two Western Australian studies, the Raine Study (RS) and the Western Australian Child Health Survey (WACHS), was used to analyze the psychometric properties of the GF6+ subscale. The results demonstrated that the GF6+ subscale had virtually equivalent psychometric properties and was able to identify almost all of the same families who had healthy or unhealthy levels of functioning as the full GF12 subscale. In consideration of the constraints faced by large-scale population-based surveys, the findings of this study support the use of a GF6+ subscale from the FAD, as a quick and effective tool to assess the overall functioning of families. © 2014 Family Process Institute.
Manzoni, Gian Mauro; Rossi, Alessandro; Marazzi, Nicoletta; Agosti, Fiorenza; De Col, Alessandra; Pietrabissa, Giada; Castelnuovo, Gianluca; Molinari, Enrico; Sartorio, Allessandro
2018-01-01
Objective This study was aimed to examine the feasibility, validity, and reliability of the Italian Pediatric Quality of Life Inventory Multidimensional Fatigue Scale (PedsQL™ MFS) for adult inpatients with severe obesity. Methods 200 inpatients (81% females) with severe obesity (BMI ≥ 35 kg/m2) completed the PedsQL MFS (General Fatigue, Sleep/Rest Fatigue and Cognitive Fatigue domains), the Fatigue Severity Scale, and the Center for Epidemiologic Studies Depression Scale immediately after admission to a 3-week residential body weight reduction program. A randomized subsample of 48 patients re-completed the PedsQL MFS after 3 days. Results Confirmatory factor analysis showed that a modified hierarchical model with two items moved from the Sleep/Rest Fatigue domain to the General Fatigue domain and a second-order latent factor best fitted the data. Internal consistency and test-retest reliabilities were acceptable to high in all scales, and small to high statistically significant correlations were found with all convergent measures, with the exception of BMI. Significant floor effects were found in two scales (Cognitive Fatigue and Sleep/Rest Fatigue). Conclusion The Italian modified PedsQL MFS for adults showed to be a valid and reliable tool for the assessment of fatigue in inpatients with severe obesity. Future studies should assess its discriminant validity as well as its responsiveness to weight reduction. PMID:29402854
A Study of the Accuracy and Reliability of Articles about Alopecia in Newspapers.
Kim, Hyojin; Park, In Ho; Kim, Do Hyeong; Park, So Hee; Cho, Gyeong Je; Seol, Jung Eun
2018-06-01
There is growing interest in alopecia among the general population. Many people obtain information from easily accessible media rather than from doctors; thus, the media can play an important role in shaping public opinion. The goal of this study was to evaluate the content and reliability of newspaper articles on alopecia. Newspapers were categorized into three groups: one group of print newspapers and two groups of online newspapers. Online newspapers were further divided into two groups according to type of publishing company; one publishes both print and online newspapers and the other publishes online newspapers only. The most frequently subscribed or circulated newspaper in each group was selected. Articles containing information on alopecia were selected from 3 years of each newspaper and evaluated for reliability. Most articles in each group used the general term "alopecia" instead of naming a specific hair loss disease. The majority of articles were based on consultation with experts. Assessment of the accuracy of articles with three grade scales showed that the percentage with high accuracy was 38.9%, 47.2%, and 23.3%. Assessment of reliability scores for five selected articles in each group showed that there were statistically significant differences between common readers and dermatologists ( p <0.05). The results of this study suggest that closer monitoring of the media is required to supply easily accessible, balanced, and trustworthy information regarding alopecia.
Boonstra, Anne M; Schiphorst Preuper, Henrica R; Reneman, Michiel F; Posthumus, Jitze B; Stewart, Roy E
2008-06-01
To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional design was used. A general rehabilitation centre and a university rehabilitation centre was the setting for the study. The study population consisted of patients over 18 years of age, suffering from chronic musculoskeletal pain; 52 patients in the reliability study, 344 patients in the validity study. Main outcome measures were as follows. Reliability study: Spearman's correlation coefficients (rho values) of the test and retest data of the VAS for disability; validity study: rho values of the VAS disability scores with the scores on four domains of the Short-Form Health Survey (SF-36) and VAS pain scores, and with Roland-Morris Disability Questionnaire scores in chronic low back pain patients. Results were as follows: in the reliability study rho values varied from 0.60 to 0.77; and in the validity study rho values of VAS disability scores with SF-36 domain scores varied from 0.16 to 0.51, with Roland-Morris Disability Questionnaire scores from 0.38 to 0.43 and with VAS pain scores from 0.76 to 0.84. The conclusion of the study was that the reliability of the VAS for disability is moderate to good. Because of a weak correlation with other disability instruments and a strong correlation with the VAS for pain, however, its validity is questionable.
[Testing reliability and validity of reduced substitutes for leadership scales(rd-SLS)].
Kim, Jeong-Hee
2005-10-01
This paper was conducted to test the reliability and validity of rd-SLS, developed by Podsakoff, et al. (1993) which measured 'substitutes for leadership'. The subjects were 345 nurses in 5 general hospitals. Cronbach's and the Guttman split-half coefficient were used to test the reliability of rd-SLS. Factor analysis, and the correlations of the rv-SLS and SLS with rd-SLS were used for convergent and discriminant validity. Cronbach's data was 0.76 and the Guttman split-half coefficient was 0.52. Twelve factors evolved by factor analysis, which explained 70.4% of the total variance. This result was similar to previous study results. However, 'Indifference toward organizational rewards'-related items were classified two factors. It was not clear t hat the rd-SLS consisted of 13 concepts(factors). The correlations of the rv-SLS and SLS with the rd-SLS were 0.93 and 0.87 respectively. The rd-SLS showed a moderate degree of validity and reliability. Thus, it is recommended to use the rd-SLS in general nursing organizations for screening for leadership substitutes. In addition, it is necessary to clarify the concept of organizational rewards. In a further study, the factor structure of the rd-SLS may be considered.
Ohno, Shotaro; Takahashi, Kana; Inoue, Aimi; Takada, Koki; Ishihara, Yoshiaki; Tanigawa, Masaru; Hirao, Kazuki
2017-12-01
This study aims to examine the smallest detectable change (SDC) and test-retest reliability of the Center for Epidemiologic Studies Depression Scale (CES-D), General Self-Efficacy Scale (GSES), and 12-item General Health Questionnaire (GHQ-12). We tested 154 young adults at baseline and 2 weeks later. We calculated the intra-class correlation coefficients (ICCs) for test-retest reliability with a two-way random effects model for agreement. We then calculated the standard error of measurement (SEM) for agreement using the ICC formula. The SEM for agreement was used to calculate SDC values at the individual level (SDC ind ) and group level (SDC group ). The study participants included 137 young adults. The ICCs for all self-reported outcome measurement scales exceeded 0.70. The SEM of CES-D was 3.64, leading to an SDC ind of 10.10 points and SDC group of 0.86 points. The SEM of GSES was 1.56, leading to an SDC ind of 4.33 points and SDC group of 0.37 points. The SEM of GHQ-12 with bimodal scoring was 1.47, leading to an SDC ind of 4.06 points and SDC group of 0.35 points. The SEM of GHQ-12 with Likert scoring was 2.44, leading to an SDC ind of 6.76 points and SDC group of 0.58 points. To confirm that the change was not a result of measurement error, a score of self-reported outcome measurement scales would need to change by an amount greater than these SDC values. This has important implications for clinicians and epidemiologists when assessing outcomes. © 2017 John Wiley & Sons, Ltd.
Preliminary study of the reliability of imaging charge coupled devices
NASA Technical Reports Server (NTRS)
Beall, J. R.; Borenstein, M. D.; Homan, R. A.; Johnson, D. L.; Wilson, D. D.; Young, V. F.
1978-01-01
Imaging CCDs are capable of low light level response and high signal-to-noise ratios. In space applications they offer the user the ability to achieve extremely high resolution imaging with minimum circuitry in the photo sensor array. This work relates the CCD121H Fairchild device to the fundamentals of CCDs and the representative technologies. Several failure modes are described, construction is analyzed and test results are reported. In addition, the relationship of the device reliability to packaging principles is analyzed and test data presented. Finally, a test program is defined for more general reliability evaluation of CCDs.
FANTASTIC Lifestyle Assessment: Part 5 Measuring Lifestyle in Family Practice
Kason, Yvonne; Ylanko, Veli J.
1984-01-01
Family physicians generally agree that they should play an active role in disease prevention and health promotion. However, until recently no valid and reliable tool was available to help physicians clinically assess patients' lifestyle. The authors have studied the validity and reliability of a new five point-scale version of the FANTASTIC Lifestyle Assessment, used in a family practice. Also, the authors polled their patients on their opinions of their doctor assessing lifestyle. They found that the FANTASTIC was a reliable instrument, which their patients thought was useful and appropriate for their physician to be using. PMID:21279064
[Santa Claus is perceived as reliable and friendly: results of the Danish Christmas 2013 survey].
Amin, Faisal Mohammad; West, Anders Sode; Jørgensen, Carina Sleiborg; Simonsen, Sofie Amalie; Lindberg, Ulrich; Tranum-Jensen, Jørgen; Hougaard, Anders
2013-12-02
Several studies have indicated that the population in general perceives doctors as reliable. In the present study perceptions of reliability and kindness attributed to another socially significant archetype, Santa Claus, have been comparatively examined in relation to the doctor. In all, 52 randomly chosen participants were shown a film, where a narrator dressed either as Santa Claus or as a doctor tells an identical story. Structured interviews were then used to assess the subjects' perceptions of reliability and kindness in relation to the narrator's appearance. We found a strong inclination for Santa Claus being perceived as friendlier than the doctor (p = 0.053). However, there was no significant difference in the perception of reliability between Santa Claus and the doctor (p = 0.524). The positive associations attributed to Santa Claus probably cause that he is perceived friendlier than the doctor who may be associated with more serious and unpleasant memories of illness and suffering. Surprisingly, and despite him being an imaginary person, Santa Claus was assessed as being as reliable as the doctor.
General motor function assessment scale--reliability of a Norwegian version.
Langhammer, Birgitta; Lindmark, Birgitta
2014-01-01
The General Motor Function assessment scale (GMF) measures activity-related dependence, pain and insecurity among older people in frail health. The aim of the present study was to translate the GMF into a Norwegian version (N-GMF) and establish its reliability and clinical feasibility. The procedure used in translating the GMF was a forward and backward process, testing a convenience sample of 30 frail elderly people with it. The intra-rater reliability tests were performed by three physiotherapists, and the inter-reliability test was done by the same three plus nine independent colleagues. The statistical analyses were performed with a pairwise analysis for intra- and inter-rater reliability, using Cronbach's α, Percentage Agreement (PA), Svensson's rank transformable method and Cohen's κ. The Cronbach's α coefficients for the different subscales of N-GMF were 0.68 for Dependency, 0.73 for Pain and 0.75 for Insecurity. Intra-rater reliability: The variation in the PA for the total score was 40-70% in Dependence, 30-40% in Pain and 30-60% in Insecurity. The Relative Rank Variant (RV) indicated a modest individual bias and an augmented rank-order agreement coefficient ra of 0.96, 0.96 and 0.99, respectively. The variation in the κ statistics was 0.27-0.62 for Dependence, 0.17-0.35 for Pain and 0.13-0.47 for Insecurity. Inter-rater reliability: The PA between different testers in Dependence, Pain and Insecurity was 74%, 89% and 74%, respectively. The augmented rank-order agreement coefficients were: for Dependence r(a) = 0.97; for Pain, r(a) = 0.99; and for Insecurity, r(a) = 0.99. The N-GMF is a fairly reliable instrument for use with frail elderly people, with intra-rater and inter-rater reliability moderate in Dependence and slight to fair in Pain and Insecurity. The clinical usefulness was stressed in regard to its main focus, the frail elderly, and for communication within a multidisciplinary team. Implications for Rehabilitation The Norwegian-General Motor Function Assessment Scale (N-GMF) is a reliable instrument. The N-GMF is an instrument for screening and assessment of activity-related dependence, pain and insecurity in frail older people. The N-GMF may be used as a tool of communication in a multidisciplinary team.
Reliability and Validity of Prototype Diagnosis for Adolescent Psychopathology.
Haggerty, Greg; Zodan, Jennifer; Mehra, Ashwin; Zubair, Ayyan; Ghosh, Krishnendu; Siefert, Caleb J; Sinclair, Samuel J; DeFife, Jared
2016-04-01
The current study investigated the interrater reliability and validity of prototype ratings of 5 common adolescent psychiatric disorders: attention-deficit/hyperactivity disorder, conduct disorder, major depressive disorder, generalized anxiety disorder, and posttraumatic stress disorder. One hundred fifty-seven adolescent inpatient participants consented to participate in this study. We compared ratings from 2 inpatient clinicians, blinded to each other's ratings and patient measures, after their separate initial diagnostic interview to assess interrater reliability. Prototype ratings completed by clinicians after their initial diagnostic interview with adolescent inpatients and outpatients were compared with patient-reported behavior problems and parents' report of their child's behavioral problems. Prototype ratings demonstrated good interrater reliability. Clinicians' prototype ratings showed predicted relationships with patient-reported behavior problems and parent-reported behavior problems. Prototype matching seems to be a possible alternative for psychiatric diagnosis. Prototype ratings showed good interrater reliability based on clinicians unique experiences with the patient (as opposed to video-/audio-recorded material) with no training.
Application of the Modified Erikson Psychosocial Stage Inventory: 25 Years in Review.
Darling-Fisher, Cynthia S
2018-04-01
The Modified Erikson Psychosocial Stage Inventory (MEPSI) is an 80-item, comprehensive measure of psychosocial development based on Erikson's theory with published reliability and validity data. Although designed as a comprehensive measure, some researchers have used individual subscales for specific developmental stages as a measure; however, these subscale reliability scores have not been generally shared. This article reviewed the literature to evaluate the use of the MEPSI: the major research questions, samples/populations studied, and individual subscale and total reliability and validity data. In total, 16 research articles (1990-2011) and 28 Dissertations/Theses (1991-2016) from nursing, social work, psychology, criminal justice, and religious studies met criteria. Results support the MEPSI's global reliability (aggregate scores ranged .89-.99) and validity in terms of consistent patterns of changes observed in the predicted direction. Reliability and validity data for individual subscales were more variable. Limitations of the tool and recommendations for possible revision and future research are addressed.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-12-05
..., RM13-14-000 and RM13-15-000] Monitoring System Conditions--Transmission Operations Reliability...) 502-6817, [email protected] . Robert T. Stroh (Legal Information), Office of the General... Reliability Standards ``address the important reliability goal of ensuring that the transmission system is...
Hallgren, Kevin A.; Greenfield, Brenna L.; Ladd, Benjamin O.
2016-01-01
Background Behavioral economic theories of drinking posit that the reinforcing value of engaging in activities with versus without alcohol influences drinking behavior. Measures of the reinforcement value of drugs and alcohol have been used in previous research, but little work has examined the psychometric properties of these measures. Objectives The present study aims to evaluate the factor structure, test-retest reliability, and concurrent validity of an alcohol-only version of the Adolescent Reinforcement Survey Schedule (ARSS-AUV). Methods A sample of 157 college student drinkers completed the ARSS-AUV at two time points 2–3 days apart. Test-retest reliability, hierarchical factor analysis, and correlations with other drinking measures were examined. Results Single, unidimensional general factors accounted for a majority of the variance in alcohol and alcohol-free reinforcement items. Residual factors emerged that typically represented alcohol or alcohol-free reinforcement while doing activities with friends, romantic or sexual partners, and family members. Individual ARSS-AUV items had fair-to-good test-retest reliability, while general and residual factors had excellent test-retest reliability. General alcohol reinforcement and alcohol reinforcement from friends and romantic partners were positively correlated with past-year alcohol consumption, heaviest drinking episode, and alcohol-related negative consequences. Alcohol-free reinforcement indices were unrelated to alcohol use or consequences. Conclusions/Importance The ARSS-AUV appears to demonstrate good reliability and mixed concurrent validity among college student drinkers. The instrument may provide useful information about alcohol reinforcement from various activities and people and could provide clinically-relevant information for prevention and treatment programs. PMID:27096713
Hallgren, Kevin A; Greenfield, Brenna L; Ladd, Benjamin O
2016-06-06
Behavioral economic theories of drinking posit that the reinforcing value of engaging in activities with versus without alcohol influences drinking behavior. Measures of the reinforcement value of drugs and alcohol have been used in previous research, but little work has examined the psychometric properties of these measures. The present study aims to evaluate the factor structure, test-retest reliability, and concurrent validity of an alcohol-only version of the Adolescent Reinforcement Survey Schedule (ARSS-AUV). A sample of 157 college student drinkers completed the ARSS-AUV at two time points 2-3 days apart. Test-retest reliability, hierarchical factor analysis, and correlations with other drinking measures were examined. Single, unidimensional general factors accounted for a majority of the variance in alcohol and alcohol-free reinforcement items. Residual factors emerged that typically represented alcohol or alcohol-free reinforcement while doing activities with friends, romantic or sexual partners, and family members. Individual ARSS-AUV items had fair-to-good test-retest reliability, while general and residual factors had excellent test-retest reliability. General alcohol reinforcement and alcohol reinforcement from friends and romantic partners were positively correlated with past-year alcohol consumption, heaviest drinking episode, and alcohol-related negative consequences. Alcohol-free reinforcement indices were unrelated to alcohol use or consequences. The ARSS-AUV appears to demonstrate good reliability and mixed concurrent validity among college student drinkers. The instrument may provide useful information about alcohol reinforcement from various activities and people and could provide clinically-relevant information for prevention and treatment programs.
Chuang, Li-Ling; Chuang, Yu-Fen; Hsu, Miao-Ju; Huang, Ying-Zu; Wong, Alice M K; Chang, Ya-Ju
2018-01-01
Fatigue is a common symptom in the general population and has a substantial effect on individuals' quality of life. The Multidimensional Fatigue Inventory (MFI) has been widely used to quantify the impact of fatigue, but no Traditional Chinese translation has yet been validated. The goal of this study was to translate the MFI from English into Traditional Chinese ('the MFI-TC') and subsequently to examine its validity and reliability. The study recruited a convenience sample of 123 people from various age groups in Taiwan. The MFI was examined using a two-step process: (1) translation and back-translation of the instrument; and (2) examination of construct validity, convergent validity, internal consistency, test-retest reliability, and measurement error. The validity and reliability of the MFI-TC were assessed by factor analysis, Spearman rho correlation coefficient, Cronbach's alpha coefficient, intraclass correlation coefficient (ICC), minimal detectable change (MDC), and Bland-Altman analysis. All participants completed the Short-Form-36 Health Survey Taiwan Form (SF-36-T) and the Chinese version of the Pittsburgh Sleep Quality Index (PSQI) concurrently to test the convergent validity of the MFI-TC. Test-retest reliability was assessed by readministration of the MFI-TC after a 1-week interval. Factor analysis confirmed the four dimensions of fatigue: general/physical fatigue, reduced activity, reduced motivation, and mental fatigue. A four-factor model was extracted, combining general fatigue and physical fatigue as one factor. The results demonstrated moderate convergent validity when correlating fatigue (MFI-TC) with quality of life (SF-36-T) and sleep disturbances (PSQI) (Spearman's rho = 0.68 and 0.47, respectively). Cronbach's alpha for the MFI-TC total scale and subscales ranged from 0.73 (mental fatigue subscale) to 0.92 (MFI-TC total scale). ICCs ranged from 0.85 (reduced motivation) to 0.94 (MFI-TC total scale), and the MDC ranged from 2.33 points (mental fatigue) to 9.5 points (MFI-TC total scale). The Bland-Altman analyses showed no significant systematic bias between the repeated assessments. The results support the use of the Traditional Chinese version of the MFI as a comprehensive instrument for measuring specific aspects of fatigue. Clinicians and researchers should consider interpreting general fatigue and physical fatigue as one subscale when measuring fatigue in Traditional Chinese-speaking populations.
Chuang, Li-Ling; Chuang, Yu-Fen; Hsu, Miao-Ju; Huang, Ying-Zu; Wong, Alice M. K.
2018-01-01
Background Fatigue is a common symptom in the general population and has a substantial effect on individuals’ quality of life. The Multidimensional Fatigue Inventory (MFI) has been widely used to quantify the impact of fatigue, but no Traditional Chinese translation has yet been validated. The goal of this study was to translate the MFI from English into Traditional Chinese (‘the MFI-TC’) and subsequently to examine its validity and reliability. Methods The study recruited a convenience sample of 123 people from various age groups in Taiwan. The MFI was examined using a two-step process: (1) translation and back-translation of the instrument; and (2) examination of construct validity, convergent validity, internal consistency, test-retest reliability, and measurement error. The validity and reliability of the MFI-TC were assessed by factor analysis, Spearman rho correlation coefficient, Cronbach’s alpha coefficient, intraclass correlation coefficient (ICC), minimal detectable change (MDC), and Bland-Altman analysis. All participants completed the Short-Form-36 Health Survey Taiwan Form (SF-36-T) and the Chinese version of the Pittsburgh Sleep Quality Index (PSQI) concurrently to test the convergent validity of the MFI-TC. Test-retest reliability was assessed by readministration of the MFI-TC after a 1-week interval. Results Factor analysis confirmed the four dimensions of fatigue: general/physical fatigue, reduced activity, reduced motivation, and mental fatigue. A four-factor model was extracted, combining general fatigue and physical fatigue as one factor. The results demonstrated moderate convergent validity when correlating fatigue (MFI-TC) with quality of life (SF-36-T) and sleep disturbances (PSQI) (Spearman's rho = 0.68 and 0.47, respectively). Cronbach’s alpha for the MFI-TC total scale and subscales ranged from 0.73 (mental fatigue subscale) to 0.92 (MFI-TC total scale). ICCs ranged from 0.85 (reduced motivation) to 0.94 (MFI-TC total scale), and the MDC ranged from 2.33 points (mental fatigue) to 9.5 points (MFI-TC total scale). The Bland-Altman analyses showed no significant systematic bias between the repeated assessments. Conclusions The results support the use of the Traditional Chinese version of the MFI as a comprehensive instrument for measuring specific aspects of fatigue. Clinicians and researchers should consider interpreting general fatigue and physical fatigue as one subscale when measuring fatigue in Traditional Chinese-speaking populations. PMID:29746466
Versey, Nathan G; Gore, Christopher J; Halson, Shona L; Plowman, Jamie S; Dawson, Brian T
2011-09-01
We determined the validity and reliability of heat flow thermistors, flexible thermocouple probes and general purpose thermistors compared with a calibrated reference thermometer in a stirred water bath. Validity (bias) was defined as the difference between the observed and criterion values, and reliability as the repeatability (standard deviation or typical error) of measurement. Data were logged every 5 s for 10 min at water temperatures of 14, 26 and 38 °C for ten heat flow thermistors and 24 general purpose thermistors, and at 35, 38 and 41 °C for eight flexible thermocouple probes. Statistical analyses were conducted using spreadsheets for validity and reliability, where an acceptable bias was set at ±0.1 °C. None of the heat flow thermistors, 17% of the flexible thermocouple probes and 71% of the general purpose thermistors met the validity criterion for temperature. The inter-probe reliabilities were 0.03 °C for heat flow thermistors, 0.04 °C for flexible thermocouple probes and 0.09 °C for general purpose thermistors. The within trial intra-probe reliability of all three temperature probes was 0.01 °C. The results suggest that these temperature sensors should be calibrated individually before use at relevant temperatures and the raw data corrected using individual linear regression equations.
Great apes are sensitive to prior reliability of an informant in a gaze following task.
Schmid, Benjamin; Karg, Katja; Perner, Josef; Tomasello, Michael
2017-01-01
Social animals frequently rely on information from other individuals. This can be costly in case the other individual is mistaken or even deceptive. Human infants below 4 years of age show proficiency in their reliance on differently reliable informants. They can infer the reliability of an informant from few interactions and use that assessment in later interactions with the same informant in a different context. To explore whether great apes share that ability, in our study we confronted great apes with a reliable or unreliable informant in an object choice task, to see whether that would in a subsequent task affect their gaze following behaviour in response to the same informant. In our study, prior reliability of the informant and habituation during the gaze following task affected both great apes' automatic gaze following response and their more deliberate response of gaze following behind barriers. As habituation is very context specific, it is unlikely that habituation in the reliability task affected the gaze following task. Rather it seems that apes employ a reliability tracking strategy that results in a general avoidance of additional information from an unreliable informant.
Nissan, Michael E; Gupta, Amar; Rayess, Hani; Black, Kevin Z; Carron, Michael
2018-02-01
Physicians should be aware of both websites and videos available online regarding the otoplasty procedure to provide quality care. This study systematically analyzes the authorships, reliability, quality, and readability of the websites, as well as the authorships and primary objectives of the videos regarding otoplasty. Validated instruments were used to analyze the reliability, quality, and readability of websites, and videos were systematically categorized and analyzed. A Google search was conducted, and the first five pages of results were included in this study. After excluding unrelated websites, the remaining 44 websites were categorized by authorship (physician, patient, academic, or unaffiliated) and were analyzed using the validated DISCERN instrument for reliability and quality, as well as various other validated instruments to measure readability. A YouTube search was also conducted, and the first 50 relevant videos were included in the study. These videos were categorized by authorship and their primary objective. Website authorships were physician-dominated. Reliability, quality, and overall DISCERN score differ between the four authorship groups by a statistically significant margin (Kruskall-Wallis test, p < 0.05). Unaffiliated websites were the most reliable, and physician websites were the least reliable. Academic websites were of the highest quality, and patient websites were of the lowest quality. Readability did not differ significantly between the groups, though the readability measurements made showed a general lack of material easily readable by the general public. YouTube was likewise dominated by physician-authored videos. While the physician-authored videos sought mainly to inform and to advertise, patient-authored videos sought mainly to provide the patient's perspective. Academic organizations showed very little representation on YouTube, and the YouTube views on otoplasty videos were dominated by the top 20 videos, which represented over 93% of the total views of videos included in this study. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.
Sorsdahl, Katherine; Stein, Dan J; Myers, Bronwyn
2017-04-01
The Social Problem Solving Inventory-Revised Short-Form (SPSI-R:SF) has been used in several countries to identify problem-solving deficits among clinical and general populations in order to guide cognitive-behavioural interventions. Yet, very few studies have evaluated its psychometric properties. Three language versions of the questionnaire were administered to a general population sample comprising 1000 participants (771 English-, 178 Afrikaans- and 101 Xhosa-speakers). Of these participants, 210 were randomly selected to establish test-retest reliability (70 in each language). Principal component analysis was performed to examine the applicability of the factor structure of the original questionnaire to the South African data. Supplementary psychometric analyses were performed, including internal consistency and test-retest reliability. Collectively, results provide initial evidence of the reliability and validity of the SPSI-R:SF for the assessment of problem solving deficits in South Africa. Further studies that explore how the Afrikaans language version of the SPSI-R:SF can be improved and that establish the predictive validity of scores on the SPSI-R:SF are needed. © 2015 International Union of Psychological Science.
Buchan, Jena; Janda, Monika; Box, Robyn; Rogers, Laura; Hayes, Sandi
2015-03-18
No tool exists to measure self-efficacy for overcoming lymphedema-related exercise barriers in individuals with cancer-related lymphedema. However, an existing scale measures confidence to overcome general exercise barriers in cancer survivors. Therefore, the purpose of this study was to develop, validate and assess the reliability of a subscale, to be used in conjunction with the general barriers scale, for determining exercise barriers self-efficacy in individuals facing lymphedema-related exercise barriers. A lymphedema-specific exercise barriers self-efficacy subscale was developed and validated using a cohort of 106 cancer survivors with cancer-related lymphedema, from Brisbane, Australia. An initial ten-item lymphedema-specific barrier subscale was developed and tested, with participant feedback and principal components analysis results used to guide development of the final version. Validity and test-retest reliability analyses were conducted on the final subscale. The final lymphedema-specific subscale contained five items. Principal components analysis revealed these items loaded highly (>0.75) on a separate factor when tested with a well-established nine-item general barriers scale. The final five-item subscale demonstrated good construct and criterion validity, high internal consistency (Cronbach's alpha = 0.93) and test-retest reliability (ICC = 0.67, p < 0.01). A valid and reliable lymphedema-specific subscale has been developed to assess exercise barriers self-efficacy in individuals with cancer-related lymphedema. This scale can be used in conjunction with an existing general exercise barriers scale to enhance exercise adherence in this understudied patient group.
DiCesare, Christopher A.; Bates, Nathaniel A.; Barber Foss, Kim D.; Thomas, Staci M.; Wordeman, Samuel C.; Sugimoto, Dai; Roewer, Benjamin D.; Medina McKeon, Jennifer M.; Di Stasi, Stephanie; Noehren, Brian W.; Ford, Kevin R.; Kiefer, Adam W.; Hewett, Timothy E.; Myer, Gregory D.
2015-01-01
Background: Anterior cruciate ligament (ACL) injuries are physically and financially devastating but affect a relatively small percentage of the population. Prospective identification of risk factors for ACL injury necessitates a large sample size; therefore, study of this injury would benefit from a multicenter approach. Purpose: To determine the reliability of kinematic and kinetic measures of a single-leg cross drop task across 3 institutions. Study Design: Controlled laboratory study. Methods: Twenty-five female high school volleyball players participated in this study. Three-dimensional motion data of each participant performing the single-leg cross drop were collected at 3 institutions over a period of 4 weeks. Coefficients of multiple correlation were calculated to assess the reliability of kinematic and kinetic measures during the landing phase of the movement. Results: Between-centers reliability for kinematic waveforms in the frontal and sagittal planes was good, but moderate in the transverse plane. Between-centers reliability for kinetic waveforms was good in the sagittal, frontal, and transverse planes. Conclusion: Based on these findings, the single-leg cross drop task has moderate to good reliability of kinematic and kinetic measures across institutions after implementation of a standardized testing protocol. Clinical Relevance: Multicenter collaborations can increase study numbers and generalize results, which is beneficial for studies of relatively rare phenomena, such as ACL injury. An important step is to determine the reliability of risk assessments across institutions before a multicenter collaboration can be initiated. PMID:26779550
Aerts, Frank; Carrier, Kathy; Alwood, Becky
2016-01-01
Background: The assessment of clinical manifestation of muscle fatigue is an effective procedure in establishing therapeutic exercise dose. Few studies have evaluated physical therapist reliability in establishing muscle fatigue through detection of changes in quality of movement patterns in a live setting. Objective: The purpose of this study is to evaluate the inter-rater reliability of physical therapists’ ability to detect altered movement patterns due to muscle fatigue. Design: A reliability study in a live setting with multiple raters. Participants: Forty-four healthy individuals (ages 19-35) were evaluated by six physical therapists in a live setting. Methods: Participants were evaluated by physical therapists for altered movement patterns during resisted shoulder rotation. Each participant completed a total of four tests: right shoulder internal rotation, right shoulder external rotation, left shoulder internal rotation and left shoulder external rotation. Results: For all tests combined, the inter-rater reliability for a single rater scoring ICC (2,1) was .65 (95%, .60, .71) This corresponds to moderate inter-rater reliability between physical therapists. Limitations: The results of this study apply only to healthy participants and therefore cannot be generalized to a symptomatic population. Conclusion: Moderate inter-rater reliability was found between physical therapists in establishing muscle fatigue through the observation of sustained altered movement patterns during dynamic resistive shoulder internal and external rotation. PMID:27347241
An evaluation of general practice websites in the UK.
Howitt, Alistair; Clement, Sarah; de Lusignan, Simon; Thiru, Krish; Goodwin, Daryl; Wells, Sally
2002-10-01
General practice websites are an emerging phenomenon, but there have been few critical evaluations of their content. Previously developed rating instruments to assess medical websites have been criticized for failing to report their reliability and validity. The purpose of this study was to develop a rating instrument for assessing UK general practice websites, and then to evaluate them critically. The STaRNet Website Assessment Tool (SWAT) was developed listing criteria that general practice websites may meet, which was then used to evaluate a random sample of websites drawn from an electronic database. A second assessor rated a subsample of the sites to assess the tool's inter-rater reliability. The setting was an information technology group of a general practice research network using a random sample of 108 websites identified from the database. The main outcome measures were identification of rating criteria and frequency counts from the website rating instrument. Ninety (93.3%) sites were accessible, of which 84 were UK general practice websites. Criteria most frequently met were those describing the scope of the website and their functionality. Apart from e-mail to practices, criteria related to electronic communication were rarely met. Criteria relating to the quality of information were least often met. Inter-rater reliability kappa values for the items in the tool ranged from -0.06 to 1.0 (mean 0.59). Values were >0.6 for 15 out of 25 criteria assessed in 40 sites which were rated by two assessors. General practice websites offer a wide range of information. They are technically satisfactory, but do not exploit fully the potential for electronic doctor-patient communication. The quality of information they provide is poor. The instrument may be developed as a template for general practices producing or revising their own websites.
Piqueras, Jose A; Martín-Vivar, María; Sandin, Bonifacio; San Luis, Concepción; Pineda, David
2017-08-15
Anxiety and depression are among the most common mental disorders during childhood and adolescence. Among the instruments for the brief screening assessment of symptoms of anxiety and depression, the Revised Child Anxiety and Depression Scale (RCADS) is one of the more widely used. Previous studies have demonstrated the reliability of the RCADS for different assessment settings and different versions. The aims of this study were to examine the mean reliability of the RCADS and the influence of the moderators on the RCADS reliability. We searched in EBSCO, PsycINFO, Google Scholar, Web of Science, and NCBI databases and other articles manually from lists of references of extracted articles. A total of 146 studies were included in our meta-analysis. The RCADS showed robust internal consistency reliability in different assessment settings, countries, and languages. We only found that reliability of the RCADS was significantly moderated by the version of RCADS. However, these differences in reliability between different versions of the RCADS were slight and can be due to the number of items. We did not examine factor structure, factorial invariance across gender, age, or country, and test-retest reliability of the RCADS. The RCADS is a reliable instrument for cross-cultural use, with the advantage of providing more information with a low number of items in the assessment of both anxiety and depression symptoms in children and adolescents. Copyright © 2017. Published by Elsevier B.V.
Cobb, Stephen C; Joshi, Mukta N; Pomeroy, Robin L
2016-12-01
In-vitro and invasive in-vivo studies have reported relatively independent motion in the medial and lateral forefoot segments during gait. However, most current surface-based models have not defined medial and lateral forefoot or midfoot segments. The purpose of the current study was to determine the reliability of a 7-segment foot model that includes medial and lateral midfoot and forefoot segments during walking gait. Three-dimensional positions of marker clusters located on the leg and 6 foot segments were tracked as 10 participants completed 5 walking trials. To examine the reliability of the foot model, coefficients of multiple correlation (CMC) were calculated across the trials for each participant. Three-dimensional stance time series and range of motion (ROM) during stance were also calculated for each functional articulation. CMCs for all of the functional articulations were ≥ 0.80. Overall, the rearfoot complex (leg-calcaneus segments) was the most reliable articulation and the medial midfoot complex (calcaneus-navicular segments) was the least reliable. With respect to ROM, reliability was greatest for plantarflexion/dorsiflexion and least for abduction/adduction. Further, the stance ROM and time-series patterns results between the current study and previous invasive in-vivo studies that have assessed actual bone motion were generally consistent.
NASA Technical Reports Server (NTRS)
Wallace, Dolores R.
2003-01-01
In FY01 we learned that hardware reliability models need substantial changes to account for differences in software, thus making software reliability measurements more effective, accurate, and easier to apply. These reliability models are generally based on familiar distributions or parametric methods. An obvious question is 'What new statistical and probability models can be developed using non-parametric and distribution-free methods instead of the traditional parametric method?" Two approaches to software reliability engineering appear somewhat promising. The first study, begin in FY01, is based in hardware reliability, a very well established science that has many aspects that can be applied to software. This research effort has investigated mathematical aspects of hardware reliability and has identified those applicable to software. Currently the research effort is applying and testing these approaches to software reliability measurement, These parametric models require much project data that may be difficult to apply and interpret. Projects at GSFC are often complex in both technology and schedules. Assessing and estimating reliability of the final system is extremely difficult when various subsystems are tested and completed long before others. Parametric and distribution free techniques may offer a new and accurate way of modeling failure time and other project data to provide earlier and more accurate estimates of system reliability.
Computer-Aided Reliability Estimation
NASA Technical Reports Server (NTRS)
Bavuso, S. J.; Stiffler, J. J.; Bryant, L. A.; Petersen, P. L.
1986-01-01
CARE III (Computer-Aided Reliability Estimation, Third Generation) helps estimate reliability of complex, redundant, fault-tolerant systems. Program specifically designed for evaluation of fault-tolerant avionics systems. However, CARE III general enough for use in evaluation of other systems as well.
Saito, Rintaro; Suzuki, Harukazu; Hayashizaki, Yoshihide
2003-04-12
Recent screening techniques have made large amounts of protein-protein interaction data available, from which biologically important information such as the function of uncharacterized proteins, the existence of novel protein complexes, and novel signal-transduction pathways can be discovered. However, experimental data on protein interactions contain many false positives, making these discoveries difficult. Therefore computational methods of assessing the reliability of each candidate protein-protein interaction are urgently needed. We developed a new 'interaction generality' measure (IG2) to assess the reliability of protein-protein interactions using only the topological properties of their interaction-network structure. Using yeast protein-protein interaction data, we showed that reliable protein-protein interactions had significantly lower IG2 values than less-reliable interactions, suggesting that IG2 values can be used to evaluate and filter interaction data to enable the construction of reliable protein-protein interaction networks.
Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M; Jusko, Todd A; Jaddoe, Vincent W V; Shaw, Pamela A; Tiemeier, Henning M; Hofman, Albert; Pierik, Frank H; Longnecker, Matthew P
2015-05-01
The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at <18, 18-25, and >25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiological studies.
DiCesare, Christopher A; Bates, Nathaniel A; Barber Foss, Kim D; Thomas, Staci M; Wordeman, Samuel C; Sugimoto, Dai; Roewer, Benjamin D; Medina McKeon, Jennifer M; Di Stasi, Stephanie; Noehren, Brian W; Ford, Kevin R; Kiefer, Adam W; Hewett, Timothy E; Myer, Gregory D
2015-12-01
Anterior cruciate ligament (ACL) injuries are physically and financially devastating but affect a relatively small percentage of the population. Prospective identification of risk factors for ACL injury necessitates a large sample size; therefore, study of this injury would benefit from a multicenter approach. To determine the reliability of kinematic and kinetic measures of a single-leg cross drop task across 3 institutions. Controlled laboratory study. Twenty-five female high school volleyball players participated in this study. Three-dimensional motion data of each participant performing the single-leg cross drop were collected at 3 institutions over a period of 4 weeks. Coefficients of multiple correlation were calculated to assess the reliability of kinematic and kinetic measures during the landing phase of the movement. Between-centers reliability for kinematic waveforms in the frontal and sagittal planes was good, but moderate in the transverse plane. Between-centers reliability for kinetic waveforms was good in the sagittal, frontal, and transverse planes. Based on these findings, the single-leg cross drop task has moderate to good reliability of kinematic and kinetic measures across institutions after implementation of a standardized testing protocol. Multicenter collaborations can increase study numbers and generalize results, which is beneficial for studies of relatively rare phenomena, such as ACL injury. An important step is to determine the reliability of risk assessments across institutions before a multicenter collaboration can be initiated.
Fatehi, Zahra; Baradaran, Hamid Reza; Asadpour, Mohamad; Rezaeian, Mohsen
2017-01-01
Background: Individuals' listening styles differs based on their characters, professions and situations. This study aimed to assess the validity and reliability of Listening Styles Profile- Revised (LSP- R) in Iranian students. Methods: After translating into Persian, LSP-R was employed in a sample of 240 medical and nursing Persian speaking students in Iran. Statistical analysis was performed to test the reliability and validity of the LSP-R. Results: The study revealed high internal consistency and good test-retest reliability for the Persian version of the questionnaire. The Cronbach's alpha coefficient was 0.72 and intra-class correlation coefficient 0.87. The means for the content validity index and the content validity ratio (CVR) were 0.90 and 0.83, respectively. Exploratory factor analysis (EFA) yielded a four-factor solution accounted for 60.8% of the observed variance. Majority of medical students (73%) as well as majority of nursing students (70%) stated that their listening styles were task-oriented. Conclusion: In general, the study finding suggests that the Persian version of LSP-R is a valid and reliable instrument for assessing listening styles profile in the studied sample.
Lievaart, Marien; Franken, Ingmar H A; Hovens, Johannes E
2016-03-01
The most commonly used instrument for measuring anger is the State-Trait Anger Expression Inventory-2 (STAXI-2; Spielberger, 1999). This study further examines the validity of the STAXI-2 and compares anger scores between several clinical and nonclinical samples. Reliability, concurrent, and construct validity were investigated in Dutch undergraduate students (N = 764), a general population sample (N = 1211), and psychiatric outpatients (N = 226). The results support the reliability and validity of the STAXI-2. Concurrent validity was strong, with meaningful correlations between the STAXI-2 scales and anger-related constructs in both clinical and nonclinical samples. Importantly, patients showed higher experience and expression of anger than the general population sample. Additionally, forensic outpatients with addiction problems reported higher Anger Expression-Out than general psychiatric outpatients. Our conclusion is that the STAXI-2 is a suitable instrument to measure both the experience and the expression of anger in both general and clinical populations. © 2016 Wiley Periodicals, Inc.
Gomez, Rapson; Watson, Shaun D
2017-01-01
For the Social Phobia Scale (SPS) and the Social Interaction Anxiety Scale (SIAS) together, this study examined support for a bifactor model, and also the internal consistency reliability and external validity of the factors in this model. Participants ( N = 526) were adults from the general community who completed the SPS and SIAS. Confirmatory factor analysis (CFA) of their ratings indicated good support for the bifactor model. For this model, the loadings for all but six items were higher on the general factor than the specific factors. The three positively worded items had negligible loadings on the general factor. The general factor explained most of the common variance in the SPS and SIAS, and demonstrated good model-based internal consistency reliability (omega hierarchical) and a strong association with fear of negative evaluation and extraversion. The practical implications of the findings for the utilization of the SPS and SIAS, and the theoretical and clinical implications for social anxiety are discussed.
Gomez, Rapson; Watson, Shaun D.
2017-01-01
For the Social Phobia Scale (SPS) and the Social Interaction Anxiety Scale (SIAS) together, this study examined support for a bifactor model, and also the internal consistency reliability and external validity of the factors in this model. Participants (N = 526) were adults from the general community who completed the SPS and SIAS. Confirmatory factor analysis (CFA) of their ratings indicated good support for the bifactor model. For this model, the loadings for all but six items were higher on the general factor than the specific factors. The three positively worded items had negligible loadings on the general factor. The general factor explained most of the common variance in the SPS and SIAS, and demonstrated good model-based internal consistency reliability (omega hierarchical) and a strong association with fear of negative evaluation and extraversion. The practical implications of the findings for the utilization of the SPS and SIAS, and the theoretical and clinical implications for social anxiety are discussed. PMID:28210232
A General Reliability Model for Ni-BaTiO3-Based Multilayer Ceramic Capacitors
NASA Technical Reports Server (NTRS)
Liu, Donhang
2014-01-01
The evaluation of multilayer ceramic capacitors (MLCCs) with Ni electrode and BaTiO3 dielectric material for potential space project applications requires an in-depth understanding of their reliability. A general reliability model for Ni-BaTiO3 MLCC is developed and discussed. The model consists of three parts: a statistical distribution; an acceleration function that describes how a capacitor's reliability life responds to the external stresses, and an empirical function that defines contribution of the structural and constructional characteristics of a multilayer capacitor device, such as the number of dielectric layers N, dielectric thickness d, average grain size, and capacitor chip size A. Application examples are also discussed based on the proposed reliability model for Ni-BaTiO3 MLCCs.
A General Reliability Model for Ni-BaTiO3-Based Multilayer Ceramic Capacitors
NASA Technical Reports Server (NTRS)
Liu, Donhang
2014-01-01
The evaluation for potential space project applications of multilayer ceramic capacitors (MLCCs) with Ni electrode and BaTiO3 dielectric material requires an in-depth understanding of the MLCCs reliability. A general reliability model for Ni-BaTiO3 MLCCs is developed and discussed in this paper. The model consists of three parts: a statistical distribution; an acceleration function that describes how a capacitors reliability life responds to external stresses; and an empirical function that defines the contribution of the structural and constructional characteristics of a multilayer capacitor device, such as the number of dielectric layers N, dielectric thickness d, average grain size r, and capacitor chip size A. Application examples are also discussed based on the proposed reliability model for Ni-BaTiO3 MLCCs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mays, S.E.; Poloski, J.P.; Sullivan, W.H.
1982-07-01
This report describes a risk study of the Browns Ferry, Unit 1, nuclear plant. The study is one of four such studies sponsored by the NRC Office of Research, Division of Risk Assessment, as part of its Interim Reliability Evaluation Program (IREP), Phase II. This report is contained in four volumes: a main report and three appendixes. Appendix C generally describes the methods used to estimate accident sequence frequency values. Information is presented concerning the approach, example collection, failure data, candidate dominant sequences, uncertainty analysis, and sensitivity analysis.
Reliability of videotaped observational gait analysis in patients with orthopedic impairments
Brunnekreef, Jaap J; van Uden, Caro JT; van Moorsel, Steven; Kooloos, Jan GM
2005-01-01
Background In clinical practice, visual gait observation is often used to determine gait disorders and to evaluate treatment. Several reliability studies on observational gait analysis have been described in the literature and generally showed moderate reliability. However, patients with orthopedic disorders have received little attention. The objective of this study is to determine the reliability levels of visual observation of gait in patients with orthopedic disorders. Methods The gait of thirty patients referred to a physical therapist for gait treatment was videotaped. Ten raters, 4 experienced, 4 inexperienced and 2 experts, individually evaluated these videotaped gait patterns of the patients twice, by using a structured gait analysis form. Reliability levels were established by calculating the Intraclass Correlation Coefficient (ICC), using a two-way random design and based on absolute agreement. Results The inter-rater reliability among experienced raters (ICC = 0.42; 95%CI: 0.38–0.46) was comparable to that of the inexperienced raters (ICC = 0.40; 95%CI: 0.36–0.44). The expert raters reached a higher inter-rater reliability level (ICC = 0.54; 95%CI: 0.48–0.60). The average intra-rater reliability of the experienced raters was 0.63 (ICCs ranging from 0.57 to 0.70). The inexperienced raters reached an average intra-rater reliability of 0.57 (ICCs ranging from 0.52 to 0.62). The two expert raters attained ICC values of 0.70 and 0.74 respectively. Conclusion Structured visual gait observation by use of a gait analysis form as described in this study was found to be moderately reliable. Clinical experience appears to increase the reliability of visual gait analysis. PMID:15774012
ERIC Educational Resources Information Center
Onwuegbuzie, Anthony J.; Daniel, Larry G.
The purposes of this paper are to identify common errors made by researchers when dealing with reliability coefficients and to outline best practices for reporting and interpreting reliability coefficients. Common errors that researchers make are: (1) stating that the instruments are reliable; (2) incorrectly interpreting correlation coefficients;…
ERIC Educational Resources Information Center
Helms, LuAnn Sherbeck
This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…
NASA Astrophysics Data System (ADS)
Liu, Yiming; Shi, Yimin; Bai, Xuchao; Zhan, Pei
2018-01-01
In this paper, we study the estimation for the reliability of a multicomponent system, named N- M-cold-standby redundancy system, based on progressive Type-II censoring sample. In the system, there are N subsystems consisting of M statistically independent distributed strength components, and only one of these subsystems works under the impact of stresses at a time and the others remain as standbys. Whenever the working subsystem fails, one from the standbys takes its place. The system fails when the entire subsystems fail. It is supposed that the underlying distributions of random strength and stress both belong to the generalized half-logistic distribution with different shape parameter. The reliability of the system is estimated by using both classical and Bayesian statistical inference. Uniformly minimum variance unbiased estimator and maximum likelihood estimator for the reliability of the system are derived. Under squared error loss function, the exact expression of the Bayes estimator for the reliability of the system is developed by using the Gauss hypergeometric function. The asymptotic confidence interval and corresponding coverage probabilities are derived based on both the Fisher and the observed information matrices. The approximate highest probability density credible interval is constructed by using Monte Carlo method. Monte Carlo simulations are performed to compare the performances of the proposed reliability estimators. A real data set is also analyzed for an illustration of the findings.
Lange, Toni; Matthijs, Omer; Jain, Nitin B; Schmitt, Jochen; Lützner, Jörg; Kopkow, Christian
2017-03-01
Shoulder pain in the general population is common and to identify the aetiology of shoulder pain, history, motion and muscle testing, and physical examination tests are usually performed. The aim of this systematic review was to summarise and evaluate intrarater and inter-rater reliability of physical examination tests in the diagnosis of shoulder pathologies. A comprehensive systematic literature search was conducted using MEDLINE, EMBASE, Allied and Complementary Medicine Database (AMED) and Physiotherapy Evidence Database (PEDro) through 20 March 2015. Methodological quality was assessed using the Quality Appraisal of Reliability Studies (QAREL) tool by 2 independent reviewers. The search strategy revealed 3259 articles, of which 18 finally met the inclusion criteria. These studies evaluated the reliability of 62 test and test variations used for the specific physical examination tests for the diagnosis of shoulder pathologies. Methodological quality ranged from 2 to 7 positive criteria of the 11 items of the QAREL tool. This review identified a lack of high-quality studies evaluating inter-rater as well as intrarater reliability of specific physical examination tests for the diagnosis of shoulder pathologies. In addition, reliability measures differed between included studies hindering proper cross-study comparisons. PROSPERO CRD42014009018. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
de Vroege, Lars; Emons, Wilco H M; Sijtsma, Klaas; van der Feltz-Cornelis, Christina M
2018-01-01
The Bermond-Vorst Alexithymia Questionnaire (BVAQ) has been validated in student samples and small clinical samples, but not in the general population; thus, representative general-population norms are lacking. We examined the factor structure of the BVAQ in Longitudinal Internet Studies for the Social Sciences panel data from the Dutch general population ( N = 974). Factor analyses revealed a first-order five-factor model and a second-order two-factor model. However, in the second-order model, the factor interpreted as analyzing ability loaded on both the affective factor and the cognitive factor. Further analyses showed that the first-order test scores are more reliable than the second-order test scores. External and construct validity were addressed by comparing BVAQ scores with a clinical sample of patients suffering from somatic symptom and related disorder (SSRD) ( N = 235). BVAQ scores differed significantly between the general population and patients suffering from SSRD, suggesting acceptable construct validity. Age was positively associated with alexithymia. Males showed higher levels of alexithymia. The BVAQ is a reliable alternative measure for measuring alexithymia.
Medical student quality-of-life in the clerkships: a scale validation study.
Brannick, Michael T; Horn, Gregory T; Schnaus, Michael J; Wahi, Monika M; Goldin, Steven B
2015-04-01
Many aspects of medical school are stressful for students. To empirically assess student reactions to clerkship programs, or to assess efforts to improve such programs, educators must measure the overall well-being of the students reliably and validly. The purpose of the study was to develop and validate a measure designed to achieve these goals. The authors developed a measure of quality of life for medical students by sampling (public domain) items tapping general happiness, fatigue, and anxiety. A quality-of-life scale was developed by factor analyzing responses to the items from students in two different clerkships from 2005 to 2008. Reliability was assessed using Cronbach's alpha. Validity was assessed by factor analysis, convergence with additional theoretically relevant scales, and sensitivity to change over time. The refined nine-item measure is a Likert scaled survey of quality-of-life items comprised of two domains: exhaustion and general happiness. The resulting scale demonstrated good reliability and factorial validity at two time points for each of the two samples. The quality-of-life measure also correlated with measures of depression and the amount of sleep reported during the clerkships. The quality-of-life measure appeared more sensitive to changes over time than did the depression measure. The measure is short and can be easily administered in a survey. The scale appears useful for program evaluation and more generally as an outcome variable in medical educational research.
Reliability of the Cooking Task in adults with acquired brain injury.
Poncet, Frédérique; Swaine, Bonnie; Taillefer, Chantal; Lamoureux, Julie; Pradat-Diehl, Pascale; Chevignard, Mathilde
2015-01-01
Acquired brain injury (ABI) often leads to deficits in executive functioning (EF) responsible for severe and long-standing disabilities in daily life activities. The Cooking Task is an ecological and valid test of EF involving multi-tasking in a real environment. Given its complex scoring system, it is important to establish the tool's reliability. The objective of the study was to examine the reliability of the Cooking Task (internal consistency, inter-rater and test-retest reliability). A total of 160 patients with ABI (113 men, mean age 37 years, SD = 14.3) were tested using the Cooking Task. For test-retest reliability, patients were assessed by the same rater on two occasions (mean interval 11 days) while two raters independently and simultaneously observed and scored patients' performances to estimate inter-rater reliability. Internal consistency was high for the global scale (Cronbach α = .74). Inter-rater reliability (n = 66) for total errors was also high (ICC = .93), however the test-retest reliability (n = 11) was poor (ICC = .36). In general the Cooking Task appears to be a reliable tool. The low test-retest results were expected given the importance of EF in the performance of novel tasks.
DOT National Transportation Integrated Search
2006-03-01
There have been several studies that have investigated interactions between light and heavy vehicles. These have primarily consisted of crash database analyses where Police Accident Reports have been studied. These approaches are generally reliable, ...
ERIC Educational Resources Information Center
Shogren, Karrie A.; Shaw, Leslie A.; Raley, Sheida K.; Wehmeyer, Michael L.; Niemiec, Ryan M.; Adkins, Megan
2018-01-01
This article reports the results of an examination of the endorsement, reliability, and factorial validity of the VIA--Youth and assessment of character strengths and virtues developed for the general population in youth with and without intellectual disability. Findings suggest that, generally, youth with intellectual disability endorsed…
Vasconcelos-Raposo, José; Fernandes, Helder Miguel; Teixeira, Carla M
2013-01-01
The purpose of the present study was to assess the factor structure and reliability of the Depression, Anxiety and Stress Scales (DASS-21) in a large Portuguese community sample. Participants were 1020 adults (585 women and 435 men), with a mean age of 36.74 (SD = 11.90) years. All scales revealed good reliability, with Cronbach's alpha values between .80 (anxiety) and .84 (depression). The internal consistency of the total score was .92. Confirmatory factor analysis revealed that the best-fitting model (*CFI = .940, *RMSEA = .038) consisted of a latent component of general psychological distress (or negative affectivity) plus orthogonal depression, anxiety and stress factors. The Portuguese version of the DASS-21 showed good psychometric properties (factorial validity and reliability) and thus can be used as a reliable and valid instrument for measuring depression, anxiety and stress symptoms.
A Study of the Accuracy and Reliability of Articles about Alopecia in Newspapers
Park, In Ho; Kim, Do Hyeong; Park, So Hee; Cho, Gyeong Je; Seol, Jung Eun
2018-01-01
Background There is growing interest in alopecia among the general population. Many people obtain information from easily accessible media rather than from doctors; thus, the media can play an important role in shaping public opinion. Objective The goal of this study was to evaluate the content and reliability of newspaper articles on alopecia. Methods Newspapers were categorized into three groups: one group of print newspapers and two groups of online newspapers. Online newspapers were further divided into two groups according to type of publishing company; one publishes both print and online newspapers and the other publishes online newspapers only. The most frequently subscribed or circulated newspaper in each group was selected. Articles containing information on alopecia were selected from 3 years of each newspaper and evaluated for reliability. Results Most articles in each group used the general term “alopecia” instead of naming a specific hair loss disease. The majority of articles were based on consultation with experts. Assessment of the accuracy of articles with three grade scales showed that the percentage with high accuracy was 38.9%, 47.2%, and 23.3%. Assessment of reliability scores for five selected articles in each group showed that there were statistically significant differences between common readers and dermatologists (p<0.05). Conclusion The results of this study suggest that closer monitoring of the media is required to supply easily accessible, balanced, and trustworthy information regarding alopecia. PMID:29853745
Manzoni, Gian Mauro; Rossi, Alessandro; Marazzi, Nicoletta; Agosti, Fiorenza; De Col, Alessandra; Pietrabissa, Giada; Castelnuovo, Gianluca; Molinari, Enrico; Sartorio, Allessandro
2018-01-01
This study was aimed to examine the feasibility, validity, and reliability of the Italian Pediatric Quality of Life Inventory Multidimensional Fatigue Scale (PedsQL™ MFS) for adult inpatients with severe obesity. 200 inpatients (81% females) with severe obesity (BMI ≥ 35 kg/m2) completed the PedsQL MFS (General Fatigue, Sleep/Rest Fatigue and Cognitive Fatigue domains), the Fatigue Severity Scale, and the Center for Epidemiologic Studies Depression Scale immediately after admission to a 3-week residential body weight reduction program. A randomized subsample of 48 patients re-completed the PedsQL MFS after 3 days. Confirmatory factor analysis showed that a modified hierarchical model with two items moved from the Sleep/Rest Fatigue domain to the General Fatigue domain and a second-order latent factor best fitted the data. Internal consistency and test-retest reliabilities were acceptable to high in all scales, and small to high statistically significant correlations were found with all convergent measures, with the exception of BMI. Significant floor effects were found in two scales (Cognitive Fatigue and Sleep/Rest Fatigue). The Italian modified PedsQL MFS for adults showed to be a valid and reliable tool for the assessment of fatigue in inpatients with severe obesity. Future studies should assess its discriminant validity as well as its responsiveness to weight reduction. © 2018 The Author(s) Published by S. Karger GmbH, Freiburg.
Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin
2014-01-01
The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is sensitive for sociodemographic and clinical variables pertinent to persons with urostomies, colostomies, and ileostomies.
Program for computer aided reliability estimation
NASA Technical Reports Server (NTRS)
Mathur, F. P. (Inventor)
1972-01-01
A computer program for estimating the reliability of self-repair and fault-tolerant systems with respect to selected system and mission parameters is presented. The computer program is capable of operation in an interactive conversational mode as well as in a batch mode and is characterized by maintenance of several general equations representative of basic redundancy schemes in an equation repository. Selected reliability functions applicable to any mathematical model formulated with the general equations, used singly or in combination with each other, are separately stored. One or more system and/or mission parameters may be designated as a variable. Data in the form of values for selected reliability functions is generated in a tabular or graphic format for each formulated model.
Day-to-day reliability of gait characteristics in rats.
Raffalt, Peter C; Nielsen, Louise R; Madsen, Stefan; Munk Højberg, Laurits; Pingel, Jessica; Nielsen, Jens Bo; Wienecke, Jacob; Alkjær, Tine
2018-04-27
The purpose of the present study was to determine the day-to-day reliability in stride characteristics in rats during treadmill walking obtained with two-dimensional (2D) motion capture. Kinematics were recorded from 26 adult rats during walking at 8 m/min, 12 m/min and 16 m/min on two separate days. Stride length, stride time, contact time, swing time and hip, knee and ankle joint range of motion were extracted from 15 strides. The relative reliability was assessed using intra-class correlation coefficients (ICC(1,1)) and (ICC(3,1)). The absolute reliability was determined using measurement error (ME). Across walking speeds, the relative reliability ranged from fair to good (ICCs between 0.4 and 0.75). The ME was below 91 mm for strides lengths, below 55 ms for the temporal stride variables and below 6.4° for the joint angle range of motion. In general, the results indicated an acceptable day-to-day reliability of the gait pattern parameters observed in rats during treadmill walking. The results of the present study may serve as a reference material that can help future intervention studies on rat gait characteristics both with respect to the selection of outcome measures and in the interpretation of the results. Copyright © 2018 Elsevier Ltd. All rights reserved.
McKay, J; Murphy, D J; Bowie, P; Schmuck, M-L; Lough, M; Eva, K W
2007-04-01
To establish the content validity and specific aspects of reliability for an assessment instrument designed to provide formative feedback to general practitioners (GPs) on the quality of their written analysis of a significant event. Content validity was quantified by application of a content validity index. Reliability testing involved a nested design, with 5 cells, each containing 4 assessors, rating 20 unique significant event analysis (SEA) reports (10 each from experienced GPs and GPs in training) using the assessment instrument. The variance attributable to each identified variable in the study was established by analysis of variance. Generalisability theory was then used to investigate the instrument's ability to discriminate among SEA reports. Content validity was demonstrated with at least 8 of 10 experts endorsing all 10 items of the assessment instrument. The overall G coefficient for the instrument was moderate to good (G>0.70), indicating that the instrument can provide consistent information on the standard achieved by the SEA report. There was moderate inter-rater reliability (G>0.60) when four raters were used to judge the quality of the SEA. This study provides the first steps towards validating an instrument that can provide educational feedback to GPs on their analysis of significant events. The key area identified to improve instrument reliability is variation among peer assessors in their assessment of SEA reports. Further validity and reliability testing should be carried out to provide GPs, their appraisers and contractual bodies with a validated feedback instrument on this aspect of the general practice quality agenda.
Frosini, Francesco; Miniati, Roberto; Grillone, Saverio; Dori, Fabrizio; Gentili, Guido Biffi; Belardinelli, Andrea
2016-11-14
The following study proposes and tests an integrated methodology involving Health Technology Assessment (HTA) and Failure Modes, Effects and Criticality Analysis (FMECA) for the assessment of specific aspects related to robotic surgery involving safety, process and technology. The integrated methodology consists of the application of specific techniques coming from the HTA joined to the aid of the most typical models from reliability engineering such as FMEA/FMECA. The study has also included in-site data collection and interviews to medical personnel. The total number of robotic procedures included in the analysis was 44: 28 for urology and 16 for general surgery. The main outcomes refer to the comparative evaluation between robotic, laparoscopic and open surgery. Risk analysis and mitigation interventions come from FMECA application. The small sample size available for the study represents an important bias, especially for the clinical outcomes reliability. Despite this, the study seems to confirm the better trend for robotics' surgical times with comparison to the open technique as well as confirming the robotics' clinical benefits in urology. More complex situation is observed for general surgery, where robotics' clinical benefits directly measured are the lowest blood transfusion rate.
Evaluating the care of general medicine inpatients: how good is implicit review?
Hayward, R A; McMahon, L F; Bernard, A M
1993-04-01
Peer review often consists of implicit evaluations by physician reviewers of the quality and appropriateness of care. This study evaluated the ability of implicit review to measure reliably various aspects of care on a general medicine inpatient service. Retrospective review of patients' charts, using structured implicit review, of a stratified random sample of consecutive admissions to a general medicine ward. A university teaching hospital. Twelve internists were trained in structured implicit review and reviewed 675 patient admissions (with 20% duplicate reviews for a total of 846 reviews). Although inter-rater reliabilities for assessments of overall quality of care and preventable deaths (kappa = 0.5) were adequate for aggregate comparisons (for example, comparing mean ratings on two hospital wards), they were inadequate for reliable evaluations of single patients using one or two reviewers. Reviewers' agreement about most focused quality problems (for example, timeliness of diagnostic evaluation and clinical readiness at time of discharge) and about the appropriateness of hospital ancillary resource use was poor (kappa < or = 0.2). For most focused implicit measures, bias due to specific reviewers who were systematically more harsh or lenient (particularly for evaluation of resource-use appropriateness) accounted for much of the variation in reviewers' assessments, but this was not a substantial problem for the measure of overall quality. Reviewers rarely reported being unable to evaluate the quality of care because of deficiencies in documentation in the patient's chart. For assessment of overall quality and preventable deaths of general medicine inpatients, implicit review by peers had moderate degrees of reliability, but for most other specific aspects of care, physician reviewers could not agree. Implicit review was particularly unreliable at evaluating the appropriateness of hospital resource use and the patient's readiness for discharge, two areas where this type of review is often used.
Cai, Gaigai; Chen, Xuefeng; Li, Bing; Chen, Baojia; He, Zhengjia
2012-01-01
The reliability of cutting tools is critical to machining precision and production efficiency. The conventional statistic-based reliability assessment method aims at providing a general and overall estimation of reliability for a large population of identical units under given and fixed conditions. However, it has limited effectiveness in depicting the operational characteristics of a cutting tool. To overcome this limitation, this paper proposes an approach to assess the operation reliability of cutting tools. A proportional covariate model is introduced to construct the relationship between operation reliability and condition monitoring information. The wavelet packet transform and an improved distance evaluation technique are used to extract sensitive features from vibration signals, and a covariate function is constructed based on the proportional covariate model. Ultimately, the failure rate function of the cutting tool being assessed is calculated using the baseline covariate function obtained from a small sample of historical data. Experimental results and a comparative study show that the proposed method is effective for assessing the operation reliability of cutting tools. PMID:23201980
A general software reliability process simulation technique
NASA Technical Reports Server (NTRS)
Tausworthe, Robert C.
1991-01-01
The structure and rationale of the generalized software reliability process, together with the design and implementation of a computer program that simulates this process are described. Given assumed parameters of a particular project, the users of this program are able to generate simulated status timelines of work products, numbers of injected anomalies, and the progress of testing, fault isolation, repair, validation, and retest. Such timelines are useful in comparison with actual timeline data, for validating the project input parameters, and for providing data for researchers in reliability prediction modeling.
Covariate-free and Covariate-dependent Reliability.
Bentler, Peter M
2016-12-01
Classical test theory reliability coefficients are said to be population specific. Reliability generalization, a meta-analysis method, is the main procedure for evaluating the stability of reliability coefficients across populations. A new approach is developed to evaluate the degree of invariance of reliability coefficients to population characteristics. Factor or common variance of a reliability measure is partitioned into parts that are, and are not, influenced by control variables, resulting in a partition of reliability into a covariate-dependent and a covariate-free part. The approach can be implemented in a single sample and can be applied to a variety of reliability coefficients.
Inter-rater reliability of twelve diagnostic systems of schizophrenia.
Helmes, E; Landmark, J; Kazarian, S S
1983-05-01
The present and past symptomatology of 31 chronic schizophrenics was rated by four independent judges, two experienced clinical psychiatrists and two psychiatric residents, in a context more representative of actual clinical practice than most research studies. Ratings were made on 64 symptoms derived from 12 diagnostic systems, based on either live or videotaped interviews for present symptomatology and case records for past symptomatology. Inter-rater reliabilities were higher for present than for past symptoms, and in general did not approach those reported for highly trained raters. There were no differences between live and videotaped interviews. Diagnostic systems differed widely in rater agreement. The most consistent across both past and present symptomatology were the systems of Langfeldt, Schneider, and DSM-III, for which the level of reliability was consistent with other studies.
Soleimani, Mohammad Ali; Yaghoobzadeh, Ameneh; Bahrami, Nasim; Sharif, Saeed Pahlevan; Sharif Nia, Hamid
2016-10-01
In this study, 398 Iranian cancer patients completed the 15-item Templer's Death Anxiety Scale (TDAS). Tests of internal consistency, principal components analysis, and confirmatory factor analysis were conducted to assess the internal consistency and factorial validity of the Persian TDAS. The construct reliability statistic and average variance extracted were also calculated to measure construct reliability, convergent validity, and discriminant validity. Principal components analysis indicated a 3-component solution, which was generally supported in the confirmatory analysis. However, acceptable cutoffs for construct reliability, convergent validity, and discriminant validity were not fulfilled for the three subscales that were derived from the principal component analysis. This study demonstrated both the advantages and potential limitations of using the TDAS with Persian-speaking cancer patients.
Grant, Andrew J; Vermunt, Jan D; Kinnersley, Paul; Houston, Helen
2007-01-01
Background Portfolio learning enables students to collect evidence of their learning. Component tasks making up a portfolio can be devised that relate directly to intended learning outcomes. Reflective tasks can stimulate students to recognise their own learning needs. Assessment of portfolios using a rating scale relating to intended learning outcomes offers high content validity. This study evaluated a reflective portfolio used during a final-year attachment in general practice (family medicine). Students were asked to evaluate the portfolio (which used significant event analysis as a basis for reflection) as a learning tool. The validity and reliability of the portfolio as an assessment tool were also measured. Methods 81 final-year medical students completed reflective significant event analyses as part of a portfolio created during a three-week attachment (clerkship) in general practice (family medicine). As well as two reflective significant event analyses each portfolio contained an audit and a health needs assessment. Portfolios were marked three times; by the student's GP teacher, the course organiser and by another teacher in the university department of general practice. Inter-rater reliability between pairs of markers was calculated. A questionnaire enabled the students' experience of portfolio learning to be determined. Results Benefits to learning from reflective learning were limited. Students said that they thought more about the patients they wrote up in significant event analyses but information as to the nature and effect of this was not forthcoming. Moderate inter-rater reliability (Spearman's Rho .65) was found between pairs of departmental raters dealing with larger numbers (20 – 60) of portfolios. Inter-rater reliability of marking involving GP tutors who only marked 1 – 3 portfolios was very low. Students rated highly their mentoring relationship with their GP teacher but found the portfolio tasks time-consuming. Conclusion The inter-rater reliability observed in this study should be viewed alongside the high validity afforded by the authenticity of the learning tasks (compared with a sample of a student's learning taken by an exam question). Validity is enhanced by the rating scale which directly connects the grade given with intended learning outcomes. The moderate inter-rater reliability may be increased if a portfolio is completed over a longer period of time and contains more component pieces of work. The questionnaire used in this study only accessed limited information about the effect of reflection on students' learning. Qualitative methods of evaluation would determine the students experience in greater depth. It would be useful to evaluate the effects of reflective learning after students have had more time to get used to this unfamiliar method of learning and to overcome any problems in understanding the task. PMID:17397544
Grant, Andrew J; Vermunt, Jan D; Kinnersley, Paul; Houston, Helen
2007-03-30
Portfolio learning enables students to collect evidence of their learning. Component tasks making up a portfolio can be devised that relate directly to intended learning outcomes. Reflective tasks can stimulate students to recognise their own learning needs. Assessment of portfolios using a rating scale relating to intended learning outcomes offers high content validity. This study evaluated a reflective portfolio used during a final-year attachment in general practice (family medicine). Students were asked to evaluate the portfolio (which used significant event analysis as a basis for reflection) as a learning tool. The validity and reliability of the portfolio as an assessment tool were also measured. 81 final-year medical students completed reflective significant event analyses as part of a portfolio created during a three-week attachment (clerkship) in general practice (family medicine). As well as two reflective significant event analyses each portfolio contained an audit and a health needs assessment. Portfolios were marked three times; by the student's GP teacher, the course organiser and by another teacher in the university department of general practice. Inter-rater reliability between pairs of markers was calculated. A questionnaire enabled the students' experience of portfolio learning to be determined. Benefits to learning from reflective learning were limited. Students said that they thought more about the patients they wrote up in significant event analyses but information as to the nature and effect of this was not forthcoming. Moderate inter-rater reliability (Spearman's Rho .65) was found between pairs of departmental raters dealing with larger numbers (20-60) of portfolios. Inter-rater reliability of marking involving GP tutors who only marked 1-3 portfolios was very low. Students rated highly their mentoring relationship with their GP teacher but found the portfolio tasks time-consuming. The inter-rater reliability observed in this study should be viewed alongside the high validity afforded by the authenticity of the learning tasks (compared with a sample of a student's learning taken by an exam question). Validity is enhanced by the rating scale which directly connects the grade given with intended learning outcomes. The moderate inter-rater reliability may be increased if a portfolio is completed over a longer period of time and contains more component pieces of work. The questionnaire used in this study only accessed limited information about the effect of reflection on students' learning. Qualitative methods of evaluation would determine the students experience in greater depth. It would be useful to evaluate the effects of reflective learning after students have had more time to get used to this unfamiliar method of learning and to overcome any problems in understanding the task.
Reliability of rapid reporting of cancers in New Hampshire.
Celaya, Maria O; Riddle, Bruce L; Cherala, Sai S; Armenti, Karla R; Rees, Judy R
2010-01-01
The New Hampshire State Cancer Registry (NHSCR) has a 2-phase reporting system. An abbreviated, "rapid" report of cancer diagnosis or treatment is due to the central registry within 45 days of diagnosis and a more detailed, definitive report is due within 180 days. Rapid reports are used for various research studies, but researchers who contact patients are warned that the rapid reports may contain inaccuracies. This study aimed to assess the reliability of rapid cancer reports. For diagnosis years 2000-2004, we compared the rapid and definitive reports submitted to NHSCR. We calculated the sensitivity and positive predictive value of rapid reports; the reliability of key data items overall and for major sites; and the time between diagnosis and submission of the report. Rapid reports identified incident cancer cases with a sensitivity of 88.5%. The overall accuracy of key data items was high. The accuracy of primary sites identified by rapid reports was high generally but lower for ovarian and unknown primaries. A subset analysis showed that 47% of cancers were reported within 90 days of diagnosis. Rapid reports submitted to NHSCR are generally of high quality and present a useful opportunity for research investigations in New Hampshire.
Aeromedical transportation and general aviation.
DOT National Transportation Integrated Search
1971-04-01
The advantages of aircraft in providing military medical evacuation are well documented. Training and experience have resulted in a reliable and safe military medical evacuation system. Many studies have been done or are in process which pertain to c...
Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M.; Jusko, Todd A.; Jaddoe, Vincent W.V.; Shaw, Pamela A.; Tiemeier, Henning M.; Hofman, Albert; Pierik, Frank H.; Longnecker, Matthew P.
2014-01-01
The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at <18, 18–25, and >25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiologic studies. PMID:25515376
Poulton, B C
1996-01-01
BACKGROUND: Primary health care services are the most frequently used in the health care system. Consumer feedback on these services is important. Research in this area relates mainly to doctor-patient relationships which fails to reflect the multidisciplinary nature of primary health care. AIM: A pilot study aimed to examine the feasibility of using a patient satisfaction questionnaire designed for use with general practitioner consultations as an instrument for measuring patient satisfaction with community nurses. METHOD: The questionnaire measuring patient satisfaction with general practitioner consultations was adapted for measuring satisfaction with contacts with a nurse practitioner, district nurses, practice nurses and health visitors. A total of 1575 patients in three practices consulting general practitioners or community nurses were invited to complete a questionnaire. Data were subjected to principal components analysis and the dimensions identified were tested for internal reliability and replicability. To establish discriminant validity, patients' mean satisfaction scores for consultations with general practitioners, the nurse practitioner, health visitors and nurses (district and practice nurses) were compared. RESULTS: Questionnaires were returned relating to 400 general practitioner, 54 nurse practitioner, 191 district/practice nurse and 83 health visitor consultations (overall response rate 46%). Principal components analysis demonstrated a factor structure similar to that found in an earlier study of the consultation satisfaction questionnaire. Three dimensions of patient satisfaction were identified: professional care, depth of relationship and perceived time spent with the health professional. The dimensions were found to have acceptable levels of reliability. Factor structures obtained from data relating to general practitioner and community nurse consultations were found to correlate significantly. Comparison between health professionals showed that patients rated satisfaction with professional care significantly more highly for nurses than for general practitioners and health visitors. Patients' rating of satisfaction with the depth of relationships with health visitors was significantly lower than their ratings of this relationship with the other groups of health professionals. There were so significant differences between health professional groups regarding patients' ratings of satisfaction with the perceived amount of time spent with health professionals. CONCLUSION: The pilot study showed that it is possible to use the consultation satisfaction questionnaire for both general practitioners and community nurses. Comparison between health professional groups should be undertaken with caution as data were available for only a small number of consultations with some of the groups of health professionals studied. PMID:8745848
Windschitl, Paul D; Rose, Jason P; Stalkfleet, Michael T; Smith, Andrew R
2008-08-01
People are often egocentric when judging their likelihood of success in competitions, leading to overoptimism about winning when circumstances are generally easy and to overpessimism when the circumstances are difficult. Yet, egocentrism might be grounded in a rational tendency to favor highly reliable information (about the self) more so than less reliable information (about others). A general theory of probability called extended support theory was used to conceptualize and assess the role of egocentrism and its consequences for the accuracy of people's optimism in 3 competitions (Studies 1-3, respectively). Also, instructions were manipulated to test whether people who were urged to avoid egocentrism would show improved or worsened accuracy in their likelihood judgments. Egocentrism was found to have a potentially helpful effect on one form of accuracy, but people generally showed too much egocentrism. Debias instructions improved one form of accuracy but had no impact on another. The advantages of using the EST framework for studying optimism and other types of judgments (e.g., comparative ability judgments) are discussed. (c) 2008 APA, all rights reserved
Reliability of robotic system during general surgical procedures in a university hospital.
Buchs, Nicolas C; Pugin, François; Volonté, Francesco; Morel, Philippe
2014-01-01
Data concerning the reliability of robotic systems are scarce, especially for general surgery. The aim of this study was to assess the incidence and consequences of robotic malfunction in a teaching institution. From January 2006 to September 2012, 526 consecutive robotic general surgical procedures were performed. All failures were prospectively recorded in a computerized database and reviewed retrospectively. Robotic malfunctions occurred in 18 cases (3.4%). These dysfunctions concerned the robotic instruments in 9 cases, the robotic arms in 4 cases, the surgical console in 3 cases, and the optical system in 2 cases. Two malfunctions were considered critical, and 1 led to a laparoscopic conversion (conversion rate due to malfunction, .2%). Overall, there were more dysfunctions at the beginning of the study period (2006 to 2010) than more recently (2011 to 2012) (4.2% vs 2.6%, P = .35). The robotic system malfunction rate was low. Most malfunctions could be resolved during surgery, allowing the procedures to be completed safely. With increased experience, the system malfunction rate seems to be reduced. Copyright © 2014 Elsevier Inc. All rights reserved.
Longitudinal Models of Reliability and Validity: A Latent Curve Approach.
ERIC Educational Resources Information Center
Tisak, John; Tisak, Marie S.
1996-01-01
Dynamic generalizations of reliability and validity that will incorporate longitudinal or developmental models, using latent curve analysis, are discussed. A latent curve model formulated to depict change is incorporated into the classical definitions of reliability and validity. The approach is illustrated with sociological and psychological…
We will make you like our research: The development of a susceptibility-to-persuasion scale.
Modic, David; Anderson, Ross; Palomäki, Jussi
2018-01-01
Psychological and other persuasive mechanisms across diverse contexts are well researched, with many studies of the effectiveness of specific persuasive techniques on distinct types of human behaviour. In the present paper, our specific interest lies in the development of a generalized modular psychometric tool to measure individuals' susceptibility to persuasion. The scale is constructed using items from previously developed and validated particulate scales established in the domains of social psychology and behavioural economics. In the first study we establish the Susceptibility to Persuasion-II (StP-II) scale, containing 54 items, 10 subscales and further 6 sub-sub scales. In Study 2 we establish the scale's construct validity and reconfirm its reliability. We present a valid and reliable modular psychometric tool that measures general susceptibility to persuasive techniques. Since its inception, we have successfully implemented the StP-II scale to measure susceptibility to persuasion of IT security officers, the role of psychology of persuasion in cybercrime victims and general persuadability levels of Facebook users; these manuscripts are in preparation. We argue that the StP-II scale shows promise in measuring individual differences in susceptibility to persuasion, and is applicable across diverse contexts such as Internet security and cybercrime.
Cheng, Shu-Fen; Rose, Susan
2009-01-01
This study investigated the technical adequacy of curriculum-based measures of written expression (CBM-W) in terms of writing prompts and scoring methods for deaf and hard-of-hearing students. Twenty-two students at the secondary school-level completed 3-min essays within two weeks, which were scored for nine existing and alternative curriculum-based measurement (CBM) scoring methods. The technical features of the nine scoring methods were examined for interrater reliability, alternate-form reliability, and criterion-related validity. The existing CBM scoring method--number of correct minus incorrect word sequences--yielded the highest reliability and validity coefficients. The findings from this study support the use of the CBM-W as a reliable and valid tool for assessing general writing proficiency with secondary students who are deaf or hard of hearing. The CBM alternative scoring methods that may serve as additional indicators of written expression include correct subject-verb agreements, correct clauses, and correct morphemes.
Interformat reliability of digital psychiatric self-report questionnaires: a systematic review.
Alfonsson, Sven; Maathz, Pernilla; Hursti, Timo
2014-12-03
Research on Internet-based interventions typically use digital versions of pen and paper self-report symptom scales. However, adaptation into the digital format could affect the psychometric properties of established self-report scales. Several studies have investigated differences between digital and pen and paper versions of instruments, but no systematic review of the results has yet been done. This review aims to assess the interformat reliability of self-report symptom scales used in digital or online psychotherapy research. Three databases (MEDLINE, Embase, and PsycINFO) were systematically reviewed for studies investigating the reliability between digital and pen and paper versions of psychiatric symptom scales. From a total of 1504 publications, 33 were included in the review, and interformat reliability of 40 different symptom scales was assessed. Significant differences in mean total scores between formats were found in 10 of 62 analyses. These differences were found in just a few studies, which indicates that the results were due to study effects and sample effects rather than unreliable instruments. The interformat reliability ranged from r=.35 to r=.99; however, the majority of instruments showed a strong correlation between format scores. The quality of the included studies varied, and several studies had insufficient power to detect small differences between formats. When digital versions of self-report symptom scales are compared to pen and paper versions, most scales show high interformat reliability. This supports the reliability of results obtained in psychotherapy research on the Internet and the comparability of the results to traditional psychotherapy research. There are, however, some instruments that consistently show low interformat reliability, suggesting that these conclusions cannot be generalized to all questionnaires. Most studies had at least some methodological issues with insufficient statistical power being the most common issue. Future studies should preferably provide information about the transformation of the instrument into digital format and the procedure for data collection in more detail.
Dougherty, Cynthia M.; Johnston, Sandra K.; Thompson, Elaine Adams
2009-01-01
The purpose of this study was to assess the reliability and validity characteristics of two new scales that measure self-efficacy expectations (SE-ICD) and outcome expectations (OE-ICD) in survivors (n=168) of sudden cardiac arrest (SCA), all of whom received an implantable cardioverter defibrillator (ICD). Cronbach's alpha reliability demonstrated good internal consistency (SE-ICD α = 0.93 and OE-ICD α = 0.81). Correlations with other self-efficacy instruments (general self-efficacy and social self-efficacy) were consistently high. The instruments were responsive to change across time with effect sizes of 0.46 for SE-ICD, and 0.26 for OE-ICD. These reliable, valid, and responsive instruments for measurement of self-efficacy expectations and outcome expectations after an ICD can be used in research and clinical settings. PMID:17693214
Optimal sample sizes for the design of reliability studies: power consideration.
Shieh, Gwowen
2014-09-01
Intraclass correlation coefficients are used extensively to measure the reliability or degree of resemblance among group members in multilevel research. This study concerns the problem of the necessary sample size to ensure adequate statistical power for hypothesis tests concerning the intraclass correlation coefficient in the one-way random-effects model. In view of the incomplete and problematic numerical results in the literature, the approximate sample size formula constructed from Fisher's transformation is reevaluated and compared with an exact approach across a wide range of model configurations. These comprehensive examinations showed that the Fisher transformation method is appropriate only under limited circumstances, and therefore it is not recommended as a general method in practice. For advance design planning of reliability studies, the exact sample size procedures are fully described and illustrated for various allocation and cost schemes. Corresponding computer programs are also developed to implement the suggested algorithms.
Using generalizability theory to develop clinical assessment protocols.
Preuss, Richard A
2013-04-01
Clinical assessment protocols must produce data that are reliable, with a clinically attainable minimal detectable change (MDC). In a reliability study, generalizability theory has 2 advantages over classical test theory. These advantages provide information that allows assessment protocols to be adjusted to match individual patient profiles. First, generalizability theory allows the user to simultaneously consider multiple sources of measurement error variance (facets). Second, it allows the user to generalize the findings of the main study across the different study facets and to recalculate the reliability and MDC based on different combinations of facet conditions. In doing so, clinical assessment protocols can be chosen based on minimizing the number of measures that must be taken to achieve a realistic MDC, using repeated measures to minimize the MDC, or simply based on the combination that best allows the clinician to monitor an individual patient's progress over a specified period of time.
Test-retest reliability of the proposed DSM-5 eating disorder diagnostic criteria
Sysko, Robyn; Roberto, Christina A.; Barnes, Rachel D.; Grilo, Carlos M.; Attia, Evelyn; Walsh, B. Timothy
2012-01-01
The proposed DSM-5 classification scheme for eating disorders includes both major and minor changes to the existing DSM-IV diagnostic criteria. It is not known what effect these modifications will have on the ability to make reliable diagnoses. Two studies were conducted to evaluate the short-term test-retest reliability of the proposed DSM-5 eating disorder diagnoses: anorexia nervosa, bulimia nervosa, binge eating disorder, and feeding and eating conditions not elsewhere classified. Participants completed two independent telephone interviews with research assessors (n=70 Study 1; n=55 Study 2). Fair to substantial agreements (κ= 0.80 and 0.54) were observed across eating disorder diagnoses in Study 1 and Study 2, respectively. Acceptable rates of agreement were identified for the individual eating disorder diagnoses, including DSM-5 anorexia nervosa (κ’s of 0.81 to 0.97), bulimia nervosa (κ=0.84), binge eating disorder (κ’s of 0.75 and 0.61), and feeding and eating disorders not elsewhere classified (κ’s of 0.70 and 0.46). Further, improved short-term test-retest reliability was noted when using the DSM-5, in comparison to DSM-IV, criteria for binge eating disorder. Thus, these studies found that trained interviewers can reliably diagnose eating disorders using the proposed DSM-5 criteria; however, additional data from general practice settings and community samples are needed. PMID:22401974
Assuring reliability program effectiveness.
NASA Technical Reports Server (NTRS)
Ball, L. W.
1973-01-01
An attempt is made to provide simple identification and description of techniques that have proved to be most useful either in developing a new product or in improving reliability of an established product. The first reliability task is obtaining and organizing parts failure rate data. Other tasks are parts screening, tabulation of general failure rates, preventive maintenance, prediction of new product reliability, and statistical demonstration of achieved reliability. Five principal tasks for improving reliability involve the physics of failure research, derating of internal stresses, control of external stresses, functional redundancy, and failure effects control. A final task is the training and motivation of reliability specialist engineers.
Structure reliability design and analysis of support ring for cylinder seal
NASA Astrophysics Data System (ADS)
Minmin, Zhao
2017-09-01
In this paper, the general reliability design process of the cross-sectional dimension of the support ring is introduced, which is used for the cylinder sealing. Then, taking a certain section shape support ring as an example, the every size parameters of section are determined from the view point of reliability design. Last, the static strength and reliability of the support ring are analyzed to verify the correctness of the reliability design result.
Test-retest reliability of resting-state magnetoencephalography power in sensor and source space.
Martín-Buro, María Carmen; Garcés, Pilar; Maestú, Fernando
2016-01-01
Several studies have reported changes in spontaneous brain rhythms that could be used as clinical biomarkers or in the evaluation of neuropsychological and drug treatments in longitudinal studies using magnetoencephalography (MEG). There is an increasing necessity to use these measures in early diagnosis and pathology progression; however, there is a lack of studies addressing how reliable they are. Here, we provide the first test-retest reliability estimate of MEG power in resting-state at sensor and source space. In this study, we recorded 3 sessions of resting-state MEG activity from 24 healthy subjects with an interval of a week between each session. Power values were estimated at sensor and source space with beamforming for classical frequency bands: delta (2-4 Hz), theta (4-8 Hz), alpha (8-13 Hz), low beta (13-20 Hz), high beta (20-30 Hz), and gamma (30-45 Hz). Then, test-retest reliability was evaluated using the intraclass correlation coefficient (ICC). We also evaluated the relation between source power and the within-subject variability. In general, ICC of theta, alpha, and low beta power was fairly high (ICC > 0.6) while in delta and gamma power was lower. In source space, fronto-posterior alpha, frontal beta, and medial temporal theta showed the most reliable profiles. Signal-to-noise ratio could be partially responsible for reliability as low signal intensity resulted in high within-subject variability, but also the inherent nature of some brain rhythms in resting-state might be driving these reliability patterns. In conclusion, our results described the reliability of MEG power estimates in each frequency band, which could be considered in disease characterization or clinical trials. © 2015 Wiley Periodicals, Inc.
The Use of Teacher Judgement for Summative Assessment in the USA
ERIC Educational Resources Information Center
Brookhart, Susan M.
2013-01-01
Studies of the use of teacher judgement for summative assessment in the USA are considered in two general categories. (1) Studies of teacher classroom summative assessment, that is, teacher grading practices, have historically and currently emphasised the lack of validity and reliability of these judgements. (2) Studies of how teacher judgement…
PERFORMANCE OF TRICKLING FILTER PLANTS: RELIABILITY, STABILITY, VARIABILITY
Effluent quality variability from trickling filters was examined in this study by statistically analyzing daily effluent BOD5 and suspended solids data from 11 treatment plants. Summary statistics (mean, standard deviation, etc.) were examined to determine the general characteris...
Spatially Regularized Machine Learning for Task and Resting-state fMRI
Song, Xiaomu; Panych, Lawrence P.; Chen, Nan-kuei
2015-01-01
Background Reliable mapping of brain function across sessions and/or subjects in task- and resting-state has been a critical challenge for quantitative fMRI studies although it has been intensively addressed in the past decades. New Method A spatially regularized support vector machine (SVM) technique was developed for the reliable brain mapping in task- and resting-state. Unlike most existing SVM-based brain mapping techniques, which implement supervised classifications of specific brain functional states or disorders, the proposed method performs a semi-supervised classification for the general brain function mapping where spatial correlation of fMRI is integrated into the SVM learning. The method can adapt to intra- and inter-subject variations induced by fMRI nonstationarity, and identify a true boundary between active and inactive voxels, or between functionally connected and unconnected voxels in a feature space. Results The method was evaluated using synthetic and experimental data at the individual and group level. Multiple features were evaluated in terms of their contributions to the spatially regularized SVM learning. Reliable mapping results in both task- and resting-state were obtained from individual subjects and at the group level. Comparison with Existing Methods A comparison study was performed with independent component analysis, general linear model, and correlation analysis methods. Experimental results indicate that the proposed method can provide a better or comparable mapping performance at the individual and group level. Conclusions The proposed method can provide accurate and reliable mapping of brain function in task- and resting-state, and is applicable to a variety of quantitative fMRI studies. PMID:26470627
The Modified Abbreviated Math Anxiety Scale: A Valid and Reliable Instrument for Use with Children.
Carey, Emma; Hill, Francesca; Devine, Amy; Szűcs, Dénes
2017-01-01
Mathematics anxiety (MA) can be observed in children from primary school age into the teenage years and adulthood, but many MA rating scales are only suitable for use with adults or older adolescents. We have adapted one such rating scale, the Abbreviated Math Anxiety Scale (AMAS), to be used with British children aged 8-13. In this study, we assess the scale's reliability, factor structure, and divergent validity. The modified AMAS (mAMAS) was administered to a very large ( n = 1746) cohort of British children and adolescents. This large sample size meant that as well as conducting confirmatory factor analysis on the scale itself, we were also able to split the sample to conduct exploratory and confirmatory factor analysis of items from the mAMAS alongside items from child test anxiety and general anxiety rating scales. Factor analysis of the mAMAS confirmed that it has the same underlying factor structure as the original AMAS, with subscales measuring anxiety about Learning and Evaluation in math. Furthermore, both exploratory and confirmatory factor analysis of the mAMAS alongside scales measuring test anxiety and general anxiety showed that mAMAS items cluster onto one factor (perceived to represent MA). The mAMAS provides a valid and reliable scale for measuring MA in children and adolescents, from a younger age than is possible with the original AMAS. Results from this study also suggest that MA is truly a unique construct, separate from both test anxiety and general anxiety, even in childhood.
Heo, K H; Squires, J; Yovanoff, P
2008-03-01
Accurate and efficient developmental screening measures are critical for early identification of developmental problems; however, few reliable and valid tests are available in Korea as well as other countries outside the USA. The Ages and Stages Questionnaires (ASQ) was chosen for study with young children in Korea. The ASQ was translated into Korean and necessary cross-cultural adaptations were made. The translated version was then distributed and completed by 3220 parents of young children between the ages of 4 months and 5 years. Reliability was studied including domain correlations, internal consistency, and performance of identification cut-off scores for the Korean population. Rasch analyses including tests of Differential Item Functioning, contrasting Korean and US samples were also performed. In general, internal consistency of the Korean ASQ was high, with overall correlations 0.75 for communication, 0.85 for gross motor, 0.74 for fine motor, 0.72 for problem solving, and 0.65 for personal-social. Validity, including concurrent validity, also had strong evidence. Mean scores of children on the Korean translation of the ASQ and the US normative sample were generally similar. Rasch analyses indicated the majority of items functioned similarly across the Korean sample. In general, the ASQ was translated with cultural appropriateness in mind and functioned as a valid and reliable parent-completed screening test to assist in early identification of young children with developmental delays. Further research is needed to confirm these results with a larger and more diverse Korean sample.
López-Pascual, Juan; Cáceres, Magda Liliana; De Rosario, Helios; Page, Álvaro
2016-02-08
The reliability of joint rotation measurements is an issue of major interest, especially in clinical applications. The effect of instrumental errors and soft tissue artifacts on the variability of human motion measures is well known, but the influence of the representation of joint motion has not yet been studied. The aim of the study was to compare the within-subject reliability of three rotation formalisms for the calculation of the shoulder elevation joint angles. Five repetitions of humeral elevation in the scapular plane of 27 healthy subjects were recorded using a stereophotogrammetry system. The humerothoracic joint angles were calculated using the YX'Y" and XZ'Y" Euler angle sequences and the attitude vector. A within-subject repeatability study was performed for the three representations. ICC, SEM and CV were the indices used to estimate the error in the calculation of the angle amplitudes and the angular waveforms with each method. Excellent results were obtained in all representations for the main angle (elevation), but there were remarkable differences for axial rotation and plane of elevation. The YX'Y" sequence generally had the poorest reliability in the secondary angles. The XZ'Y' sequence proved to be the most reliable representation of axial rotation, whereas the attitude vector had the highest reliability in the plane of elevation. These results highlight the importance of selecting the method used to describe the joint motion when within-subjects reliability is an important issue of the experiment. This may be of particular importance when the secondary angles of motions are being studied. Copyright © 2016 Elsevier Ltd. All rights reserved.
A Study of Intonation in the Soccer Results.
ERIC Educational Resources Information Center
Bonnet, G.
1980-01-01
Reports a study which illustrates that a listener can anticipate the score of the opposing team in sports match results from the variation in the announcer's intonation. Investigates how reliable this prediction is and what linguistic features it involves. Relates these findings to general problems in intonation contour interpretation. (PMJ)
The quest for a general theory of aging and longevity.
Gavrilov, Leonid A; Gavrilova, Natalia S
2003-07-16
Extensive studies of phenomena related to aging have produced many diverse findings, which require a general theoretical framework to be organized into a comprehensive body of knowledge. As demonstrated by the success of evolutionary theories of aging, quite general theoretical considerations can be very useful when applied to research on aging. In this theoretical study, we attempt to gain insight into aging by applying a general theory of systems failure known as reliability theory. Considerations of this theory lead to the following conclusions: (i) Redundancy is a concept of crucial importance for understanding aging, particularly the systemic nature of aging. Systems that are redundant in numbers of irreplaceable elements deteriorate (that is, age) over time, even if they are built of elements that do not themselves age. (ii) An apparent aging rate or expression of aging is higher for systems that have higher levels of redundancy. (iii) Redundancy exhaustion over the life course explains a number of observations about mortality, including mortality convergence at later life (when death rates are becoming relatively similar at advanced ages for different populations of the same species) as well as late-life mortality deceleration, leveling off, and mortality plateaus. (iv) Living organisms apparently contain a high load of initial damage from the early stages of development, and therefore their life span and aging patterns may be sensitive to early-life conditions that determine this initial damage load. Thus, the reliability theory provides a parsimonious explanation for many important aging-related phenomena and suggests a number of interesting testable predictions. We therefore suggest adding the reliability theory to the arsenal of methodological approaches applied to research on aging.
Nyitray, Alan G; Harris, Robin B; Abalos, Andrew T; Nielson, Carrie M; Papenfuss, Mary; Giuliano, Anna R
2010-12-01
Accurate knowledge about human sexual behaviors is important for increasing our understanding of human sexuality; however, there have been few studies assessing the reliability of sexual behavior questionnaires designed for community samples of adult men. A test-retest reliability study was conducted on a questionnaire completed by 334 men who had been recruited in Tucson, Arizona. Reliability coefficients and refusal rates were calculated for 39 non-sexual and sexual behavior questionnaire items. Predictors of unreliable reporting for lifetime number of female sexual partners were also assessed. Refusal rates were generally low, with slightly higher refusal rates for questions related to immigration, income, the frequency of sexual intercourse with women, lifetime number of female sexual partners, and the lifetime number of male anal sex partners. Kappa and intraclass correlation coefficients were substantial or almost perfect for all non-sexual and sexual behavior items. Reliability dropped somewhat, but was still substantial, for items that asked about household income and the men's knowledge of their sexual partners' health, including abnormal Pap tests and prior sexually transmitted diseases (STD). Age and lifetime number of female sexual partners were independent predictors of unreliable reporting while years of education was inversely associated with unreliable reporting. These findings among a community sample of adult men are consistent with other test-retest reliability studies with populations of women and adolescents.
Lim, Kheng Seang; Hills, Michael D; Choo, Wan Yuen; Wong, Mee Hoo; Wu, Cathie; Tan, Chong Tin
2013-02-01
Students' attitudes toward epilepsy have been studied in several countries, but none of the studies used a quantitative scale. We aimed to determine the validity and reliability of the Public Attitudes Toward Epilepsy (PATE) scale in a homogenous population consisting of secondary and tertiary students in Malaysia and to quantify their attitudes toward epilepsy, using a web-based survey. A total of 227 respondents with a mean age of 19.6±2.07 years, predominantly Chinese (85%), female (62%), and in a pre-university education level (71%) completed the web-based survey. Psychometric testing showed that the PATE is a valid and reliable scale to be applied in a homogenous population. The mean score in the personal domain was significantly higher than that in the general domain (2.73±0.61 vs. 2.12±0.60, respectively, p<0.001). Compared with a study previously performed on a general population (Lim et al., 2012 [10]), the mean score in the general domain was significantly lower (p<0.01), whereas there was no significant difference between the mean scores in the personal domain. The mean scores in the general domain were significantly lower for those with tertiary education (p<0.001) but did not correlate with gender and ethnicity. The attitudes of secondary and tertiary students are more positive than those of the general population in the general domain but not in the personal domain. Copyright © 2012 Elsevier Inc. All rights reserved.
Discharge reliability in ablative pulsed plasma thrusters
NASA Astrophysics Data System (ADS)
Wu, Zhiwen; Sun, Guorui; Yuan, Shiyue; Huang, Tiankun; Liu, Xiangyang; Xie, Kan; Wang, Ningfei
2017-08-01
Discharge reliability is typically neglected in low-ignition-cycle ablative pulsed plasma thrusters (APPTs). In this study, the discharge reliability of an APPT is assessed analytically and experimentally. The goals of this study are to better understand the ignition characteristics and to assess the accuracy of the analytical method. For each of six sets of operating conditions, 500 tests of a parallel-plate APPT with a coaxial semiconductor spark plug are conducted. The discharge voltage and current are measured with a high-voltage probe and a Rogowski coil, respectively, to determine whether the discharge is successful. Generally, the discharge success rate increases as the discharge voltage increases, and it decreases as the electrode gap and the number of ignitions increases. The theoretical analysis and the experimental results are reasonably consistent. This approach provides a reference for designing APPTs and improving their stability.
Bellón, Juan Ángel; Moreno-Küstner, Berta; Torres-González, Francisco; Montón-Franco, Carmen; GildeGómez-Barragán, María Josefa; Sánchez-Celaya, Marta; Díaz-Barreiros, Miguel Ángel; Vicens, Catalina; de Dios Luna, Juan; Cervilla, Jorge A; Gutierrez, Blanca; Martínez-Cañavate, María Teresa; Oliván-Blázquez, Bárbara; Vázquez-Medrano, Ana; Sánchez-Artiaga, María Soledad; March, Sebastia; Motrico, Emma; Ruiz-García, Victor Manuel; Brangier-Wainberg, Paulette Renée; del Mar Muñoz-García, María; Nazareth, Irwin; King, Michael
2008-01-01
Background The effects of putative risk factors on the onset and/or persistence of depression remain unclear. We aim to develop comprehensive models to predict the onset and persistence of episodes of depression in primary care. Here we explain the general methodology of the predictD-Spain study and evaluate the reliability of the questionnaires used. Methods This is a prospective cohort study. A systematic random sample of general practice attendees aged 18 to 75 has been recruited in seven Spanish provinces. Depression is being measured with the CIDI at baseline, and at 6, 12, 24 and 36 months. A set of individual, environmental, genetic, professional and organizational risk factors are to be assessed at each follow-up point. In a separate reliability study, a proportional random sample of 401 participants completed the test-retest (251 researcher-administered and 150 self-administered) between October 2005 and February 2006. We have also checked 118,398 items for data entry from a random sample of 480 patients stratified by province. Results All items and questionnaires had good test-retest reliability for both methods of administration, except for the use of recreational drugs over the previous six months. Cronbach's alphas were good and their factorial analyses coherent for the three scales evaluated (social support from family and friends, dissatisfaction with paid work, and dissatisfaction with unpaid work). There were 191 (0.16%) data entry errors. Conclusion The items and questionnaires were reliable and data quality control was excellent. When we eventually obtain our risk index for the onset and persistence of depression, we will be able to determine the individual risk of each patient evaluated in primary health care. PMID:18657275
Kumar, Mohit; Yadav, Shiv Prasad
2012-03-01
This paper addresses the fuzzy system reliability analysis using different types of intuitionistic fuzzy numbers. Till now, in the literature, to analyze the fuzzy system reliability, it is assumed that the failure rates of all components of a system follow the same type of fuzzy set or intuitionistic fuzzy set. However, in practical problems, such type of situation rarely occurs. Therefore, in the present paper, a new algorithm has been introduced to construct the membership function and non-membership function of fuzzy reliability of a system having components following different types of intuitionistic fuzzy failure rates. Functions of intuitionistic fuzzy numbers are calculated to construct the membership function and non-membership function of fuzzy reliability via non-linear programming techniques. Using the proposed algorithm, membership functions and non-membership functions of fuzzy reliability of a series system and a parallel systems are constructed. Our study generalizes the various works of the literature. Numerical examples are given to illustrate the proposed algorithm. Copyright © 2011 ISA. Published by Elsevier Ltd. All rights reserved.
Validity and reliability of CHOICE Health Experience Questionnaire: Thai version.
Aiyasanon, Nipa; Premasathian, Nalinee; Nimmannit, Akarin; Jetanavanich, Pantip; Sritippayawan, Suchai
2009-09-01
Assess the reliability and validity of the Thai translation of the CHOICE Health Experience Questionnaire (CHEQ), which is the English-language questionnaire, developed specifically for End-stage-renal disease (ESRD) patients. The CHEQ comprised of two parts, nine general domains of SF-36 (physical function, role-physical, bodily pain, mental health, role-emotional, social function, vitality, general health, and report transition) and 16 dialysis specific domains of the CHEQ (role-physical, mental health, general health, freedom, travel restriction, cognitive function, financial function, restriction diet and fluids, recreation, work, body image, symptoms, sex, sleep, access, and quality of life). The authors translated the CHEQ questionnaire into Thai and confirmed the accuracy by back translation. Pilot study sample was 10 Thai ESRD patients. Then the CHEQ (Thai) was applied to 110 Thai ESRD patients. Twenty-three patients had chronic peritoneal dialysis patients and 87 were chronic intermittent hemodialysis patients. Statistical analysis included descriptive statistics, Mann-Whitney U test, Student's t-test, and Cronbach's alpha. Construct validity was satisfactory with the significant difference less than 0.001 between the low and high group. The reliability coefficient for the Cronbach's alpha of the total scale of the CHEQ (Thai) was 0.98. The Cronbach 's alphas were greater than 0.7 for all domains, range from 0.58 to 0.92, except the social function and quality of life domain (alpha = 0.66 and 0.575). The CHEQ (Thai) is reliable and valid for assessment of Thai ESRD patients receiving chronic dialysis. Its properties are similar to those reported in the original version.
Rakotonarivo, O Sarobidy; Schaafsma, Marije; Hockley, Neal
2016-12-01
While discrete choice experiments (DCEs) are increasingly used in the field of environmental valuation, they remain controversial because of their hypothetical nature and the contested reliability and validity of their results. We systematically reviewed evidence on the validity and reliability of environmental DCEs from the past thirteen years (Jan 2003-February 2016). 107 articles met our inclusion criteria. These studies provide limited and mixed evidence of the reliability and validity of DCE. Valuation results were susceptible to small changes in survey design in 45% of outcomes reporting reliability measures. DCE results were generally consistent with those of other stated preference techniques (convergent validity), but hypothetical bias was common. Evidence supporting theoretical validity (consistency with assumptions of rational choice theory) was limited. In content validity tests, 2-90% of respondents protested against a feature of the survey, and a considerable proportion found DCEs to be incomprehensible or inconsequential (17-40% and 10-62% respectively). DCE remains useful for non-market valuation, but its results should be used with caution. Given the sparse and inconclusive evidence base, we recommend that tests of reliability and validity are more routinely integrated into DCE studies and suggest how this might be achieved. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
NASA Technical Reports Server (NTRS)
Platt, M. E.; Lewis, E. E.; Boehm, F.
1991-01-01
A Monte Carlo Fortran computer program was developed that uses two variance reduction techniques for computing system reliability applicable to solving very large highly reliable fault-tolerant systems. The program is consistent with the hybrid automated reliability predictor (HARP) code which employs behavioral decomposition and complex fault-error handling models. This new capability is called MC-HARP which efficiently solves reliability models with non-constant failures rates (Weibull). Common mode failure modeling is also a specialty.
ERIC Educational Resources Information Center
Dimitrov, Dimiter M.; Raykov, Tenko; AL-Qataee, Abdullah Ali
2015-01-01
This article is concerned with developing a measure of general academic ability (GAA) for high school graduates who apply to colleges, as well as with the identification of optimal weights of the GAA indicators in a linear combination that yields a composite score with maximal reliability and maximal predictive validity, employing the framework of…
Analysis of the reliability and reproducibility of goniometry compared to hand photogrammetry
de Carvalho, Rosana Martins Ferreira; Mazzer, Nilton; Barbieri, Claudio Henrique
2012-01-01
Objective: To evaluate the intra- and inter-examiner reliability and reproducibility of goniometry in relation to photogrammetry of hand, comparing the angles of thumb abduction, PIP joint flexion of the II finger and MCP joint flexion of the V finger. Methods: The study included 30 volunteers, who were divided into three groups: one group of 10 physiotherapy students, one group of 10 physiotherapists, and a third group of 10 therapists of the hand. Each examiner performed the measurements on the same hand mold, using the goniometer followed by two photogrammetry software programs; CorelDraw® and ALCimagem®. Results: The results revealed that the groups and the methods proposed presented inter-examiner reliability, generally rated as excellent (ICC 0.998 I.C. 95% 0.995 - 0.999). In the intra-examiner evaluation, an excellent level of reliability was found between the three groups. In the comparison between groups for each angle and each method, no significant differences were found between the groups for most of the measurements. Conclusion: Goniometry and photogrammetry are reliable and reproducible methods for evaluating measurements of the hand. However, due to the lack of similar references, detailed studies are needed to define the normal parameters between the methods in the joints of the hand. Level of Evidence II, Diagnostic Study. PMID:24453594
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire.
Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos
2006-03-04
Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test - retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81-0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test - retest reliability) was found 0.66 (95% CI: 0.62-0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36-0.89). This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting.
Reliability program requirements for aeronautical and space system contractors
NASA Technical Reports Server (NTRS)
1987-01-01
General reliability program requirements for NASA contracts involving the design, development, fabrication, test, and/or use of aeronautical and space systems including critical ground support equipment are prescribed. The reliability program requirements require (1) thorough planning and effective management of the reliability effort; (2) definition of the major reliability tasks and their place as an integral part of the design and development process; (3) planning and evaluating the reliability of the system and its elements (including effects of software interfaces) through a program of analysis, review, and test; and (4) timely status indication by formal documentation and other reporting to facilitate control of the reliability program.
Trakman, Gina Louise; Forsyth, Adrienne; Hoye, Russell; Belski, Regina
2018-01-01
The Nutrition for Sport Knowledge Questionnaire (NSKQ) is an 89-item, valid and reliable measure of sports nutrition knowledge (SNK). It takes 25 min to complete and has been subject to low completion and response rates. The aim of this study was to develop an abridged version of the NSKQ (A-NSKQ) and compare response rates, completion rates and NK scores of the NSKQ and A-NSKQ. Rasch analysis was used for the questionnaire validation. The sample ( n = 181) was the same sample that was used in the validation of the full-length NSKQ. Construct validity was assessed using the known-group comparisons method. Temporal stability was assessed using the test-retest reliability method. NK assessment was cross-sectional; responses were collected electronically from members of one non-elite Australian football (AF) and netball club, using Qualtrics Software (Qualtrics, Provo, UT). Validation - The A-NSKQ has 37 items that assess general ( n = 17) and sports ( n = 20) nutrition knowledge (NK). Both sections are unidimensional (Perc5% = 2.84% [general] and 3.41% [sport]). Both sections fit the Rasch Model (overall-interaction statistic mean (SD) = - 0.15 ± 0.96 [general] and 0.22 ± 1.11 [sport]; overall-person interaction statistic mean (SD) = - 0.11 ± 0.61 [general] and 0.08 ± 0.73 [sport]; Chi-Square probability = 0.308 [general] and 0.283 [sport]). Test-retest reliability was confirmed ( r = 0.8, P < 0.001 [general] and r = 0.7, P < 0.001 [sport]). Construct validity was demonstrated (nutrition students = 77% versus non-nutrition students = 60%, P < 0.001 [general] and nutrition students = 60% versus non-nutrition students = 40%, P < 0.001 [sport]. Assessment of NK - 177 usable survey responses from were returned. Response rates were low (7%) but completion rates were high (85%). NK scores on the A-NSKQ (46%) are comparable to results obtained in similar cohorts on the NSKQ (49%). The A-NSKQ took on average 12 min to complete, which is around half the time taken to complete the NSKQ (25 min). The A-NSKQ is a valid and reliable, brief questionnaire designed to assess general NK (GNK) and SNK.
A Model for Estimating the Reliability and Validity of Criterion-Referenced Measures.
ERIC Educational Resources Information Center
Edmonston, Leon P.; Randall, Robert S.
A decision model designed to determine the reliability and validity of criterion referenced measures (CRMs) is presented. General procedures which pertain to the model are discussed as to: Measures of relationship, Reliability, Validity (content, criterion-oriented, and construct validation), and Item Analysis. The decision model is presented in…
A General Approach for Estimating Scale Score Reliability for Panel Survey Data
ERIC Educational Resources Information Center
Biemer, Paul P.; Christ, Sharon L.; Wiesen, Christopher A.
2009-01-01
Scale score measures are ubiquitous in the psychological literature and can be used as both dependent and independent variables in data analysis. Poor reliability of scale score measures leads to inflated standard errors and/or biased estimates, particularly in multivariate analysis. Reliability estimation is usually an integral step to assess…
Wu, X; Lund, M S; Sun, D; Zhang, Q; Su, G
2015-10-01
One of the factors affecting the reliability of genomic prediction is the relationship among the animals of interest. This study investigated the reliability of genomic prediction in various scenarios with regard to the relationship between test and training animals, and among animals within the training data set. Different training data sets were generated from EuroGenomics data and a group of Nordic Holstein bulls (born in 2005 and afterwards) as a common test data set. Genomic breeding values were predicted using a genomic best linear unbiased prediction model and a Bayesian mixture model. The results showed that a closer relationship between test and training animals led to a higher reliability of genomic predictions for the test animals, while a closer relationship among training animals resulted in a lower reliability. In addition, the Bayesian mixture model in general led to a slightly higher reliability of genomic prediction, especially for the scenario of distant relationships between training and test animals. Therefore, to prevent a decrease in reliability, constant updates of the training population with animals from more recent generations are required. Moreover, a training population consisting of less-related animals is favourable for reliability of genomic prediction. © 2015 Blackwell Verlag GmbH.
Jordan, Kelvin; Clarke, Alexandra M; Symmons, Deborah PM; Fleming, Douglas; Porcheret, Mark; Kadam, Umesh T; Croft, Peter
2007-01-01
Background Primary care consultation data are an important source of information on morbidity prevalence. It is not known how reliable such figures are. Aim To compare annual consultation prevalence estimates for musculoskeletal conditions derived from four general practice consultation databases. Design of study Retrospective study of general practice consultation records. Setting Three national general practice consultation databases: i) Fourth Morbidity Statistics from General Practice (MSGP4, 1991/92), ii) Royal College of General Practitioners Weekly Returns Service (RCGP WRS, 2001), and iii) General Practice Research Database (GPRD, 1991 and 2001); and one regional database (Consultations in Primary Care Archive, 2001). Method Age-sex standardised persons consulting annual prevalence rates for musculoskeletal conditions overall, rheumatoid arthritis, osteoarthritis and arthralgia were derived for patients aged 15 years and over. Results GPRD prevalence of any musculoskeletal condition, rheumatoid arthritis and osteoarthritis was lower than that of the other databases. This is likely to be due to GPs not needing to record every consultation made for a chronic condition. MSGP4 gave the highest prevalence for osteoarthritis but low prevalence of arthralgia which reflects encouragement for GPs to use diagnostic rather than symptom codes. Conclusion Considerable variation exists in consultation prevalence estimates for musculoskeletal conditions. Researchers and health service planners should be aware that estimates of disease occurrence based on consultation will be influenced by choice of database. This is likely to be true for other chronic diseases and where alternative symptom labels exist for a disease. RCGP WRS may give the most reliable prevalence figures for musculoskeletal and other chronic diseases. PMID:17244418
5 CFR 841.411 - Appeals procedure.
Code of Federal Regulations, 2011 CFR
2011-01-01
... agency's actuarial analysis are sufficient and reliable (As a general rule, at least 5 years of data... reliable.); (2) The assumptions used in the agency's actuarial analysis are justified; (3) When all...
Inter-arch digital model vs. manual cast measurements: Accuracy and reliability.
Kiviahde, Heikki; Bukovac, Lea; Jussila, Päivi; Pesonen, Paula; Sipilä, Kirsi; Raustia, Aune; Pirttiniemi, Pertti
2017-06-28
The purpose of this study was to evaluate the accuracy and reliability of inter-arch measurements using digital dental models and conventional dental casts. Thirty sets of dental casts with permanent dentition were examined. Manual measurements were done with a digital caliper directly on the dental casts, and digital measurements were made on 3D models by two independent examiners. Intra-class correlation coefficients (ICC), a paired sample t-test or Wilcoxon signed-rank test, and Bland-Altman plots were used to evaluate intra- and inter-examiner error and to determine the accuracy and reliability of the measurements. The ICC values were generally good for manual and excellent for digital measurements. The Bland-Altman plots of all the measurements showed good agreement between the manual and digital methods and excellent inter-examiner agreement using the digital method. Inter-arch occlusal measurements on digital models are accurate and reliable and are superior to manual measurements.
Writing Across the Curriculum: Reliability Testing of a Standardized Rubric.
Minnich, Margo; Kirkpatrick, Amanda J; Goodman, Joely T; Whittaker, Ali; Stanton Chapple, Helen; Schoening, Anne M; Khanna, Maya M
2018-06-01
Rubrics positively affect student academic performance; however, accuracy and consistency of the rubric and its use is imperative. The researchers in this study developed a standardized rubric for use across an undergraduate nursing curriculum, then evaluated the interrater reliability and general usability of the tool. Faculty raters graded papers using the standardized rubric, submitted their independent scoring for interrater reliability analyses, then participated in a focus group discussion regarding rubric use experience. Quantitative analysis of the data showed a high interrater reliability (α = .998). Content analysis of transcription revealed several positive themes: Consistency, Emphasis on Writing Ability, and Ability to Use the Rubric as a Teaching Tool. Areas for improvement included use of value words and difficulty with point allocation. Investigators recommend effective faculty orientation for rubric use and future work in developing a rubric to assess reflective writing. [J Nurs Educ. 2018;57(6):366-370.]. Copyright 2018, SLACK Incorporated.
Development of a scale to assess cancer stigma in the non-patient population.
Marlow, Laura A V; Wardle, Jane
2014-04-23
Illness-related stigma has attracted considerable research interest, but few studies have specifically examined stigmatisation of cancer in the non-patient population. The present study developed and validated a Cancer Stigma Scale (CASS) for use in the general population. An item pool was developed on the basis of previous research into illness-related stigma in the general population and patients with cancer. Two studies were carried out. The first study used Exploratory factor analysis to explore the structure of items in a sample of 462 postgraduate students recruited through a London university. The second study used Confirmatory factor analysis to confirm the structure among 238 adults recruited through an online market research panel. Internal reliability, test-retest reliability and construct validity were also assessed. Exploratory factor analysis suggested six subscales, representing: Awkwardness, Severity, Avoidance, Policy Opposition, Personal Responsibility and Financial Discrimination. Confirmatory factor analysis confirmed this structure with a 25-item scale. All subscales showed adequate to good internal and test-retest reliability in both samples. Construct validity was also good, with mean scores for each subscale varying in the expected directions by age, gender, experience of cancer, awareness of lifestyle risk factors for cancer, and social desirability. Means for the subscales were consistent across the two samples. These findings highlight the complexity of cancer stigma and provide the Cancer Stigma Scale (CASS) which can be used to compare populations, types of cancer and evaluate the effects of interventions designed to reduce cancer stigma in non-patient populations.
NASA Technical Reports Server (NTRS)
Vesely, William E.; Colon, Alfredo E.
2010-01-01
Design Safety/Reliability is associated with the probability of no failure-causing faults existing in a design. Confidence in the non-existence of failure-causing faults is increased by performing tests with no failure. Reliability-Growth testing requirements are based on initial assurance and fault detection probability. Using binomial tables generally gives too many required tests compared to reliability-growth requirements. Reliability-Growth testing requirements are based on reliability principles and factors and should be used.
Development of assessment instruments to measure critical thinking skills
NASA Astrophysics Data System (ADS)
Sumarni, W.; Supardi, K. I.; Widiarti, N.
2018-04-01
Assessment instruments that is commonly used in the school generally have not been orientated on critical thinking skills. The purpose of this research is to develop assessment instruments to measure critical thinking skills, to test validity, reliability, and practicality. This type of research is Research and Development. There are two stages on the preface step, which are field study and literacy study. On the development steps, there some parts, which are 1) instrument construction, 2) expert validity, 3) limited scale tryout and 4) narrow scale try-out. The developed assessment instrument are analysis essay and problem solving. Instruments were declared valid, reliable and practical.
Reliability and Maintainability model (RAM) user and maintenance manual. Part 2
NASA Technical Reports Server (NTRS)
Ebeling, Charles E.
1995-01-01
This report documents the procedures for utilizing and maintaining the Reliability and Maintainability Model (RAM) developed by the University of Dayton for the NASA Langley Research Center (LaRC). The RAM model predicts reliability and maintainability (R&M) parameters for conceptual space vehicles using parametric relationships between vehicle design and performance characteristics and subsystem mean time between maintenance actions (MTBM) and manhours per maintenance action (MH/MA). These parametric relationships were developed using aircraft R&M data from over thirty different military aircraft of all types. This report describes the general methodology used within the model, the execution and computational sequence, the input screens and data, the output displays and reports, and study analyses and procedures. A source listing is provided.
Reliability Concerns in Measuring Respondent Skin Tone by Interviewer Observation
Hannon, Lance; DeFina, Robert
2016-01-01
The current study assesses the intercoder reliability of one of the most important skin tone measurement instruments—the Massey–Martin scale. This scale is used in several high-profile social surveys, but has not yet been psychometrically evaluated. The current evaluation is only possible because, for the first time, the General Social Survey’s 2010–2014 panel used the instrument to guide interviewers’ skin tone observation of the same respondents in two different years (2012 and 2014). Despite the widespread use of the Massey–Martin scale to investigate potential effects of skin tone on social attitudes and outcomes, the data suggest that the measure has low intercoder reliability. Implications for researchers and survey practitioners are discussed. PMID:27274576
Garcia, Danilo; Lundström, Sebastian; Brändström, Sven; Råstam, Maria; Cloninger, C. Robert; Kerekes, Nóra; Nilsson, Thomas; Anckarsäter, Henrik
2013-01-01
Background The Child and Adolescent Twin Study in Sweden (CATSS) is an on-going, large population-based longitudinal twin study. We aimed (1) to investigate the reliability of two different versions (125-items and 238-items) of Cloninger's Temperament and Character Inventory (TCI) used in the CATSS and the validity of extracting the short version from the long version, (2) to compare these personality dimensions between twins and adolescents from the general population, and (3) to investigate the genetic structure of Cloninger's model. Method Reliability and correlation analyses were conducted for both TCI versions, 2,714 CATSS-twins were compared to 631 adolescents from the general population, and the genetic structure was investigated through univariate genetic analyses, using a model-fitting approach with structural equation-modeling techniques based on same-sex twin pairs from the CATSS (423 monozygotic and 408 dizygotic pairs). Results The TCI scores from the short and long versions showed comparable reliability coefficients and were strongly correlated. Twins scored about half a standard deviation higher in the character scales. Three of the four temperament dimensions (Novelty Seeking, Harm Avoidance, and Persistence) had strong genetic and non-shared environmental effects, while Reward Dependence and the three character dimensions had moderate genetic effects, and both shared and non-shared environmental effects. Conclusions Twins showed higher scores in character dimensions compared to adolescents from the general population. At least among adolescents there is a shared environmental influence for all of the character dimensions, but only for one of the temperament dimensions (i.e., Reward Dependence). This specific finding regarding the existence of shared environmental factors behind the character dimensions in adolescence, together with earlier findings showing a small shared environmental effects on character among young adults and no shared environmental effects on character among adults, suggest that there is a shift in type of environmental influence from adolescence to adulthood regarding character. PMID:23940581
NASA Astrophysics Data System (ADS)
Krishnamurthy, Sanjana
This study investigated the impact of different instructional strategies on students' understanding about the cell cycle in a general education biology course. Although several studies have documented gains in students' cell cycle understanding after instruction, these studies generally use only one instructional method, often without a comparison group. The goal of this study was to learn more about students' misconceptions about the cell cycle and how those ideas change after three different evidence-based learning experiences in undergraduate general education. Undergraduate students in six laboratory sections (n = 24; N = 144) in a large public institution in the western United States were surveyed pre- and post-instruction using a 14-item valid and reliable survey of cell cycle knowledge. Cronbach's alpha for the standard scoring convention was 0.264 and for the alternate scoring convention was 0.360, documenting serious problems with inconsistent validity and reliability of the survey. Operating as though the findings are at least a proxy for actual cell cycle knowledge, score comparisons by groups of interest were explored, including pre- and post-instruction differences among demographic groups of interest and three instructional settings: a bead modeling activity, a role-playing game, and 5E instructional strategy. No significant differences were found across groups of interest or by strategy, but some significant item-level differences were found. Implications and discussion of these shifts is noted in lieu of the literature.
Assessing Reliability of Medical Record Reviews for the Detection of Hospital Adverse Events.
Ock, Minsu; Lee, Sang-il; Jo, Min-Woo; Lee, Jin Yong; Kim, Seon-Ha
2015-09-01
The purpose of this study was to assess the inter-rater reliability and intra-rater reliability of medical record review for the detection of hospital adverse events. We conducted two stages retrospective medical records review of a random sample of 96 patients from one acute-care general hospital. The first stage was an explicit patient record review by two nurses to detect the presence of 41 screening criteria (SC). The second stage was an implicit structured review by two physicians to identify the occurrence of adverse events from the positive cases on the SC. The inter-rater reliability of two nurses and that of two physicians were assessed. The intra-rater reliability was also evaluated by using test-retest method at approximately two weeks later. In 84.2% of the patient medical records, the nurses agreed as to the necessity for the second stage review (kappa, 0.68; 95% confidence interval [CI], 0.54 to 0.83). In 93.0% of the patient medical records screened by nurses, the physicians agreed about the absence or presence of adverse events (kappa, 0.71; 95% CI, 0.44 to 0.97). When assessing intra-rater reliability, the kappa indices of two nurses were 0.54 (95% CI, 0.31 to 0.77) and 0.67 (95% CI, 0.47 to 0.87), whereas those of two physicians were 0.87 (95% CI, 0.62 to 1.00) and 0.37 (95% CI, -0.16 to 0.89). In this study, the medical record review for detecting adverse events showed intermediate to good level of inter-rater and intra-rater reliability. Well organized training program for reviewers and clearly defining SC are required to get more reliable results in the hospital adverse event study.
ERIC Educational Resources Information Center
Wilhelm, Anne Garrison; Kim, Sungyeun
2015-01-01
One crucial question for researchers who study teachers' classroom practice is how to maximize information about what is happening in classrooms while minimizing costs. This report extends prior studies of the reliability of the Instructional Quality Assessment (IQA), a widely used classroom observation toolkit, and offers insight into the often…
Measurement of General and Specific Approaches to Physical Activity Parenting: A Systematic Review
McDonald, Samantha; Cohen, Alysia
2013-01-01
Abstract Background Parents play a significant role in shaping youth physical activity (PA). However, interventions targeting PA parenting have been ineffective. Methodological inconsistencies related to the measurement of parental influences may be a contributing factor. The purpose of this article is to review the extant peer-reviewed literature related to the measurement of general and specific parental influences on youth PA. Methods A systematic review of studies measuring constructs of PA parenting was conducted. Computerized searches were completed using PubMed, MEDLINE, Academic Search Premier, SPORTDiscus, and PsycINFO. Reference lists of the identified articles were manually reviewed as well as the authors' personal collections. Articles were selected on the basis of strict inclusion criteria and details regarding the measurement protocols were extracted. A total of 117 articles met the inclusionary criteria. Methodological articles that evaluated the validity and reliability of PA parenting measures (n=10) were reviewed separately from parental influence articles (n=107). Results A significant percentage of studies used measures with indeterminate validity and reliability. A significant percentage of articles did not provide sample items, describe the response format, or report the possible range of scores. No studies were located that evaluated sensitivity to change. Conclusion The reporting of measurement properties and the use of valid and reliable measurement scales need to be improved considerably. PMID:23944923
Parallelizing Timed Petri Net simulations
NASA Technical Reports Server (NTRS)
Nicol, David M.
1993-01-01
The possibility of using parallel processing to accelerate the simulation of Timed Petri Nets (TPN's) was studied. It was recognized that complex system development tools often transform system descriptions into TPN's or TPN-like models, which are then simulated to obtain information about system behavior. Viewed this way, it was important that the parallelization of TPN's be as automatic as possible, to admit the possibility of the parallelization being embedded in the system design tool. Later years of the grant were devoted to examining the problem of joint performance and reliability analysis, to explore whether both types of analysis could be accomplished within a single framework. In this final report, the results of our studies are summarized. We believe that the problem of parallelizing TPN's automatically for MIMD architectures has been almost completely solved for a large and important class of problems. Our initial investigations into joint performance/reliability analysis are two-fold; it was shown that Monte Carlo simulation, with importance sampling, offers promise of joint analysis in the context of a single tool, and methods for the parallel simulation of general Continuous Time Markov Chains, a model framework within which joint performance/reliability models can be cast, were developed. However, very much more work is needed to determine the scope and generality of these approaches. The results obtained in our two studies, future directions for this type of work, and a list of publications are included.
Edgren, Robert; Castrén, Sari; Mäkelä, Marjukka; Pörtfors, Pia; Alho, Hannu; Salonen, Anne H
2016-06-01
This review aims to clarify which instruments measuring at-risk and problem gambling (ARPG) among youth are reliable and valid in light of reported estimates of internal consistency, classification accuracy, and psychometric properties. A systematic search was conducted in PubMed, Medline, and PsycInfo covering the years 2009-2015. In total, 50 original research articles fulfilled the inclusion criteria: target age under 29 years, using an instrument designed for youth, and reporting a reliability estimate. Articles were evaluated with the revised Quality Assessment of Diagnostic Accuracy Studies tool. Reliability estimates were reported for five ARPG instruments. Most studies (66%) evaluated the South Oaks Gambling Screen Revised for Adolescents. The Gambling Addictive Behavior Scale for Adolescents was the only novel instrument. In general, the evaluation of instrument reliability was superficial. Despite its rare use, the Canadian Adolescent Gambling Inventory (CAGI) had a strong theoretical and methodological base. The Gambling Addictive Behavior Scale for Adolescents and the CAGI were the only instruments originally developed for youth. All studies, except the CAGI study, were population based. ARPG instruments for youth have not been rigorously evaluated yet. Further research is needed especially concerning instruments designed for clinical use. Copyright © 2016 The Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
Hasanpour, Neda; Attarbashi Moghadam, Behrouz; Sami, Ramin; Tavakol, Kamran
2016-08-01
The clinical COPD questionnaire (CCQ) has been developed to measure the health status of COPD patients. The aim of this study was to translate CCQ into the Persian language and assess the validity and reliability of the translated version. We used a forward-backward procedure to translate the questionnaire. In a cross-sectional study 100 COPD patients and 50 healthy subjects over 40 years old were selected to assess the reliability and construct validity of the instrument. The face and content validity were used for the questionnaire validity. Validity was examined in a population of patients with COPD, using the Persian validated version of the St George's Respiratory Questionnaire (PSGRQ). In order to assess the questionnaire's reliability, the Intraclass correlation coefficient (ICC) and Cronbach's alpha were calculated. Test-retest reliability was tested by re-administering the Persian version of the CCQ (PCCQ) after 1 week. Test-retest carry out of data demonstrates that the PCCQ has excellent reliability (ICC for all 3 domains were higher than 0.9). Internal consistency was found by Cronbach's alpha to be 0.96, 0.94, 0.97, and 0.98 for the symptom, mental state, functional state and total scores respectively. In addition, the correlation between the components of PCCQ and PSGRQ showed satisfactory construct validity. Analyzing the data from healthy subjects and patients divulged that the PCCQ has acceptable discriminant validity. In general, the PCCQ had satisfactory reliability and validity for assessing health-related quality of life status of Iranian COPD patients.
Reliability of an fMRI Paradigm for Emotional Processing in a Multisite Longitudinal Study
Gee, Dylan G.; McEwen, Sarah C.; Forsyth, Jennifer K.; Haut, Kristen M.; Bearden, Carrie E.; Addington, Jean; Goodyear, Bradley; Cadenhead, Kristin S.; Mirzakhanian, Heline; Cornblatt, Barbara A.; Olvet, Doreen; Mathalon, Daniel H.; McGlashan, Thomas H.; Perkins, Diana O.; Belger, Aysenil; Seidman, Larry J.; Thermenos, Heidi; Tsuang, Ming T.; van Erp, Theo G.M.; Walker, Elaine F.; Hamann, Stephan; Woods, Scott W.; Constable, Todd; Cannon, Tyrone D.
2015-01-01
Multisite neuroimaging studies can facilitate the investigation of brain-related changes in many contexts, including patient groups that are relatively rare in the general population. Though multisite studies have characterized the reliability of brain activation during working memory and motor functional magnetic resonance imaging tasks, emotion processing tasks, pertinent to many clinical populations, remain less explored. A traveling participants study was conducted with eight healthy volunteers scanned twice on consecutive days at each of the eight North American Longitudinal Prodrome Study sites. Tests derived from generalizability theory showed excellent reliability in the amygdala (Eρ2=0.82), inferior frontal gyrus (IFG;Eρ2=0.83), anterior cingulate cortex (ACC;Eρ2=0.76), insula (Eρ2=0.85), and fusiform gyrus (Eρ2=0.91) for maximum activation and fair to excellent reliability in the amygdala (Eρ2=0.44), IFG (Eρ2=0.48), ACC (Eρ2=0.55), insula (Eρ2=0.42), and fusiform gyrus (Eρ2=0.83) for mean activation across sites and test days. For the amygdala, habituation (Eρ2=0.71) was more stable than mean activation. In a second investigation, data from 111 healthy individuals across sites were aggregated in a voxelwise, quantitative meta-analysis. When compared with a mixed effects model controlling for site, both approaches identified robust activation in regions consistent with expected results based on prior single-site research. Overall, regions central to emotion processing showed strong reliability in the traveling participants study and robust activation in the aggregation study. These results support the reliability of blood oxygen level-dependent signal in emotion processing areas across different sites and scanners and may inform future efforts to increase efficiency and enhance knowledge of rare conditions in the population through multisite neuroimaging paradigms. PMID:25821147
Failure-Time Distribution Of An m-Out-of-n System
NASA Technical Reports Server (NTRS)
Scheuer, Ernest M.
1988-01-01
Formulas for reliability extended to more general cases. Useful in analyses of reliabilities of practical systems and structures, especially of redundant systems of identical components, among which operating loads distributed equally.
Golden, Sherita Hill; Sánchez, Brisa N.; DeSantis, Amy S.; Wu, Meihua; Castro, Cecilia; Seeman, Teresa E.; Tadros, Sameh; Shrager, Sandi; Diez Roux, Ana V.
2014-01-01
Collection of salivary cortisol has become increasingly popular in large population-based studies. However, the impact of protocol compliance on day-to-day reliabilities of measures, and the extent to which reliabilities differ systematically according to socio-demographic characteristics, has not been well characterized in large-scale population-based studies to date. Using data on 935 men and women from the Multi-ethnic Study of Atherosclerosis, we investigated whether sampling protocol compliance differs systematically according to socio-demographic factors and whether compliance was associated with cortisol estimates, as well as whether associations of cortisol with both compliance and socio-demographic characteristics were robust to adjustments for one another. We further assessed the day-to-day reliability for cortisol features and the extent to which reliabilities vary according to socio-demographic factors and sampling protocol compliance. Overall, we found higher compliance among persons with higher levels of income and education. Lower compliance was significantly associated with a less pronounced cortisol awakening response (CAR) but was not associated with any other cortisol features, and adjustment for compliance did not affect associations of socio-demographic characteristics with cortisol. Reliability was higher for area under the curve (AUC) and wake up values than for other features, but generally did not vary according to socio-demographic characteristics, with few exceptions. Our findings regarding intra-class correlation coefficients (ICCs) support prior research indicating that multiple day collection is preferable to single day collection, particularly for CAR and slopes, more so than wakeup and AUC. There were few differences in reliability by socio-demographic characteristics. Thus, it is unlikely that group-specific sampling protocols are warranted. PMID:24703168
García-Ramos, Amador; Haff, Guy Gregory; Pestaña-Melero, Francisco Luis; Pérez-Castilla, Alejandro; Rojas, Francisco Javier; Balsalobre-Fernández, Carlos; Jaric, Slobodan
2017-09-05
This study compared the concurrent validity and reliability of previously proposed generalized group equations for estimating the bench press (BP) one-repetition maximum (1RM) with the individualized load-velocity relationship modelled with a two-point method. Thirty men (BP 1RM relative to body mass: 1.08 0.18 kg·kg -1 ) performed two incremental loading tests in the concentric-only BP exercise and another two in the eccentric-concentric BP exercise to assess their actual 1RM and load-velocity relationships. A high velocity (≈ 1 m·s -1 ) and a low velocity (≈ 0.5 m·s -1 ) was selected from their load-velocity relationships to estimate the 1RM from generalized group equations and through an individual linear model obtained from the two velocities. The directly measured 1RM was highly correlated with all predicted 1RMs (r range: 0.847-0.977). The generalized group equations systematically underestimated the actual 1RM when predicted from the concentric-only BP (P <0.001; effect size [ES] range: 0.15-0.94), but overestimated it when predicted from the eccentric-concentric BP (P <0.001; ES range: 0.36-0.98). Conversely, a low systematic bias (range: -2.3-0.5 kg) and random errors (range: 3.0-3.8 kg), no heteroscedasticity of errors (r 2 range: 0.053-0.082), and trivial ES (range: -0.17-0.04) were observed when the prediction was based on the two-point method. Although all examined methods reported the 1RM with high reliability (CV≤5.1%; ICC≥0.89), the direct method was the most reliable (CV<2.0%; ICC≥0.98). The quick, fatigue-free, and practical two-point method was able to predict the BP 1RM with high reliability and practically perfect validity, and therefore we recommend its use over generalized group equations.
Reliability and validity of the Parenting Scale of Inconsistency.
Yoshizumi, Takahiro; Murase, Satomi; Murakami, Takashi; Takai, Jiro
2006-08-01
The purposes of the present study were to develop a Parenting Scale of Inconsistency and to evaluate its initial reliability and validity. The 12 items assess the inconsistency among parents' moods, behaviors, and attitudes toward children. In the primary study, 517 participants completed three measures: the new Parenting Scale of Inconsistency, the Parental Bonding Instrument, and the Depression Scale of the General Health Questionnaire. The Parenting Scale of Inconsistency had good test-retest reliability of .85 and internal consistency of .88 (Cronbach coefficient alpha). Construct validity was good as Inconsistency scores were significantly correlated with the Care and Overprotection scores of the Parental Bonding Instrument and with the Depression scores. Moreover, Inconsistency scores' relation with a dimension of parenting style distinct from Care and Overprotection suggested that the Parenting Scale of Inconsistency had factorial validity. This scale seems a potential measure for examining the relationships between inconsistent parenting and the mental health of children.
Design of low-cost general purpose microcontroller based neuromuscular stimulator.
Koçer, S; Rahmi Canal, M; Güler, I
2000-04-01
In this study, a general purpose, low-cost, programmable, portable and high performance stimulator is designed and implemented. For this purpose, a microcontroller is used in the design of the stimulator. The duty cycle and amplitude of the designed system can be controlled using a keyboard. The performance test of the system has shown that the results are reliable. The overall system can be used as the neuromuscular stimulator under safe conditions.
ERIC Educational Resources Information Center
Rogers, Katherine D.; Young, Alys; Lovell, Karina; Campbell, Malcolm; Scott, Paul R.; Kendal, Sarah
2013-01-01
The present study is aimed to translate 3 widely used clinical assessment measures into British Sign Language (BSL), to pilot the BSL versions, and to establish their validity and reliability. These were the Patient Health Questionnaire (PHQ-9), the Generalized Anxiety Disorder 7-item (GAD-7) scale, and the Work and Social Adjustment Scale (WSAS).…
Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio
2016-01-01
The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05). Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio
2016-01-01
The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05). Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe. PMID:27228050
The development and validation of the Perceived Health Competence Scale.
Smith, M S; Wallston, K A; Smith, C A
1995-03-01
A sense of competence or self-efficacy is associated with many positive outcomes, particularly in the area of health behavior. A measure of a sense of competence in the domain of health behavior has not been developed. Most measures are either general measures of a general sense of self-efficacy or are very specific to a particular health behavior. The Perceived Health Competence Scale (PHCS), a domain-specific measure of the degree to which an individual feels capable of effectively managing his or her health outcomes, was developed to provide a measure of perceived competence at an intermediate level of specificity. Five studies using three different types of samples (students, adults and persons with a chronic illness) provide evidence for the reliability and validity of the PHCS. The eight items of the PHCS combine both outcome and behavioral expectancies. Results from the five studies indicate that the scale has good internal consistency and test-retest reliability. The construct validity of the scale is demonstrated through the support obtained for substantive hypotheses regarding the correlates of perceived health competence, such as health behavior intentions, general sense of competence and health locus of control.
Calella, Patrizia; Iacullo, Vittorio Maria; Valerio, Giuliana
2017-04-29
Good knowledge of nutrition is widely thought to be an important aspect to maintaining a balanced and healthy diet. The aim of this study was to develop and validate a new reliable tool to measure the general and the sport nutrition knowledge (GeSNK) in people who used to practice sports at different levels. The development of (GeSNK) was carried out in six phases as follows: (1) item development and selection by a panel of experts; (2) pilot study in order to assess item difficulty and item discrimination; (3) measurement of the internal consistency; (4) reliability assessment with a 2-week test-retest analysis; (5) concurrent validity was tested by administering the questionnaire along with other two similar tools; (6) construct validity by administering the questionnaire to three groups of young adults with different general nutrition and sport nutrition knowledge. The final questionnaire, consisted of 62 items of the original 183 questions. It is a consistent, valid, and suitable instrument that can be applied over time, making it a promising tool to look at the relationship between nutrition knowledge, demographic characteristics, and dietary behavior in adolescents and young adults.
Liem, Franziskus; Mérillat, Susan; Bezzola, Ladina; Hirsiger, Sarah; Philipp, Michel; Madhyastha, Tara; Jäncke, Lutz
2015-03-01
FreeSurfer is a tool to quantify cortical and subcortical brain anatomy automatically and noninvasively. Previous studies have reported reliability and statistical power analyses in relatively small samples or only selected one aspect of brain anatomy. Here, we investigated reliability and statistical power of cortical thickness, surface area, volume, and the volume of subcortical structures in a large sample (N=189) of healthy elderly subjects (64+ years). Reliability (intraclass correlation coefficient) of cortical and subcortical parameters is generally high (cortical: ICCs>0.87, subcortical: ICCs>0.95). Surface-based smoothing increases reliability of cortical thickness maps, while it decreases reliability of cortical surface area and volume. Nevertheless, statistical power of all measures benefits from smoothing. When aiming to detect a 10% difference between groups, the number of subjects required to test effects with sufficient power over the entire cortex varies between cortical measures (cortical thickness: N=39, surface area: N=21, volume: N=81; 10mm smoothing, power=0.8, α=0.05). For subcortical regions this number is between 16 and 76 subjects, depending on the region. We also demonstrate the advantage of within-subject designs over between-subject designs. Furthermore, we publicly provide a tool that allows researchers to perform a priori power analysis and sensitivity analysis to help evaluate previously published studies and to design future studies with sufficient statistical power. Copyright © 2014 Elsevier Inc. All rights reserved.
Gutiérrez-Vilahú, Lourdes; Massó-Ortigosa, Núria; Rey-Abella, Ferran; Costa-Tutusaus, Lluís; Guerra-Balic, Myriam
2016-05-01
People with Down syndrome present skeletal abnormalities in their feet that can be analyzed by commonly used gold standard indices (the Hernández-Corvo index, the Chippaux-Smirak index, the Staheli arch index, and the Clarke angle) based on footprint measurements. The use of Photoshop CS5 software (Adobe Systems Software Ireland Ltd, Dublin, Ireland) to measure footprints has been validated in the general population. The present study aimed to assess the reliability and validity of this footprint assessment technique in the population with Down syndrome. Using optical podography and photography, 44 footprints from 22 patients with Down syndrome (11 men [mean ± SD age, 23.82 ± 3.12 years] and 11 women [mean ± SD age, 24.82 ± 6.81 years]) were recorded in a static bipedal standing position. A blinded observer performed the measurements using a validated manual method three times during the 4-month study, with 2 months between measurements. Test-retest was used to check the reliability of the Photoshop CS5 software measurements. Validity and reliability were obtained by intraclass correlation coefficient (ICC). The reliability test for all of the indices showed very good values for the Photoshop CS5 method (ICC, 0.982-0.995). Validity testing also found no differences between the techniques (ICC, 0.988-0.999). The Photoshop CS5 software method is reliable and valid for the study of footprints in young people with Down syndrome.
Surgery resident selection and evaluation. A critical incident study.
Edwards, J C; Currie, M L; Wade, T P; Kaminski, D L
1993-03-01
This article reports a study of the process of selecting and evaluating general surgery residents. In personnel psychology terms, a job analysis of general surgery was conducted using the Critical Incident Technique (CIT). The researchers collected 235 critical incidents through structured interviews with 10 general surgery faculty members and four senior residents. The researchers then directed the surgeons in a two-step process of sorting the incidents into categories and naming the categories. The final essential categories of behavior to define surgical competence were derived through discussion among the surgeons until a consensus was formed. Those categories are knowledge/self-education, clinical performance, diagnostic skills, surgical skills, communication skills, reliability, integrity, compassion, organization skills, motivation, emotional control, and personal appearance. These categories were then used to develop an interview evaluation form for selection purposes and a performance evaluation form to be used throughout residency training. Thus a continuum of evaluation was established. The categories and critical incidents were also used to structure the interview process, which has demonstrated increased interview validity and reliability in many other studies. A handbook for structuring the interviews faculty members conduct with applicants was written, and an interview training session was held with the faculty. The process of implementation of the structured selection interviews is being documented currently through qualitative research.
Cross-institutional stability of behavioral criteria desirable for success in radiology residency.
Altmaier, E; Smith, W L; Wood, P; Ross, R; Montgomery, W J; Klattee, E; Imray, T; Shields, J; Franken, E A
1989-03-01
Certain dimensions of job performance are critical to radiology residents, and several of these dimensions are noncognitive in nature (eg, interpersonal skills, conscientiousness, recognition of limits). Our initial study examined these factors in only one residency program, so the general nature of these dimensions must be documented. The current study was a cross institutional analysis involving 31 faculty radiologists at three separate academic institutions (82% of total faculty) who participated in a critical incident interview to obtain data on important resident behaviors and attitudes. The resultant 172 incidents were sorted by two physicians into the six categories (knowledge, technical skills, attitudes toward self and [both recognitions of limits and confidence in abilities], conscientiousness, curiosity, and interpersonal skills); inter-rater reliability was 92%, kappa = .89. A Chi square analysis revealed similar distributions of incidents across categories (x2 12 = 17.22) among the three institutions, supporting the general reliability of these dimensions across the institutions studied. Further, the distributions of these incidents demonstrated that the noncognitive dimensions again were given considerable importance by faculty radiologists. For example, more than 40% of the critical incidents pertained to the conscientiousness dimension. These findings documented the generalization of these behavioral dimensions across several sites and support their importance in selection and evaluation of residents.
We will make you like our research: The development of a susceptibility-to-persuasion scale
Modic, David; Anderson, Ross
2018-01-01
Psychological and other persuasive mechanisms across diverse contexts are well researched, with many studies of the effectiveness of specific persuasive techniques on distinct types of human behaviour. In the present paper, our specific interest lies in the development of a generalized modular psychometric tool to measure individuals’ susceptibility to persuasion. The scale is constructed using items from previously developed and validated particulate scales established in the domains of social psychology and behavioural economics. In the first study we establish the Susceptibility to Persuasion–II (StP-II) scale, containing 54 items, 10 subscales and further 6 sub-sub scales. In Study 2 we establish the scale’s construct validity and reconfirm its reliability. We present a valid and reliable modular psychometric tool that measures general susceptibility to persuasive techniques. Since its inception, we have successfully implemented the StP-II scale to measure susceptibility to persuasion of IT security officers, the role of psychology of persuasion in cybercrime victims and general persuadability levels of Facebook users; these manuscripts are in preparation. We argue that the StP-II scale shows promise in measuring individual differences in susceptibility to persuasion, and is applicable across diverse contexts such as Internet security and cybercrime. PMID:29543845
Mixed Phylogenetic Signal in Fish Toxicity Data across Chemical Classes
Chemical use in society is growing rapidly and is one of the five major pressures on biodiversity worldwide. Since empirical toxicity studies of pollutants generally focus on a handful of model organisms, reliable approaches are needed to assess sensitivity to chemicals across th...
Kaminer, Y; Blitz, C; Burleson, J A; Kadden, R M; Rounsaville, B J
1998-07-01
The state of the art for treatment efficacy studies now requires manual guided treatments and tests of therapist adherence. This report provides findings regarding adherence assessment of therapists participating in an investigation of treatment matching in adolescent substance abusers. The Group Sessions Rating Scale (GSRS), a group-therapy process measure, was studied to determine its appropriateness for assessing group treatment of adolescents with a) substance use disorders (SUD), b) interrater reliability, c) internal consistency, and d) ability to discriminate the active ingredients of cognitive-behavioral therapy (CBT) from interactional therapy (IT). Interrater reliabilities were moderate to high, with those for CBT generally higher than those for IT. Internal consistency of CBT items was moderate, whereas those of IT were moderately high. Discriminability between the two treatment modalities was high. The frequency of active ingredients was generally therapy-specific: high for the relevant and low for the nonrelevant therapeutic modality items. The GSRS was found to be effective in the measurement of treatment process in adolescents with SUD.
Leboeuf, C; Love, A; Crisp, T C
1989-04-01
The subjective complaints of 41 chronic low back pain sufferers attending a chiropractic clinic were assessed twice prior to therapy with a widely used psychological self-report assessment tool, the Middlesex Hospital Questionnaire (MHQ) and a newly developed VAS Disability Scales Questionnaire (DISQ), both of which investigate various aspects of certain basic positions and activities. Reliability was generally acceptable with these two questionnaires. Subjects participating in the study were commonly found to score within the normal range on the MHQ, indicating that psychological disturbance was not a major feature of their presentation. However, mild mood disturbance was commonly reported, and a more sensitive tool may need to be developed for this type of mildly affected chronic low back pain sufferers. The DISQ generally indicated subjects were mildly to moderately affected by their low back trouble and that sitting and leisure activities were the most pain provoking. Recommendations for further development of the disability scale are made.
Mindful attention and awareness: relationships with psychopathology and emotion regulation.
Gregório, Sónia; Pinto-Gouveia, José
2013-01-01
The growing interest in mindfulness from the scientific community has originated several self-report measures of this psychological construct. The Mindful Attention and Awareness Scale (MAAS) is a self-report measure of mindfulness at a trait-level. This paper aims at exploring MAAS psychometric characteristics and validating it for the Portuguese population. The first two studies replicate some of the original author's statistical procedures in two different samples from the Portuguese general community population, in particular confirmatory factor analyses. Results from both analyses confirmed the scale single-factor structure and indicated a very good reliability. Moreover, cross-validation statistics showed that this single-factor structure is valid for different respondents from the general community population. In the third study the Portuguese version of the MAAS was found to have good convergent and discriminant validities. Overall the findings support the psychometric validity of the Portuguese version of MAAS and suggest this is a reliable self-report measure of trait-mindfulness, a central construct in Clinical Psychology research and intervention fields.
Pinto-Gouveia, José; Carvalho, Teresa; Cunha, Marina; Duarte, Joana; Walser, Robyn D
2015-10-01
The Acceptance and Action Questionnaire-Trauma Specific (AAQ-TS) is a self-report measure designed to assess-trauma-related psychological (in)flexibility, as conceptualized in Acceptance and Commitment Therapy. However, there are no studies to date regarding its psychometric properties. This study explores such properties in the Portuguese version of the AAQ-TS, in Portuguese Colonial War Veterans. A Principal Components Analysis (PCA) was conducted in a sample from the general population of war Veterans (N=371). Confirmatory Factor Analysis (CFA) as well as reliability and convergent validity studies were performed in a different sample from the same population (N=312). For the discriminant validity a clinical sample with a war-related PTSD (N=42) and a non-clinical sample without PTSD (N=44) were used. The CFA suggested a re-specified 15-item model with good global adjustment and factorial validity. The AAQ-TS showed internal consistency, a good temporal reliability, convergent validity with psychopathological symptoms (related to PTSD, anxiety, depression and stress) and peritraumatic dissociation (altered awareness and depersonalization/derealization). The questionnaire also discriminates between war Veterans with and without a PTSD diagnosis. The major limitation relates to the samples' characteristics and sampling methods, which can limit the generalization of results. The Portuguese version of the AAQ-TS is a reliable and valid measure to assess experiential avoidance related to trauma in Portuguese Colonial War Veterans. Copyright © 2015 Elsevier B.V. All rights reserved.
Assessment of the reliability of protein-protein interactions and protein function prediction.
Deng, Minghua; Sun, Fengzhu; Chen, Ting
2003-01-01
As more and more high-throughput protein-protein interaction data are collected, the task of estimating the reliability of different data sets becomes increasingly important. In this paper, we present our study of two groups of protein-protein interaction data, the physical interaction data and the protein complex data, and estimate the reliability of these data sets using three different measurements: (1) the distribution of gene expression correlation coefficients, (2) the reliability based on gene expression correlation coefficients, and (3) the accuracy of protein function predictions. We develop a maximum likelihood method to estimate the reliability of protein interaction data sets according to the distribution of correlation coefficients of gene expression profiles of putative interacting protein pairs. The results of the three measurements are consistent with each other. The MIPS protein complex data have the highest mean gene expression correlation coefficients (0.256) and the highest accuracy in predicting protein functions (70% sensitivity and specificity), while Ito's Yeast two-hybrid data have the lowest mean (0.041) and the lowest accuracy (15% sensitivity and specificity). Uetz's data are more reliable than Ito's data in all three measurements, and the TAP protein complex data are more reliable than the HMS-PCI data in all three measurements as well. The complex data sets generally perform better in function predictions than do the physical interaction data sets. Proteins in complexes are shown to be more highly correlated in gene expression. The results confirm that the components of a protein complex can be assigned to functions that the complex carries out within a cell. There are three interaction data sets different from the above two groups: the genetic interaction data, the in-silico data and the syn-express data. Their capability of predicting protein functions generally falls between that of the Y2H data and that of the MIPS protein complex data. The supplementary information is available at the following Web site: http://www-hto.usc.edu/-msms/AssessInteraction/.
A general graphical user interface for automatic reliability modeling
NASA Technical Reports Server (NTRS)
Liceaga, Carlos A.; Siewiorek, Daniel P.
1991-01-01
Reported here is a general Graphical User Interface (GUI) for automatic reliability modeling of Processor Memory Switch (PMS) structures using a Markov model. This GUI is based on a hierarchy of windows. One window has graphical editing capabilities for specifying the system's communication structure, hierarchy, reconfiguration capabilities, and requirements. Other windows have field texts, popup menus, and buttons for specifying parameters and selecting actions. An example application of the GUI is given.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peters, Valerie A.; Ogilvie, Alistair B.
2012-01-01
This report addresses the general data requirements for reliability analysis of fielded wind turbines and other wind plant equipment. The report provides a rationale for why this data should be collected, a list of the data needed to support reliability and availability analysis, and specific data recommendations for a Computerized Maintenance Management System (CMMS) to support automated analysis. This data collection recommendations report was written by Sandia National Laboratories to address the general data requirements for reliability analysis of operating wind turbines. This report is intended to help develop a basic understanding of the data needed for reliability analysis frommore » a Computerized Maintenance Management System (CMMS) and other data systems. The report provides a rationale for why this data should be collected, a list of the data needed to support reliability and availability analysis, and specific recommendations for a CMMS to support automated analysis. Though written for reliability analysis of wind turbines, much of the information is applicable to a wider variety of equipment and analysis and reporting needs. The 'Motivation' section of this report provides a rationale for collecting and analyzing field data for reliability analysis. The benefits of this type of effort can include increased energy delivered, decreased operating costs, enhanced preventive maintenance schedules, solutions to issues with the largest payback, and identification of early failure indicators.« less
A new tool to evaluate postgraduate training posts: the Job Evaluation Survey Tool (JEST).
Wall, David; Goodyear, Helen; Singh, Baldev; Whitehouse, Andrew; Hughes, Elizabeth; Howes, Jonathan
2014-10-02
Three reports in 2013 about healthcare and patient safety in the UK, namely Berwick, Francis and Keogh have highlighted the need for junior doctors' views about their training experience to be heard. In the UK, the General Medical Council (GMC) quality assures medical training programmes and requires postgraduate deaneries to undertake quality management and monitoring of all training posts in their area. The aim of this study was to develop a simple trainee questionnaire for evaluation of postgraduate training posts based on the GMC, UK standards and to look at the reliability and validity including comparison with a well-established and internationally validated tool, the Postgraduate Hospital Educational Environment Measure (PHEEM). The Job Evaluation Survey Tool (JEST), a fifteen item job evaluation questionnaire was drawn up in 2006, piloted with Foundation doctors (2007), field tested with specialist paediatric registrars (2008) and used over a three year period (2008-11) by Foundation Doctors. Statistical analyses including descriptives, reliability, correlation and factor analysis were undertaken and JEST compared with PHEEM. The JEST had a reliability of 0.91 in the pilot study of 76 Foundation doctors, 0.88 in field testing of 173 Paediatric specialist registrars and 0.91 in three years of general use in foundation training with 3367 doctors completing JEST. Correlation of JEST with PHEEM was 0.80 (p < 0.001). Factor analysis showed two factors, a teaching factor and a social and lifestyle one. The JEST has proved to be a simple, valid and reliable evaluation tool in the monitoring and evaluation of postgraduate hospital training posts.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wong, S.M.; Boccio, J.L.; Karimian, S.
1986-01-01
In this paper, a trial application of reliability technology to the emergency diesel generator system at the Trojan Nuclear Power Plant is presented. An approach for formulating a reliability program plan for this system is being developed. The trial application has shown that a reliability program process, using risk- and reliability-based techniques, can be interwoven into current plant operational activities to help in controlling, analyzing, and predicting faults that can challenge safety systems. With the cooperation of the utility, Portland General Electric Co., this reliability program can eventually be implemented at Trojan to track its effectiveness.
Subject-level reliability analysis of fast fMRI with application to epilepsy.
Hao, Yongfu; Khoo, Hui Ming; von Ellenrieder, Nicolas; Gotman, Jean
2017-07-01
Recent studies have applied the new magnetic resonance encephalography (MREG) sequence to the study of interictal epileptic discharges (IEDs) in the electroencephalogram (EEG) of epileptic patients. However, there are no criteria to quantitatively evaluate different processing methods, to properly use the new sequence. We evaluated different processing steps of this new sequence under the common generalized linear model (GLM) framework by assessing the reliability of results. A bootstrap sampling technique was first used to generate multiple replicated data sets; a GLM with different processing steps was then applied to obtain activation maps, and the reliability of these maps was assessed. We applied our analysis in an event-related GLM related to IEDs. A higher reliability was achieved by using a GLM with head motion confound regressor with 24 components rather than the usual 6, with an autoregressive model of order 5 and with a canonical hemodynamic response function (HRF) rather than variable latency or patient-specific HRFs. Comparison of activation with IED field also favored the canonical HRF, consistent with the reliability analysis. The reliability analysis helps to optimize the processing methods for this fast fMRI sequence, in a context in which we do not know the ground truth of activation areas. Magn Reson Med 78:370-382, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
Rasch Analysis of the General Self-Efficacy Scale in Workers with Traumatic Limb Injuries.
Wu, Tzu-Yi; Yu, Wan-Hui; Huang, Chien-Yu; Hou, Wen-Hsuan; Hsieh, Ching-Lin
2016-09-01
Purpose The purpose of this study was to apply Rasch analysis to examine the unidimensionality and reliability of the General Self-Efficacy Scale (GSE) in workers with traumatic limb injuries. Furthermore, if the items of the GSE fitted the Rasch model's assumptions, we transformed the raw sum ordinal scores of the GSE into Rasch interval scores. Methods A total of 1076 participants completed the GSE at 1 month post injury. Rasch analysis was used to examine the unidimensionality and person reliability of the GSE. The unidimensionality of the GSE was verified by determining whether the items fit the Rasch model's assumptions: (1) item fit indices: infit and outfit mean square (MNSQ) ranged from 0.6 to 1.4; and (2) the eigenvalue of the first factor extracted from principal component analysis (PCA) for residuals was <2. Person reliability was calculated. Results The unidimensionality of the 10-item GSE was supported in terms of good item fit statistics (infit and outfit MNSQ ranging from 0.92 to 1.32) and acceptable eigenvalues (1.6) of the first factor of the PCA, with person reliability = 0.89. Consequently, the raw sum scores of the GSE were transformed into Rasch scores. Conclusions The results indicated that the items of GSE are unidimensional and have acceptable person reliability in workers with traumatic limb injuries. Additionally, the raw sum scores of the GSE can be transformed into Rasch interval scores for prospective users to quantify workers' levels of self-efficacy and to conduct further statistical analyses.
Guan, Ng Chong; Seng, Loh Huai; Hway Ann, Anne Yee; Hui, Koh Ong
2015-03-01
This study was aimed at validating the simplified Chinese version of the Multidimensional Scale of Perceived Support (MSPSS-SCV) among a group of medical and dental students in University Malaya. Two hundred and two students who took part in this study were given the MSPSS-SCV, the Medical Outcome Study social support survey, the Malay version of the Beck Depression Inventory, the Malay version of the General Health Questionnaire, and the English version of the MSPSS. After 1 week, these students were again required to complete the MSPSS-SCV but with the item sequences shuffled. This scale displayed excellent internal consistency (Cronbach's α = .924), high test-retest reliability (.71), parallel form reliability (.92; Spearman's ρ, P < .01), and validity. In conclusion, the MSPSS-SCV demonstrated sound psychometric properties in measuring social support among a group of medical and dental students. It could therefore be used as a simple screening tool among young educated Malaysian adolescents. © 2013 APJPH.
Paap, Kenneth R; Sawi, Oliver
2016-12-01
Studies testing for individual or group differences in executive functioning can be compromised by unknown test-retest reliability. Test-retest reliabilities across an interval of about one week were obtained from performance in the antisaccade, flanker, Simon, and color-shape switching tasks. There is a general trade-off between the greater reliability of single mean RT measures, and the greater process purity of measures based on contrasts between mean RTs in two conditions. The individual differences in RT model recently developed by Miller and Ulrich was used to evaluate the trade-off. Test-retest reliability was statistically significant for 11 of the 12 measures, but was of moderate size, at best, for the difference scores. The test-retest reliabilities for the Simon and flanker interference scores were lower than those for switching costs. Standard practice evaluates the reliability of executive-functioning measures using split-half methods based on data obtained in a single day. Our test-retest measures of reliability are lower, especially for difference scores. These reliability measures must also take into account possible day effects that classical test theory assumes do not occur. Measures based on single mean RTs tend to have acceptable levels of reliability and convergent validity, but are "impure" measures of specific executive functions. The individual differences in RT model shows that the impurity problem is worse than typically assumed. However, the "purer" measures based on difference scores have low convergent validity that is partly caused by deficiencies in test-retest reliability. Copyright © 2016 Elsevier B.V. All rights reserved.
The quality of orthodontic practice websites.
Parekh, J; Gill, D S
2014-05-01
To evaluate orthodontic practice websites for the reliability of information presented, accessibility, usability for patients and compliance to General Dental Council (GDC) regulations on ethical advertising. World Wide Web. The term 'orthodontic practice' was entered into three separate search engines. The 30 websites from the UK were selected and graded according to the LIDA tool (a validated method of evaluating healthcare websites) for accessibility, usability of the website and reliability of information on orthodontic treatment. The websites were then evaluated against the GDC's Principles for ethical advertising in nine different criteria. On average, each website fulfilled six out of nine points of the GDC's criteria, with inclusion of a complaints policy being the most poorly fulfilled criteria. The mean LIDA score (a combination of usability, reliability and accessibility) was 102/144 (standard deviation 8.38). The websites scored most poorly on reliability (average 43% SD 11.7), with no single website reporting a clear, reliable method of content production. Average accessibility was 81% and usability 73%. In general, websites did not comply with GDC guidelines on ethical advertising. Furthermore, practitioners should consider reporting their method of information production, particularly when making claims about efficiency and speed of treatment in order to improve reliability.
Mieritz, Rune M; Bronfort, Gert; Jakobsen, Markus D; Aagaard, Per; Hartvigsen, Jan
2014-09-01
A basic premise for any instrument measuring spinal motion is that reliable outcomes can be obtained on a relevant sample under standardized conditions. The purpose of this study was to assess the overall reliability and measurement error of regional spinal sagittal plane motion in patients with chronic low back pain (LBP), and then to evaluate the influence of body mass index, examiner, gender, stability of pain, and pain distribution on reliability and measurement error. This study comprises a test-retest design separated by 7 to 14 days. The patient cohort consisted of 220 individuals with chronic LBP. Kinematics of the lumbar spine were sampled during standardized spinal extension-flexion testing using a 6-df instrumented spatial linkage system. Test-retest reliability and measurement error were evaluated using interclass correlation coefficients (ICC(1,1)) and Bland-Altman limits of agreement (LOAs). The overall test-retest reliability (ICC(1,1)) for various motion parameters ranged from 0.51 to 0.70, and relatively wide LOAs were observed for all parameters. Reliability measures in patient subgroups (ICC(1,1)) ranged between 0.34 and 0.77. In general, greater (ICC(1,1)) coefficients and smaller LOAs were found in subgroups with patients examined by the same examiner, patients with a stable pain level, patients with a body mass index less than below 30 kg/m(2), patients who were men, and patients in the Quebec Task Force classifications Group 1. This study shows that sagittal plane kinematic data from patients with chronic LBP may be sufficiently reliable in measurements of groups of patients. However, because of the large LOAs, this test procedure appears unusable at the individual patient level. Furthermore, reliability and measurement error varies substantially among subgroups of patients. Copyright © 2014 Elsevier Inc. All rights reserved.
A Reliability Generalization of the Parental Authority Questionnaire
ERIC Educational Resources Information Center
Dean, Lynn M.
2016-01-01
How parents interact with their children impacts many crucial facets of children's lives. Over the last 4 decades, researchers have identified four different parenting styles: authoritative, authoritarian, permissive, and disengaged. Hundreds of studies conducted all over the world, have identified correlations between parenting style and many…
76 FR 13018 - Submission for OMB Review; Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-09
... statistical surveys that yield quantitative results that can be generalized to the population of study. This... information will not be used for quantitative information collections that are designed to yield reliably... generic mechanisms that are designed to yield quantitative results. Total Burden Estimate for the...
Teaching Historical Contextualization: The Construction of a Reliable Observation Instrument
ERIC Educational Resources Information Center
Huijgen, Tim; van de Grift, Wim; van Boxtel, Carla; Holthuis, Paul
2017-01-01
Since the 1970s, many observation instruments have been constructed to map teachers' general pedagogic competencies. However, few of these instruments focus on teachers' subject-specific competencies. This study presents the development of the "Framework for Analyzing the Teaching of Historical Contextualization" (FAT-HC). This…
The Cross Validation of the Attitudes toward Mainstreaming Scale (ATMS).
ERIC Educational Resources Information Center
Berryman, Joan D.; Neal, W. R. Jr.
1980-01-01
Reliability and factorial validity of the Attitudes Toward Mainstreaming Scale was supported in a cross-validation study with teachers. Three factors emerged: learning capability, general mainstreaming, and traditional limiting disabilities. Factor intercorrelations varied from .42 to .55; correlations between total scores and individual factors…
Limits to detection of generalized synchronization in delay-coupled chaotic oscillators.
Kato, Hideyuki; Soriano, Miguel C; Pereda, Ernesto; Fischer, Ingo; Mirasso, Claudio R
2013-12-01
We study how reliably generalized synchronization can be detected and characterized from time-series analysis. To that end, we analyze synchronization in a generalized sense of delay-coupled chaotic oscillators in unidirectional ring configurations. The generalized synchronization condition can be verified via the auxiliary system approach; however, in practice, this might not always be possible. Therefore, in this study, widely used indicators to directly quantify generalized and phase synchronization from noise-free time series of two oscillators are employed complementarily to the auxiliary system approach. In our analysis, none of the indices provide the consistent results of the auxiliary system approach. Our findings indicate that it is a major challenge to directly detect synchronization in a generalized sense between two oscillators that are connected via a chain of other oscillators, even if the oscillators are identical. This has major consequences for the interpretation of the dynamics of coupled systems and applications thereof.
The Effect of Incorrect Reliability Information on Expectations, Perceptions, and Use of Automation.
Barg-Walkow, Laura H; Rogers, Wendy A
2016-03-01
We examined how providing artificially high or low statements about automation reliability affected expectations, perceptions, and use of automation over time. One common method of introducing automation is providing explicit statements about the automation's capabilities. Research is needed to understand how expectations from such introductions affect perceptions and use of automation. Explicit-statement introductions were manipulated to set higher-than (90%), same-as (75%), or lower-than (60%) levels of expectations in a dual-task scenario with 75% reliable automation. Two experiments were conducted to assess expectations, perceptions, compliance, reliance, and task performance over (a) 2 days and (b) 4 days. The baseline assessments showed initial expectations of automation reliability matched introduced levels of expectation. For the duration of each experiment, the lower-than groups' perceptions were lower than the actual automation reliability. However, the higher-than groups' perceptions were no different from actual automation reliability after Day 1 in either study. There were few differences between groups for automation use, which generally stayed the same or increased with experience using the system. Introductory statements describing artificially low automation reliability have a long-lasting impact on perceptions about automation performance. Statements including incorrect automation reliability do not appear to affect use of automation. Introductions should be designed according to desired outcomes for expectations, perceptions, and use of the automation. Low expectations have long-lasting effects. © 2015, Human Factors and Ergonomics Society.
SIERRA - A 3-D device simulator for reliability modeling
NASA Astrophysics Data System (ADS)
Chern, Jue-Hsien; Arledge, Lawrence A., Jr.; Yang, Ping; Maeda, John T.
1989-05-01
SIERRA is a three-dimensional general-purpose semiconductor-device simulation program which serves as a foundation for investigating integrated-circuit (IC) device and reliability issues. This program solves the Poisson and continuity equations in silicon under dc, transient, and small-signal conditions. Executing on a vector/parallel minisupercomputer, SIERRA utilizes a matrix solver which uses an incomplete LU (ILU) preconditioned conjugate gradient square (CGS, BCG) method. The ILU-CGS method provides a good compromise between memory size and convergence rate. The authors have observed a 5x to 7x speedup over standard direct methods in simulations of transient problems containing highly coupled Poisson and continuity equations such as those found in reliability-oriented simulations. The application of SIERRA to parasitic CMOS latchup and dynamic random-access memory single-event-upset studies is described.
Expected Utility Based Decision Making under Z-Information and Its Application.
Aliev, Rashad R; Mraiziq, Derar Atallah Talal; Huseynov, Oleg H
2015-01-01
Real-world decision relevant information is often partially reliable. The reasons are partial reliability of the source of information, misperceptions, psychological biases, incompetence, and so forth. Z-numbers based formalization of information (Z-information) represents a natural language (NL) based value of a variable of interest in line with the related NL based reliability. What is important is that Z-information not only is the most general representation of real-world imperfect information but also has the highest descriptive power from human perception point of view as compared to fuzzy number. In this study, we present an approach to decision making under Z-information based on direct computation over Z-numbers. This approach utilizes expected utility paradigm and is applied to a benchmark decision problem in the field of economics.
Tarescavage, Anthony M; Wygant, Dustin B; Boutacoff, Lana I; Ben-Porath, Yossef S
2013-12-01
In the current study, we examined the reliability, validity, and clinical utility of Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2011) scores in a sample of 759 bariatric surgery candidates. We provide descriptives for all scales, internal consistency and standard error of measurement estimates for all substantive scales, external correlates of substantive scales using chart review and self-report criteria, and relative risk ratios to assess the clinical utility of the instrument. Results generally support the reliability, validity, and clinical utility of MMPI-2-RF scale scores in the psychological evaluation of bariatric surgery candidates. Limitations, future directions, and practical application of these results are discussed. (c) 2013 APA, all rights reserved.
Family Self-Efficacy for Diabetes Management: Psychometric Testing.
Mcewen, Marylyn M; Pasvogel, Alice; Murdaugh, Carolyn L
2016-01-01
Type 2 diabetes mellitus (T2DM) self-management among Hispanic adults occurs in a family context. Self-efficacy (SE) affects T2DM self-management behaviors; however, no instruments are available to measure family diabetes self-efficacy. The study's purpose was to test the psychometric properties of the Family Self-Efficacy for Diabetes Scale (FSE). Family members (n = 113) of adults with T2DM participated. Psychometric analysis included internal consistency reliability and concurrent and construct validity. Internal consistency reliability was .86. Items loaded on 2 factors, Family SE for Supporting Healthy Behaviors and Family SE for Supporting General Health, accounting for 71% of the variance. FSE correlated significantly with 3 diabetes-related instruments. The FSE is a reliable and valid instrument. Further testing is needed in diverse populations and geographic areas.
Illustrated structural application of universal first-order reliability method
NASA Technical Reports Server (NTRS)
Verderaime, V.
1994-01-01
The general application of the proposed first-order reliability method was achieved through the universal normalization of engineering probability distribution data. The method superimposes prevailing deterministic techniques and practices on the first-order reliability method to surmount deficiencies of the deterministic method and provide benefits of reliability techniques and predictions. A reliability design factor is derived from the reliability criterion to satisfy a specified reliability and is analogous to the deterministic safety factor. Its application is numerically illustrated on several practical structural design and verification cases with interesting results and insights. Two concepts of reliability selection criteria are suggested. Though the method was developed to support affordable structures for access to space, the method should also be applicable for most high-performance air and surface transportation systems.
Papadakaki, Maria; Prokopiadou, Dimitra; Petridou, Eleni; Kogevinas, Manolis; Lionis, Christos
2012-06-01
The current article aims to translate the PREMIS (Physician Readiness to Manage Intimate Partner Violence) survey into the Greek language and test its validity and reliability in a sample of primary care physicians. The validation study was conducted in 2010 and involved all the general practitioners serving two adjacent prefectures of Greece (n = 80). Maximum-likelihood factor analysis (MLF) was used to extract key survey factors. The instrument was further assessed for the following psychometric properties: (a) scale reliability, (b) item-specific reliability, (c) test-retest reliability, (d) scale construct validity, and (e) internal predictive validity. The MLF analysis of 23 opinion items revealed a seven-factor solution (preparation, constraint, workplace issues, screening, self-efficacy, alcohol/drugs, victim understanding), which was statistically sound (p = .293). Most of the newly derived scales displayed satisfactory internal consistency (α ≥ .60), high item-specific reliability, strong construct, and internal predictive validity (F = 2.82; p = .004), and high repeatability when retested with 20 individuals (intraclass correlation coefficient [ICC] > .70). The tool was found appropriate to facilitate the identification of competence deficits and the evaluation of training initiatives.
Schoenmakers, Birgitte; Wens, Johan
2014-03-04
To investigate if the psychometric qualities of an OSCE consisting of more complex simulated patient encounters remain valid and reliable in the assessment of postgraduate trainees in general practice. In this intervention study without control group, the traditional OSCE was formally replaced by the new, complex version. The study population was composed by all postgraduate trainees (second and third phase) in general practice during the ongoing academic year. Data were handled and collected as part of the formal assessment program. Univariate analyses, the variance of scores and multivariate analyses were performed to assess the test qualities. A total of 340 students participated. Average final scores were slightly higher for third-phase students (t-test, p =0.05). Overall test scores were equally distributed on station level, circuit level and phase level. A multiple regression analysis revealed that test scores were dependent on the stations and circuits, but not on the master phase. In a changing learning environment, assessment and evaluation strategies require reorientation. The reliability and validity of the OSCE remain subject to discussion. In particular, when it comes to content and design, the traditional OSCE might underestimate the performance level of postgraduate trainees in general practice. A reshaping of this OSCE to a more sophisticated design with more complex patient encounters appears to restore the validity of the test results.
2016-05-23
general model for heterogeneous granular media under compaction and (ii) the lack of a reliable multiscale discrete -to-continuum framework for...dynamics. These include a continuum- discrete model of heat dissipation/diffusion and a continuum- discrete model of compaction of a granular material with...the lack of a general model for het- erogeneous granular media under compac- tion and (ii) the lack of a reliable multi- scale discrete -to-continuum
Staff Report to the Secretary on Electricity Markets and Reliability
DOE Office of Scientific and Technical Information (OSTI.GOV)
None
Energy Secretary Rick Perry issued a memo in April of 2017 requesting a study and directing his staff to develop a report to include an assessment of the reliability and resilience of the electric grid and an overview of the evolution of electricity markets. Various factors have emerged over the past 15 years which have impacted power supply and demand in different ways. This study, prepared by experts throughout the Department, contains a comprehensive analysis of these factors and the corresponding data, and presents a series of recommendations meant to inform and guide policy makers, regulators, and the general public.more » Potential areas for further research are also presented.« less
Systematic review found AMSTAR, but not R(evised)-AMSTAR, to have good measurement properties.
Pieper, Dawid; Buechter, Roland Brian; Li, Lun; Prediger, Barbara; Eikermann, Michaela
2015-05-01
To summarize all available evidence on measurement properties in terms of reliability, validity, and feasibility of the Assessment of Multiple Systematic Reviews (AMSTAR) tool, including R(evised)-AMSTAR. MEDLINE, EMBASE, Psycinfo, and CINAHL were searched for studies containing information on measurement properties of the tools in October 2013. We extracted data on study characteristics and measurement properties. These data were analyzed following measurement criteria. We included 13 studies, four of them were labeled as validation studies. Nine articles dealt with AMSTAR, two articles dealt with R-AMSTAR, and one article dealt with both instruments. In terms of interrater reliability, most items showed a substantial agreement (>0.6). The median intraclass correlation coefficient (ICC) for the overall score of AMSTAR was 0.83 (range 0.60-0.98), indicating a high agreement. In terms of validity, ICCs were very high with all but one ICC lower than 0.8 when the AMSTAR score was compared with scores from other tools. Scoring AMSTAR takes between 10 and 20 minutes. AMSTAR seems to be reliable and valid. Further investigations for systematic reviews of other study designs than randomized controlled trials are needed. R-AMSTAR should be further investigated as evidence for its use is limited and its measurement properties have not been studied sufficiently. In general, test-retest reliability should be investigated in future studies. Copyright © 2015 Elsevier Inc. All rights reserved.
A new technique in the global reliability of cyclic communications network
NASA Technical Reports Server (NTRS)
Sjogren, Jon A.
1989-01-01
The global reliability of a communications network is the probability that given any pair of nodes, there exists a viable path between them. A characterization of connectivity, for a given class of networks, can enable one to find this reliability. Such a characterization is described for a useful class of undirected networks called daisy-chained or braided networks. This leads to a new method of quickly computing the global reliability of these networks. Asymptotic behavior in terms of component reliability is related to geometric properties of the given graph. Generalization of the technique is discussed.
[Reliability and Validity of the Scale for Homophobia in Medicine Students].
Campo-Arias, Adalberto; Lafaurie, María Mercedes; Gaitán-Duarte, Hernando G
2012-12-01
There are several scales to quantify homophobia in different populations. However, the reliability and validity of these instruments among Colombian students are unknown. Consequently, this work is intended to assess reliability (inner consistency) as well as the validity of the Scale for Homophobia in Medicine students from a private university in Bogotá (Colombia). Methodological study with 199 Medicine students from 1st to 5th semester that filled out the Homophobia Scale form, the general welfare questionnaire, the Attitude Towards Gays and Lesbians Scale (ATGL), WHO-5 (divergent validity) and the Francis Scale of Attitude Toward Christianity (nomologic validity). Pearson's correlations were computed, the Cronbach's alfa coefficient, the omega coefficient (construct's reliability) and confirmatory factorial analysis. The Scale for Homophobia showed an alpha Cronbach coefficient of 0,785, an omega coefficient of 0,790 and a Pearson correlation with the ATGL of 0,844; with WHO-5, -0,059; and a Francis Scale of Attitude Toward Christianity, 0,187. The Scale toward Homophobia exhibited a relevant factor of 44,7% of the total variance. The Scale for Homophobia showed acceptable reliability and validity. New studies should investigate the stability of the scale and the nomologic validity regarding other constructs. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Development of a scale to assess cancer stigma in the non-patient population
2014-01-01
Background Illness-related stigma has attracted considerable research interest, but few studies have specifically examined stigmatisation of cancer in the non-patient population. The present study developed and validated a Cancer Stigma Scale (CASS) for use in the general population. Methods An item pool was developed on the basis of previous research into illness-related stigma in the general population and patients with cancer. Two studies were carried out. The first study used Exploratory factor analysis to explore the structure of items in a sample of 462 postgraduate students recruited through a London university. The second study used Confirmatory factor analysis to confirm the structure among 238 adults recruited through an online market research panel. Internal reliability, test-retest reliability and construct validity were also assessed. Results Exploratory factor analysis suggested six subscales, representing: Awkwardness, Severity, Avoidance, Policy Opposition, Personal Responsibility and Financial Discrimination. Confirmatory factor analysis confirmed this structure with a 25-item scale. All subscales showed adequate to good internal and test-retest reliability in both samples. Construct validity was also good, with mean scores for each subscale varying in the expected directions by age, gender, experience of cancer, awareness of lifestyle risk factors for cancer, and social desirability. Means for the subscales were consistent across the two samples. Conclusions These findings highlight the complexity of cancer stigma and provide the Cancer Stigma Scale (CASS) which can be used to compare populations, types of cancer and evaluate the effects of interventions designed to reduce cancer stigma in non-patient populations. PMID:24758482
di Giuseppe, Romina; Hirche, Frank; Montonen, Jukka; Buijsse, Brian; Dierkes, Jutta; Stangl, Gabriele I; Boeing, Heiner; Weikert, Cornelia
2012-11-01
Identified as a biomarker of altered calcium-phosphorus metabolism in chronic kidney disease, fibroblast growth factor 23 (FGF-23) can also be used as a biomarker of risk for cardiovascular disease in the general population. However, it is crucial to first evaluate the reproducibility (reliability) of plasma FGF-23 concentrations. We assessed the reliability of plasma FGF-23 concentrations using replicate blood samples taken four months apart of 207 participants from the European Prospective Investigation into Cancer and Nutrition-Potsdam Study. Plasma FGF-23 concentrations at baseline (geometric mean: 24.7 RU/mL; 95% confidence interval [CI] in RU/mL: 21.8-27.9) were not significantly different from those measured four months later (geometric mean: 23.7 RU/mL; 95% CI in RU/mL: 20.6-27.1; P = 0.42). The intraclass correlation coefficients were 0.69 (95% CI: 0.62-0.76) for all; 0.64 (95% CI: 0.50-0.75) for men and 0.73 (95% CI: 0.64-0.81) for women. Plasma FGF-23 concentrations showed good reliability over time. Our findings suggest that in epidemiological studies, a single plasma FGF-23 measurement may be sufficient to derive the relative risk in prospective cohort studies.
Ku, David Tawei; Shen, Chun-Yi
2009-01-01
The Felder-Soloman Index of Learning Styles (ILS) has been a popular instrument for measuring learning styles of college students for the past two decades. Even though several researchers have translated the ILS into Chinese for their own studies, a Chinese version has not been standardized and evaluated, nor has anyone reported on its reliability and validity. Based on data collected from 2,748 students at a large private university in Taiwan, this study investigates the reliability and validity of the Chinese version of the ILS. In addition, through factor analysis and structural equation modeling (SEM) analysis, problematic test items are identified for further modification. Results show that the reliability of each scale of the ILS has a pattern similar to that of previous studies. The study therefore investigates the identified problematic elements and discusses two key points: (1) the language and translation problems and (2) precision and design. In addition, results of the significant interaction effects of analysis of variance (ANOVA) for active/reflective and sensing/intuitive scales indicate the effect of college differences depends on the levels between genders. Moreover, in general, female students are significantly more intuitive and global and less visual than male students. Other detailed analysis of academic disciplines and gender onILS are also reported.
Duncan, Laura; Georgiades, Kathy; Wang, Li; Van Lieshout, Ryan J; MacMillan, Harriet L; Ferro, Mark A; Lipman, Ellen L; Szatmari, Peter; Bennett, Kathryn; Kata, Anna; Janus, Magdalena; Boyle, Michael H
2017-12-04
The goals of the study were to examine test-retest reliability, informant agreement and convergent and discriminant validity of nine DSM-IV-TR psychiatric disorders classified by parent and youth versions of the Mini International Neuropsychiatric Interview for Children and Adolescents (MINI-KID). Using samples drawn from the general population and child mental health outpatient clinics, 283 youth aged 9 to 18 years and their parents separately completed the MINI-KID with trained lay interviewers on two occasions 7 to 14 days apart. Test-retest reliability estimates based on kappa (κ) went from 0.33 to 0.79 across disorders, samples and informants. Parent-youth agreement on disorders was low (average κ = 0.20). Confirmatory factor analysis provided evidence supporting convergent and discriminant validity. The MINI-KID disorder classifications yielded estimates of test-retest reliability and validity comparable to other standardized diagnostic interviews in both general population and clinic samples. These findings, in addition to the brevity and low administration cost, make the MINI-KID a good candidate for use in epidemiological research and clinical practice. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Power Cycle Testing of Power Switches: A Literature Survey
DOE Office of Scientific and Technical Information (OSTI.GOV)
GopiReddy, Lakshmi Reddy; Tolbert, Leon M.; Ozpineci, Burak
Reliability of power converters and lifetime prediction has been a major topic of research in the last few decades, especially for traction applications. The main failures in high power semiconductors are caused by thermomechanical fatigue. Power cycling and temperature cycling are the two most common thermal acceleration tests used in assessing reliability. The objective of this paper is to study the various power cycling tests found in the literature and to develop generalized steps in planning application specific power cycling tests. A comparison of different tests based on the failures, duration, test circuits, and monitored electrical parameters is presented.
Power Cycle Testing of Power Switches: A Literature Survey
GopiReddy, Lakshmi Reddy; Tolbert, Leon M.; Ozpineci, Burak
2014-09-18
Reliability of power converters and lifetime prediction has been a major topic of research in the last few decades, especially for traction applications. The main failures in high power semiconductors are caused by thermomechanical fatigue. Power cycling and temperature cycling are the two most common thermal acceleration tests used in assessing reliability. The objective of this paper is to study the various power cycling tests found in the literature and to develop generalized steps in planning application specific power cycling tests. A comparison of different tests based on the failures, duration, test circuits, and monitored electrical parameters is presented.
A comparison of three observational techniques for assessing postural loads in industry.
Kee, Dohyung; Karwowski, Waldemar
2007-01-01
This study aims to compare 3 observational techniques for assessing postural load, namely, OWAS, RULA, and REBA. The comparison was based on the evaluation results generated by the classification techniques using 301 working postures. All postures were sampled from the iron and steel, electronics, automotive, and chemical industries, and a general hospital. While only about 21% of the 301 postures were classified at the action category/level 3 or 4 by both OWAS and REBA, about 56% of the postures were classified into action level 3 or 4 by RULA. The inter-method reliability for postural load category between OWAS and RULA was just 29.2%, and the reliability between RULA and REBA was 48.2%. These results showed that compared to RULA, OWAS, and REBA generally underestimated postural loads for the analyzed postures, irrespective of industry, work type, and whether or not the body postures were in a balanced state.
NASA Technical Reports Server (NTRS)
Liu, Donhang
2014-01-01
This presentation includes a summary of NEPP-funded deliverables for the Base-Metal Electrodes (BMEs) capacitor task, development of a general reliability model for BME capacitors, and a summary and future work.
Larsen, Camilla Marie; Juul-Kristensen, Birgit; Lund, Hans; Søgaard, Karen
2014-10-01
The aims were to compile a schematic overview of clinical scapular assessment methods and critically appraise the methodological quality of the involved studies. A systematic, computer-assisted literature search using Medline, CINAHL, SportDiscus and EMBASE was performed from inception to October 2013. Reference lists in articles were also screened for publications. From 50 articles, 54 method names were identified and categorized into three groups: (1) Static positioning assessment (n = 19); (2) Semi-dynamic (n = 13); and (3) Dynamic functional assessment (n = 22). Fifteen studies were excluded for evaluation due to no/few clinimetric results, leaving 35 studies for evaluation. Graded according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN checklist), the methodological quality in the reliability and validity domains was "fair" (57%) to "poor" (43%), with only one study rated as "good". The reliability domain was most often investigated. Few of the assessment methods in the included studies that had "fair" or "good" measurement property ratings demonstrated acceptable results for both reliability and validity. We found a substantially larger number of clinical scapular assessment methods than previously reported. Using the COSMIN checklist the methodological quality of the included measurement properties in the reliability and validity domains were in general "fair" to "poor". None were examined for all three domains: (1) reliability; (2) validity; and (3) responsiveness. Observational evaluation systems and assessment of scapular upward rotation seem suitably evidence-based for clinical use. Future studies should test and improve the clinimetric properties, and especially diagnostic accuracy and responsiveness, to increase utility for clinical practice.
Monolithic ceramic analysis using the SCARE program
NASA Technical Reports Server (NTRS)
Manderscheid, Jane M.
1988-01-01
The Structural Ceramics Analysis and Reliability Evaluation (SCARE) computer program calculates the fast fracture reliability of monolithic ceramic components. The code is a post-processor to the MSC/NASTRAN general purpose finite element program. The SCARE program automatically accepts the MSC/NASTRAN output necessary to compute reliability. This includes element stresses, temperatures, volumes, and areas. The SCARE program computes two-parameter Weibull strength distributions from input fracture data for both volume and surface flaws. The distributions can then be used to calculate the reliability of geometrically complex components subjected to multiaxial stress states. Several fracture criteria and flaw types are available for selection by the user, including out-of-plane crack extension theories. The theoretical basis for the reliability calculations was proposed by Batdorf. These models combine linear elastic fracture mechanics (LEFM) with Weibull statistics to provide a mechanistic failure criterion. Other fracture theories included in SCARE are the normal stress averaging technique and the principle of independent action. The objective of this presentation is to summarize these theories, including their limitations and advantages, and to provide a general description of the SCARE program, along with example problems.
Trampisch, U; Platen, P; Burghaus, I; Moschny, A; Wilm, S; Thiem, U; Hinrichs, T
2010-12-01
A questionnaire (Q) to measure physical activity (PA) of persons ≥70 years for epidemiological research is lacking. The aim was to develop the PRISCUS-PAQ and test the reliability in community-dwelling people (≥70 years). Validated PA questionnaires were translated and adapted to design the PRISCUS-PAQ. Its test-retest reliability for 91 randomly selected people (36% men) aged 70-98 (76±5) years ranged from 0.47 (walking) to 0.82 (riding a bicycle). The overall activity score was 0.59 as determined by the intraclass correlation coefficient (ICC). Recording of general activities, e.g., housework (ICC=0.59), was in general less reliable than athletic activities, e.g., gymnastics (ICC=0.76). The PRISCUS-PAQ, which is a short instrument with acceptable reliability to collect the physical activity of the elderly in a telephone interview, will be used to collect data in a large cohort of older people in the German research consortium PRISCUS.
Development, Validity, and Reliability of the Campus Residential Experience Survey
ERIC Educational Resources Information Center
Sriram, Rishi; Scales, Laine; Shushok, Frank, Jr.
2017-01-01
The importance of living on campus is well established, but extant research that examines administrator perceptions of what comprises the best educational experience for students living on campus is generally unavailable. This study reports the development of a psychometric instrument designed to uncover underlying paradigms and attitudes of…
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-04
... provides useful insights on perceptions and opinions, but are not statistical surveys that yield quantitative results that can be generalized to the population of study. This feedback will provide insights... used for quantitative information collections that are designed to yield reliably actionable results...
ERIC Educational Resources Information Center
Robertson, Marjorie J.
1986-01-01
Reviews literature on the homeless reporting higher rates of psychiatric disorder, psychological distress, and previous psychiatric hospitalization compared to the general population. However, understandardized methodology and lack of consistent findings across studies prohibit reliable prevalence estimates of mental disorder among the homeless.…
ERIC Educational Resources Information Center
Rantanen, Pekka
2013-01-01
A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…
Baker, Richard S; Bazargan, Mohsen; Calderón, José L; Hays, Ron D
2006-08-01
To compare the psychometric performance of Spanish versions of the 25-item National Eye Institute Visual Function Questionnaire (NEI VFQ-25) and the NEI VFQ-39 administered to Latino patients with the psychometric performance of the standard English NEI VFQ-25 and NEI VFQ-39 administered to non-Latino patients. Clinic-based cross-sectional survey. Four hundred three patients (160 Latinos and 243 non-Latinos) recruited from general ophthalmology clinics of an urban public hospital over a 6-month period. Structured face-to-face interviews were conducted in Spanish and English to collect data for the NEI VFQ-25 and NEI VFQ-39. We calculated the mean, standard deviation, and percentage of participants having the minimum (floor) and maximum (ceiling) possible score for each item and scale. Internal consistency reliability of the NEI VFQ-25 and NEI VFQ-39 was estimated using the Cronbach alpha and average inter-item correlation. Construct validity for the instruments was assessed by comparing scores for participants classified as having normal versus impaired visual acuity. Instrument scales for general health; general vision; ocular pain; near activities; distance activities; vision-specific social functioning, mental health, role difficulties, and dependency; driving; color vision; and peripheral vision. Internal consistency reliability was significantly lower in the Spanish version than in the English version for 3 scales of the NEI VFQ-25. More importantly, 3 scales in the Spanish version manifested inadequate reliability (alpha< or =0.70), compared with only 1 inadequately reliable subscale in the English version. Reliability coefficients associated with the Spanish NEI VFQ-39 scales exceeded commonly accepted minimum standards. Comparison of reliability coefficients between Latino and non-Latino subgroups demonstrated statistically significant differences for 4 scales: Ocular Pain, Mental Health, Role Difficulties, and Dependency. In each case, the Latino group had the lower internal consistency reliability. However, only for the Ocular Pain subscale was reliability both significantly lower and inadequate (alpha<0.70). Overall performance of the NEI VFQ in Latino populations is adequate. However, in the absence of modifications to improve the reliability of specific Spanish version subscales, comparisons between Latino and non-Latino subgroups using the NEI VFQ must be interpreted with appropriate caution.
Development and validation of a Spanish diabetes-specific numeracy measure: DNT-15 Latino.
White, Richard O; Osborn, Chandra Y; Gebretsadik, Tebeb; Kripalani, Sunil; Rothman, Russell L
2011-09-01
Although deficits in health literacy and numeracy have been described among Latinos, the impact of low numeracy on diabetes outcomes has not been studied. Study objectives were (1) to establish the reliability and validity of a 15-item Spanish, diabetes-specific numeracy measure (Diabetes Numeracy Test [DNT]-15 Latino) and (2) to examine the relationship between diabetes-specific numeracy and diabetes-related outcomes among a sample of Latino adults with diabetes. Data collection included patient demographics, health literacy, general numeracy, diabetes-specific numeracy, acculturation, self-efficacy, self-care behaviors, and most recent glycosylated hemoglobin (HbA1c). Participants (n=144) were on average 47.8 years old (SD=12.1). The majority were female (62%), uninsured (81%), and of Mexican nationality (78%) and reported low levels of acculturation (96%). The DNT-15 Latino had high internal reliability (Kruder-Richardson 20=0.78). The DNT-15 Latino demonstrated construct validity, correlating with measures of health literacy (ρ=0.291), general numeracy (ρ=0.500), education (ρ=0.361), and income (ρ=0.270) (P<0.001 for each). The DNT-15 Latino was significantly associated with acculturation but unrelated to self-efficacy, self-care behaviors, insulin use, and HbA1c. The DNT-15 Latino is a reliable and valid measure of diabetes-specific numeracy for Latino patients with diabetes; however, additional studies are needed to further explore the association between diabetes-specific numeracy and acculturation and their impact on diabetes-related outcomes for Latinos.
Reliability and Probabilistic Risk Assessment - How They Play Together
NASA Technical Reports Server (NTRS)
Safie, Fayssal M.; Stutts, Richard G.; Zhaofeng, Huang
2015-01-01
PRA methodology is one of the probabilistic analysis methods that NASA brought from the nuclear industry to assess the risk of LOM, LOV and LOC for launch vehicles. PRA is a system scenario based risk assessment that uses a combination of fault trees, event trees, event sequence diagrams, and probability and statistical data to analyze the risk of a system, a process, or an activity. It is a process designed to answer three basic questions: What can go wrong? How likely is it? What is the severity of the degradation? Since 1986, NASA, along with industry partners, has conducted a number of PRA studies to predict the overall launch vehicles risks. Planning Research Corporation conducted the first of these studies in 1988. In 1995, Science Applications International Corporation (SAIC) conducted a comprehensive PRA study. In July 1996, NASA conducted a two-year study (October 1996 - September 1998) to develop a model that provided the overall Space Shuttle risk and estimates of risk changes due to proposed Space Shuttle upgrades. After the Columbia accident, NASA conducted a PRA on the Shuttle External Tank (ET) foam. This study was the most focused and extensive risk assessment that NASA has conducted in recent years. It used a dynamic, physics-based, integrated system analysis approach to understand the integrated system risk due to ET foam loss in flight. Most recently, a PRA for Ares I launch vehicle has been performed in support of the Constellation program. Reliability, on the other hand, addresses the loss of functions. In a broader sense, reliability engineering is a discipline that involves the application of engineering principles to the design and processing of products, both hardware and software, for meeting product reliability requirements or goals. It is a very broad design-support discipline. It has important interfaces with many other engineering disciplines. Reliability as a figure of merit (i.e. the metric) is the probability that an item will perform its intended function(s) for a specified mission profile. In general, the reliability metric can be calculated through the analyses using reliability demonstration and reliability prediction methodologies. Reliability analysis is very critical for understanding component failure mechanisms and in identifying reliability critical design and process drivers. The following sections discuss the PRA process and reliability engineering in detail and provide an application where reliability analysis and PRA were jointly used in a complementary manner to support a Space Shuttle flight risk assessment.
Effects of a shade-matching light and background color on reliability in tooth shade selection.
Najafi-Abrandabadi, Siamak; Vahidi, Farhad; Janal, Malvin N
2018-01-01
The purpose of this study was to evaluate the effects of a shade-matching light (Rite-Lite-2, AdDent) and different viewing backgrounds on reliability in a test of shade tab matching. Four members of the Prosthodontic faculty matched 10 shade tabs selected for a range of shades against the shade guide. All raters were tested for color blindness and were calibrated prior to the study. Matching took place under four combinations of conditions: with operatory light or the shade-matching light, and using either a pink or a blue background. Reliability was quantified with the kappa statistic, separately for agreement of value, hue, and chroma for each shade tab. In general, raters showed fair to moderate levels of agreement when judging the value of the shade tabs, but could not agree on the hue and chroma of the stimuli. The pink background led to higher levels of agreement than the blue background, and the shade-matching light improved agreement when used in conjunction with the pink but not the blue background. Moderate levels of agreement were found in matching shade tab value. Agreement was generally better when using the pink rather than the blue background, regardless of light source. The use of the shade-matching light tended to amplify the advantage of the pink background.
Evaluation of the Cardiac Depression Visual Analogue Scale in a medical and non-medical sample.
Di Benedetto, Mirella; Sheehan, Matthew
2014-01-01
Comorbid depression and medical illness is associated with a number of adverse health outcomes such as lower medication adherence and higher rates of subsequent mortality. Reliable and valid psychological measures capable of detecting a range of depressive symptoms found in medical settings are needed. The Cardiac Depression Visual Analogue Scale (CDVAS) is a recently developed, brief six-item measure originally designed to assess the range and severity of depressive symptoms within a cardiac population. The current study aimed to further investigate the psychometric properties of the CDVAS in a general and medical sample. The sample consisted of 117 participants, whose mean age was 40.0 years (SD = 19.0, range 18-84). Participants completed the CDVAS, the Cardiac Depression Scale (CDS), the Depression Anxiety Stress Scales (DASS) and a demographic and health questionnaire. The CDVAS was found to have adequate internal reliability (α = .76), strong concurrent validity with the CDS (r = .89) and the depression sub-scale of the DASS (r = .70), strong discriminant validity and strong predictive validity. The principal components analysis revealed that the CDVAS measured only one component, providing further support for the construct validity of the scale. Results of the current study indicate that the CDVAS is a short, simple, valid and reliable measure of depressive symptoms suitable for use in a general and medical sample.
Balaguier, Romain; Madeleine, Pascal; Vuillerme, Nicolas
2016-01-01
The assessment of pressure pain threshold (PPT) provides a quantitative value related to the mechanical sensitivity to pain of deep structures. Although excellent reliability of PPT has been reported in numerous anatomical locations, its absolute and relative reliability in the lower back region remains to be determined. Because of the high prevalence of low back pain in the general population and because low back pain is one of the leading causes of disability in industrialized countries, assessing pressure pain thresholds over the low back is particularly of interest. The purpose of this study study was (1) to evaluate the intra- and inter- absolute and relative reliability of PPT within 14 locations covering the low back region of asymptomatic individuals and (2) to determine the number of trial required to ensure reliable PPT measurements. Fifteen asymptomatic subjects were included in this study. PPTs were assessed among 14 anatomical locations in the low back region over two sessions separated by one hour interval. For the two sessions, three PPT assessments were performed on each location. Reliability was assessed computing intraclass correlation coefficients (ICC), standard error of measurement (SEM) and minimum detectable change (MDC) for all possible combinations between trials and sessions. Bland-Altman plots were also generated to assess potential bias in the dataset. Relative reliability for both intra- and inter- session was almost perfect with ICC ranged from 0.85 to 0.99. With respect to the intra-session, no statistical difference was reported for ICCs and SEM regardless of the conducted comparisons between trials. Conversely, for inter-session, ICCs and SEM values were significantly larger when two consecutive PPT measurements were used for data analysis. No significant difference was observed for the comparison between two consecutive measurements and three measurements. Excellent relative and absolute reliabilities were reported for both intra- and inter-session. Reliable measurements can be equally achieved when using the mean of two or three consecutive PPT measurements, as usually proposed in the literature, or with only the first one. Although reliability was almost perfect regardless of the conducted comparison between PPT assessments, our results suggest using two consecutive measurements to obtain higher short term absolute reliability.
Establishing monitoring programs for travel time reliability. [supporting datasets
DOT National Transportation Integrated Search
2014-01-01
The objective of this project was to develop system designs for programs to monitor travel time reliability and to prepare a guidebook that practitioners and others can use to design, build, operate, and maintain such systems. Generally, such travel ...
The reliable solution and computation time of variable parameters logistic model
NASA Astrophysics Data System (ADS)
Wang, Pengfei; Pan, Xinnong
2018-05-01
The study investigates the reliable computation time (RCT, termed as T c) by applying a double-precision computation of a variable parameters logistic map (VPLM). Firstly, by using the proposed method, we obtain the reliable solutions for the logistic map. Secondly, we construct 10,000 samples of reliable experiments from a time-dependent non-stationary parameters VPLM and then calculate the mean T c. The results indicate that, for each different initial value, the T cs of the VPLM are generally different. However, the mean T c trends to a constant value when the sample number is large enough. The maximum, minimum, and probable distribution functions of T c are also obtained, which can help us to identify the robustness of applying a nonlinear time series theory to forecasting by using the VPLM output. In addition, the T c of the fixed parameter experiments of the logistic map is obtained, and the results suggest that this T c matches the theoretical formula-predicted value.
Interrater reliability of videotaped observational gait-analysis assessments.
Eastlack, M E; Arvidson, J; Snyder-Mackler, L; Danoff, J V; McGarvey, C L
1991-06-01
The purpose of this study was to determine the interrater reliability of videotaped observational gait-analysis (VOGA) assessments. Fifty-four licensed physical therapists with varying amounts of clinical experience served as raters. Three patients with rheumatoid arthritis who demonstrated an abnormal gait pattern served as subjects for the videotape. The raters analyzed each patient's most severely involved knee during the four subphases of stance for the kinematic variables of knee flexion and genu valgum. Raters were asked to determine whether these variables were inadequate, normal, or excessive. The temporospatial variables analyzed throughout the entire gait cycle were cadence, step length, stride length, stance time, and step width. Generalized kappa coefficients ranged from .11 to .52. Intraclass correlation coefficients (2,1) and (3,1) were slightly higher. Our results indicate that physical therapists' VOGA assessments are only slightly to moderately reliable and that improved interrater reliability of the assessments of physical therapists utilizing this technique is needed. Our data suggest that there is a need for greater standardization of gait-analysis training.
Reference values for the muscle power sprint test in 6- to 12-year-old children.
Douma-van Riet, Danielle; Verschuren, Olaf; Jelsma, Dorothee; Kruitwagen, Cas; Smits-Engelsman, Bouwien; Takken, Tim
2012-01-01
The aims of this study were (1) to develop centile reference values for anaerobic performance of Dutch children tested using the Muscle Power Sprint Test (MPST) and (2) to examine the test-retest reliability of the MPST. Children who were developing typically (178 boys and 201 girls) and aged 6 to 12 years (mean = 8.9 years) were recruited. The MPST was administered to 379 children, and test-retest reliability was examined in 47 children. MPST scores were transformed into centile curves, which were created using generalized additive models for location, scale, and shape. Height-related reference curves were created for both genders. Excellent (intraclass correlation coefficient = 0.98) test-retest reliability was demonstrated. The reference values for the MPST of children who are developing typically and aged 6 to 12 years can serve as a clinical standard in pediatric physical therapy practice. The MPST is a reliable and practical method for determining anaerobic performance in children.
The Comprehensive Snack Parenting Questionnaire (CSPQ): Development and Test-Retest Reliability.
Gevers, Dorus W M; Kremers, Stef P J; de Vries, Nanne K; van Assema, Patricia
2018-04-26
The narrow focus of existing food parenting instruments led us to develop a food parenting practices instrument measuring the full range of food practices constructs with a focus on snacking behavior. We present the development of the questionnaire and our research on the test-retest reliability. The developed Comprehensive Snack Parenting Questionnaire (CSPQ) covers 21 constructs. Test-retest reliability was assessed by calculating intra class correlation coefficients and percentage agreement after two administrations of the CSPQ among a sample of 66 Dutch parents. Test-retest reliability analysis revealed acceptable intra class correlation coefficients (≥0.41) or agreement scores (≥0.60) for all items. These results, together with earlier work, suggest sufficient psychometric characteristics. The comprehensive, but brief CSPQ opens up chances for highly essential but unstudied research questions to understand and predict children’s snack intake. Example applications include studying the interactional nature of food parenting practices or interactions of food parenting with general parenting or child characteristics.
NASA Technical Reports Server (NTRS)
Bavuso, Salvatore J.; Rothmann, Elizabeth; Dugan, Joanne Bechta; Trivedi, Kishor S.; Mittal, Nitin; Boyd, Mark A.; Geist, Robert M.; Smotherman, Mark D.
1994-01-01
The Hybrid Automated Reliability Predictor (HARP) integrated Reliability (HiRel) tool system for reliability/availability prediction offers a toolbox of integrated reliability/availability programs that can be used to customize the user's application in a workstation or nonworkstation environment. HiRel consists of interactive graphical input/output programs and four reliability/availability modeling engines that provide analytical and simulative solutions to a wide host of reliable fault-tolerant system architectures and is also applicable to electronic systems in general. The tool system was designed to be compatible with most computing platforms and operating systems, and some programs have been beta tested, within the aerospace community for over 8 years. Volume 1 provides an introduction to the HARP program. Comprehensive information on HARP mathematical models can be found in the references.
Bell, Steven; Britton, Annie
2015-10-01
Retrospective measures of alcohol intake are becoming increasingly popular; however, the reliability of such measures remains uncertain. This study assessed the reliability of a retrospective decade-based life-course alcohol consumption questionnaire, based on the standardized Alcohol Use Disorder Identification Test-Consumption (AUDIT-C) administered in older age in a well-characterized cohort study. A retrospective alcohol life-grid was administered to 5980 participants (72% male, mean age 70 years) in the Whitehall II study covering frequency of drinking, number of drinks in a typical drinking day and frequency of consuming six or more drinks in a single drinking occasion in the teens (16-19 years) through to the 80s. A subsample of 385 individuals completed a repeat survey to determine test-retest reliability. Retrospective measures were also compared with prospectively ascertained information and used to predict objectively measured systolic blood pressure to test their predictive validity. Across all decades of life, test-retest reliability was generally good (κ range = 0.62-0.78 for frequency, 0.55-0.62 for usual number of drinks and 0.57-0.65 for frequency of consuming six or more drinks in a single occasion). The concordance between prospective and retrospective measures was consistently moderate to high. The life-grid method performed better than a single question in identifying life-time abstainers. Retrospective measures were also related to systolic blood pressure in the manner anticipated. A retrospective decade-based AUDIT-C grid administered in older age provides a relatively reliable measure of alcohol consumption across the life-course. © 2015 The Authors. Addiction published by John Wiley & Sons Ltd on behalf of Society for the Study of Addiction.
How reliable and accurate is the AO/OTA comprehensive classification for adult long-bone fractures?
Meling, Terje; Harboe, Knut; Enoksen, Cathrine H; Aarflot, Morten; Arthursson, Astvaldur J; Søreide, Kjetil
2012-07-01
Reliable classification of fractures is important for treatment allocation and study comparisons. The overall accuracy of scoring applied to a general population of fractures is little known. This study aimed to investigate the accuracy and reliability of the comprehensive Arbeitsgemeinschaft für Osteosynthesefragen/Orthopedic Trauma Association classification for adult long-bone fractures and identify factors associated with poor coding agreement. Adults (>16 years) with long-bone fractures coded in a Fracture and Dislocation Registry at the Stavanger University Hospital during the fiscal year 2008 were included. An unblinded reference code dataset was generated for the overall accuracy assessment by two experienced orthopedic trauma surgeons. Blinded analysis of intrarater reliability was performed by rescoring and of interrater reliability by recoding of a randomly selected fracture sample. Proportion of agreement (PA) and kappa (κ) statistics are presented. Uni- and multivariate logistic regression analyses of factors predicting accuracy were performed. During the study period, 949 fractures were included and coded by 26 surgeons. For the intrarater analysis, overall agreements were κ = 0.67 (95% confidence interval [CI]: 0.64-0.70) and PA 69%. For interrater assessment, κ = 0.67 (95% CI: 0.62-0.72) and PA 69%. The accuracy of surgeons' blinded recoding was κ = 0.68 (95% CI: 0.65- 0.71) and PA 68%. Fracture type, frequency of the fracture, and segment fractured significantly influenced accuracy whereas the coder's experience did not. Both the reliability and accuracy of the comprehensive Arbeitsgemeinschaft für Osteosynthesefragen/Orthopedic Trauma Association classification for long-bone fractures ranged from substantial to excellent. Variations in coding accuracy seem to be related more to the fracture itself than the surgeon. Diagnostic study, level I.
Cape, John; Morris, Elena; Burd, Mary; Buszewicz, Marta
2008-01-01
Background How GPs understand mental health problems determines their treatment choices; however, measures describing GPs' thinking about such problems are not currently available. Aim To develop a measure of the complexity of GP explanations of common mental health problems and to pilot its reliability and validity. Design of study A qualitative development of the measure, followed by inter-rater reliability and validation pilot studies. Setting General practices in North London. Method Vignettes of simulated consultations with patients with mental health problems were videotaped, and an anchored measure of complexity of psychosocial explanation in response to these vignettes was developed. Six GPs, four psychologists, and two lay people viewed the vignettes. Their responses were rated for complexity, both using the anchored measure and independently by two experts in primary care mental health. In a second reliability and revalidation study, responses of 50 GPs to two vignettes were rated for complexity. The GPs also completed a questionnaire to determine their interest and training in mental health, and they completed the Depression Attitudes Questionnaire. Results Inter-rater reliability of the measure of complexity of explanation in both pilot studies was satisfactory (intraclass correlation coefficient = 0.78 and 0.72). The measure correlated with expert opinion as to what constitutes a complex explanation, and the responses of psychologists, GPs, and lay people differed in measured complexity. GPs with higher complexity scores had greater interest, more training in mental health, and more positive attitudes to depression. Conclusion Results suggest that the complexity of GPs' psychosocial explanations about common mental health problems can be reliably and validly assessed by this new standardised measure. PMID:18505616
Tackling the challenges of matching biomedical ontologies.
Faria, Daniel; Pesquita, Catia; Mott, Isabela; Martins, Catarina; Couto, Francisco M; Cruz, Isabel F
2018-01-15
Biomedical ontologies pose several challenges to ontology matching due both to the complexity of the biomedical domain and to the characteristics of the ontologies themselves. The biomedical tracks in the Ontology Matching Evaluation Initiative (OAEI) have spurred the development of matching systems able to tackle these challenges, and benchmarked their general performance. In this study, we dissect the strategies employed by matching systems to tackle the challenges of matching biomedical ontologies and gauge the impact of the challenges themselves on matching performance, using the AgreementMakerLight (AML) system as the platform for this study. We demonstrate that the linear complexity of the hash-based searching strategy implemented by most state-of-the-art ontology matching systems is essential for matching large biomedical ontologies efficiently. We show that accounting for all lexical annotations (e.g., labels and synonyms) in biomedical ontologies leads to a substantial improvement in F-measure over using only the primary name, and that accounting for the reliability of different types of annotations generally also leads to a marked improvement. Finally, we show that cross-references are a reliable source of information and that, when using biomedical ontologies as background knowledge, it is generally more reliable to use them as mediators than to perform lexical expansion. We anticipate that translating traditional matching algorithms to the hash-based searching paradigm will be a critical direction for the future development of the field. Improving the evaluation carried out in the biomedical tracks of the OAEI will also be important, as without proper reference alignments there is only so much that can be ascertained about matching systems or strategies. Nevertheless, it is clear that, to tackle the various challenges posed by biomedical ontologies, ontology matching systems must be able to efficiently combine multiple strategies into a mature matching approach.
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire
Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos
2006-01-01
Background Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. Methods The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test – retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. Results The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81–0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test – retest reliability) was found 0.66 (95% CI: 0.62–0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36–0.89). Conclusion This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting. PMID:16515708
Baschung Pfister, Pierrette; Sterkele, Iris; Maurer, Britta; de Bie, Rob A.; Knols, Ruud H.
2018-01-01
Manual muscle testing (MMT) and hand-held dynamometry (HHD) are commonly used in people with inflammatory myopathy (IM), but their clinimetric properties have not yet been sufficiently studied. To evaluate the reliability and validity of MMT and HHD, maximum isometric strength was measured in eight muscle groups across three measurement events. To evaluate reliability of HHD, intra-class correlation coefficients (ICC), the standard error of measurements (SEM) and smallest detectable changes (SDC) were calculated. To measure reliability of MMT linear Cohen`s Kappa was computed for single muscle groups and ICC for total score. Additionally, correlations between MMT8 and HHD were evaluated with Spearman Correlation Coefficients. Fifty people with myositis (56±14 years, 76% female) were included in the study. Intra-and interrater reliability of HHD yielded excellent ICCs (0.75–0.97) for all muscle groups, except for interrater reliability of ankle extension (0.61). The corresponding SEMs% ranged from 8 to 28% and the SDCs% from 23 to 65%. MMT8 total score revealed excellent intra-and interrater reliability (ICC>0.9). Intrarater reliability of single muscle groups was substantial for shoulder and hip abduction, elbow and neck flexion, and hip extension (0.64–0.69); moderate for wrist (0.53) and knee extension (0.49) and fair for ankle extension (0.35). Interrater reliability was moderate for neck flexion (0.54) and hip abduction (0.44); fair for shoulder abduction, elbow flexion, wrist and ankle extension (0.20–0.33); and slight for knee extension (0.08). Correlations between the two tests were low for wrist, knee, ankle, and hip extension; moderate for elbow flexion, neck flexion and hip abduction; and good for shoulder abduction. In conclusion, the MMT8 total score is a reliable assessment to consider general muscle weakness in people with myositis but not for single muscle groups. In contrast, our results confirm that HHD can be recommended to evaluate strength of single muscle groups. PMID:29596450
Broyles, S T; Drazba, K T; Church, T S; Chaput, J-P; Fogelholm, M; Hu, G; Kuriyan, R; Kurpad, A; Lambert, E V; Maher, C; Maia, J; Matsudo, V; Olds, T; Onywera, V; Sarmiento, O L; Standage, M; Tremblay, M S; Tudor-Locke, C; Zhao, P; Katzmarzyk, P T
2015-01-01
Objectives: Schools are an important setting to enable and promote physical activity. Researchers have created a variety of tools to perform objective environmental assessments (or ‘audits') of other settings, such as neighborhoods and parks; yet, methods to assess the school physical activity environment are less common. The purpose of this study is to describe the approach used to objectively measure the school physical activity environment across 12 countries representing all inhabited continents, and to report on the reliability and feasibility of this methodology across these diverse settings. Methods: The International Study of Childhood Obesity, Lifestyle and the Environment (ISCOLE) school audit tool (ISAT) data collection required an in-depth training (including field practice and certification) and was facilitated by various supporting materials. Certified data collectors used the ISAT to assess the environment of all schools enrolled in ISCOLE. Sites completed a reliability audit (simultaneous audits by two independent, certified data collectors) for a minimum of two schools or at least 5% of their school sample. Item-level agreement between data collectors was assessed with both the kappa statistic and percent agreement. Inter-rater reliability of school summary scores was measured using the intraclass correlation coefficient. Results: Across the 12 sites, 256 schools participated in ISCOLE. Reliability audits were conducted at 53 schools (20.7% of the sample). For the assessed environmental features, inter-rater reliability (kappa) ranged from 0.37 to 0.96; 18 items (42%) were assessed with almost perfect reliability (κ=0.80–0.96), and a further 24 items (56%) were assessed with substantial reliability (κ=0.61–0.79). Likewise, scores that summarized a school's support for physical activity were highly reliable, with the exception of scores assessing aesthetics and perceived suitability of the school grounds for sport, informal games and general play. Conclusions: This study suggests that the ISAT can be used to conduct reliable objective audits of the school physical activity environment across diverse, international school settings. PMID:27152183
GP preferences for information systems: conjoint analysis of speed, reliability, access and users.
Wyatt, Jeremy C; Batley, Richard P; Keen, Justin
2010-10-01
To elicit the preferences and trade-offs of UK general practitioners about key features of health information systems, to help inform the design of such systems in future. A stated choice study to uncover implicit preferences based on a binary choice between scenarios presented in random order. were all 303 general practice members of the UK Internet service provider, Medix who were approached by email to participate. The main outcome measure was the number of seconds delay in system response that general practitioners were willing to trade off for each key system feature: the reliability of the system, the sites from which the system could be accessed and which staff are able to view patient data. Doctors valued speed of response most in information systems but would be prepared to wait 28 seconds to access a system in exchange for improved reliability from 95% to 99%, a further 2 seconds for an improvement to 99.9% and 27 seconds for access to data from anywhere including their own home compared with one place in a single health care premises. However, they would require a system that was 14 seconds faster to compensate for allowing social care as well as National Health Service staff to read patient data. These results provide important new evidence about which system characteristics doctors value highly, and hence which characteristics designers need to focus on when large scale health information systems are planned. © 2010 Blackwell Publishing Ltd.
Ngune, Irene; Jiwa, Moyez; McManus, Alexandra; Hughes, Jeff; Parsons, Richard; Hodder, Rupert; Entriken, Fiona
2014-01-01
Treatment for colorectal cancer (CRC) may result in physical, social, and psychological needs that affect patients' quality of life post-treatment. A comprehensive assessment should be conducted to identify these needs in CRC patients post treatment, however, there is a lack of tools and processes available in general practice. This study aimed to develop a patient-completed needs screening tool that identifies potentially unmet physical, psychological, and social needs in CRC and facilitates consultation with a general practitioner (GP) to address these needs. The development of the self-assessment tool for patients (SATp) included a review of the literature; face and content validity with reference to an expert panel; psychometric testing including readability, internal consistency, and test-retest reliability; and usability in clinical practice. The SATp contains 25 questions. The tool had internal consistency (Cronbach's alpha 0.70-0.97), readability (reading ease 82.5%), and test-retest reliability (kappa 0.689-1.000). A total of 66 patients piloted the SATp. Participants were on average 69.2 (SD 9.9) years old and had a median follow-up period of 26.7 months. The SATp identified a total of 547 needs (median 7 needs/per patient; IQR [3-12.25]). Needs were categorised into social (175[32%]), psychological (175[32%]), and physical (197[36%]) domains. SATp is a reliable self-assessment tool useful for identifying CRC patient needs. Further testing of this tool for validity and usability is underway.
Purba, Fredrick Dermawan; Hunfeld, Joke A M; Iskandarsyah, Aulia; Fitriana, Titi Sahidah; Sadarjoen, Sawitri S; Passchier, Jan; Busschbach, Jan J V
2018-01-01
The objective of this study is to obtain population norms and to assess test-retest reliability of EQ-5D-5L and WHOQOL-BREF for the Indonesian population. A representative sample of 1056 people aged 17-75 years was recruited from the Indonesian general population. We used a multistage stratified quota sampling method with respect to residence, gender, age, education level, religion and ethnicity. Respondents completed EQ-5D-5L and WHOQOL-BREF with help from an interviewer. Norms data for both instruments were reported. For the test-retest evaluations, a sub-sample of 206 respondents completed both instruments twice. The total sample and test-retest sub-sample were representative of the Indonesian general population. The EQ-5D-5L shows almost perfect agreement between the two tests (Gwet's AC: 0.85-0.99 and percentage agreement: 90-99%) regarding the five dimensions. However, the agreement of EQ-VAS and index scores can be considered as poor (ICC: 0.45 and 0.37 respectively). For the WHOQOL-BREF, ICCs of the four domains were between 0.70 and 0.79, which indicates moderate to good agreement. For EQ-5D-5L, it was shown that female and older respondents had lower EQ-index scores, whilst rural, younger and higher-educated respondents had higher EQ-VAS scores. For WHOQOL-BREF: male, younger, higher-educated, high-income respondents had the highest scores in most of the domains, overall quality of life, and health satisfaction. This study provides representative estimates of self-reported health status and quality of life for the general Indonesian population as assessed by the EQ-5D-5L and WHOQOL-BREF instruments. The descriptive system of the EQ-5D-5L and the WHOQOL-BREF have high test-retest reliability while the EQ-VAS and the index score of EQ-5D-5L show poor agreement between the two tests. Our results can be useful to researchers and clinicians who can compare their findings with respect to these concepts with those of the Indonesian general population.
Validity and reliability of the South African health promoting schools monitoring questionnaire
Struthers, Patricia; de Koker, Petra; Lerebo, Wondwossen; Blignaut, Renette J.
2017-01-01
Summary Health promoting schools, as conceptualised by the World Health Organisation, have been developed in many countries to facilitate the health-education link. In 1994, the concept of health promoting schools was introduced in South Africa. In the process of becoming a health promoting school, it is important for schools to monitor and evaluate changes and developments taking place. The Health Promoting Schools (HPS) Monitoring Questionnaire was developed to obtain opinions of students about their school as a health promoting school. It comprises 138 questions in seven sections: socio-demographic information; General health promotion programmes; health related Skills and knowledge; Policies; Environment; Community-school links; and support Services. This paper reports on the reliability and face validity of the HPS Monitoring Questionnaire. Seven experts reviewed the questionnaire and agreed that it has satisfactory face validity. A test-retest reliability study was conducted with 83 students in three high schools in Cape Town, South Africa. The kappa-coefficients demonstrate mostly fair (κ-scores between 0.21 and 0.4) to moderate (κ-scores between 0.41 and 0.6) agreement between test-retest General and Environment items; poor (κ-scores up to 0.2) agreement between Skills and Community test-retest items, fair agreement between Policies items, and for most of the questions focussing on Services a fair agreement was found. The study is a first effort at providing a tool that may be used to monitor and evaluate students’ opinions about changes in health promoting schools. Although the HPS Monitoring Questionnaire has face validity, the results of the reliability testing were inconclusive. Further research is warranted. PMID:27694227
Badia, X; Mascaró, J M; Lozano, R
1999-10-01
The aim of this study was to assess the feasibility, validity, reliability and sensitivity to change of a Spanish version of the Dermatology Life Quality Index (DLQI) in patients with mild to moderate eczema and psoriasis who were treated with topical corticosteroids. The final study sample comprised 237 patients (48% eczema). Discriminant validity was tested by comparing patients' scores with those of a random sample of the general population (n = 100), and convergent validity by analysing correlations between DLQI scores, measures of clinical severity, and domain scores on the Nottingham Health Profile (NHP). Internal consistency and test-retest reliability were tested in clinically stable patients (n = 94), and responsiveness in a clinically unstable group (n = 143) initiating treatment with topical corticosteroids. Patient scores were significantly higher than general population scores (4.3 vs. 0. 27, P < 0.001). Correlations with NHP domains ranged from 0.12 to 0. 32, and there was significant correlation with clinical measures (r = 0.26, P < 0.001). Reliability was good (Cronbach's alpha = 0.83; intraclass correlation coefficient = 0.88), and the instrument proved responsive to change (effect size for the total group of de novo patients = 0.70), though the great majority of changes occurred in items 1 and 2. The NHP Emotional Reactions and Mobility domains were more responsive than some DLQI domains. In clinical trials of treatments for mild to moderate eczema and psoriasis, it is likely that only items 1 and 2 of the DLQI will be needed, and it is probably advisable to include generic instruments alongside the DLQI.
Validation of the Brazilian Portuguese Version of Geriatric Anxiety Inventory--GAI-BR.
Massena, Patrícia Nitschke; de Araújo, Narahyana Bom; Pachana, Nancy; Laks, Jerson; de Pádua, Analuiza Camozzato
2015-07-01
The Geriatric Anxiety Inventory (GAI) is a recently developed scale aiming to evaluate symptoms of anxiety in later life. This 20-item scale uses dichotomous answers highlighting non-somatic anxiety complaints of elderly people. The present study aimed to evaluate the psychometric properties of the Brazilian Portuguese version GAI (GAI-BR) in a sample from community and outpatient psychogeriatric clinic. A mixed convenience sample of 72 subjects was recruited for answering the research protocol. The interview procedures were structured with questionnaires about sociodemographic data, clinical health status, anxiety, and depression previously validated instruments, Mini-Mental State Examination, Mini International Neuropsychiatric Interview, and GAI-BR. Twenty-two percent of the sample were interviewed twice for test-retest reliability. For internal consistency analyses, the Cronbach's α test was applied. The Spearman correlation test was applied to evaluate the test-retest GAI-BR reliability. A ROC (receiver operating characteristic) curve study was made to estimate the GAI-BR area under curve, cut-off points, sensitivity, and specificity for the Generalized Anxiety Disorder diagnosis. The GAI-BR version showed high internal consistency (Cronbach's α = 0.91) and strong and significant test-retest reliability (ρ = 0.85, p < 0.001). It also showed moderate and significant correlation with the Beck Anxiety Inventory (ρ = 0.68, p < 0.001) and the State-Trait Anxiety Inventory (ρ = 0.61, p < 0.001) showing evidence of concurrent validation. The cut-off point of 13 estimated by ROC curve analyses showed sensitivity of 83.3% and specificity of 84.6% to detect Generalized Anxiety Disorder (DSM-IV). GAI-BR has demonstrated very good psychometric properties and can be a reliable instrument to measure anxiety in Brazilian elderly people.
Kwan, Yu Heng; Fong, Warren Weng Seng; Lui, Nai Lee; Yong, Si Ting; Cheung, Yin Bun; Malhotra, Rahul; Østbye, Truls; Thumboo, Julian
2016-12-01
The Short Form 36 Health Survey (SF-36) is a popular health-related quality of life (HrQoL) tool. However, few studies have assessed its psychometric properties in patients with spondyloarthritis (SpA). We therefore aimed to assess the reliability and validity of the SF-36 in patients with SpA in Singapore. Cross-sectional data from a registry of 196 SpA patients recruited from a dedicated tertiary referral clinic in Singapore from 2011 to 2014 was used. Analyses were guided by the COnsensus-based Standards for the selection of health Measurement INstruments framework. Internal consistency reliability was assessed using Cronbach's alpha. Construct validity was assessed through 33 a priori hypotheses by correlations of the eight subscales and two summary scores of SF-36 with other health outcomes. Known-group construct validity was assessed by comparison of the means of the subscales and summary scores of the SF-36 of SpA patients and the general population of Singapore using student's t tests. Among 196 patients (155 males (79.0 %), median (range) age: 36 (17-70), 166 Chinese (84.6 %)), SF-36 scales showed high internal consistency ranging from 0.88 to 0.90. Convergent construct validity was supported as shown by fulfillment of all hypotheses. Divergent construct validity was supported, as SF-36 MCS was not associated with PGA, pain and HAQ. Known-group construct validity showed SpA patients had lower scores of 3.8-12.5 when compared to the general population at p < 0.001. This study supports the SF-36 as a valid and reliable measure of HrQoL for use in patients with SpA at a single time point.
Validity and reliability of the South African health promoting schools monitoring questionnaire.
Struthers, Patricia; Wegner, Lisa; de Koker, Petra; Lerebo, Wondwossen; Blignaut, Renette J
2017-04-01
Health promoting schools, as conceptualised by the World Health Organisation, have been developed in many countries to facilitate the health-education link. In 1994, the concept of health promoting schools was introduced in South Africa. In the process of becoming a health promoting school, it is important for schools to monitor and evaluate changes and developments taking place. The Health Promoting Schools (HPS) Monitoring Questionnaire was developed to obtain opinions of students about their school as a health promoting school. It comprises 138 questions in seven sections: socio-demographic information; General health promotion programmes; health related Skills and knowledge; Policies; Environment; Community-school links; and support Services. This paper reports on the reliability and face validity of the HPS Monitoring Questionnaire. Seven experts reviewed the questionnaire and agreed that it has satisfactory face validity. A test-retest reliability study was conducted with 83 students in three high schools in Cape Town, South Africa. The kappa-coefficients demonstrate mostly fair (κ-scores between 0.21 and 0.4) to moderate (κ-scores between 0.41 and 0.6) agreement between test-retest General and Environment items; poor (κ-scores up to 0.2) agreement between Skills and Community test-retest items, fair agreement between Policies items, and for most of the questions focussing on Services a fair agreement was found. The study is a first effort at providing a tool that may be used to monitor and evaluate students' opinions about changes in health promoting schools. Although the HPS Monitoring Questionnaire has face validity, the results of the reliability testing were inconclusive. Further research is warranted. © The Author 2016. Published by Oxford University Press.
García-Ramos, Amador; Feriche, Belén; Pérez-Castilla, Alejandro; Padial, Paulino; Jaric, Slobodan
2017-07-01
This study aimed to explore the strength of the force-velocity (F-V) relationship of lower limb muscles and the reliability of its parameters (maximum force [F 0 ], slope [a], maximum velocity [V 0 ], and maximum power [P 0 ]). Twenty-three men were tested in two different jump types (squat and countermovement jump: SJ and CMJ), performed under two different loading conditions (free weight and Smith machine: Free and Smith) with 0, 17, 30, 45, 60, and 75 kg loads. The maximum and averaged values of F and V were obtained for the F-V relationship modelling. All F-V relationships were strong and linear independently whether observed from the averaged across the participants (r ≥ 0.98) or individual data (r = 0.94-0.98), while their parameters were generally highly reliable (F 0 [CV: 4.85%, ICC: 0.87], V 0 [CV: 6.10%, ICC: 0.82], a [CV: 10.5%, ICC: 0.81], and P 0 [CV: 3.5%, ICC: 0.93]). Both the strength of the F-V relationships and the reliability of their parameters were significantly higher for (1) the CMJ over the SJ, (2) the Free over the Smith loading type, and (3) the maximum over the averaged F and V variables. In conclusion, although the F-V relationships obtained from all the jumps tested were linear and generally highly reliable, the less appropriate choice for testing the F-V relationship could be through the averaged F and V data obtained from the SJ performed either in a Free weight or in a Smith machine. Insubstantial differences exist among the other combinations tested.
Dikken, Jeroen; Hoogerduijn, Jita G; Lagerwey, Mary D; Shortridge-Baggett, Lillie; Klaassen, Sharon; Schuurmans, Marieke J
In clinical practice, identifying positive and negative attitudes toward older patients is very important to improve quality of care provided to them. The Older People in Acute Care Survey - United States (OPACS-US) is an instrument measuring hospital nurses attitudes regarding older patients. However, psychometrics have never been assessed. Furthermore, knowledge being related to attitude and behavior should also be measured complementing the OPACS-US. The purpose of this study was to assess structural validity and reliability of the OPACS-US and assess whether the OPACS-US can be complemented with the Knowledge about Older Patients-Quiz (KOP-Q). A multicenter cross sectional design was conducted. Registered nurses (n = 130, mean age 39,9 years; working experience 14,6 years) working in four general hospitals were included in the study. Nurses completed the OPACS-US section A: practice experiences, B: general opinion and the KOP-Q online. Findings demonstrated that the OPACS-US is a valid and reliable survey instrument that measures practice experiences and general opinion. Furthermore, the OPACS-US can be combined with the KOP-Q adding a knowledge construct, and is ready for use within education and/or quality improvement programs in the USA. Copyright © 2017 Elsevier Inc. All rights reserved.
Babor, Thomas F; Xuan, Ziming; Proctor, Dwayne
2008-03-01
The purposes of this study were to develop reliable procedures to monitor the content of alcohol advertisements broadcast on television and in other media, and to detect violations of the content guidelines of the alcohol industry's self-regulation codes. A set of rating-scale items was developed to measure the content guidelines of the 1997 version of the U.S. Beer Institute Code. Six focus groups were conducted with 60 college students to evaluate the face validity of the items and the feasibility of the procedure. A test-retest reliability study was then conducted with 74 participants, who rated five alcohol advertisements on two occasions separated by 1 week. Average correlations across all advertisements using three reliability statistics (r, rho, and kappa) were almost all statistically significant and the kappas were good for most items, which indicated high test-retest agreement. We also found high interrater reliabilities (intraclass correlations) among raters for item-level and guideline-level violations, indicating that regardless of the specific item, raters were consistent in their general evaluations of the advertisements. Naïve (untrained) raters can provide consistent (reliable) ratings of the main content guidelines proposed in the U.S. Beer Institute Code. The rating procedure may have future applications for monitoring compliance with industry self-regulation codes and for conducting research on the ways in which alcohol advertisements are perceived by young adults and other vulnerable populations.
COSTING MODELS FOR WATER SUPPLY DISTRIBUTION: PART III- PUMPS, TANKS, AND RESERVOIRS
Distribution systems are generally designed to ensure hydraulic reliability. Storage tanks, reservoirs and pumps are critical in maintaining this reliability. Although storage tanks, reservoirs and pumps are necessary for maintaining adequate pressure, they may also have a negati...
Space Operations Center System Analysis: Requirements for a Space Operations Center, revision A
NASA Technical Reports Server (NTRS)
Woodcock, G. R.
1982-01-01
The system and program requirements for a space operations center as defined by systems analysis studies are presented as a guide for future study and systems definition. Topics covered include general requirements for safety, maintainability, and reliability, service and habitat modules, the health maintenance facility; logistics modules; the docking tunnel; and subsystem requirements (structures, electrical power, environmental control/life support; extravehicular activity; data management; communications and tracking; docking/berthing; flight control/propulsion; and crew support). Facilities for flight support, construction, satellite and mission servicing, and fluid storage are included as well as general purpose support equipment.
Bartels, Meike; Cath, Danielle C.; Boomsma, Dorret I.
2008-01-01
The factor structure of the Dutch translation of the Autism-Spectrum Quotient (AQ; a continuous, quantitative measure of autistic traits) was evaluated with confirmatory factor analyses in a large general population and student sample. The criterion validity of the AQ was examined in three matched patient groups (autism spectrum conditions (ASC), social anxiety disorder, and obsessive–compulsive disorder). A two factor model, consisting of a “Social interaction” factor and “Attention to detail” factor could be identified. The internal consistency and test–retest reliability of the AQ were satisfactory. High total AQ and factor scores were specific to ASC patients. Men scored higher than women and science students higher than non-science students. The Dutch translation of the AQ is a reliable instrument to assess autism spectrum conditions. PMID:18302013
The general movement assessment in non-European low- and middle-income countries.
Tomantschger, Iris; Herrero, Dafne; Einspieler, Christa; Hamamura, Cristina; Voos, Mariana Calil; Marschik, Peter B
2018-02-05
Abnormal general movements are among the most reliable markers for cerebral palsy. General movements are part of the spontaneous motor repertoire and are present from early fetal life until the end of the first half year after term. In addition to its high sensitivity (98%) and specificity (91%), the assessment of general movements is non-invasive and time- and cost-efficient. It is therefore ideal for assessing the integrity of the young nervous system, most notably in lowresource settings. Studies on the general movements assessment in low- and middle-income countries such as China, India, Iran, or South Africa are still rare but increasing. In Brazil, too, researchers have demonstrated that the evaluation of general movements adds to the functional assessment of the young nervous system. Applying general movements assessment in vulnerable populations in Brazil is therefore highly recommended.
NASA Technical Reports Server (NTRS)
Motyka, P.
1983-01-01
A methodology for quantitatively analyzing the reliability of redundant avionics systems, in general, and the dual, separated Redundant Strapdown Inertial Measurement Unit (RSDIMU), in particular, is presented. The RSDIMU is described and a candidate failure detection and isolation system presented. A Markov reliability model is employed. The operational states of the system are defined and the single-step state transition diagrams discussed. Graphical results, showing the impact of major system parameters on the reliability of the RSDIMU system, are presented and discussed.
Analysis of whisker-toughened CMC structural components using an interactive reliability model
NASA Technical Reports Server (NTRS)
Duffy, Stephen F.; Palko, Joseph L.
1992-01-01
Realizing wider utilization of ceramic matrix composites (CMC) requires the development of advanced structural analysis technologies. This article focuses on the use of interactive reliability models to predict component probability of failure. The deterministic William-Warnke failure criterion serves as theoretical basis for the reliability model presented here. The model has been implemented into a test-bed software program. This computer program has been coupled to a general-purpose finite element program. A simple structural problem is presented to illustrate the reliability model and the computer algorithm.
The nonverbal expression of pride: evidence for cross-cultural recognition.
Tracy, Jessica L; Robins, Richard W
2008-03-01
The present research tests whether recognition for the nonverbal expression of pride generalizes across cultures. Study 1 provided the first evidence for cross-cultural recognition of pride, demonstrating that the expression generalizes across Italy and the United States. Study 2 found that the pride expression generalizes beyond Western cultures; individuals from a preliterate, highly isolated tribe in Burkina Faso, West Africa, reliably recognized pride, regardless of whether it was displayed by African or American targets. These Burkinabe participants were unlikely to have learned the pride expression through cross-cultural transmission, so their recognition suggests that pride may be a human universal. Studies 3 and 4 used drawn figures to systematically manipulate the ethnicity and gender of targets showing the expression, and demonstrated that pride recognition generalizes across male and female targets of African, Asian, and Caucasian descent. Discussion focuses on the implications of the findings for the universality of the pride expression.
Comparison of fMRI paradigms assessing visuospatial processing: Robustness and reproducibility
Herholz, Peer; Zimmermann, Kristin M.; Westermann, Stefan; Frässle, Stefan; Jansen, Andreas
2017-01-01
The development of brain imaging techniques, in particular functional magnetic resonance imaging (fMRI), made it possible to non-invasively study the hemispheric lateralization of cognitive brain functions in large cohorts. Comprehensive models of hemispheric lateralization are, however, still missing and should not only account for the hemispheric specialization of individual brain functions, but also for the interactions among different lateralized cognitive processes (e.g., language and visuospatial processing). This calls for robust and reliable paradigms to study hemispheric lateralization for various cognitive functions. While numerous reliable imaging paradigms have been developed for language, which represents the most prominent left-lateralized brain function, the reliability of imaging paradigms investigating typically right-lateralized brain functions, such as visuospatial processing, has received comparatively less attention. In the present study, we aimed to establish an fMRI paradigm that robustly and reliably identifies right-hemispheric activation evoked by visuospatial processing in individual subjects. In a first study, we therefore compared three frequently used paradigms for assessing visuospatial processing and evaluated their utility to robustly detect right-lateralized brain activity on a single-subject level. In a second study, we then assessed the test-retest reliability of the so-called Landmark task–the paradigm that yielded the most robust results in study 1. At the single-voxel level, we found poor reliability of the brain activation underlying visuospatial attention. This suggests that poor signal-to-noise ratios can become a limiting factor for test-retest reliability. This represents a common detriment of fMRI paradigms investigating visuospatial attention in general and therefore highlights the need for careful considerations of both the possibilities and limitations of the respective fMRI paradigm–in particular, when being interested in effects at the single-voxel level. Notably, however, when focusing on the reliability of measures of hemispheric lateralization (which was the main goal of study 2), we show that hemispheric dominance (quantified by the lateralization index, LI, with |LI| >0.4) of the evoked activation could be robustly determined in more than 62% and, if considering only two categories (i.e., left, right), in more than 93% of our subjects. Furthermore, the reliability of the lateralization strength (LI) was “fair” to “good”. In conclusion, our results suggest that the degree of right-hemispheric dominance during visuospatial processing can be reliably determined using the Landmark task, both at the group and single-subject level, while at the same time stressing the need for future refinements of experimental paradigms and more sophisticated fMRI data acquisition techniques. PMID:29059201
Junkes, Monica C; Fraiz, Fabian C; Sardenberg, Fernanda; Lee, Jessica Y; Paiva, Saul M; Ferreira, Fernanda M
2015-01-01
The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. The BREALD-30 demonstrated good internal reliability. Cronbach's alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent's perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent's perception regarding his/her child's oral health remained significant in the multivariate analysis. The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil.
Junkes, Monica C.; Fraiz, Fabian C.; Sardenberg, Fernanda; Lee, Jessica Y.; Paiva, Saul M.; Ferreira, Fernanda M.
2015-01-01
Objective The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. Methods After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. Results The BREALD-30 demonstrated good internal reliability. Cronbach’s alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent’s perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent’s perception regarding his/her child's oral health remained significant in the multivariate analysis. Conclusion The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil. PMID:26158724
Reliability Assessment of Graphite Specimens under Multiaxial Stresses
NASA Technical Reports Server (NTRS)
Sookdeo, Steven; Nemeth, Noel N.; Bratton, Robert L.
2008-01-01
An investigation was conducted to predict the failure strength response of IG-100 nuclear grade graphite exposed to multiaxial stresses. As part of this effort, a review of failure criteria accounting for the stochastic strength response is provided. The experimental work was performed in the early 1990s at the Oak Ridge National Laboratory (ORNL) on hollow graphite tubes under the action of axial tensile loading and internal pressurization. As part of the investigation, finite-element analysis (FEA) was performed and compared with results of FEA from the original ORNL report. The new analysis generally compared well with the original analysis, although some discrepancies in the location of peak stresses was noted. The Ceramics Analysis and Reliability Evaluation of Structures Life prediction code (CARES/Life) was used with the FEA results to predict the quadrants I (tensile-tensile) and quadrant IV (compression-tension) strength response of the graphite tubes for the principle of independent action (PIA), the Weibull normal stress averaging (NSA), and the Batdorf multiaxial failure theories. The CARES/Life reliability analysis showed that all three failure theories gave similar results in quadrant I but that in quadrant IV, the PIA and Weibull normal stress-averaging theories were not conservative, whereas the Batdorf theory was able to correlate well with experimental results. The conclusion of the study was that the Batdorf theory should generally be used to predict the reliability response of graphite and brittle materials in multiaxial loading situations.
NASA Technical Reports Server (NTRS)
Ciciora, J. A.; Leonard, S. D.; Johnson, N.; Amell, J.
1984-01-01
In order to derive general design guidelines for automated systems a study was conducted on the utilization and acceptance of existing automated systems as currently employed in several commercial fields. Four principal study area were investigated by means of structured interviews, and in some cases questionnaires. The study areas were aviation, a both scheduled airline and general commercial aviation; process control and factory applications; office automation; and automation in the power industry. The results of over eighty structured interviews were analyzed and responses categoried as various human factors issues for use by both designers and users of automated equipment. These guidelines address such items as general physical features of automated equipment; personnel orientation, acceptance, and training; and both personnel and system reliability.
Panagiotopoulou, O.; Wilshin, S. D.; Rayfield, E. J.; Shefelbine, S. J.; Hutchinson, J. R.
2012-01-01
Finite element modelling is well entrenched in comparative vertebrate biomechanics as a tool to assess the mechanical design of skeletal structures and to better comprehend the complex interaction of their form–function relationships. But what makes a reliable subject-specific finite element model? To approach this question, we here present a set of convergence and sensitivity analyses and a validation study as an example, for finite element analysis (FEA) in general, of ways to ensure a reliable model. We detail how choices of element size, type and material properties in FEA influence the results of simulations. We also present an empirical model for estimating heterogeneous material properties throughout an elephant femur (but of broad applicability to FEA). We then use an ex vivo experimental validation test of a cadaveric femur to check our FEA results and find that the heterogeneous model matches the experimental results extremely well, and far better than the homogeneous model. We emphasize how considering heterogeneous material properties in FEA may be critical, so this should become standard practice in comparative FEA studies along with convergence analyses, consideration of element size, type and experimental validation. These steps may be required to obtain accurate models and derive reliable conclusions from them. PMID:21752810
Validity and Reliability of the 8-Item Work Limitations Questionnaire.
Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C
2017-12-01
Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Gaining perspective on what we've lost: the reliability of encoded anecdotes in historical ecology.
Al-Abdulrazzak, Dalal; Naidoo, Robin; Palomares, Maria Lourdes D; Pauly, Daniel
2012-01-01
Historical data are essential in fisheries management and conservation, especially for species that suffered significant population declines prior to ecological data collection. Within the field of historical marine ecology, studies have relied on anecdotal evidence, such as written accounts by explorers and interviews of different generations of resource users, to demonstrate the former abundance of certain species and the extent of their ranges. Yet, do we all agree on how these anecdotes are interpreted? This study examines the way that different people interpret anecdotes extracted from historical narratives. We outsource a survey to 50 randomly selected people using Amazon Mechanical Turk (www.mturk.com) and ask them to 'code' historical anecdotes based on their perceived abundance of species. We perform intercoder reliability tests to show that people's perceptions of historical anecdotes are generally consistent. The results speak to the reliability of using people's perceptions to acquire quantitative data, and provide novel insights into the use of anecdotal evidence to inform historical ecology.
Ponton-Carss, Alicia; Hutchison, Carol; Violato, Claudio
2011-10-01
The purpose of this study was to investigate the reliability and validity of a performance assessment of communication, professionalism, and surgical skills competencies for surgery residents. Fourteen residents from the general surgery program of the University of Calgary were assessed in 7 surgical simulation stations that included communication and professionalism skills. The internal consistency reliability of the checklists and global rating scales combined was adequate for communication (α = .75-.92) and surgical skills (α = .86-.96), but not for professionalism (α = 0). There was evidence of validity as surgical skills performance improved as a function of postgraduate year level but not for the professionalism checklist. Surgical skills and communication correlated in the 2 stations assessed (r = .55 and .57; P < .05). There is evidence for both reliability and validity for simultaneously assessing surgical skills and communication skills. Further instrument development is required to assess professionalism in a structured examination context. Copyright © 2011 Elsevier Inc. All rights reserved.
Reliability of concussion history in former professional football players.
Kerr, Zachary Y; Marshall, Stephen W; Guskiewicz, Kevin M
2012-03-01
The reliability of athletes to recall and self-report a concussion history has never been quantified. This study examined the reliability of the self-report concussion history measure and explored determinants of recall in the number of self-reported concussions in a group of retired professional football players. In 2001, a short questionnaire was administered to a cohort of former professional football players to ascertain the number of self-reported concussions they sustained during their professional playing careers. In 2010, the same instrument was readministered to a subset (n = 899) of the original cohort to assess reliability. Overall reliability was moderate (weighted Cohen κ = 0.48). The majority (62.1%) reported the same number of concussions in both administrations (2001 and 2010); 31.4% reported more concussions in the second administration. Compared with the "same number reported" group, the "greater number reported" group had more deficits in the second administration in their Short Form 36 physical health (composite score combining physical functioning, role physical, bodily pain, general health) and mental health (e.g., composite score combining vitality, social functioning, role emotional) scales. The self-reported concussion history had moderate reliability in former professional football players, on the basis of two administrations of the same instrument, 9 yr apart. However, changes in health status may be differentially associated with recall of concussions.
Selenide isotope generator for the Galileo mission. Reliability program plan
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1978-10-01
The reliability program plan for the Selenide Isotope Generator (SIG) program is presented. It delineates the specific tasks that will be accomplished by Teledyne Energy Systems and its suppliers during design, development, fabrication and test of deliverable Radioisotopic Thermoelectric Generators (RTG), Electrical Heated Thermoelectric Generators (ETG) and associated Ground Support Equipment (GSE). The Plan is formulated in general accordance with procedures specified in DOE Reliability Engineering Program Requirements Publication No. SNS-2, dated June 17, 1974. The Reliability Program Plan presented herein defines the total reliability effort without further reference to Government Specifications. The reliability tasks to be accomplished are delineatedmore » herein and become the basis for contract compliance to the extent specified in the SIG contract Statement of Work.« less
Lord, Sarah Peregrine; Can, Doğan; Yi, Michael; Marin, Rebeca; Dunn, Christopher W.; Imel, Zac E.; Georgiou, Panayiotis; Narayanan, Shrikanth; Steyvers, Mark; Atkins, David C.
2014-01-01
The current paper presents novel methods for collecting MISC data and accurately assessing reliability of behavior codes at the level of the utterance. The MISC 2.1 was used to rate MI interviews from five randomized trials targeting alcohol and drug use. Sessions were coded at the utterance-level. Utterance-based coding reliability was estimated using three methods and compared to traditional reliability estimates of session tallies. Session-level reliability was generally higher compared to reliability using utterance-based codes, suggesting that typical methods for MISC reliability may be biased. These novel methods in MI fidelity data collection and reliability assessment provided rich data for therapist feedback and further analyses. Beyond implications for fidelity coding, utterance-level coding schemes may elucidate important elements in the counselor-client interaction that could inform theories of change and the practice of MI. PMID:25242192
Lord, Sarah Peregrine; Can, Doğan; Yi, Michael; Marin, Rebeca; Dunn, Christopher W; Imel, Zac E; Georgiou, Panayiotis; Narayanan, Shrikanth; Steyvers, Mark; Atkins, David C
2015-02-01
The current paper presents novel methods for collecting MISC data and accurately assessing reliability of behavior codes at the level of the utterance. The MISC 2.1 was used to rate MI interviews from five randomized trials targeting alcohol and drug use. Sessions were coded at the utterance-level. Utterance-based coding reliability was estimated using three methods and compared to traditional reliability estimates of session tallies. Session-level reliability was generally higher compared to reliability using utterance-based codes, suggesting that typical methods for MISC reliability may be biased. These novel methods in MI fidelity data collection and reliability assessment provided rich data for therapist feedback and further analyses. Beyond implications for fidelity coding, utterance-level coding schemes may elucidate important elements in the counselor-client interaction that could inform theories of change and the practice of MI. Copyright © 2015 Elsevier Inc. All rights reserved.
[Reliability of static posturography in elderly persons].
Bauer, C M; Gröger, I; Rupprecht, R; Tibesku, C O; Gassmann, K G
2010-08-01
Static posturography is used to quantify body sway. It is used to assess the balance of elderly persons who are prone to falls. There is still no general opinion concerning the reliability of force platform measurements. The aim of this study was to test the reliability of force platform parameters when measuring elderly persons. The reliability of 11 force platform parameters was tested measuring 30 elderly persons. The following parameters were calculated: mean speed of center of pressure displacement in mm/s, length of sway in mm, sway area in mm(2), amplitudes of center of pressure movement, the axis of oscillation in degrees and the person's angles of inclination in degrees. Three measurements were taken on the same day, with a resting period of 2 min. Four different test conditions were used: normal standing and narrow stand with eyes open and eyes closed, respectively. Reliability was determined by using intraclass correlation coefficients. Six parameters had excellent reliability with a correlation coefficient of >0.9: mean speed of center of pressure movement during narrow stand, area of sway during narrow stand, length of sway during normal and narrow stand, and the angle of inclination in the sagittal plane during normal stand and narrow stand. The condition "narrow stand eyes closed" proved to be the most reliable test position. Six parameters proved to have excellent reliability and are recommended to be used in further investigations. Narrow stand with eyes closed should be used as the test position. The tested protocol proved to be reliable. Whether these parameters can be used to predict falls in elderly persons remains to be investigated.
Code of Federal Regulations, 2010 CFR
2010-07-01
... PROGRAM REGULATION Adjudication § 154.40 General. (a) The standard which must be met for clearance or assignment to sensitive duties is that, based on all available information, the person's loyalty, reliability...
DOT National Transportation Integrated Search
2014-01-01
The objective of this project was to develop system designs for programs to monitor travel time reliability and to prepare a guidebook that practitioners and others can use to design, build, operate, and maintain such systems. Generally, such travel ...
Pilot testing of SHRP 2 reliability data and analytical products: Minnesota. [supporting datasets
DOT National Transportation Integrated Search
2014-01-01
The objective of this project was to develop system designs for programs to monitor travel time reliability and to prepare a guidebook that practitioners and others can use to design, build, operate, and maintain such systems. Generally, such travel ...
7 CFR 1788.2 - General insurance requirements.
Code of Federal Regulations, 2010 CFR
2010-01-01
... consistent with cost-effectiveness, reliability, safety, and expedition. It is recognized that Prudent... accomplish the desired result at the lowest reasonable cost consistent with cost-effectiveness, reliability... which is used or useful in the borrower's business and which shall be covered by insurance, unless each...
Malisova, Olga; Bountziouka, Vassiliki; Panagiotakos, Demosthenes B; Zampelas, Antonis; Kapsokefalou, Maria
2012-03-01
There is a need to develop a questionnaire as a research tool for the evaluation of water balance in the general population. The water balance questionnaire (WBQ) was designed to evaluate water intake from fluid and solid foods and drinking water, and water loss from urine, faeces and sweat at sedentary conditions and physical activity. For validation purposes, the WBQ was administrated in 40 apparently healthy participants aged 22-57 years (37.5% males). Hydration indices in urine (24 h volume, osmolality, specific gravity, pH, colour) were measured through established procedures. Furthermore, the questionnaire was administered twice to 175 subjects to evaluate its reliability. Kendall's τ-b and the Bland and Altman method were used to assess the questionnaire's validity and reliability. The proposed WBQ to assess water balance in healthy individuals was found to be valid and reliable, and it could thus be a useful tool in future projects that aim to evaluate water balance.
Hagiwara, Akiko; Ito, Naomi; Sawai, Kazuhiko; Kazuma, Keiko
2008-09-01
In Japan, there are no valid and reliable physical activity questionnaires for elderly people. In this study, we translated the Physical Activity Scale for the Elderly (PASE) into Japanese and assessed its validity and reliability. Three hundred and twenty-five healthy and elderly subjects over 65 years were enrolled. Concurrent validity was evaluated by Spearman's rank correlation coefficient between PASE scores and an accelerometer (waking steps and energy expenditure), a physical activity questionnaire for adults in general (the Japan Arteriosclerosis Longitudinal Study Physical Activity Questionnaire, JALSPAQ), grip strength, mid-thigh muscle area per bodyweight, static valance and bodyfat percentage. Reliability was evaluated by the test-retest method over a period of 3-4 weeks. The mean PASE score in this study was 114.9. The PASE score was significantly correlated with walking steps (rho = 0.17, P = 0.014), energy expenditure (rho = 0.16, P = 0.024), activity measured with the JALSPAQ (rho = 0.48, P < 0.001), mid-thigh muscle area per bodyweight (rho = 0.15, P = 0.006) and static balance (rho = 0.19, P = 0.001). The proportion of consistency in the response between the first and second surveys was adequately high. The intraclass correlation coefficient for the PASE score was 0.65. The Japanese version of PASE was shown to have acceptable validity and reliability. The PASE is useful to measure the physical activity of elderly people in Japan.
ERIC Educational Resources Information Center
Shirazi, Mandana; Sadeghi, Majid; Emami, A.; Kashani, A. Sabouri; Parikh, Sagar; Alaeddini, F.; Arbabi, Mohammad; Wahlstrom, Rolf
2011-01-01
Objective: Standardized patients (SPs) have been developed to measure practitioner performance in actual practice settings, but results have not been fully validated for psychiatric disorders. This study describes the process of creating reliable and valid SPs for unannounced assessment of general-practitioners' management of depression disorders…
Incorporating Nonparametric Statistics into Delphi Studies in Library and Information Science
ERIC Educational Resources Information Center
Ju, Boryung; Jin, Tao
2013-01-01
Introduction: The Delphi technique is widely used in library and information science research. However, many researchers in the field fail to employ standard statistical tests when using this technique. This makes the technique vulnerable to criticisms of its reliability and validity. The general goal of this article is to explore how…
Federal Register 2010, 2011, 2012, 2013, 2014
2011-09-08
... statistical surveys that yield quantitative results that can be generalized to the population of study. This... information will not be used for quantitative information collections that are designed to yield reliably... generic mechanisms that are designed to yield quantitative results. The FHWA received no comments in...
On the comparison between MASS and generalized-SCIDAR techniques
NASA Astrophysics Data System (ADS)
Masciadri, E.; Lombardi, G.; Lascaux, F.
2014-02-01
The Multi-Aperture Scintillation Sensor (MASS) and the Generalized-Scintillation Detection and Ranging (Generalized-SCIDAR) are two instruments conceived to measure the optical turbulence vertical distribution on the whole troposphere and low stratosphere (˜20 km) widely used in the astronomical context. In this paper, we perform a detailed analysis/comparison of measurements provided by the two instruments and taken during the extended site testing campaign carried out in 2007 at Cerro Paranal and promoted by the European Southern Observatory. The main and final goal of the study is to provide a detailed estimation of the measurements reliability, i.e. dispersion of turbulence measurements done by the two instruments at different heights above the ground. This information is directly related to our ability in estimating the absolute value of the turbulence stratification. To better analyse the uncertainties between the MASS and the Generalized-SCIDAR we took advantage of the availability of measurements taken during the same campaign by a third independent instrument called Differential Image Motion Monitor (DIMM) measuring the integrated turbulence extended on the whole 20 km. Such a cross-check comparison permitted us to define the reliability of the instruments and their measurements, their limits and the contexts in which their use can present some risk.
NG, Chong Guan; CHIN, Soo Cheng; YEE, Anne Hway Ann; LOH, Huai Seng; SULAIMAN, Ahmad Hatim; Sherianne Sook Kuan, WONG; HABIL, Mohamed Hussain
2014-01-01
Background: The Snaith-Hamilton Pleasure Scale (SHAPS) is a self-assessment scale designed to evaluate anhedonia in various psychiatric disorders. In order to facilitate its use in Malaysian settings, our current study aimed to examine the validity of a Malay-translated version of the SHAPS (SHAPS-M). Methods: In this cross-sectional study, a total of 44 depressed patients and 82 healthy subjects were recruited from a university out-patient clinic. All participants were given both the Malay and English versions of the SHAPS, Fawcett-Clark Pleasure Scale (FCPS), General Health Questionnaire 12 (GHQ-12), and the Beck Depression Inventory (BDI) to assess their hedonic state, general mental health condition and levels of depression. Results: The results showed that the SHAPS-M has impressive internal consistency (α = 0.96), concurrent validity and good parallel-form reliability (intraclass coefficient, ICC = 0.65). Conclusion: In addition to demonstrating good psychometric properties, the SHAPS-M is easy to administer. Therefore, it is a valid, reliable, and suitable questionnaire for assessing anhedonia among depressed patients in Malaysia. PMID:25246837
Science to support the understanding of Ohio's water resources, 2014-15
Shaffer, Kimberly; Kula, Stephanie P.
2014-01-01
The U.S. Geological Survey (USGS) works in cooperation with local, State, and other Federal agencies, as well as universities, to furnish decision makers, policy makers, USGS scientists, and the general public with reliable scientific information and tools to assist them in management, stewardship, and use of Ohio’s natural resources. The diversity of scientific expertise among USGS personnel enables them to carry out large- and small-scale multidisciplinary studies. The USGS is unique among government organizations because it has neither regulatory nor developmental authority—its sole product is impartial, credible, relevant, and timely scientific information, equally accessible and available to everyone. The USGS Ohio Water Science Center provides reliable hydrologic and water-related ecological information to aid in the understanding of the use and management of the Nation’s water resources, in general, and Ohio’s water resources, in particular. This fact sheet provides an overview of current (2014) or recently completed USGS studies and data activities pertaining to water resources in Ohio. More information regarding projects of the USGS Ohio Water Science Center is available at http://oh.water.usgs.gov/.
Ng, Chong Guan; Chin, Soo Cheng; Yee, Anne Hway Ann; Loh, Huai Seng; Sulaiman, Ahmad Hatim; Sherianne Sook Kuan, Wong; Habil, Mohamed Hussain
2014-05-01
The Snaith-Hamilton Pleasure Scale (SHAPS) is a self-assessment scale designed to evaluate anhedonia in various psychiatric disorders. In order to facilitate its use in Malaysian settings, our current study aimed to examine the validity of a Malay-translated version of the SHAPS (SHAPS-M). In this cross-sectional study, a total of 44 depressed patients and 82 healthy subjects were recruited from a university out-patient clinic. All participants were given both the Malay and English versions of the SHAPS, Fawcett-Clark Pleasure Scale (FCPS), General Health Questionnaire 12 (GHQ-12), and the Beck Depression Inventory (BDI) to assess their hedonic state, general mental health condition and levels of depression. The results showed that the SHAPS-M has impressive internal consistency (α = 0.96), concurrent validity and good parallel-form reliability (intraclass coefficient, ICC = 0.65). In addition to demonstrating good psychometric properties, the SHAPS-M is easy to administer. Therefore, it is a valid, reliable, and suitable questionnaire for assessing anhedonia among depressed patients in Malaysia.
Validation of the Spanish version of the Index of Spouse Abuse.
Plazaola-Castaño, Juncal; Ruiz-Pérez, Isabel; Escribà-Agüir, Vicenta; Jiménez-Martín, Juan Manuel; Hernández-Torres, Elisa
2009-04-01
Partner violence against women is a major public health problem. Although there are currently a number of validated screening and diagnostic tools that can be used to evaluate this type of violence, such tools are not available in Spain. The aim of this study is to analyze the validity and reliability of the Spanish version of the Index of Spouse Abuse (ISA). A cross-sectional study was carried out in 2005 in two health centers in Granada, Spain, in 390 women between 18 and 70 years old. Analyses of the factorial structure, internal consistency, test-retest reliability, and construct validity were conducted. Cutoff points for each subscale were also defined. For the construct validity analysis, the SF-36 perceived general health dimension, the Rosenberg Self-Esteem Scale and the Goldberg 12-item General Health Questionnaire were included. The psychometric analysis shows that the instrument has good internal consistency, reproducibility, and construct validity. The scale is useful for the analysis of partner violence against women in both a research setting and a healthcare setting.
The reliability of the pass/fail decision for assessments comprised of multiple components.
Möltner, Andreas; Tımbıl, Sevgi; Jünger, Jana
2015-01-01
The decision having the most serious consequences for a student taking an assessment is the one to pass or fail that student. For this reason, the reliability of the pass/fail decision must be determined for high quality assessments, just as the measurement reliability of the point values. Assessments in a particular subject (graded course credit) are often composed of multiple components that must be passed independently of each other. When "conjunctively" combining separate pass/fail decisions, as with other complex decision rules for passing, adequate methods of analysis are necessary for estimating the accuracy and consistency of these classifications. To date, very few papers have addressed this issue; a generally applicable procedure was published by Douglas and Mislevy in 2010. Using the example of an assessment comprised of several parts that must be passed separately, this study analyzes the reliability underlying the decision to pass or fail students and discusses the impact of an improved method for identifying those who do not fulfill the minimum requirements. The accuracy and consistency of the decision to pass or fail an examinee in the subject cluster Internal Medicine/General Medicine/Clinical Chemistry at the University of Heidelberg's Faculty of Medicine was investigated. This cluster requires students to separately pass three components (two written exams and an OSCE), whereby students may reattempt to pass each component twice. Our analysis was carried out using the method described by Douglas and Mislevy. Frequently, when complex logical connections exist between the individual pass/fail decisions in the case of low failure rates, only a very low reliability for the overall decision to grant graded course credit can be achieved, even if high reliabilities exist for the various components. For the example analyzed here, the classification accuracy and consistency when conjunctively combining the three individual parts is relatively low with κ=0.49 or κ=0.47, despite the good reliability of over 0.75 for each of the three components. The option to repeat each component twice leads to a situation in which only about half of the candidates who do not satisfy the minimum requirements would fail the overall assessment, while the other half is able to continue their studies despite having deficient knowledge and skills. The method put forth by Douglas and Mislevy allows the analysis of the decision accuracy and consistency for complex combinations of scores from different components. Even in the case of highly reliable components, it is not necessarily so that a reliable pass/fail decision has been reached - for instance in the case of low failure rates. Assessments must be administered with the explicit goal of identifying examinees that do not fulfill the minimum requirements.
The reliability of the pass/fail decision for assessments comprised of multiple components
Möltner, Andreas; Tımbıl, Sevgi; Jünger, Jana
2015-01-01
Objective: The decision having the most serious consequences for a student taking an assessment is the one to pass or fail that student. For this reason, the reliability of the pass/fail decision must be determined for high quality assessments, just as the measurement reliability of the point values. Assessments in a particular subject (graded course credit) are often composed of multiple components that must be passed independently of each other. When “conjunctively” combining separate pass/fail decisions, as with other complex decision rules for passing, adequate methods of analysis are necessary for estimating the accuracy and consistency of these classifications. To date, very few papers have addressed this issue; a generally applicable procedure was published by Douglas and Mislevy in 2010. Using the example of an assessment comprised of several parts that must be passed separately, this study analyzes the reliability underlying the decision to pass or fail students and discusses the impact of an improved method for identifying those who do not fulfill the minimum requirements. Method: The accuracy and consistency of the decision to pass or fail an examinee in the subject cluster Internal Medicine/General Medicine/Clinical Chemistry at the University of Heidelberg’s Faculty of Medicine was investigated. This cluster requires students to separately pass three components (two written exams and an OSCE), whereby students may reattempt to pass each component twice. Our analysis was carried out using the method described by Douglas and Mislevy. Results: Frequently, when complex logical connections exist between the individual pass/fail decisions in the case of low failure rates, only a very low reliability for the overall decision to grant graded course credit can be achieved, even if high reliabilities exist for the various components. For the example analyzed here, the classification accuracy and consistency when conjunctively combining the three individual parts is relatively low with κ=0.49 or κ=0.47, despite the good reliability of over 0.75 for each of the three components. The option to repeat each component twice leads to a situation in which only about half of the candidates who do not satisfy the minimum requirements would fail the overall assessment, while the other half is able to continue their studies despite having deficient knowledge and skills. Conclusion: The method put forth by Douglas and Mislevy allows the analysis of the decision accuracy and consistency for complex combinations of scores from different components. Even in the case of highly reliable components, it is not necessarily so that a reliable pass/fail decision has been reached – for instance in the case of low failure rates. Assessments must be administered with the explicit goal of identifying examinees that do not fulfill the minimum requirements. PMID:26483855
Acoustic method respiratory rate monitoring is useful in patients under intravenous anesthesia.
Ouchi, Kentaro; Fujiwara, Shigeki; Sugiyama, Kazuna
2017-02-01
Respiratory depression can occur during intravenous general anesthesia without tracheal intubation. A new acoustic method for respiratory rate monitoring, RRa ® (Masimo Corp., Tokyo, Japan), has been reported to show good reliability in post-anesthesia care and emergency units. The purpose of this study was to investigate the reliability of the acoustic method for measurement of respiratory rate during intravenous general anesthesia, as compared with capnography. Patients with dental anxiety undergoing dental treatment under intravenous anesthesia without tracheal intubation were enrolled in this study. Respiratory rate was recorded every 30 s using the acoustic method and capnography, and detectability of respiratory rate was investigated for both methods. This study used a cohort study design. In 1953 recorded respiratory rate data points, the number of detected points by the acoustic method (1884, 96.5 %) was significantly higher than that by capnography (1682, 86.1 %) (P < 0.0001). In the intraoperative period, there was a significant difference in the LOA (95 % limits of agreement of correlation between difference and average of the two methods)/ULLOA (under the lower limit of agreement) in terms of use or non-use of a dental air turbine (P < 0.0001). In comparison between capnography, the acoustic method is useful for continuous monitoring of respiratory rate in spontaneously breathing subjects undergoing dental procedures under intravenous general anesthesia. However, the acoustic method might not accurately detect in cases in with dental air turbine.
Paiva, Carlos Eduardo; Carneseca, Estela Cristina; Barroso, Eliane Marçon; de Camargos, Mayara Goulart; Alfano, Ana Camila Callado; Rugno, Fernanda Capella; Paiva, Bianca Sakamoto Ribeiro
2014-08-01
The European Organization for Research and Treatment of Cancer Core Quality of Life Questionnaire (EORTC QLQ-C30) is considered a valid instrument for use in Brazil. However, the previous Brazilian validation study included only 30 lung cancer patients and only measured test-retest reliability. The aim of this study was to evaluate the psychometric properties of the EORTC QLQ-C30 in a sample of cancer patients at different educational levels who completed the instrument administered by an interviewer. Data from six prospective studies conducted by the same group of researchers were combined in this study (N = 986). Reliability was assessed using Cronbach's alpha coefficient, all values of which were >0.7, with the exception of cognitive functioning, social functioning, and nausea and vomiting (α = 0.57, α = 0.69, and α = 0.68, respectively). In multi-trait scaling analysis, convergent and divergent validity were considered adequate (validity indices were 91.6 and 97.4%). In general, moderate to strong correlations were found between the subscales of the EORTC QLQ-C30 and its respective dimensions from the WHOQOL-bref, the hospital anxiety and depression scale, and the Edmonton Symptom Assessment System (ESAS) instruments. In addition, the EORTC QLQ-C30 was able to differentiate groups of patients with distinct performance statuses and types of treatment (known-group validation). Statistical analyses were also performed on educational status, yielding similar results. Detailed psychometric property data using the EORTC QLQ-C30 in Brazil are added by this study. In addition, we demonstrated that this instrument is in general reliable and valid regardless of the patient educational level.
[CRISPR/CAS9, the King of Genome Editing Tools].
Bannikov, A V; Lavrov, A V
2017-01-01
The discovery of CRISPR/Cas9 brought a hope for having an efficient, reliable, and readily available tool for genome editing. CRISPR/Cas9 is certainly easy to use, while its efficiency and reliability remain the focus of studies. The review describes the general principles of the organization and function of Cas nucleases and a number of important issues to be considered while planning genome editing experiments with CRISPR/Cas9. The issues include evaluation of the efficiency and specificity for Cas9, sgRNA selection, Cas9 variants designed artificially, and use of homologous recombination and nonhomologous end joining in DNA editing.
Reliability of self-reported antisocial personality disorder symptoms among substance abusers.
Cottler, L B; Compton, W M; Ridenour, T A; Ben Abdallah, A; Gallagher, T
1998-02-01
It is estimated that from 20 to 60% of substance abusers meet criteria for Antisocial Personality Disorder (APD). An accurate and reliable diagnosis is important because persons meeting criteria for APD, by the nature of their disorder, are less likely to change behaviors and more likely to relapse to both substance abuse and high risk behaviors. To understand more about the reliability of the disorder and symptoms of APD, the Diagnostic Interview Schedule Version III-R (DIS) was administered to 453 substance abusers ascertained from treatment programs and from the general population (St Louis Epidemiological Catchment Area (ECA) follow-up study). Estimates of the 1 week, test-retest reliability for the childhood conduct disorder criterion, the adult antisocial behavior criterion, and APD diagnosis fell in the good agreement range, as measured by kappa. The internal consistency of these DIS symptoms was adequate to acceptable. Individual DIS criteria designed to measure childhood conduct disorder ranged from fair to good for most items; reliability was slightly higher for the adult antisocial behavior symptom items. Finally, self-reported 'liars' were no more unreliable in their reports of their behaviors than 'non-liars'.
Garcia, Darren J.; Skadberg, Rebecca M.; Schmidt, Megan; ...
2018-03-05
The Diagnostic and Statistical Manual of Mental Disorders (5th ed. [DSM–5]; American Psychiatric Association, 2013) Section III Alternative Model for Personality Disorders (AMPD) represents a novel approach to the diagnosis of personality disorder (PD). In this model, PD diagnosis requires evaluation of level of impairment in personality functioning (Criterion A) and characterization by pathological traits (Criterion B). Questions about clinical utility, complexity, and difficulty in learning and using the AMPD have been expressed in recent scholarly literature. We examined the learnability, interrater reliability, and clinical utility of the AMPD using a vignette methodology and graduate student raters. Results showed thatmore » student clinicians can learn Criterion A of the AMPD to a high level of interrater reliability and agreement with expert ratings. Interrater reliability of the 25 trait facets of the AMPD varied but showed overall acceptable levels of agreement. Examination of severity indexes of PD impairment showed the level of personality functioning (LPF) added information beyond that of global assessment of functioning (GAF). Clinical utility ratings were generally strong. Lastly, the satisfactory interrater reliability of components of the AMPD indicates the model, including the LPF, is very learnable.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Garcia, Darren J.; Skadberg, Rebecca M.; Schmidt, Megan
The Diagnostic and Statistical Manual of Mental Disorders (5th ed. [DSM–5]; American Psychiatric Association, 2013) Section III Alternative Model for Personality Disorders (AMPD) represents a novel approach to the diagnosis of personality disorder (PD). In this model, PD diagnosis requires evaluation of level of impairment in personality functioning (Criterion A) and characterization by pathological traits (Criterion B). Questions about clinical utility, complexity, and difficulty in learning and using the AMPD have been expressed in recent scholarly literature. We examined the learnability, interrater reliability, and clinical utility of the AMPD using a vignette methodology and graduate student raters. Results showed thatmore » student clinicians can learn Criterion A of the AMPD to a high level of interrater reliability and agreement with expert ratings. Interrater reliability of the 25 trait facets of the AMPD varied but showed overall acceptable levels of agreement. Examination of severity indexes of PD impairment showed the level of personality functioning (LPF) added information beyond that of global assessment of functioning (GAF). Clinical utility ratings were generally strong. Lastly, the satisfactory interrater reliability of components of the AMPD indicates the model, including the LPF, is very learnable.« less
Ruan, W June; Goldstein, Risë B; Chou, S Patricia; Smith, Sharon M; Saha, Tulshi D; Pickering, Roger P; Dawson, Deborah A; Huang, Boji; Stinson, Frederick S; Grant, Bridget F
2008-01-01
This study presents test-retest reliability statistics and information on internal consistency for new diagnostic modules and risk factors for alcohol, drug, and psychiatric disorders from the Alcohol Use Disorder and Associated Disabilities Interview Schedule-IV (AUDADIS-IV). Test-retest statistics were derived from a random sample of 1899 adults selected from 34,653 respondents who participated in the 2004-2005 Wave 2 National Epidemiologic Survey on Alcohol and Related Conditions (NESARC). Internal consistency of continuous scales was assessed using the entire Wave 2 NESARC. Both test and retest interviews were conducted face-to-face. Test-retest and internal consistency results for diagnoses and symptom scales associated with posttraumatic stress disorder, attention-deficit/hyperactivity disorder, and borderline, narcissistic, and schizotypal personality disorders were predominantly good (kappa>0.63; ICC>0.69; alpha>0.75) and reliability for risk factor measures fell within the good to excellent range (intraclass correlations=0.50-0.94; alpha=0.64-0.90). The high degree of reliability found in this study suggests that new AUDADIS-IV diagnostic measures can be useful tools in research settings. The availability of highly reliable measures of risk factors for alcohol, drug, and psychiatric disorders will contribute to the validity of conclusions drawn from future research in the domains of substance use disorder and psychiatric epidemiology.
Leading Change: Transitioning the AFMS into a High Reliability Organization
2016-02-16
Belief and Drive Big Results (New York, NY: Free Press, 2012), 17. 61 William Riley, “High Reliability and Implications for Nursing Leaders,” Journal of... Nursing Management 17, no. 2 (March 2009): 241. 62 The Joint Commission, “About Us,” 25 November 2015, http://www.jointcommission.org/about_us...Air Force Surgeon General. Trusted Care Concept of Operations, October 2015. Riley, William. “High reliability and implications for nursing leaders
ERIC Educational Resources Information Center
Noh, Younghee
2010-01-01
This study aimed to improve the current state of electronic resource evaluation in libraries. While the use of Web DB, e-book, e-journal, and other e-resources such as CD-ROM, DVD, and micro materials is increasing in libraries, their use is not comprehensively factored into the general evaluation of libraries and may diminish the reliability of…
Montazeri, Ali; Torkan, Behnaz; Omidvari, Sepideh
2007-04-04
The Edinburgh Postnatal Depression Scale (EPDS) is a widely used instrument to measure postnatal depression. This study aimed to translate and to test the reliability and validity of the EPDS in Iran. The English language version of the EPDS was translated into Persian (Iranian language) and was used in this study. The questionnaire was administered to a consecutive sample of 100 women with normal (n = 50) and caesarean section (n = 50) deliveries at two points in time: 6 to 8 weeks and 12 to 14 weeks after delivery. Statistical analysis was performed to test the reliability and validity of the EPDS. Overall 22% of women at time 1 and 18% at time 2 reported experiencing postpartum depression. In general, the Iranian version of the EPDS was found to be acceptable to almost all women. Cronbach's alpha coefficient (to test reliability) was found to be 0.77 at time 1 and 0.86 at time 2. In addition, test-rest reliability was performed and the intraclass correlation coefficient was found to be 0.80. Validity as performed using known groups comparison showed satisfactory results. The questionnaire discriminated well between sub-groups of women differing in mode of delivery in the expected direction. The factor analysis indicated a three-factor structure that jointly accounted for 58% of the variance. This preliminary validation study of the Iranian version of the EPDS proved that it is an acceptable, reliable and valid measure of postnatal depression. It seems that the EPDS not only measures postpartum depression but also may be measuring something more.
A Psychometric Properties of the Malay-version Police Stress Questionnaire
IRNIZA, Rasdi; EMILIA, Zainal Abidin; MUHAMMAD SALILUDDIN, Suhainizam; NIZAM ISHA, Ahmad Shahrul
2014-01-01
Background: Police Stress Questionnaire (PSQ) was developed to measure police-specific stressors. The present study was the first to have translated the PSQ to Malay. This study aims to test the reliability, construct validity, and component structure of the Malay-version PSQ. Methods: A set of survey consisted of the Malay-version PSQ, General Health Questionnaire (GHQ-12), Job Content Questionnaire (JCQ), Global Stress Questionnaire (GSQ) and General Self-rated Health (GSRH) were distributed to 300 traffic police officers in Kuala Lumpur and all traffic police officers in a few districts of Pahang and Negeri Sembilan. Results: The response rate was 65.5% (N = 262). The reported Cronbach’s alpha coefficient was 0.93 for Operational PSQ (PSQ-Op) and 0.94 for Organisational PSQ (PSQ-Org). Findings indicated that the PSQ had positive construct validity with the GSRH, GSQ, and GHQ. After excluding four factors related to lifestyles, all police-specific stressors were highly loaded (0.50) in one component. Conclusion: It is confirmed that the Malay-version PSQ, excluding the four factors related to lifestyle, was uni-dimensional, reliable, and a valid questionnaire. This study proffers a potentially better instrument for assessing the stressors among Malaysian police. PMID:25977621
Code of Federal Regulations, 2010 CFR
2010-01-01
... 10 Energy 4 2010-01-01 2010-01-01 false Applicability. 712.2 Section 712.2 Energy DEPARTMENT OF ENERGY HUMAN RELIABILITY PROGRAM Establishment of and Procedures for the Human Reliability Program General Provisions § 712.2 Applicability. The HRP applies to all applicants for, or current employees of...
Reliability and Factorial Validity of the Artes de Lenguaje.
ERIC Educational Resources Information Center
Powers, Stephen; And Others
1984-01-01
Spanish speaking first graders were administered the Artes de Lenguage (ADL)--a Spanish, criterion-referenced, language arts test. Reliability analyses indicated the adequacy of three of the four subscales (Phonetic Analysis, Vocabulary Development, Comprehension Skills, and General Skills). A principal factors analysis of the intercorrelation…
Evaluating the reliability of an injury prevention screening tool: Test-retest study.
Gittelman, Michael A; Kincaid, Madeline; Denny, Sarah; Wervey Arnold, Melissa; FitzGerald, Michael; Carle, Adam C; Mara, Constance A
2016-10-01
A standardized injury prevention (IP) screening tool can identify family risks and allow pediatricians to address behaviors. To assess behavior changes on later screens, the tool must be reliable for an individual and ideally between household members. Little research has examined the reliability of safety screening tool questions. This study utilized test-retest reliability of parent responses on an existing IP questionnaire and also compared responses between household parents. Investigators recruited parents of children 0 to 1 year of age during admission to a tertiary care children's hospital. When both parents were present, one was chosen as the "primary" respondent. Primary respondents completed the 30-question IP screening tool after consent, and they were re-screened approximately 4 hours later to test individual reliability. The "second" parent, when present, only completed the tool once. All participants received a 10-dollar gift card. Cohen's Kappa was used to estimate test-retest reliability and inter-rater agreement. Standard test-retest criteria consider Kappa values: 0.0 to 0.40 poor to fair, 0.41 to 0.60 moderate, 0.61 to 0.80 substantial, and 0.81 to 1.00 as almost perfect reliability. One hundred five families participated, with five lost to follow-up. Thirty-two (30.5%) parent dyads completed the tool. Primary respondents were generally mothers (88%) and Caucasian (72%). Test-retest of the primary respondents showed their responses to be almost perfect; average 0.82 (SD = 0.13, range 0.49-1.00). Seventeen questions had almost perfect test-retest reliability and 11 had substantial reliability. However, inter-rater agreement between household members for 12 objective questions showed little agreement between responses; inter-rater agreement averaged 0.35 (SD = 0.34, range -0.19-1.00). One question had almost perfect inter-rater agreement and two had substantial inter-rater agreement. The IP screening tool used by a single individual had excellent test-retest reliability for nearly all questions. However, when a reporter changes from pre- to postintervention, differences may reflect poor reliability or different subjective experiences rather than true change.
Reliability of Examination Findings in Suspected Community-Acquired Pneumonia.
Florin, Todd A; Ambroggio, Lilliam; Brokamp, Cole; Rattan, Mantosh S; Crotty, Eric J; Kachelmeyer, Andrea; Ruddy, Richard M; Shah, Samir S
2017-09-01
The authors of national guidelines emphasize the use of history and examination findings to diagnose community-acquired pneumonia (CAP) in outpatient children. Little is known about the interrater reliability of the physical examination in children with suspected CAP. This was a prospective cohort study of children with suspected CAP presenting to a pediatric emergency department from July 2013 to May 2016. Children aged 3 months to 18 years with lower respiratory signs or symptoms who received a chest radiograph were included. We excluded children hospitalized ≤14 days before the study visit and those with a chronic medical condition or aspiration. Two clinicians performed independent examinations and completed identical forms reporting examination findings. Interrater reliability for each finding was reported by using Fleiss' kappa (κ) for categorical variables and intraclass correlation coefficient (ICC) for continuous variables. No examination finding had substantial agreement (κ/ICC > 0.8). Two findings (retractions, wheezing) had moderate to substantial agreement (κ/ICC = 0.6-0.8). Nine findings (abdominal pain, pleuritic pain, nasal flaring, skin color, overall impression, cool extremities, tachypnea, respiratory rate, and crackles/rales) had fair to moderate agreement (κ/ICC = 0.4-0.6). Eight findings (capillary refill time, cough, rhonchi, head bobbing, behavior, grunting, general appearance, and decreased breath sounds) had poor to fair reliability (κ/ICC = 0-0.4). Only 3 examination findings had acceptable agreement, with the lower 95% confidence limit >0.4: wheezing, retractions, and respiratory rate. In this study, we found fair to moderate reliability of many findings used to diagnose CAP. Only 3 findings had acceptable levels of reliability. These findings must be considered in the clinical management and research of pediatric CAP. Copyright © 2017 by the American Academy of Pediatrics.
Reliability of the Q Force; a mobile instrument for measuring isometric quadriceps muscle strength.
Douma, K W; Regterschot, G R H; Krijnen, W P; Slager, G E C; van der Schans, C P; Zijlstra, W
2016-01-01
The ability to generate muscle strength is a pre-requisite for all human movement. Decreased quadriceps muscle strength is frequently observed in older adults and is associated with a decreased performance and activity limitations. To quantify the quadriceps muscle strength and to monitor changes over time, instruments and procedures with a sufficient reliability are needed. The Q Force is an innovative mobile muscle strength measurement instrument suitable to measure in various degrees of extension. Measurements between 110 and 130° extension present the highest values and the most significant increase after training. The objective of this study is to determine the test-retest reliability of muscle strength measurements by the Q Force in older adults in 110° extension. Forty-one healthy older adults, 13 males and 28 females were included in the study. Mean (SD) age was 81.9 (4.89) years. Isometric muscle strength of the Quadriceps muscle was assessed with the Q Force at 110° of knee extension. Participants were measured at two sessions with a three to eight day interval between sessions. To determine relative reliability, the intraclass correlation coefficient (ICC) was calculated. To determine absolute reliability, Bland and Altman Limits of Agreement (LOA) were calculated and t-tests were performed. Relative reliability of the Q Force is good to excellent as all ICC coefficients are higher than 0.75. Generally a large 95 % LOA, reflecting only moderate absolute reliability, is found as exemplified for the peak torque left leg of -18.6 N to 33.8 N and the right leg of -9.2 N to 26.4 N was between 15.7 and 23.6 Newton representing 25.2 % to 39.9 % of the size of the mean. Small systematic differences in mean were found between measurement session 1 and 2. The present study shows that the Q Force has excellent relative test-retest reliability, but limited absolute test-retest reliability. Since the Q Force is relatively cheap and mobile it is suitable for application in various clinical settings, however, its capability to detect changes in muscle force over time is limited but comparable to existing instruments.
Abery, Philip; Kuys, Suzanne; Lynch, Mary; Low Choy, Nancy
2018-05-23
To design and establish reliability of a local stroke audit tool by engaging allied health clinicians within a privately funded hospital. Design: Two-stage study involving a modified Delphi process to inform stroke audit tool development and inter-tester reliability. Allied health clinicians. A modified Delphi process to select stroke guideline recommendations for inclusion in the audit tool. Reliability study: 1 allied health representative from each discipline audited 10 clinical records with sequential admissions to acute and rehabilitation services. Recommendations were admitted to the audit tool when 70% agreement was reached, with 50% set as the reserve agreement. Inter-tester reliability was determined using intra-class correlation coefficients (ICCs) across 10 clinical records. Twenty-two participants (92% female, 50% physiotherapists, 17% occupational therapists) completed the modified Delphi process. Across 6 voting rounds, 8 recommendations reached 70% agreement and 2 reached 50% agreement. Two recommendations (nutrition/hydration; goal setting) were added to ensure representation for all disciplines. Substantial consistency across raters was established for the audit tool applied in acute stroke (ICC .71; range .48 to .90) and rehabilitation (ICC.78; range .60 to .93) services. Allied health clinicians within a privately funded hospital generally agreed in an audit process to develop a reliable stroke audit tool. Allied health clinicians agreed on stroke guideline recommendations to inform a stroke audit tool. The stroke audit tool demonstrated substantial consistency supporting future use for service development. This process, which engages local clinicians, could be adopted by other facilities to design reliable audit tools to identify local service gaps to inform changes to clinical practice. © 2018 John Wiley & Sons, Ltd.
Reliability and Validity of the Turkish Version of the Gastrointestinal Symptom Rating Scale.
Turan, Nuray; Aşt, Türkinaz Atabek; Kaya, Nurten
The purpose of this methodological study is to investigate the validity and reliability of the Turkish version of the Gastrointestinal Symptom Rating Scale (GSRS). The scale was adapted to the Turkish language via backward translation. Content validity was examined by referring to experts. Reliability was examined via test-retest reliability and internal consistency, and validity was examined with divergent and convergent validity. The Epworth Sleepiness Scale (ESS) and the Marlowe-Crowne Social Desirability Scale (MCSDS) were used for divergent validity. As for convergent validity, the Constipation Severity Instrument (CSI) and the Patient Assessment of Constipation Quality of Life Scale (PAC-QOLQ) were utilized. The relationship between the GSRS and the health-related quality of life (36-item short-form health survey [SF-36]) was also analyzed. The study population consisted of patients in orthopedic clinic who volunteered to participate. Test-retest reliability was examined with the participation of 30 patients; internal consistency and validity were examined with 150 patients. Test-retest reliability correlation coefficients of the GSRS varied from 0.39 to 0.87 for all items. For internal consistency, the GSRS's item total correlation was found to be 0.17-0.67, and Cronbach α was 0.82 for all items. There was a positive linear significant correlation between the GSRS, CSI, and PAC-QOLQ. There was no significant correlation between the GSRS, MCSDS, and ESS. Higher GSRS scores inversely correlated with general quality of life (SF-36). The Turkish version of the GSRS has been found to be a reliable and valid instrument for assessing patients' gastrointestinal symptoms. Therefore, this instrument can be confidently used with Turkish individuals.
Intraobserver reliability of contact pachymetry in children.
Weise, Katherine K; Kaminski, Brett; Melia, Michele; Repka, Michael X; Bradfield, Yasmin S; Davitt, Bradley V; Johnson, David A; Kraker, Raymond T; Manny, Ruth E; Matta, Noelle S; Schloff, Susan
2013-04-01
Central corneal thickness (CCT) is an important measurement in the treatment and management of pediatric glaucoma and potentially of refractive error, but data regarding reliability of CCT measurement in children are limited. The purpose of this study was to evaluate the reliability of CCT measurement with the use of handheld contact pachymetry in children. We conducted a multicenter intraobserver test-retest reliability study of more than 3,400 healthy eyes in children aged from newborn to 17 years by using a handheld contact pachymeter (Pachmate DGH55; DGH Technology Inc, Exton, PA) in 2 clinical settings--with the use of topical anesthesia in the office and with the patient under general anesthesia in a surgical facility. The overall standard error of measurement, including only measurements with standard deviation ≤5 μm, was 8 μm; the corresponding coefficient of repeatability, or limits within which 95% of test-retest differences fell, was ±22.3 μm. However, standard error of measurement increased as CCT increased, from 6.8 μm for CCT less than 525 μm, to 12.9 μm for CCT 625 μm and greater. The standard error of measurement including measurements with standard deviation >5 μm was 10.5 μm. Age, sex, race/ethnicity group, and examination setting did not influence the magnitude of test-retest differences. CCT measurement reliability in children via the Pachmate DGH55 handheld contact pachymeter is similar to that reported for adults. Because thicker CCT measurements are less reliable than thinner measurements, a second measure may be helpful when the first exceeds 575 μm. Reliability is also improved by disregarding measurements with instrument-reported standard deviations >5 μm. Copyright © 2013 American Association for Pediatric Ophthalmology and Strabismus. Published by Mosby, Inc. All rights reserved.
Hua, Bin; Abbas, Estelle; Hayes, Alan; Ryan, Peter; Nelson, Lisa; O'Brien, Kylie
2012-11-01
Chinese medicine (CM) has its own diagnostic indicators that are used as evidence of change in a patient's condition. The majority of studies investigating efficacy of Chinese herbal medicine (CHM) have utilized biomedical diagnostic endpoints. For CM clinical diagnostic variables to be incorporated into clinical trial designs, there would need to be evidence that these diagnostic variables are reliable. Previous studies have indicated that the reliability of CM syndrome diagnosis is variable. Little information is known about where the variability stems from--the basic data collection level or the synthesis of diagnostic data, or both. No previous studies have investigated systematically the reliability of all four diagnostic methods used in the CM diagnostic process (Inquiry, Inspection, Auscultation/Olfaction, and Palpation). The objective of this study was to assess the inter-rater reliability of data collected using the four diagnostic methods of CM in Australian patients with knee osteoarthritis (OA), in order to investigate if CM variables could be used with confidence as diagnostic endpoints in a clinical trial investigating the efficacy of a CHM in treating OA. An inter-rater reliability study was conducted as a substudy of a clinical trial investigating the treatment of knee OA with Chinese herbal medicine. Two (2) experienced CM practitioners conducted a CM examination separately, within 2 hours of each other, in 40 participants. A CM assessment form was utilized to record the diagnostic data. Cohen's κ coefficient was used as a measure of the level of agreement between 2 practitioners. There was a relatively good level of agreement for Inquiry and Auscultation variables, and, in general, a low level of agreement for (visual) Inspection and Palpation variables. There was variation in the level of agreement between 2 practitioners on clinical information collected using the Four Diagnostic Methods of a CM examination. Some aspects of CM diagnosis appear to be reliable, while others are not. Based on these results, it was inappropriate to use CM diagnostic variables as diagnostic endpoints in the main study, which was an investigation of efficacy of CHM treatment of knee OA.
A simulated training model for laparoscopic pyloromyotomy: Is 3D printing the way of the future?
Williams, Andrew; McWilliam, Morgan; Ahlin, James; Davidson, Jacob; Quantz, Mackenzie A; Bütter, Andreana
2018-05-01
Hypertrophic pyloric stenosis (HPS) is a common neonatal condition treated with open or laparoscopic pyloromyotomy. 3D-printed organs offer realistic simulations to practice surgical techniques. The purpose of this study was to validate a 3D HPS stomach model and assess model reliability and surgical realism. Medical students, general surgery residents, and adult and pediatric general surgeons were recruited from a single center. Participants were videotaped three times performing a laparoscopic pyloromyotomy using box trainers and 3D-printed stomachs. Attempts were graded independently by three reviewers using GOALS and Task Specific Assessments (TSA). Participants were surveyed using the Index of Agreement of Assertions on Model Accuracy (IAAMA). Participants reported their experience levels as novice (22%), inexperienced (26%), intermediate (19%), and experienced (33%). Interrater reliability was similar for overall average GOALS and TSA scores. There was a significant improvement in GOALS (p<0.0001) and TSA scores (p=0.03) between attempts and overall. Participants felt the model accurately simulated a laparoscopic pyloromyotomy (82%) and would be a useful tool for beginners (100%). A 3D-printed stomach model for simulated laparoscopic pyloromyotomy is a useful training tool for learners to improve laparoscopic skills. The GOALS and TSA provide reliable technical skills assessments. II. Copyright © 2018 Elsevier Inc. All rights reserved.
Kliem, Sören; Lohmann, Anna; Mößle, Thomas; Brähler, Elmar
2017-12-04
Suicidal ideation has been identified as one of the major predictors of attempted or actual suicide. Routinely screening individuals for endorsing suicidal thoughts could save lives and protect many from severe psychological consequences following the suicide of loved ones. The aim of this study was to validate the German version of the Beck Scale for Suicide Ideation (BSS) in a sample representative for the Federal Republic of Germany. All 2450 participants completed the first part of the Scale, the BSS-Screen. A risk group of n = 112 individuals (4.6%) with active or passive suicidal ideation was identified and subsequently completed the entire BSS. Satisfactory internal reliability (α = .97 for the BSS-Screen; α = .94 for the entire BSS) and excellent model fit indices for the one-dimensional factorial structure of the BSS-Screen (CFI = .998; TLI = .995; RMSEA = .045 [95%-CI: .030-.061]) were confirmed. Measurement invariance analyses supported strict invariance across gender, age, and depression status. We found correlations with related self-report measures in expected directions comparable to previous studies, indicating satisfactory construct validity. Our study involved cross sectional data, hence neither predictive validity nor retest-reliability were examined. As only the risk group of n = 112 individuals completed the entire measure, confirmatory factor analyses could not be conducted for the full BSS. The German translation of the BSS is a reliable and valid instrument for assessing suicidal ideation in the general population. Using it as a screening device in general and specialized medical care could substantially advance suicide prevention.
Navabi, Nader; Hashemipour, Maryam A; Roughani, Aida
2017-02-01
Oral cancer is a global health problem; however, many dentists lack the necessary skills, knowledge and capacity to diagnose oral cancers early. This study aimed to examine the validity and reliability of a Persian short-form version of a standardised questionnaire to assess dentists' knowledge, practice and attitudes towards oral cancer. This cross-sectional analytical study was carried out in May 2015 in Tehran, Iran. An original 39-item English-language questionnaire developed by Yellowitz et al . was translated into Persian using forward and backward translation methods. A total of 15 dental professionals were asked to assess the questionnaire for content validity. Based on their feedback, a 20-item short-form version was prepared, including six demographic, six knowledge, four attitude and four practice items. The translated short-form questionnaire was subsequently distributed to 973 general dental practitioners attending a dental conference in Tehran. Internal consistency and reliability were assessed with Cronbach's alpha coefficient and item-total correlation calculations. A total of 13 professionals and 313 general dentists participated in the study (response rates: 86.7% and 32.2%, respectively). After the elimination of six items (two knowledge, two attitude and two practice items), the validity and reliability of the questionnaire was confirmed. The final Persian 14-item version of the questionnaire had acceptable validity and internal consistency. These results indicate that researchers can use this translated short-form version to evaluate oral cancer knowledge, attitudes and practices among Persian-speaking dentists; this will allow for a comparison of data between different populations.
Dennett, Hugh W; McKone, Elinor; Tavashmi, Raka; Hall, Ashleigh; Pidcock, Madeleine; Edwards, Mark; Duchaine, Bradley
2012-06-01
Many research questions require a within-class object recognition task matched for general cognitive requirements with a face recognition task. If the object task also has high internal reliability, it can improve accuracy and power in group analyses (e.g., mean inversion effects for faces vs. objects), individual-difference studies (e.g., correlations between certain perceptual abilities and face/object recognition), and case studies in neuropsychology (e.g., whether a prosopagnosic shows a face-specific or object-general deficit). Here, we present such a task. Our Cambridge Car Memory Test (CCMT) was matched in format to the established Cambridge Face Memory Test, requiring recognition of exemplars across view and lighting change. We tested 153 young adults (93 female). Results showed high reliability (Cronbach's alpha = .84) and a range of scores suitable both for normal-range individual-difference studies and, potentially, for diagnosis of impairment. The mean for males was much higher than the mean for females. We demonstrate independence between face memory and car memory (dissociation based on sex, plus a modest correlation between the two), including where participants have high relative expertise with cars. We also show that expertise with real car makes and models of the era used in the test significantly predicts CCMT performance. Surprisingly, however, regression analyses imply that there is an effect of sex per se on the CCMT that is not attributable to a stereotypical male advantage in car expertise.
Complementary Reliability-Based Decodings of Binary Linear Block Codes
NASA Technical Reports Server (NTRS)
Fossorier, Marc P. C.; Lin, Shu
1997-01-01
This correspondence presents a hybrid reliability-based decoding algorithm which combines the reprocessing method based on the most reliable basis and a generalized Chase-type algebraic decoder based on the least reliable positions. It is shown that reprocessing with a simple additional algebraic decoding effort achieves significant coding gain. For long codes, the order of reprocessing required to achieve asymptotic optimum error performance is reduced by approximately 1/3. This significantly reduces the computational complexity, especially for long codes. Also, a more efficient criterion for stopping the decoding process is derived based on the knowledge of the algebraic decoding solution.
76 FR 11437 - Application To Export Electric Energy; Societe Generale Energy Corp.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-02
... DEPARTMENT OF ENERGY [OE Docket No. EA-376] Application To Export Electric Energy; Societe Generale Energy Corp. AGENCY: Office of Electricity Delivery and Energy Reliability, DOE. ACTION: Notice of application. SUMMARY: Societe Generale Energy Corp. (SGEC) has applied for authority to transmit electric...
Nurses as Evaluators of the Humanistic Behavior of Internal Medicine Residents.
ERIC Educational Resources Information Center
Butterfield, Paula S.; And Others
1987-01-01
The reliability of a 13-item questionnaire designed to assess the humanistic behaviors of internal medicine residents and the reliability of nurses as raters of those behaviors were examined. Residents were evaluated by nurses on two general medicine services and on cardiology and hematology-oncology services. (Author/MLW)
Digital avionics design and reliability analyzer
NASA Technical Reports Server (NTRS)
1981-01-01
The description and specifications for a digital avionics design and reliability analyzer are given. Its basic function is to provide for the simulation and emulation of the various fault-tolerant digital avionic computer designs that are developed. It has been established that hardware emulation at the gate-level will be utilized. The primary benefit of emulation to reliability analysis is the fact that it provides the capability to model a system at a very detailed level. Emulation allows the direct insertion of faults into the system, rather than waiting for actual hardware failures to occur. This allows for controlled and accelerated testing of system reaction to hardware failures. There is a trade study which leads to the decision to specify a two-machine system, including an emulation computer connected to a general-purpose computer. There is also an evaluation of potential computers to serve as the emulation computer.
Performance of Lung Ultrasound in Detecting Peri-Operative Atelectasis after General Anesthesia.
Yu, Xin; Zhai, Zhenping; Zhao, Yongfeng; Zhu, Zhiming; Tong, Jianbin; Yan, Jianqin; Ouyang, Wen
2016-12-01
The aim of this prospective observational study was to evaluate the performance of lung ultrasound (LUS) in detecting post-operative atelectasis in adult patients under general anesthesia. Forty-six patients without pulmonary comorbidities who were scheduled for elective neurosurgery were enrolled in the study. A total of 552 pairs of LUS clips and thoracic computed tomography (CT) images were ultimately analyzed to determine the presence of atelectasis in 12 prescribed lung regions. The accuracy of LUS in detecting peri-operative atelectasis was evaluated with thoracic CT as gold standard. Levels of agreement between the two observers for LUS and the two observers for thoracic CT were analyzed using the κ reliability test. The quantitative correlation between LUS scores of aeration and the volumetric data of atelectasis in thoracic CT were further evaluated. LUS had reliable performance in post-operative atelectasis, with a sensitivity of 87.7%, specificity of 92.1% and diagnostic accuracy of 90.8%. The levels of agreement between the two observers for LUS and for thoracic CT were both satisfactory, with κ coefficients of 0.87 (p < 0.0001) and 0.93 (p < 0.0001), respectively. In patients in the supine position, LUS scores were highly correlated with the atelectasis volume of CT (r = 0.58, p < 0.0001). Thus, LUS provides a fast, reliable and radiation-free method to identify peri-operative atelectasis in adults. Copyright © 2016. Published by Elsevier Inc.
Clark, Ross A; Mentiplay, Benjamin F; Pua, Yong-Hao; Bower, Kelly J
2018-03-01
The use of force platform technologies to assess standing balance is common across a range of clinical areas. Numerous researchers have evaluated the low-cost Wii Balance Board (WBB) for its utility in assessing balance, with variable findings. This review aimed to systematically evaluate the reliability and concurrent validity of the WBB for assessment of static standing balance. Articles were retrieved from six databases (Medline, SCOPUS, EMBASE, CINAHL, Web of Science, Inspec) from 2007 to 2017. After independent screening by two reviewers, 25 articles were included. Two reviewers performed the data extraction and quality assessment. Test-retest reliability was investigated in 12 studies, with intraclass correlation coefficients or Pearson's correlation values showing a range from poor to excellent reliability (range: 0.27 to 0.99). Concurrent validity (i.e. comparison with another force platform) was examined in 21 studies, and was generally found to be excellent in studies examining the association between the same outcome measures collected on both devices. For studies reporting predominantly poor to moderate validity, potentially influential factors included the choice of 1) criterion reference (e.g. not a common force platform), 2) test duration (e.g. <30 s for double leg), 3) outcome measure (e.g. comparing a centre of pressure variable from the WBB with a summary score from the force platform), 4) data acquisition platform (studies using Apple iOS reported predominantly moderate validity), and 5) low sample size. In conclusion, evidence suggests that the WBB can be used as a reliable and valid tool for assessing standing balance. Protocol registration number: PROSPERO 2017: CRD42017058122. Copyright © 2018 Elsevier B.V. All rights reserved.
Adaptations of advanced safety and reliability techniques to petroleum and other industries
NASA Technical Reports Server (NTRS)
Purser, P. E.
1974-01-01
The underlying philosophy of the general approach to failure reduction and control is presented. Safety and reliability management techniques developed in the industries which have participated in the U.S. space and defense programs are described along with adaptations to nonaerospace activities. The examples given illustrate the scope of applicability of these techniques. It is indicated that any activity treated as a 'system' is a potential user of aerospace safety and reliability management techniques.
Balancing reliability and cost to choose the best power subsystem
NASA Technical Reports Server (NTRS)
Suich, Ronald C.; Patterson, Richard L.
1991-01-01
A mathematical model is presented for computing total (spacecraft) subsystem cost including both the basic subsystem cost and the expected cost due to the failure of the subsystem. This model is then used to determine power subsystem cost as a function of reliability and redundancy. Minimum cost and maximum reliability and/or redundancy are not generally equivalent. Two example cases are presented. One is a small satellite, and the other is an interplanetary spacecraft.
Ensemble-Based Parameter Estimation in a Coupled General Circulation Model
Liu, Y.; Liu, Z.; Zhang, S.; ...
2014-09-10
Parameter estimation provides a potentially powerful approach to reduce model bias for complex climate models. Here, in a twin experiment framework, the authors perform the first parameter estimation in a fully coupled ocean–atmosphere general circulation model using an ensemble coupled data assimilation system facilitated with parameter estimation. The authors first perform single-parameter estimation and then multiple-parameter estimation. In the case of the single-parameter estimation, the error of the parameter [solar penetration depth (SPD)] is reduced by over 90% after ~40 years of assimilation of the conventional observations of monthly sea surface temperature (SST) and salinity (SSS). The results of multiple-parametermore » estimation are less reliable than those of single-parameter estimation when only the monthly SST and SSS are assimilated. Assimilating additional observations of atmospheric data of temperature and wind improves the reliability of multiple-parameter estimation. The errors of the parameters are reduced by 90% in ~8 years of assimilation. Finally, the improved parameters also improve the model climatology. With the optimized parameters, the bias of the climatology of SST is reduced by ~90%. Altogether, this study suggests the feasibility of ensemble-based parameter estimation in a fully coupled general circulation model.« less
Markell, Hannah M.; Newman, Michelle G.; Gallop, Robert; Gibbons, Mary Beth Connolly; Rickels, Karl; Crits-Christoph, Paul
2014-01-01
Using data from a study of combined cognitive behavioral therapy (CBT) and venlafaxine XR in the treatment of generalized anxiety disorder (GAD), the current article examines the reliability and convergent validity of scales, and preliminary outcomes, for African American compared to European American patients. Internal consistency and short-term stability coefficients for African Americans (n=42) were adequate and similar or higher compared to those found for European Americans (n=164) for standard scales used in GAD treatment research. Correlations among outcome measures among African Americans were in general not significantly different for African Americans compared to European Americans. A subset of patients with DSM-IV–diagnosed GAD (n = 24 African Americans; n = 52 European Americans) were randomly selected to be offered the option of adding 12 sessions of CBT to venlafaxine XR treatment. Of those offered CBT, 33.3% (n = 8) of the African Americans, and 32.6% (n = 17) of the European Americans accepted and attended at least one CBT treatment session. The outcomes for African Americans receiving combined treatment were not significantly different from European Americans receiving combined treatment on primary or secondary efficacy measures. PMID:24912462
Validation of a French version of the pure procrastination scale (PPS).
Rebetez, Marie My Lien; Rochat, Lucien; Gay, Philippe; Van der Linden, Martial
2014-08-01
Procrastination is a widespread phenomenon that affects everyone's day-to-day life and interferes with the clinical treatment of several psychopathological states. To assess this construct, Steel (2010) developed the Pure Procrastination Scale (PPS), a short scale intended to capture the general notion of dysfunctional delay. The aim of the current study was to present a French version of this questionnaire. To this end, the 12 items of the PPS were translated into French and data were collected from an online survey in a sample of 245 French-speaking individuals from the general population. The results revealed that one item had problematic face validity; it was therefore removed. Exploratory and confirmatory analyses performed on the resulting 11-item version of the French PPS indicated that the scale was composed of two factors ("voluntary delay" and "observed delay") depending on a common, higher-order construct ("general procrastination"). Good internal consistency and test-retest reliability were found. External validity was supported by specific relationships with measures of personality traits, impulsivity, and subjective well-being. The French PPS therefore presents satisfactory psychometric properties and may be considered a reliable and valid instrument for research, teaching and clinical practice. Copyright © 2014 Elsevier Inc. All rights reserved.
2004-01-01
Background Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. Methods We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). Results In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. Conclusion This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system. PMID:15018620
Laerum, Hallvard; Faxvaag, Arild
2004-02-09
Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system.
General Analytical Procedure for Determination of Acidity Parameters of Weak Acids and Bases
Pilarski, Bogusław; Kaliszan, Roman; Wyrzykowski, Dariusz; Młodzianowski, Janusz; Balińska, Agata
2015-01-01
The paper presents a new convenient, inexpensive, and reagent-saving general methodology for the determination of pK a values for components of the mixture of diverse chemical classes weak organic acids and bases in water solution, without the need to separate individual analytes. The data obtained from simple pH-metric microtitrations are numerically processed into reliable pK a values for each component of the mixture. Excellent agreement has been obtained between the determined pK a values and the reference literature data for compounds studied. PMID:25692072
General analytical procedure for determination of acidity parameters of weak acids and bases.
Pilarski, Bogusław; Kaliszan, Roman; Wyrzykowski, Dariusz; Młodzianowski, Janusz; Balińska, Agata
2015-01-01
The paper presents a new convenient, inexpensive, and reagent-saving general methodology for the determination of pK a values for components of the mixture of diverse chemical classes weak organic acids and bases in water solution, without the need to separate individual analytes. The data obtained from simple pH-metric microtitrations are numerically processed into reliable pK a values for each component of the mixture. Excellent agreement has been obtained between the determined pK a values and the reference literature data for compounds studied.
1993-09-24
respectively. That study generally emphasized that the pollutant sources and concentrations indoors can be quite different from those outdoors. [he present...potent source .1 9 Another general argument can be applied to account for upward curvature at the low end, i.e., the lowest concen- tration cannot fall...conclusions can be reached. The technique of z-direction scaling by normalising Rq leads to similar values of Ra because both are averages obtained by
Developing and investigating the use of single-item measures in organizational research.
Fisher, Gwenith G; Matthews, Russell A; Gibbons, Alyssa Mitchell
2016-01-01
The validity of organizational research relies on strong research methods, which include effective measurement of psychological constructs. The general consensus is that multiple item measures have better psychometric properties than single-item measures. However, due to practical constraints (e.g., survey length, respondent burden) there are situations in which certain single items may be useful for capturing information about constructs that might otherwise go unmeasured. We evaluated 37 items, including 18 newly developed items as well as 19 single items selected from existing multiple-item scales based on psychometric characteristics, to assess 18 constructs frequently measured in organizational and occupational health psychology research. We examined evidence of reliability; convergent, discriminant, and content validity assessments; and test-retest reliabilities at 1- and 3-month time lags for single-item measures using a multistage and multisource validation strategy across 3 studies, including data from N = 17 occupational health subject matter experts and N = 1,634 survey respondents across 2 samples. Items selected from existing scales generally demonstrated better internal consistency reliability and convergent validity, whereas these particular new items generally had higher levels of content validity. We offer recommendations regarding when use of single items may be more or less appropriate, as well as 11 items that seem acceptable, 14 items with mixed results that might be used with caution due to mixed results, and 12 items we do not recommend using as single-item measures. Although multiple-item measures are preferable from a psychometric standpoint, in some circumstances single-item measures can provide useful information. (c) 2016 APA, all rights reserved).
Jimenez, Krystal; Vargas, Cristina; Garcia, Karla; Guzman, Herlinda; Angulo, Marco; Billimek, John
2017-02-01
Purpose The purpose of this study was to examine the reliability and validity of a Spanish version of the Beliefs about Medicines Questionnaire (BMQ) as a measure to evaluate beliefs about medications and to differentiate adherent from nonadherent patients among low-income Latino patients with diabetes in the United States. Methods Seventy-three patients were administered the BMQ and surveyed for evidence of medication nonadherence. Internal consistency of the BMQ was assessed by Cronbach's alpha along with performing a confirmatory factor analysis. Criterion validity was assessed by comparing mean scores on 3 subscales of the BMQ (General Overuse, General Harm, and Specific Necessity-Concerns difference score) between adherent patients and patients reporting nonadherence for 3 different reasons (unintentional nonadherence, cost-related nonadherence, and nonadherence due to reasons other than cost) using independent samples t tests. Results The BMQ is a reliable instrument to examine beliefs about medications in this Spanish-speaking population. Construct validity testing shows nearly identical factor loading as the original construct map. General Overuse scores were significantly more negative for patients reporting each reason for nonadherence compared with their adherent counterparts. Necessity-Concerns difference scores were significantly more negative for patients reporting nonadherence for reasons other than cost compared with those who did not report this reason for nonadherence. Conclusion The Spanish version of the BMQ is appropriate to assess beliefs about medications in Latino patients with type 2 diabetes in the United States and may help identify patients who become nonadherent to medications for reasons other than out-of-pocket costs.
Perceived Transcultural Self-Efficacy of Nurses in General Hospitals in Guangzhou, China
Li, Juan; He, Zhuang; Luo, Yong; Zhang, Rong
2016-01-01
Background Conflicts arising from cultural diversity among patients and hospital staff in China have become intense. Hospitals have an urgent need to improve transcultural self-efficacy of nurses for providing effective transcultural nursing. Objective The purpose of the research was to (a) evaluate the current status of perceived transcultural self-efficacy of nurses in general hospitals in Guangzhou, China; (b) explore associations between demographic characteristics of nurses and their perceived transcultural self-efficacy; and (c) assess the reliability and validity of scores on the Chinese version of the Transcultural Self-Efficacy Tool (TSET). Methods A cross-sectional survey of registered nurses from three general hospitals was conducted. Quota and convenience sampling were used. Participants provided demographic information and answered questions on the TSET. Results A total of 1,156 registered nurses took part. Most nurses had a moderate level of self-efficacy on the Cognitive (87.9%), Practical (87%), and Affective (89.2%) TSET subscales. Nurses who were older; who had more years of work experience, higher professional titles, higher incomes, and a minority background; and who were officially employed (not temporary positions) had higher perceived transcultural self-efficacy. Reliability estimated using Cronbach’s alpha was .99 for the total TSET score; reliability for the three subscales ranged from .97 to .98. Confirmatory factor analysis of TSET scores showed good fit with a three-factor model. Conclusion The results of this study can provide insights and guidelines for hospital nursing management to facilitate design of in-service education systems to improve transcultural self-efficacy of nurses. PMID:27454552
Reliability models: the influence of model specification in generation expansion planning
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stremel, J.P.
1982-10-01
This paper is a critical evaluation of reliability methods used for generation expansion planning. It is shown that the methods for treating uncertainty are critical for determining the relative reliability value of expansion alternatives. It is also shown that the specification of the reliability model will not favor all expansion options equally. Consequently, the model is biased. In addition, reliability models should be augmented with an economic value of reliability (such as the cost of emergency procedures or energy not served). Generation expansion evaluations which ignore the economic value of excess reliability can be shown to be inconsistent. The conclusionsmore » are that, in general, a reliability model simplifies generation expansion planning evaluations. However, for a thorough analysis, the expansion options should be reviewed for candidates which may be unduly rejected because of the bias of the reliability model. And this implies that for a consistent formulation in an optimization framework, the reliability model should be replaced with a full economic optimization which includes the costs of emergency procedures and interruptions in the objective function.« less
NASA Astrophysics Data System (ADS)
Mayer, Simon; Jenner, Florian; Aeschbach, Werner
2017-04-01
Applications of inert gases in groundwater hydrology require a profound understanding of underlying biogeochemical processes. Some of these processes are, however, not well understood and therefore require further investigation. This is the first study simultaneously investigating soil air and groundwater in the context of noble gas tracer applications, accounting for seasonal effects in different climate regions. The sampled data confirm a general reliability of common assumptions proposed in the literature. In particular, a solubility-controlled description of excess air formation and of groundwater degassing can be confirmed. This study identifies certain effects which need to be taken into account to reliably evaluate noble gas patterns. First, long-term samplings suggest a permanent temperature-driven equilibration of shallow groundwater with entrapped air bubbles, even some years after recharge. Second, minor groundwater degassing is found to challenge existing excess air model approaches, depending on the amount and the fractionation of excess air. Third, soil air composition data of this study imply a potential bias of noble gas temperatures by up to about 2℃ due to microbial oxygen depletion and a reduced sum value of O2+CO2. This effect causes systematically lower noble gas temperatures in tropical groundwater samples and in shallow mid-latitude groundwater samples after strong recharge during the warm season. However, a general bias of noble gas temperatures in mid-latitudes is probably prevented by a predominant recharge during the cold season, accompanied by nearly atmospheric noble gas mixing ratios in the soil air. Findings of this study provide a remarkable contribution to the reliability of noble gas tracer applications in hydrology, in particular with regard to paleoclimate reconstructions and an understanding of subsurface gas dynamics.
Estimated Student Score Gain on the ACT COMP Exam: Valid Tool for Institutional Assessment?
ERIC Educational Resources Information Center
Banta, Trudy W.; And Others
1987-01-01
An institution can test seniors with the ACT College Outcome Measures Project (COMP) exam, then subtract from the senior score an estimated freshman score. Studies at the University of Tennessee, Knoxville, indicate that this method is not reliable to make judgments about the quality of general education programs. (Author/MLW)
Technical Analysis of Scores on the "Self-Efficacy Self-Report Scale"
ERIC Educational Resources Information Center
Erford, Bradley T.; Schein, Hallie; Duncan, Kelly
2011-01-01
The purpose of this study was to provide preliminary analysis of reliability and validity of scores on the "Self-Efficacy Self-Report Scale", which was designed to assess general self-efficacy in students aged 10 to 17 years. Confirmatory factor analysis on cross-validated samples was conducted revealing a marginal fit of the data to the…
ERIC Educational Resources Information Center
Pearson, L. Carolyn; Moomaw, William
2005-01-01
The purpose of this study was to examine the relationship between teacher autonomy and on-the-job stress, work satisfaction, empowerment, and professionalism. Using a reliable and valid measure of curriculum autonomy and general teaching autonomy (TAS), it was found that as curriculum autonomy increased on-the-job stress decreased, but there was…
ERIC Educational Resources Information Center
Nishimura, Trisha Sugita; Busse, Randy T.
2015-01-01
General and special education teachers (N = 125) completed the Scale of Teachers' Attitudes towards Inclusive Classrooms (STATIC). The internal consistency of the instrument was strong with an alpha of 0.89. The measure demonstrated excellent test-retest reliability (r = 0.99) and a dependent t-test was non-significant, indicating mean group…
Gender, Peers, and Delinquency: A Study of Boys and Girls in Rural France.
ERIC Educational Resources Information Center
Hartjen, Clayton A.; Priyadarsini, S.
2003-01-01
Surveyed rural French students age 13-18 years to investigate the extent to which measures of social control and learning/differential association theories could be generalized to, and help explain, delinquency. Social control measures either did not form reliable scales or were not significantly related to various offense scales. Measures of…
ERIC Educational Resources Information Center
Smiley, Patricia A.; Coulson, Sheri L.; Greene, Joelle K.; Bono, Katherine L.
2010-01-01
Individual differences in emotion, cognitions, and task choice following achievement failure are found among four- to seven-year-olds. However, neither performance deterioration during failure nor generalization after failure--aspects of the helpless pattern in 10-year-olds--have been reliably demonstrated in this age group. In the present study,…
The Specificity of Sound Symbolic Correspondences in Spoken Language.
Tzeng, Christina Y; Nygaard, Lynne C; Namy, Laura L
2017-11-01
Although language has long been regarded as a primarily arbitrary system, sound symbolism, or non-arbitrary correspondences between the sound of a word and its meaning, also exists in natural language. Previous research suggests that listeners are sensitive to sound symbolism. However, little is known about the specificity of these mappings. This study investigated whether sound symbolic properties correspond to specific meanings, or whether these properties generalize across semantic dimensions. In three experiments, native English-speaking adults heard sound symbolic foreign words for dimensional adjective pairs (big/small, round/pointy, fast/slow, moving/still) and for each foreign word, selected a translation among English antonyms that either matched or mismatched with the correct meaning dimension. Listeners agreed more reliably on the English translation for matched relative to mismatched dimensions, though reliable cross-dimensional mappings did occur. These findings suggest that although sound symbolic properties generalize to meanings that may share overlapping semantic features, sound symbolic mappings offer semantic specificity. Copyright © 2016 Cognitive Science Society, Inc.
Houx, P J; Shepherd, J; Blauw, G-J; Murphy, M B; Ford, I; Bollen, E L; Buckley, B; Stott, D J; Jukema, W; Hyland, M; Gaw, A; Norrie, J; Kamper, A M; Perry, I J; MacFarlane, P W; Meinders, A Edo; Sweeney, B J; Packard, C J; Twomey, C; Cobbe, S M; Westendorp, R G
2002-10-01
For large scale follow up studies with non-demented patients in which cognition is an endpoint, there is a need for short, inexpensive, sensitive, and reliable neuropsychological tests that are suitable for repeated measurements. The commonly used Mini-Mental-State-Examination fulfils only the first two requirements. In the PROspective Study of Pravastatin in the Elderly at Risk (PROSPER), 5804 elderly subjects aged 70 to 82 years were examined using a learning test (memory), a coding test (general speed), and a short version of the Stroop test (attention). Data presented here were collected at dual baseline, before randomisation for active treatment. The tests proved to be reliable (with test/retest reliabilities ranging from acceptable (r=0.63) to high (r=0.88) and sensitive to detect small differences in subjects from different age categories. All tests showed significant practice effects: performance increased from the first measurement to the first follow up after two weeks. Normative data are provided that can be used for one time neuropsychological testing as well as for assessing individual and group change. Methods for analysing cognitive change are proposed.
Reliability of the Fox-walk test in patients with rheumatoid arthritis.
Verberkt, Cornelia Antonia; Fridén, Cecilia; Grooten, Wilhelmus Johannes Andreas; Opava, Christina H
2012-01-01
The Fox-walk test is a new method used to estimate aerobic capacity outside a clinical environment, which may be useful in the implementation of daily health-enhancing physical activity. The aim of our study was to investigate the reliability of the test in people with rheumatoid arthritis (RA). Fifteen participants performed the Fox-walk test three times with weekly intervals. The intraclass correlation coefficient (ICC), the standard error of measurement (SEM) and the smallest detectable change (SDC) were used to estimate the reliability. General health perception, lower limb pain and fatigue were measured to determine their potential influence on the reliability. There were no systematic differences between the three test occasions (p = 0.190) and the reliability was almost perfect (ICC = 0.982). None of the covariates influenced the reliability. The SEM was 0.999 ml/kg/min or 3.4% and the SDC was 2.769 ml/kg/min or 9.4%. These findings demonstrate that the Fox-walk test is reliable in people with RA and enables differentiation between people with RA and monitoring progress. The validity of the test among people with RA is still to be determined. • The Fox-walk test is a new method to estimate aerobic capacity and could be performed walking or running. • The test is self administered without expensive equipment and is available in 150 public places in Sweden and several other European countries. • The Fox-walk test is a reliable test for use among people with rheumatoid arthritis monitoring the progress of their physical activity.
Care 3 phase 2 report, maintenance manual
NASA Technical Reports Server (NTRS)
Bryant, L. A.; Stiffler, J. J.
1982-01-01
CARE 3 (Computer-Aided Reliability Estimation, version three) is a computer program designed to help estimate the reliability of complex, redundant systems. Although the program can model a wide variety of redundant structures, it was developed specifically for fault-tolerant avionics systems--systems distinguished by the need for extremely reliable performance since a system failure could well result in the loss of human life. It substantially generalizes the class of redundant configurations that could be accommodated, and includes a coverage model to determine the various coverage probabilities as a function of the applicable fault recovery mechanisms (detection delay, diagnostic scheduling interval, isolation and recovery delay, etc.). CARE 3 further generalizes the class of system structures that can be modeled and greatly expands the coverage model to take into account such effects as intermittent and transient faults, latent faults, error propagation, etc.
Multiple objective optimization in reliability demonstration test
Lu, Lu; Anderson-Cook, Christine Michaela; Li, Mingyang
2016-10-01
Reliability demonstration tests are usually performed in product design or validation processes to demonstrate whether a product meets specified requirements on reliability. For binomial demonstration tests, the zero-failure test has been most commonly used due to its simplicity and use of minimum sample size to achieve an acceptable consumer’s risk level. However, this test can often result in unacceptably high risk for producers as well as a low probability of passing the test even when the product has good reliability. This paper explicitly explores the interrelationship between multiple objectives that are commonly of interest when planning a demonstration test andmore » proposes structured decision-making procedures using a Pareto front approach for selecting an optimal test plan based on simultaneously balancing multiple criteria. Different strategies are suggested for scenarios with different user priorities and graphical tools are developed to help quantify the trade-offs between choices and to facilitate informed decision making. As a result, potential impacts of some subjective user inputs on the final decision are studied to offer insights and useful guidance for general applications.« less
Modification site localization scoring integrated into a search engine.
Baker, Peter R; Trinidad, Jonathan C; Chalkley, Robert J
2011-07-01
Large proteomic data sets identifying hundreds or thousands of modified peptides are becoming increasingly common in the literature. Several methods for assessing the reliability of peptide identifications both at the individual peptide or data set level have become established. However, tools for measuring the confidence of modification site assignments are sparse and are not often employed. A few tools for estimating phosphorylation site assignment reliabilities have been developed, but these are not integral to a search engine, so require a particular search engine output for a second step of processing. They may also require use of a particular fragmentation method and are mostly only applicable for phosphorylation analysis, rather than post-translational modifications analysis in general. In this study, we present the performance of site assignment scoring that is directly integrated into the search engine Protein Prospector, which allows site assignment reliability to be automatically reported for all modifications present in an identified peptide. It clearly indicates when a site assignment is ambiguous (and if so, between which residues), and reports an assignment score that can be translated into a reliability measure for individual site assignments.
Ruff, Jessica; Wang, Tiffany L; Quatman-Yates, Catherine C; Phieffer, Laura S; Quatman, Carmen E
2015-02-01
Commercially available gaming systems (CAGS) such as the Wii Balance Board (WBB) and Microsoft Xbox with Kinect (Xbox Kinect) are increasingly used as balance training and rehabilitation tools. The purpose of this review was to answer the question, "Are commercially available gaming systems valid and reliable instruments for use as clinical diagnostic and functional assessment tools in orthopaedic settings?" and provide a summary of relevant studies, identify their strengths and weaknesses, and generate conclusions regarding general validity/reliability of WBB and Xbox Kinect in orthopaedics. A systematic search was performed using MEDLINE (1996-2013) and Scopus (1996-2013). Inclusion criteria were minimum of 5 subjects, full manuscript provided in English or translated, and studies incorporating investigation of CAG measurement properties. Exclusion criteria included reviews, systematic reviews, summary/clinical commentaries, or case studies; conference proceedings/presentations; cadaveric studies; studies of non-reversible, non-orthopaedic-related musculoskeletal disease; non-human trials; and therapeutic studies not reporting comparative evaluation to already established functional assessment criteria. All studies meeting inclusion and exclusion criteria were appraised for quality by two independent reviewers. Evidence levels (I-V) were assigned to each study based on established methodological criteria. 3 Level II, 7 level III, and 1 Level IV studies met inclusion criteria and provided information related to the use of the WBB and Xbox Kinect as clinical assessment tools in the field of orthopaedics. Studies have used the WBB in a variety of clinical applications, including the measurement of center of pressure (COP), measurement of medial-to-lateral (M/L) or anterior-to-posterior (A/P) symmetry, assessment anatomic landmark positioning, and assessment of fall risk. However, no uniform protocols or outcomes were used to evaluate the quality of the WBB as a clinical assessment tool; therefore a wide range of sensitivities, specificities, accuracies, and validities were reported. Currently it is not possible to make a universal generalization about the clinical utility of CAGS in the field of orthopaedics. However, there is evidence to support using the WBB and the Xbox Kinect as tools to obtain reliable and valid COP measurements. The Wii Fit Game may specifically provide reliable and valid measurements for predicting fall risk. Copyright © 2014 Elsevier Ltd. All rights reserved.
Developing safety performance functions incorporating reliability-based risk measures.
Ibrahim, Shewkar El-Bassiouni; Sayed, Tarek
2011-11-01
Current geometric design guides provide deterministic standards where the safety margin of the design output is generally unknown and there is little knowledge of the safety implications of deviating from these standards. Several studies have advocated probabilistic geometric design where reliability analysis can be used to account for the uncertainty in the design parameters and to provide a risk measure of the implication of deviation from design standards. However, there is currently no link between measures of design reliability and the quantification of safety using collision frequency. The analysis presented in this paper attempts to bridge this gap by incorporating a reliability-based quantitative risk measure such as the probability of non-compliance (P(nc)) in safety performance functions (SPFs). Establishing this link will allow admitting reliability-based design into traditional benefit-cost analysis and should lead to a wider application of the reliability technique in road design. The present application is concerned with the design of horizontal curves, where the limit state function is defined in terms of the available (supply) and stopping (demand) sight distances. A comprehensive collision and geometric design database of two-lane rural highways is used to investigate the effect of the probability of non-compliance on safety. The reliability analysis was carried out using the First Order Reliability Method (FORM). Two Negative Binomial (NB) SPFs were developed to compare models with and without the reliability-based risk measures. It was found that models incorporating the P(nc) provided a better fit to the data set than the traditional (without risk) NB SPFs for total, injury and fatality (I+F) and property damage only (PDO) collisions. Copyright © 2011 Elsevier Ltd. All rights reserved.
Distribution System Reliability Analysis for Smart Grid Applications
NASA Astrophysics Data System (ADS)
Aljohani, Tawfiq Masad
Reliability of power systems is a key aspect in modern power system planning, design, and operation. The ascendance of the smart grid concept has provided high hopes of developing an intelligent network that is capable of being a self-healing grid, offering the ability to overcome the interruption problems that face the utility and cost it tens of millions in repair and loss. To address its reliability concerns, the power utilities and interested parties have spent extensive amount of time and effort to analyze and study the reliability of the generation and transmission sectors of the power grid. Only recently has attention shifted to be focused on improving the reliability of the distribution network, the connection joint between the power providers and the consumers where most of the electricity problems occur. In this work, we will examine the effect of the smart grid applications in improving the reliability of the power distribution networks. The test system used in conducting this thesis is the IEEE 34 node test feeder, released in 2003 by the Distribution System Analysis Subcommittee of the IEEE Power Engineering Society. The objective is to analyze the feeder for the optimal placement of the automatic switching devices and quantify their proper installation based on the performance of the distribution system. The measures will be the changes in the reliability system indices including SAIDI, SAIFI, and EUE. The goal is to design and simulate the effect of the installation of the Distributed Generators (DGs) on the utility's distribution system and measure the potential improvement of its reliability. The software used in this work is DISREL, which is intelligent power distribution software that is developed by General Reliability Co.
Assessing medical students' self-regulation as aptitude in computer-based learning.
Song, Hyuksoon S; Kalet, Adina L; Plass, Jan L
2011-03-01
We developed a Self-Regulation Measure for Computer-based learning (SRMC) tailored toward medical students, by modifying Zimmerman's Self-Regulated Learning Interview Schedule (SRLIS) for K-12 learners. The SRMC's reliability and validity were examined in 2 studies. In Study 1, 109 first-year medical students were asked to complete the SRMC. Bivariate correlation analysis results indicated that the SRMC scores had a moderate degree of correlation with student achievement in a teacher-developed test. In Study 2, 58 third-year clerkship students completed the SRMC. Regression analysis results indicated that the frequency of medical students' usage of self-regulation strategies was associated with their general clinical knowledge measured by a nationally standardized licensing exam. These two studies provided evidence for the reliability and concurrent validity of the SRMC to assess medical students' self-regulation as aptitude. Future work should provide evidence to guide and improve instructional design as well as inform educational policy.
Estimating reliable paediatric reference intervals in clinical chemistry and haematology.
Ridefelt, Peter; Hellberg, Dan; Aldrimer, Mattias; Gustafsson, Jan
2014-01-01
Very few high-quality studies on paediatric reference intervals for general clinical chemistry and haematology analytes have been performed. Three recent prospective community-based projects utilising blood samples from healthy children in Sweden, Denmark and Canada have substantially improved the situation. The present review summarises current reference interval studies for common clinical chemistry and haematology analyses. ©2013 Foundation Acta Paediatrica. Published by John Wiley & Sons Ltd.
Theoretical investigation of gas-surface interactions
NASA Technical Reports Server (NTRS)
Lee, Timothy J.
1989-01-01
Four reprints are presented from four projects which are to be published in a refereed journal. Two are of interest to us and are presented herein. One is a description of a very detailed theoretical study of four anionic hydrogen bonded complexes. The other is a detailed study of the first generally reliable diagnostic for determining the quality of results that may be expected from single reference based electron correlation methods.
Robbins, Shawn M; Caplan, Ryan M; Aponte, Daniel I; St-Onge, Nancy
2017-10-01
External perturbations are utilized to challenge balance and mimic realistic balance threats in patient populations. The reliability of such protocols has not been established. The purpose was to examine test-retest reliability of balance testing with external perturbations. Healthy adults (n=34; mean age 23 years) underwent balance testing over two visits. Participants completed ten balance conditions in which the following parameters were combined: perturbation or non-perturbation, single or double leg, and eyes open or closed. Three trials were collected for each condition. Data were collected on a force plate and external perturbations were applied by translating the plate. Force plate center of pressure (CoP) data were summarized using 13 different CoP measures. Test-retest reliability was examined using intraclass correlation coefficients (ICC) and Bland-Altman plots. CoP measures of total speed and excursion in both anterior-posterior and medial-lateral directions generally had acceptable ICC values for perturbation conditions (ICC=0.46 to 0.87); however, many other CoP measures (e.g. range, area of ellipse) had unacceptable test-retest reliability (ICC<0.70). Improved CoP measures were present on the second visit indicating a potential learning effect. Non-perturbation conditions generally produced more reliable CoP measures than perturbation conditions during double leg standing, but not single leg standing. Therefore, changes to balance testing protocols that include external perturbations should be made to improve test-retest reliability and diminish learning including more extensive participant training and increasing the number of trials. CoP measures that consider all data points (e.g. total speed) are more reliable than those that only consider a few data points. Copyright © 2017 Elsevier B.V. All rights reserved.
O’Connor, David; Potler, Natan Vega; Kovacs, Meagan; Xu, Ting; Ai, Lei; Pellman, John; Vanderwal, Tamara; Parra, Lucas C.; Cohen, Samantha; Ghosh, Satrajit; Escalera, Jasmine; Grant-Villegas, Natalie; Osman, Yael; Bui, Anastasia; Craddock, R. Cameron
2017-01-01
Abstract Background: Although typically measured during the resting state, a growing literature is illustrating the ability to map intrinsic connectivity with functional MRI during task and naturalistic viewing conditions. These paradigms are drawing excitement due to their greater tolerability in clinical and developing populations and because they enable a wider range of analyses (e.g., inter-subject correlations). To be clinically useful, the test-retest reliability of connectivity measured during these paradigms needs to be established. This resource provides data for evaluating test-retest reliability for full-brain connectivity patterns detected during each of four scan conditions that differ with respect to level of engagement (rest, abstract animations, movie clips, flanker task). Data are provided for 13 participants, each scanned in 12 sessions with 10 minutes for each scan of the four conditions. Diffusion kurtosis imaging data was also obtained at each session. Findings: Technical validation and demonstrative reliability analyses were carried out at the connection-level using the Intraclass Correlation Coefficient and at network-level representations of the data using the Image Intraclass Correlation Coefficient. Variation in intrinsic functional connectivity across sessions was generally found to be greater than that attributable to scan condition. Between-condition reliability was generally high, particularly for the frontoparietal and default networks. Between-session reliabilities obtained separately for the different scan conditions were comparable, though notably lower than between-condition reliabilities. Conclusions: This resource provides a test-bed for quantifying the reliability of connectivity indices across subjects, conditions and time. The resource can be used to compare and optimize different frameworks for measuring connectivity and data collection parameters such as scan length. Additionally, investigators can explore the unique perspectives of the brain's functional architecture offered by each of the scan conditions. PMID:28369458
An evaluation of Wikipedia as a resource for patient education in nephrology.
Thomas, Garry R; Eng, Lawson; de Wolff, Jacob F; Grover, Samir C
2013-01-01
Wikipedia, a multilingual online encyclopedia, is a common starting point for patient medical searches. As its articles can be authored and edited by anyone worldwide, the credibility of the medical content of Wikipedia has been openly questioned. Wikipedia medical articles have also been criticized as too advanced for the general public. This study assesses the comprehensiveness, reliability, and readability of nephrology articles on Wikipedia. The International Statistical Classification of Diseases and Related problems, 10th Edition (ICD-10) diagnostic codes for nephrology (N00-N29.8) were used as a topic list to investigate the English Wikipedia database. Comprehensiveness was assessed by the proportion of ICD-10 codes that had corresponding articles. Reliability was measured by both the number of references per article and proportion of references from substantiated sources. Finally, readability was assessed using three validated indices (Flesch-Kincaid grade level, Automated readability index, and Flesch reading ease). Nephrology articles on Wikipedia were relatively comprehensive, with 70.5% of ICD-10 codes being represented. The articles were fairly reliable, with 7.1 ± 9.8 (mean ± SD) references per article, of which 59.7 ± 35.0% were substantiated references. Finally, all three readability indices determined that nephrology articles are written at a college level. Wikipedia is a comprehensive and fairly reliable medical resource for nephrology patients that is written at a college reading level. Accessibility of this information for the general public may be improved by hosting it at alternative Wikipedias targeted at a lower reading level, such as the Simple English Wikipedia. © 2013 Wiley Periodicals, Inc.
Physical and reliability issues in MEMS microrelays with gold contacts
NASA Astrophysics Data System (ADS)
Lafontan, Xavier; Pressecq, Francis; Perez, Guy; Dufaza, Christian; Karam, Jean Michel
2001-10-01
This paper presents the work we have done on micro-relays with gold micro-contacts in MUMPs. Firstly, the theoretical physical principles of MEMS micro-relay are described. This study is divided in two parts: the micro-contact and the micro-actuator. The micro-contact part deals with resistance of constriction, contact area, adhesion, arcing and wear. Whereas the micro-actuator part describes general principles, contact force, restoring force and actuator reliability. Then, in a second part, an innovative electrostatic relay design in MUMPs is presented. The concept, the implementation and the final realization are discussed. Then, in the third part, characterization results are reported. This part particularly focuses on the micro-contact study. Conduction mode, contact area, mechanical and thermal deformation, and adhesion energies are presented.
Test-Retest Reliability of Graph Metrics in Functional Brain Networks: A Resting-State fNIRS Study
Niu, Haijing; Li, Zhen; Liao, Xuhong; Wang, Jinhui; Zhao, Tengda; Shu, Ni; Zhao, Xiaohu; He, Yong
2013-01-01
Recent research has demonstrated the feasibility of combining functional near-infrared spectroscopy (fNIRS) and graph theory approaches to explore the topological attributes of human brain networks. However, the test-retest (TRT) reliability of the application of graph metrics to these networks remains to be elucidated. Here, we used resting-state fNIRS and a graph-theoretical approach to systematically address TRT reliability as it applies to various features of human brain networks, including functional connectivity, global network metrics and regional nodal centrality metrics. Eighteen subjects participated in two resting-state fNIRS scan sessions held ∼20 min apart. Functional brain networks were constructed for each subject by computing temporal correlations on three types of hemoglobin concentration information (HbO, HbR, and HbT). This was followed by a graph-theoretical analysis, and then an intraclass correlation coefficient (ICC) was further applied to quantify the TRT reliability of each network metric. We observed that a large proportion of resting-state functional connections (∼90%) exhibited good reliability (0.6< ICC <0.74). For global and nodal measures, reliability was generally threshold-sensitive and varied among both network metrics and hemoglobin concentration signals. Specifically, the majority of global metrics exhibited fair to excellent reliability, with notably higher ICC values for the clustering coefficient (HbO: 0.76; HbR: 0.78; HbT: 0.53) and global efficiency (HbO: 0.76; HbR: 0.70; HbT: 0.78). Similarly, both nodal degree and efficiency measures also showed fair to excellent reliability across nodes (degree: 0.52∼0.84; efficiency: 0.50∼0.84); reliability was concordant across HbO, HbR and HbT and was significantly higher than that of nodal betweenness (0.28∼0.68). Together, our results suggest that most graph-theoretical network metrics derived from fNIRS are TRT reliable and can be used effectively for brain network research. This study also provides important guidance on the choice of network metrics of interest for future applied research in developmental and clinical neuroscience. PMID:24039763
Bourrinet, P; Conduzorgues, J P; Dutertre, H; Macabies, J; Masson, P; Maurin, J; Mercier, O
1995-02-01
An interlaboratory study was carried out to determine the feasibility and reliability of a method using the hamster cheek pouch as a model for assessing the potential irritative properties of substances intended to be applied to the lips or other mucous membranes. The test substances were applied once daily to both pouches for 14 consecutive days. Local and general tolerances were appraised throughout the study. At the end of the study, histologic examination of the pouches and the main organs was performed. Results of the feasibility study, conducted on various types of commercial products, indicated that this model is suitable for preparations of various consistence and composition. Results of the reliability study, carried out on gel-type preparations containing various concentrations of a known irritant, sodium lauryl sulfate, indicated that the method elicits a dose-dependent reaction for this compound. This hamster cheek pouch method was reproducible for the various parameters under consideration: local tolerance, general tolerance, histologic examination. For all products, results were in good agreement among the various laboratories participating in the study. The French regulatory authorities of the Fraud Repression Department have accepted it as an official method for the evaluation of the potential irritative properties of cosmetics and hygiene products intended to be applied to the lips or other mucous membranes.
Monkeys and humans take local uncertainty into account when localizing a change.
Devkar, Deepna; Wright, Anthony A; Ma, Wei Ji
2017-09-01
Since sensory measurements are noisy, an observer is rarely certain about the identity of a stimulus. In visual perception tasks, observers generally take their uncertainty about a stimulus into account when doing so helps task performance. Whether the same holds in visual working memory tasks is largely unknown. Ten human and two monkey subjects localized a single change in orientation between a sample display containing three ellipses and a test display containing two ellipses. To manipulate uncertainty, we varied the reliability of orientation information by making each ellipse more or less elongated (two levels); reliability was independent across the stimuli. In both species, a variable-precision encoding model equipped with an "uncertainty-indifferent" decision rule, which uses only the noisy memories, fitted the data poorly. In both species, a much better fit was provided by a model in which the observer also takes the levels of reliability-driven uncertainty associated with the memories into account. In particular, a measured change in a low-reliability stimulus was given lower weight than the same change in a high-reliability stimulus. We did not find strong evidence that observers took reliability-independent variations in uncertainty into account. Our results illustrate the importance of studying the decision stage in comparison tasks and provide further evidence for evolutionary continuity of working memory systems between monkeys and humans.
Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael
2015-01-01
Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
Monkeys and humans take local uncertainty into account when localizing a change
Devkar, Deepna; Wright, Anthony A.; Ma, Wei Ji
2017-01-01
Since sensory measurements are noisy, an observer is rarely certain about the identity of a stimulus. In visual perception tasks, observers generally take their uncertainty about a stimulus into account when doing so helps task performance. Whether the same holds in visual working memory tasks is largely unknown. Ten human and two monkey subjects localized a single change in orientation between a sample display containing three ellipses and a test display containing two ellipses. To manipulate uncertainty, we varied the reliability of orientation information by making each ellipse more or less elongated (two levels); reliability was independent across the stimuli. In both species, a variable-precision encoding model equipped with an “uncertainty–indifferent” decision rule, which uses only the noisy memories, fitted the data poorly. In both species, a much better fit was provided by a model in which the observer also takes the levels of reliability-driven uncertainty associated with the memories into account. In particular, a measured change in a low-reliability stimulus was given lower weight than the same change in a high-reliability stimulus. We did not find strong evidence that observers took reliability-independent variations in uncertainty into account. Our results illustrate the importance of studying the decision stage in comparison tasks and provide further evidence for evolutionary continuity of working memory systems between monkeys and humans. PMID:28877535
Petersen, Solveig; Hägglöf, Bruno; Stenlund, Hans; Bergström, Erik
2009-09-01
To study the psychometric performance of the Swedish version of the Pediatric Quality of Life Inventory (PedsQL) 4.0 generic core scales in a general child population in Sweden. PedsQL forms were distributed to 2403 schoolchildren and 888 parents in two different school settings. Reliability and validity was studied for self-reports and proxy reports, full forms and short forms. Confirmatory factor analysis tested the factor structure and multigroup confirmatory factor analysis tested measurement invariance between boys and girls. Test-retest reliability was demonstrated for all scales and internal consistency reliability was shown with alpha value exceeding 0.70 for all scales but one (self-report short form: social functioning). Child-parent agreement was low to moderate. The four-factor structure of the PedsQL and factorial invariance across sex subgroups were confirmed for the self-report forms and for the proxy short form, while model fit indices suggested improvement of several proxy full-form scales. The Swedish PedsQL 4.0 generic core scales are a reliable and valid tool for health-related quality of life (HRQoL) assessment in Swedish child populations. The proxy full form, however, should be used with caution. The study also support continued use of the PedsQL as a four-factor model, capable of revealing meaningful HRQoL differences between boys and girls.
Physical activity questionnaires for youth: a systematic review of measurement properties.
Chinapaw, Mai J M; Mokkink, Lidwine B; van Poppel, Mireille N M; van Mechelen, Willem; Terwee, Caroline B
2010-07-01
Because of the diversity in available questionnaires, it is not easy for researchers to decide which instrument is most suitable for his or her specific demands. Therefore, we systematically summarized and appraised studies examining measurement properties of self-administered and proxy-reported physical activity (PA) questionnaires in youth. Literature was identified through searching electronic databases (PubMed, EMBASE using 'EMBASE only' and SportDiscus) until May 2009. Studies were included if they reported on the measurement properties of self-administered and proxy-reported PA questionnaires in youth (mean age <18 years) and were published in the English language. Methodological quality and results of included studies was appraised using a standardized checklist (qualitative attributes and measurement properties of PA questionnaires [QAPAQ]). We included 54 manuscripts examining 61 versions of questionnaires. None of the included questionnaires showed both acceptable reliability and validity. Only seven questionnaires received a positive rating for reliability. Reported validity varied, with correlations between PA questionnaires and accelerometers ranging from very low to high (previous day PA recall: correlation coefficient [r] = 0.77). In general, PA questionnaires for adolescents correlated better with accelerometer scores than did those for children. From this systematic review, we conclude that no questionnaires were available with both acceptable reliability and validity. Considerably more high-quality research is required to examine the validity and reliability of promising PA questionnaires for youth.
Brinca, Lilia; Batista, Ana Paula; Tavares, Ana Inês; Pinto, Patrícia N; Araújo, Lara
2015-11-01
The main objective of the present study was to investigate if the type of voice stimuli-sustained vowel, oral reading, and connected speech-results in good intrarater and interrater agreement/reliability. A short-term panel study was performed. Voice samples from 30 native European Portuguese speakers were used in the present study. The speech materials used were (1) the sustained vowel /a/, (2) oral reading of the European Portuguese version of "The Story of Arthur the Rat," and (3) connected speech. After an extensive training with textual and auditory anchors, the judges were asked to rate the severity of dysphonic voice stimuli using the phonation dimensions G, R, and B from the GRBAS scale. The voice samples were judged 6 months and 1 year after the training. Intrarater agreement and reliability were generally very good for all the phonation dimensions and voice stimuli. The highest interrater reliability was obtained using the oral reading stimulus, particularly for phonation dimensions grade (G) and breathiness (B). Roughness (R) was the voice quality that was the most difficult to evaluate, leading to interrater unreliability in all voice quality ratings. Extensive training using textual and auditory anchors and the use of anchors during the voice evaluations appear to be good methods for auditory-perceptual evaluation of dysphonic voices. The best results of interrater reliability were obtained when the oral reading stimulus was used. Breathiness appears to be a voice quality that is easier to evaluate than roughness. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Validity and reliability of the robotic objective structured assessment of technical skills
Siddiqui, Nazema Y.; Galloway, Michael L.; Geller, Elizabeth J.; Green, Isabel C.; Hur, Hye-Chun; Langston, Kyle; Pitter, Michael C.; Tarr, Megan E.; Martino, Martin A.
2015-01-01
Objective Objective structured assessments of technical skills (OSATS) have been developed to measure the skill of surgical trainees. Our aim was to develop an OSATS specifically for trainees learning robotic surgery. Study Design This is a multi-institutional study in eight academic training programs. We created an assessment form to evaluate robotic surgical skill through five inanimate exercises. Obstetrics/gynecology, general surgery, and urology residents, fellows, and faculty completed five robotic exercises on a standard training model. Study sessions were recorded and randomly assigned to three blinded judges who scored performance using the assessment form. Construct validity was evaluated by comparing scores between participants with different levels of surgical experience; inter- and intra-rater reliability were also assessed. Results We evaluated 83 residents, 9 fellows, and 13 faculty, totaling 105 participants; 88 (84%) were from obstetrics/gynecology. Our assessment form demonstrated construct validity, with faculty and fellows performing significantly better than residents (mean scores: 89 ± 8 faculty; 74 ± 17 fellows; 59 ± 22 residents, p<0.01). In addition, participants with more robotic console experience scored significantly higher than those with fewer prior console surgeries (p<0.01). R-OSATS demonstrated good inter-rater reliability across all five drills (mean Cronbach's α: 0.79 ± 0.02). Intra-rater reliability was also high (mean Spearman's correlation: 0.91 ± 0.11). Conclusions We developed an assessment form for robotic surgical skill that demonstrates construct validity, inter- and intra-rater reliability. When paired with standardized robotic skill drills this form may be useful to distinguish between levels of trainee performance. PMID:24807319
Reliability and validity of the Lithuanian Tinnitus Handicap Inventory.
Ulozienė, Ingrida; Balnytė, Renata; Alzbutienė, Giedrė; Arechvo, Irina; Vaitkus, Antanas; Šileikaitė, Milda; Šaferis, Viktoras; Ulozas, Virgilijus
2016-01-01
The aim of this study was to determine the reliability and validity of the Lithuanian version of the Tinnitus Handicap Inventory (THI), a self-report measure of perceived tinnitus handicap. A cross-sectional psychometric validation study was performed in the University Hospital. A total of 248 subjects reporting chronic tinnitus as their primary complaint or secondary to hearing loss were encluded in the study and filled in the Lithuanian version of THI. For assessment of construct validity a subgroup of 55 participants completed the Lithuanian version of the Hospital Anxiety and Depression Scale as a measure of self-perceived levels of anxiety and depression. Test-retest and internal consistency reliability as well as construct validity were calculated. The Lithuanian version of the THI and its subscales showed a robust internal consistency reliability (Cronbach's alpha=0.93) comparable to the original version. Statistically significant correlations were observed between the Lithuanian translation of the THI and the measures of self-perceived levels of anxiety and depression using HADS. Confirmatory factor analysis demonstrated that the three subscales of the THI Lithuanian version corresponded to three different factors, which strongly correlated between themselves. The results suggest that the Lithuanian version of THI maintains its original validity and may serve as reliable and valid measure of general tinnitus related distress that can be used in a clinical setting to quantify the impact of tinnitus on daily living. Copyright © 2016 The Lithuanian University of Health Sciences. Production and hosting by Elsevier Urban & Partner Sp. z o.o. All rights reserved.
Validity and reliability of the Persian version of spatial hearing questionnaire
Delphi, Maryam; Zamiri Abdolahi, Farzaneh; Tyler, Richard; Bakhit, Mahsa; Saki, Nader; Nazeri, Ahmad Reza
2015-01-01
Background: Our hearing ability in space is critical for hearing speech in noisy environment and localization. The Spatial Hearing Questionnaire (SHQ) has been devised to focus only on spatial haring tasks (e.g., lateralization, distance detection and binaural detection). The aim of the present study was to determine the reliability and validity of the Persian translation of the SHQ (Spatial Hearing Questionnaire). Methods: Translation and back-translation, reliability, content and construct validity were investigated. Eighty patients with sensory neural hearing loss (SNHL) (52.50% female and 47.5 % male) with the mean±SD age of 49.02±13.60 years completed SHQ, and they were categorized into mild, moderate, moderate to severe and severe groups based on their hearing threshold. Inclusion criteria in this study were the MMSE questionnaire score of higher than 21, good general health, no history of psychiatric disorders, dizziness or vertigo, dementia or alcohol abuse. Results: The reliability was assessed by Cronbach’s alpha and found to be 0.99. Item-total correlation was between r= 0.84 and 0.92. There was a significant difference between the mean score of PSHQ in the four groups. Based on the factor analysis, two factors were extracted from the questions in P-SHQ: sound localization; and music and speech understanding in noise and quiet. These factors could explain 82.1% and 9.3% of the total variance, respectively. Conclusion: The present study proved the reliability and validity of the Persian version of SHQ (PSHQ). This provides a suitable tool for spatial hearing assessment in clinical/research environments. PMID:26793624
Hemke, Robert; Tzaribachev, Nikolay; Nusman, Charlotte M; van Rossum, Marion A J; Maas, Mario; Doria, Andrea S
2017-08-01
There is increasing evidence that early therapeutic intervention improves longterm joint outcome in juvenile idiopathic arthritis (JIA). Given the existence of highly effective treatments, there is an urgent need for reliable and accurate measures of disease activity and joint damage in JIA. Our objective was to assess the reliability of 2 magnetic resonance imaging (MRI) scoring methods: the Juvenile Arthritis MRI Scoring (JAMRIS) system and the International Prophylaxis Study Group (IPSG) consensus score, for evaluating disease status of the knee in patients with JIA. Four international readers independently scored an MRI dataset of 25 JIA patients with clinical knee involvement. Synovial thickening, joint effusion, bone marrow changes, cartilage lesions, bone erosions, and subchondral cysts were scored using the JAMRIS and IPSG systems. Further, synovial enhancement, infrapatellar fat pad heterogeneity, tendinopathy, and enthesopathy were scored. Interreader reliability was analyzed by using the generalized κ, ICC, and the smallest detectable difference (SDD). ICC regarding interreader reliability ranged from 0.33 (95% CI 0.12-0.52, SDD = 0.29) for enthesopathy up to 0.95 (95% CI 0.92-0.97, SDD = 3.19) for synovial thickening. Good interreader reliability was found concerning joint effusion (ICC 0.93, 95% CI 0.89-0.95, SDD = 0.51), synovial enhancement (ICC 0.90, 95% CI 0.85-0.94, SDD = 9.85), and bone marrow changes (ICC 0.87, 95% CI 0.80-0.92, SDD = 10.94). Moderate to substantial reliability was found concerning cartilage lesions and bone erosions (ICC 0.55-0.72, SDD 1.41-13.65). The preliminary results are promising for most of the scored JAMRIS and IPSG items. However, further refinement of the scoring system is warranted for unsatisfactorily reliable items such as bone erosions, cartilage lesions, and enthesopathy.
Ko, Jooyeon; Kim, MinYoung
2013-03-01
The Gross Motor Function Measure (GMFM-88) is commonly used in the evaluation of gross motor function in children with cerebral palsy (CP). The relative reliability of GMFM-88 has been assessed in children with CP. However, little information is available regarding the absolute reliability or responsiveness of GMFM-88. The purpose of this study was to determine the absolute and relative reliability and the responsiveness of the GMFM-88 in evaluating gross motor function in children with CP. A clinical measurement design was used. Ten raters scored the GMFM-88 in 84 children (mean age=3.7 years, SD=1.9, range=10 months to 9 years 9 months) from video records across all Gross Motor Function Classification System (GMFCS) levels to establish interrater reliability. Two raters participated to assess intrarater reliability. Responsiveness was determined from 3 additional assessments after the baseline assessment. The interrater and intrarater intraclass correlation coefficients (ICCs) with 95% confidence intervals, standard error of measurement (SEM), smallest real difference (SRD), effect size (ES), and standardized response mean (SRM) were calculated. The relative reliability of the GMFM was excellent (ICCs=.952-1.000). The SEM and SRD for total score of the GMFM were acceptable (1.60 and 3.14, respectively). Additionally, the ES and SRM of the dimension goal scores increased gradually in the 3 follow-up assessments (GMFCS levels I and II: ES=0.5, 0.6, and 0.8 and SRM=1.3, 1.8, and 2.0; GMFCS levels III-V: ES=0.4, 0.7, and 0.9 and SRM=1.5, 1.7, and 2.0). Children over 10 years of age with CP were not included in this study, so the results should not be generalized to all children with CP. Both the reliability and the responsiveness of the GMFM-88 are reasonable for measuring gross motor function in children with CP.
HiRel - Reliability/availability integrated workstation tool
NASA Technical Reports Server (NTRS)
Bavuso, Salvatore J.; Dugan, Joanne B.
1992-01-01
The HiRel software tool is described and demonstrated by application to the mission avionics subsystem of the Advanced System Integration Demonstrations (ASID) system that utilizes the PAVE PILLAR approach. HiRel marks another accomplishment toward the goal of producing a totally integrated computer-aided design (CAD) workstation design capability. Since a reliability engineer generally represents a reliability model graphically before it can be solved, the use of a graphical input description language increases productivity and decreases the incidence of error. The graphical postprocessor module HARPO makes it possible for reliability engineers to quickly analyze huge amounts of reliability/availability data to observe trends due to exploratory design changes. The addition of several powerful HARP modeling engines provides the user with a reliability/availability modeling capability for a wide range of system applications all integrated under a common interactive graphical input-output capability.
A Review on VSC-HVDC Reliability Modeling and Evaluation Techniques
NASA Astrophysics Data System (ADS)
Shen, L.; Tang, Q.; Li, T.; Wang, Y.; Song, F.
2017-05-01
With the fast development of power electronics, voltage-source converter (VSC) HVDC technology presents cost-effective ways for bulk power transmission. An increasing number of VSC-HVDC projects has been installed worldwide. Their reliability affects the profitability of the system and therefore has a major impact on the potential investors. In this paper, an overview of the recent advances in the area of reliability evaluation for VSC-HVDC systems is provided. Taken into account the latest multi-level converter topology, the VSC-HVDC system is categorized into several sub-systems and the reliability data for the key components is discussed based on sources with academic and industrial backgrounds. The development of reliability evaluation methodologies is reviewed and the issues surrounding the different computation approaches are briefly analysed. A general VSC-HVDC reliability evaluation procedure is illustrated in this paper.
Software reliability experiments data analysis and investigation
NASA Technical Reports Server (NTRS)
Walker, J. Leslie; Caglayan, Alper K.
1991-01-01
The objectives are to investigate the fundamental reasons which cause independently developed software programs to fail dependently, and to examine fault tolerant software structures which maximize reliability gain in the presence of such dependent failure behavior. The authors used 20 redundant programs from a software reliability experiment to analyze the software errors causing coincident failures, to compare the reliability of N-version and recovery block structures composed of these programs, and to examine the impact of diversity on software reliability using subpopulations of these programs. The results indicate that both conceptually related and unrelated errors can cause coincident failures and that recovery block structures offer more reliability gain than N-version structures if acceptance checks that fail independently from the software components are available. The authors present a theory of general program checkers that have potential application for acceptance tests.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paret, Paul
The National Renewable Energy Laboratory (NREL) will conduct thermal and reliability modeling on three sets of power modules for the development of a next generation inverter for electric traction drive vehicles. These modules will be chosen by General Motors (GM) to represent three distinct technological approaches to inverter power module packaging. Likely failure mechanisms will be identified in each package and a physics-of-failure-based reliability assessment will be conducted.
Validation of Turkish version of brief negative symptom scale.
Polat Nazlı, Irmak; Ergül, Ceylan; Aydemir, Ömer; Chandhoke, Swati; Üçok, Alp; Gönül, Ali Saffet
2016-11-01
Negative symptoms in schizophrenia have been assessed by many instruments. However, a current consensus on these symptoms has been built and new tools, such as the Brief Negative Symptom Scale (BNSS), are generated. This study aimed to evaluate reliability and validity of the Turkish version of BNSS. The scale was translated to Turkish and backtranslated to English. After the approval of the translation, 75 schizophrenia patients were interviewed with BNSS, Positive and Negative Syndrome Scale (PANSS), Calgary Depression Scale for Schizophrenia (CDSS) and Extrapyramidal Symptom Rating Scale (ESRS). Reliability and validity analyses were then calculated. In the reliability analysis, the Cronbach's alpha coefficient was 0.96 and item-total score correlation coefficients were between 0.655-0.884. The intraclass correlation coefficient was 0.665. The inter-rater reliability was 0.982 (p < 0.0001). In the validity analysis, the total score of BNSS-TR was correlated with PANSS Total Score, Positive Symptoms Subscale, Negative Symptoms Subscale, and General Psychopathology Subscale. CDSS and ESRS were not correlated with BNSS-TR. The factor structure of the scale was consisting the same items as in the original version. Our study confirms that the Turkish version of BNSS is an applicable tool for the evaluation of negative symptoms in schizophrenia.
A Bayesian Framework for Reliability Analysis of Spacecraft Deployments
NASA Technical Reports Server (NTRS)
Evans, John W.; Gallo, Luis; Kaminsky, Mark
2012-01-01
Deployable subsystems are essential to mission success of most spacecraft. These subsystems enable critical functions including power, communications and thermal control. The loss of any of these functions will generally result in loss of the mission. These subsystems and their components often consist of unique designs and applications for which various standardized data sources are not applicable for estimating reliability and for assessing risks. In this study, a two stage sequential Bayesian framework for reliability estimation of spacecraft deployment was developed for this purpose. This process was then applied to the James Webb Space Telescope (JWST) Sunshield subsystem, a unique design intended for thermal control of the Optical Telescope Element. Initially, detailed studies of NASA deployment history, "heritage information", were conducted, extending over 45 years of spacecraft launches. This information was then coupled to a non-informative prior and a binomial likelihood function to create a posterior distribution for deployments of various subsystems uSing Monte Carlo Markov Chain sampling. Select distributions were then coupled to a subsequent analysis, using test data and anomaly occurrences on successive ground test deployments of scale model test articles of JWST hardware, to update the NASA heritage data. This allowed for a realistic prediction for the reliability of the complex Sunshield deployment, with credibility limits, within this two stage Bayesian framework.
Code of Federal Regulations, 2010 CFR
2010-10-01
... reasons of safety, reliability and generally applicable engineering purposes. (b) Requests for access to a... and information relate to a denial of access for reasons of lack of capacity, safety, reliability or engineering standards. (c) A utility shall provide a cable television system operator or telecommunications...
ERIC Educational Resources Information Center
Harris, Larry P.; Wolf, Steven R.
1979-01-01
The article focuses on the controversy over norm-referenced v criterion-referenced measures (CRM) in assessment of learning disorders. The authors contend that while the reliability of CRMs is generally indisputable, the validity of measures designed from local curricula is still dependent on the intuitive judgments of teachers. (Author/SBH)
Reliability Generalization for Childhood Autism Rating Scale
ERIC Educational Resources Information Center
Breidbord, Jonathan; Croudace, Tim J.
2013-01-01
The Childhood Autism Rating Scale (CARS) is a popular behavior-observation instrument that was developed more than 34 years ago and has since been adopted in a wide variety of contexts for assessing the presence and severity of autism symptomatology in both children and adolescents. This investigation of the reliability of CARS scores involves…
Validity and Reliability in Social Science Research
ERIC Educational Resources Information Center
Drost, Ellen A.
2011-01-01
In this paper, the author aims to provide novice researchers with an understanding of the general problem of validity in social science research and to acquaint them with approaches to developing strong support for the validity of their research. She provides insight into these two important concepts, namely (1) validity; and (2) reliability, and…
Teaching the Nature of Science in a Course in Sustainable Agriculture
ERIC Educational Resources Information Center
Cessna, Stephen; Neufeld, Douglas Graber; Horst, S. Jeanne
2013-01-01
Claims of the (non-)sustainability of a given agricultural practice generally hinge on scientific evidence and the reliability of that evidence, or at least the perception of its reliability. Advocates of sustainable agriculture may dismiss science as purely subjective, or at the other extreme, may inappropriately elevate scientific findings to…
Improving vaccination cold chain in the general practice setting.
Page, Sue L; Earnest, Arul; Birden, Hudson; Deaker, Rachelle; Clark, Chris
2008-10-01
This study compared temperature control in different types of vaccine storing refrigerators in general practice and tested knowledge of general practice staff in vaccine storage requirements. Temperature data loggers were set to serially record the temperature within vaccine refrigerators in 28 general practices, recording at 12 minute intervals over a period of 10 days on each occasion. A survey of vaccine storage knowledge and records of divisions of general practice immunisation contacts were also obtained. There was a significant relationship between type of refrigerator and optimal temperature, with the odds ratio for bar style refrigerator being 0.005 (95% CI: 0.001-0.044) compared to the purpose built vaccine refrigerators. Score on a survey of vaccine storage was also positively associated with optimal storage temperature. General practices that invest in purpose built vaccine refrigerators will achieve standards of vaccine cold chain maintenance significantly more reliably than can be achieved through regular cold chain monitoring and practice supports.
Scaled CMOS Technology Reliability Users Guide
NASA Technical Reports Server (NTRS)
White, Mark
2010-01-01
The desire to assess the reliability of emerging scaled microelectronics technologies through faster reliability trials and more accurate acceleration models is the precursor for further research and experimentation in this relevant field. The effect of semiconductor scaling on microelectronics product reliability is an important aspect to the high reliability application user. From the perspective of a customer or user, who in many cases must deal with very limited, if any, manufacturer's reliability data to assess the product for a highly-reliable application, product-level testing is critical in the characterization and reliability assessment of advanced nanometer semiconductor scaling effects on microelectronics reliability. A methodology on how to accomplish this and techniques for deriving the expected product-level reliability on commercial memory products are provided.Competing mechanism theory and the multiple failure mechanism model are applied to the experimental results of scaled SDRAM products. Accelerated stress testing at multiple conditions is applied at the product level of several scaled memory products to assess the performance degradation and product reliability. Acceleration models are derived for each case. For several scaled SDRAM products, retention time degradation is studied and two distinct soft error populations are observed with each technology generation: early breakdown, characterized by randomly distributed weak bits with Weibull slope (beta)=1, and a main population breakdown with an increasing failure rate. Retention time soft error rates are calculated and a multiple failure mechanism acceleration model with parameters is derived for each technology. Defect densities are calculated and reflect a decreasing trend in the percentage of random defective bits for each successive product generation. A normalized soft error failure rate of the memory data retention time in FIT/Gb and FIT/cm2 for several scaled SDRAM generations is presented revealing a power relationship. General models describing the soft error rates across scaled product generations are presented. The analysis methodology may be applied to other scaled microelectronic products and their key parameters.
Automatic specification of reliability models for fault-tolerant computers
NASA Technical Reports Server (NTRS)
Liceaga, Carlos A.; Siewiorek, Daniel P.
1993-01-01
The calculation of reliability measures using Markov models is required for life-critical processor-memory-switch structures that have standby redundancy or that are subject to transient or intermittent faults or repair. The task of specifying these models is tedious and prone to human error because of the large number of states and transitions required in any reasonable system. Therefore, model specification is a major analysis bottleneck, and model verification is a major validation problem. The general unfamiliarity of computer architects with Markov modeling techniques further increases the necessity of automating the model specification. Automation requires a general system description language (SDL). For practicality, this SDL should also provide a high level of abstraction and be easy to learn and use. The first attempt to define and implement an SDL with those characteristics is presented. A program named Automated Reliability Modeling (ARM) was constructed as a research vehicle. The ARM program uses a graphical interface as its SDL, and it outputs a Markov reliability model specification formulated for direct use by programs that generate and evaluate the model.
User-Perceived Reliability of M-for-N (M: N) Shared Protection Systems
NASA Astrophysics Data System (ADS)
Ozaki, Hirokazu; Kara, Atsushi; Cheng, Zixue
In this paper we investigate the reliability of general type shared protection systems i.e. M for N (M: N) that can typically be applied to various telecommunication network devices. We focus on the reliability that is perceived by an end user of one of N units. We assume that any failed unit is instantly replaced by one of the M units (if available). We describe the effectiveness of such a protection system in a quantitative manner. The mathematical analysis gives the closed-form solution of the availability, the recursive computing algorithm of the MTTFF (Mean Time to First Failure) and the MTTF (Mean Time to Failure) perceived by an arbitrary end user. We also show that, under a certain condition, the probability distribution of TTFF (Time to First Failure) can be approximated by a simple exponential distribution. The analysis provides useful information for the analysis and the design of not only the telecommunication network devices but also other general shared protection systems that are subject to service level agreements (SLA) involving user-perceived reliability measures.
Temperature Monitoring and Perioperative Thermoregulation
Sessler, Daniel I.
2008-01-01
Most clinically available thermometers accurately report the temperature of whatever tissue is being measured. The difficulty is that no reliably core-temperature measuring sites are completely non-invasive and easy to use — especially in patients not having general anesthesia. Nonetheless, temperature can be reliably measured in most patients. Body temperature should be measured in patients having general anesthesia exceeding 30 minutes in duration, and in patients having major operations under neuraxial anesthesia. Core body temperature is normally tightly regulated. All general anesthetics produce a profound dose-dependent reduction in the core temperature triggering cold defenses including arterio-venous shunt vasoconstriction and shivering. Anesthetic-induced impairment of normal thermoregulatory control, and the resulting core-to-peripheral redistribution of body heat, is the primary cause of hypothermia in most patients. Neuraxial anesthesia also impairs thermoregulatory control, although to a lesser extant than general anesthesia. Prolonged epidural analgesia is associated with hyperthermia whose cause remains unknown. PMID:18648241
Severity of illness index for surgical departments in a Cuban hospital: a revalidation study.
Armas-Bencomo, Amadys; Tamargo-Barbeito, Teddy Osmin; Fuentes-Valdés, Edelberto; Jiménez-Paneque, Rosa Eugenia
2017-03-08
In the context of the evaluation of hospital services, the incorporation of severity indices allows an essential control variable for performance comparisons in time and space through risk adjustment. The severity index for surgical services was developed in 1999 and validated as a general index for surgical services. Sixteen years later the hospital context is different in many ways and a revalidation was considered necessary to guarantee its current usefulness. To evaluate the validity and reliability of the surgical services severity index to warrant its reasonable use under current conditions. A descriptive study was carried out in the General Surgery service of the "Hermanos Ameijeiras" Clinical Surgical Hospital of Havana, Cuba during the second half of 2010. We reviewed the medical records of 511 patients discharged from this service. Items were the same as the original index as were their weighted values. Conceptual or construct validity, criterion validity and inter-rater reliability as well as internal consistency of the proposed index were evaluated. Construct validity was expressed as a significant association between the value of the severity index for surgical services and discharge status. A significant association was also found, although weak, with length of hospital stay. Criterion validity was demonstrated through the correlations between the severity index for surgical services and other similar indices. Regarding criterion validity, the Horn index showed a correlation of 0.722 (95% CI: 0.677-0.761) with our index. With the POSSUM score, correlation was 0.454 (95% CI: 0.388-0.514) with mortality risk and 0.539 (95% CI: 0.462-0.607) with morbidity risk. Internal consistency yielded a standardized Cronbach's alpha of 0.8; inter-rater reliability resulted in a reliability coefficient of 0.98 for the quantitative index and a weighted global Kappa coefficient of 0.87 for the ordinal surgical index of severity for surgical services (IGQ). The validity and reliability of the proposed index was satisfactory in all aspects evaluated. The surgical services severity index may be used in the original context and is easily adaptable to other contexts as well.
Light aircraft crash safety program
NASA Technical Reports Server (NTRS)
Thomson, R. G.; Hayduk, R. J.
1974-01-01
NASA is embarked upon research and development tasks aimed at providing the general aviation industry with a reliable crashworthy airframe design technology. The goals of the NASA program are: reliable analytical techniques for predicting the nonlinear behavior of structures; significant design improvements of airframes; and simulated full-scale crash test data. The analytical tools will include both simplified procedures for estimating energy absorption characteristics and more complex computer programs for analysis of general airframe structures under crash loading conditions. The analytical techniques being developed both in-house and under contract are described, and a comparison of some analytical predictions with experimental results is shown.
A microRNA detection system based on padlock probes and rolling circle amplification
Jonstrup, Søren Peter; Koch, Jørn; Kjems, Jørgen
2006-01-01
The differential expression and the regulatory roles of microRNAs (miRNAs) are being studied intensively these years. Their minute size of only 19–24 nucleotides and strong sequence similarity among related species call for enhanced methods for reliable detection and quantification. Moreover, miRNA expression is generally restricted to a limited number of specific cells within an organism and therefore requires highly sensitive detection methods. Here we present a simple and reliable miRNA detection protocol based on padlock probes and rolling circle amplification. It can be performed without specialized equipment and is capable of measuring the content of specific miRNAs in a few nanograms of total RNA. PMID:16888321
An Online Risk Monitor System (ORMS) to Increase Safety and Security Levels in Industry
NASA Astrophysics Data System (ADS)
Zubair, M.; Rahman, Khalil Ur; Hassan, Mehmood Ul
2013-12-01
The main idea of this research is to develop an Online Risk Monitor System (ORMS) based on Living Probabilistic Safety Assessment (LPSA). The article highlights the essential features and functions of ORMS. The basic models and modules such as, Reliability Data Update Model (RDUM), running time update, redundant system unavailability update, Engineered Safety Features (ESF) unavailability update and general system update have been described in this study. ORMS not only provides quantitative analysis but also highlights qualitative aspects of risk measures. ORMS is capable of automatically updating the online risk models and reliability parameters of equipment. ORMS can support in the decision making process of operators and managers in Nuclear Power Plants.
A microRNA detection system based on padlock probes and rolling circle amplification.
Jonstrup, Søren Peter; Koch, Jørn; Kjems, Jørgen
2006-09-01
The differential expression and the regulatory roles of microRNAs (miRNAs) are being studied intensively these years. Their minute size of only 19-24 nucleotides and strong sequence similarity among related species call for enhanced methods for reliable detection and quantification. Moreover, miRNA expression is generally restricted to a limited number of specific cells within an organism and therefore requires highly sensitive detection methods. Here we present a simple and reliable miRNA detection protocol based on padlock probes and rolling circle amplification. It can be performed without specialized equipment and is capable of measuring the content of specific miRNAs in a few nanograms of total RNA.
Ovarian and cervical cancer awareness: development of two validated measurement tools.
Simon, Alice E; Wardle, Jane; Grimmett, Chloe; Power, Emily; Corker, Elizabeth; Menon, Usha; Matheson, Lauren; Waller, Jo
2012-07-01
The aim of the study was to develop and validate measures of awareness of symptoms and risk factors for ovarian and cervical cancer (Ovarian and Cervical Cancer Awareness Measures). Potentially relevant items were extracted from the literature and generated by experts. Four validation studies were carried out to establish reliability and validity. Women aged 21-67 years (n=146) and ovarian and cervical cancer experts (n=32) were included in the studies. Internal reliability was assessed psychometrically. Test-retest reliability was assessed over a 1-week interval. To establish construct validity, Cancer Awareness Measure (CAM) scores of cancer experts were compared with equally well-educated comparison groups. Sensitivity to change was tested by randomly assigning participants to read either a leaflet giving information about ovarian/cervical cancer or a leaflet with control information, and then completing the ovarian/cervical CAM. Internal reliability (Cronbach's α=0.88 for the ovarian CAM and α=0.84 for the cervical CAM) and test-retest reliability (r=0.84 and r=0.77 for the ovarian and cervical CAMs, respectively) were both high. Validity was demonstrated with cancer experts achieving higher scores than controls [ovarian CAM: t(36)= -5.6, p<0.001; cervical CAM: t(38)= -3.7, p=0.001], and volunteers who were randomised to read a cancer leaflet scored higher than those who received a control leaflet [ovarian CAM: t(49)=7.5, p<0.001; cervical CAM: t(48)= -5.5, p<0.001]. This study demonstrates the psychometric properties of the ovarian and cervical CAMs and supports their utility in assessing ovarian and cervical cancer awareness in the general population.
Ovarian and cervical cancer awareness: development of two validated measurement tools
Simon, Alice E; Wardle, Jane; Grimmett, Chloe; Power, Emily; Corker, Elizabeth; Menon, Usha; Matheson, Lauren; Waller, Jo
2012-01-01
Background The aim of the study was to develop and validate measures of awareness of symptoms and risk factors for ovarian and cervical cancer (Ovarian and Cervical Cancer Awareness Measures). Methods Potentially relevant items were extracted from the literature and generated by experts. Four validation studies were carried out to establish reliability and validity. Women aged 21–67 years (n=146) and ovarian and cervical cancer experts (n=32) were included in the studies. Internal reliability was assessed psychometrically. Test-retest reliability was assessed over a 1-week interval. To establish construct validity, Cancer Awareness Measure (CAM) scores of cancer experts were compared with equally well-educated comparison groups. Sensitivity to change was tested by randomly assigning participants to read either a leaflet giving information about ovarian/cervical cancer or a leaflet with control information, and then completing the ovarian/cervical CAM. Results Internal reliability (Cronbach's α=0.88 for the ovarian CAM and α=0.84 for the cervical CAM) and test-retest reliability (r=0.84 and r=0.77 for the ovarian and cervical CAMs, respectively) were both high. Validity was demonstrated with cancer experts achieving higher scores than controls [ovarian CAM: t(36)= –5.6, p<0.001; cervical CAM: t(38)= –3.7, p=0.001], and volunteers who were randomised to read a cancer leaflet scored higher than those who received a control leaflet [ovarian CAM: t(49)=7.5, p<0.001; cervical CAM: t(48)= –5.5, p<0.001]. Conclusions This study demonstrates the psychometric properties of the ovarian and cervical CAMs and supports their utility in assessing ovarian and cervical cancer awareness in the general population. PMID:21933805
Schaefer, Lauren M; Harriger, Jennifer A; Heinberg, Leslie J; Soderberg, Taylor; Kevin Thompson, J
2017-02-01
The Sociocultural Attitudes Toward Appearance Questionnaire-4 (SATAQ-4) is a measure of internalization of appearance ideals (i.e., personal acceptance of societal ideals) and appearance pressures (i.e., pressures to achieve the societal ideal). The current study sought to address limitations of the scale in order to increase precision in the measurement of muscular ideal internalization, include an assessment of one's desire for attractiveness, and broaden the measurement of appearance-related pressures. The factor structure, reliability and construct validity of the SATAQ-4-Revised were examined among college women (N = 1,114) in Study 1, adolescent girls (N = 275) in Study 2, and college men (N = 290) in Study 3. Factor analysis among college women indicated a 7-factor 31-item scale, labeled the SATAQ-4R-Female: (1) Internalization: Thin/Low Body Fat, (2) Internalization: Muscular, (3) Internalization: General Attractiveness, (4) Pressures: Family, (5) Pressures: Media, (6) Pressures: Peers, and (7) Pressures: Significant Others. SATAQ-4R-Female subscales demonstrated good reliability and construct validity among college women. Examination of the SATAQ-4R-Female among adolescent girls suggested a six-factor scale in which peer and significant others items comprised a single subscale. The scale demonstrated good reliability and construct validity in adolescent girls. Examination of the SATAQ-4R among men produced a 28-item scale with seven factors paralleling the factors identified among college women. This scale, labeled the SATAQ-4R-Male, demonstrated good reliability and construct validity. Results support the reliability and validity of SATAQ-4R-Female in college women and adolescent girls, and the SATAQ-4R-Male in college men. © 2016 Wiley Periodicals, Inc.(Int J Eat Disord 2017; 50:104-117). © 2016 Wiley Periodicals, Inc.
Oh, HyunSoo; Lee, Seul; Kim, JiSun; Lee, EunJu; Min, HyoNam; Cho, OkJa; Seo, WhaSook
2015-07-01
This study was conducted to develop a family relocation stress scale by modifying the Son's Relocation Stress Syndrome Scale, to examine its clinical validity and reliability and to confirm its suitability for measuring family relocation stress. The transfer of ICU patients to general wards is a significant anxiety-producing event for family members. However, no relocation stress scale has been developed specifically for families. A nonexperimental, correlation design was adopted. The study subjects were 95 family members of 95 ICU patients at a university hospital located in Incheon, South Korea. Face and construct validities of the devised family relocation stress scale were examined. Construct validity was examined using factor analysis and by using a nomological validity test. Reliability was also examined. Face and content validity of the scale were verified by confirming that its items adequately measured family relocation stress. Factor analysis yielded four components, and the total variance explained by these four components was 63·0%, which is acceptable. Nomological validity was well supported by significant relationships between relocation stress and degree of preparation for relocation, patient self-care ability, family burden and satisfaction with the relocation process. The devised scale was also found to have good reliability. The family relocation stress scale devised in this study was found to have good validity and reliability, and thus, is believed to offer a means of assessing family relocation stress. The findings of this study provide a reliable and valid assessment tool when nurses prepare families for patient transfer from an ICU to a ward setting, and may also provide useful information to those developing an intervention programme for family relocation stress management. © 2015 John Wiley & Sons Ltd.
Löwe, Bernd; Decker, Oliver; Müller, Stefanie; Brähler, Elmar; Schellberg, Dieter; Herzog, Wolfgang; Herzberg, Philipp Yorck
2008-03-01
The 7-item Generalized Anxiety Disorder Scale (GAD-7) is a practical self-report anxiety questionnaire that proved valid in primary care. However, the GAD-7 was not yet validated in the general population and thus far, normative data are not available. To investigate reliability, construct validity, and factorial validity of the GAD-7 in the general population and to generate normative data. Nationally representative face-to-face household survey conducted in Germany between May 5 and June 8, 2006. Five thousand thirty subjects (53.6% female) with a mean age (SD) of 48.4 (18.0) years. The survey questionnaire included the GAD-7, the 2-item depression module from the Patient Health Questionnaire (PHQ-2), the Rosenberg Self-Esteem Scale, and demographic characteristics. Confirmatory factor analyses substantiated the 1-dimensional structure of the GAD-7 and its factorial invariance for gender and age. Internal consistency was identical across all subgroups (alpha = 0.89). Intercorrelations with the PHQ-2 and the Rosenberg Self-Esteem Scale were r = 0.64 (P < 0.001) and r = -0.43 (P < 0.001), respectively. As expected, women had significantly higher mean (SD) GAD-7 anxiety scores compared with men [3.2 (3.5) vs. 2.7 (3.2); P < 0.001]. Normative data for the GAD-7 were generated for both genders and different age levels. Approximately 5% of subjects had GAD-7 scores of 10 or greater, and 1% had GAD-7 scores of 15 or greater. Evidence supports reliability and validity of the GAD-7 as a measure of anxiety in the general population. The normative data provided in this study can be used to compare a subject's GAD-7 score with those determined from a general population reference group.
Modeling and experimental characterization of electromigration in interconnect trees
NASA Astrophysics Data System (ADS)
Thompson, C. V.; Hau-Riege, S. P.; Andleigh, V. K.
1999-11-01
Most modeling and experimental characterization of interconnect reliability is focussed on simple straight lines terminating at pads or vias. However, laid-out integrated circuits often have interconnects with junctions and wide-to-narrow transitions. In carrying out circuit-level reliability assessments it is important to be able to assess the reliability of these more complex shapes, generally referred to as `trees.' An interconnect tree consists of continuously connected high-conductivity metal within one layer of metallization. Trees terminate at diffusion barriers at vias and contacts, and, in the general case, can have more than one terminating branch when they include junctions. We have extended the understanding of `immortality' demonstrated and analyzed for straight stud-to-stud lines, to trees of arbitrary complexity. This leads to a hierarchical approach for identifying immortal trees for specific circuit layouts and models for operation. To complete a circuit-level-reliability analysis, it is also necessary to estimate the lifetimes of the mortal trees. We have developed simulation tools that allow modeling of stress evolution and failure in arbitrarily complex trees. We are testing our models and simulations through comparisons with experiments on simple trees, such as lines broken into two segments with different currents in each segment. Models, simulations and early experimental results on the reliability of interconnect trees are shown to be consistent.
New Aspects of Probabilistic Forecast Verification Using Information Theory
NASA Astrophysics Data System (ADS)
Tödter, Julian; Ahrens, Bodo
2013-04-01
This work deals with information-theoretical methods in probabilistic forecast verification, particularly concerning ensemble forecasts. Recent findings concerning the "Ignorance Score" are shortly reviewed, then a consistent generalization to continuous forecasts is motivated. For ensemble-generated forecasts, the presented measures can be calculated exactly. The Brier Score (BS) and its generalizations to the multi-categorical Ranked Probability Score (RPS) and to the Continuous Ranked Probability Score (CRPS) are prominent verification measures for probabilistic forecasts. Particularly, their decompositions into measures quantifying the reliability, resolution and uncertainty of the forecasts are attractive. Information theory sets up a natural framework for forecast verification. Recently, it has been shown that the BS is a second-order approximation of the information-based Ignorance Score (IGN), which also contains easily interpretable components and can also be generalized to a ranked version (RIGN). Here, the IGN, its generalizations and decompositions are systematically discussed in analogy to the variants of the BS. Additionally, a Continuous Ranked IGN (CRIGN) is introduced in analogy to the CRPS. The useful properties of the conceptually appealing CRIGN are illustrated, together with an algorithm to evaluate its components reliability, resolution, and uncertainty for ensemble-generated forecasts. This algorithm can also be used to calculate the decomposition of the more traditional CRPS exactly. The applicability of the "new" measures is demonstrated in a small evaluation study of ensemble-based precipitation forecasts.
Yanez, B; Pearman, T; Lis, C G; Beaumont, J L; Cella, D
2013-04-01
Health-related quality-of-life (HRQOL) assessments in research and clinical oncology settings are increasingly important. HRQOL instruments need to be rapid and still maintain the ability to capture the most relevant patient issues in a valid and reliable manner. The current study develops and validates the FACT-G7, a rapid version of the Functional Assessment of Cancer Therapy-General (FACT-G). Oncology patients with advanced cancer (N = 533) from 11 diseases sites ranked the symptoms and concerns they viewed as 'the very most important' when undergoing cancer treatment, completed the FACT-G, and additional HRQOL measures. Oncology patients' scores were referenced across a general US population sample (N = 2000). We selected the highest priority cancer-related symptoms and concerns endorsed by patients for inclusion in the FACT-G7. Fatigue and ability to enjoy life were ranked the most highly. The results provide preliminary support for the FACT-G7's internal consistency reliability (α = 0.74) and validity as evidenced by moderate-to-strong relationships with expected criteria. The references for the general population are summarized. The FACT-G7 can be used to assess top-rated symptoms and concerns for a broad spectrum of advanced cancers in clinical practice and research.
NASA Technical Reports Server (NTRS)
Bavuso, Salvatore J.; Rothmann, Elizabeth; Mittal, Nitin; Koppen, Sandra Howell
1994-01-01
The Hybrid Automated Reliability Predictor (HARP) integrated Reliability (HiRel) tool system for reliability/availability prediction offers a toolbox of integrated reliability/availability programs that can be used to customize the user's application in a workstation or nonworkstation environment. HiRel consists of interactive graphical input/output programs and four reliability/availability modeling engines that provide analytical and simulative solutions to a wide host of highly reliable fault-tolerant system architectures and is also applicable to electronic systems in general. The tool system was designed at the outset to be compatible with most computing platforms and operating systems, and some programs have been beta tested within the aerospace community for over 8 years. This document is a user's guide for the HiRel graphical preprocessor Graphics Oriented (GO) program. GO is a graphical user interface for the HARP engine that enables the drawing of reliability/availability models on a monitor. A mouse is used to select fault tree gates or Markov graphical symbols from a menu for drawing.
Hybrid automated reliability predictor integrated work station (HiREL)
NASA Technical Reports Server (NTRS)
Bavuso, Salvatore J.
1991-01-01
The Hybrid Automated Reliability Predictor (HARP) integrated reliability (HiREL) workstation tool system marks another step toward the goal of producing a totally integrated computer aided design (CAD) workstation design capability. Since a reliability engineer must generally graphically represent a reliability model before he can solve it, the use of a graphical input description language increases productivity and decreases the incidence of error. The captured image displayed on a cathode ray tube (CRT) screen serves as a documented copy of the model and provides the data for automatic input to the HARP reliability model solver. The introduction of dependency gates to a fault tree notation allows the modeling of very large fault tolerant system models using a concise and visually recognizable and familiar graphical language. In addition to aiding in the validation of the reliability model, the concise graphical representation presents company management, regulatory agencies, and company customers a means of expressing a complex model that is readily understandable. The graphical postprocessor computer program HARPO (HARP Output) makes it possible for reliability engineers to quickly analyze huge amounts of reliability/availability data to observe trends due to exploratory design changes.
Reliability and Validity of Athletes Disability Index Questionnaire.
Noormohammadpour, Pardis; Hosseini Khezri, Alireza; Farahbakhsh, Farzin; Mansournia, Mohammad Ali; Smuck, Matthew; Kordi, Ramin
2018-03-01
The purpose of this study was to evaluate validity and reliability of a new proposed questionnaire for assessment of functional disability in athletes with low back pain (LBP). Validity and reliability study. Elite athletes participating in different fields of sports. Participants were 165 male and female athletes (between 12 and 50 years old) with LBP. Athlete Disability Index (ADI) Questionnaire which is developed by the authors for assessing LBP-related disability in athletes, Oswestry Disability Index (ODI), and the Roland-Morris Disability Questionnaire (RDQ). Self-reported responses were collected regarding LBP-related disability through ADI, ODI, and RDQ. The test-retest reliability was strong, and intraclass correlation value ranged between 0.74 and 0.94. The Cronbach alpha coefficient value of 0.91 (P < 0.001) demonstrated excellent internal consistency of the questionnaire. The correlation coefficient between ADI and ODI was r = 0.918 (P < 0.0001), between ADI and RDQ was r = 0.669 (P < 0.0001), and between ADI and visual analog scale was r = 0.626 (P < 0.001). According to ODI and RDQ, disability levels were mild in the large majority of subjects (91.5% and 86.0%, respectively). Alternatively, disability assessments by the ADI did not cluster at the mild level and ranged more broadly from mild to very high. The ADI is a reliable and valid instrument for assessing disability in athletes with LBP. Compared with the available LBP disability questionnaires used in the general population, ADI can more precisely stratify the disability levels of athletes due to LBP.
Ruan, W. June; Goldstein, Risë B.; Chou, S. Patricia; Smith, Sharon M.; Saha, Tulshi D.; Pickering, Roger P.; Dawson, Deborah A.; Huang, Boji; Stinson, Frederick S.; Grant, Bridget F.
2008-01-01
This study presents test-retest reliability statistics and information on internal consistency for new diagnostic modules and risk factor of alcohol, drug, and psychiatric disorders the Alcohol Use Disorder and Associated Disabilities Interview Schedule-IV (AUDADIS-IV). Test-retest statistics were derived from a random sample of 1,899 adults selected from 34,653 respondents who participated in the 2004–2005 Wave 2 National Epidemiologic Survey on Alcohol and Related Conditions (NESARC). Internal consistency of continuous scales was assessed using the entire Wave 2 NESARC. Both test and retest interviews were conducted face-to-face. Test-retest and internal consistency results for diagnoses and symptom scales associated with posttraumatic stress disorder, attention-deficit/hyperactivity disorder, and borderline, narcissistic, and schizotypal personality disorders were predominantly good (kappa > 0.63; ICC > 0.69; alpha > 0.75) and reliability for risk factor measures fell within the good to excellent range (intraclass correlations = 0.50–0.94; alpha = 0.64–0.90). The high degree of reliability found in this study suggests that new AUDADIS-IV diagnostic measures can be useful tools in research settings. The availability of highly reliable measures of risk factors of alcohol, drug, and psychiatric disorders will contribute to the validity of conclusions drawn from future research in the domains of substance use disorder and psychiatric epidemiology. PMID:17706375
Properties of the DASS-21 in an Australian Community Adolescent Population.
Shaw, T; Campbell, M A; Runions, K C; Zubrick, S R
2017-07-01
Although developed for adults, the Depression Anxiety Stress Scales-Short Version (DASS-21) has been used in many research studies with adolescent samples. Evidence as to the applicability of the DASS subscale scores to represent the distinct states of depression, anxiety, and stress as experienced by adolescents is mixed, and the age at which it may be possible to differentiate these 3 states using the DASS-21 has not yet been determined. This study evaluated evidence for a multifactor structure in the DASS-21 in adolescents and the specificity of the 3 subscales for adolescents in general and at different ages. Data were from a large cross-sectional survey of 2,873 school students in Grades 6-12 (aged 12-18 years) in Australia. We conducted confirmatory bifactor analyses testing a general mental health distress factor and 3 domain-specific factors for anxiety, depression, and stress for the whole sample and across gender by age groups. The internal consistency reliability of the DASS total and subscale scores was determined using omega coefficients. Analyses identified that most of the variation in the items was explained by the dominance of a single, general factor and the subscales lacked specificity across all age groups. The DASS-21 can be reliably used to measure general distress in adolescents, but the subscales fail to discriminate between the 3 states. Our results indicate that this lack of discrimination does not reduce with increasing age. These findings caution against the use of adult theoretical models and measures within adolescent populations. © 2016 Wiley Periodicals, Inc.
Regional Frequency and Uncertainty Analysis of Extreme Precipitation in Bangladesh
NASA Astrophysics Data System (ADS)
Mortuza, M. R.; Demissie, Y.; Li, H. Y.
2014-12-01
Increased frequency of extreme precipitations, especially those with multiday durations, are responsible for recent urban floods and associated significant losses of lives and infrastructures in Bangladesh. Reliable and routinely updated estimation of the frequency of occurrence of such extreme precipitation events are thus important for developing up-to-date hydraulic structures and stormwater drainage system that can effectively minimize future risk from similar events. In this study, we have updated the intensity-duration-frequency (IDF) curves for Bangladesh using daily precipitation data from 1961 to 2010 and quantified associated uncertainties. Regional frequency analysis based on L-moments is applied on 1-day, 2-day and 5-day annual maximum precipitation series due to its advantages over at-site estimation. The regional frequency approach pools the information from climatologically similar sites to make reliable estimates of quantiles given that the pooling group is homogeneous and of reasonable size. We have used Region of influence (ROI) approach along with homogeneity measure based on L-moments to identify the homogenous pooling groups for each site. Five 3-parameter distributions (i.e., Generalized Logistic, Generalized Extreme value, Generalized Normal, Pearson Type Three, and Generalized Pareto) are used for a thorough selection of appropriate models that fit the sample data. Uncertainties related to the selection of the distributions and historical data are quantified using the Bayesian Model Averaging and Balanced Bootstrap approaches respectively. The results from this study can be used to update the current design and management of hydraulic structures as well as in exploring spatio-temporal variations of extreme precipitation and associated risk.