DOE Office of Scientific and Technical Information (OSTI.GOV)
Hamachi La Commare, Kristina
Metrics for reliability, such as the frequency and duration of power interruptions, have been reported by electric utilities for many years. This study examines current utility practices for collecting and reporting electricity reliability information and discusses challenges that arise in assessing reliability because of differences among these practices. The study is based on reliability information for year 2006 reported by 123 utilities in 37 states representing over 60percent of total U.S. electricity sales. We quantify the effects that inconsistencies among current utility reporting practices have on comparisons of System Average Interruption Duration Index (SAIDI) and System Average Interruption Frequency Indexmore » (SAIFI) reported by utilities. We recommend immediate adoption of IEEE Std. 1366-2003 as a consistent method for measuring and reporting reliability statistics.« less
Walker, Lorraine O; Kirby, Russell S
2010-11-01
Early parenting practices are significant to public health because of their linkages to child health outcomes. This paper focuses on the current state of the science regarding conceptual frameworks that incorporate early parenting practices in epidemiologic research and evidence supporting reliability and validity of self-report measures of such practices. Guided by a provisional definition of early parenting practices, literature searches were conducted using PubMed and Sociological Abstracts. Twenty-five published studies that included parent-report measures of early parenting practices met inclusion criteria. Findings on conceptual frameworks were analyzed qualitatively, whereas evidence of reliability and validity were organized into four domains (safety, feeding and oral health, development promotion, and discipline) and summarized in tabular form. Quantitative estimates of measures of reliability and validity were extracted, where available. We found two frameworks incorporating early parenting: one a program theory and the other a predictive model. We found no reported evidence of the reliability or validity of parent-report measures of safety or feeding and oral health practices. Evidence for reliability and validity were reported with greater frequency for development promotion and discipline practices, but report of the most pertinent type of reliability estimation, test-retest reliability, was rare. Failure to examine associations of early parenting practices with any child outcomes within most studies resulted in missed opportunities to indirectly estimate validity of parenting practice measures. Stronger evidence concerning specific measurement properties of early parenting practices is important to advancing maternal-child research, surveillance, and practice.
Apollo experience report: Reliability and quality assurance
NASA Technical Reports Server (NTRS)
Sperber, K. P.
1973-01-01
The reliability of the Apollo spacecraft resulted from the application of proven reliability and quality techniques and from sound management, engineering, and manufacturing practices. Continual assessment of these techniques and practices was made during the program, and, when deficiencies were detected, adjustments were made and the deficiencies were effectively corrected. The most significant practices, deficiencies, adjustments, and experiences during the Apollo Program are described in this report. These experiences can be helpful in establishing an effective base on which to structure an efficient reliability and quality assurance effort for future space-flight programs.
ERIC Educational Resources Information Center
Green, Carlton E.; Chen, Cynthia E.; Helms, Janet E.; Henze, Kevin T.
2011-01-01
Helms, Henze, Sass, and Mifsud (2006) defined good practices for internal consistency reporting, interpretation, and analysis consistent with an alpha-as-data perspective. Their viewpoint (a) expands on previous arguments that reliability coefficients are group-level summary statistics of samples' responses rather than stable properties of scales…
Reliability reporting across studies using the Buss Durkee Hostility Inventory.
Vassar, Matt; Hale, William
2009-01-01
Empirical research on anger and hostility has pervaded the academic literature for more than 50 years. Accurate measurement of anger/hostility and subsequent interpretation of results requires that the instruments yield strong psychometric properties. For consistent measurement, reliability estimates must be calculated with each administration, because changes in sample characteristics may alter the scale's ability to generate reliable scores. Therefore, the present study was designed to address reliability reporting practices for a widely used anger assessment, the Buss Durkee Hostility Inventory (BDHI). Of the 250 published articles reviewed, 11.2% calculated and presented reliability estimates for the data at hand, 6.8% cited estimates from a previous study, and 77.1% made no mention of score reliability. Mean alpha estimates of scores for BDHI subscales generally fell below acceptable standards. Additionally, no detectable pattern was found between reporting practices and publication year or journal prestige. Areas for future research are also discussed.
Score Reliability of Adolescent Alcohol Screening Measures: A Meta-Analytic Inquiry
ERIC Educational Resources Information Center
Shields, Alan L.; Campfield, Delia C.; Miller, Christopher S.; Howell, Ryan T.; Wallace, Kimberly; Weiss, Roger D.
2008-01-01
This study describes the reliability reporting practices in empirical studies using eight adolescent alcohol screening tools and characterizes and explores variability in internal consistency estimates across samples. Of 119 observed administrations of these instruments, 40 (34%) reported usable reliability information. The Personal Experience…
A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.
Koo, Terry K; Li, Mae Y
2016-06-01
Intraclass correlation coefficient (ICC) is a widely used reliability index in test-retest, intrarater, and interrater reliability analyses. This article introduces the basic concept of ICC in the content of reliability analysis. There are 10 forms of ICCs. Because each form involves distinct assumptions in their calculation and will lead to different interpretations, researchers should explicitly specify the ICC form they used in their calculation. A thorough review of the research design is needed in selecting the appropriate form of ICC to evaluate reliability. The best practice of reporting ICC should include software information, "model," "type," and "definition" selections. When coming across an article that includes ICC, readers should first check whether information about the ICC form has been reported and if an appropriate ICC form was used. Based on the 95% confident interval of the ICC estimate, values less than 0.5, between 0.5 and 0.75, between 0.75 and 0.9, and greater than 0.90 are indicative of poor, moderate, good, and excellent reliability, respectively. This article provides a practical guideline for clinical researchers to choose the correct form of ICC and suggests the best practice of reporting ICC parameters in scientific publications. This article also gives readers an appreciation for what to look for when coming across ICC while reading an article.
Lindsley, Kristina; Li, Tianjing; Ssemanda, Elizabeth; Virgili, Gianni; Dickersin, Kay
2016-01-01
Topic Are existing systematic reviews of interventions for age-related macular degeneration incorporated into clinical practice guidelines? Clinical relevance High-quality systematic reviews should be used to underpin evidence-based clinical practice guidelines and clinical care. We have examined the reliability of systematic reviews of interventions for age-related macular degeneration (AMD) and described the main findings of reliable reviews in relation to clinical practice guidelines. Methods Eligible publications are systematic reviews of the effectiveness of treatment interventions for AMD. We searched a database of systematic reviews in eyes and vision and employed no language or date restrictions; the database is up-to-date as of May 6, 2014. Two authors independently screened records for eligibility and abstracted and assessed the characteristics and methods of each review. We classified reviews as “reliable” when they reported eligibility criteria, comprehensive searches, appraisal of methodological quality of included studies, appropriate statistical methods for meta-analysis, and conclusions based on results. We mapped treatment recommendations from the American Academy of Ophthalmology Preferred Practice Patterns (AAO PPP) for AMD to the identified systematic reviews and assessed whether any reliable systematic review was cited or could have been cited to support each treatment recommendation. Results Of 1,570 systematic reviews in our database, 47 met our inclusion criteria. Most of the systematic reviews targeted neovascular AMD and investigated anti-vascular endothelial growth factor (anti-VEGF) interventions, dietary supplements or photodynamic therapy. We classified over two-thirds (33/47) of the reports as reliable. The quality of reporting varied, with criteria for reliable reporting met more often for Cochrane reviews and for reviews whose authors disclosed conflicts of interest. Although most systematic reviews were reliable, anti-VEGF agents and photodynamic therapy were the only interventions identified as effective by reliable reviews. Of 35 treatment recommendations extracted from the AAO PPP, 15 could have been supported with reliable systematic reviews; however, only one recommendation had an accompanying intervention systematic review citation, which we assessed as a reliable systematic review. No reliable systematic review was identified for 20 treatment recommendations, highlighting areas of evidence gaps. Conclusions For AMD, reliable systematic reviews exist for many treatment recommendations in the AAO PPP and should be used to support these recommendations. We also identified areas where no high-level evidence exists. Mapping clinical practice guidelines to existing systematic reviews is one way to highlight areas where evidence generation or evidence synthesis is either available or needed. PMID:26804762
Lindsley, Kristina; Li, Tianjing; Ssemanda, Elizabeth; Virgili, Gianni; Dickersin, Kay
2016-04-01
Are existing systematic reviews of interventions for age-related macular degeneration incorporated into clinical practice guidelines? High-quality systematic reviews should be used to underpin evidence-based clinical practice guidelines and clinical care. We examined the reliability of systematic reviews of interventions for age-related macular degeneration (AMD) and described the main findings of reliable reviews in relation to clinical practice guidelines. Eligible publications were systematic reviews of the effectiveness of treatment interventions for AMD. We searched a database of systematic reviews in eyes and vision without language or date restrictions; the database was up to date as of May 6, 2014. Two authors independently screened records for eligibility and abstracted and assessed the characteristics and methods of each review. We classified reviews as reliable when they reported eligibility criteria, comprehensive searches, methodologic quality of included studies, appropriate statistical methods for meta-analysis, and conclusions based on results. We mapped treatment recommendations from the American Academy of Ophthalmology (AAO) Preferred Practice Patterns (PPPs) for AMD to systematic reviews and citations of reliable systematic reviews to support each treatment recommendation. Of 1570 systematic reviews in our database, 47 met inclusion criteria; most targeted neovascular AMD and investigated anti-vascular endothelial growth factor (VEGF) interventions, dietary supplements, or photodynamic therapy. We classified 33 (70%) reviews as reliable. The quality of reporting varied, with criteria for reliable reporting met more often by Cochrane reviews and reviews whose authors disclosed conflicts of interest. Anti-VEGF agents and photodynamic therapy were the only interventions identified as effective by reliable reviews. Of 35 treatment recommendations extracted from the PPPs, 15 could have been supported with reliable systematic reviews; however, only 1 recommendation cited a reliable intervention systematic review. No reliable systematic review was identified for 20 treatment recommendations, highlighting areas of evidence gaps. For AMD, reliable systematic reviews exist for many treatment recommendations in the AAO PPPs and should be cited to support these recommendations. We also identified areas where no high-level evidence exists. Mapping clinical practice guidelines to existing systematic reviews is one way to highlight areas where evidence generation or evidence synthesis is either available or needed. Copyright © 2016 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.
Treating Cronbach's Alpha Reliability Coefficients as Data in Counseling Research
ERIC Educational Resources Information Center
Helms, Janet E.; Henze, Kevin T.; Sass, Terry L.; Mifsud, Venus A.
2006-01-01
Scientific associations and measurement experts in psychology and education have voiced various standards and best-practice recommendations concerning reliability data over the years. Yet in the counseling psychology literature, there is virtually no single-source compilation and articulation of good practices for reporting, analyzing, and…
Tapering Practices of Strongman Athletes: Test-Retest Reliability Study
Pritchard, Hayden J; Keogh, Justin WL
2017-01-01
Background Little is currently known about the tapering practices of strongman athletes. We have developed an Internet-based comprehensive self-report questionnaire examining the training and tapering practices of strongman athletes. Objective The objective of this study was to document the test-retest reliability of questions associated with the Internet-based comprehensive self-report questionnaire on the tapering practices of strongman athletes. The information will provide insight on the reliability and usefulness of the online questionnaire for use with strongman athletes. Methods Invitations to complete an Internet questionnaire were sent via Facebook Messenger to identified strongman athletes. The survey consisted of four main areas of inquiry, including demographics and background information, training practices, tapering, and tapering practices. Of the 454 athletes that completed the survey over the 8-week period, 130 athletes responded on Facebook Messenger indicating that they intended to complete, or had completed, the survey. These participants were asked if they could complete the online questionnaire a second time for a test-retest reliability analysis. Sixty-four athletes (mean age 33.3 years, standard deviation [SD] 7.7; mean height 178.2 cm, SD 11.0; mean body mass 103.7 kg, SD 24.8) accepted this invitation and completed the survey for the second time after a minimum 7-day period from the date of their first completion. Agreement between athlete responses was measured using intraclass correlation coefficients (ICCs) and kappa statistics. Confidence intervals (at 95%) were reported for all measures and significance was set at P<.05. Results Test-retest reliability for demographic and training practices items were significant (P<.001) and showed excellent (ICC range=.84 to .98) and fair to almost perfect agreement (κ range=.37-.85). Moderate to excellent agreements (ICC range=.56-.84; P<.01) were observed for all tapering practice measures except for the number of days athletes started their usual taper before a strongman competition (ICC=.30). When the number of days were categorized with additional analyses, moderate reliability was observed (κ=.43; P<.001). Fair to substantial agreement was observed for the majority of tapering practices measures (κrange=.38-.73; P<.001) except for how training frequency (κ=.26) and the percentage and type of resistance training performed, which changed in the taper (κ=.20). Good to excellent agreement (ICC=.62-.93; P<.05) was observed for items relating to strongman events and traditional exercises performed during the taper. Only the time at which the Farmer’s Walk was last performed before competition showed poor reliability (ICC=.27). Conclusions We have developed a low cost, self-reported, online retrospective questionnaire, which provided stable and reliable answers for most of the demographic, training, and tapering practice questions. The results of this study support the inferences drawn from the Tapering Practices of Strongman Athletes Study. PMID:29089292
The quality of orthodontic practice websites.
Parekh, J; Gill, D S
2014-05-01
To evaluate orthodontic practice websites for the reliability of information presented, accessibility, usability for patients and compliance to General Dental Council (GDC) regulations on ethical advertising. World Wide Web. The term 'orthodontic practice' was entered into three separate search engines. The 30 websites from the UK were selected and graded according to the LIDA tool (a validated method of evaluating healthcare websites) for accessibility, usability of the website and reliability of information on orthodontic treatment. The websites were then evaluated against the GDC's Principles for ethical advertising in nine different criteria. On average, each website fulfilled six out of nine points of the GDC's criteria, with inclusion of a complaints policy being the most poorly fulfilled criteria. The mean LIDA score (a combination of usability, reliability and accessibility) was 102/144 (standard deviation 8.38). The websites scored most poorly on reliability (average 43% SD 11.7), with no single website reporting a clear, reliable method of content production. Average accessibility was 81% and usability 73%. In general, websites did not comply with GDC guidelines on ethical advertising. Furthermore, practitioners should consider reporting their method of information production, particularly when making claims about efficiency and speed of treatment in order to improve reliability.
ERIC Educational Resources Information Center
Onwuegbuzie, Anthony J.; Daniel, Larry G.
The purposes of this paper are to identify common errors made by researchers when dealing with reliability coefficients and to outline best practices for reporting and interpreting reliability coefficients. Common errors that researchers make are: (1) stating that the instruments are reliable; (2) incorrectly interpreting correlation coefficients;…
Score Reliability: A Retrospective Look Back at 12 Years of Reliability Generalization Studies
ERIC Educational Resources Information Center
Vacha-Haase, Tammi; Thompson, Bruce
2011-01-01
The present study was conducted to characterize (a) the features of the thousands of primary reports synthesized in 47 reliability generalization (RG) measurement meta-analysis studies and (b) typical methodological practice within the RG literature to date. With respect to the treatment of score reliability in the literature, in an astounding…
Psychometrics Matter in Health Behavior: A Long-term Reliability Generalization Study.
Pickett, Andrew C; Valdez, Danny; Barry, Adam E
2017-09-01
Despite numerous calls for increased understanding and reporting of reliability estimates, social science research, including the field of health behavior, has been slow to respond and adopt such practices. Therefore, we offer a brief overview of reliability and common reporting errors; we then perform analyses to examine and demonstrate the variability of reliability estimates by sample and over time. Using meta-analytic reliability generalization, we examined the variability of coefficient alpha scores for a well-designed, consistent, nationwide health study, covering a span of nearly 40 years. For each year and sample, reliability varied. Furthermore, reliability was predicted by a sample characteristic that differed among age groups within each administration. We demonstrated that reliability is influenced by the methods and individuals from which a given sample is drawn. Our work echoes previous calls that psychometric properties, particularly reliability of scores, are important and must be considered and reported before drawing statistical conclusions.
NASA Technical Reports Server (NTRS)
1970-01-01
Reliability Abstracts and Technical Reviews is an abstract and critical analysis service covering published and report literature on reliability. The service is designed to provide information on theory and practice of reliability as applied to aerospace and an objective appraisal of the quality, significance, and applicability of the literature abstracted.
McKinney, Christy M; Harris, T Robert; Caetano, Raul
2009-01-01
Little is known about the reliability of self-reported child physical abuse (CPA) or CPA reporting practices. We estimated reliability and prevalence of self-reported CPA and identified factors predictive of inconsistent CPA reporting among 2,256 participants using surveys administered in 1995 and 2000. Reliability of CPA was fair to moderate (kappa = 0.41). Using a positive report from either survey, the prevalence of moderate (61.8%) and severe (12.0%) CPA was higher than at either survey alone. Compared to consistent reporters of having experienced CPA, inconsistent reporters were less likely to be > or = 30 years old (vs. 18-29) or Black (vs. White) and more likely to have < 12 years of education (vs. 12), have no alcohol-related problems (vs. having problems), or report one type (vs. > or = 2) of CPA. These findings may assist researchers conducting and interpreting studies of CPA.
NASA Technical Reports Server (NTRS)
Miller, James; Leggett, Jay; Kramer-White, Julie
2008-01-01
A team directed by the NASA Engineering and Safety Center (NESC) collected methodologies for how best to develop safe and reliable human rated systems and how to identify the drivers that provide the basis for assessing safety and reliability. The team also identified techniques, methodologies, and best practices to assure that NASA can develop safe and reliable human rated systems. The results are drawn from a wide variety of resources, from experts involved with the space program since its inception to the best-practices espoused in contemporary engineering doctrine. This report focuses on safety and reliability considerations and does not duplicate or update any existing references. Neither does it intend to replace existing standards and policy.
Report #18-P-0105, February 28, 2018. There is a risk that the state, local and tribal agencies that monitor ambient air quality are not always implementing the EPA’s recommended QA practices for validating ozone data.
Spiritual and religious beliefs and practices of family physicians: a national survey.
Daaleman, T P; Frey, B
1999-02-01
The current movement in American medicine toward patient-centered or relationship-centered care highlights the importance of assessing physician core beliefs and personal philosophies. Religious and spiritual beliefs are often entwined within this domain. The purpose of this study was to identify the personal religious and spiritual beliefs and practices of family physicians and to test a valid and reliable measure of religiosity that would be useful in physician populations. An anonymous survey was mailed to a random sample of active members of the American Academy of Family Physicians who had the self-designated professional activity of direct patient care. Physicians reported their religious and spiritual beliefs and practices, including frequency of religious service attendance and private prayer or spiritual practice, and self-reported intrinsic or subjective religiosity. Seventy-four percent of the surveyed physicians reported at least weekly or monthly service attendance, and 79% reported a strong religious or spiritual orientation. A small percentage (4.5%) of physicians stated they do not believe in God. A 3-dimensional religiosity scale that assessed organized religious activity, nonorganized religious activity, and intrinsic religiosity was determined to be a valid and reliable measure (alpha = .87) of physician religious and spiritual beliefs and practices. Family physicians report religious and spiritual beliefs and practices at rates that are comparable with the general population.
Prospective patients rate practice factors: development of a questionnaire.
St Louis, Brian Lingg; Firestone, Allen R; Johnston, William; Shanker, Shiva; Vig, Katherine W L
2011-02-01
The importance that prospective patients place on practice characteristics when choosing an orthodontic practice has not been extensively reported. The objective of this research was to develop a valid and reliable questionnaire to address the relative importance of orthodontic office and doctor characteristics for prospective patients or parents of child patients during the initial orthodontic office consultation. An initial questionnaire, based on published literature, was field-tested on 16 subjects to assess its validity. Based on the field test, the questionnaire was modified and tested for reliability by using a test-retest method. The questionnaire covered the following areas: doctor, office, staff, and finances. The reliability study included 2 groups of subjects: 12 consecutive prospective adult patients and 41 consecutive parents of prospective child patients. The questionnaires consisted of 43 and 50 questions for the adult patients and the parents of patients, respectively. The subjects rated the importance of practice characteristics in their selection of an orthodontic practice using a 100-mm visual analog scale anchored at "not important at all" and "most important." Reliability was analyzed by using the intraclass correlation coefficient (ICC). Summary scores of all 53 subjects showed excellent reliability (ICC, 0.88; range, 0.61-1.0). Summary scores of all 50 questions showed acceptable reliability (ICC, 0.70; range, 0.45-0.88). Twenty-one questions had excellent reliability (ICC, >.75), and 29 questions had fair-to-good reliability (ICC, 0.41-0.75). No questions showed poor reliability (ICC, <0.4). The pilot study data indicated that the overall reliability of the questionnaire is acceptable. Copyright © 2011 American Association of Orthodontists. Published by Mosby, Inc. All rights reserved.
76 FR 35031 - Notice of Quarterly Report (January 1, 2011-March 31, 2011)
Federal Register 2010, 2011, 2012, 2013, 2014
2011-06-15
... Number of farmers Profitability of trained in Commercial cultivation, Agriculture. services to Number of.... Knowledge of good hygiene practices. Households with reliable water services. Enterprises with reliable... by public outreach efforts. Personnel trained. Number of Services Fonciers Ruraux (rural land service...
Observing practice leadership in intellectual and developmental disability services.
Beadle-Brown, J; Bigby, C; Bould, E
2015-12-01
Improving staff performance is an issue in services for people with intellectual disability. Practice leadership, where the front line leader of a staff team focuses on service user outcomes in everything they do and provides coaching, modeling, supervision and organisation to the team, has been identified as important in improving staff performance. To date this finding is based only on self-report measures. This paper describes and tests an observational measure of practice leadership based on an interview with the front-line manager, a review of paperwork and observations in 58 disability services in Australia. The measure showed good internal consistency and acceptable inter-rater reliability. Practice leadership was associated with staff practice and outcomes for service users. The observed measure of practice leadership appears to be a useful tool for assessing whether leadership within a service promotes enabling and empowering support by staff. It was found to discriminate higher and lower performing services in terms of active support. The measure had good reliability and validity although some further testing is required to give a complete picture of the possible uses and reliability of the measure. The measure is potentially useful in contexts of both research and service development. The confirmation of previous findings from self-report measures that practice leadership is related to the quality of staff practice and outcomes for service users has implications for policy and practice in terms of the training of managers and structures for organisational management. © 2015 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
76 FR 60536 - Notice of Quarterly Report (April 1, 2011-June 30, 2011)
Federal Register 2010, 2011, 2012, 2013, 2014
2011-09-29
... Number of farmers of cultivation, trained in commercial services to agriculture. agriculture and Number.... Knowledge of good hygiene practices. Households with reliable water services. Enterprises with reliable... Antananarivo. Average time for Land Services Offices to issue a duplicate copy of a title. Average cost to a...
Stolarova, Margarita; Wolf, Corinna; Rinker, Tanja; Brielmann, Aenne
2014-01-01
This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire developed for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent–teacher and 19 mother–father pairs) collected for two-year-old children (12 bilingual) are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC). Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent–teacher ratings of children's early vocabulary can achieve agreement and correlation comparable to those of mother–father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters' agreement. We conclude that future reports of agreement, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings. PMID:24994985
Benbassat, Jochanan; Baumal, Reuben
2010-08-01
To review the reported reliability (reproducibility, inter-examiner agreement) and validity (sensitivity, specificity and likelihood ratios) of respiratory physical examination (PE) signs, and suggest an approach to teaching these signs to medical students. Review of the literature. We searched Paper Chase between 1966 and June 2009 to identify and evaluate published studies on the diagnostic accuracy of respiratory PE signs. Most studies have reported low to fair reliability and sensitivity values. However, some studies have found high specificites for selected PE signs. None of the studies that we reviewed adhered to all of the STARD criteria for reporting diagnostic accuracy. Possible flaws in study designs may have led to underestimates of the observed diagnostic accuracy of respiratory PE signs. The reported poor reliabilities may have been due to differences in the PE skills of the participating examiners, while the sensitivities may have been confounded by variations in the severity of the diseases of the participating patients. IMPLICATION FOR PRACTICE AND MEDICAL EDUCATION: Pending the results of properly controlled studies, the reported poor reliability and sensitivity of most respiratory PE signs do not necessarily detract from their clinical utility. Therefore, we believe that a meticulously performed respiratory PE, which aims to explore a diagnostic hypothesis, as opposed to a PE that aims to detect a disease in an asymptomatic person, remains a cornerstone of clinical practice. We propose teaching the respiratory PE signs according to their importance, beginning with signs of life-threatening conditions and those that have been reported to have a high specificity, and ending with signs that are "nice to know," but are no longer employed because of the availability of more easily performed tests.
Baumal, Reuben
2010-01-01
OBJECTIVE To review the reported reliability (reproducibility, inter-examiner agreement) and validity (sensitivity, specificity and likelihood ratios) of respiratory physical examination (PE) signs, and suggest an approach to teaching these signs to medical students. METHODS Review of the literature. We searched Paper Chase between 1966 and June 2009 to identify and evaluate published studies on the diagnostic accuracy of respiratory PE signs. RESULTS Most studies have reported low to fair reliability and sensitivity values. However, some studies have found high specificites for selected PE signs. None of the studies that we reviewed adhered to all of the STARD criteria for reporting diagnostic accuracy. CONCLUSIONS Possible flaws in study designs may have led to underestimates of the observed diagnostic accuracy of respiratory PE signs. The reported poor reliabilities may have been due to differences in the PE skills of the participating examiners, while the sensitivities may have been confounded by variations in the severity of the diseases of the participating patients. IMPLICATION FOR PRACTICE AND MEDICAL EDUCATION Pending the results of properly controlled studies, the reported poor reliability and sensitivity of most respiratory PE signs do not necessarily detract from their clinical utility. Therefore, we believe that a meticulously performed respiratory PE, which aims to explore a diagnostic hypothesis, as opposed to a PE that aims to detect a disease in an asymptomatic person, remains a cornerstone of clinical practice. We propose teaching the respiratory PE signs according to their importance, beginning with signs of life-threatening conditions and those that have been reported to have a high specificity, and ending with signs that are "nice to know," but are no longer employed because of the availability of more easily performed tests. PMID:20349154
A content validated questionnaire for assessment of self reported venous blood sampling practices
2012-01-01
Background Venous blood sampling is a common procedure in health care. It is strictly regulated by national and international guidelines. Deviations from guidelines due to human mistakes can cause patient harm. Validated questionnaires for health care personnel can be used to assess preventable "near misses"--i.e. potential errors and nonconformities during venous blood sampling practices that could transform into adverse events. However, no validated questionnaire that assesses nonconformities in venous blood sampling has previously been presented. The aim was to test a recently developed questionnaire in self reported venous blood sampling practices for validity and reliability. Findings We developed a questionnaire to assess deviations from best practices during venous blood sampling. The questionnaire contained questions about patient identification, test request management, test tube labeling, test tube handling, information search procedures and frequencies of error reporting. For content validity, the questionnaire was confirmed by experts on questionnaires and venous blood sampling. For reliability, test-retest statistics were used on the questionnaire answered twice. The final venous blood sampling questionnaire included 19 questions out of which 9 had in total 34 underlying items. It was found to have content validity. The test-retest analysis demonstrated that the items were generally stable. In total, 82% of the items fulfilled the reliability acceptance criteria. Conclusions The questionnaire could be used for assessment of "near miss" practices that could jeopardize patient safety and gives several benefits instead of assessing rare adverse events only. The higher frequencies of "near miss" practices allows for quantitative analysis of the effect of corrective interventions and to benchmark preanalytical quality not only at the laboratory/hospital level but also at the health care unit/hospital ward. PMID:22260505
A content validated questionnaire for assessment of self reported venous blood sampling practices.
Bölenius, Karin; Brulin, Christine; Grankvist, Kjell; Lindkvist, Marie; Söderberg, Johan
2012-01-19
Venous blood sampling is a common procedure in health care. It is strictly regulated by national and international guidelines. Deviations from guidelines due to human mistakes can cause patient harm. Validated questionnaires for health care personnel can be used to assess preventable "near misses"--i.e. potential errors and nonconformities during venous blood sampling practices that could transform into adverse events. However, no validated questionnaire that assesses nonconformities in venous blood sampling has previously been presented. The aim was to test a recently developed questionnaire in self reported venous blood sampling practices for validity and reliability. We developed a questionnaire to assess deviations from best practices during venous blood sampling. The questionnaire contained questions about patient identification, test request management, test tube labeling, test tube handling, information search procedures and frequencies of error reporting. For content validity, the questionnaire was confirmed by experts on questionnaires and venous blood sampling. For reliability, test-retest statistics were used on the questionnaire answered twice. The final venous blood sampling questionnaire included 19 questions out of which 9 had in total 34 underlying items. It was found to have content validity. The test-retest analysis demonstrated that the items were generally stable. In total, 82% of the items fulfilled the reliability acceptance criteria. The questionnaire could be used for assessment of "near miss" practices that could jeopardize patient safety and gives several benefits instead of assessing rare adverse events only. The higher frequencies of "near miss" practices allows for quantitative analysis of the effect of corrective interventions and to benchmark preanalytical quality not only at the laboratory/hospital level but also at the health care unit/hospital ward.
Stone, Thomas H; Kisamore, Jennifer L; Jawahar, I M
2008-04-01
Interest and research on academic misconduct has become more salient in part due to recent publicized academic and organizational scandals. The current study investigated a possible interaction between perception of the university's academic culture and personality, conceptualized as Reliability, on students' perceptions of academic misconduct. A convenience sample of 217 university business students (91 men, 126 women), whose average age was 22.3 yr. (SD = 4.4) was tested. Reliability was measured with an occupational scale included in the Hogan Personality Inventory. Two hierarchical regression analyses were conducted using Cheating Intentions and Likelihood of Reporting Cheating as criteria. Age, Reliability, Integrity Culture, and the interaction between scores on Reliability and Integrity Culture were entered as predictors. Only Age and Reliability scores were significant predictors of Cheating Intentions, while all variables were significant predictors for Likelihood of Reporting Cheating. Suggestions for practice and research are provided.
Wade, Kimberley A; Nash, Robert A; Lindsay, D Stephen
2018-05-01
Wixted, Mickes, and Fisher (this issue) take issue with the common trope that eyewitness memory is inherently unreliable. They draw on a large body of mock-crime research and a small number of field studies, which indicate that high-confidence eyewitness reports are usually accurate, at least when memory is uncontaminated and suitable interviewing procedures are used. We agree with the thrust of Wixted et al.'s argument and welcome their invitation to confront the mass underselling of eyewitnesses' potential reliability. Nevertheless, we argue that there is a comparable risk of overselling eyewitnesses' reliability. Wixted et al.'s reasoning implies that near-pristine conditions or uncontaminated memories are normative, but there are at least two good reasons to doubt this. First, psychological science does not yet offer a good understanding of how often and when eyewitness interviews might deviate from best practice in ways that compromise the accuracy of witnesses' reports. Second, witnesses may frequently be exposed to preinterview influences that could corrupt reports obtained in best-practice interviews.
Reliability of self-reported smoking history and age at initial tobacco use.
Huerta, Michael; Chodick, Gabriel; Balicer, Ran D; Davidovitch, Nadav; Grotto, Itamar
2005-08-01
Many studies use questionnaires to determine smoking status and age of smoking onset. This study aimed to determine the reliability of self-reported smoking history and age of smoking initiation. The proportion of inconsistent answers and correlation coefficients of reported age of initial smoking were measured by an answer-reanswer analysis of questionnaires in an ongoing, two-step, population-based survey of health behavior. Interviews were conducted on the day of recruitment to and the day of discharge from mandatory military service in Israel among a sample of 25,437 young men and women recruited between 1986 and 2000. Of 7276 participants reporting current or past smoking upon recruitment, 559 (7.7%) reported never having smoked upon discharge, thus demonstrating prima facie inconsistency. Variables significantly associated with reliable reporting in a multivariate logistic regression model were female gender (P = 0.04) and more than 4 years of military service (P < 0.01). 6010 subjects who reported a positive smoking history at both recruitment and discharge were available for analysis of reliability of reported age at smoking onset. Intraclass correlation coefficients for recruitment/discharge consistency in reported age at first cigarette were 0.73 (95% CI: 0.71-0.74) and 0.76 (95% CI: 0.74-0.78) for men and women, respectively. Eastern origin, lower subject education level, and lower paternal education level were also associated with lower reliability. Our results showed a relatively high level of answer-reanswer reliability, with some variance attributable to personal characteristics. These results suggest that self-reported age at onset of tobacco use is practical and reliable in normative, young adult populations. However, time elapsed between questionnaires and demographic and lifestyle characteristics may affect reliability rates, and thus should be carefully regarded in future studies.
ERIC Educational Resources Information Center
Miller, Christopher S.; Shields, Alan L.; Campfield, Delia; Wallace, Kim A.; Weiss, Roger D.
2007-01-01
Three drug and alcohol use screening scales are embedded within the Minnesota Multiphasic Personality Inventory--2: the MacAndrew Alcoholism Scale (MAC) and its revised version (MAC-R), the Addiction Acknowledgement Scale (AAS), and the Addiction Potential Scale (APS). The current study evaluated the reliability reporting practices among 210…
A reliable sewage quality abnormal event monitoring system.
Li, Tianling; Winnel, Melissa; Lin, Hao; Panther, Jared; Liu, Chang; O'Halloran, Roger; Wang, Kewen; An, Taicheng; Wong, Po Keung; Zhang, Shanqing; Zhao, Huijun
2017-09-15
With closing water loop through purified recycled water, wastewater becomes a part of source water, requiring reliable wastewater quality monitoring system (WQMS) to manage wastewater source and mitigate potential health risks. However, the development of reliable WQMS is fatally constrained by severe contamination and biofouling of sensors due to the hostile analytical environment of wastewaters, especially raw sewages, that challenges the limit of existing sensing technologies. In this work, we report a technological solution to enable the development of WQMS for real-time abnormal event detection with high reliability and practicality. A vectored high flow hydrodynamic self-cleaning approach and a dual-sensor self-diagnostic concept are adopted for WQMS to effectively encounter vital sensor failing issues caused by contamination and biofouling and ensure the integrity of sensing data. The performance of the WQMS has been evaluated over a 3-year trial period at different sewage catchment sites across three Australian states. It has demonstrated that the developed WQMS is capable of continuously operating in raw sewage for a prolonged period up to 24 months without maintenance and failure, signifying the high reliability and practicality. The demonstrated WQMS capability to reliably acquire real-time wastewater quality information leaps forward the development of effective wastewater source management system. The reported self-cleaning and self-diagnostic concepts should be applicable to other online water quality monitoring systems, opening a new way to encounter the common reliability and stability issues caused by sensor contamination and biofouling. Copyright © 2017 Elsevier Ltd. All rights reserved.
Validation of the Evidence-Based Practice Process Assessment Scale
ERIC Educational Resources Information Center
Rubin, Allen; Parrish, Danielle E.
2011-01-01
Objective: This report describes the reliability, validity, and sensitivity of a scale that assesses practitioners' perceived familiarity with, attitudes of, and implementation of the evidence-based practice (EBP) process. Method: Social work practitioners and second-year master of social works (MSW) students (N = 511) were surveyed in four sites…
TEST–RETEST RELIABILITY OF CAPABILITY MEASUREMENT IN THE UK GENERAL POPULATION
Al-Janabi, Hareth; Flynn, Terry N; Peters, Tim J; Bryan, Stirling; Coast, Joanna
2015-01-01
Although philosophically attractive, it may be difficult, in practice, to measure individuals' capabilities (what they are able to do in their lives) as opposed to their functionings (what they actually do). To examine whether capability information could be reliably self-reported, we administered a measure of self-reported capability (the Investigating Choice Experiments Capability Measure for Adults, ICECAP-A) on two occasions, 2 weeks apart, alongside a self-reported health measure (the EuroQol Five Dimensional Questionnaire with 3 levels, EQ-5D-3L). We found that respondents were able to report capabilities with a moderate level of consistency, although somewhat less reliably than their health status. The more socially orientated nature of some of the capability questions may account for the difference. © 2014 The Authors Health Economics Published by John Wiley & Sons Ltd. PMID:25204621
Failure analysis of solid rocket apogee motors
NASA Technical Reports Server (NTRS)
Martin, P. J.
1972-01-01
The analysis followed five selected motors through initial design, development, test, qualification, manufacture, and final flight reports. An audit was conducted at the manufacturing plants to complement the literature search with firsthand observations of the current philosophies and practices that affect reliability of the motors. A second literature search emphasized acquisition of spacecraft and satellite data bearing on solid motor reliability. It was concluded that present practices at the plants yield highly reliable flight hardware. Reliability can be further improved by new developments of aft-end bonding and initiator/igniter nondestructive test methods, a safe/arm device, and an insulation formulation. Minimum diagnostic instrumentation is recommended for all motor flights. Surplus motors should be used in margin testing. Criteria should be established for pressure and zone curing. The motor contractor should be represented at launch. New design analyses should be made of stretched motors and spacecraft/motor pairs.
Edwards, Mark J.; Urbanski, Carly R.; Sebire, Simon J.
2013-01-01
Abstract Background Parent-focused interventions may help to reduce youth screen-viewing (SV). This review synthesized current information on the links between parenting styles, parenting practices, and youth SV with a focus on measurement. Methods A systematic review of electronic databases was conducted. Results In all, 29 of 1189 studies met the inclusion criteria. Parenting practices were divided into rule and nonrule-based practices. Seven rules: (1) Limits on total time (n studies=23); (2) limits on time of day (n=7); (3) content restriction (n=11); (4) mealtime rules (n=2); (5) parental supervision (n=3); (6) contingent screentime (n=3); and (7) no-TV policy (n=1) were reported. Two nonrule-based practices were reported: Co-viewing (n=6) and encouragement to view (n=2). Three studies (10.3%) provided information on parenting styles. Only 12 studies (41.4%) provided information on the reliability/validity of the outcome measure, 15 (51.7%) studies provided information on the reliability/validity of the parenting measure, and 6 (20.7%) provided information on the reliability/validity of both outcome and exposure measures. Conclusions There is mixed evidence that parenting styles and media-related parenting practices are associated with youth SV. The assessment of parental influence of youth media use is hampered by the diversity of measures that have been used. There is a need for new measures that assess a range of media parenting practices that are relevant to multiple forms of SV. PMID:23944925
Regan, Annette K; Gibbs, Robyn A; Effler, Paul V
2018-05-31
To evaluate the reliability of information in general practice (GP) electronic health records (EHRs), 2100 adult patients were randomly selected for interview regarding the presence of specific medical conditions and recent influenza vaccination. Agreement between self-report and data extracted from EHRs was compared using Cohen's kappa coefficient (k) and interpreted in accordance with Altman's Kappa Benchmarking criteria; 377 (18%) patients declined participation, and 608 (29%) could not be contacted. Of 1115 (53%) remaining, 856 (77%) were active patients (≥3 visits to the GP practice in the last two years) who provided complete information for analysis. Although a higher proportion of patients self-reported being vaccinated or having a medical condition compared to the EHR (50.7% vs 36.9%, and 39.4% vs 30.3%, respectively), there was "good" agreement between self-report and EHR for both vaccination status (κ = 0.67) and medical conditions (κ = 0.66). These findings suggest EHR may be useful for public health surveillance. Crown Copyright © 2018. Published by Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Genc, Evrim
The primary purpose of this study was to develop a valid and reliable instrument to examine science teachers' assessment beliefs and practices in science classrooms. The present study also investigated the relationship between teachers' beliefs and practices in terms of assessment issues in science, their perceptions of the factors that influenced their assessment practices and their feelings towards high-stakes testing. The participants of the study were 408 science teachers teaching at middle and high school levels in the State of Florida. Data were collected through two modes of administration of the instrument as a paper-and-pencil and a web-based form. The response rate for paper-and-pencil administration was estimated as 68% whereas the response for the web administration was found to be 27%. Results from the various dimensions of validity and reliability analyses revealed that the 24 item-four-factor belief and practice measures were psychometrically sound and conceptually anchored measures of science teachers' assessment beliefs and self-reported practices. Reliability estimates for the belief measure ranged from .83 to .91 whereas alpha values for the practice measure ranged from .56 to .90. Results from the multigroup analysis supported that the instrument has the same theoretical structure across both administration groups. Therefore, future researchers may use either a paper-and-pencil or web-based format of the instrument. This study underscored a discrepancy between what teachers believe and how they act in classroom settings. It was emphasized that certain factors were mediating the dynamics between the belief and the practice. The majority of teachers reported that instruction time, class size, professional development activities, availability of school funding, and state testing mandates impact their assessment routines. Teachers reported that both the preparation process and the results of the test created unbelievable tension both on students and teachers. Implications of the study indicated that it would be valuable to conduct alignment studies to examine whether state tests are fully aligned with the state standards and classroom assessment. Perhaps, such analyses would assist state level decision makers in reconsidering the current policies and "unintended" influences of mandated tests on classroom practices.
Research in Mathematics Education: Multiple Methods for Multiple Uses
ERIC Educational Resources Information Center
Battista, Michael; Smith, Margaret S.; Boerst, Timothy; Sutton, John; Confrey, Jere; White, Dorothy; Knuth, Eric; Quander, Judith
2009-01-01
Recent federal education policies and reports have generated considerable debate about the meaning, methods, and goals of "scientific research" in mathematics education. Concentrating on the critical problem of determining which educational programs and practices reliably improve students' mathematics achievement, these policies and reports focus…
ERIC Educational Resources Information Center
Rubin, Allen; Parrish, Danielle E.
2010-01-01
Objective: This report describes the development and preliminary findings regarding the reliability, validity, and sensitivity of a scale that has been developed to assess practitioners' perceived familiarity with, attitudes about, and implementation of the phases of the evidence-based practice (EBP) process. Method: After a panel of national…
ERIC Educational Resources Information Center
Wilhelm, Anne Garrison; Kim, Sungyeun
2015-01-01
One crucial question for researchers who study teachers' classroom practice is how to maximize information about what is happening in classrooms while minimizing costs. This report extends prior studies of the reliability of the Instructional Quality Assessment (IQA), a widely used classroom observation toolkit, and offers insight into the often…
Seitsinger, Anne M; Felner, Robert D; Brand, Stephen; Burns, Amy
2008-08-01
As schools move forward with comprehensive school reform, parents' roles have shifted and been redefined. Parent-teacher communication is critical to student success, yet how schools and teachers contact parents is the subject of few studies. Evaluations of school-change efforts require reliable and useful measures of teachers' practices in communicating with parents. The structure of teacher-parent-contact practices was examined using data from multiple, longitudinal cohorts of schools and teachers from a large-scale project and found to be a reliable and stable measure of parent contact across building levels and localities. Teacher/school practices in contacting parents were found to be significantly related to parent reports of school contact performance and student academic adjustment and achievement. Implications for school improvement efforts are discussed.
Hogue, Aaron; Dauber, Sarah; Lichvar, Emily; Bobek, Molly; Henderson, Craig E
2015-03-01
Developing therapist-report fidelity tools to support quality delivery of evidence-based practices in usual care is a top priority for implementation science. This study tested the reliability and accuracy of two groups of community therapists who reported on their use of family therapy (FT) and motivational interviewing/cognitive-behavioral therapy (MI/CBT) interventions during routine treatment of inner-city adolescents with conduct and substance use problems. Study cases (n = 45) were randomized into two conditions: (a) Routine Family Therapy (RFT), consisting of a single site that featured family therapy as its standard of care for behavioral treatment; or (b) Treatment As Usual (TAU), consisting of five sites that featured non-family approaches. Therapists and trained observational raters provided FT and MI/CBT adherence ratings on 157 sessions (104 RFT, 53 TAU). Overall therapist reliability was adequate for averaged FT ratings (ICC = .66) but almost non-existent for MI/CBT (ICC = .06); moreover, both RFT and TAU therapists were more reliable in reporting on FT than on MI/CBT. Both groups of therapists overestimated the extent to which they implemented FT and MI/CBT interventions. Results offer support for the feasibility of using existing therapist-report methods to anchor quality assurance procedures for FT interventions in real-world settings, though not for MI/CBT.
Validation of an instrument to assess toddler feeding practices of Latino mothers.
Chaidez, Virginia; Kaiser, Lucia L
2011-08-01
This paper describes qualitative and quantitative aspects of testing a 34-item Toddler-Feeding Questionnaire (TFQ), designed for use in Latino families, and the associations between feeding practices and toddler dietary outcomes. Qualitative methods included review by an expert panel for content validity and cognitive testing of the tool to assess face validity. Quantitative analyses included use of exploratory factor analysis for construct validity; Pearson's correlations for test-retest reliability; Cronbach's alpha (α) for internal reliability; and multivariate regression for investigating relationships between feeding practices and toddler diet and anthropometry. Interviews were conducted using a convenience sample of 94 Latino mother and toddler dyads obtained largely through the Supplemental Nutrition Program for Women, Infants and Children (WIC). Data collection included household characteristics, self-reported early-infant feeding practices, the toddler's dietary intake, and anthropometric measurements. Factor analysis suggests the TFQ contains three subscales: indulgent; authoritative; and environmental influences. The TFQ demonstrated acceptable reliability for most measures. As hypothesized, indulgent practices in Latino toddlers were associated with increased energy consumption and higher intakes of total fat, saturated fat, and sweetened beverages. This tool may be useful in future research exploring the relationship of toddler feeding practices to nutritional outcomes in Latino families. Copyright © 2011 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hill, Roger R.; Klise, Geoffrey Taylor; Balfour, John R.
Characterizing the factors that affect reliability of a photovoltaic (PV) power plant is an important aspect of optimal asset management. This document describes the many factors that affect operation and maintenance (O&M) of a PV plant, identifies the data necessary to quantify those factors, and describes how data might be used by O&M service providers and others in the PV industry. This document lays out data needs from perspectives of reliability, availability, and key performance indicators and is intended to be a precursor for standardizing terminology and data reporting, which will improve data sharing, analysis, and ultimately PV plant performance.
Abstract:The National STEPP Program seeks to improve water quality by accelerating the effective implementation and adoption of innovative stormwater management technologies. Itwill attempt to accomplish this by establishing practices through highly reliable, and cost-effective S...
ERIC Educational Resources Information Center
Begeny, John C.; Easton, Julia E.; Upright, James J.; Tunstall, Kali R.; Ehrenbock, Cassia A.
2014-01-01
Within the realm of school-based interventions, implementation integrity is important for practical, legal, and ethical purposes. Unfortunately, evidence suggests that proper monitoring of implementation integrity is often absent from both research and practice. School psychology practitioners and researchers have reported that a major barrier to…
Confirmatory factor analysis of the Chinese Breast Cancer Screening Beliefs Questionnaire.
Kwok, Cannas; Fethney, Judith; White, Kate
2012-01-01
Chinese women have been consistently reported as having low breast cancer screening practices. The Chinese Breast Cancer Screening Beliefs Questionnaire (CBCSB) was designed to assess Chinese Australian women's beliefs, knowledge, and attitudes toward breast cancer and screening practices. The objectives of the study were to confirm the factor structure of the CBCSB with a new, larger sample of immigrant Chinese Australian women and to report its clinical validity. A convenience sample of 785 Chinese Australian women was recruited from Chinese community organizations and shopping malls. Cronbach α was used to assess internal consistency reliability, and Amos v18 was used for confirmatory factor analysis. Clinical validity was assessed through linear regression using SPSS v18. The 3-factor structure of the CBCSB was confirmed, although the model required respecification to arrive at a suitable model fit as measured by the goodness-of-fit index (0.98), adjusted goodness-of-fit index (0.97), normed fit index (0.95), and root mean square error of approximation (0.031). Internal consistency reliability coefficients were satisfactory (>.6). Women who engaged in all 3 types of screening had more proactive attitudes to health checkups and perceived less barriers to mammographic screening. The CBCSB is a valid and reliable tool for assessing Chinese women's beliefs, knowledge, and attitudes about breast cancer and breast cancer screening practices. The CBCSB can be used for providing practicing nurses with insights into the provision of culturally sensitive breast health education.
Zhou, Guiyun; Stoltzfus, Jill C; Houldin, Arlene D; Parks, Susan M; Swan, Beth Ann
2010-11-01
To establish initial reliability and validity of a Web-based survey focused on oncology advanced practice nurses' (APNs') knowledge, attitudes, and practice behaviors regarding advanced care planning, and to obtain preliminary understanding of APNs' knowledge, attitudes, and practice behaviors and perceived barriers to advanced care planning. Descriptive, cross-sectional, pilot survey study. The eastern United States. 300 oncology APNs. Guided by the Theory of Planned Behavior, a knowledge, attitudes, and practice behaviors survey was developed and reviewed for content validity. The survey was distributed to 300 APNs via e-mail and sent again to the 89 APNs who responded to the initial survey. Exploratory factor analysis was used to examine the construct validity and test-retest reliability of the survey's attitudinal and practice behavior portions. Respondents' demographics, knowledge, attitudes, practice behaviors, and perceived barriers to advanced care planning practice. Exploratory factor analysis yielded a five-factor solution from the survey's attitudes and practice behavior portions with internal consistency using Cronbach alpha. Respondents achieved an average of 67% correct answers in the 12-item knowledge section and scored positively in attitudes toward advanced care planning. Their practice behavior scores were marginally positive. The most common reported barriers were from patients' and families' as well as physicians' reluctance to discuss advanced care planning. The attitudinal and practice behaviors portions of the survey demonstrated preliminary construct validity and test-retest reliability. Regarding advanced care planning, respondents were moderately knowledgeable, but their advanced care planning practice was not routine. Validly assessing oncology APNs' knowledge, attitudes, and practice behaviors regarding advanced care planning will enable more tailored approaches to improve end-of-life care outcomes.
McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen
2010-01-01
Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p < 0.001-0.004). The evidence-based practice profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
NASA Astrophysics Data System (ADS)
Wetzel, Angela Payne
Previous systematic reviews indicate a lack of reporting of reliability and validity evidence in subsets of the medical education literature. Psychology and general education reviews of factor analysis also indicate gaps between current and best practices; yet, a comprehensive review of exploratory factor analysis in instrument development across the continuum of medical education had not been previously identified. Therefore, the purpose for this study was critical review of instrument development articles employing exploratory factor or principal component analysis published in medical education (2006--2010) to describe and assess the reporting of methods and validity evidence based on the Standards for Educational and Psychological Testing and factor analysis best practices. Data extraction of 64 articles measuring a variety of constructs that have been published throughout the peer-reviewed medical education literature indicate significant errors in the translation of exploratory factor analysis best practices to current practice. Further, techniques for establishing validity evidence tend to derive from a limited scope of methods including reliability statistics to support internal structure and support for test content. Instruments reviewed for this study lacked supporting evidence based on relationships with other variables and response process, and evidence based on consequences of testing was not evident. Findings suggest a need for further professional development within the medical education researcher community related to (1) appropriate factor analysis methodology and reporting and (2) the importance of pursuing multiple sources of reliability and validity evidence to construct a well-supported argument for the inferences made from the instrument. Medical education researchers and educators should be cautious in adopting instruments from the literature and carefully review available evidence. Finally, editors and reviewers are encouraged to recognize this gap in best practices and subsequently to promote instrument development research that is more consistent through the peer-review process.
Roley, Susanne Smith; Mailloux, Zoe; Parham, L. Diane; Koomar, Jane; Schaaf, Roseann C.; Van Jaarsveld, Annamarie; Cohn, Ellen
2014-01-01
This study examined the reliability and validity of the structural section of the Ayres Sensory Integration® Fidelity Measure© (ASIFM), which provides a method for monitoring the extent to which an intervention was implemented as conceptualized in studies of occupational therapy using sensory integration intervention methods (OT–SI). We examined the structural elements of the measure, including content of assessment reports, availability of specific equipment and adequate space, safety monitoring, and integration of communication with parents and other team members, such as collaborative goal setting with parents or family and teacher education, into the intervention program. Analysis of self-report ratings by 259 occupational therapists from 185 different facilities indicated that the structural section of the ASIFM has acceptable interrater reliability (r ≥ .82) and significantly differentiates between settings in which therapists reportedly do and do not practice OT–SI (p < .001). PMID:25184462
Weigh-in-Motion systems evaluation : final report.
DOT National Transportation Integrated Search
1976-04-01
This relatively short-term project was initiated in order to perfect installation, operation, and maintenance practices necessary for continued accurate and reliable operation of a computerized Weigh-in-Motion system which will be used to gather truc...
Instruments to measure behavioural and psychological symptoms of dementia.
van der Linde, Rianne M; Stephan, Blossom C M; Dening, Tom; Brayne, Carol
2014-03-01
Reliable and valid measurement of behavioural and psychological symptoms of dementia (BPSD) is important for research and clinical practice. Here we provide an overview of the different instruments and discuss issues involved in the choice of the most appropriate instrument to measure BPSD in research. A list of BPSD instruments was generated. For each instrument Pubmed and SCOPUS were searched for articles that reported on their use or quality. Eighty-three instruments that are used to measure BPSD were identified. Instruments differ in length and detail, whether the interview is with participants, informants or by observation, the target sample and the time frames for use. Reliability and validity is generally good, but reported in few independent samples. When choosing a BPSD instrument for research the research question should be carefully scrutinised and the symptoms of interest, population, quality, detail, time frame and practical issues should be considered. Copyright © 2014 John Wiley & Sons, Ltd.
Electrical service reliability: the customer perspective
DOE Office of Scientific and Technical Information (OSTI.GOV)
Samsa, M.E.; Hub, K.A.; Krohm, G.C.
1978-09-01
Electric-utility-system reliability criteria have traditionally been established as a matter of utility policy or through long-term engineering practice, generally with no supportive customer cost/benefit analysis as justification. This report presents results of an initial study of the customer perspective toward electric-utility-system reliability, based on critical review of over 20 previous and ongoing efforts to quantify the customer's value of reliable electric service. A possible structure of customer classifications is suggested as a reasonable level of disaggregation for further investigation of customer value, and these groups are characterized in terms of their electricity use patterns. The values that customers assign tomore » reliability are discussed in terms of internal and external cost components. A list of options for effecting changes in customer service reliability is set forth, and some of the many policy issues that could alter customer-service reliability are identified.« less
2011-01-01
Background In healthcare, a gap exists between what is known from research and what is practiced. Understanding this gap depends upon our ability to robustly measure research utilization. Objectives The objectives of this systematic review were: to identify self-report measures of research utilization used in healthcare, and to assess the psychometric properties (acceptability, reliability, and validity) of these measures. Methods We conducted a systematic review of literature reporting use or development of self-report research utilization measures. Our search included: multiple databases, ancestry searches, and a hand search. Acceptability was assessed by examining time to complete the measure and missing data rates. Our approach to reliability and validity assessment followed that outlined in the Standards for Educational and Psychological Testing. Results Of 42,770 titles screened, 97 original studies (108 articles) were included in this review. The 97 studies reported on the use or development of 60 unique self-report research utilization measures. Seven of the measures were assessed in more than one study. Study samples consisted of healthcare providers (92 studies) and healthcare decision makers (5 studies). No studies reported data on acceptability of the measures. Reliability was reported in 32 (33%) of the studies, representing 13 of the 60 measures. Internal consistency (Cronbach's Alpha) reliability was reported in 31 studies; values exceeded 0.70 in 29 studies. Test-retest reliability was reported in 3 studies with Pearson's r coefficients > 0.80. No validity information was reported for 12 of the 60 measures. The remaining 48 measures were classified into a three-level validity hierarchy according to the number of validity sources reported in 50% or more of the studies using the measure. Level one measures (n = 6) reported evidence from any three (out of four possible) Standards validity sources (which, in the case of single item measures, was all applicable validity sources). Level two measures (n = 16) had evidence from any two validity sources, and level three measures (n = 26) from only one validity source. Conclusions This review reveals significant underdevelopment in the measurement of research utilization. Substantial methodological advances with respect to construct clarity, use of research utilization and related theory, use of measurement theory, and psychometric assessment are required. Also needed are improved reporting practices and the adoption of a more contemporary view of validity (i.e., the Standards) in future research utilization measurement studies. PMID:21794144
Santelmann, Hanno; Franklin, Jeremy; Bußhoff, Jana; Baethge, Christopher
2016-10-01
Schizoaffective disorder is a common diagnosis in clinical practice but its nosological status has been subject to debate ever since it was conceptualized. Although it is key that diagnostic reliability is sufficient, schizoaffective disorder has been reported to have low interrater reliability. Evidence based on systematic review and meta-analysis methods, however, is lacking. Using a highly sensitive literature search in Medline, Embase, and PsycInfo we identified studies measuring the interrater reliability of schizoaffective disorder in comparison to schizophrenia, bipolar disorder, and unipolar disorder. Out of 4126 records screened we included 25 studies reporting on 7912 patients diagnosed by different raters. The interrater reliability of schizoaffective disorder was moderate (meta-analytic estimate of Cohen's kappa 0.57 [95% CI: 0.41-0.73]), and substantially lower than that of its main differential diagnoses (difference in kappa between 0.22 and 0.19). Although there was considerable heterogeneity, analyses revealed that the interrater reliability of schizoaffective disorder was consistently lower in the overwhelming majority of studies. The results remained robust in subgroup and sensitivity analyses (e.g., diagnostic manual used) as well as in meta-regressions (e.g., publication year) and analyses of publication bias. Clinically, the results highlight the particular importance of diagnostic re-evaluation in patients diagnosed with schizoaffective disorder. They also quantify a widely held clinical impression of lower interrater reliability and agree with earlier meta-analysis reporting low test-retest reliability. Copyright © 2016. Published by Elsevier B.V.
Method matters: Understanding diagnostic reliability in DSM-IV and DSM-5.
Chmielewski, Michael; Clark, Lee Anna; Bagby, R Michael; Watson, David
2015-08-01
Diagnostic reliability is essential for the science and practice of psychology, in part because reliability is necessary for validity. Recently, the DSM-5 field trials documented lower diagnostic reliability than past field trials and the general research literature, resulting in substantial criticism of the DSM-5 diagnostic criteria. Rather than indicating specific problems with DSM-5, however, the field trials may have revealed long-standing diagnostic issues that have been hidden due to a reliance on audio/video recordings for estimating reliability. We estimated the reliability of DSM-IV diagnoses using both the standard audio-recording method and the test-retest method used in the DSM-5 field trials, in which different clinicians conduct separate interviews. Psychiatric patients (N = 339) were diagnosed using the SCID-I/P; 218 were diagnosed a second time by an independent interviewer. Diagnostic reliability using the audio-recording method (N = 49) was "good" to "excellent" (M κ = .80) and comparable to the DSM-IV field trials estimates. Reliability using the test-retest method (N = 218) was "poor" to "fair" (M κ = .47) and similar to DSM-5 field-trials' estimates. Despite low test-retest diagnostic reliability, self-reported symptoms were highly stable. Moreover, there was no association between change in self-report and change in diagnostic status. These results demonstrate the influence of method on estimates of diagnostic reliability. (c) 2015 APA, all rights reserved).
Measurement of the True Dynamic and Static Pressures in Flight
NASA Technical Reports Server (NTRS)
Kiel, Georg
1939-01-01
In this report, two reliable methods are presented, with the aid of which the undisturbed flight dynamic pressure and the true static pressure may be determined without error. These problems were solved chiefly through practical flight tests.
77 FR 34075 - Notice of Quarterly Report (January 1, 2012-March 31, 2012)
Federal Register 2010, 2011, 2012, 2013, 2014
2012-06-08
... trained in commercial cultivation, agriculture. services to Number of agriculture and agribusinesses... improved Latrines. Knowledge of good hygiene practices. Households with reliable water services... stakeholders reached by public outreach efforts. Personnel trained. [[Page 34089
ERIC Educational Resources Information Center
Torrance, Harry
2018-01-01
There are sound educational and examining reasons for the use of coursework assessment and practical assessment of student work by teachers in schools for purposes of reporting examination grades. Coursework and practical work test a range of different curriculum goals to final papers and increase the validity and reliability of the result.…
Applying Best Business Practices from Corporate Performance Management to DoD
2013-01-01
leading or governing large, complex corporations and are experienced in creating reliable solutions to complex management issues guided by best business ...recommendations and effective solutions aimed at improving DoD. Defense Business Board Corporate Performance Management REPORT FY13-03 Task...Group 1 Applying Best Business Practices from Corporate Performance Management to DoD TASK The Deputy Secretary of Defense (DEPSECDEF
Hildebrand, Martin; Noteborn, Mirthe G C
2015-01-01
The use of brief, reliable, valid, and practical measures of substance use is critical for conducting individual (risk and need) assessments in probation practice. In this exploratory study, the basic psychometric properties of the Alcohol Use Disorders Identification Test (AUDIT) and the Drug Use Disorders Identification Test (DUDIT) are evaluated. The instruments were administered as an oral interview instead of a self-report questionnaire. The sample comprised 383 offenders (339 men, 44 women). A subset of 56 offenders (49 men, 7 women) participated in the interrater reliability study. Data collection took place between September 2011 and November 2012. Overall, both instruments have acceptable levels of interrater reliability for total scores and acceptable to good interrater reliabilities for most of the individual items. Confirmatory factor analyses (CFA) indicated that the a priori one-, two- and three-factor solutions for the AUDIT did not fit the observed data very well. Principal axis factoring (PAF) supported a two-factor solution for the AUDIT that included a level of alcohol consumption/consequences factor (Factor 1) and a dependence factor (Factor 2), with both factors explaining substantial variance in AUDIT scores. For the DUDIT, CFA and PAF suggest that a one-factor solution is the preferred model (accounting for 62.61% of total variance). The Dutch language versions of the AUDIT and the DUDIT are reliable screening instruments for use with probationers and both instruments can be reliably administered by probation officers in probation practice. However, future research on concurrent and predictive validity is warranted.
Kwok, Cannas; Pillay, Rona; Lee, Chun Fan
2016-01-01
Indian women have been consistently reported as having low participation in breast cancer screening practices. A valid and reliable instrument to explore their breast cancer beliefs is essential for development of interventions to promote breast cancer screening practices. The aim of this study was to report the psychometric properties of the Breast Cancer Screening Beliefs Questionnaire (BCSBQ) in an Indian community in Australia. A convenience sample of 242 Indian Australian women was recruited from Indian community organizations and personal networking. Explanatory factor analysis was conducted to study the factor structure. Clinical validity was examined by Cuzick's nonparametric test, and Cronbach's α was used to assess internal consistency reliability. Exploratory factor analysis showed a similar fit to the hypothesized 3-factor structure. The frequency of breast cancer screening practices was significantly associated with attitudes toward general health check-up. Knowledge and perceptions about the breast cancer scale were not significantly associated with clinical breast examinations and mammography. Perceived barriers to mammography were much less evident among women who engaged in breast awareness and clinical breast examination. Results indicated that the BCSBQ had satisfactory validity and internal consistency. Cronbach's α of the 3 subscales ranged from .81 to .91. The BCSBQ is a culturally appropriate, valid, and reliable instrument for assessing the beliefs, knowledge, and attitudes about breast cancer and breast cancer screening practices among women of Indian ethnic extraction living in Australia. The BCSBQ can be used to provide nurses with information relevant for the development of culturally sensitive breast health education programs.
Rater methodology for stroboscopy: a systematic review.
Bonilha, Heather Shaw; Focht, Kendrea L; Martin-Harris, Bonnie
2015-01-01
Laryngeal endoscopy with stroboscopy (LES) remains the clinical gold standard for assessing vocal fold function. LES is used to evaluate the efficacy of voice treatments in research studies and clinical practice. LES as a voice treatment outcome tool is only as good as the clinician interpreting the recordings. Research using LES as a treatment outcome measure should be evaluated based on rater methodology and reliability. The purpose of this literature review was to evaluate the rater-related methodology from studies that use stroboscopic findings as voice treatment outcome measures. Systematic literature review. Computerized journal databases were searched for relevant articles using terms: stroboscopy and treatment. Eligible articles were categorized and evaluated for the use of rater-related methodology, reporting of number of raters, types of raters, blinding, and rater reliability. Of the 738 articles reviewed, 80 articles met inclusion criteria. More than one-third of the studies included in the review did not report the number of raters who participated in the study. Eleven studies reported results of rater reliability analysis with only two studies reporting good inter- and intrarater reliability. The comparability and use of results from treatment studies that use LES are limited by a lack of rigor in rater methodology and variable, mostly poor, inter- and intrarater reliability. To improve our ability to evaluate and use the findings from voice treatment studies that use LES features as outcome measures, greater consistency of reporting rater methodology characteristics across studies and improved rater reliability is needed. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The disinfection of drinking water. Final report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
The current status of theoretically possible methods for disinfecting drinking water is reviewed. The specific biocidal activity of each of the disinfectants is considered, as well as information (or lack of it) on the practical application and reliability of the methods.
Educational Testing and Validity of Conclusions in the Scholarship of Teaching and Learning
Beltyukova, Svetlana A.; Martin, Beth A.
2013-01-01
Validity and its integral evidence of reliability are fundamentals for educational and psychological measurement, and standards of educational testing. Herein, we describe these standards of educational testing, along with their subtypes including internal consistency, inter-rater reliability, and inter-rater agreement. Next, related issues of measurement error and effect size are discussed. This article concludes with a call for future authors to improve reporting of psychometrics and practical significance with educational testing in the pharmacy education literature. By increasing the scientific rigor of educational research and reporting, the overall quality and meaningfulness of SoTL will be improved. PMID:24249848
El-Choueifati, Nisrine; Purcell, Alison; McCabe, Patricia; Heard, Robert; Munro, Natalie
2014-06-01
Early childhood educators (ECEs) have an important role in promoting positive outcomes for children's language and literacy development. This paper reports the development of a new tool, The Interaction Communication and Literacy (ICL) Skills Audit, and pilots its reliability and validity. Intra- and inter-rater reliability was examined by three speech-language pathologists (SLPs). Five skill areas relating to ECE language and literacy practice were rated. The face and content validity of the ICL Skills Audit was examined by expert SLPs (n = 8) and expert ECEs (n = 4) via questionnaire. The overall intra-rater reliability for the ICL Skills Audit was excellent with percentage close agreement (PCA) of 91-94. Inter-rater agreement was PCA 68-80. Expert SLPs and ECEs agreed that the content was comprehensive and practical. Based on this preliminary study, the ICL Skills Audit appears to be a promising tool that can be used by SLPs and ECEs in collaboration to measure the skills of ECEs in the areas of language and literacy support. Future psychometric and outcome research on the revised ICL Skills Audit is warranted.
The Baby Care Questionnaire: A measure of parenting principles and practices during infancy☆
Winstanley, Alice; Gattis, Merideth
2013-01-01
The current report provides a new framework to explore the role of parenting practices and principles during infancy. We identify structure and attunement as key parenting principles during infancy. Structure represents reliance on regularity and routines in daily life. Attunement represents reliance on infant cues and close physical contact. We suggest parents’ relative endorsement of these parenting principles is related to their choices about practices such as feeding, holding and night-time sleeping. We designed the Baby Care Questionnaire to measure parents’ endorsement of structure and attunement, as well as their daily parenting practices. We report data demonstrating the factor structure, reliability and validity of the BCQ. The BCQ, to our knowledge, is the first comprehensive measure of parenting practices and principles during infancy. We conclude with a discussion of future directions for the measure. PMID:24050932
An evaluation of general practice websites in the UK.
Howitt, Alistair; Clement, Sarah; de Lusignan, Simon; Thiru, Krish; Goodwin, Daryl; Wells, Sally
2002-10-01
General practice websites are an emerging phenomenon, but there have been few critical evaluations of their content. Previously developed rating instruments to assess medical websites have been criticized for failing to report their reliability and validity. The purpose of this study was to develop a rating instrument for assessing UK general practice websites, and then to evaluate them critically. The STaRNet Website Assessment Tool (SWAT) was developed listing criteria that general practice websites may meet, which was then used to evaluate a random sample of websites drawn from an electronic database. A second assessor rated a subsample of the sites to assess the tool's inter-rater reliability. The setting was an information technology group of a general practice research network using a random sample of 108 websites identified from the database. The main outcome measures were identification of rating criteria and frequency counts from the website rating instrument. Ninety (93.3%) sites were accessible, of which 84 were UK general practice websites. Criteria most frequently met were those describing the scope of the website and their functionality. Apart from e-mail to practices, criteria related to electronic communication were rarely met. Criteria relating to the quality of information were least often met. Inter-rater reliability kappa values for the items in the tool ranged from -0.06 to 1.0 (mean 0.59). Values were >0.6 for 15 out of 25 criteria assessed in 40 sites which were rated by two assessors. General practice websites offer a wide range of information. They are technically satisfactory, but do not exploit fully the potential for electronic doctor-patient communication. The quality of information they provide is poor. The instrument may be developed as a template for general practices producing or revising their own websites.
ERIC Educational Resources Information Center
Nese, Joseph F. T.; Lai, Cheng-Fei; Anderson, Daniel; Jamgochian, Elisa M.; Kamata, Akihito; Saez, Leilani; Park, Bitnara J.; Alonzo, Julie; Tindal, Gerald
2010-01-01
In this technical report, data are presented on the practical utility, reliability, and validity of the easyCBM[R] mathematics (2009-2010 version) measures for students in grades 3-8 within four districts in two states. Analyses include: minimum acceptable within-year growth; minimum acceptable year-end benchmark performance; internal and…
Pharmacy Service Orientation: a measure of organizational culture in pharmacy practice sites.
Clark, Bartholomew E; Mount, Jeanine K
2006-03-01
The importance of organizational culture in shaping everyday organizational life is well accepted, but little work has focused on organizational culture in pharmacy. Examining new pharmacists' experiences at various practice sites may help us to understand how these shape their professional ethos and practice habits. (1) Present development and assessment of the Pharmacy Service Orientation (PSO) measure, a tool for assessing pharmacists' impressions of pharmacy practice sites. (2) Use data gathered from a sample of new pharmacists to explore potential predictors of PSO, including type of practice site, type of pharmacy work experience, and type of pharmacy degree. Mail survey of randomly selected class of 1999 pharmacy graduates within 3 months of graduation (response rate: 259 of 1,850; 14%), each of whom reported on up to 6 different pharmacy practice sites for a total of 1,192 pharmacy observations. Pharmacy Service Orientation is scored on a 1-10 semantic differential scale and reliability was assessed using Cronbach's alpha. Predictors of PSO were explored using t test and ordinary least squares regression procedures. Reliability of the PSO across all observations was 0.86. When divided according to recency of experience and type of experience, reliabilities ranged from 0.78 to 0.87. Analysis of potential predictors of PSO showed that non-corporate-community sites had significantly greater pharmaceutical care-oriented cultures (mean PSOs of 7.42 and 5.13, respectively; P<.001). The same pattern was seen for academic and nonacademic worksites (mean PSOs of 7.46 and 6.01, respectively; P<.001). The pharmacist's pharmacy degree type was not predictive of PSO. Multivariate regression results showed that type of practice site and type of pharmacy work experience explained more than 25% of the observed variance in PSO. Pharmacy Service Orientation is a reliable measure. Statistically significant differences in PSO comparisons by degree and by experience type are explained by significant differences between the PSOs of corporate-community and non-corporate-community sites.
Assessing the knowledge to practice gap: The asthma practices of community pharmacists
Guirguis, Lisa M.
2017-01-01
Background: Community pharmacists are well positioned to identify patients with poorly controlled asthma and trained to optimize asthma therapy. Yet, over 90% of patients with asthma live with uncontrolled disease. We sought to understand the current state of asthma management in practice in Alberta and explore the potential use of the Chat, Check and Chart (CCC) model to enhance pharmacists’ care for patients with asthma. Methods: An 18-question survey was used to examine pharmacists’ monitoring of asthma control and prior use of the CCC tools. Descriptive statistics were used to characterize the response rate, sample demographics, asthma management and CCC use. Survey validity and reliability were established. Results: One hundred randomly selected pharmacists completed the online survey with a 40% (100/250) response rate. A third of responding pharmacists reported talking to most patients about asthma symptoms and medication, with a greater focus on talking with patients on new prescriptions over those with ongoing therapies. Fewer than 1 in 10 pharmacists routinely talked to most patients about asthma action plans (AAPs). The majority of pharmacists (76%) were familiar with the CCC model, and 83% of those reported that the CCC model influenced their practice anywhere from somewhat (45%) to a great deal (38%). Both scales had good reliability, and factor analysis provided support for scale validity. Conclusions: There was considerable variability in pharmacists’ activities in monitoring asthma. Pharmacists rarely used AAPs. The CCC model had a high level of self-reported familiarity, use and influence among pharmacists. PMID:29317938
Oropharyngeal dysphagia: surveying practice patterns of the speech-language pathologist.
Martino, Rosemary; Pron, Gaylene; Diamant, Nicholas E
2004-01-01
The present study was designed to obtain a comprehensive view of the dysphagia assessment practice patterns of speech-language pathologists and their opinion on the importance of these practices using survey methods and taking into consideration clinician, patient, and practice-setting variables. A self-administered mail questionnaire was developed following established methodology to maximize response rates. Eight dysphagia experts independently rated the new survey for content validity. Test-retest reliability was assessed with a random sample of 23 participants. The survey was sent to 50 speech-language pathologists randomly selected from the Canadian professional association database of members who practice in dysphagia. Surveys were mailed according to the Dillman Total Design Method and included an incentive offer. High survey (64%) and item response (95%) rates were achieved and clinicians were reliable reporters of their practice behaviors (ICC>0.60). Of all the clinical assessment items, 36% were reported with high (>80%) utilization and 24% with low (<20%) utilization, the former pertaining to tongue motion and vocal quality after food/fluid intake and the latter to testing of oral sensation without food. One-third (33%) of instrumental assessment items were highly utilized and included assessment of bolus movement and laryngeal response to bolus misdirection. Overall, clinician experience and teaching institutions influenced greater utilization. Opinions of importance were similar to utilization behaviors (r = 0.947, p = 0.01). Of all patients referred for dysphagia assessment, full clinical assessments were administered to 71% of patients but instrumental assessments to only 36%. A hierarchical model of practice behavior is proposed to explain this pattern of progressively decreasing item utilization.
Hand assessment in older adults with musculoskeletal hand problems: a reliability study.
Myers, Helen L; Thomas, Elaine; Hay, Elaine M; Dziedzic, Krysia S
2011-01-07
Musculoskeletal hand pain is common in the general population. This study aims to investigate the inter- and intra-observer reliability of two trained observers conducting a simple clinical interview and physical examination for hand problems in older adults. The reliability of applying the American College of Rheumatology (ACR) criteria for hand osteoarthritis to community-dwelling older adults will also be investigated. Fifty-five participants aged 50 years and over with a current self-reported hand problem and registered with one general practice were recruited from a previous health questionnaire study. Participants underwent a standardised, structured clinical interview and physical examination by two independent trained observers and again by one of these observers a month later. Agreement beyond chance was summarised using Kappa statistics and intra-class correlation coefficients. Median values for inter- and intra-observer reliability for clinical interview questions were found to be "substantial" and "moderate" respectively [median agreement beyond chance (Kappa) was 0.75 (range: -0.03, 0.93) for inter-observer ratings and 0.57 (range: -0.02, 1.00) for intra-observer ratings]. Inter- and intra-observer reliability for physical examination items was variable, with good reliability observed for some items, such as grip and pinch strength, and poor reliability observed for others, notably assessment of altered sensation, pain on resisted movement and judgements based on observation and palpation of individual features at single joints, such as bony enlargement, nodes and swelling. Moderate agreement was observed both between and within observers when applying the ACR criteria for hand osteoarthritis. Standardised, structured clinical interview is reliable for taking a history in community-dwelling older adults with self reported hand problems. Agreement between and within observers for physical examination items is variable. Low Kappa values may have resulted, in part, from a low prevalence of clinical signs and symptoms in the study participants. The decision to use clinical interview and hand assessment variables in clinical practice or further research in primary care should include consideration of clinical applicability and training alongside reliability. Further investigation is required to determine the relationship between these clinical questions and assessments and the clinical course of hand pain and hand problems in community-dwelling older adults.
Cairnduff, Victoria; Dean, Moira; Koidis, Anastasios
2016-09-01
Food preparation and storage behaviors in the home deviating from the "best practice" food safety recommendations may result in foodborne illnesses. Currently, there are limited tools available to fully evaluate the consumer knowledge, perceptions, and behavior in the area of refrigerator safety. The current study aimed to develop a valid and reliable tool in the form of a questionnaire, the Consumer Refrigerator Safety Questionnaire (CRSQ), for assessing systematically all these aspects. Items relating to refrigerator safety knowledge (n =17), perceptions (n =46), and reported behavior (n =30) were developed and pilot tested by an expert reference group and various consumer groups to assess face and content validity (n =20), item difficulty and consistency (n =55), and construct validity (n =23). The findings showed that the CRSQ has acceptable face and content validity with acceptable levels of item difficulty. Item consistency was observed for 12 of 15 in refrigerator safety knowledge. Further, all 5 of the subscales of consumer perceptions of refrigerator safety practices relating to risk of developing foodborne disease showed acceptable internal consistency (Cronbach's α value > 0.8). Construct validity of the CRSQ was shown to be very good (P = 0.022). The CRSQ exhibited acceptable test-retest reliability at 14 days with the majority of knowledge items (93.3%) and reported behavior items (96.4%) having correlation coefficients of greater than 0.70. Overall, the CRSQ was deemed valid and reliable in assessing refrigerator safety knowledge and behavior; therefore, it has the potential for future use in identifying groups of individuals at increased risk of deviating from recommended refrigerator safety practices, as well as the assessment of refrigerator safety knowledge and behavior for use before and after an intervention.
Pati, Sanghamitra; Hussain, Mohammad Akhtar; Swain, Subhashisa; Salisbury, Chris; Metsemakers, Job F M; Knottnerus, J André; van den Akker, Marjan
2016-01-01
Multimorbidity remains an underexplored domain in Indian primary care. We undertook a study to assess the prevalence, correlates, and outcomes of multimorbidity in primary care settings in India. This paper describes the process of development and validation of our data collection tool "Multimorbidity Assessment Questionnaire for Primary Care (MAQ-PC)." An iterative process comprising desk review, chart review, and expert consultations was undertaken to generate the questionnaire. The MAQ-PC contained items on chronic conditions, health care utilization, health related quality of life, disease severity, and sociodemographics. It was first tested with twelve adults for comprehensibility followed by test-retest reliability with 103 patients from four primary care practices. For interrater reliability, two interviewers separately administered the questionnaire to sixteen patients. MAQ-PC displayed strong internal consistency (Cronbach's alpha: 0.69), interrater reliability (Cohen's Kappa: 0.78-1), and test-retest reliability (ICC: 0.970-0.741). Substantial concordance between self-report and physician diagnosis (Scott Kappa: 0.59-1.0) was observed for listed chronic conditions indicating strong concurrent validity. Nearly 54% had one chronic condition and 23.3% had multimorbidity. Our findings demonstrate MAQ-PC to be a valid and reliable measure of multimorbidity in primary care practice and suggest its potential utility in multimorbidity research in India.
Reliability and Validity of the Evidence-Based Practice Confidence (EPIC) Scale
ERIC Educational Resources Information Center
Salbach, Nancy M.; Jaglal, Susan B.; Williams, Jack I.
2013-01-01
Introduction: The reliability, minimal detectable change (MDC), and construct validity of the evidence-based practice confidence (EPIC) scale were evaluated among physical therapists (PTs) in clinical practice. Methods: A longitudinal mail survey was conducted. Internal consistency and test-retest reliability were estimated using Cronbach's alpha…
NASA reliability preferred practices for design and test
NASA Technical Reports Server (NTRS)
1991-01-01
Given here is a manual that was produced to communicate within the aerospace community design practices that have contributed to NASA mission success. The information represents the best technical advice that NASA has to offer on reliability design and test practices. Topics covered include reliability practices, including design criteria, test procedures, and analytical techniques that have been applied to previous space flight programs; and reliability guidelines, including techniques currently applied to space flight projects, where sufficient information exists to certify that the technique will contribute to mission success.
Photovoltaic module reliability improvement through application testing and failure analysis
NASA Technical Reports Server (NTRS)
Dumas, L. N.; Shumka, A.
1982-01-01
During the first four years of the U.S. Department of Energy (DOE) National Photovoltatic Program, the Jet Propulsion Laboratory Low-Cost Solar Array (LSA) Project purchased about 400 kW of photovoltaic modules for test and experiments. In order to identify, report, and analyze test and operational problems with the Block Procurement modules, a problem/failure reporting and analysis system was implemented by the LSA Project with the main purpose of providing manufacturers with feedback from test and field experience needed for the improvement of product performance and reliability. A description of the more significant types of failures is presented, taking into account interconnects, cracked cells, dielectric breakdown, delamination, and corrosion. Current design practices and reliability evaluations are also discussed. The conducted evaluation indicates that current module designs incorporate damage-resistant and fault-tolerant features which address field failure mechanisms observed to date.
Development and initial validation of the internalization of Asian American stereotypes scale.
Shen, Frances C; Wang, Yu-Wei; Swanson, Jane L
2011-07-01
This research consists of four studies on the initial reliability and validity of the Internalization of Asian American Stereotypes Scale (IAASS), a self-report instrument that measures the degree Asian Americans have internalized racial stereotypes about their own group. The results from the exploratory and confirmatory factor analyses support a stable four-factor structure of the IAASS: Difficulties with English Language Communication, Pursuit of Prestigious Careers, Emotional Reservation, and Expected Academic Success. Evidence for concurrent and discriminant validity is presented. High internal-consistency and test-retest reliability estimates are reported. A discussion of how this scale can contribute to research and practice regarding internalized stereotyping among Asian Americans is provided.
Reddy, Linda A; Dudek, Christopher M; Fabiano, Gregory A; Peters, Stephanie
2015-12-01
This article presents information about the construct validity and reliability of a new teacher self-report measure of classroom instructional and behavioral practices (the Classroom Strategies Scales-Teacher Form; CSS-T). The theoretical underpinnings and empirical basis for the instructional and behavioral management scales are presented. Information is provided about the construct validity, internal consistency, test-retest reliability, and freedom from item-bias of the scales. Given previous investigations with the CSS Observer Form, it was hypothesized that internal consistency would be adequate and that confirmatory factor analyses (CFA) of CSS-T data from 293 classrooms would offer empirical support for the CSS-T's Total, Composite and subscales, and yield a similar factor structure to that of the CSS Observer Form. Goodness-of-fit indices of χ2/df, Root Mean Square Error of Approximation, Goodness of Fit Index, and Adjusted Goodness of Fit Index suggested satisfactory fit of proposed CFA models whereas the Comparative Fit Index did not. Internal consistency estimates of .93 and .94 were obtained for the Instructional Strategies and Behavioral Strategies Total scales respectively. Adequate test-retest reliability was found for instructional and behavioral total scales (r = .79, r = .84, percent agreement 93% and 93%). The CSS-T evidences freedom from item bias on important teacher demographics (age, educational degree, and years of teaching experience). Implications of results are discussed. (c) 2015 APA, all rights reserved).
ERIC Educational Resources Information Center
Harris, Judy R.
2011-01-01
This study provides evidence that peer marking can be a reliable tool for assessing laboratory reports in large cohorts. It was conducted over a 4-yr period with first-year undergraduates ([asymptotically equivalent to]180 students/cohort) taking a mammalian physiology course, but the procedure adopted would be applicable to any other…
Design for reliability: NASA reliability preferred practices for design and test
NASA Technical Reports Server (NTRS)
Lalli, Vincent R.
1994-01-01
This tutorial summarizes reliability experience from both NASA and industry and reflects engineering practices that support current and future civil space programs. These practices were collected from various NASA field centers and were reviewed by a committee of senior technical representatives from the participating centers (members are listed at the end). The material for this tutorial was taken from the publication issued by the NASA Reliability and Maintainability Steering Committee (NASA Reliability Preferred Practices for Design and Test. NASA TM-4322, 1991). Reliability must be an integral part of the systems engineering process. Although both disciplines must be weighed equally with other technical and programmatic demands, the application of sound reliability principles will be the key to the effectiveness and affordability of America's space program. Our space programs have shown that reliability efforts must focus on the design characteristics that affect the frequency of failure. Herein, we emphasize that these identified design characteristics must be controlled by applying conservative engineering principles.
Reliability and validity of electrothermometers and associated thermocouples.
Jutte, Lisa S; Knight, Kenneth L; Long, Blaine C
2008-02-01
Examine thermocouple model uncertainty (reliability+validity). First, a 3x3 repeated measures design with independent variables electrothermometers and thermocouple model. Second, a 1x3 repeated measures design with independent variable subprobe. Three electrothermometers, 3 thermocouple models, a multi-sensor probe and a mercury thermometer measured a stable water bath. Temperature and absolute temperature differences between thermocouples and a mercury thermometer. Thermocouple uncertainty was greater than manufactures'claims. For all thermocouple models, validity and reliability were better in the Iso-Themex than the Datalogger, but there were no practical differences between models within an electrothermometers. Validity of multi-sensor probes and thermocouples within a probe were not different but were greater than manufacturers'claims. Reliability of multiprobes and thermocouples within a probe were within manufacturers claims. Thermocouple models vary in reliability and validity. Scientists should test and report the uncertainty of their equipment rather than depending on manufactures' claims.
Spanish version of Pregnancy Symptoms Inventory: transcultural adaptation and reliability.
Oviedo-Caro, Miguel A; Bueno-Antequera, Javier; Munguía-Izquierdo, Diego
2017-09-01
To transcultural adapt and analyze the reliability of Spanish version of Pregnancy Symptoms Inventory (PSI) and assess the prevalence of pregnancy symptoms in Spanish pregnant women. A subsample of 120 healthy pregnant women answered the PSI twice and a sample of 280 report the prevalence and limitation of pregnancy symptoms. The reliability was examined by means of percent agreement and weighted Kappa coefficients. The prevalence of pregnancy symptoms was evaluated by the frequency of answers. Perfect and perfect-acceptable agreement was observed in 82% and 96% of the pregnant women, respectively. Weighted Kappa coefficients ranged from 0.589 to 0.889, indicating a good reliability. The most frequent symptoms perceived by Spanish pregnant women were urinary frequency, poor sleep, increased vaginal discharge and tiredness. Spanish Pregnancy Symptoms Inventory is a brief, conceptually equivalent and satisfactory reliable tool that allows an early assessment of the wide range of pregnancy symptoms in the health care practices.
Palmer, Janice L; Coats, Mary A; Roe, Catherine M; Hanko, Shelly M; Xiong, Chengjie; Morris, John C
2010-06-01
This paper is a report of a study to establish the inter-rater reliability of advanced practice nurse and neurologist neurological assessments which included ratings with the Unified Parkinson's Disease Rating Scale-Motor Exam. Around the world, advanced practice nurses are performing tasks once completed only by physicians. To promote consumer and provider confidence, it is important to establish that nurse and physician ratings using assessment tools are similar. In addition in research settings, when different raters are used, establishment of inter-rater reliability for study assessments is needed. Advanced practice nurses and neurologists independently recorded findings on neurological examinations of 46 participants in a study conducted between August 2007 and January 2008. An intraclass correlation coefficient was calculated to estimate overall agreement between the nurse and neurologist ratings. Agreement for individual items measured on a dichotomous scale was assessed by calculating Cohen's kappa. There was substantial agreement between advanced practice nurses and neurologists on the mean Unified Parkinson's Disease Rating Scale-Motor Exam ratings (intraclass correlation coefficient = 0.65) and the U.S. National Alzheimer's Coordinating Center Uniform Data Set neurological examination ratings of unremarkable findings (kappa = 0.74) and of gait disorder (kappa = 0.73). Moderate agreement (kappa = 0.53) was reached for the rating of whether all Unified Parkinson's Disease Rating Scale-Motor Exam items were normal. These findings are consistent with studies of the inter-rater agreement of the Unified Parkinson's Disease Rating Scale-Motor Exam and support the conduct of neurological assessments by advanced practice nurses.
[An instrument in Spanish to evaluate the performance of clinical teachers by students].
Bitran, Marcela; Mena, Beltrán; Riquelme, Arnoldo; Padilla, Oslando; Sánchez, Ignacio; Moreno, Rodrigo
2010-06-01
The modernization of clinical teaching has called for the creation of faculty development programs, and the design of suitable instruments to evaluate clinical teachers' performance. To report the development and validation of an instrument in Spanish designed to measure the students' perceptions of their clinical teachers' performance and to provide them with feedback to improve their teaching practices. In a process that included the active participation of authorities, professors in charge of courses and internships, clinical teachers, students and medical education experts, we developed a 30-item questionnaire called MEDUC30 to evaluate the performance of clinical teachers by their students. The internal validity was assessed by factor analysis of 5214 evaluations of 265 teachers, gathered from 2004 to 2007. The reliability was measured with the Cronbach's alpha coefficient and the generalizability coefficient (g). MEDUC30 had good content and construct validity. Its internal structure was compatible with four factors: patient-centered teaching, teaching skills, assessment skills and learning climate, and it proved to be consistent with the structure anticipated by the theory. The scores were highly reliable (Cronbach's alpha: 0.97); five evaluations per teacher were sufficient to reach a reliability coefficient (g) of 0.8. MEDUC30 is a valid, reliable and useful instrument to evaluate the performance of clinical teachers. To our knowledge, this is the first instrument in Spanish for which solid validity and reliability evidences have been reported. We hope that MEDUC30 will be used to improve medical education in Spanish-speaking medical schools, providing teachers a specific feedback upon which to improve their pedagogical practice, and authorities with valuable information for the assessment of their faculty.
Best Practices for Reliable and Robust Spacecraft Structures
NASA Technical Reports Server (NTRS)
Raju, Ivatury S.; Murthy, P. L. N.; Patel, Naresh R.; Bonacuse, Peter J.; Elliott, Kenny B.; Gordon, S. A.; Gyekenyesi, J. P.; Daso, E. O.; Aggarwal, P.; Tillman, R. F.
2007-01-01
A study was undertaken to capture the best practices for the development of reliable and robust spacecraft structures for NASA s next generation cargo and crewed launch vehicles. In this study, the NASA heritage programs such as Mercury, Gemini, Apollo, and the Space Shuttle program were examined. A series of lessons learned during the NASA and DoD heritage programs are captured. The processes that "make the right structural system" are examined along with the processes to "make the structural system right". The impact of technology advancements in materials and analysis and testing methods on reliability and robustness of spacecraft structures is studied. The best practices and lessons learned are extracted from these studies. Since the first human space flight, the best practices for reliable and robust spacecraft structures appear to be well established, understood, and articulated by each generation of designers and engineers. However, these best practices apparently have not always been followed. When the best practices are ignored or short cuts are taken, risks accumulate, and reliability suffers. Thus program managers need to be vigilant of circumstances and situations that tend to violate best practices. Adherence to the best practices may help develop spacecraft systems with high reliability and robustness against certain anomalies and unforeseen events.
A Performance-Based Method of Student Evaluation
ERIC Educational Resources Information Center
Nelson, G. E.; And Others
1976-01-01
The Problem Oriented Medical Record (which allows practical definition of the behavioral terms thoroughness, reliability, sound analytical sense, and efficiency as they apply to the identification and management of patient problems) provides a vehicle to use in performance based type evaluation. A test-run use of the record is reported. (JT)
16 CFR 436.9 - Additional prohibitions.
Code of Federal Regulations, 2012 CFR
2012-01-01
... practice in violation of Section 5 of the Federal Trade Commission Act for any franchise seller covered by... a franchise from the franchisor or operated a franchise of the type offered by the franchisor. (2) Can provide an independent and reliable report about the franchise or the experiences of any current...
16 CFR 436.9 - Additional prohibitions.
Code of Federal Regulations, 2014 CFR
2014-01-01
... practice in violation of Section 5 of the Federal Trade Commission Act for any franchise seller covered by... a franchise from the franchisor or operated a franchise of the type offered by the franchisor. (2) Can provide an independent and reliable report about the franchise or the experiences of any current...
16 CFR 436.9 - Additional prohibitions.
Code of Federal Regulations, 2013 CFR
2013-01-01
... practice in violation of Section 5 of the Federal Trade Commission Act for any franchise seller covered by... a franchise from the franchisor or operated a franchise of the type offered by the franchisor. (2) Can provide an independent and reliable report about the franchise or the experiences of any current...
16 CFR 436.9 - Additional prohibitions.
Code of Federal Regulations, 2010 CFR
2010-01-01
... practice in violation of Section 5 of the Federal Trade Commission Act for any franchise seller covered by... a franchise from the franchisor or operated a franchise of the type offered by the franchisor. (2) Can provide an independent and reliable report about the franchise or the experiences of any current...
16 CFR 436.9 - Additional prohibitions.
Code of Federal Regulations, 2011 CFR
2011-01-01
... practice in violation of Section 5 of the Federal Trade Commission Act for any franchise seller covered by... a franchise from the franchisor or operated a franchise of the type offered by the franchisor. (2) Can provide an independent and reliable report about the franchise or the experiences of any current...
Raad, Jennifer M; Bellinger, Skylar; McCormick, Erica; Roberts, Michael C; Steele, Ric G
2008-08-01
To replicate Sifers, Puddy, Warren, and Roberts (2002) examining reporting rates of demographic, methodological, and ethical information in articles published during 1997, and to compare these rates to those found in articles published during 2005, in order to determine whether and how reporting practices of these variables have changed over time. We examined reporting demographic, methodological, and ethical information in articles in four journals: Journal of Pediatric Psychology, Journal of Clinical Child and Adolescent Psychology, Journal of Abnormal Child Psychology, and Child Development. Reporting rates during 2005 were compared to articles published during 1997. These four journals improved on many of the 23 variables compared to Sifers et al. including increases in the reporting of ethnicity, attrition, child assent procedures, socioeconomic status, reliability, and reward/incentive offered to participants. Improvements in descriptive information have implications for interpretation, replication, and generalizability of research findings.
Sample size requirements for the design of reliability studies: precision consideration.
Shieh, Gwowen
2014-09-01
In multilevel modeling, the intraclass correlation coefficient based on the one-way random-effects model is routinely employed to measure the reliability or degree of resemblance among group members. To facilitate the advocated practice of reporting confidence intervals in future reliability studies, this article presents exact sample size procedures for precise interval estimation of the intraclass correlation coefficient under various allocation and cost structures. Although the suggested approaches do not admit explicit sample size formulas and require special algorithms for carrying out iterative computations, they are more accurate than the closed-form formulas constructed from large-sample approximations with respect to the expected width and assurance probability criteria. This investigation notes the deficiency of existing methods and expands the sample size methodology for the design of reliability studies that have not previously been discussed in the literature.
Barkham, M; Margison, F; Leach, C; Lucock, M; Mellor-Clark, J; Evans, C; Benson, L; Connell, J; Audin, K; McGrath, G
2001-04-01
To complement the evidence-based practice paradigm, the authors argued for a core outcome measure to provide practice-based evidence for the psychological therapies. Utility requires instruments that are acceptable scientifically, as well as to service users, and a coordinated implementation of the measure at a national level. The development of the Clinical Outcomes in Routine Evaluation-Outcome Measure (CORE-OM) is summarized. Data are presented across 39 secondary-care services (n = 2,710) and within an intensively evaluated single service (n = 1,455). Results suggest that the CORE-OM is a valid and reliable measure for multiple settings and is acceptable to users and clinicians as well as policy makers. Baseline data levels of patient presenting problem severity, including risk, are reported in addition to outcome benchmarks that use the concept of reliable and clinically significant change. Basic quality improvement in outcomes for a single service is considered.
Reliability Practice at NASA Goddard Space Flight Center
NASA Technical Reports Server (NTRS)
Pruessner, Paula S.; Li, Ming
2008-01-01
This paper describes in brief the Reliability and Maintainability (R&M) Programs performed directly by the reliability branch at Goddard Space Flight Center (GSFC). The mission assurance requirements flow down is explained. GSFC practices for PRA, reliability prediction/fault tree analysis/reliability block diagram, FMEA, part stress and derating analysis, worst case analysis, trend analysis, limit life items are presented. Lessons learned are summarized and recommendations on improvement are identified.
Vadillo, Miguel A; Hardwicke, Tom E; Shanks, David R
2016-05-01
When a series of studies fails to replicate a well-documented effect, researchers might be tempted to use a "vote counting" approach to decide whether the effect is reliable-that is, simply comparing the number of successful and unsuccessful replications. Vohs's (2015) response to the absence of money priming effects reported by Rohrer, Pashler, and Harris (2015) provides an example of this approach. Unfortunately, vote counting is a poor strategy to assess the reliability of psychological findings because it neglects the impact of selection bias and questionable research practices. In the present comment, we show that a range of meta-analytic tools indicate irregularities in the money priming literature discussed by Rohrer et al. and Vohs, which all point to the conclusion that these effects are distorted by selection bias, reporting biases, or p-hacking. This could help to explain why money-priming effects have proven unreliable in a number of direct replication attempts in which biases have been minimized through preregistration or transparent reporting. Our major conclusion is that the simple proportion of significant findings is a poor guide to the reliability of research and that preregistered replications are an essential means to assess the reliability of money-priming effects. (c) 2016 APA, all rights reserved).
How reliable are forensic evaluations of legal sanity?
Gowensmith, W Neil; Murrie, Daniel C; Boccaccini, Marcus T
2013-04-01
When different clinicians evaluate the same criminal defendant's legal sanity, do they reach the same conclusion? Because Hawaii law requires multiple, independent evaluations when questions about legal sanity arise, Hawaii allows for the first contemporary study of the reliability of legal sanity opinions in routine practice in the United States. We examined 483 evaluation reports, addressing 165 criminal defendants, in which up to three forensic psychiatrists or psychologists offered independent opinions on a defendant's legal sanity. Evaluators reached unanimous agreement regarding legal sanity in only 55.1% of cases. Evaluators tended to disagree more often when a defendant was under the influence of drugs or alcohol at the time of the offense. But evaluators tended to agree more often when they agreed about diagnosing a psychotic disorder, or when the defendant had been psychiatrically hospitalized shortly before the offense. In court, judges followed the majority opinion among evaluators in 91% of cases. But when judges disagreed with the majority opinion, they usually did so to find defendants legally sane, rather than insane. Overall, this study indicates that reliability among practicing forensic evaluators addressing legal sanity may be poorer than the field has tended to assume. Although agreement appears more likely in some cases than others, the frequent disagreements suggest a need for improved training and practice.
Using an evidence-based approach to measure outcomes in clinical practice.
MacDermid, Joy C; Grewal, Ruby; MacIntyre, Norma J
2009-02-01
Evaluation of the outcome of evidence-based practice decisions in individual patients or patient groups is step five in the evidence-based practice approach. Outcome measures are any measures that reflect patient status. Status or outcome measures can be used to detect change over time (eg, treatment effects), to discriminate among clinical groups, or to predict future outcomes (eg, return to work). A variety of reliable and valid physical impairment and disability measures are available to assess treatment outcomes in hand surgery and therapy. Evidence from research studies that includes normative data, standard error of measurement, or comparative scores for important clinical subgroups can be used to set treatment goals, monitor recovery, and compare individual patient outcomes to those reported in the literature. Clinicians tend to rely on impairment measures, such as radiographic measures, grip strength, and range of motion, although self-report measures are known to be equally reliable and more related to global effects, such as return-to-work. The process of selecting and implementing outcome measures is crucial. This process works best when team members are involved and willing to trial new measures. In this way, the team can develop customized outcome assessment procedures that meet their needs for assessing individual patients and providing data for program evaluation.
Bussières, André E.; Terhorst, Lauren; Leach, Matthew; Stuber, Kent; Evans, Roni; Schneider, Michael J.
2015-01-01
Objectives: To identify Canadian chiropractors’ attitudes, skills and use of evidence based practice (EBP), as well as their level of awareness of previously published chiropractic clinical practice guidelines (CPGs). Methods: 7,200 members of the Canadian Chiropractic Association were invited by e-mail to complete an online version of the Evidence Based practice Attitude & utilisation SurvEy (EBASE); a valid and reliable measure of participant attitudes, skills and use of EBP. Results: Questionnaires were completed by 554 respondents. Most respondents (>75%) held positive attitudes toward EBP. Over half indicated a high level of self-reported skills in EBP, and over 90% expressed an interest in improving these skills. A majority of respondents (65%) reported over half of their practice was based on evidence from clinical research, and only half (52%) agreed that chiropractic CPGs significantly impacted on their practice. Conclusions: While most Canadian chiropractors held positive attitudes towards EBP, believed EBP was useful, and were interested in improving their skills in EBP, many did not use research evidence or CPGs to guide clinical decision making. Our findings should be interpreted cautiously due to the low response rate. PMID:26816412
Palmer, Janice L.; Coats, Mary A.; Roe, Catherine M.; Hanko, Shelly M.; Xiong, Chengjie; Morris, John C.
2010-01-01
Aim This paper is a report of a study to establish the inter-rater reliability of advanced practice nurse and neurologist neurological assessments which included ratings with the Unified Parkinson’s Disease Rating Scale-Motor Exam. Background Around the world, advanced practice nurses are performing tasks once completed by only physicians. To promote consumer and provider confidence, it is important to establish that nurse and physician ratings using assessment tools are similar. In addition in research settings, when different raters are used, establishment of inter-rater reliability for study assessments is needed. Method Advanced practice nurses and neurologists independently recorded findings on neurological examinations of 46 participants in a study conducted between August 2007 and January 2008. An intraclass correlation coefficient was calculated to estimate overall agreement between the nurse and neurologist ratings. Agreement for individual items measured on a dichotomous scale was assessed by calculating Cohen’s kappa. Results There was substantial agreement between advanced practice nurses and neurologists on the mean Unified Parkinson’s Disease Rating Scale-Motor Exam ratings (intraclass correlation coefficient = 0.65) and the U.S. National Alzheimer’s Coordinating Center Uniform Data Set neurological examination ratings of unremarkable findings (kappa = 0.74) and of gait disorder (kappa = 0.73). Moderate agreement (kappa = 0.53) was reached for the rating of whether all Unified Parkinson’s Disease Rating Scale-Motor Exam items were normal. Conclusion These findings are consistent with studies of the inter-rater agreement of the Unified Parkinson’s Disease Rating Scale-Motor Exam and support the conduct of neurological assessments by advanced practice nurses. PMID:20546368
McKay, J; Murphy, D J; Bowie, P; Schmuck, M-L; Lough, M; Eva, K W
2007-04-01
To establish the content validity and specific aspects of reliability for an assessment instrument designed to provide formative feedback to general practitioners (GPs) on the quality of their written analysis of a significant event. Content validity was quantified by application of a content validity index. Reliability testing involved a nested design, with 5 cells, each containing 4 assessors, rating 20 unique significant event analysis (SEA) reports (10 each from experienced GPs and GPs in training) using the assessment instrument. The variance attributable to each identified variable in the study was established by analysis of variance. Generalisability theory was then used to investigate the instrument's ability to discriminate among SEA reports. Content validity was demonstrated with at least 8 of 10 experts endorsing all 10 items of the assessment instrument. The overall G coefficient for the instrument was moderate to good (G>0.70), indicating that the instrument can provide consistent information on the standard achieved by the SEA report. There was moderate inter-rater reliability (G>0.60) when four raters were used to judge the quality of the SEA. This study provides the first steps towards validating an instrument that can provide educational feedback to GPs on their analysis of significant events. The key area identified to improve instrument reliability is variation among peer assessors in their assessment of SEA reports. Further validity and reliability testing should be carried out to provide GPs, their appraisers and contractual bodies with a validated feedback instrument on this aspect of the general practice quality agenda.
Assessing Health Status in Inflammatory Bowel Disease using a Novel Single-Item Numeric Rating Scale
Surti, Bijal; Spiegel, Brennan; Ippoliti, Andrew; Vasiliauskas, Eric; Simpson, Peter; Shih, David; Targan, Stephan; McGovern, Dermot; Melmed, Gil Y.
2014-01-01
Background Current instruments used to measure disease activity and health-related quality of life (HRQOL) in patients with Crohn’s disease (CD) and ulcerative colitis (UC) are often cumbersome, time-consuming, and expensive; although used in clinical trials, they are not convenient for clinical practice. A numeric rating scale (NRS) is a quick, inexpensive, and convenient patient-reported outcome (PRO) that can capture the patient’s overall perception of health. Aims To assess the validity, reliability, and responsiveness of an NRS and evaluate its use in clinical practice in patients with CD and UC. Methods We prospectively evaluated patient-reported NRS scores and measured correlations between NRS and a range of severity measures, including physician-reported NRS, Crohn’s disease activity index (CDAI), Harvey-Bradshaw index (HBI), inflammatory bowel disease questionnaire (IBDQ), and C-reactive protein (CRP) in patients with CD. Subsequently, we evaluated the correlation between the NRS and standard measures of health status (HBI or simple colitis clinical activity index [SCCAI]) and laboratory tests (sedimentation rate [ESR], CRP, and fecal calprotectin) in patients with CD and UC. Results The patient-reported NRS showed excellent correlation with CDAI (R2=0.59, p<0.0001), IBDQ (R2=0.66, p<0.0001), and HBI (R2=0.32, p<0.0001) in patients with CD. The NRS showed poor, but statistically significant correlation with SCCAI (R2=0.25, p<0.0001) in patients with UC. The NRS did not correlate with CRP, ESR, or calprotectin. The NRS was reliable and responsive to change. Conclusions The NRS is a valid, reliable, and responsive measure that may be useful to evaluate patients with CD and possibly UC. PMID:23250673
Surti, Bijal; Spiegel, Brennan; Ippoliti, Andrew; Vasiliauskas, Eric A; Simpson, Peter; Shih, David Q; Targan, Stephan R; McGovern, Dermot P B; Melmed, Gil Y
2013-05-01
Current instruments used to measure disease activity and health-related quality of life in patients with Crohn's disease (CD) and ulcerative colitis (UC) are often cumbersome, time-consuming, and expensive; although used in clinical trials, they are not convenient for clinical practice. A numeric rating scale (NRS) is a quick, inexpensive, and convenient patient-reported outcome that can capture the patient's overall perception of health. The aim of this study was to assess the validity, reliability, and responsiveness of an NRS and evaluate its use in clinical practice in patients with CD and UC. We prospectively evaluated patient-reported NRS scores and measured correlations between NRS and a range of severity measures, including physician-reported NRS, Crohn's disease activity index (CDAI), Harvey-Bradshaw index (HBI), inflammatory bowel disease questionnaire (IBDQ), and C-reactive protein (CRP) in patients with CD. Subsequently, we evaluated the correlation between the NRS and standard measures of health status (HBI or simple colitis clinical activity index [SCCAI]) and laboratory tests (sedimentation rate [ESR], CRP, and fecal calprotectin) in patients with CD and UC. The patient-reported NRS showed excellent correlation with CDAI (R (2) = 0.59, p < 0.0001), IBDQ (R (2) = 0.66, p < 0.0001), and HBI (R (2) = 0.32, p < 0.0001) in patients with CD. The NRS showed poor, but statistically significant correlation with SCCAI (R (2) = 0.25, p < 0.0001) in patients with UC. The NRS did not correlate with CRP, ESR, or calprotectin. The NRS was reliable and responsive to change. The NRS is a valid, reliable, and responsive measure that may be useful to evaluate patients with CD and possibly UC.
Aligning English Language Testing with Curriculum
ERIC Educational Resources Information Center
Palacio, Marcela; Gaviria, Sandra; Brown, James Dean
2016-01-01
Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…
Reliability Reporting Practices in Youth Life Satisfaction Research
ERIC Educational Resources Information Center
Vassar, Matt; Hale, William
2007-01-01
Due to the emergence of positive psychology in recent years, a growing line of research has focused on aspects of psychological wellness rather than psychopathology. Within the context of positive psychology, life satisfaction has emerged as a key variable of study in relation to adult and youth populations. Accurate measurement of life…
Predicting Human Thermal Comfort in Automobiles
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rugh, J.; Bharathan, D.; Chaney, L.
The objects of this report are to: (1) increase national energy security by reducing fuel use for vehicle climate control systems; (2) show/demonstrate technology that can reduce the fuel used by LD vehicles' ancillary systems; and (3) develop tools to evaluate the effectiveness of energy-efficient systems including--comfort, cost, practicality, ease-of-use, and reliability.
Ibrahim, Edward F; Petrou, Charalambos; Galanos, Antonis
2015-01-01
Background The purpose of the present study was to validate the Functional Shoulder Score (FSS), a new patient-reported outcome score specifically designed to evaluate patients with rotator cuff disorders. Methods One hundred and nineteen patients were assessed using two shoulder scoring systems [the FSS and the Constant–Murley Score (CMS)] at 3 weeks pre- and 6 months post-arthroscopic rotator cuff surgery. The reliability, validity, responsiveness and interpretability of the FSS were evaluated. Results Reliability analysis (test–retest) showed an intraclass correlation coefficient value of 0.96 [95% confidence interval (CI) = 0.92 to 0.98]. Internal consistency analysis revealed a Cronbach's alpha coefficient of 0.93. The Pearson correlation coefficient FSS-CMS was 0.782 pre-operatively and 0.737 postoperatively (p < 0.0005). There was a statistically significant increase in FSS scores postoperatively, an effect size of 3.06 and standardized response mean of 2.80. The value for minimal detectable change was ±8.38 scale points (based on a 90% CI) and the minimal clinically important difference for improvement was 24.7 ± 5.4 points. Conclusions The FSS is a patient-reported outcome measure that can easily be incorporated into clinical practice, providing a quick, reliable, valid and practical measure for rotator cuff problems. The questionnaire is highly sensitive to clinical change. PMID:27582986
Bandiera, Glen; Abrahams, Caroline; Ruetalo, Mariela; Hanson, Mark D; Nickell, Leslie; Spadafora, Salvatore
2015-12-01
Medical education institutions have a social mandate to produce a diverse physician workforce that meets the public's needs. Recent reports have framed the admission process outcome of undergraduate and postgraduate medical education (UGME and PGME) programs as a key determinant of the collective contributions graduating cohorts will make to society, creating a sense of urgency around the issue of who gets accepted. The need for evidence-informed residency application and selection processes is growing because of the increasing size and diversity of the applicant pool and the need for equity, fairness, social accountability, and health human resource planning. The selection literature, however, is dominated by a UGME focus and emphasizes determination of desirable qualities of future physicians and selection instrument reliability and validity. Gaps remain regarding PGME selection, particularly the creation of specialty-specific selection criteria, suitable outcome measures, and reliable selection systems.In this Perspective, the authors describe the University of Toronto's centralized approach to defining system-level best practices for residency application and selection. Over the 2012-2013 academic year, the Best Practices in Application and Selection working group reviewed relevant literature and reports, consulted content experts, surveyed local practices, and conducted iterative stakeholder consultations on draft recommendations. Strong agreement arose around the resulting 13 principles and 24 best practices, which had either empirical support or face validity. These recommendations, which are shared in this article, have been adopted by the university's PGME advisory committee and will inform a national initiative to improve trainees' transition from UGME to PGME in Canada.
Emergency physician evaluation of PA and NP practice patterns.
Phillips, Andrew W; Klauer, Kevin M; Kessler, Chad S
2018-05-01
The unprecedented surge in physician assistants (PAs) and NPs in the ED developed quickly in recent years, but scope of practice and practice patterns are not well described. We conducted two cross-sectional electronic surveys of the American College of Emergency Physicians' council. Survey construction was informed by interviews and evaluated with validity and reliability studies. Univariate analyses to establish associations also were performed. Most councilors' departments employ PAs and NPs (72.4% of 163 responses). Supervisory requirements varied greatly among respondents for the same emergency severity index (ESI) level. Regardless of experience level, NPs were reported to use significantly more resources than PAs; chi-square(4) = 105.292, P < .001 for less-experienced PAs or NPs; chi-square(4) = 120.415, P < .001 for more experienced PAs or NPs. Councilors reported great variation in PA and NP scope of practice. The results also suggest that new graduate PAs may be more clinically prepared to practice in the ED than new graduate NPs.
Medical and pharmacy students' perceptions of the grading and assessment practices.
Kasanda, C D; Mitonga, K H; Veii, K; Zimba, R F
2013-01-01
Many students at the University of Namibia have frequently complained about ineffective assessment practices used at the institution. On many occasions, these complaints have not been substantiated with evidence of any kind. The purpose of this study was to obtain some empirical evidence that would ascertain undergraduate students' perceptions of the University of Namibia's grading and assessment practices. Using a structured scaled questionnaire, data were obtained from a representative sample of the University's undergraduate students studying for Medical and Pharmacy degrees. The questionnaire items covered matters related to students' experiences of assessment practices, feedback on assessment tasks, reliability and validity of assessment tools used by lecturers, efficacy of processes of administering examinations, perceptions of irregular and unfair assessment practices, impact of assessment regimes on students' cost of studies, motivation, morale, rate of progression in studies and graduation, the degree of compliance with assessment ethics and on academic quality assurance. According to the data reported in this article, the majority of the respondents perceived that the Schools of Medicine and Pharmacy at the University of Namibia applied assessment practices that yielded reliable and valid results. This was the case because most lecturers in the two schools used appropriate assessment tools and provided their students with prompt and informative feedback on the results of assignments, tests and examinations. In addition, most respondents reported that whereas examination procedures used in the two schools were efficient and effective, lecturers graded examination scripts fairly. These and other results are discussed in the article to communicate the message that the assessment procedures used in the Schools of Medicine and Pharmacy at the University of Namibia would promote effective learning and understanding amongst students as they were of high quality.
Test-retest reliability and practice effects of the Wechsler Memory Scale-III.
Lo, Ada H Y; Humphreys, Michael; Byrne, Gerard J; Pachana, Nancy A
2012-09-01
Although serial administration of cognitive tests is increasingly common, there is a paucity of research on test-retest reliabilities and practice effects, both of which are important for evaluating changes in functioning. Reliability is generally conceptualized as involving short-lasting changes in performance. However, when repeated testing occurs over a period of years, there will be some longer lasting effects. The implications of these longer lasting effects and practice effects on reliability were examined in the context of repeated administrations of the Wechsler Memory Scale-III in 339 community-dwelling women aged 40-79 years over 2 to 7 years. The results showed that Logical Memory and Verbal Paired Associates subtests were consistently the most reliable subtests across the age cohorts. The magnitude of practice effects varied as a function of subtests and age. The largest practice effects were found in the youngest age cohort, especially on the Faces, Logical Memory, and Verbal Paired Associates subtests. ©2012 The British Psychological Society.
Development and psychometric properties of the Ethics Environment Questionnaire.
McDaniel, C
1997-09-01
The author reports on the development and the psychometric properties of the Ethics Environment Questionnaire (EEQ), an instrument by which to measure the opinions of health-care providers about ethics in their clinical practice organizations. The EEQ was developed to increase the number of valid and reliable measures pertaining to ethics in health-care delivery. The EEQ is a 20-item self-administered questionnaire using a Likert-type 5-point format, offering ease of administration. It is applicable to a cross-section of health-care practitioners and health-care facilities. The mean administration time is 10 minutes. The EEQ represents testing on 450 respondents in acute care settings among a cross-section of acute care facilities. Internal consistency reliability using Cronbach's alpha coefficient is 0.93, and the test-retest reliability is 0.88. Construct, content, and criterion validity are established. The scale is unidimensional, with factor loadings exceeding the minimum preset criterion. Mean score is 3.1 out of 5.0, with scores of 3.5 and above interpreted as reflective of a positive ethics environment. The EEQ provides a measure of ethics in health-care organizations among multi-practitioners in clinical practice on a valid, reliable, cost effective, and easily administered instrument that requires minimum investment of personnel time.
Creating Highly Reliable Accountable Care Organizations.
Vogus, Timothy J; Singer, Sara J
2016-12-01
Accountable Care Organizations' (ACOs) pursuit of the triple aim of higher quality, lower cost, and improved population health has met with mixed results. To improve the design and implementation of ACOs we look to organizations that manage similarly complex, dynamic, and tightly coupled conditions while sustaining exceptional performance known as high-reliability organizations. We describe the key processes through which organizations achieve reliability, the leadership and organizational practices that enable it, and the role that professionals can play when charged with enacting it. Specifically, we present concrete practices and processes from health care organizations pursuing high-reliability and from early ACOs to illustrate how the triple aim may be met by cultivating mindful organizing, practicing reliability-enhancing leadership, and identifying and supporting reliability professionals. We conclude by proposing a set of research questions to advance the study of ACOs and high-reliability research. © The Author(s) 2016.
Validation of the Korean Version of the Breast Cancer Screening Beliefs Questionnaire.
Kwok, Cannas; Lee, Mi-Joung; Lee, Chun Fan
Korean immigrant women have been consistently reported as having low participation in breast cancer screening practices. A valid and reliable instrument to explore factors that affect their cancer screening behaviors is essential. The aim of this study was to report the psychometric properties of the Korean version of the Breast Cancer Screening Beliefs Questionnaire (BCSBQ). A convenience sample of 249 Korean Australian women was recruited through a number of Korean community organizations in Sydney. Exploratory factor analysis supports a similar fit for the original 3-factor structure of our data set. A significant association was found between the attitudes of these women toward general health checkups and the frequency of their performance of the breast awareness practices and having mammograms. Furthermore, it was found that knowledge and perceptions about the breast cancer scales were significantly associated with education level and that barriers to mammographic screening were much less evident among women who engaged in the 3 screening practices. The results indicated that the Korean version of the BCSBQ had satisfactory validity and internal consistency. The Cronbach's α of the 3 subscales ranged between .80 and .88. The Korean version of the BCSBQ was confirmed to be a culturally appropriate, valid, and reliable instrument for assessing the beliefs, knowledge, and attitudes to breast cancer and breast cancer screening practices among women of Korean background living in Australia. The Korean version of the BCBSQ can provide nurses with insights into the development of culturally sensitive breast health education programs.
Panepinto, Julie A; Torres, Sylvia; Bendo, Cristiane B; McCavit, Timothy L; Dinu, Bogdan; Sherman-Bien, Sandra; Bemrich-Stolz, Christy; Varni, James W
2014-01-01
Sickle cell disease (SCD) is an inherited blood disorder characterized by a chronic hemolytic anemia that can contribute to fatigue and global cognitive impairment in patients. The study objective was to report on the feasibility, reliability, and validity of the PedsQL™ Multidimensional Fatigue Scale in SCD for pediatric patient self-report ages 5-18 years and parent proxy-report for ages 2-18 years. This was a cross-sectional multi-site study whereby 240 pediatric patients with SCD and 303 parents completed the 18-item PedsQL™ Multidimensional Fatigue Scale. Participants also completed the PedsQL™ 4.0 Generic Core Scales. The PedsQL™ Multidimensional Fatigue Scale evidenced excellent feasibility, excellent reliability for the Total Scale Scores (patient self-report α = 0.90; parent proxy-report α = 0.95), and acceptable reliability for the three individual scales (patient self-report α = 0.77-0.84; parent proxy-report α = 0.90-0.97). Intercorrelations of the PedsQL™ Multidimensional Fatigue Scale with the PedsQL™ Generic Core Scales were predominantly in the large (≥0.50) range, supporting construct validity. PedsQL™ Multidimensional Fatigue Scale Scores were significantly worse with large effects sizes (≥0.80) for patients with SCD than for a comparison sample of healthy children, supporting known-groups discriminant validity. Confirmatory factor analysis demonstrated an acceptable to excellent model fit in SCD. The PedsQL™ Multidimensional Fatigue Scale demonstrated acceptable to excellent measurement properties in SCD. The results demonstrate the relative severity of fatigue symptoms in pediatric patients with SCD, indicating the potential clinical utility of multidimensional assessment of fatigue in patients with SCD in clinical research and practice. © 2013 Wiley Periodicals, Inc.
Varni, James W; Burwinkle, Tasha M; Katz, Ernest R; Meeske, Kathy; Dickinson, Paige
2002-04-01
The Pediatric Quality of Life Inventory (PedsQL) is a modular instrument designed to measure health-related quality of life (HRQOL) in children and adolescents ages 2-18 years. The PedsQL 4.0 Generic Core Scales are multidimensional child self-report and parent proxy-report scales developed as the generic core measure to be integrated with the PedsQL disease specific modules. The PedsQL Multidimensional Fatigue Scale was designed to measure fatigue in pediatric patients. The PedsQL 3.0 Cancer Module was designed to measure pediatric cancer specific HRQOL. The PedsQL Generic Core Scales, Multidimensional Fatigue Scale, and Cancer Module were administered to 339 families (220 child self-reports; 337 parent proxy-reports). Internal consistency reliability for the PedsQL Generic Core Total Scale Score (alpha = 0.88 child, 0.93 parent report), Multidimensional Fatigue Total Scale Score (alpha = 0.89 child, 0.92 parent report) and most Cancer Module Scales (average alpha = 0.72 child, 0.87 parent report) demonstrated reliability acceptable for group comparisons. Validity was demonstrated using the known-groups method. The PedsQL distinguished between healthy children and children with cancer as a group, and among children on-treatment versus off-treatment. The validity of the PedsQL Multidimensional Fatigue Scale was further demonstrated through hypothesized intercorrelations with dimensions of generic and cancer specific HRQOL. The results demonstrate the reliability and validity of the PedsQL Generic Core Scales, Multidimensional Fatigue Scale, and Cancer Module in pediatric cancer. The PedsQL may be utilized as an outcome measure in clinical trials, research, and clinical practice. Copyright 2002 American Cancer Society.
PedsQL™ Multidimensional Fatigue Scale in Sickle Cell Disease: Feasibility, Reliability and Validity
Panepinto, Julie A.; Torres, Sylvia; Bendo, Cristiane B.; McCavit, Timothy L.; Dinu, Bogdan; Sherman-Bien, Sandra; Bemrich-Stolz, Christy; Varni, James W.
2013-01-01
Background Sickle cell disease (SCD) is an inherited blood disorder characterized by a chronic hemolytic anemia that can contribute to fatigue and global cognitive impairment in patients. The study objective was to report on the feasibility, reliability, and validity of the PedsQL™ Multidimensional Fatigue Scale in SCD for pediatric patient self-report ages 5–18 years and parent proxy-report for ages 2–18 years. Procedure This was a cross-sectional multi-site study whereby 240 pediatric patients with SCD and 303 parents completed the 18-item PedsQL™ Multidimensional Fatigue Scale. Participants also completed the PedsQL™ 4.0 Generic Core Scales. Results The PedsQL™ Multidimensional Fatigue Scale evidenced excellent feasibility, excellent reliability for the Total Scale Scores (patient self-report α = 0.90; parent proxy-report α = 0.95), and acceptable reliability for the three individual scales (patient self-report α = 0.77–0.84; parent proxy-report α = 0.90–0.97). Intercorrelations of the PedsQL™ Multidimensional Fatigue Scale with the PedsQL™ Generic Core Scales were predominantly in the large (≥ 0.50) range, supporting construct validity. PedsQL™ Multidimensional Fatigue Scale Scores were significantly worse with large effects sizes (≥0.80) for patients with SCD than for a comparison sample of healthy children, supporting known-groups discriminant validity. Confirmatory factor analysis demonstrated an acceptable to excellent model fit in SCD. Conclusions The PedsQL™ Multidimensional Fatigue Scale demonstrated acceptable to excellent measurement properties in SCD. The results demonstrate the relative severity of fatigue symptoms in pediatric patients with SCD, indicating the potential clinical utility of multidimensional assessment of fatigue in patients with SCD in clinical research and practice. PMID:24038960
Zawawi, Khalid H; Afify, Ahmed R; Yousef, Mohammed K; Othman, Hisham I; Al-Dharrab, Ayman A
2015-01-01
This longitudinal study was aimed to investigate the association between didactic grades and practical skills for dental students and whether didactic grades can reliability predict the dental students' practical performance. Didactic and practical grades for graduates from the Faculty of Dentistry, King Abdulaziz University, between the years 2009 and 2011 were collected. Four courses were selected: Dental Anatomy, Operative Dentistry, Prosthodontics, and Orthodontics. Pearson product-moment correlation analyses between didactic and practical scores were conducted. There was only a significant correlation between didactic and practical scores for the Dental Anatomy course (P<0.001). There was also a significant correlation between all four subjects in the didactic scores (P<0.001). Only the scores of male students showed a significant correlation in the Operative Dentistry course (P<0.001). There were no correlations between Orthodontic grades. Moreover, a poor degree of reliability was found between didactic and practical scores for all subjects. Based on the findings of this study, the relationship between didactic grades and practical performance is course specific. Didactic grades do not reliably predict the students' practical skills. Measuring practical performances should be independent from didactic grading.
Zawawi, Khalid H; Afify, Ahmed R; Yousef, Mohammed K; Othman, Hisham I; Al-Dharrab, Ayman A
2015-01-01
Objectives This longitudinal study was aimed to investigate the association between didactic grades and practical skills for dental students and whether didactic grades can reliability predict the dental students’ practical performance. Materials and methods Didactic and practical grades for graduates from the Faculty of Dentistry, King Abdulaziz University, between the years 2009 and 2011 were collected. Four courses were selected: Dental Anatomy, Operative Dentistry, Prosthodontics, and Orthodontics. Pearson product-moment correlation analyses between didactic and practical scores were conducted. Results There was only a significant correlation between didactic and practical scores for the Dental Anatomy course (P<0.001). There was also a significant correlation between all four subjects in the didactic scores (P<0.001). Only the scores of male students showed a significant correlation in the Operative Dentistry course (P<0.001). There were no correlations between Orthodontic grades. Moreover, a poor degree of reliability was found between didactic and practical scores for all subjects. Conclusion Based on the findings of this study, the relationship between didactic grades and practical performance is course specific. Didactic grades do not reliably predict the students’ practical skills. Measuring practical performances should be independent from didactic grading. PMID:25878519
Professional monitoring and critical incident reporting using personal digital assistants.
Bent, Paul D; Bolsin, Stephen N; Creati, Bernie J; Patrick, Andrew J; Colson, Mark E
2002-11-04
To assess the practicality of using personal digital assistants (PDAs) for the collection of logbook data, procedural performance data and critical incident reports in anaesthetic trainees. Pilot study. Two tertiary referral centres (in Victoria and New Zealand) and a large district hospital in Queensland. Six accredited Australian and New Zealand College of Anaesthetists (ANZCA) registrars and their ANZCA training supervisors. Registrars and supervisors underwent initial training for one hour, and supervisors were provided with ongoing support. Reliable use of the program, average time for data entry and number of procedures logged. ANZCA trainees reliably enter data into PDAs. The data can be transferred to a central database, where they can be remotely analysed before results are fed back to trainees. This technology can be used to monitor professional performance in ANZCA trainees.
Evidence-based practice guidelines in OHS: are they agree-able?
Hulshof, Carel; Hoenen, John
2007-01-01
The purpose of this study was to evaluate the acceptance, validity, reliability and feasibility of the AGREE (Appraisal of Guidelines and REsearch and Evaluation) instrument to assess the quality of evidence-based practice guidelines for occupational physicians. In total, 6 practice guidelines of the Netherlands Society of Occupational Medicine (NVAB) were appraised by 20 occupational health professionals and experts in guideline development or implementation. Although appraisers often disagreed on individual item scores, the internal consistency and interrater reliability for most domains was sufficient. The AGREE criteria were in general considered relevant and no major suggestions for additional items for use in the context of occupational health were brought up. The domain scores for the individual guidelines show a wide variety: 'applicability' had on average the lowest mean score (53%) while 'scope and purpose' had the highest one (87%). Low scores indicate where improvements are possible and necessary, e.g. by providing more information about the development. Key experts in occupational health report that AGREE is a relevant and easy to use instrument to evaluate quality aspects and the included criteria provide a good framework to develop or update evidence-based practice guidelines in the field of occupational health.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-02-24
... including cybersecurity best practices, media security and reliability best practices, transition to Next... Cybersecurity and Communications Reliability Public Safety and Homeland Security Bureau, Federal Communications... Chief for Cybersecurity and Communications Reliability. [FR Doc. 2011-4211 Filed 2-23-11; 8:45 am...
Gorlin, Eugenia I; Dalrymple, Kristy; Chelminski, Iwona; Zimmerman, Mark
2016-08-30
Despite growing recognition that the symptoms and functional impairments of Attention Deficit/Hyperactivity Disorder (ADHD) persist into adulthood, only a few psychometrically sound diagnostic measures have been developed for the assessment of ADHD in adults, and none have been validated for use in a broad treatment-seeking psychiatric sample. The current study presents the reliability and validity of a semi-structured DSM-based diagnostic interview module for ADHD, which was administered to 1194 adults presenting to an outpatient psychiatric practice. The module showed excellent internal consistency and interrater reliability, good convergent and discriminant validity (as indexed by relatively high correlations with self-report measures of ADHD and ADHD-related constructs and little or no correlation with other, non-ADHD symptom domains), and good construct validity (as indexed by significantly higher rates of psychosocial impairment and self-reported family history of ADHD in individuals who meet criteria for an ADHD diagnosis). This instrument is thus a reliable and valid diagnostic tool for the detection of ADHD in adults presenting for psychiatric evaluation and treatment. Published by Elsevier Ireland Ltd.
Hanauer, David I.; Bauerle, Cynthia
2015-01-01
Science, technology, engineering, and mathematics education reform efforts have called for widespread adoption of evidence-based teaching in which faculty members attend to student outcomes through assessment practice. Awareness about the importance of assessment has illuminated the need to understand what faculty members know and how they engage with assessment knowledge and practice. The Faculty Self-Reported Assessment Survey (FRAS) is a new instrument for evaluating science faculty assessment knowledge and experience. Instrument validation was composed of two distinct studies: an empirical evaluation of the psychometric properties of the FRAS and a comparative known-groups validation to explore the ability of the FRAS to differentiate levels of faculty assessment experience. The FRAS was found to be highly reliable (α = 0.96). The dimensionality of the instrument enabled distinction of assessment knowledge into categories of program design, instrumentation, and validation. In the known-groups validation, the FRAS distinguished between faculty groups with differing levels of assessment experience. Faculty members with formal assessment experience self-reported higher levels of familiarity with assessment terms, higher frequencies of assessment activity, increased confidence in conducting assessment, and more positive attitudes toward assessment than faculty members who were novices in assessment. These results suggest that the FRAS can reliably and validly differentiate levels of expertise in faculty knowledge of assessment. PMID:25976653
Limit states and reliability-based pipeline design. Final report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zimmerman, T.J.E.; Chen, Q.; Pandey, M.D.
1997-06-01
This report provides the results of a study to develop limit states design (LSD) procedures for pipelines. Limit states design, also known as load and resistance factor design (LRFD), provides a unified approach to dealing with all relevant failure modes combinations of concern. It explicitly accounts for the uncertainties that naturally occur in the determination of the loads which act on a pipeline and in the resistance of the pipe to failure. The load and resistance factors used are based on reliability considerations; however, the designer is not faced with carrying out probabilistic calculations. This work is done during developmentmore » and periodic updating of the LSD document. This report provides background information concerning limits states and reliability-based design (Section 2), gives the limit states design procedures that were developed (Section 3) and provides results of the reliability analyses that were undertaken in order to partially calibrate the LSD method (Section 4). An appendix contains LSD design examples in order to demonstrate use of the method. Section 3, Limit States Design has been written in the format of a recommended practice. It has been structured so that, in future, it can easily be converted to a limit states design code format. Throughout the report, figures and tables are given at the end of each section, with the exception of Section 3, where to facilitate understanding of the LSD method, they have been included with the text.« less
ERIC Educational Resources Information Center
Wetzel, Angela Payne
2011-01-01
Previous systematic reviews indicate a lack of reporting of reliability and validity evidence in subsets of the medical education literature. Psychology and general education reviews of factor analysis also indicate gaps between current and best practices; yet, a comprehensive review of exploratory factor analysis in instrument development across…
Report on the Formal Trialling of the Australian Second Language Proficiency Ratings (ASLPR).
ERIC Educational Resources Information Center
Ingram, D. E.
Results of testing the validity and reliability of the Australian second language proficiency ratings (ASLPR) are presented. In addition to providing a framework for planning English as a second language programs for immigrants to Australia, the ASLPR was developed to specify second language proficiency in terms of practical language skills. The…
School Safety in a Post-Sandy Hook World
ERIC Educational Resources Information Center
Trump, Kenneth S.
2014-01-01
In this report the author, who is a school safety expert, provides information about school safety in a post-Sandy Hook world. He presents the following: (1) Continuum of Threats and Responses; (2) The role social media plays; (3) Reliable Best Practices; (4) Policy and Funding--Climate and Context; (5) Policy and Funding--Things to Avoid; and (6)…
ERIC Educational Resources Information Center
St. Louis, Kenneth O.
2011-01-01
Purpose: The "Public Opinion Survey of Human Attributes-Stuttering" ("POSHA-S") was developed to make available worldwide a standard measure of public attitudes toward stuttering that is practical, reliable, valid, and translatable. Mean data from past field studies as comparisons for interpretation of "POSHA-S" results are reported. Method: Means…
Bryant, Elizabeth; Murtagh, Shemane; Finucane, Laura; McCrum, Carol; Mercer, Christopher; Smith, Toby; Canby, Guy; Rowe, David A; Moore, Ann P
2018-05-11
In response for the need of a freely available, stand-alone, validated outcome measure for use within musculoskeletal (MSK) physiotherapy practice, sensitive enough to measure clinical effectiveness, we developed an MSK patient reported outcome measure. This study examined the validity and reliability of the newly developed Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM) within physiotherapy outpatient settings. Two hundred twenty-four patients attending physiotherapy outpatient departments in South East England with an MSK condition participated in this study. The BmPROM was assessed for user friendliness (rated feedback, N = 224), reliability (internal consistency and test-retest reliability, n = 42), validity (internal and external construct validity, N = 224), and responsiveness (internal, n = 25). Exploratory factor analysis indicated that a two-factor model provides a good fit to the data. Factors were representative of "Functionality" and "Wellbeing". Correlations observed between the BmPROM and SF-36 domains provided evidence of convergent validity. Reliability results indicated that both subscales were internally consistent with alphas above the acceptable limits for both "Functionality" (α = .85, 95% CI [.81, .88]) and 'Wellbeing' (α = .80, 95% CI [.75, .84]). Test-retest analyses (n = 42) demonstrated a high degree of reliability between "Functionality" (ICC = .84; 95% CI [.72, .91]) and "Wellbeing" scores (ICC = .84; 95% CI [.72, .91]). Further examination of test-retest reliability through the Bland-Altman analysis demonstrated that the difference between "Functionality" and "Wellbeing" test scores did not vary as a function of absolute test score. Large treatment effect sizes were found for both subscales (Functionality d = 1.10; Wellbeing 1.03). The BmPROM is a reliable and valid outcome measure for use in evaluating physiotherapy treatment of MSK conditions. Copyright © 2018 John Wiley & Sons, Ltd.
A Best Practice for Developing Availability Guarantee Language in Photovoltaic (PV) O&M Agreements.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klise, Geoffrey Taylor; Balfour, John
This document outlines the foundation for developing language that can be utilized in an Equipment Availability Guarantee, typically included in an O&M services agreement between a PV system or plant owner and an O&M services provider, or operator. Many of the current PV O&M service agreement Availability Guarantees are based on contracts used for traditional power generation, which create challenges for owners and operators due to the variable nature of grid-tied photovoltaic generating technologies. This report documents language used in early PV availability guarantees and presents best practices and equations that can be used to more openly communicate how themore » reliability of the PV system and plant equipment can be expressed in an availability guarantee. This work will improve the bankability of PV systems by providing greater transparency into the equipment reliability state to all parties involved in an O&M services contract.« less
2011-01-01
Background Psychometric properties include validity, reliability and sensitivity to change. Establishing the psychometric properties of an instrument which measures three-dimensional human posture are essential prior to applying it in clinical practice or research. Methods This paper reports the findings of a systematic literature review which aimed to 1) identify non-invasive three-dimensional (3D) human posture-measuring instruments; and 2) assess the quality of reporting of the methodological procedures undertaken to establish their psychometric properties, using a purpose-build critical appraisal tool. Results Seventeen instruments were identified, of which nine were supported by research into psychometric properties. Eleven and six papers respectively, reported on validity and reliability testing. Rater qualification and reference standards were generally poorly addressed, and there was variable quality reporting of rater blinding and statistical analysis. Conclusions There is a lack of current research to establish the psychometric properties of non-invasive 3D human posture-measuring instruments. PMID:21569486
Oster, Natalia V; Carney, Patricia A; Allison, Kimberly H; Weaver, Donald L; Reisch, Lisa M; Longton, Gary; Onega, Tracy; Pepe, Margaret; Geller, Berta M; Nelson, Heidi D; Ross, Tyler R; Tosteson, Aanna N A; Elmore, Joann G
2013-02-05
Diagnostic test sets are a valuable research tool that contributes importantly to the validity and reliability of studies that assess agreement in breast pathology. In order to fully understand the strengths and weaknesses of any agreement and reliability study, however, the methods should be fully reported. In this paper we provide a step-by-step description of the methods used to create four complex test sets for a study of diagnostic agreement among pathologists interpreting breast biopsy specimens. We use the newly developed Guidelines for Reporting Reliability and Agreement Studies (GRRAS) as a basis to report these methods. Breast tissue biopsies were selected from the National Cancer Institute-funded Breast Cancer Surveillance Consortium sites. We used a random sampling stratified according to woman's age (40-49 vs. ≥50), parenchymal breast density (low vs. high) and interpretation of the original pathologist. A 3-member panel of expert breast pathologists first independently interpreted each case using five primary diagnostic categories (non-proliferative changes, proliferative changes without atypia, atypical ductal hyperplasia, ductal carcinoma in situ, and invasive carcinoma). When the experts did not unanimously agree on a case diagnosis a modified Delphi method was used to determine the reference standard consensus diagnosis. The final test cases were stratified and randomly assigned into one of four unique test sets. We found GRRAS recommendations to be very useful in reporting diagnostic test set development and recommend inclusion of two additional criteria: 1) characterizing the study population and 2) describing the methods for reference diagnosis, when applicable.
Inter-rater reliability of twelve diagnostic systems of schizophrenia.
Helmes, E; Landmark, J; Kazarian, S S
1983-05-01
The present and past symptomatology of 31 chronic schizophrenics was rated by four independent judges, two experienced clinical psychiatrists and two psychiatric residents, in a context more representative of actual clinical practice than most research studies. Ratings were made on 64 symptoms derived from 12 diagnostic systems, based on either live or videotaped interviews for present symptomatology and case records for past symptomatology. Inter-rater reliabilities were higher for present than for past symptoms, and in general did not approach those reported for highly trained raters. There were no differences between live and videotaped interviews. Diagnostic systems differed widely in rater agreement. The most consistent across both past and present symptomatology were the systems of Langfeldt, Schneider, and DSM-III, for which the level of reliability was consistent with other studies.
Tarescavage, Anthony M; Wygant, Dustin B; Boutacoff, Lana I; Ben-Porath, Yossef S
2013-12-01
In the current study, we examined the reliability, validity, and clinical utility of Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2011) scores in a sample of 759 bariatric surgery candidates. We provide descriptives for all scales, internal consistency and standard error of measurement estimates for all substantive scales, external correlates of substantive scales using chart review and self-report criteria, and relative risk ratios to assess the clinical utility of the instrument. Results generally support the reliability, validity, and clinical utility of MMPI-2-RF scale scores in the psychological evaluation of bariatric surgery candidates. Limitations, future directions, and practical application of these results are discussed. (c) 2013 APA, all rights reserved.
Mayo, Ann M
2015-01-01
It is important for CNSs and other APNs to consider the reliability and validity of instruments chosen for clinical practice, evidence-based practice projects, or research studies. Psychometric testing uses specific research methods to evaluate the amount of error associated with any particular instrument. Reliability estimates explain more about how well the instrument is designed, whereas validity estimates explain more about scores that are produced by the instrument. An instrument may be architecturally sound overall (reliable), but the same instrument may not be valid. For example, if a specific group does not understand certain well-constructed items, then the instrument does not produce valid scores when used with that group. Many instrument developers may conduct reliability testing only once, yet continue validity testing in different populations over many years. All CNSs should be advocating for the use of reliable instruments that produce valid results. Clinical nurse specialists may find themselves in situations where reliability and validity estimates for some instruments that are being utilized are unknown. In such cases, CNSs should engage key stakeholders to sponsor nursing researchers to pursue this most important work.
Measures of satisfaction with care during labour and birth: a comparative review
2013-01-01
Background Satisfaction is the one of the most frequently reported outcome measures for quality of care. Assessment of satisfaction with maternity services is crucial, and psychometrically sound measures are needed if this is to inform health practices. This paper comparatively reviews current measures of satisfaction with care during labour and birth. Methods A review of the literature was conducted. Studies were located through computerised databases and hand searching references of identified articles and reviews. Inclusion criteria were that the questionnaire was a multi-item scale of satisfaction with care during labour and birth, and some form of psychometric information (either information about questionnaire construction, or reliability, or validity) had to be reported. Results Nine questionnaires of satisfaction with care during labour and birth were identified. Instruments varied in psychometric properties and dimensions. Most described questionnaire construction and tested some form of reliability and validity. Measures were generally not based on the main theoretical models of satisfaction and varied in scope and application to different types of samples (e.g. satisfaction following caesarean section). For an in-depth measure of satisfaction with intrapartum care, the Intrapartal-Specific Quality from the Patient’s Perspective questionnaire (QPP-I) is recommended. Brief measures with good reliability and validity are provided by the Six Simple Questions (SSQ) or Perceptions of Care Adjective Checklist (PCACL-R). Conclusions Despite the interest in measures of satisfaction there are only a small number of validated measures of satisfaction with care during labour and birth. It is important that brief, reliable and valid measures are available for use in general and specific populations in order to assist research and inform practice. PMID:23656701
Lyon, Aaron R; Pullmann, Michael D; Dorsey, Shannon; Martin, Prerna; Grigore, Alexandra A; Becker, Emily M; Jensen-Doss, Amanda
2018-05-11
Measurement-based care (MBC) is an increasingly popular, evidence-based practice, but there are no tools with established psychometrics to evaluate clinician use of MBC practices in mental health service delivery. The current study evaluated the reliability, validity, and factor structure of scores generated from a brief, standardized tool to measure MBC practices, the Current Assessment Practice Evaluation-Revised (CAPER). Survey data from a national sample of 479 mental health clinicians were used to conduct exploratory and confirmatory factor analyses, as well as reliability and validity analyses (e.g., relationships between CAPER subscales and clinician MBC attitudes). Analyses revealed competing two- and three-factor models. Regardless of the model used, scores from CAPER subscales demonstrated good reliability and convergent and divergent validity with MBC attitudes in the expected directions. The CAPER appears to be a psychometrically sound tool for assessing clinician MBC practices. Future directions for development and application of the tool are discussed.
Test-retest reliability and practice effects of a rapid screen of mild traumatic brain injury.
De Monte, Veronica Eileen; Geffen, Gina Malke; Kwapil, Karleigh
2005-07-01
Test-retest reliabilities and practice effects of measures from the Rapid Screen of Concussion (RSC), in addition to the Digit Symbol Substitution Test (Digit Symbol), were examined. Twenty five male participants were tested three times; each testing session scheduled a week apart. The test-retest reliability estimates for most measures were reasonably good, ranging from .79 to .97. An exception was the delayed word recall test, which has had a reliability estimate of .66 for the first retest, and .59 for the second retest. Practice effects were evident from Times 1 to 2 on the sentence comprehension and delayed recall subtests of the RSC, Digit Symbol and a composite score. There was also a practice effect of the same magnitude found from Time 2 to Time 3 on Digit Symbol, delayed recall and the composite score. Statistics on measures for both the first and second retest intervals, with associated practice effects, are presented to enable the calculation of reliable change indices (RCI). The RCI may be used to assess any improvement in cognitive functioning after mild Traumatic Brain Injury.
The kappa statistic in rehabilitation research: an examination.
Tooth, Leigh R; Ottenbacher, Kenneth J
2004-08-01
The number and sophistication of statistical procedures reported in medical rehabilitation research is increasing. Application of the principles and methods associated with evidence-based practice has contributed to the need for rehabilitation practitioners to understand quantitative methods in published articles. Outcomes measurement and determination of reliability are areas that have experienced rapid change during the past decade. In this study, distinctions between reliability and agreement are examined. Information is presented on analytical approaches for addressing reliability and agreement with the focus on the application of the kappa statistic. The following assumptions are discussed: (1) kappa should be used with data measured on a categorical scale, (2) the patients or objects categorized should be independent, and (3) the observers or raters must make their measurement decisions and judgments independently. Several issues related to using kappa in measurement studies are described, including use of weighted kappa, methods of reporting kappa, the effect of bias and prevalence on kappa, and sample size and power requirements for kappa. The kappa statistic is useful for assessing agreement among raters, and it is being used more frequently in rehabilitation research. Correct interpretation of the kappa statistic depends on meeting the required assumptions and accurate reporting.
ERIC Educational Resources Information Center
Gadermann, Anne M.; Guhn, Martin; Zumbo, Bruno D.
2012-01-01
This paper provides a conceptual, empirical, and practical guide for estimating ordinal reliability coefficients for ordinal item response data (also referred to as Likert, Likert-type, ordered categorical, or rating scale item responses). Conventionally, reliability coefficients, such as Cronbach's alpha, are calculated using a Pearson…
Assessing the reliability of ecotoxicological studies: An overview of current needs and approaches.
Moermond, Caroline; Beasley, Amy; Breton, Roger; Junghans, Marion; Laskowski, Ryszard; Solomon, Keith; Zahner, Holly
2017-07-01
In general, reliable studies are well designed and well performed, and enough details on study design and performance are reported to assess the study. For hazard and risk assessment in various legal frameworks, many different types of ecotoxicity studies need to be evaluated for reliability. These studies vary in study design, methodology, quality, and level of detail reported (e.g., reviews, peer-reviewed research papers, or industry-sponsored studies documented under Good Laboratory Practice [GLP] guidelines). Regulators have the responsibility to make sound and verifiable decisions and should evaluate each study for reliability in accordance with scientific principles regardless of whether they were conducted in accordance with GLP and/or standardized methods. Thus, a systematic and transparent approach is needed to evaluate studies for reliability. In this paper, 8 different methods for reliability assessment were compared using a number of attributes: categorical versus numerical scoring methods, use of exclusion and critical criteria, weighting of criteria, whether methods are tested with case studies, domain of applicability, bias toward GLP studies, incorporation of standard guidelines in the evaluation method, number of criteria used, type of criteria considered, and availability of guidance material. Finally, some considerations are given on how to choose a suitable method for assessing reliability of ecotoxicity studies. Integr Environ Assess Manag 2017;13:640-651. © 2016 The Authors. Integrated Environmental Assessment and Management published by Wiley Periodicals, Inc. on behalf of Society of Environmental Toxicology & Chemistry (SETAC). © 2016 The Authors. Integrated Environmental Assessment and Management published by Wiley Periodicals, Inc. on behalf of Society of Environmental Toxicology & Chemistry (SETAC).
McBride, Ruth; Leroux, Brian; Lindblad, Anne; Williams, O Dale; Lehmann, Maryann; Rindal, D Brad; Botello-Harbaum, Maria; Gilbert, Gregg H; Gillette, Jane; Demko, Catherine
2013-05-01
The National Institute of Dental and Craniofacial Research funded three practice-based research networks (PBRNs), NW-PRECEDENT, PEARL and DPBRN to conduct studies relevant to practicing general dentists. These PBRNs collaborated to develop a questionnaire to assess the impact of network participation on changes in practice patterns. This report presents results from the initial administration of the questionnaire. Questionnaires were administered to network dentists and a non-network reference group. Practice patterns including caries diagnosis and treatment, pulp cap materials, third molar extraction, dentine hypersensitivity treatments and endodontic treatment and restoration were assessed by network, years in practice, and level of network participation. Test-retest reliability of the questionnaire was evaluated. 950 practitioners completed the questionnaire. Test-retest reliability was good-excellent (kappa>0.4) for most questions. Significant differences in responses by network were not observed. The use of caries risk assessment forms differed by both network participation (p<0.001) and years since dental degree (p=0.026). Recent dental graduates are more likely to recommend third molar removal for preventive reasons (p=0.003). Practitioners in the CONDOR research networks are similar to their US colleagues. As a group, however, these practitioners show a more evidence-based approach to their practice. Dental PBRNs have the potential to improve the translation of evidence into daily practice. Designing methods to assess practice change and the associated factors is essential to addressing this important issue. Copyright © 2013 Elsevier Ltd. All rights reserved.
Goldhahn, Jörg; Beaton, Dorcas; Ladd, Amy; Macdermid, Joy; Hoang-Kim, Amy
2014-02-01
Lack of standardization of outcome measurement has hampered an evidence-based approach to clinical practice and research. We adopted a process of reviewing evidence on current use of measures and appropriate theoretical frameworks for health and disability to inform a consensus process that was focused on deriving the minimal set of core domains in distal radius fracture. We agreed on the following seven core recommendations: (1) pain and function were regarded as the primary domains, (2) very brief measures were needed for routine administration in clinical practice, (3) these brief measures could be augmented by additional measures that provide more detail or address additional domains for clinical research, (4) measurement of pain should include measures of both intensity and frequency as core attributes, (5) a numeric pain scale, e.g. visual analogue scale or visual numeric scale or the pain subscale of the patient-reported wrist evaluation (PRWE) questionnaires were identified as reliable, valid and feasible measures to measure these concepts, (6) for function, either the Quick Disability of the arm, shoulder and hand questionnaire or PRWE-function subscale was identified as reliable, valid and feasible measures, and (7) a measure of participation and treatment complications should be considered core outcomes for both clinical practice and research. We used a sound methodological approach to form a comprehensive foundation of content for outcomes in the area of distal radius fractures. We recommend the use of symptom and function as separate domains in the ICF core set in clinical research or practice for patients with wrist fracture. Further research is needed to provide more definitive measurement properties of measures across all domains.
Reliability of two social cognition tests: The combined stories test and the social knowledge test.
Thibaudeau, Élisabeth; Cellard, Caroline; Legendre, Maxime; Villeneuve, Karèle; Achim, Amélie M
2018-04-01
Deficits in social cognition are common in psychiatric disorders. Validated social cognition measures with good psychometric properties are necessary to assess and target social cognitive deficits. Two recent social cognition tests, the Combined Stories Test (COST) and the Social Knowledge Test (SKT), respectively assess theory of mind and social knowledge. Previous studies have shown good psychometric properties for these tests, but the test-retest reliability has never been documented. The aim of this study was to evaluate the test-retest reliability and the inter-rater reliability of the COST and the SKT. The COST and the SKT were administered twice to a group of forty-two healthy adults, with a delay of approximately four weeks between the assessments. Excellent test-retest reliability was observed for the COST, and a good test-retest reliability was observed for the SKT. There was no evidence of practice effect. Furthermore, an excellent inter-rater reliability was observed for both tests. This study shows a good reliability of the COST and the SKT that adds to the good validity previously reported for these two tests. These good psychometrics properties thus support that the COST and the SKT are adequate measures for the assessment of social cognition. Copyright © 2018. Published by Elsevier B.V.
Deliberate Imagery Practice: The Reliability of Using a Retrospective Recall Methodology
ERIC Educational Resources Information Center
Cumming, Jennifer; Hall, Craig; Starkes, Janet L.
2005-01-01
This study examined the reliability of a retrospective recall methodology for providing evidence of deliberate imagery practice. A secondary purpose was to determine which imagery activities constituted the sport-specific definition of deliberate practice (Starkes, Deakin, Allard, Hodges, & Hayes, 1996). Ninety-three Canadian athletes from one…
Mötteli, S; Barbey, J; Keller, C; Bucher, T; Siegrist, M
2016-04-01
As a high-quality diet is associated with a lower risk for several diseases and all-cause mortality, current nutrition education tools provide people with information regarding how to build a healthy and a balanced meal. To assess this basic nutrition knowledge, the research aim was to develop and validate a brief scale to measure the Practical Knowledge about Balanced meals (PKB-7). A pool of 25 items was pretested with experts and laypeople before being tested on a random sample in Switzerland (n=517). For item selection, a Rasch model analysis was applied. The validity and reliability of the new scale were assessed by three additional studies including laypeople (n=597; n=145) and nutrition experts (n=59). The final scale consists of seven multiple-choice items, which met the assumptions of the Rasch model. The validity of the new scale was shown by several aspects: the Rasch model was replicated in a second study, and nutrition experts achieved significantly higher scores than laypeople (t(148)=20.27, P<0.001, d=1.78). In addition, the PKB-7 scale was correlated with other nutrition-related constructs and associated with reported vegetable consumption. Test-retest reliability (r=0.68, P<0.001) was acceptable. The PKB-7 scale is a reliable and a valid Rasch-based instrument in Swiss citizens aged between 18 and 80 years for measuring the practical knowledge about balanced meals based on current dietary guidelines. This brief and easy-to-use scale is intended for application in both research and practice.
Horn, Katyana Kowalchuk; Jennings, Sophie; Richardson, Gillian; Vliet, Ditte Van; Hefford, Cheryl; Abbott, J Haxby
2012-01-01
Systematic review of the literature. To summarize peer-reviewed literature on the reliability, validity, and responsiveness of the Patient-Specific Functional Scale (PSFS), and to identify its use as an outcome measure. Searches were performed of several electronic databases from 1995 to May 2010. Studies included were published articles containing (1) primary research investigating the psychometric and clinimetrics of the PSFS or (2) the implementation of the PSFS as an outcome measure. We assessed the methodological quality of studies included in the first category. Two hundred forty-two articles published from 1994 to May 2010 were identified. Of these, 66 met the inclusion criteria for this review, with 13 reporting the measurement properties of the PSFS, 55 implementing the PSFS as an outcome measure, and 2 doing both of the above. The PSFS was reported to be valid, reliable, and responsive in populations with knee dysfunction, cervical radiculopathy, acute low back pain, mechanical low back pain, and neck dysfunction. The PSFS was found to be reliable and responsive in populations with chronic low back pain. The PSFS was also reported to be valid, reliable, or responsive in individuals with a limited number of acute, subacute, and chronic conditions. This review found that the PSFS is also being used as an outcome measure in many other conditions, despite a lack of published evidence supporting its validity in these conditions. Although the use of the PSFS as an outcome measure is increasing in physiotherapy practice, there are gaps in the research literature regarding its validity, reliability, and responsiveness in many health conditions.
A critical appraisal of instruments to measure outcomes of interprofessional education.
Oates, Matthew; Davidson, Megan
2015-04-01
Interprofessional education (IPE) is believed to prepare health professional graduates for successful collaborative practice. A range of instruments have been developed to measure the outcomes of IPE. An understanding of the psychometric properties of these instruments is important if they are to be used to measure the effectiveness of IPE. This review set out to identify instruments available to measure outcomes of IPE and collaborative practice in pre-qualification health professional students and to critically appraise the psychometric properties of validity, responsiveness and reliability against contemporary standards for instrument design. Instruments were selected from a pool of extant instruments and subjected to critical appraisal to determine whether they satisfied inclusion criteria. The qualitative and psychometric attributes of the included instruments were appraised using a checklist developed for this review. Nine instruments were critically appraised, including the widely adopted Readiness for Interprofessional Learning Scale (RIPLS) and the Interdisciplinary Education Perception Scale (IEPS). Validity evidence for instruments was predominantly based on test content and internal structure. Ceiling effects and lack of scale width contribute to the inability of some instruments to detect change in variables of interest. Limited reliability data were reported for two instruments. Scale development and scoring protocols were generally reported by instrument developers, but the inconsistent application of scoring protocols for some instruments was apparent. A number of instruments have been developed to measure outcomes of IPE in pre-qualification health professional students. Based on reported validity evidence and reliability data, the psychometric integrity of these instruments is limited. The theoretical test construction paradigm on which instruments have been developed may be contributing to the failure of some instruments to detect change in variables of interest following an IPE intervention. These limitations should be considered in any future research on instrument design. © 2015 John Wiley & Sons Ltd.
Callwood, Alison; Cooke, Debbie; Bolger, Sarah; Lemanska, Agnieszka; Allan, Helen
2018-01-01
Universities in the United Kingdom (UK) are required to incorporate values based recruitment (VBR) into their healthcare student selection processes. This reflects an international drive to strengthen the quality of healthcare service provision. This paper presents novel findings in relation to the reliability and predictive validity of multiple mini interviews (MMIs); one approach to VBR widely being employed by universities. To examine the reliability (internal consistency) and predictive validity of MMIs using end of Year One practice outcomes of under-graduate pre-registration adult, child, mental health nursing, midwifery and paramedic practice students. Cross-discipline evaluation study. One university in the United Kingdom. Data were collected in two streams: applicants to A) The September 2014 and 2015 Midwifery Studies programmes; B) September 2015 adult; Child and Mental Health Nursing and Paramedic Practice programmes. Fifty-seven midwifery students commenced their programme in 2014 and 69 in 2015; 47 and 54 agreed to participate and completed Year One respectively. 333 healthcare students commenced their programmes in September 2015. Of these, 281 agreed to participate and completed their first year (180 adult, 33 child and 34 mental health nursing and 34 paramedic practice students). Stream A featured a seven station four-minute model with one interviewer at each station and in Stream B a six station model was employed. Cronbach's alpha was used to assess MMI station internal consistency and Pearson's moment correlation co-efficient to explore associations between participants' admission MMI score and end of Year one clinical practice outcomes (OSCE and mentor grading). Stream A: Significant correlations are reported between midwifery applicant's MMI scores and end of Year One practice outcomes. A multivariate linear regression model demonstrated that MMI score significantly predicted end of Year One practice outcomes controlling for age and academic entry level: coefficients 0.195 (p=0.002) and 0.116 (p=0.002) for OSCE and mentor grading respectively. In Stream B no significant correlations were found between MMI score and practice outcomes measured by mentor grading. Internal consistency for each MMI station was 'excellent' with values ranging from 0.966-0.974 across Streams A and B. This novel, cross-discipline study shows that MMIs are reliable VBR tools which have predictive validity when a seven station model is used. These data are important given the current international use of different MMI models in healthcare student selection processes. Copyright © 2017. Published by Elsevier Ltd.
Insightful practice: a reliable measure for medical revalidation
Guthrie, Bruce; Sullivan, Frank M; Mercer, Stewart W; Russell, Andrew; Bruce, David A
2012-01-01
Background Medical revalidation decisions need to be reliable if they are to reassure on the quality and safety of professional practice. This study tested an innovative method in which general practitioners (GPs) were assessed on their reflection and response to a set of externally specified feedback. Setting and participants 60 GPs and 12 GP appraisers in the Tayside region of Scotland, UK. Methods A feedback dataset was specified as (1) GP-specific data collected by GPs themselves (patient and colleague opinion; open book self-evaluated knowledge test; complaints) and (2) Externally collected practice-level data provided to GPs (clinical quality and prescribing safety). GPs' perceptions of whether the feedback covered UK General Medical Council specified attributes of a ‘good doctor’ were examined using a mapping exercise. GPs' professionalism was examined in terms of appraiser assessment of GPs' level of insightful practice, defined as: engagement with, insight into and appropriate action on feedback data. The reliability of assessment of insightful practice and subsequent recommendations on GPs' revalidation by face-to-face and anonymous assessors were investigated using Generalisability G-theory. Main outcome measures Coverage of General Medical Council attributes by specified feedback and reliability of assessor recommendations on doctors' suitability for revalidation. Results Face-to-face assessment proved unreliable. Anonymous global assessment by three appraisers of insightful practice was highly reliable (G=0.85), as were revalidation decisions using four anonymous assessors (G=0.83). Conclusions Unlike face-to-face appraisal, anonymous assessment of insightful practice offers a valid and reliable method to decide GP revalidation. Further validity studies are needed. PMID:22653078
Kostopoulou, O
The paper describes the process of developing a taxonomy of patient safety in general practice. The methodologies employed included fieldwork, task analysis and confidential reporting of patient-safety events in five West Midlands practices. Reported events were traced back to their root causes and contributing factors. The resulting taxonomy is based on a theoretical model of human cognition, includes multiple levels of classification to reflect the chain of causation and considers affective and physiological influences on performance. Events are classified at three levels. At level one, the information-processing model of cognition is used to classify errors. At level two, immediate causes are identified, internal and external to the individual. At level three, more remote causal factors are classified as either 'work organization' or 'technical' with subcategories. The properties of the taxonomy (validity, reliability, comprehensiveness) as well as its usability and acceptability remain to be tested with potential users.
Reliability Analysis of Systems Subject to First-Passage Failure
NASA Technical Reports Server (NTRS)
Lutes, Loren D.; Sarkani, Shahram
2009-01-01
An obvious goal of reliability analysis is the avoidance of system failure. However, it is generally recognized that it is often not feasible to design a practical or useful system for which failure is impossible. Thus it is necessary to use techniques that estimate the likelihood of failure based on modeling the uncertainty about such items as the demands on and capacities of various elements in the system. This usually involves the use of probability theory, and a design is considered acceptable if it has a sufficiently small probability of failure. This report contains findings of analyses of systems subject to first-passage failure.
ERIC Educational Resources Information Center
Chen, Ai-Hong; Buari, Noor Halilah; Jufri, Shauqiah
2017-01-01
Passages with continuous sentences are commonly used for the assessment of reading performance related to visual function, and rehabilitation in optometric practices. Passages created in native languages are crucial for a reliable interpretation in a real scenario. This study aimed to report the development of SAH Reading Passage Compendium…
ERIC Educational Resources Information Center
Arkoudis, Sophie; O'Loughlin, Kieran
2004-01-01
This article reports on a collaborative study involving ESL teachers in an Australian English Language Centre as they work through some of their concerns about reliability and validity in their assessment practices. The focus of this article is on how teachers work with the Curriculum Standards Framework (CSF) as an assessment tool. The discussion…
ERIC Educational Resources Information Center
What Works Clearinghouse, 2012
2012-01-01
This study examined five instruments used to assess the effectiveness of teacher practices based on classroom observations. The study first examined whether observers could reliably assess teachers with each instrument, and then examined how well each instrument, along with other information, predicted student achievement. The study reported that,…
Parent-Child Agreement of Anxiety Symptoms in Youth with Autism Spectrum Disorders
ERIC Educational Resources Information Center
Blakeley-Smith, Audrey; Reaven, Judy; Ridge, Katherine; Hepburn, Susan
2012-01-01
Best practice for assessment of anxiety symptoms in children suggests that child self-report is an important element to consider. Yet, it is not known if it is a reliable assessment method for children with high-functioning autism spectrum disorders (ASDs). The present study examines the extent to which verbally fluent children with ASD and their…
Barriers to medication error reporting among hospital nurses.
Rutledge, Dana N; Retrosi, Tina; Ostrowski, Gary
2018-03-01
The study purpose was to report medication error reporting barriers among hospital nurses, and to determine validity and reliability of an existing medication error reporting barriers questionnaire. Hospital medication errors typically occur between ordering of a medication to its receipt by the patient with subsequent staff monitoring. To decrease medication errors, factors surrounding medication errors must be understood; this requires reporting by employees. Under-reporting can compromise patient safety by disabling improvement efforts. This 2017 descriptive study was part of a larger workforce engagement study at a faith-based Magnet ® -accredited community hospital in California (United States). Registered nurses (~1,000) were invited to participate in the online survey via email. Reported here are sample demographics (n = 357) and responses to the 20-item medication error reporting barriers questionnaire. Using factor analysis, four factors that accounted for 67.5% of the variance were extracted. These factors (subscales) were labelled Fear, Cultural Barriers, Lack of Knowledge/Feedback and Practical/Utility Barriers; each demonstrated excellent internal consistency. The medication error reporting barriers questionnaire, originally developed in long-term care, demonstrated good validity and excellent reliability among hospital nurses. Substantial proportions of American hospital nurses (11%-48%) considered specific factors as likely reporting barriers. Average scores on most barrier items were categorised "somewhat unlikely." The highest six included two barriers concerning the time-consuming nature of medication error reporting and four related to nurses' fear of repercussions. Hospitals need to determine the presence of perceived barriers among nurses using questionnaires such as the medication error reporting barriers and work to encourage better reporting. Barriers to medication error reporting make it less likely that nurses will report medication errors, especially errors where patient harm is not apparent or where an error might be hidden. Such under-reporting impedes collection of accurate medication error data and prevents hospitals from changing harmful practices. © 2018 John Wiley & Sons Ltd.
Measures of shoulder protraction and thoracolumbar rotation.
Schenkman, M; Laub, K C; Kuchibhatla, M; Ray, L; Shinberg, M
1997-05-01
Physical therapists need objective measures that can be used reliably with a variety of subject groups to document upper quadrant function. Two aspects of upper quadrant motion, shoulder protraction and thoracolumbar rotation, are assessed routinely in clinical practice, but no standard measurement techniques have been reported. We hypothesized that there would be significant differences, by age and state of health, for both shoulder protraction and thoracolumbar rotation. The purposes of this study were: 1) to develop measurement approaches for shoulder protraction and thoracolumbar rotation; 2) to determine if there are significant differences in these motions for four subject groups: healthy young, healthy elders, functionally limited elders, and people with Parkinson's disease; and 3) to describe between-rater and within-rater reliability for these measures. Fifty-five subjects participated in this investigation. All subjects were rated by a physical therapist and two research assistants. Using an analysis of variance followed by Scheffe's post hoc analysis, significant differences were demonstrated between the groups. Between-rater and within-rater reliability ranged from ICCs of 0.54 to 0.95. Clinicians can use these measures to quantify aspects of upper quadrant function treated routinely in physical therapy practice. These measures also have applicability for researchers.
Rodger, Sylvia; Turpin, Merrill; Copley, Jodie; Coleman, Allison; Chien, Chi-Wen; Caine, Anne-Maree; Brown, Ted
2014-08-01
The reliable evaluation of occupational therapy students completing practice education placements along with provision of appropriate feedback is critical for both students and for universities from a quality assurance perspective. This study describes the development of a comment bank for use with an online version of the Student Practice Evaluation Form-Revised Edition (SPEF-R Online) and investigates its reliability. A preliminary bank of 109 individual comments (based on previous students' placement performance) was developed via five stages. These comments reflected all 11 SPEF-R domains. A purpose-designed online survey was used to examine the reliability of the comment bank. A total of 37 practice educators returned surveys, 31 of which were fully completed. Participants were asked to rate each individual comment using the five-point SPEF-R rating scale. One hundred and two of 109 comments demonstrated satisfactory agreement with their respective default ratings that were determined by the development team. At each domain level, the intra-class correlation coefficients (ranging between 0.86 and 0.96) also demonstrated good to excellent inter-rater reliability. There were only seven items that required rewording prior to inclusion in the final SPEF-R Online comment bank. The development of the SPEF-R Online comment bank offers a source of reliable comments (consistent with the SPEF-R rating scale across different domains) and aims to assist practice educators in providing reliable and timely feedback to students in a user-friendly manner. © 2014 Occupational Therapy Australia.
Media Reporting of Practice-Changing Clinical Trials in Oncology: A North American Perspective.
Andrew, Peter; Vickers, Michael M; O'Connor, Stephen; Valdes, Mario; Tang, Patricia A
2016-03-01
Media reporting of clinical trials impacts patient-oncologist interactions. We sought to characterize the accuracy of media and Internet reporting of practice-changing clinical trials in oncology. The first media articles referencing 17 practice-changing clinical trials were collected from 4 media outlets: newspapers, cable news, cancer websites, and industry websites. Measured outcomes were media reporting score, social media score, and academic citation score. The media reporting score was a measure of completeness of information detailed in media articles as scored by a 15-point scoring instrument. The social media score represented the ubiquity of social media presence referencing 17 practice-changing clinical trials in cancer as determined by the American Society of Clinical Oncology in its annual report, entitled Clinical Cancer Advances 2012; social media score was calculated from Twitter, Facebook, and Google searches. The academic citation score comprised total citations from Google Scholar plus the Scopus database, which represented the academic impact per clinical cancer advance. From 170 media articles, 107 (63%) had sufficient data for analysis. Cohen's κ coefficient demonstrated reliability of the media reporting score instrument with a coefficient of determination of 94%. Per the media reporting score, information was most complete from industry, followed by cancer websites, newspapers, and cable news. The most commonly omitted items, in descending order, were study limitations, exclusion criteria, conflict of interest, and other. The social media score was weakly correlated with academic citation score. Media outlets appear to have set a low bar for coverage of many practice-changing advances in oncology, with reports of scientific breakthroughs often omitting basic study facts and cautions, which may mislead the public. The media should be encouraged to use a standardized reporting template and provide accessible references to original source information whenever feasible. ©AlphaMed Press.
Illustrated structural application of universal first-order reliability method
NASA Technical Reports Server (NTRS)
Verderaime, V.
1994-01-01
The general application of the proposed first-order reliability method was achieved through the universal normalization of engineering probability distribution data. The method superimposes prevailing deterministic techniques and practices on the first-order reliability method to surmount deficiencies of the deterministic method and provide benefits of reliability techniques and predictions. A reliability design factor is derived from the reliability criterion to satisfy a specified reliability and is analogous to the deterministic safety factor. Its application is numerically illustrated on several practical structural design and verification cases with interesting results and insights. Two concepts of reliability selection criteria are suggested. Though the method was developed to support affordable structures for access to space, the method should also be applicable for most high-performance air and surface transportation systems.
Trend analysis of dental practice rent and mortgage expenses: 1989-1995.
Brown, L J; Lazar, V
1999-03-01
The authors analyzed trends for rent and mortgage, as reported by independent active private practitioners, for the period of 1989-1995. Rent and mortgage were analyzed overall and by different characteristics. In the ADA's annual "Survey of Dental Practice," dentists reported gross billings and net incomes, as well as itemized practice expenses. The authors tabulated survey responses over time to develop trends and compare rent and mortgage expenses for each year. If trends over time were not exhibited, the authors combined multiple years of data to develop more reliable statistics. Approximately three-fourths of independent active private practitioners reported rent but no mortgage expenses. The percentages of practitioners who reported rent or mortgage was stable over time. The percentage reporting rent only as an expense increased with the number of dentists in the practice, the age of the reporting dentist and the number of office locations. In contrast, the opposite was true for dentists reporting mortgage only as an expense. The authors found that office rental is more common than is office ownership and that mortgages as a percentage of gross billings and in dollar amounts were similar to rents. They also found that because the cost of office space as a percentage of gross billings decreased as the number of dentists in the practice increased, limited economies of scale may be present in the cost of dental office space. There is not a large financial advantage to ownership, so renting is likely to remain an important way to acquire office space. As only limited economies of scale exist in office space expenses, many group practices may choose to rent office space.
Is the medical justice system broken?
Howard, Philip K
2003-09-01
The current lawsuit culture is creating a crisis in US health care. The broad perception that anyone can sue for almost anything has fundamentally altered the practice of medicine, eroding the quality and availability of health care. Current reform proposals to "cap" one category of damages are not nearly ambitious enough. Providing relief to doctors squeezed by insurance premiums is important but will not heal the deep distrust that skews daily decisions, nor will it provide incentives to overhaul outdated practices. The United States needs an entirely new system of medical justice. Its first goal is to be reliable: reliable in protecting patients against bad practices, reliable in protecting physicians who act reasonably, and reliable in interpreting standards of care.
Patrick, Donald L; Giuliano, François; Ho, Kai Fai; Gagnon, Dennis D; McNulty, Pauline; Rothman, Margaret
2009-02-01
To evaluate the reliability and validity of the Premature Ejaculation Profile (PEP), a self-reported outcome instrument for evaluating domains of PE and its treatment, comprised of four single-item measures, a profile, and an index score. Data were from men participating in observational studies in the USA (PE, 207 men; non-PE, 1380) and Europe (PE, 201; non-PE, 914) and from men with PE (1238) participating in a phase III randomized, placebo-controlled clinical trial of dapoxetine. The PEP contains four measures: perceived control over ejaculation, personal distress related to ejaculation, satisfaction with sexual intercourse, and interpersonal difficulty related to ejaculation, each assessed on five-point response scales. Test-retest reliability, known-groups validity, and ability to detect a patient-reported global impression of change (PGI) in condition were evaluated for the individual PEP measures and a PEP index score (the mean of all four measures). Profile analysis was conducted using multivariate analysis of variance. All PEP measures showed acceptable reliability (intraclass correlation coefficients ranged from 0.66 to 0.83) and mean scores for all measures differed significantly between PE and non-PE groups (P < 0.001). Men who reported a reduction in PE with treatment in the phase III trial had significantly greater scores on each of the four measures. The PEP profiles of men with and without PE differed significantly (P < 0.001) in both observational studies; higher levels of PGI were associated with higher PEP profiles (P < 0.001). The PEP index score also showed acceptable reliability and was significantly different between the PE and non-PE groups (P < 0.001). Men who reported an improvement in PE with treatment in the phase III trial had significantly greater PEP index scores. In the phase III trial, nausea was the most common adverse event with dapoxetine. The PEP provides a reliable, valid, and interpretable measure for use in monitoring outcomes of men with PE.
Ajja, Rahma; Beets, Michael W; Chandler, Jessica; Kaczynski, Andrew T; Ward, Dianne S
2015-08-01
There is a growing interest in evaluating the physical activity (PA) and healthy eating (HE) policy and practice environment characteristics in settings frequented by youth (≤18years). This review evaluates the measurement properties of audit tools designed to assess PA and HE policy and practice environment characteristics in settings that care for youth (e.g., childcare, school, afterschool, summer camp). Three electronic databases, reference lists, educational department and national health organizations' web pages were searched between January 1980 and February 2014 to identify tools assessing PA and/or HE policy and practice environments in settings that care for youth (≤18years). Sixty-five audit tools were identified of which 53 individual tools met the inclusion criteria. Thirty-three tools assessed both the PA and HE domains, 6 assessed the PA domain and 14 assessed the HE domain solely. The majority of the tools were self-assessment tools (n=40), and were developed to assess the PA and/or HE environment in school settings (n=33), childcare (n=12), and after school programs (n=4). Four tools assessed the community at-large and had sections for assessing preschool, school and/or afterschool settings within the tool. The majority of audit tools lacked validity and/or reliability data (n=42). Inter-rater reliability and construct validity were the most frequently reported reliability (n=7) and validity types (n=5). Limited attention has been given to establishing the reliability and validity of audit tools for settings that care for youth. Future efforts should be directed towards establishing a strong measurement foundation for these important environmental audit tools. Published by Elsevier Inc.
Test-retest and between-site reliability in a multicenter fMRI study.
Friedman, Lee; Stern, Hal; Brown, Gregory G; Mathalon, Daniel H; Turner, Jessica; Glover, Gary H; Gollub, Randy L; Lauriello, John; Lim, Kelvin O; Cannon, Tyrone; Greve, Douglas N; Bockholt, Henry Jeremy; Belger, Aysenil; Mueller, Bryon; Doty, Michael J; He, Jianchun; Wells, William; Smyth, Padhraic; Pieper, Steve; Kim, Seyoung; Kubicki, Marek; Vangel, Mark; Potkin, Steven G
2008-08-01
In the present report, estimates of test-retest and between-site reliability of fMRI assessments were produced in the context of a multicenter fMRI reliability study (FBIRN Phase 1, www.nbirn.net). Five subjects were scanned on 10 MRI scanners on two occasions. The fMRI task was a simple block design sensorimotor task. The impulse response functions to the stimulation block were derived using an FIR-deconvolution analysis with FMRISTAT. Six functionally-derived ROIs covering the visual, auditory and motor cortices, created from a prior analysis, were used. Two dependent variables were compared: percent signal change and contrast-to-noise-ratio. Reliability was assessed with intraclass correlation coefficients derived from a variance components analysis. Test-retest reliability was high, but initially, between-site reliability was low, indicating a strong contribution from site and site-by-subject variance. However, a number of factors that can markedly improve between-site reliability were uncovered, including increasing the size of the ROIs, adjusting for smoothness differences, and inclusion of additional runs. By employing multiple steps, between-site reliability for 3T scanners was increased by 123%. Dropping one site at a time and assessing reliability can be a useful method of assessing the sensitivity of the results to particular sites. These findings should provide guidance toothers on the best practices for future multicenter studies.
Varni, James W; Limbers, Christine A; Burwinkle, Tasha M
2007-01-03
Health-related quality of life (HRQOL) measurement has emerged as an important health outcome in clinical trials, clinical practice improvement strategies, and healthcare services research and evaluation. While pediatric patient self-report should be considered the standard for measuring perceived HRQOL, there are circumstances when children are too young, too cognitively impaired, too ill or fatigued to complete a HRQOL instrument, and reliable and valid parent proxy-report instruments are needed in such cases. Further, it is typically parents' perceptions of their children's HRQOL that influences healthcare utilization. Data from the PedsQL DatabaseSM were utilized to test the reliability and validity of parent proxy-report at the individual age subgroup level for ages 2-16 years as recommended by recent FDA guidelines. The sample analyzed represents parent proxy-report age data on 13,878 children ages 2 to 16 years from the PedsQL 4.0 Generic Core Scales DatabaseSM. Parents were recruited from general pediatric clinics, subspecialty clinics, and hospitals in which their children were being seen for well-child checks, mild acute illness, or chronic illness care (n = 3,718, 26.8%), and from a State Children's Health Insurance Program (SCHIP) in California (n = 10,160, 73.2%). The percentage of missing item responses for the parent proxy-report sample as a whole was 2.1%, supporting feasibility. The majority of the parent proxy-report scales across the age subgroups exceeded the minimum internal consistency reliability standard of 0.70 required for group comparisons, while the Total Scale Scores across the age subgroups approached or exceeded the reliability criterion of 0.90 recommended for analyzing individual patient scale scores. Construct validity was demonstrated utilizing the known groups approach. For each PedsQL scale and summary score, across age subgroups, healthy children demonstrated a statistically significant difference in HRQOL (better HRQOL) than children with a known chronic health condition, with most effect sizes in the medium to large effect size range. The results demonstrate the feasibility, reliability, and validity of parent proxy-report at the individual age subgroup for ages 2-16 years. These analyses are consistent with recent FDA guidelines which require instrument development and validation testing for children and adolescents within fairly narrow age groupings and which determine the lower age limit at which reliable and valid responses across age categories are achievable. Even as pediatric patient self-report is advocated, there remains a fundamental role for parent proxy-report in pediatric clinical trials and health services research.
Failure Analysis Study and Long-Term Reliability of Optical Assemblies with End-Face Damage
NASA Technical Reports Server (NTRS)
Kichak, Robert A.; Ott, Melanie N.; Leidecker, Henning W.; Chuska, Richard F.; Greenwell, Christopher J.
2008-01-01
In June 2005, the NESC received a multi-faceted request to determine the long term reliability of fiber optic termini on the ISS that exhibited flaws not manufactured to best workmanship practices. There was a lack of data related to fiber optic workmanship as it affects the long term reliability of optical fiber assemblies in a harsh environment. A fiber optic defect analysis was requested which would find and/or create various types of chips, spalls, scratches, etc., that were identified by the ISS personnel. Once the defects and causes were identified the next step would be to perform long term reliability testing of similar assemblies with similar defects. The goal of the defect analysis would be for the defects to be observed and documented for deterioration of fiber optic performance. Though this report mostly discusses what has been determined as evidence of poor manufacturing processes, it also concludes the majority of the damage could have been avoided with a rigorous process in place.
Design Protocols and Analytical Strategies that Incorporate Structural Reliability Models
NASA Technical Reports Server (NTRS)
Duffy, Stephen F.
1995-01-01
The general goal of this project is to establish design protocols that enable the engineer to analyze and predict certain types of behavior in ceramic composites. Sections of the final report addresses the following: Description of the Problem that Motivated the Technology Development, Description of the New Technology that was Developed, Unique and Novel Features of the Technology and Results/Benefits of Application (year by year accomplishments), and Utilization of New Technology in Non-Aerospace Applications. Activities for this reporting period included the development of a design analysis as part of a cooperative agreement with general Electric Aircraft Engines. The effort focused on modifying the Toughened Ceramics Analysis and Reliability Evaluation of Structures (TCARES) algorithm for use in the design of engine components fabricated from NiAl. Other activities related to the development of an ASTM standard practice for estimating Weibull parameters. The standard focuses on the evaluation and reporting of uniaxial strength data, and the estimation of probability distribution parameters for ceramics which fail in a brittle fashion.
Zbrozek, Arthur; Hebert, Joy; Gogates, Gregory; Thorell, Rod; Dell, Christopher; Molsen, Elizabeth; Craig, Gretchen; Grice, Kenneth; Kern, Scottie; Hines, Sheldon
2013-06-01
Outcomes research literature has many examples of high-quality, reliable patient-reported outcome (PRO) data entered directly by electronic means, ePRO, compared to data entered from original results on paper. Clinical trial managers are increasingly using ePRO data collection for PRO-based end points. Regulatory review dictates the rules to follow with ePRO data collection for medical label claims. A critical component for regulatory compliance is evidence of the validation of these electronic data collection systems. Validation of electronic systems is a process versus a focused activity that finishes at a single point in time. Eight steps need to be described and undertaken to qualify the validation of the data collection software in its target environment: requirements definition, design, coding, testing, tracing, user acceptance testing, installation and configuration, and decommissioning. These elements are consistent with recent regulatory guidance for systems validation. This report was written to explain how the validation process works for sponsors, trial teams, and other users of electronic data collection devices responsible for verifying the quality of the data entered into relational databases from such devices. It is a guide on the requirements and documentation needed from a data collection systems provider to demonstrate systems validation. It is a practical source of information for study teams to ensure that ePRO providers are using system validation and implementation processes that will ensure the systems and services: operate reliably when in practical use; produce accurate and complete data and data files; support management control and comply with any existing regulations. Furthermore, this short report will increase user understanding of the requirements for a technology review leading to more informed and balanced recommendations or decisions on electronic data collection methods. Copyright © 2013 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Zaki, Rafdzah; Bulgiba, Awang; Nordin, Noorhaire; Azina Ismail, Noor
2013-06-01
Reliability measures precision or the extent to which test results can be replicated. This is the first ever systematic review to identify statistical methods used to measure reliability of equipment measuring continuous variables. This studyalso aims to highlight the inappropriate statistical method used in the reliability analysis and its implication in the medical practice. In 2010, five electronic databases were searched between 2007 and 2009 to look for reliability studies. A total of 5,795 titles were initially identified. Only 282 titles were potentially related, and finally 42 fitted the inclusion criteria. The Intra-class Correlation Coefficient (ICC) is the most popular method with 25 (60%) studies having used this method followed by the comparing means (8 or 19%). Out of 25 studies using the ICC, only 7 (28%) reported the confidence intervals and types of ICC used. Most studies (71%) also tested the agreement of instruments. This study finds that the Intra-class Correlation Coefficient is the most popular method used to assess the reliability of medical instruments measuring continuous outcomes. There are also inappropriate applications and interpretations of statistical methods in some studies. It is important for medical researchers to be aware of this issue, and be able to correctly perform analysis in reliability studies.
Kilov, Andrea M; Togher, Leanne; Power, Emma
2015-01-01
To determine test-re-test reliability of the 'Computer User Profile' (CUP) in people with and without TBI. The CUP was administered on two occasions to people with and without TBI. The CUP investigated the nature and frequency of participants' computer and Internet use. Intra-class correlation coefficients and kappa coefficients were conducted to measure reliability of individual CUP items. Descriptive statistics were used to summarize content of responses. Sixteen adults with TBI and 40 adults without TBI were included in the study. All participants were reliable in reporting demographic information, frequency of social communication and leisure activities and computer/Internet habits and usage. Adults with TBI were reliable in 77% of their responses to survey items. Adults without TBI were reliable in 88% of their responses to survey items. The CUP was practical and valuable in capturing information about social, leisure, communication and computer/Internet habits of people with and without TBI. Adults without TBI scored more items with satisfactory reliability overall in their surveys. Future studies may include larger samples and could also include an exploration of how people with/without TBI use other digital communication technologies. This may provide further information on determining technology readiness for people with TBI in therapy programmes.
Leifker, Feea R.; Patterson, Thomas L.; Bowie, Christopher R.; Mausbach, Brent T.; Harvey, Philip D.
2010-01-01
Performance-based measures of the ability to perform social and everyday living skills are being more widely used to assess functional capacity in people with serious mental illnesses such as schizophrenia and bipolar disorder. Since they are also being used as outcome measures in pharmacological and cognitive remediation studies aimed at cognitive impairments in schizophrenia, understanding their measurement properties and potential sensitivity to change is important. In this study, the test-retest reliability, practice effects, and reliable change indices of two different performance-based functional capacity measures, the UCSD Performance-based skills assessment (UPSA) and Social skills performance assessment (SSPA) were examined over several different retest intervals in two different samples of people with schizophrenia (n’s=238 and 116) and a healthy comparison sample (n=109). These psychometric properties were compared to those of a neuropsychological assessment battery. Test-retest reliabilities of the long form of the UPSA ranged from r=.63 to r=.80 over follow-up periods up to 36 months in people with schizophrenia, while brief UPSA reliabilities ranged from r=.66 to r=.81. Test-retest reliability of the NP performance scores ranged from r=.77 to r=.79. Test-retest reliabilities of the UPSA were lower in healthy controls, while NP performance was slightly more reliable. SSPA test-retest reliability was lower. Practice effect sizes ranged from .05 to .16 for the UPSA and .07 to .19 for the NP assessment in patients, with HC having more practice effects. Reliable change intervals were consistent across NP and both FC measures, indicating equal potential for detection of change. These performance-based measures of functional capacity appear to have similar potential to be sensitive to change compared to NP performance in people with schizophrenia. PMID:20399613
Komro, Kelli A; Livingston, Melvin D; Kominsky, Terrence K; Livingston, Bethany J; Garrett, Brady A; Molina, Mildred Maldonado; Boyd, Misty L
2015-01-01
Objective: American Indians (AIs) suffer from significant alcohol-related health disparities, and increased risk begins early. This study examined the reliability and validity of measures to be used in a preventive intervention trial. Reliability and validity across racial/ethnic subgroups are crucial to evaluate intervention effectiveness and promote culturally appropriate evidence-based practice. Method: To assess reliability and validity, we used three baseline surveys of high school students participating in a preventive intervention trial within the jurisdictional service area of the Cherokee Nation in northeastern Oklahoma. The 15-minute alcohol risk survey included 16 multi-item scales and one composite score measuring key proximal, primary, and moderating variables. Forty-four percent of the students indicated that they were AI (of whom 82% were Cherokee), including 23% who reported being AI only (n = 435) and 18% both AI and White (n = 352). Forty-seven percent reported being White only (n = 901). Results: Scales were adequately reliable for the full sample and across race/ethnicity defined by AI, AI/White, and White subgroups. Among the full sample, all scales had acceptable internal consistency, with minor variation across race/ethnicity. All scales had extensive to exemplary test–retest reliability and showed minimal variation across race/ethnicity. The eight proximal and two primary outcome scales were each significantly associated with the frequency of alcohol use during the past month in both the cross-sectional and the longitudinal models, providing support for both criterion validity and predictive validity. For most scales, interpretation of the strength of association and statistical significance did not differ between the racial/ethnic subgroups. Conclusions: The results support the reliability and validity of scales of a brief questionnaire measuring risk and protective factors for alcohol use among AI adolescents, primarily members of the Cherokee Nation. PMID:25486402
Tahmasebi, Neda; Shafie, Bijan; Karimi, Hamid; Mazaheri, Masood
The fourth version of the Stuttering Severity Instrument (SSI-4) has been available since 2009. It has some modifications and new features which make it more appropriate at least for clinical practice, although further documentation is needed. The objective of the current research was to translate SSI-4 into Persian language and to discuss its relative and absolute reliability as well as its criterion validity for Persian adults who stutter (PWS). We also aimed to study how the new subjective self-reports of the SSI-4 complement the stuttering severity score obtained from the SSI-3 or the SSI-4. The cross-cultural guideline recommended by the International Quality of Life Assessment project was used to translate the SSI-4 into Persian language. Thirty five PWS from ages 17 to 42 were recruited and 10 speech and language pathologists assessed their stuttering severity using either the SSI-4 or stuttering severity ratings (SR) to test validity and reliability of the Persian translated version. A very high inter-judge relative reliability along with a poor absolute inter-judge reliability was found for the SSI-4 scores. The results were more promising for the intra-judge absolute reliability. Test-retest reliability of the complementary questions to the SSI-4 was also found acceptable. However, no strong relationship was found between the SSI-4 scores and its complementary questions. The Persian version of the SSI-4 can be used reliably by trained SLPs for research and clinical purposes, but not to document small changes in stuttering severity. We argue that the response of participants to the complementary self-report questions should also be considered in calculating their stuttering severity score. Copyright © 2018 Elsevier Inc. All rights reserved.
Transferring Aviation Practices into Clinical Medicine for the Promotion of High Reliability.
Powell-Dunford, Nicole; McPherson, Mark K; Pina, Joseph S; Gaydos, Steven J
2017-05-01
Aviation is a classic example of a high reliability organization (HRO)-an organization in which catastrophic events are expected to occur without control measures. As health care systems transition toward high reliability, aviation practices are increasingly transferred for clinical implementation. A PubMed search using the terms aviation, crew resource management, and patient safety was undertaken. Manuscripts authored by physician pilots and accident investigation regulations were analyzed. Subject matter experts involved in adoption of aviation practices into the medical field were interviewed. A PubMed search yielded 621 results with 22 relevant for inclusion. Improved clinical outcomes were noted in five research trials in which aviation practices were adopted, particularly with regard to checklist usage and crew resource-management training. Effectiveness of interventions was influenced by intensity of application, leadership involvement, and provision of staff training. The usefulness of incorporating mishap investigation techniques has not been established. Whereas aviation accident investigation is highly standardized, the investigation of medical error is characterized by variation. The adoption of aviation practices into clinical medicine facilitates an evolution toward high reliability. Evidence for the efficacy of the checklist and crew resource-management training is robust. Transference of aviation accident investigation practices is preliminary. A standardized, independent investigation process could facilitate the development of a safety culture commensurate with that achieved in the aviation industry.Powell-Dunford N, McPherson MK, Pina JS, Gaydos SJ. Transferring aviation practices into clinical medicine for the promotion of high reliability. Aerosp Med Hum Perform. 2017; 88(5):487-491.
Defense AT and L. Volume 43, Number 1
2014-02-01
community on the strategic direction of future space system acquisitions. Should we continue and/or improve aggregated space systems over time or move...desirable. Schedulers must have a firm command of project manage- ment theory and practice and possess leadership and communication skills. Scorecard...idea was recommended by the Defense Sci- ence Board’s Munitions System Reliability report. In the future, such systems conceivably could communicate
Pesch, Megan H; Lumeng, Julie C
2017-12-15
Behavioral coding of videotaped eating and feeding interactions can provide researchers with rich observational data and unique insights into eating behaviors, food intake, food selection as well as interpersonal and mealtime dynamics of children and their families. Unlike self-report measures of eating and feeding practices, the coding of videotaped eating and feeding behaviors can allow for the quantitative and qualitative examinations of behaviors and practices that participants may not self-report. While this methodology is increasingly more common, behavioral coding protocols and methodology are not widely shared in the literature. This has important implications for validity and reliability of coding schemes across settings. Additional guidance on how to design, implement, code and analyze videotaped eating and feeding behaviors could contribute to advancing the science of behavioral nutrition. The objectives of this narrative review are to review methodology for the design, operationalization, and coding of videotaped behavioral eating and feeding data in children and their families, and to highlight best practices. When capturing eating and feeding behaviors through analysis of videotapes, it is important for the study and coding to be hypothesis driven. Study design considerations include how to best capture the target behaviors through selection of a controlled experimental laboratory environment versus home mealtime, duration of video recording, number of observations to achieve reliability across eating episodes, as well as technical issues in video recording and sound quality. Study design must also take into account plans for coding the target behaviors, which may include behavior frequency, duration, categorization or qualitative descriptors. Coding scheme creation and refinement occur through an iterative process. Reliability between coders can be challenging to achieve but is paramount to the scientific rigor of the methodology. Analysis approach is dependent on the how data were coded and collapsed. Behavioral coding of videotaped eating and feeding behaviors can capture rich data "in-vivo" that is otherwise unobtainable from self-report measures. While data collection and coding are time-intensive the data yielded can be extremely valuable. Additional sharing of methodology and coding schemes around eating and feeding behaviors could advance the science and field.
Bengtsson, Ulrika; Kjellgren, Karin; Höfer, Stefan; Taft, Charles; Ring, Lena
2014-10-01
Self-management support tools using technology may improve adherence to hypertension treatment. There is a need for user-friendly tools facilitating patients' understanding of the interconnections between blood pressure, wellbeing and lifestyle. This study aimed to examine comprehension, comprehensiveness and relevance of items, and further to evaluate the usability and reliability of an interactive hypertension-specific mobile phone self-report system. Areas important in supporting self-management and candidate items were derived from five focus group interviews with patients and healthcare professionals (n = 27), supplemented by a literature review. Items and response formats were drafted to meet specifications for mobile phone administration and were integrated into a mobile phone data-capture system. Content validity and usability were assessed iteratively in four rounds of cognitive interviews with patients (n = 21) and healthcare professionals (n = 4). Reliability was examined using a test-retest. Focus group analyses yielded six areas covered by 16 items. The cognitive interviews showed satisfactory item comprehension, relevance and coverage; however, one item was added. The mobile phone self-report system was reliable and perceived easy to use. The mobile phone self-report system appears efficiently to capture information relevant in patients' self-management of hypertension. Future studies need to evaluate the effectiveness of this tool in improving self-management of hypertension in clinical practice.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-02-13
... Location Accuracy, Network Security Best Practices, DNSSEC Implementation Practices for ISPs, Secure BGP... FEDERAL COMMUNICATIONS COMMISSION Federal Advisory Committee Act; Communications Security... persons that the Federal Communications Commission's (FCC) Communications Security, Reliability, and...
Monroe, Todd; Carter, Michael
2012-09-01
Cognitive scales are used frequently in geriatric research and practice. These instruments are constructed with underlying assumptions that are a part of their validation process. A common measurement scale used in older adults is the Folstein Mini Mental State Exam (MMSE). The MMSE was designed to screen for cognitive impairment and is used often in geriatric research. This paper has three aims. Aim one was to explore four potential threats to validity in the use of the MMSE: (1) administering the exam without meeting the underlying assumptions, (2) not reporting that the underlying assumptions were assessed prior to test administration, (3) use of variable and inconsistent cut-off scores for the determination of presence of cognitive impairment, and (4) failure to adjust the scores based on the demographic characteristics of the tested subject. Aim two was to conduct a literature search to determine if the assumptions of (1) education level assessment, (2) sensory assessment, and (3) language fluency were being met and clearly reported in published research using the MMSE. Aim three was to provide recommendations to minimalize threats to validity in research studies that use cognitive scales, such as the MMSE. We found inconsistencies in published work in reporting whether or not subjects meet the assumptions that underlie a reliable and valid MMSE score. These inconsistencies can pose threats to the reliability of exam results. Fourteen of the 50 studies reviewed reported inclusion of all three of these assumptions. Inconsistencies in reporting the inclusion of the underlying assumptions for a reliable score could mean that subjects were not appropriate to be tested by use of the MMSE or that an appropriate test administration of the MMSE was not clearly reported. Thus, the research literature could have threats to both validity and reliability based on misuse of or improper reported use of the MMSE. Six recommendations are provided to minimalize these threats in future research.
2016-01-01
When a series of studies fails to replicate a well-documented effect, researchers might be tempted to use a “vote counting” approach to decide whether the effect is reliable—that is, simply comparing the number of successful and unsuccessful replications. Vohs’s (2015) response to the absence of money priming effects reported by Rohrer, Pashler, and Harris (2015) provides an example of this approach. Unfortunately, vote counting is a poor strategy to assess the reliability of psychological findings because it neglects the impact of selection bias and questionable research practices. In the present comment, we show that a range of meta-analytic tools indicate irregularities in the money priming literature discussed by Rohrer et al. and Vohs, which all point to the conclusion that these effects are distorted by selection bias, reporting biases, or p-hacking. This could help to explain why money-priming effects have proven unreliable in a number of direct replication attempts in which biases have been minimized through preregistration or transparent reporting. Our major conclusion is that the simple proportion of significant findings is a poor guide to the reliability of research and that preregistered replications are an essential means to assess the reliability of money-priming effects. PMID:27077759
Choi, Kang Rok; Kim, Daeho; Jang, Eun Young; Bae, Hwallip; Kim, Seok Hyeon
2017-01-01
Traumatic events and adverse stressful experiences are major etiological factors in a wide variety of physical and mental disorders. Developing psychological instruments that can be easily administered and that have good psychometric properties have become an integral part for research and practice. This study investigated the reliability and validity of the Korean version of the Lifetime Stressor Checklist-Revised (LSC-R) in a consecutive sample of psychiatric outpatients. The LSC-R is a 30-item self-reporting questionnaire examining lifetime traumatic and non-traumatic stressors. A final sample of 258 outpatients with anxiety or depressive disorders was recruited at the psychiatric department of a university-affiliated teaching hospital. Self-reported data included the Life Events Checklist (LEC), the Zung Self-Rating Depression and Anxiety Scales, and the Impact of Events Scale-Revised, in addition to the LSC-R. A convenience sample of 50 college students completed the LSC-R on two occasions separated by a three week-interval for test-retest reliability. Mean kappa for temporal stability was high (κ=0.651) and Cronbach alpha was moderate (α=0.724). Convergent validity was excellent with corresponding items on the LEC. Concurrent validity was good for symptoms of post-traumatic stress disorder, depression, and anxiety. An exploratory factor analysis revealed that 11 factors explained 64.3 % of the total variance. This study demonstrated good psychometric properties of the Korean version of the LSC-R, further supporting its use in clinical research and practice with a Korean speaking population.
Validation of the Practice Environment Scale to the Brazilian culture.
Gasparino, Renata C; Guirardello, Edinêis de B
2017-07-01
To validate the Brazilian version of the Practice Environment Scale. The Practice Environment Scale is a tool that evaluates the presence of characteristics that are favourable for professional nursing practice because a better work environment contributes to positive results for patients, professionals and institutions. Methodological study including 209 nurses. Validity was assessed via a confirmatory factor analysis using structural equation modelling, in which the correlations between the instrument and the following variables were tested: burnout, job satisfaction, safety climate, perception of quality of care and intention to leave the job. Subgroups were compared and the reliability was assessed using Cronbach's alpha and the composite reliability. Factor analysis resulted in exclusion of seven items. Significant correlations were obtained between the subscales and all variables in the study. The reliability was considered acceptable. The Brazilian version of the Practice Environment Scale is a valid and reliable tool used to assess the characteristics that promote professional nursing practice. Use of this tool in Brazilian culture should allow managers to implement changes that contribute to the achievement of better results, in addition to identifying and comparing the environments of health institutions. © 2017 John Wiley & Sons Ltd.
Morrison, Janet D; Becker, Heather; Stuifbergen, Alexa K
2017-12-01
Careful consideration of intervention fidelity is critical to establishing the validity and reliability of research findings, yet such reports are often lacking in the research literature. It is imperative that intervention fidelity be methodically evaluated and reported to promote the translation of effective interventions into sound evidence-based practice. The purpose of this article is to explore strategies used to promote intervention fidelity, incorporating examples from a multisite clinical trial, that illustrate the National Institutes of Health Behavior Change Consortium's 5 domains for recommended treatment practices: (1) study design, (2) facilitator training, (3) intervention delivery, (4) intervention receipt, and (5) intervention enactment. A multisite randomized clinical trial testing the efficacy of a computer-assisted cognitive rehabilitation intervention for adults with multiple sclerosis is used to illustrate strategies promoting intervention fidelity. Data derived from audiotapes of intervention classes, audits of computer exercises completed by participants, participant class attendance, and goal attainment scaling suggested relatively high fidelity to the intervention protocol. This study illustrates how to report intervention fidelity in the literature guided by best practice strategies, which may serve to promote fidelity monitoring and reporting in future studies.
Bastianelli, Karen M S; Nelson, Lucas; Palombi, Laura
The public is largely unaware of changing pharmacy roles and continues to underutilize pharmacists as mainly a dispensing service, often overlooking direct patient care components. This paper evaluates the public perceptions of pharmacists' through student led point-of-care screenings and comprehensive medication reviews. An IRB approved longitudinal retrospective study design was used to survey participants who attended College of Pharmacy sponsored health fairs in 2013. The survey contained questions related to patient understanding of the current pharmacy scope of practice, patient's reported level of comfort with the expanding roles of pharmacy, insurance coverage, demographics, income and whom the patients report that they seek first for medical related advice. One hundred participants were surveyed. By a large majority, the health care practitioner that patients reported that they currently sought out for general health information was a physician. All of the participants reported that they considered pharmacists as reliable sources of general health information. Of the 96 participants that completed the survey question regarding their willingness to seek out a pharmacist in the future to have their blood pressure, blood sugar, cholesterol, and/or bone density checked, 89 (92.7%) surveyed answered yes. Only 50 patients (50%) reported being aware that pharmacists have the capability to perform these point-of-care screenings. Participants responded that they felt "comfortable" (78 patients, 83.4%) followed by "trusting" (36 patients, 38.7%). The patients surveyed recognized pharmacists as the medication experts; however, many did not understand the capacity to which pharmacists' scope of practice allows them to deliver care to patients. In this study, almost all of the patients stated that they would consider seeking out a pharmacist to receive point-of-care screenings, but only half were aware that a pharmacist is capable of conducting these screenings. All felt that pharmacists are a reliable source of general health information and medication related needs, regardless of the patients' education level or household income. With the shortage of primary care providers, expansion of the scope of pharmaceutical practice is necessary. Although the public may not fully understand pharmacists' scope of practice, they are comfortable with pharmacists' knowledge related to general health needs outside of medications. The reported public comfort and trust of pharmacist knowledge and skills justify expanding the scope of pharmaceutical practice. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reliability and validity in a nutshell.
Bannigan, Katrina; Watson, Roger
2009-12-01
To explore and explain the different concepts of reliability and validity as they are related to measurement instruments in social science and health care. There are different concepts contained in the terms reliability and validity and these are often explained poorly and there is often confusion between them. To develop some clarity about reliability and validity a conceptual framework was built based on the existing literature. The concepts of reliability, validity and utility are explored and explained. Reliability contains the concepts of internal consistency and stability and equivalence. Validity contains the concepts of content, face, criterion, concurrent, predictive, construct, convergent (and divergent), factorial and discriminant. In addition, for clinical practice and research, it is essential to establish the utility of a measurement instrument. To use measurement instruments appropriately in clinical practice, the extent to which they are reliable, valid and usable must be established.
Low-income mothers’ feeding goals predict observed home mealtime and child feeding practices
Pesch, Megan H.; Miller, Alison L.; Appugliese, Danielle P.; Kaciroti, Niko; Rosenblum, Katherine L.; Lumeng, Julie C.
2016-01-01
Background Mothers’ goals are important for health behavior change, and engagement in child obesity interventions. It is unknown if maternal feeding goals are associated with observed home mealtime or feeding practices. The objective of this study was to examine the association of four common feeding goals (restrict junk food, promote fruit or vegetable intake, promote autonomy in eating, and prevent obesity) with mothers’ observed home mealtime and feeding practices. Methods Low-income mothers (N = 265) of children (mean child age 70.8 months) participated in a semi-structured interview about child feeding. A coding scheme was developed and reliably applied to identify mothers’ feeding goals from transcripts. Mothers’ observed home mealtime and feeding practices were reliably coded from home mealtimes and a laboratory eating protocol. Mothers completed a questionnaire and reported demographics. Participant weights and heights were obtained. Regression models were used to test the association of each feeding goal with observed maternal practice, controlling for covariates. Results The goal of restricting junk food was associated with the child always eating at a table (OR 2.87, 95% CI (1.39–5.96) p =0.005), but not with the mother restricting junk food. The goal of promoting fruit or vegetable intake was associated with observationally promoting vegetables (OR 1.41, 95% CI (1.09–1.84), p = 0.01). The goals of promoting autonomy and preventing obesity were not associated with any observed maternal home mealtime or feeding practices. Conclusions While mothers’ goals to restrict junk food and promote fruit or vegetable intake were associated with observed home mealtime and feeding practices, promoting autonomy and preventing obesity were not. Increased understanding of why low-income mothers may not translate certain feeding goals into practices may inform childhood obesity interventions. PMID:27558923
Barrett, Eva; McCreesh, Karen; Lewis, Jeremy
2014-02-01
A wide array of instruments are available for non-invasive thoracic kyphosis measurement. Guidelines for selecting outcome measures for use in clinical and research practice recommend that properties such as validity and reliability are considered. This systematic review reports on the reliability and validity of non-invasive methods for measuring thoracic kyphosis. A systematic search of 11 electronic databases located studies assessing reliability and/or validity of non-invasive thoracic kyphosis measurement techniques. Two independent reviewers used a critical appraisal tool to assess the quality of retrieved studies. Data was extracted by the primary reviewer. The results were synthesized qualitatively using a level of evidence approach. 27 studies satisfied the eligibility criteria and were included in the review. The reliability, validity and both reliability and validity were investigated by sixteen, two and nine studies respectively. 17/27 studies were deemed to be of high quality. In total, 15 methods of thoracic kyphosis were evaluated in retrieved studies. All investigated methods showed high (ICC ≥ .7) to very high (ICC ≥ .9) levels of reliability. The validity of the methods ranged from low to very high. The strongest levels of evidence for reliability exists in support of the Debrunner kyphometer, Spinal Mouse and Flexicurve index, and for validity supports the arcometer and Flexicurve index. Further reliability and validity studies are required to strengthen the level of evidence for the remaining methods of measurement. This should be addressed by future research. Copyright © 2013 Elsevier Ltd. All rights reserved.
Crockford, Christopher; Newton, Judith; Lonergan, Katie; Madden, Caoifa; Mays, Iain; O'Sullivan, Meabhdh; Costello, Emmet; Pinto-Grau, Marta; Vajda, Alice; Heverin, Mark; Pender, Niall; Al-Chalabi, Ammar; Hardiman, Orla; Abrahams, Sharon
2018-02-01
Cognitive impairment affects approximately 50% of people with amyotrophic lateral sclerosis (ALS). Research has indicated that impairment may worsen with disease progression. The Edinburgh Cognitive and Behavioural ALS Screen (ECAS) was designed to measure neuropsychological functioning in ALS, with its alternate forms (ECAS-A, B, and C) allowing for serial assessment over time. The aim of the present study was to establish reliable change scores for the alternate forms of the ECAS, and to explore practice effects and test-retest reliability of the ECAS's alternate forms. Eighty healthy participants were recruited, with 57 completing two and 51 completing three assessments. Participants were administered alternate versions of the ECAS serially (A-B-C) at four-month intervals. Intra-class correlation analysis was employed to explore test-retest reliability, while analysis of variance was used to examine the presence of practice effects. Reliable change indices (RCI) and regression-based methods were utilized to establish change scores for the ECAS alternate forms. Test-retest reliability was excellent for ALS Specific, ALS Non-Specific, and ECAS Total scores of the combined ECAS A, B, and C (all > .90). No significant practice effects were observed over the three testing sessions. RCI and regression-based methods produced similar change scores. The alternate forms of the ECAS possess excellent test-retest reliability in a healthy control sample, with no significant practice effects. The use of conservative RCI scores is recommended. Therefore, a change of ≥8, ≥4, and ≥9 for ALS Specific, ALS Non-Specific, and ECAS Total score is required for reliable change.
Alsous, Mervat; Alhalaiqa, Fadwa; Abu Farha, Rana; Abdel Jalil, Mariam; McElnay, James; Horne, Robert
2017-01-01
Objectives to evaluate the reliability and discriminant validity of Arabic translation of the Medication Adherence Report Scale (MARS) and the Beliefs about Medication Questionnaire-specific (BMQ-specific). Methods Having developed Arabic translations of the study instruments, a cross-sectional study was carried out between March and October 2015 in two multidisciplinary governmental hospitals in Jordan. An expert panel monitored the forward and backward translation of the MARS and BMQ. Standard Arabic was used (with no specific dialect inclusion) to allow greater generalisability across Arabic speaking countries. Once the Arabic translations of the questionnaires were developed they were tested for consistency, validity and reliability on a group of children with chronic diseases and their parents. Results A total of 258 parents and 208 children were included in the study. The median age of participated children and parents was 15 years and 42 years respectively. Principle component analysis of all questionnaires indicated that all had good construct validity as they clearly measured one construct. The questionnaires were deemed reliable based on the results of Cronbach alpha coefficient. Furthermore, reliability of the questionnaires was demonstrated by test-retest intraclass correlation coefficients (ICC) which ranged from good to excellent for all scales (ICC>0.706). The Pearson correlation coefficient ranged from 0.546–0.805 for the entire sample which indicated a significant moderate to strong positive correlation between MARS and BMQ items at time 1 and 2. Reported adherence was greater than 59% using MARS-children and MARS-parents scales, and was correlated with beliefs in necessity and independent of the concerns regarding medications. Conclusion The Arabic translations of both BMQ and MARS for use in children and their parents have good internal consistency and proved to be valid and reliable tools that can be used by researchers in clinical practice to measure adherence and beliefs about medications in Arabic speaking patient populations. PMID:28192467
Alsous, Mervat; Alhalaiqa, Fadwa; Abu Farha, Rana; Abdel Jalil, Mariam; McElnay, James; Horne, Robert
2017-01-01
to evaluate the reliability and discriminant validity of Arabic translation of the Medication Adherence Report Scale (MARS) and the Beliefs about Medication Questionnaire-specific (BMQ-specific). Having developed Arabic translations of the study instruments, a cross-sectional study was carried out between March and October 2015 in two multidisciplinary governmental hospitals in Jordan. An expert panel monitored the forward and backward translation of the MARS and BMQ. Standard Arabic was used (with no specific dialect inclusion) to allow greater generalisability across Arabic speaking countries. Once the Arabic translations of the questionnaires were developed they were tested for consistency, validity and reliability on a group of children with chronic diseases and their parents. A total of 258 parents and 208 children were included in the study. The median age of participated children and parents was 15 years and 42 years respectively. Principle component analysis of all questionnaires indicated that all had good construct validity as they clearly measured one construct. The questionnaires were deemed reliable based on the results of Cronbach alpha coefficient. Furthermore, reliability of the questionnaires was demonstrated by test-retest intraclass correlation coefficients (ICC) which ranged from good to excellent for all scales (ICC>0.706). The Pearson correlation coefficient ranged from 0.546-0.805 for the entire sample which indicated a significant moderate to strong positive correlation between MARS and BMQ items at time 1 and 2. Reported adherence was greater than 59% using MARS-children and MARS-parents scales, and was correlated with beliefs in necessity and independent of the concerns regarding medications. The Arabic translations of both BMQ and MARS for use in children and their parents have good internal consistency and proved to be valid and reliable tools that can be used by researchers in clinical practice to measure adherence and beliefs about medications in Arabic speaking patient populations.
Judging nursing information on the WWW: a theoretical understanding.
Cader, Raffik; Campbell, Steve; Watson, Don
2009-09-01
This paper is a report of a study of the judgement processes nurses use when evaluating World Wide Web information related to nursing practice. The World Wide Web has increased the global accessibility of online health information. However, the variable nature of the quality of World Wide Web information and its perceived level of reliability may lead to misinformation. This makes demands on healthcare professionals, and on nurses in particular, to ensure that health information of reliable quality is selected for use in practice. A grounded theory approach was adopted. Semi-structured interviews and focus groups were used to collect data, between 2004 and 2005, from 20 nurses undertaking a postqualification graduate course at a university and 13 nurses from a local hospital in the United Kingdom. A theoretical framework emerged that gave insight into the judgement process nurses use when evaluating World Wide Web information. Participants broke the judgement process down into specific tasks. In addition, they used tacit, process and propositional knowledge and intuition, quasi-rational cognition and analysis to undertake these tasks. World Wide Web information cues, time available and nurses' critical skills were influencing factors in their judgement process. Addressing the issue of quality and reliability associated with World Wide Web information is a global challenge. This theoretical framework could contribute towards meeting this challenge.
Concordance of chart and billing data with direct observation in dental practice.
Demko, Catherine A; Victoroff, Kristin Zakariasen; Wotman, Stephen
2008-10-01
The commonly used methods of chart review, billing data summaries and practitioner self-reporting have not been examined for their ability to validly and reliably represent time use and service delivery in routine dental practice. A more thorough investigation of these data sources would provide insight into the appropriateness of each approach for measuring various clinical behaviors. The aim of this study was to assess the validity of commonly used methods such as dental chart review, billing data, or practitioner self-report compared with a 'gold standard' of information derived from direct observation of routine dental visits. A team of trained dental hygienists directly observed 3751 patient visits in 120 dental practices and recorded the behaviors and procedures performed by dentists and hygienists during patient contact time. Following each visit, charts and billing records were reviewed for the performed and billed procedures. Dental providers characterized their frequency of preventive service delivery through self-administered surveys. We standardized the observation and abstraction methods to obtain optimal measures from each of the multiple data sources. Multi-rater kappa coefficients were computed to monitor standardization, while sensitivity, specificity, and kappa coefficients were calculated to compare the various data sources with direct observation. Chart audits were more sensitive than billing data for all observed procedures and demonstrated higher agreement with directly observed data. Chart and billing records were not sensitive for several prevention-related tasks (oral cancer screening and oral hygiene instruction). Provider self-reports of preventive behaviors were always over-estimated compared with direct observation. Inter-method reliability kappa coefficients for 13 procedures ranged from 0.197 to 0.952. These concordance findings suggest that strengths and weaknesses of data collection sources should be considered when investigating delivery of dental services especially when using practitioner survey data. Future investigations can more fully rely on charted information rather than billing data and provider self-report for most dental procedures, but nonbillable procedures and most counseling interactions will not be captured with routine charting and billing practices.
Shiying Tian; Mohamed A. Youssef; R. Wayne Skaggs; Devendra Amatya; George M. Chescheir
2012-01-01
This paper reports results of a study to test the reliability of the DRAINMOD-FOREST model for predicting water, soil carbon (C) and nitrogen (N) dynamics in intensively managed forests. The study site, two adjacent loblolly pine (Pinus taeda L.) plantations (referred as D2 and D3), are located in the coastal plain of North Carolina, USA. Controlled drainage (with weir...
Is the MARS questionnaire a reliable measure of medication adherence in childhood asthma?
Garcia-Marcos, Patricia W; Brand, Paul L P; Kaptein, Adrian A; Klok, Ted
2016-12-01
To assess the reliability of the Medication Adherence Report Scale (MARS-5) for assessing adherence in clinical practice and research. Prospective cohort study following electronically measured inhaled corticosteroids (ICS) adherence for 1 year in 2-13-year-old children with persistent asthma. The relationship between electronically measured adherence and MARS-5 scores (ranging from 5 to 25) was assessed by Spearman's rank correlation coefficient. A ROC (receiver operating characteristic) curve was performed testing MARS-5 against electronically measured adherence. Sensitivity, specificity, positive and negative likelihood ratios of the closest MARS-5 cut-off values to the top left-hand corner of the ROC curve were calculated. High MARS scores were obtained (median 24, interquartile range 22-24). Despite a statistically significant correlation between MARS-5 and electronically assessed adherence (Spearman's rho = 0.47; p < 0.0001), there was considerable variation of adherence rates at every MARS-5 score. The area under the ROC curve was 0.7188. A MARS-5 score ≥23 had the best predictive ability for electronically assessed adherence, but positive and negative likelihood ratios were too small to be useful (1.65 and 0.27, respectively). Self-report using MARS-5 is too inaccurate to be a useful measure of adherence in children with asthma, both in clinical practice and in research.
Eggleton, Kyle; Goodyear-Smith, Felicity; Henning, Marcus; Jones, Rhys; Shulruf, Boaz
2017-03-01
The aim of this study was to develop an instrument (University of Auckland General Practice Report of Educational Environment: UAGREE) with robust psychometric properties that measured the educational environment of undergraduate primary care. The questions were designed to incorporate measurements of the teaching of cultural competence. Following a structured consensus process and an initial pilot, a list of 55 questions was developed. All Year 5 and 6 students completing a primary care attachment at Auckland University were invited to complete the questionnaire. The results were analysed using exploratory factor analysis and confirmatory factor analysis resulting in a 16-item instrument. Three factors were identified explaining 53% of the variance. The items' reliability within the factors were high (Learning: 0.894; Teaching: 0.871; Cultural competence: 0.857). Multiple groups analysis by gender; and separately across ethnic groups did not find significant differences between groups. UAGREE is a specific instrument measuring the undergraduate primary care educational environment. Its questions fit within established theoretical educational environment frameworks and the incorporation of cultural competence questions reflects the importance of teaching cultural competence within medicine. The psychometric properties of UAGREE suggest that it is a reliable and valid measure of the primary care education environment.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-03-12
... Alerting, E9-1-1 Location Accuracy, Network Security Best Practices, DNSSEC Implementation Practices for... FEDERAL COMMUNICATIONS COMMISSION Federal Advisory Committee Act; Communications Security... Security, Reliability, and Interoperability Council (CSRIC) meeting that was scheduled for March 6, 2013 is...
Veneziano, Louis; Veneziano, Carol
2002-09-01
In order to provide an objective measure of problematic behavioral patterns among juvenile delinquents in residential facilities, the Dormitory Observation Report (DOR) was developed. The DOR assesses 11 dimensions of problematic behavioral patterns (e.g., physical assaultiveness, manipulativeness), as well as three dimensions of desirable behavioral patterns expected in an institutional setting (e.g., independent functioning, personal hygiene, care of surroundings). Empirical study regarding the reliability and validity of the DOR are reported, and the results are discussed in terms of the theoretical and practical implications of this instrument. Copyright 2002 Wiley Periodicals, Inc.
Robinson, Hilde Stendal; Dagfinrud, Hanne
2017-05-31
Low back pain (LBP) is the most common reported musculoskeletal disorder, with large prevalence numbers and high costs. Focus on early identification of patients at risk of developing chronic LBP has increased. The Keele Start Back Tool (SBT) is a questionnaire aiming at screening prognostic indicators in LBP patients, categorizing patients into risk-groups and guide treatment. The aim of this study was to explore the Norwegian version of the SBT with regard to reliability of the SBT-scoring and the screening ability in LBP patients in primary care physiotherapy. LBP patients answered a package of questionnaires twice, with 1-3 days in between, containing SBT, Hannover functional ability questionnaire, pain intensity questions and demographics. The relative and absolute reliability of SBT was calculated using intraclass correlation coefficient (ICC) and the smallest detectable change respectively. Independent sample t-tests were used for group comparisons. Fifty-two patients with LBP. Mean age (SD) was 45 (12) years and 62% were female. The ICC (95% CI) for SBT total score and psychosocial subscore was 0.89 (0.82, 0.94) and 0.82 (0.70, 0.90) respectively. None of the participants were allocated to the high risk group. The medium risk group reported significantly more pain last week and more activity limitations than the low risk group at both test and retest (0.001 ≤ p ≤ 0.003), whereas no significant difference between the groups was found on pain now (0.05 ≤ p ≤ 0.16). The Norwegian version of the SBT was reliable and the screening ability was good as the subgrouping of patients into risk-groups reflected the severity of their back problems. The SBT may be an applicable and useful tool in physiotherapy practice.
Eliasson, Kristina; Palm, Peter; Nyman, Teresia; Forsman, Mikael
2017-07-01
A common way to conduct practical risk assessments is to observe a job and report the observed long term risks for musculoskeletal disorders. The aim of this study was to evaluate the inter- and intra-observer reliability of ergonomists' risk assessments without the support of an explicit risk assessment method. Twenty-one experienced ergonomists assessed the risk level (low, moderate, high risk) of eight upper body regions, as well as the global risk of 10 video recorded work tasks. Intra-observer reliability was assessed by having nine of the ergonomists repeat the procedure at least three weeks after the first assessment. The ergonomists made their risk assessment based on his/her experience and knowledge. The statistical parameters of reliability included agreement in %, kappa, linearly weighted kappa, intraclass correlation and Kendall's coefficient of concordance. The average inter-observer agreement of the global risk was 53% and the corresponding weighted kappa (K w ) was 0.32, indicating fair reliability. The intra-observer agreement was 61% and 0.41 (K w ). This study indicates that risk assessments of the upper body, without the use of an explicit observational method, have non-acceptable reliability. It is therefore recommended to use systematic risk assessment methods to a higher degree. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Bourne, Richard S; Shulman, Rob; Tomlin, Mark; Borthwick, Mark; Berry, Will; Mills, Gary H
2017-04-01
To identify between and within profession-rater reliability of clinical impact grading for common critical care prescribing error and optimisation cases. To identify representative clinical impact grades for each individual case. Electronic questionnaire. 5 UK NHS Trusts. 30 Critical care healthcare professionals (doctors, pharmacists and nurses). Participants graded severity of clinical impact (5-point categorical scale) of 50 error and 55 optimisation cases. Case between and within profession-rater reliability and modal clinical impact grading. Between and within profession rater reliability analysis used linear mixed model and intraclass correlation, respectively. The majority of error and optimisation cases (both 76%) had a modal clinical severity grade of moderate or higher. Error cases: doctors graded clinical impact significantly lower than pharmacists (-0.25; P < 0.001) and nurses (-0.53; P < 0.001), with nurses significantly higher than pharmacists (0.28; P < 0.001). Optimisation cases: doctors graded clinical impact significantly lower than nurses and pharmacists (-0.39 and -0.5; P < 0.001, respectively). Within profession reliability grading was excellent for pharmacists (0.88 and 0.89; P < 0.001) and doctors (0.79 and 0.83; P < 0.001) but only fair to good for nurses (0.43 and 0.74; P < 0.001), for optimisation and error cases, respectively. Representative clinical impact grades for over 100 common prescribing error and optimisation cases are reported for potential clinical practice and research application. The between professional variability highlights the importance of multidisciplinary perspectives in assessment of medication error and optimisation cases in clinical practice and research. © The Author 2017. Published by Oxford University Press in association with the International Society for Quality in Health Care. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
DOE Office of Scientific and Technical Information (OSTI.GOV)
Davis, Lynn; Arquit Niederberger, Anne
Abstract— Lighting systems have the ability to transform the economic and educational infrastructure of disadvantaged communities, and eradicating “light poverty” has become one of the primary goals of the International Year of Light 2015. Solid-state lighting (SSL) technology, based on light-emitting diode (LED) light sources, has emerged as the next generation of lighting technology, with a current global market penetration of roughly 5%. This paper will report on recent research on understanding SSL lighting system reliability (failure modes, environmental stressors, electrical power quality); discuss the implications of SSL technology reliability for providing lighting services; and suggest practical approaches to ensuremore » SSL reliability to benefit humanity. Among the key findings from this work is that LED sources can be extremely reliable, withstanding a broad range of environmental stresses without failure. Nonetheless, SSL lighting systems can have a negative impact on electrical power reliability, as well as on the affordability of lighting services, without attention to the quality of the accompanying power infrastructure. It is therefore critical to ensure that the performance of the power supply electronics used in lighting systems is matched to the quality of the power source, when evaluating energy efficient lighting choices.« less
Design of high reliability organizations in health care.
Carroll, J S; Rudolph, J W
2006-12-01
To improve safety performance, many healthcare organizations have sought to emulate high reliability organizations from industries such as nuclear power, chemical processing, and military operations. We outline high reliability design principles for healthcare organizations including both the formal structures and the informal practices that complement those structures. A stage model of organizational structures and practices, moving from local autonomy to formal controls to open inquiry to deep self-understanding, is used to illustrate typical challenges and design possibilities at each stage. We suggest how organizations can use the concepts and examples presented to increase their capacity to self-design for safety and reliability.
Verbakel, Natasha J; Zwart, Dorien L M; Langelaan, Maaike; Verheij, Theo J M; Wagner, Cordula
2013-09-17
Patient safety has been a priority in primary healthcare in the last years. The prevailing culture is seen as an important condition for patient safety in practice and several tools to measure patient safety culture have therefore been developed. Although Dutch primary care consists of different professions, such as general practice, dental care, dietetics, physiotherapy and midwifery, a safety culture questionnaire was only available for general practices. The purpose of this study was to modify and validate this existing questionnaire to a generic questionnaire for all professions in Dutch primary care. A validated Dutch questionnaire for general practices was modified to make it usable for all Dutch primary care professions. Subsequently, this questionnaire was administered to a random sample of 2400 practices from eleven primary care professions. The instrument's factor structure, reliability and validity were examined using confirmatory and explorative factor analyses. 921 questionnaires were returned. Of these, 615 were eligible for factor analysis. The resulting SCOPE-PC questionnaire consisted of seven dimensions: 'open communication and learning from errors', 'handover and teamwork', 'adequate procedures and working conditions', 'patient safety management', 'support and fellowship', 'intention to report events' and 'organisational learning' with a total of 41 items. All dimensions had good reliability with Cronbach's alphas ranging from 0.70-0.90, and the questionnaire had a good construct validity. The SCOPE-PC questionnaire has sound psychometric characteristics for use by the different professions in Dutch primary care to gain insight in their safety culture.
Photovoltaic balance-of-system designs and costs at PVUSA
NASA Astrophysics Data System (ADS)
Reyes, A. B.; Jennings, C.
1995-05-01
This report is one in a series of 1994-1995 PVUSA reports that document PVUSA lessons learned at demonstration sites in California and Texas. During the last 7 years (1988 to 1994), 16 PV systems ranging from 20 kW to 500 kW have been installed. Six 20-kW emerging module technology (EMT) arrays and three turnkey (i.e., vendor designed and integrated) utility-scale systems were procured and installed at PVUSA's main test site in Davis, California. PVUSA host utilities have installed a total of seven EMT arrays and utility-scale systems in their service areas. Additional systems at Davis and host utility sites are planned. One of PVUSA's key objectives is to evaluate the performance, reliability, and cost of PV balance-of-system (BOS). In the procurement stage PVUSA encouraged innovative design to improve upon present practice by reducing maintenance, improving reliability, or lowering manufacturing or construction costs. The project team worked closely with suppliers during the design stage not only to ensure designs met functional and safety specifications, but to provide suggestions for improvement. This report, intended for the photovoltaic (PV) industry and for utility project managers and engineers considering PV plant construction and ownership, documents PVUSA utility-scale system design and cost lessons learned. Complementary PVUSA topical reports document: construction and safety experience; five-year assessment of EMTs; validation of the Kerman 500-kW grid-support PV plant benefits; PVUSA instrumentation and data analysis techniques; procurement, acceptance, and rating practices for PV power plants; experience with power conditioning units and power quality.
Falloon, I R H; Mizuno, M; Murakami, M; Roncone, R; Unoka, Z; Harangozo, J; Pullman, J; Gedye, R; Held, T; Hager, B; Erickson, D; Burnett, K
2005-01-01
To develop a reliable standardized assessment of psychiatric symptoms for use in clinical practice. A 50-item interview, the Current Psychiatric State 50 (CPS-50), was used to assess 237 patients with a range of psychiatric diagnoses. Ratings were made by interviewers after a 2-day training. Comparisons of inter-rater reliability on each item and on eight clinical subscales were made across four international centres and between psychiatrists and non-psychiatrists. A principal components analysis was used to validate these clinical scales. Acceptable inter-rater reliability (intra-class coefficient > 0.80) was found for 46 of the 50 items, and for all eight subscales. There was no difference between centres or between psychiatrists and non-psychiatrists. The principal components analysis factors were similar to the clinical scales. The CPS-50 is a reliable standardized assessment of current mental status that can be used in clinical practice by all mental health professionals after brief training. Blackwell Munksgaard 2004
2013-01-01
During the last decade, there has been a rapid increase in development of instruments to measure parent food practices. Because these instruments often measure different constructs, or define common constructs differently, an evaluation of these instruments is needed. A systematic review of the literature was conducted to identify existing measures of parent food practices and to assess the quality of their development. The initial search used terms capturing home environment, parenting behaviors, feeding practices and eating behaviors, and was performed in October of 2009 using PubMed/Medline, PsychInfo, Web of knowledge (ISI), and ERIC, and updated in July of 2012. A review of titles and abstracts was used to narrow results, after which full articles were retrieved and reviewed. Only articles describing development of measures of parenting food practices designed for families with children 2-12 years old were retained for the current review. For each article, two reviewers extracted data and appraised the quality of processes used for instrument development and evaluation. The initial search yielded 28,378 unique titles; review of titles and abstracts narrowed the pool to 1,352 articles; from which 57 unique instruments were identified. The review update yielded 1,772 new titles from which14 additional instruments were identified. The extraction and appraisal process found that 49% of instruments clearly identified and defined concepts to be measured, and 46% used theory to guide instrument development. Most instruments (80%) had some reliability testing, with internal consistency being the most common (79%). Test-retest or inter-rater reliability was reported for less than half the instruments. Some form of validity evidence was reported for 84% of instruments. Construct validity was most commonly presented (86%), usually with analysis of associations with child diet or weight/BMI. While many measures of food parenting practices have emerged, particularly in recent years, few have demonstrated solid development methods. Substantial variation in items across different scales/constructs makes comparison between instruments extremely difficult. Future efforts should be directed toward consensus development of food parenting practices constructs and measures. PMID:23688157
Medication administration via enteral tubes: a survey of nurses' practices.
Phillips, Nicole Margaret; Endacott, Ruth
2011-12-01
This article is a report of a study examining the practices of acute care nurses when administering medication via enteral tubes. Administering medication via enteral tubes is predominantly a nursing responsibility across countries. It is important to establish what nurses actually do when giving enteral medication to inform policy and continuing education development. In 2007, a survey was conducted using a random sample of acute care nurses at two large metropolitan hospitals in Melbourne, Australia. There were 181 Registered Nurses who participated in the study; 92 (50.8%) practised in intensive care units, 52 (28.7%) in surgical areas, 30 (16.6%) in medical areas and 7 (3.9%) were from combined medical-surgical areas. The questionnaire was developed by the researchers and a pilot study was conducted in August 2006 to test reliability, face validity and user-friendliness of the tool. Nurses reported using a range of methods to verify enteral tube position prior to administering enteral medication; some were unreliable methods. A majority reported administering enteric-coated and slow or extended release forms of medication, and giving solid forms of medication when liquid form was available. Nearly all (96%) reported flushing a tube after giving medication, 28% before, and 12% always flushed between each medication. Enteral medication administration practices are inconsistent. Some nurses are using unsafe practices and may therefore compromise patient care. © 2011 Blackwell Publishing Ltd.
[Individual responses of arterial pressure to geomagnetic activity in practically healthy subjects].
Zenchenko, T A; Dimitrova, S; Stoilova, I; Breus, T K
2009-01-01
Dynamic analysis of arterial blood pressure in relation to the Earth's magnetic field perturbations was performed in 77 practically healthy volunteers (staff of Bulgarian Academy of Sciences). Almost half of them proved magneto-sensitive, i.e. experienced AP elevation with increased geomagnetic activity. The probability of development of magnetic sensitivity was independent of age and gender but increased in volunteers having even mild cardiovascular pathology. These subjects complained of worsened health condition upon a rise in geomagnetic activity. However, some volunteers reported deteriorated well-being without AD elevation. It means that AD measurement may be insufficient for reliable monitoring magnetic sensitivity.
Efficient entanglement distribution over 200 kilometers.
Dynes, J F; Takesue, H; Yuan, Z L; Sharpe, A W; Harada, K; Honjo, T; Kamada, H; Tadanaga, O; Nishida, Y; Asobe, M; Shields, A J
2009-07-06
Here we report the first demonstration of entanglement distribution over a record distance of 200 km which is of sufficient fidelity to realize secure communication. In contrast to previous entanglement distribution schemes, we use detection elements based on practical avalanche photodiodes (APDs) operating in a self-differencing mode. These APDs are low-cost, compact and easy to operate requiring only electrical cooling to achieve high single photon detection efficiency. The self-differencing APDs in combination with a reliable parametric down-conversion source demonstrate that entanglement distribution over ultra-long distances has become both possible and practical. Consequently the outlook is extremely promising for real world entanglement-based communication between distantly separated parties.
Development and evaluation of superconducting circuit elements
NASA Technical Reports Server (NTRS)
Haertling, Gene H.; Lee, Burtrand; Hsi, Dennis; Modi, Vibhakar; Marone, Matt
1990-01-01
An approach to the application of high Tc ceramic superconductors to practical circuit elements was developed and demonstrated. This method, known as the rigid-conductor process (RCP), involves the combination of a pre-formed, sintered, and tested superconductor material with an appropriate, rigid substrate via an epoxy adhesive which also serves to encapsulate the element from the ambient environment. Emphasis was on the practical means to achieve functional, reliable, and reproducible components. Although all of the work described in this report involved a YBa2Cu3Osub(7-x) high Tc superconductor material, the techniques developed and conclusions reached are equally applicable to other high Tc materials.
Principles of Statistics: What the Sports Medicine Professional Needs to Know.
Riemann, Bryan L; Lininger, Monica R
2018-07-01
Understanding the results and statistics reported in original research remains a large challenge for many sports medicine practitioners and, in turn, may be among one of the biggest barriers to integrating research into sports medicine practice. The purpose of this article is to provide minimal essentials a sports medicine practitioner needs to know about interpreting statistics and research results to facilitate the incorporation of the latest evidence into practice. Topics covered include the difference between statistical significance and clinical meaningfulness; effect sizes and confidence intervals; reliability statistics, including the minimal detectable difference and minimal important difference; and statistical power. Copyright © 2018 Elsevier Inc. All rights reserved.
Verloo, Henk; Desmedt, Mario; Morin, Diane
2017-09-01
To evaluate two psychometric properties of the French versions of the Evidence-Based Practice Beliefs and Evidence-Based Practice Implementation scales, namely their internal consistency and construct validity. The Evidence-Based Practice Beliefs and Evidence-Based Practice Implementation scales developed by Melnyk et al. are recognised as valid, reliable instruments in English. However, no psychometric validation for their French versions existed. Secondary analysis of a cross sectional survey. Source data came from a cross-sectional descriptive study sample of 382 nurses and other allied healthcare providers. Cronbach's alpha was used to evaluate internal consistency, and principal axis factor analysis and varimax rotation were computed to determine construct validity. The French Evidence-Based Practice Beliefs and Evidence-Based Practice Implementation scales showed excellent reliability, with Cronbach's alphas close to the scores established by Melnyk et al.'s original versions. Principal axis factor analysis showed medium-to-high factor loading scores without obtaining collinearity. Principal axis factor analysis with varimax rotation of the 16-item Evidence-Based Practice Beliefs scale resulted in a four-factor loading structure. Principal axis factor analysis with varimax rotation of the 17-item Evidence-Based Practice Implementation scale revealed a two-factor loading structure. Further research should attempt to understand why the French Evidence-Based Practice Implementation scale showed a two-factor loading structure but Melnyk et al.'s original has only one. The French versions of the Evidence-Based Practice Beliefs and Evidence-Based Practice Implementation scales can both be considered valid and reliable instruments for measuring Evidence-Based Practice beliefs and implementation. The results suggest that the French Evidence-Based Practice Beliefs and Evidence-Based Practice Implementation scales are valid and reliable and can therefore be used to evaluate the effectiveness of organisational strategies aimed at increasing professionals' confidence in Evidence-Based Practice, supporting its use and implementation. © 2017 John Wiley & Sons Ltd.
Mohan, Arjun; Sethi, Sanjay
2014-03-01
Despite the increasing awareness of their pathogenesis and clinical consequences, research on and clinical management of acute exacerbations of chronic obstructive lung disease (AECOPDs) have been hindered by the lack of a consistent and reliable definition. Symptom-based definitions of exacerbations are sensitive to events and account for unreported exacerbations. Event (healthcare utilization)-based definitions are somewhat more definitive but miss unreported events. Objective quantification of symptoms in AECOPD is now possible with the development of the Exacerbations of Chronic Obstructive Pulmonary Disease Tool (EXACT-PRO), a patient-reported outcome (PRO) measure. Several studies have revealed that unreported AECOPDs are more frequent than reported events and are associated with long-term adverse consequences. New antibiotic development for AECOPD has been hampered by the lack of validated measures for resolution of exacerbations. As a result of these observations, a unique collaborative effort between academia, industry and regulatory agencies resulted in the development of the EXACT-PRO. It consists of 14 questions that generate a score between 0 and 100, and it has been shown to have excellent reliability and validity. In the absence of a reliable biomarker, the definition and measurement of exacerbations has been subjective and imprecise. PRO measures such as EXACT can provide much needed objectivity in assessing symptom-defined exacerbations, which may translate into a uniform outcome measure in clinical trials. With further development and validation, it may have a role in clinical practice in the earlier detection of exacerbations, stratification of an exacerbation severity and the assessment of clinical response to treatment.
The Practice of Respect in the ICU.
Brown, Samuel M; Azoulay, Elie; Benoit, Dominique; Butler, Terri Payne; Folcarelli, Patricia; Geller, Gail; Rozenblum, Ronen; Sands, Ken; Sokol-Hessner, Lauge; Talmor, Daniel; Turner, Kathleen; Howell, Michael D
2018-06-01
Although "respect" and "dignity" are intuitive concepts, little formal work has addressed their systematic application in the ICU setting. After convening a multidisciplinary group of relevant experts, we undertook a review of relevant literature and collaborative discussions focused on the practice of respect in the ICU. We report the output of this process, including a summary of current knowledge, a conceptual framework, and a research program for understanding and improving the practice of respect and dignity in the ICU. We separate our report into findings and proposals. Findings include the following: 1) dignity and respect are interrelated; 2) ICU patients and families are vulnerable to disrespect; 3) violations of respect and dignity appear to be common in the ICU and overlap substantially with dehumanization; 4) disrespect may be associated with both primary and secondary harms; and 5) systemic barriers complicate understanding and the reliable practice of respect in the ICU. Proposals include: 1) initiating and/or expanding a field of research on the practice of respect in the ICU; 2) treating "failures of respect" as analogous to patient safety events and using existing quality and safety mechanisms for improvement; and 3) identifying both benefits and potential unintended consequences of efforts to improve the practice of respect. Respect and dignity are important considerations in the ICU, even as substantial additional research remains to be done.
Nowicka, Paulina; Sorjonen, Kimmo; Pietrobelli, Angelo; Flodmark, Carl-Erik; Faith, Myles S
2014-10-01
The Child Feeding Questionnaire (CFQ) assesses parental feeding attitudes, beliefs and practices concerned with child feeding and obesity proneness. The questionnaire has been developed in the U.S., and validation studies in other countries are limited. The aim of this study was to examine the psychometric properties of the CFQ in Sweden and the associations between parenting practices and children's weight status. Based on records from the Swedish population register, all mothers of 4-year-olds (n = 3007) from the third largest city in Sweden, Malmö, were contacted by mail. Those who returned the CFQ together with a background questionnaire (n = 876) received the CFQ again to enable test-retest evaluation; 564 mothers completed the CFQ twice. We used confirmatory factor analysis to test whether the original 7-factor model was supported. Good fit (CFI = 0.94, TLI = 0.95, RMSEA = 0.04, SRMR = 0.05) was obtained after minor modifications such as dropping 2 items on restriction and adding 3 error covariances. The internal reliability and the 2-week test-retest reliability were good. The scores on restriction were the lowest ever reported. When the influence of parenting practices on child BMI (dependent variable) was examined in a structural equation model (SEM), child BMI had a positive association with restriction and a negative association with pressure to eat. Restriction was positively influenced by concern about child weight. The second SEM treated parenting practices as dependent variables. Parental foreign origin and child BMI had direct effects on restriction, while pressure to eat was also influenced by parental education. While the results of the study support the usefulness of the CFQ in Sweden, carefully designed cross-cultural comparisons are needed to explain why the levels of restrictive feeding in Swedish families are the lowest reported. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Guidance, Navigation and Control (GN&C): Best Practices for Human-Rated Spacecraft Systems
NASA Technical Reports Server (NTRS)
Lebsock, Ken; West, John
2008-01-01
In 2007 the NESC completed an in-depth assessment to identify, define and document engineering considerations for the Design Development Test and Evaluation (DDT&E) of human-rated spacecraft systems. This study had been requested by the Astronaut Office at JSC to help them to better understand what is required to ensure safe, robust, and reliable human-rated spacecraft systems. The 22 GN&C engineering Best Practices described in this paper are a condensed version of what appears in the NESC Technical Report. These Best Practices cover a broad range from fundamental system architectural considerations to more specific aspects (e.g., stability margin recommendations) of GN&C system design and development. 15 of the Best Practices address the early phases of a GN&C System development project and the remaining 7 deal with the later phases. Some of these Best Practices will cross-over between both phases. We recognize that this set of GN&C Best Practices will not be universally applicable to all projects and mission applications.
Martignon, Stefania; Bautista-Mendoza, Gloria; González-Carrera, María; Lafaurie-Villamil, Gloria; Morales, Veicy; Santamaría, Ruth
2008-01-01
Designing three instruments for evaluating oral health knowledge, attitudes and practice in parents/caregivers of low social-economic status 0-5 year-olds. Evaluating the instruments' reliability in terms of internal consistency and analysing items. Three instruments were constructed for evaluating low social-economic status 0-5 year-olds' parents/caregivers' oral health knowledge, attitudes and practice in the municipality of Usaquén , Bogotá , Colombia . 47 parents/caregivers were given a test establishing the instrument's reliability in terms of internal consistency and the adults' level of knowledge, attitudes and practice. A sub-sample was qualitatively analysed (content verification and understanding). Reliability was evaluated using Cronbach's alpha coefficient. Items were analysed for improving constructing and understanding the questions, taking four criteria into account: corrected homogeneity index (CHI), response trend, correlation between items and qualitative analysis. Cronbach's alpha coefficient for knowledge, attitudes and practice was 0,82, 0,80 and 0,62, respectively. Participants' level of knowledge, attitudes and practice was acceptable (60 %, 55 % and 91 %, respectively). This study found two out of the three evaluated instruments to be reliable (knowledge and attitudes); all three of them were then redesigned. The resulting instruments represent a valuable tool which can be used in future studies for describing and evaluating preventative programmes.
Suen, Yi-Nam; Cerin, Ester; Barnett, Anthony; Huang, Wendy Y J; Mellecker, Robin R
2017-09-01
Valid instruments of parenting practices related to children's physical activity (PA) are essential to understand how parents affect preschoolers' PA. This study developed and validated a questionnaire of PA-related parenting practices for Chinese-speaking parents of preschoolers in Hong Kong. Parents (n = 394) completed a questionnaire developed using findings from formative qualitative research and literature searches. Test-retest reliability was determined on a subsample (n = 61). Factorial validity was assessed using confirmatory factor analysis. Subscale internal consistency was determined. The scale of parenting practices encouraging PA comprised 2 latent factors: Modeling, structure and participatory engagement in PA (23 items), and Provision of appropriate places for child's PA (4 items). The scale of parenting practices discouraging PA scale encompassed 4 latent factors: Safety concern/overprotection (6 items), Psychological/behavioral control (5 items), Promoting inactivity (4 items), and Promoting screen time (2 items). Test-retest reliabilities were moderate to excellent (0.58 to 0.82), and internal subscale reliabilities were acceptable (0.63 to 0.89). We developed a theory-based questionnaire for assessing PA-related parenting practices among Chinese-speaking parents of Hong Kong preschoolers. While some items were context and culture specific, many were similar to those previously found in other populations, indicating a degree of construct generalizability across cultures.
Anatomic pathology laboratory information systems: a review.
Park, Seung Lyung; Pantanowitz, Liron; Sharma, Gaurav; Parwani, Anil Vasdev
2012-03-01
The modern anatomic pathology laboratory depends on a reliable information infrastructure to register specimens, record gross and microscopic findings, regulate laboratory workflow, formulate and sign out report(s), disseminate them to the intended recipients across the whole health system, and support quality assurance measures. This infrastructure is provided by the Anatomical Pathology Laboratory Information Systems (APLIS), which have evolved over decades and now are beginning to support evolving technologies like asset tracking and digital imaging. As digital pathology transitions from "the way of the future" to "the way of the present," the APLIS continues to be one of the key effective enablers of the scope and practice of pathology. In this review, we discuss the evolution, necessary components, architecture and functionality of the APLIS that are crucial to today's practicing pathologist and address the demands of emerging trends on the future APLIS.
ERIC Educational Resources Information Center
Parkes, Jay
2007-01-01
Reliability consists of both important social and scientific values and methods for evidencing those values, though in practice methods are often conflated with the values. With the two distinctly understood, a reliability argument can be made that articulates the particular reliability values most relevant to the particular measurement situation…
Application of human reliability analysis to nursing errors in hospitals.
Inoue, Kayoko; Koizumi, Akio
2004-12-01
Adverse events in hospitals, such as in surgery, anesthesia, radiology, intensive care, internal medicine, and pharmacy, are of worldwide concern and it is important, therefore, to learn from such incidents. There are currently no appropriate tools based on state-of-the art models available for the analysis of large bodies of medical incident reports. In this study, a new model was developed to facilitate medical error analysis in combination with quantitative risk assessment. This model enables detection of the organizational factors that underlie medical errors, and the expedition of decision making in terms of necessary action. Furthermore, it determines medical tasks as module practices and uses a unique coding system to describe incidents. This coding system has seven vectors for error classification: patient category, working shift, module practice, linkage chain (error type, direct threat, and indirect threat), medication, severity, and potential hazard. Such mathematical formulation permitted us to derive two parameters: error rates for module practices and weights for the aforementioned seven elements. The error rate of each module practice was calculated by dividing the annual number of incident reports of each module practice by the annual number of the corresponding module practice. The weight of a given element was calculated by the summation of incident report error rates for an element of interest. This model was applied specifically to nursing practices in six hospitals over a year; 5,339 incident reports with a total of 63,294,144 module practices conducted were analyzed. Quality assurance (QA) of our model was introduced by checking the records of quantities of practices and reproducibility of analysis of medical incident reports. For both items, QA guaranteed legitimacy of our model. Error rates for all module practices were approximately of the order 10(-4) in all hospitals. Three major organizational factors were found to underlie medical errors: "violation of rules" with a weight of 826 x 10(-4), "failure of labor management" with a weight of 661 x 10(-4), and "defects in the standardization of nursing practices" with a weight of 495 x 10(-4).
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-14
... practice recommendations on emergency alerting systems such as promoting E9-1-1 reliability and alerting platforms--Emergency Alert System and Common Alerting Protocol. DATES: June 6, 2012. ADDRESSES: Federal... Advisory Committee that will provide recommendations to the FCC regarding best practices and actions the...
Media Reporting of Practice-Changing Clinical Trials in Oncology: A North American Perspective
Vickers, Michael M.; O’Connor, Stephen; Valdes, Mario; Tang, Patricia A.
2016-01-01
Introduction. Media reporting of clinical trials impacts patient-oncologist interactions. We sought to characterize the accuracy of media and Internet reporting of practice-changing clinical trials in oncology. Materials and Methods. The first media articles referencing 17 practice-changing clinical trials were collected from 4 media outlets: newspapers, cable news, cancer websites, and industry websites. Measured outcomes were media reporting score, social media score, and academic citation score. The media reporting score was a measure of completeness of information detailed in media articles as scored by a 15-point scoring instrument. The social media score represented the ubiquity of social media presence referencing 17 practice-changing clinical trials in cancer as determined by the American Society of Clinical Oncology in its annual report, entitled Clinical Cancer Advances 2012; social media score was calculated from Twitter, Facebook, and Google searches. The academic citation score comprised total citations from Google Scholar plus the Scopus database, which represented the academic impact per clinical cancer advance. Results. From 170 media articles, 107 (63%) had sufficient data for analysis. Cohen’s κ coefficient demonstrated reliability of the media reporting score instrument with a coefficient of determination of 94%. Per the media reporting score, information was most complete from industry, followed by cancer websites, newspapers, and cable news. The most commonly omitted items, in descending order, were study limitations, exclusion criteria, conflict of interest, and other. The social media score was weakly correlated with academic citation score. Conclusion. Media outlets appear to have set a low bar for coverage of many practice-changing advances in oncology, with reports of scientific breakthroughs often omitting basic study facts and cautions, which may mislead the public. The media should be encouraged to use a standardized reporting template and provide accessible references to original source information whenever feasible. Implications for Practice: North American newspapers, cable news, cancer websites, and industry websites were searched for their reporting on 17 practice-changing clinical trials in oncology as highlighted by the American Society of Clinical Oncology in its 2012 annual report, Clinical Cancer Advances. Accuracy of reporting across media platforms was evaluated, and the social media buzz and academic interest generated by each clinical trial was gauged. The findings represent, to the authors’ knowledge, the first systematic effort to appraise the reporting of practice-changing clinical trials in oncology across various media platforms. Use of a standardized reporting template by the media is proposed to reduce flaws in their reporting of clinical trials to the public. PMID:26921290
Nutakki, Kavitha; Varni, James W; Swigonski, Nancy L
2018-04-01
The objective of the present study was to report on the measurement properties of the Pediatric Quality of Life Inventory (PedsQL) Neurofibromatosis Type 1 Module for pediatric patients ages 5-25 from the perspectives of patients and parents. The 104-item PedsQL NF1 Module and 23-item PedsQL Generic Core Scales were completed in a multi-site national study by 323 patients and 335 parents (343 families). Patients were diagnosed with NF1 using the National Institutes of Health diagnostic criteria. In addition to a Total Scale Score, 18 unidimensional scales were derived measuring skin itch bother, skin sensations, pain, pain impact, pain management, cognitive functioning, speech, fine motor, balance, vision, perceived physical appearance, communication, worry, treatment anxiety, medicines, stomach discomfort, constipation, and diarrhea. The PedsQL NF1 Module Scales evidenced excellent feasibility, excellent reliability for the Total Scale Scores (patient self-report α = 0.98; parent proxy-report α = 0.98), and good to excellent reliability for the 18 individual scales (patient self-report α = 0.71-0.96; parent proxy-report α = 0.73-0.98). Intercorrelations with the Generic Core Scales supported construct validity. Factor analysis supported the unidimensionality of the 18 individual scales. The PedsQL NF1 Module Scales demonstrated acceptable to excellent measurement properties, and may be utilized as standardized metrics to assess NF1-specific symptoms and problems in clinical research and practice in children, adolescents, and young adults.
Williams, Nathaniel J
2016-05-05
Intentions play a central role in numerous empirically supported theories of behavior and behavior change and have been identified as a potentially important antecedent to successful evidence-based treatment (EBT) implementation. Despite this, few measures of mental health clinicians' EBT intentions exist and available measures have not been subject to thorough psychometric evaluation or testing. This paper evaluates the psychometric properties of the evidence-based treatment intentions (EBTI) scale, a new measure of mental health clinicians' intentions to adopt EBTs. The study evaluates the reliability and validity of inferences made with the EBTI using multi-method, multi-informant criterion variables collected over 12 months from a sample of 197 mental health clinicians delivering services in 13 mental health agencies. Structural, predictive, and discriminant validity evidence is assessed. Findings support the EBTI's factor structure (χ (2) = 3.96, df = 5, p = .556) and internal consistency reliability (α = .80). Predictive validity evidence was provided by robust and significant associations between EBTI scores and clinicians' observer-reported attendance at a voluntary EBT workshop at a 1-month follow-up (OR = 1.92, p < .05), self-reported EBT adoption at a 12-month follow-up (R (2) = .17, p < .001), and self-reported use of EBTs with clients at a 12-month follow-up (R (2) = .25, p < .001). Discriminant validity evidence was provided by small associations with clinicians' concurrently measured psychological work climate perceptions of functionality (R (2) = .06, p < .05), engagement (R (2) = .06, p < .05), and stress (R (2) = .00, ns). The EBTI is a practical and theoretically grounded measure of mental health clinicians' EBT intentions. Scores on the EBTI provide a basis for valid inferences regarding mental health clinicians' intentions to adopt EBTs. Discussion focuses on research and practice applications.
ERIC Educational Resources Information Center
Buchanan, Thomas D.
2011-01-01
Indicators of the proficiency of teacher candidates at applying knowledge of child development to teaching and learning were examined to see if they predict the overall success of the candidates full-time student teaching. The assessment instrument, the Full-Time Student Teaching Summary Report (FSTSR), was found statistically reliable and…
Lai, Jin-Shei; Cella, David; Choi, Seung; Junghaenel, Doerte U; Christodoulou, Christopher; Gershon, Richard; Stone, Arthur
2011-10-01
To illustrate how measurement practices can be advanced by using as an example the fatigue item bank (FIB) and its applications (short forms and computerized adaptive testing [CAT]) that were developed through the National Institutes of Health Patient Reported Outcomes Measurement Information System (PROMIS) Cooperative Group. Psychometric analysis of data collected by an Internet survey company using item response theory-related techniques. A U.S. general population representative sample collected through the Internet. Respondents used for dimensionality evaluation of the PROMIS FIB (N=603) and item calibrations (N=14,931). Not applicable. Fatigue items (112) developed by the PROMIS fatigue domain working group, 13-item Functional Assessment of Chronic Illness Therapy-Fatigue, and 4-item Medical Outcomes Study 36-Item Short Form Health Survey Vitality scale. The PROMIS FIB version 1, which consists of 95 items, showed acceptable psychometric properties. CAT showed consistently better precision than short forms. However, all 3 short forms showed good precision for most participants in that more than 95% of the sample could be measured precisely with reliability greater than 0.9. Measurement practice can be advanced by using a psychometrically sound measurement tool and its applications. This example shows that CAT and short forms derived from the PROMIS FIB can reliably estimate fatigue reported by the U.S. general population. Evaluation in clinical populations is warranted before the item bank can be used for clinical trials. Copyright © 2011 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Hanskamp-Sebregts, Mirelle; Zegers, Marieke; Vincent, Charles; van Gurp, Petra J; de Vet, Henrica C W; Wollersheim, Hub
2016-01-01
Objectives Record review is the most used method to quantify patient safety. We systematically reviewed the reliability and validity of adverse event detection with record review. Design A systematic review of the literature. Methods We searched PubMed, EMBASE, CINAHL, PsycINFO and the Cochrane Library and from their inception through February 2015. We included all studies that aimed to describe the reliability and/or validity of record review. Two reviewers conducted data extraction. We pooled κ values (κ) and analysed the differences in subgroups according to number of reviewers, reviewer experience and training level, adjusted for the prevalence of adverse events. Results In 25 studies, the psychometric data of the Global Trigger Tool (GTT) and the Harvard Medical Practice Study (HMPS) were reported and 24 studies were included for statistical pooling. The inter-rater reliability of the GTT and HMPS showed a pooled κ of 0.65 and 0.55, respectively. The inter-rater agreement was statistically significantly higher when the group of reviewers within a study consisted of a maximum five reviewers. We found no studies reporting on the validity of the GTT and HMPS. Conclusions The reliability of record review is moderate to substantial and improved when a small group of reviewers carried out record review. The validity of the record review method has never been evaluated, while clinical data registries, autopsy or direct observations of patient care are potential reference methods that can be used to test concurrent validity. PMID:27550650
ERIP Project No. 670, Nevada Energy Control Systems, Inc.. Final techincal progress report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kimber, D.J.
1998-02-11
In order to gauge the effectiveness of the ERIP Project No. 670, Nevada Energy Control Systems, Inc., Grant Number DE-FG01-96EE15670, the Statement of Work must be compared to the achievements by NECSI during the grant period. The following report reflects the aforementioned statement and is coordinated directly with it. The project goal is to gather data and test in order to validate earlier tests of energy savings,safety,reliability and practicality of the NECSI Evaporator Fan Controller in order to fully commercialize and market the product.
Maassen, Gerard H
2010-08-01
In this Journal, Lewis and colleagues introduced a new Reliable Change Index (RCI(WSD)), which incorporated the within-subject standard deviation (WSD) of a repeated measurement design as the standard error. In this note, two opposite errors in using WSD this way are demonstrated. First, being the standard error of measurement of only a single assessment makes WSD too small when practice effects are absent. Then, too many individuals will be designated reliably changed. Second, WSD can grow unlimitedly to the extent that differential practice effects occur. This can even make RCI(WSD) unable to detect any reliable change.
Design of high reliability organizations in health care
Carroll, J S; Rudolph, J W
2006-01-01
To improve safety performance, many healthcare organizations have sought to emulate high reliability organizations from industries such as nuclear power, chemical processing, and military operations. We outline high reliability design principles for healthcare organizations including both the formal structures and the informal practices that complement those structures. A stage model of organizational structures and practices, moving from local autonomy to formal controls to open inquiry to deep self‐understanding, is used to illustrate typical challenges and design possibilities at each stage. We suggest how organizations can use the concepts and examples presented to increase their capacity to self‐design for safety and reliability. PMID:17142607
Marawar, Satyajit V; Madom, Ian A; Palumbo, Mark; Tallarico, Richard A; Ordway, Nathaniel R; Metkar, Umesh; Wang, Dongliang; Green, Adam; Lavelle, William F
2017-01-01
Treating surgeon's visual assessment of axial MRI images to ascertain the degree of stenosis has a critical impact on surgical decision-making. The purpose of this study was to prospectively analyze the impact of surgeon experience on inter-observer and intra-observer reliability of assessing severity of spinal stenosis on MRIs by spine surgeons directly involved in surgical decision-making. Seven fellowship trained spine surgeons reviewed MRI studies of 30 symptomatic patients with lumbar stenosis and graded the stenosis in the central canal, the lateral recess and the foramen at T12-L1 to L5-S1 as none, mild, moderate or severe. No specific instructions were provided to what constituted mild, moderate, or severe stenosis. Two surgeons were "senior" (>fifteen years of practice experience); two were "intermediate" (>four years of practice experience), and three "junior" (< one year of practice experience). The concordance correlation coefficient (CCC) was calculated to assess inter-observer reliability. Seven MRI studies were duplicated and randomly re-read to evaluate inter-observer reliability. Surgeon experience was found to be a strong predictor of inter-observer reliability. Senior inter-observer reliability was significantly higher assessing central(p<0.001), foraminal p=0.005 and lateral p=0.001 than "junior" group.Senior group also showed significantly higher inter-observer reliability that intermediate group assessing foraminal stenosis (p=0.036). In intra-observer reliability the results were contrary to that found in inter-observer reliability. Inter-observer reliability of assessing stenosis on MRIs increases with surgeon experience. Lower intra-observer reliability values among the senior group, although not clearly explained, may be due to the small number of MRIs evaluated and quality of MRI images.Level of evidence: Level 3.
Seeking high reliability in primary care: Leadership, tools, and organization.
Weaver, Robert R
2015-01-01
Leaders in health care increasingly recognize that improving health care quality and safety requires developing an organizational culture that fosters high reliability and continuous process improvement. For various reasons, a reliability-seeking culture is lacking in most health care settings. Developing a reliability-seeking culture requires leaders' sustained commitment to reliability principles using key mechanisms to embed those principles widely in the organization. The aim of this study was to examine how key mechanisms used by a primary care practice (PCP) might foster a reliability-seeking, system-oriented organizational culture. A case study approach was used to investigate the PCP's reliability culture. The study examined four cultural artifacts used to embed reliability-seeking principles across the organization: leadership statements, decision support tools, and two organizational processes. To decipher their effects on reliability, the study relied on observations of work patterns and the tools' use, interactions during morning huddles and process improvement meetings, interviews with clinical and office staff, and a "collective mindfulness" questionnaire. The five reliability principles framed the data analysis. Leadership statements articulated principles that oriented the PCP toward a reliability-seeking culture of care. Reliability principles became embedded in the everyday discourse and actions through the use of "problem knowledge coupler" decision support tools and daily "huddles." Practitioners and staff were encouraged to report unexpected events or close calls that arose and which often initiated a formal "process change" used to adjust routines and prevent adverse events from recurring. Activities that foster reliable patient care became part of the taken-for-granted routine at the PCP. The analysis illustrates the role leadership, tools, and organizational processes play in developing and embedding a reliable-seeking culture across an organization. Progress toward a reliability-seeking, system-oriented approach to care remains ongoing, and movement in that direction requires deliberate and sustained effort by committed leaders in health care.
PedsQL™ Eosinophilic Esophagitis Module: Feasibility, Reliability and Validity
Franciosi, James P.; Hommel, Kevin A.; Bendo, Cristiane B.; King, Eileen C.; Collins, Margaret H.; Eby, Michael D.; Marsolo, Keith; Abonia, J. Pablo; von Tiehl, Karl F.; Putnam, Philip E.; Greenler, Alexandria J.; Greenberg, Allison B.; Bryson, Ronald A.; Davis, Carla M.; Olive, Anthony P.; Gupta, Sandeep K.; Erwin, Elizabeth A.; Klinnert, Mary D.; Spergel, Jonathan M.; Denham, Jolanda M.; Furuta, Glenn T.; Rothenberg, Marc E.; Varni, James W.
2014-01-01
Objective Eosinophilic esophagitis (EoE) is a chronic esophageal inflammatory condition with a paucity of information on health-related quality of life (HRQOL). The objective of the study was to report on the measurement properties of the PedsQL™ EoE Module. Methods The PedsQL™ EoE Module was completed in a multisite study by 196 pediatric EoE patients and 262 EoE parents. Results The PedsQL™ EoE Module scales evidenced excellent feasibility (0.6%–3.1% missing), excellent group comparison reliability across total scale scores (patient α = 0.93; parent proxy α = 0.94), good reliability for the seven individual scales (patient α = 0.75–0.87; parent proxy α = 0.81–0.92), excellent test-retest reliability (patient ICC = 0.88; parent ICC= 0.82), demonstrated no floor effects and low ceiling effects, and demonstrated a high percentage of scaling success for most scales. Intercorrelations with the PedsQL™ Generic Core Scales were in the medium (0.30) to large (0.50) range. PedsQL™ EoE Module scores were worse among patients with active histologic disease (> 5 eos/hpf) compared to those in remission (patient self-report: 63.3 vs. 69.9 [p<0.05]; parent proxy-report: 65.1 vs. 72.3 [p<0.01]), and those treated with dietary restrictions compared to those with no restrictions (patient self-report: 61.6 vs. 74.3 [p< 0.01]; parent proxy-report: 65.5 vs. 74.7 [p<0.01]). Conclusions The results demonstrate excellent measurement properties of the PedsQL™ EoE Module. Patients with active histologic disease and those treated with dietary restrictions demonstrated worse PedsQL™ scores. The PedsQL™ EoE Module may be utilized in evaluation of pediatric EoE disease-specific HRQOL in clinical research and practice. PMID:23478422
Defining and assessing professional competence.
Epstein, Ronald M; Hundert, Edward M
2002-01-09
Current assessment formats for physicians and trainees reliably test core knowledge and basic skills. However, they may underemphasize some important domains of professional medical practice, including interpersonal skills, lifelong learning, professionalism, and integration of core knowledge into clinical practice. To propose a definition of professional competence, to review current means for assessing it, and to suggest new approaches to assessment. We searched the MEDLINE database from 1966 to 2001 and reference lists of relevant articles for English-language studies of reliability or validity of measures of competence of physicians, medical students, and residents. We excluded articles of a purely descriptive nature, duplicate reports, reviews, and opinions and position statements, which yielded 195 relevant citations. Data were abstracted by 1 of us (R.M.E.). Quality criteria for inclusion were broad, given the heterogeneity of interventions, complexity of outcome measures, and paucity of randomized or longitudinal study designs. We generated an inclusive definition of competence: the habitual and judicious use of communication, knowledge, technical skills, clinical reasoning, emotions, values, and reflection in daily practice for the benefit of the individual and the community being served. Aside from protecting the public and limiting access to advanced training, assessments should foster habits of learning and self-reflection and drive institutional change. Subjective, multiple-choice, and standardized patient assessments, although reliable, underemphasize important domains of professional competence: integration of knowledge and skills, context of care, information management, teamwork, health systems, and patient-physician relationships. Few assessments observe trainees in real-life situations, incorporate the perspectives of peers and patients, or use measures that predict clinical outcomes. In addition to assessments of basic skills, new formats that assess clinical reasoning, expert judgment, management of ambiguity, professionalism, time management, learning strategies, and teamwork promise a multidimensional assessment while maintaining adequate reliability and validity. Institutional support, reflection, and mentoring must accompany the development of assessment programs.
Hyde, M K; Chambers, S K; Shum, D; Ip, D; Dunn, J
2016-09-01
This systematic review describes psychosocial and quality of life (QOL) measures used in psycho-oncology research with cancer patients and caregivers in China. Medline and PsycINFO databases were searched (1980-2014). Studies reviewed met the following criteria: English language; peer-reviewed; sampled Chinese cancer patients/caregivers; developed, validated or assessed psychometric properties of psychosocial or QOL outcome measures; and reported validation data. The review examined characteristics of measures and participants, translation and cultural adaptation processes and psychometric properties of the measures. Ninety five studies met review criteria. Common characteristics of studies reviewed were they: assessed primarily QOL measures, sampled patients with breast, colorectal, or head and neck cancer, and validated existing measures (>80%) originating in North America or Europe. Few studies reported difficulties translating measures. Regarding psychometric properties of the measures >50% of studies reported subscale reliabilities <α = 0.70, <50% reported test-retest reliability, and <30% reported divergent validity. Few reported sensitivity, specificity or responsiveness. Improved accuracy and transparency of reporting for translation, cultural adaptation and psychometric testing of psychosocial measures is needed. Developing support structures for translating and validating psychosocial measures would enable this and ensure Chinese psycho-oncology clinical practice and research keeps pace with international focus on patient reported outcome measures and data management. © 2015 John Wiley & Sons Ltd.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-08-13
..., Reliability, and Interoperability Council AGENCY: Federal Communications Commission. ACTION: Notice of public..., Reliability, and Interoperability Council (CSRIC) will hold its fifth meeting. The CSRIC will vote on... to the FCC regarding best practices and actions the FCC can take to ensure the security, reliability...
Teaching nurses teamwork: Integrative review of competency-based team training in nursing education.
Barton, Glenn; Bruce, Anne; Schreiber, Rita
2017-12-20
Widespread demands for high reliability healthcare teamwork have given rise to many educational initiatives aimed at building team competence. Most effort has focused on interprofessional team training however; Registered Nursing teams comprise the largest human resource delivering direct patient care in hospitals. Nurses also influence many other health team outcomes, yet little is known about the team training curricula they receive, and furthermore what specific factors help translate teamwork competency to nursing practice. The aim of this review is to critically analyse empirical published work reporting on teamwork education interventions in nursing, and identify key educational considerations enabling teamwork competency in this group. CINAHL, Web of Science, Academic Search Complete, and ERIC databases were searched and detailed inclusion-exclusion criteria applied. Studies (n = 19) were selected and evaluated using established qualitative-quantitative appraisal tools and a systematic constant comparative approach. Nursing teamwork knowledge is rooted in High Reliability Teams theory and Crew or Crisis Resource Management sources. Constructivist pedagogy is used to teach, practice, and refine teamwork competency. Nursing teamwork assessment is complex; involving integrated yet individualized determinations of knowledge, skills, and attitudes. Future initiatives need consider frontline leadership, supportive followership and skilled communication emphasis. Collective stakeholder support is required to translate teamwork competency into nursing practice. Copyright © 2017 Elsevier Ltd. All rights reserved.
Is the statistic value all we should care about in neuroimaging?
Chen, Gang; Taylor, Paul A; Cox, Robert W
2017-02-15
Here we address an important issue that has been embedded within the neuroimaging community for a long time: the absence of effect estimates in results reporting in the literature. The statistic value itself, as a dimensionless measure, does not provide information on the biophysical interpretation of a study, and it certainly does not represent the whole picture of a study. Unfortunately, in contrast to standard practice in most scientific fields, effect (or amplitude) estimates are usually not provided in most results reporting in the current neuroimaging publications and presentations. Possible reasons underlying this general trend include (1) lack of general awareness, (2) software limitations, (3) inaccurate estimation of the BOLD response, and (4) poor modeling due to our relatively limited understanding of FMRI signal components. However, as we discuss here, such reporting damages the reliability and interpretability of the scientific findings themselves, and there is in fact no overwhelming reason for such a practice to persist. In order to promote meaningful interpretation, cross validation, reproducibility, meta and power analyses in neuroimaging, we strongly suggest that, as part of good scientific practice, effect estimates should be reported together with their corresponding statistic values. We provide several easily adaptable recommendations for facilitating this process. Published by Elsevier Inc.
Kwok, Cannas; Endrawes, Gihane; Lee, Chun Fan
2016-02-01
The aim of the study was to report the psychometric properties of the Arabic version of the Breast Cancer Screening Beliefs Questionnaire (BCSBQ). A convenience sample of 251 Arabic-Australian women was recruited from a number of Arabic community organizations. Construct validity was examined by Cuzick's non-parametric test while Cronbach α was used to assess internal consistency reliability. Explanatory factor analysis was conducted to study the factor structure. The results indicated that the Arabic version of the BCSBQ had satisfactory validity and internal consistency. The Cronbach's alpha of the three subscales ranged between 0.810 and 0.93. The frequency of breast cancer screening practices (breast awareness, clinical breast-examination and mammography) were significantly associated with attitudes towards general health check-up and perceived barriers to mammographic screening. Exploratory factor analysis showed a similar fit for the hypothesized three-factor structure with our data set. The Arabic version of the BCBSQ is a culturally appropriate, valid and reliable instrument for assessing the beliefs, knowledge and attitudes to breast cancer and breast cancer screening practices among Arabic-Australian women. Copyright © 2015 Elsevier Ltd. All rights reserved.
Morrison, Geoffrey Stewart
2014-05-01
In this paper it is argued that one should not attempt to directly assess whether a forensic analysis technique is scientifically acceptable. Rather one should first specify what one considers to be appropriate principles governing acceptable practice, then consider any particular approach in light of those principles. This paper focuses on one principle: the validity and reliability of an approach should be empirically tested under conditions reflecting those of the case under investigation using test data drawn from the relevant population. Versions of this principle have been key elements in several reports on forensic science, including forensic voice comparison, published over the last four-and-a-half decades. The aural-spectrographic approach to forensic voice comparison (also known as "voiceprint" or "voicegram" examination) and the currently widely practiced auditory-acoustic-phonetic approach are considered in light of this principle (these two approaches do not appear to be mutually exclusive). Approaches based on data, quantitative measurements, and statistical models are also considered in light of this principle. © 2013.
High reliability and implications for nursing leaders.
Riley, William
2009-03-01
To review high reliability theory and discuss its implications for the nursing leader. A high reliability organization (HRO) is considered that which has measurable near perfect performance for quality and safety. The author has reviewed the literature, discussed research findings that contribute to improving reliability in health care organizations, and makes five recommendations for how nursing leaders can create high reliability organizations. Health care is not a safe industry and unintended patient harm occurs at epidemic levels. Health care can learn from high reliability theory and practice developed in other high-risk industries. Viewed by HRO standards, unintended patient injury in health care is excessively high and quality is distressingly low. HRO theory and practice can be successfully applied in health care using advanced interdisciplinary teamwork training and deliberate process design techniques. Nursing has a primary leadership function for ensuring patient safety and achieving high quality in health care organizations. Learning HRO theory and methods for achieving high reliability is a foremost opportunity for nursing leaders.
Navy applications experience with small wind power systems
NASA Astrophysics Data System (ADS)
Pal, D.
1985-05-01
This report describes the experience gained and lesson learned from the ongoing field evaluations of seven small, 2-to 20-kW wind energy conversion systems (WECS) at Navy installations located in the Southern California desert, on San Nicolas Island, in California, and in Kaneohe Bay, Hawaii. The field tests show that the WECS's bearings and yaw slip-rings are prone to failure. The failures were attributed to the corrosive environment and poor design practices. Based upon the field tests, it is concluded that a reliable WECS must use a permanent magnet alternator without a gearbox and yaw slip-rings that are driven by a fixed pitch wind turbine rotor. The present state-of-the-art in small WECS technology, including environmental concerns, is reviewed. Also presented is how the technology is advancing to improve reliability and availability for effectively using wind power at Navy bases. The field evaluations are continuing on the small WECS in order to develop operation, maintenance, and reliability data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Emery, John M.; Coffin, Peter; Robbins, Brian A.
Microstructural variabilities are among the predominant sources of uncertainty in structural performance and reliability. We seek to develop efficient algorithms for multiscale calcu- lations for polycrystalline alloys such as aluminum alloy 6061-T6 in environments where ductile fracture is the dominant failure mode. Our approach employs concurrent multiscale methods, but does not focus on their development. They are a necessary but not sufficient ingredient to multiscale reliability predictions. We have focused on how to efficiently use concurrent models for forward propagation because practical applications cannot include fine-scale details throughout the problem domain due to exorbitant computational demand. Our approach begins withmore » a low-fidelity prediction at the engineering scale that is sub- sequently refined with multiscale simulation. The results presented in this report focus on plasticity and damage at the meso-scale, efforts to expedite Monte Carlo simulation with mi- crostructural considerations, modeling aspects regarding geometric representation of grains and second-phase particles, and contrasting algorithms for scale coupling.« less
Reliability Problems of the Datum: Solutions for Questionnaire Responses.
ERIC Educational Resources Information Center
Bastick, Tony
Questionnaires often ask for estimates, and these estimates are given with different reliabilities. It is difficult to know the different reliabilities of single estimates and to take these into account in subsequent analyses. This paper contains a practical example to show that not taking the reliability of different responses into account can…
Singh, Amika S; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Vik, Froydis N; van Lippevelde, Wendy; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; van der Sluijs, Maria; Terwee, Caroline; Brug, Johannes
2012-08-13
Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10-12 year old children. We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study) of 10-12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement.All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
Good Cell Culture Practice for stem cells and stem-cell-derived models.
Pamies, David; Bal-Price, Anna; Simeonov, Anton; Tagle, Danilo; Allen, Dave; Gerhold, David; Yin, Dezhong; Pistollato, Francesca; Inutsuka, Takashi; Sullivan, Kristie; Stacey, Glyn; Salem, Harry; Leist, Marcel; Daneshian, Mardas; Vemuri, Mohan C; McFarland, Richard; Coecke, Sandra; Fitzpatrick, Suzanne C; Lakshmipathy, Uma; Mack, Amanda; Wang, Wen Bo; Yamazaki, Daiju; Sekino, Yuko; Kanda, Yasunari; Smirnova, Lena; Hartung, Thomas
2017-01-01
The first guidance on Good Cell Culture Practice (GCCP) dates back to 2005. This document expands this to include aspects of quality assurance for in vitro cell culture focusing on the increasingly diverse cell types and culture formats used in research, product development, testing and manufacture of biotechnology products and cell-based medicines. It provides a set of basic principles of best practice that can be used in training new personnel, reviewing and improving local procedures, and helping to assure standard practices and conditions for the comparison of data between laboratories and experimentation performed at different times. This includes recommendations for the documentation and reporting of culture conditions. It is intended as guidance to facilitate the generation of reliable data from cell culture systems, and is not intended to conflict with local or higher level legislation or regulatory requirements. It may not be possible to meet all recommendations in this guidance for practical, legal or other reasons. However, when it is necessary to divert from the principles of GCCP, the risk of decreasing the quality of work and the safety of laboratory staff should be addressed and any conclusions or alternative approaches justified. This workshop report is considered a first step toward a revised GCCP 2.0.
Perlea, Paula; Nistor, Cristina Coralia; Iliescu, Mihaela Georgiana; Iliescu, Alexandru Andrei
2015-01-01
Internal root resorption is a consequence of chronic pulp inflammation. Later on, the pulp necrosis followed by a chronic apical periodontitis is installed. Hence, usually, in clinical practice, both lesions have to be simultaneously managed. Conventional periapical radiograph is mandatory in diagnosis. Improving the diagnosis and management of both lesions, cone beam computed tomography proves to be more reliable than conventional radiography.
Pre-university Chemistry Students in a Mimicked Scholarly Peer Review
NASA Astrophysics Data System (ADS)
van Rens, Lisette; Hermarij, Philip; Pilot, Albert; Beishuizen, Jos; Hofman, Herman; Wal, Marjolein
2014-10-01
Peer review is a significant component in scientific research. Introducing peer review into inquiry processes may be regarded as an aim to develop student understanding regarding quality in inquiries. This study examines student understanding in inquiry peer reviews among pre-university chemistry students, aged 16-17, when they enact a design of a mimicked scholarly peer review. This design is based on a model of a human activity system. Twenty-five different schools in Brazil, Germany, Poland and The Netherlands participated. The students (n = 880) conducted in small groups (n = 428) open inquiries on fermentation. All groups prepared an inquiry report for peer review. These reports were published on a website. Groups were randomly paired in an internet symposium, where they posted review comments to their peers. These responses were qualitatively analyzed on small groups' level of understanding regarding seven categories: inquiry question, hypothesis, management of control variables, accurate measurement, presenting results, reliability of results, discussion and conclusion. The mimicked scholarly review prompted a collective practice. Student understanding was significantly well on presenting results, discussion and conclusion, and significantly less on inquiry question and reliability of results. An enacted design, based on a model of a human activity system, created student understanding of quality in inquiries as well as an insight in a peer-reviewing practice. To what extent this model can be applied in a broader context of design research in science education needs further study.
Letters, S; Smith, A J; McHugh, S; Bagg, J
2005-10-22
This study examined methods used for reprocessing endodontic instruments in general dental practice and determined the degree of residual visual contamination and blood contamination on 250 reprocessed files collected from 25 general dental practices. A questionnaire was administered to 25 general dental practitioners to obtain information on the re-processing of used endodontic files. Ten files which had been used and reprocessed were also collected from each practice. These were examined visually under a dissecting light microscope for residual contamination and then tested for blood deposits using the Kastle-Meyer test. Nineteen of the 25 practices used stainless steel hand files. No practitioners used endodontic files as single use devices. Ninety-two per cent of the practitioners discarded and replaced files when they were bent or damaged. Several decontamination methods were reported. The two combinations employed most frequently were manual cleaning and autoclaving or manual cleaning, followed by ultrasonic cleaning and autoclaving. Of the 250 files, 75% showed some degree of visual contamination and seven percent tested positive for residual blood. Blood contaminated files were significantly more heavily contaminated when examined visually. Large variations were found in residual contamination of files collected from practices using the same methods of decontamination. While all practitioners re-used endodontic files, the variations in decontamination methods reported indicate a lack of clarity on best practice. This study demonstrates that endodontic files are not reliably decontaminated by methods currently employed in dental practice.
Noble, Stephanie; Spann, Marisa N; Tokoglu, Fuyuze; Shen, Xilin; Constable, R Todd; Scheinost, Dustin
2017-11-01
Best practices are currently being developed for the acquisition and processing of resting-state magnetic resonance imaging data used to estimate brain functional organization-or "functional connectivity." Standards have been proposed based on test-retest reliability, but open questions remain. These include how amount of data per subject influences whole-brain reliability, the influence of increasing runs versus sessions, the spatial distribution of reliability, the reliability of multivariate methods, and, crucially, how reliability maps onto prediction of behavior. We collected a dataset of 12 extensively sampled individuals (144 min data each across 2 identically configured scanners) to assess test-retest reliability of whole-brain connectivity within the generalizability theory framework. We used Human Connectome Project data to replicate these analyses and relate reliability to behavioral prediction. Overall, the historical 5-min scan produced poor reliability averaged across connections. Increasing the number of sessions was more beneficial than increasing runs. Reliability was lowest for subcortical connections and highest for within-network cortical connections. Multivariate reliability was greater than univariate. Finally, reliability could not be used to improve prediction; these findings are among the first to underscore this distinction for functional connectivity. A comprehensive understanding of test-retest reliability, including its limitations, supports the development of best practices in the field. © The Author 2017. Published by Oxford University Press.
ERIC Educational Resources Information Center
Uggioni, Paula Lazzarin; Salay, Elisabette
2013-01-01
Objective: The objective of this study was to develop a validated and reliable questionnaire to measure consumer knowledge regarding safe practices to prevent microbiological contamination in restaurants and commercial kitchens. Methods: Non-probabilistic samples of individuals were interviewed in the city of Campinas, Brazil. Questionnaire items…
The Development of a Practical and Reliable Assessment Measure for Atopic Dermatitis (ADAM).
ERIC Educational Resources Information Center
Charman, Denise; Varigos, George; Horne, David J. de L.; Oberklaid, Frank
1999-01-01
A study was conducted in Australia to develop a reliable, valid, and practical measure of atopic dermatitis. The test development process and validity evaluation with two doctors and 51 patients are discussed. Results suggest that operational definitions of the scales need to be defined more clearly. The measure satisfies assumptions for a partial…
Powers, John H.; Patrick, Donald L.; Walton, Marc K.; Marquis, Patrick; Cano, Stefan; Hobart, Jeremy; Isaac, Maria; Vamvakas, Spiros; Slagle, Ashley; Molsen, Elizabeth; Burke, Laurie B.
2017-01-01
A clinician-reported outcome (ClinRO) assessment is a type of clinical outcome assessment (COA). ClinRO assessments, like all COAs (patient-reported, observer-reported, or performance outcome assessments), are used to 1) measure patients’ health status and 2) define end points that can be interpreted as treatment benefits of medical interventions on how patients feel, function, or survive in clinical trials. Like other COAs, ClinRO assessments can be influenced by human choices, judgment, or motivation. A ClinRO assessment is conducted and reported by a trained health care professional and requires specialized professional training to evaluate the patient’s health status. This is the second of two reports by the ISPOR Clinical Outcomes Assessment—Emerging Good Practices for Outcomes Research Task Force. The first report provided an overview of COAs including definitions important for an understanding of COA measurement practices. This report focuses specifically on issues related to ClinRO assessments. In this report, we define three types of ClinRO assessments (readings, ratings, and clinician global assessments) and describe emerging good measurement practices in their development and evaluation. The good measurement practices include 1) defining the context of use; 2) identifying the concept of interest measured; 3) defining the intended treatment benefit on how patients feel, function, or survive reflected by the ClinRO assessment and evaluating the relationship between that intended treatment benefit and the concept of interest; 4) documenting content validity; 5) evaluating other measurement properties once content validity is established (including intra- and inter-rater reliability); 6) defining study objectives and end point(s) objectives, and defining study end points and placing study end points within the hierarchy of end points; 7) establishing interpretability in trial results; and 8) evaluating operational considerations for the implementation of ClinRO assessments used as end points in clinical trials. Applying good measurement practices to ClinRO assessment development and evaluation will lead to more efficient and accurate measurement of treatment effects. This is important beyond regulatory approval in that it provides evidence for the uptake of new interventions into clinical practice and provides justification to payers for reimbursement on the basis of the clearly demonstrated added value of the new intervention. PMID:28212963
Powers, John H; Patrick, Donald L; Walton, Marc K; Marquis, Patrick; Cano, Stefan; Hobart, Jeremy; Isaac, Maria; Vamvakas, Spiros; Slagle, Ashley; Molsen, Elizabeth; Burke, Laurie B
2017-01-01
A clinician-reported outcome (ClinRO) assessment is a type of clinical outcome assessment (COA). ClinRO assessments, like all COAs (patient-reported, observer-reported, or performance outcome assessments), are used to 1) measure patients' health status and 2) define end points that can be interpreted as treatment benefits of medical interventions on how patients feel, function, or survive in clinical trials. Like other COAs, ClinRO assessments can be influenced by human choices, judgment, or motivation. A ClinRO assessment is conducted and reported by a trained health care professional and requires specialized professional training to evaluate the patient's health status. This is the second of two reports by the ISPOR Clinical Outcomes Assessment-Emerging Good Practices for Outcomes Research Task Force. The first report provided an overview of COAs including definitions important for an understanding of COA measurement practices. This report focuses specifically on issues related to ClinRO assessments. In this report, we define three types of ClinRO assessments (readings, ratings, and clinician global assessments) and describe emerging good measurement practices in their development and evaluation. The good measurement practices include 1) defining the context of use; 2) identifying the concept of interest measured; 3) defining the intended treatment benefit on how patients feel, function, or survive reflected by the ClinRO assessment and evaluating the relationship between that intended treatment benefit and the concept of interest; 4) documenting content validity; 5) evaluating other measurement properties once content validity is established (including intra- and inter-rater reliability); 6) defining study objectives and end point(s) objectives, and defining study end points and placing study end points within the hierarchy of end points; 7) establishing interpretability in trial results; and 8) evaluating operational considerations for the implementation of ClinRO assessments used as end points in clinical trials. Applying good measurement practices to ClinRO assessment development and evaluation will lead to more efficient and accurate measurement of treatment effects. This is important beyond regulatory approval in that it provides evidence for the uptake of new interventions into clinical practice and provides justification to payers for reimbursement on the basis of the clearly demonstrated added value of the new intervention. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Thin-film module circuit design: Practical and reliability aspects
NASA Technical Reports Server (NTRS)
Daiello, R. V.; Twesme, E. N.
1985-01-01
This paper will address several aspects of the design and construction of submodules based on thin film amorphous silicon (a-Si) p i n solar cells. Starting from presently attainable single cell characteristics, and a realistic set of specifications, practical module designs are discussed from the viewpoints of efficient designs, the fabrication requirements, and reliability concerns. The examples center mostly on series interconnected modules of the superstrate type with detailed discussions of each portion of the structure in relation to its influence on module efficiency. Emphasis is placed on engineering topics such as: area coverage, optimal geometries, and cost and reliability. Practical constraints on achieving optimal designs, along with some examples of potential pitfalls in the manufacture and subsequent performance of a-Si modules are discussed.
Psychometric evaluation of the Revised Professional Practice Environment (RPPE) scale.
Erickson, Jeanette Ives; Duffy, Mary E; Ditomassi, Marianne; Jones, Dorothy
2009-05-01
The purpose was to examine the psychometric properties of the Revised Professional Practice Environment (RPPE) scale. Despite renewed focus on studying health professionals' practice environments, there are still few reliable and valid instruments available to assist nurse administrators in decision making. A psychometric evaluation using a random-sample cross-validation procedure (calibration sample [CS], n = 775; validation sample [VS], n = 775) was undertaken. Cronbach alpha internal consistency reliability of the total score (r = 0.93 [CS] and 0.92 [VS]), resulting subscale scores (r range: 0.80-0.87 [CS], 0.81-0.88 [VS]), and principal components analyses with Varimax rotation and Kaiser normalization (8 components, 59.2% variance [CS], 59.7% [VS]) produced almost identical results in both samples. The multidimensional RPPE is a psychometrically sound measure of 8 components of the professional practice environment in the acute care setting and sufficiently reliable and valid for use as independent subscales in healthcare research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Logan, Jeffrey S.; Paranhos, Elizabeth; Kozak, Tracy G.
This study focuses on onshore natural gas operations and examines the extent to which oil and gas firms have embraced certain organizational characteristics that lead to 'high reliability' - understood here as strong safety and reliability records over extended periods of operation. The key questions that motivated this study include whether onshore oil and gas firms engaged in exploration and production (E&P) and midstream (i.e., natural gas transmission and storage) are implementing practices characteristic of high reliability organizations (HROs) and the extent to which any such practices are being driven by industry innovations and standards and/or regulatory requirements.
Survey points to practices that reduce refinery maintenance spending
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ricketts, R.
During the past decade, Solomon Associates Inc., Dallas, has conducted several comparative analyses of maintenance costs in the refining industry. These investigations have brought to light maintenance practices and reliability improvement activities that are responsible for the wide range of maintenance costs recorded by refineries. Some of the practices are of an organizational nature and thus are of interest to managers reviewing their operations. The paper discusses maintenance costs; profitability; cost trends; equipment availability; funds application; two basic organizational approached to maintenance (repair-focused organization and reliability-focused organization); low-cost practices; and organizational style.
GN&C Engineering Best Practices for Human-Rated Spacecraft Systems
NASA Technical Reports Server (NTRS)
Dennehy, Cornelius J.; Lebsock, Kenneth; West, John
2007-01-01
The NASA Engineering and Safety Center (NESC) recently completed an in-depth assessment to identify a comprehensive set of engineering considerations for the Design, Development, Test and Evaluation (DDT&E) of safe and reliable human-rated spacecraft systems. Reliability subject matter experts, discipline experts, and systems engineering experts were brought together to synthesize the current "best practices" both at the spacecraft system and subsystems levels. The objective of this paper is to summarize, for the larger Community of Practice, the initial set of Guidance, Navigation and Control (GN&C) engineering Best Practices as identified by this NESC assessment process.
GN&C Engineering Best Practices for Human-Rated Spacecraft System
NASA Technical Reports Server (NTRS)
Dennehy, Cornelius J.; Lebsock, Kenneth; West, John
2008-01-01
The NASA Engineering and Safety Center (NESC) recently completed an in-depth assessment to identify a comprehensive set of engineering considerations for the Design, Development, Test and Evaluation (DDT&E) of safe and reliable human-rated spacecraft systems. Reliability subject matter experts, discipline experts, and systems engineering experts were brought together to synthesize the current "best practices" both at the spacecraft system and subsystems levels. The objective of this paper is to summarize, for the larger Community of Practice, the initial set of Guidance, Navigation and Control (GN&C) engineering Best Practices as identified by this NESC assessment process.
GN&C Engineering Best Practices For Human-Rated Spacecraft Systems
NASA Technical Reports Server (NTRS)
Dennehy, Cornelius J.; Lebsock, Kenneth; West, John
2007-01-01
The NASA Engineering and Safety Center (NESC) recently completed an in-depth assessment to identify a comprehensive set of engineering considerations for the Design, Development, Test and Evaluation (DDT&E) of safe and reliable human-rated spacecraft systems. Reliability subject matter experts, discipline experts, and systems engineering experts were brought together to synthesize the current "best practices" both at the spacecraft system and subsystems levels. The objective of this paper is to summarize, for the larger Community of Practice, the initial set of Guidance, Navigation and Control (GN&C) engineering Best Practices as identified by this NESC assessment process.
O'Connor, Teresia M; Cerin, Ester; Hughes, Sheryl O; Robles, Jessica; Thompson, Deborah I; Mendoza, Jason A; Baranowski, Tom; Lee, Rebecca E
2014-01-15
Latino preschoolers (3-5 year old children) have among the highest rates of obesity. Low levels of physical activity (PA) are a risk factor for obesity. Characterizing what Latino parents do to encourage or discourage their preschooler to be physically active can help inform interventions to increase their PA. The objective was therefore to develop and assess the psychometrics of a new instrument: the Preschooler Physical Activity Parenting Practices (PPAPP) among a Latino sample, to assess parenting practices used to encourage or discourage PA among preschool-aged children. Cross-sectional study of 240 Latino parents who reported the frequency of using PA parenting practices. 95% of respondents were mothers; 42% had more than a high school education. Child mean age was 4.5 (±0.9) years (52% male). Test-retest reliability was assessed in 20%, 2 weeks later. We assessed the fit of a priori models using Confirmatory factor analyses (CFA). In a separate sub-sample (35%), preschool-aged children wore accelerometers to assess associations with their PA and PPAPP subscales. The a-priori models showed poor fit to the data. A modified factor structure for encouraging PPAPP had one multiple-item scale: engagement (15 items), and two single-items (have outdoor toys; not enroll in sport-reverse coded). The final factor structure for discouraging PPAPP had 4 subscales: promote inactive transport (3 items), promote screen time (3 items), psychological control (4 items) and restricting for safety (4 items). Test-retest reliability (ICC) for the two scales ranged from 0.56-0.85. Cronbach's alphas ranged from 0.5-0.9. Several sub-factors correlated in the expected direction with children's objectively measured PA. The final models for encouraging and discouraging PPAPP had moderate to good fit, with moderate to excellent test-retest reliabilities. The PPAPP should be further evaluated to better assess its associations with children's PA and offers a new tool for measuring PPAPP among Latino families with preschool-aged children.
Sato, Kouichi; Takenokishin, Miki; Anzai, Sachi; Sato, Rei; Tatewaki, Miyako; Takahashi, Mikako
2003-12-01
The home drip infusion is a stressful procedure for both patients and families under HPN at home. Effective and reliable guidance for drip infusion management should be provided while patients are still at the hospital to realize reliable home care. We revised the HPN Guidance Manual and report the process in this article. We revised the manual according to the principles that "the guidance is subdivided into 3 steps so that the contents of the guidance can be adjusted for individual patients", that "a lot of photos and illustrations for frequently used 2 models are placed" and that "a video is prepared to make the most of the impact motion pictures can give". We want to use the manual in future clinical practices.
Implementation of an attack scheme on a practical QKD system
NASA Astrophysics Data System (ADS)
Lamas-Linares, Antia; Liu, Qin; Gerhardt, Ilja; Makarov, Vadim; Kurtsiefer, Christian
2010-03-01
We report on an experimental implementation of an attack of a practical quantum key distribution system [1], based on a vulnerability of single photon detectors [2]. An intercept/resend-like attack has been carried out which revealed 100% of the raw key generated between the legitimate communication partners. No increase of the error ratio was observed, which is usually considered a reliable witness for any eavesdropping attempt. We also present an experiment which shows that this attack is not revealed by key distribution protocols probing for eavesdroppers by testing a Bell inequality [3], and discuss implications for practical quantum key distribution.[4pt] [1] I. Marcikic, A. Lamas-Linares, C. Kurtsiefer, Appl. Phys. Lett. 89, 101122 (2006); [2] V. Makarov, New J. Phys. 11, 065003 (2009); [3] A. Ling et al., Phys. Rev. A 78, 020301(R), (2008)
Petróczi, Andrea; Naughton, Declan P
2007-01-01
Background Supplement use by athletes is complex and research supports the alarming notion of misinformed decisions regarding supplements. Hypothesis A frequent divergence between the type of supplements chosen by athletes and the rationale dictating the supplement use is hypothesized. Thus, a potentially dangerous incongruence may exist between rationale and practice. Testing the hypothesis In the continued absence of reliable data on supplement use, an alternative approach of studying the reasons underlying supplement use in athletes is proposed to determine whether there is an incongruence between rationale and practice. Existing data from large scale national surveys can be used to investigate this incongruence. Implications of the hypothesis In this report, analyses of distinctive patterns between the use and rationale for use of supplements among athletes are recommended to explore this potentially dangerous phenomenon. PMID:17535442
Informing Nutrition Care in the Antenatal Period: Pregnant Women's Experiences and Need for Support
Yeatman, Heather; Williamson, Moira
2017-01-01
This study aimed to provide insights into Australian women's experiences in gaining nutrition information during pregnancy. Individual semistructured telephone interviews were conducted with 17 pregnant (across all trimesters) and 9 postpartum women in five Australian states. Data were transcribed and analysed using inductive thematic analysis. Women valued nutrition information, actively sought it, and passively received it mainly from three sources: healthcare providers (HCPs), media, and their social networks. Women reported HCPs as highest for reliability but they had limited time and indifferent approaches. Various media were easily and most frequently accessed but were less reliable. Social networks were considered to be the least reliable and least accessed. Women reported becoming overwhelmed and confused. This in turn influenced their decisions (pragmatic/rational) and their eating behaviours (“overdo it,” “loosen it,” “ignore it,” and “positive response”). Individual and environmental barriers impacted their application of knowledge to dietary practice. Women wanted more constructive and interactive engagement with their HCPs. This study identified the need to establish and maintain mutually respectful environments where women feel able to raise issues with their HCPs throughout their pregnancies and where they are confident that the information they receive will be accurate and meet their needs. PMID:28890896
Patient simulation: a literary synthesis of assessment tools in anesthesiology.
Edler, Alice A; Fanning, Ruth G; Chen, Michael I; Claure, Rebecca; Almazan, Dondee; Struyk, Brain; Seiden, Samuel C
2009-12-20
High-fidelity patient simulation (HFPS) has been hypothesized as a modality for assessing competency of knowledge and skill in patient simulation, but uniform methods for HFPS performance assessment (PA) have not yet been completely achieved. Anesthesiology as a field founded the HFPS discipline and also leads in its PA. This project reviews the types, quality, and designated purpose of HFPS PA tools in anesthesiology. We used the systematic review method and systematically reviewed anesthesiology literature referenced in PubMed to assess the quality and reliability of available PA tools in HFPS. Of 412 articles identified, 50 met our inclusion criteria. Seventy seven percent of studies have been published since 2000; more recent studies demonstrated higher quality. Investigators reported a variety of test construction and validation methods. The most commonly reported test construction methods included "modified Delphi Techniques" for item selection, reliability measurement using inter-rater agreement, and intra-class correlations between test items or subtests. Modern test theory, in particular generalizability theory, was used in nine (18%) of studies. Test score validity has been addressed in multiple investigations and shown a significant improvement in reporting accuracy. However the assessment of predicative has been low across the majority of studies. Usability and practicality of testing occasions and tools was only anecdotally reported. To more completely comply with the gold standards for PA design, both shared experience of experts and recognition of test construction standards, including reliability and validity measurements, instrument piloting, rater training, and explicit identification of the purpose and proposed use of the assessment tool, are required.
Banks, Merrilyn; Hannan-Jones, Mary; Ross, Lynda; Buckley, Ann; Ellick, Jennifer; Young, Adrienne
2017-04-01
To develop and test the reliability of a Meal Quality Audit Tool (MQAT) to audit the quality of hospital meals to assist food service managers and dietitians in identifying areas for improvement. The MQAT was developed using expert opinion and was modified over time with extensive use and feedback. A phased approach was used to assess content validity and test reliability: (i) trial with 60 dietetic students, (ii) trial with 12 food service dietitians in practice and (iii) interrater reliability study. Phases 1 and 2 confirmed content validity and informed minor revision of scoring, language and formatting of the MQAT. To assess reliability of the final MQAT, eight separate meal quality audits of five identical meals were conducted over several weeks in the hospital setting. Each audit comprised an 'expert' team and four 'test' teams (dietitians, food services and ward staff). Interrater reliability was determined using intra-class correlation analysis. There was statistically significant interrater reliability for dimensions of Temperature and Accuracy (P < 0.001) but not for Appearance or Sensory. Composition of the 'test' team appeared to influence results for Appearance and Sensory, with food service-led teams scoring higher on these dimensions. 'Test' teams reported that MQAT was clear and easy to use. MQAT was found to be reliable for Temperature and Accuracy domains, with further work required to improve the reliability of the Appearance and Sensory dimensions. The systematic use of the tool, used in conjunction with patient satisfaction, could provide pertinent and useful information regarding the quality of food services and areas for improvement. © 2017 Dietitians Association of Australia.
ERIC Educational Resources Information Center
Morley, Donald D.
2012-01-01
The vast majority of the research on student evaluation of instruction has assessed the reliability of groups of courses and yielded either a single reliability coefficient for the entire group, or grouped reliability coefficients for each student evaluation of teaching (SET) item. This manuscript argues that these practices constitute a form of…
Baker, Robert A; Bronson, Shahna L; Dickinson, Timothy A; Fitzgerald, David C; Likosky, Donald S; Mellas, Nicholas B; Shann, Kenneth G
2013-09-01
One of the roles of a professional society is to develop standards and guidelines of practice as an instrument to guide safe and effective patient care. The American Society of Extracorporeal Technology (AmSECT) first published its Essentials for Perfusion Practice, Clinical Function: Conduct of Extracorporeal Circulation in 1993. The International Consortium for Evidence-Based Perfusion (ICEBP), a committee within AmSECT, was tasked with updating this document in 2010. The aim of this report is to describe the method of development and content of AmSECT's new professional standards and guidelines. The ICEBP committee independently evaluated and provided input regarding the current "Essentials and Guidelines." Structural changes were made to the entire document, and a draft document was developed, presented, and circulated to the AmSECT Board of Directors and broader membership for comment. Informed by these reviews, a revised document was then presented to the Society for a membership vote. The final document consists of 15 areas of practice covered by 50 Standards and 38 Guidelines (see Appendix 1) with the first standard focusing on the development of institutional protocols to support their implementation and use. A majority of the membership voted to accept the document (81.2% of the voting membership accepting, 18.8% rejecting). After an audit of the balloting process by AmSECT's Ethics Committee, the results were reported to the membership and the document was officially adopted on July 24, 2013. The Standards and Guidelines will serve as a useful guide for cardiac surgical teams that wish to develop institution-specific standards and guidelines to improve the reliability, safety, and effectiveness of adult cardiopulmonary bypass. The ICEBP recognizes that the development of a Standards and Guidelines statement alone will not change care. Safe, reliable, and effective care will be best served through the development and implementation of institutional protocols based on these standards. AmSECT's Standards and Guidelines for Perfusion Practice reflect the changing landscape of our profession as we work toward a safer and optimal provision of cardiopulmonary bypass for all our patients as well as a work environment that is supportive of delivering this care. standards, guidelines, cardiopulmonary bypass, perfusion, cardiac surgery.
Palmer, Clare E; Langbehn, Douglas; Tabrizi, Sarah J; Papoutsi, Marina
2017-01-01
Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington's disease (HD) and Parkinson's disease (PD) across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test-retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test-retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test-retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT) from the combined-Simon-Stop-Signal task showed moderate test-retest reliability, however, the combined SSRT interference effect showed poor test-retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.
Palmer, S; Manns, S; Cramp, F; Lewis, R; Clark, E M
2017-12-01
The Bristol Impact of Hypermobility (BIoH) questionnaire is a patient-reported outcome measure developed in conjunction with adults with Joint Hypermobility Syndrome (JHS). It has demonstrated strong concurrent validity with the Short Form-36 (SF-36) physical component score but other psychometric properties have yet to be established. This study aimed to determine its test-retest reliability and smallest detectable change (SDC). A test-retest reliability study. Participants were recruited from the Hypermobility Syndromes Association, a patient organisation in the United Kingdom. Recruitment packs were sent to 1080 adults who had given permission to be contacted about research. BIoH and SF-36 questionnaires were administered at baseline and repeated two weeks later. An 11-point global rating of change scale (-5 to +5) was also administered at two weeks. Test-retest analysis and calculation of the SDC was conducted on 'stable' patients (defined as global rating of change -1 to +1). 462 responses were received. 233 patients reported a 'stable' condition and were included in analysis (95% women; mean (SD) age 44.5 (13.9) years; BIoH score 223.6 (54.0)). The BIoH questionnaire demonstrated excellent test-retest reliability (ICC 0.923, 95% CI 0.900-0.940). The SDC was 42 points (equivalent to 19% of the mean baseline score). The SF-36 physical and mental component scores demonstrated poorer test-retest reliability and larger SDCs (as a proportion of the mean baseline scores). The results provide further evidence of the potential of the BIoH questionnaire to underpin research and clinical practice for people with JHS. Copyright © 2017 Elsevier Ltd. All rights reserved.
Cooper, Dale J; Scammell, Brigitte E; Batt, Mark E; Palmer, Debbie
2018-01-17
The impracticalities and comparative expense of carrying out a clinical assessment is an obstacle in many large epidemiological studies. The purpose of this study was to develop and validate a series of electronic self-reported line drawing instruments based on the modified Beighton scoring system for the assessment of self-reported generalised joint hypermobility. Five sets of line drawings were created to depict the 9-point Beighton score criteria. Each instrument consisted of an explanatory question whereby participants were asked to select the line drawing which best represented their joints. Fifty participants completed the self-report online instrument on two occasions, before attending a clinical assessment. A blinded expert clinical observer then assessed participants' on two occasions, using a standardised goniometry measurement protocol. Validity of the instrument was assessed by participant-observer agreement and reliability by participant repeatability and observer repeatability using unweighted Cohen's kappa (k). Validity and reliability were assessed for each item in the self-reported instrument separately, and for the sum of the total scores. An aggregate score for generalised joint hypermobility was determined based on a Beighton score of 4 or more out of 9. Observer-repeatability between the two clinical assessments demonstrated perfect agreement (k 1.00; 95% CI 1.00, 1.00). Self-reported participant-repeatability was lower but it was still excellent (k 0.91; 95% CI 0.74, 1.00). The participant-observer agreement was excellent (k 0.96; 95% CI 0.87, 1.00). Validity was excellent for the self-report instrument, with a good sensitivity of 0.87 (95% CI 0.81, 0.91) and excellent specificity of 0.99 (95% CI 0.98, 1.00). The self-reported instrument provides a valid and reliable assessment of the presence of generalised joint hypermobility and may have practical use in epidemiological studies.
The Comprehensive Snack Parenting Questionnaire (CSPQ): Development and Test-Retest Reliability.
Gevers, Dorus W M; Kremers, Stef P J; de Vries, Nanne K; van Assema, Patricia
2018-04-26
The narrow focus of existing food parenting instruments led us to develop a food parenting practices instrument measuring the full range of food practices constructs with a focus on snacking behavior. We present the development of the questionnaire and our research on the test-retest reliability. The developed Comprehensive Snack Parenting Questionnaire (CSPQ) covers 21 constructs. Test-retest reliability was assessed by calculating intra class correlation coefficients and percentage agreement after two administrations of the CSPQ among a sample of 66 Dutch parents. Test-retest reliability analysis revealed acceptable intra class correlation coefficients (≥0.41) or agreement scores (≥0.60) for all items. These results, together with earlier work, suggest sufficient psychometric characteristics. The comprehensive, but brief CSPQ opens up chances for highly essential but unstudied research questions to understand and predict children’s snack intake. Example applications include studying the interactional nature of food parenting practices or interactions of food parenting with general parenting or child characteristics.
ERIC Educational Resources Information Center
Riches, Vivienne C.; Parmenter, Trevor R.; Llewellyn, Gwynnyth; Hindmarsh, Gabrielle; Chan, Jeff
2009-01-01
Background: There is an urgent need for developing reliable, valid and practical instruments that assess and classify the support needed by persons with disability to function in their chosen living, working and social environments. I-CAN is an instrument that addresses the frequency and level of support needed (not individual skills or deficits)…
Grigore, Bogdan; Peters, Jaime; Hyde, Christopher; Stein, Ken
2013-11-01
Elicitation is a technique that can be used to obtain probability distribution from experts about unknown quantities. We conducted a methodology review of reports where probability distributions had been elicited from experts to be used in model-based health technology assessments. Databases including MEDLINE, EMBASE and the CRD database were searched from inception to April 2013. Reference lists were checked and citation mapping was also used. Studies describing their approach to the elicitation of probability distributions were included. Data was abstracted on pre-defined aspects of the elicitation technique. Reports were critically appraised on their consideration of the validity, reliability and feasibility of the elicitation exercise. Fourteen articles were included. Across these studies, the most marked features were heterogeneity in elicitation approach and failure to report key aspects of the elicitation method. The most frequently used approaches to elicitation were the histogram technique and the bisection method. Only three papers explicitly considered the validity, reliability and feasibility of the elicitation exercises. Judged by the studies identified in the review, reports of expert elicitation are insufficient in detail and this impacts on the perceived usability of expert-elicited probability distributions. In this context, the wider credibility of elicitation will only be improved by better reporting and greater standardisation of approach. Until then, the advantage of eliciting probability distributions from experts may be lost.
Du, Xin; Zhang, Xinping; Tang, Yuqing; Zhang, Xiaopeng; Wang, Lijun; Wang, Xuan
2015-10-01
Public reporting of performance data is one of the most popular topics in health care field. The aim of this study was to investigate the transparency mechanism, that is, how public reporting influenced general practitioners' (GPs) prescribing practices. GPs who had the license to prescribe medicine of all 10 primary care institutions were surveyed. Data were collected by an instrument, which exhibited satisfactory reliability and validity (Cronbach's alpha > 0.7; average variance extracted > 0.5; composite reliability > 0.7). Data analysis was conducted by structural equation model. The results showed that GPs' perceived value (GP's overall assessment of the worth of the public reporting) and attitude (the psychological reaction to public reporting) had a significantly direct effect on behavioural intention (r = 0.28; r = 0.36), and were affected by information accessibility (r = 0.63; r = 0.32). Attitude had a significant effect on perceived value(r = 0.45). Perceived risk (the perceptions of the possible loss due to public reporting, e.g. decreasing their income) did not have a significant relationship with information accessibility, attitude and behavioural intention(r = -0.09; r = 0.01; r = -0.07). The information accessibility, perceived value and attitude have strong effects on prescribing practices of GPs, whereas perceived risk did not play a role in influencing the prescribing practices. Policymakers need to improve the accessibility of prescription quality indicators and pay attention to the perceived values and attitudes of GPs. Policymakers also need to strengthen the risk education of GPs and attach incentives to transparent regulation. © 2015 John Wiley & Sons, Ltd.
Lorencatto, Fabiana; West, Robert; Seymour, Natalie; Michie, Susan
2013-06-01
There is a difference between interventions as planned and as delivered in practice. Unless we know what was actually delivered, we cannot understand "what worked" in effective interventions. This study aimed to (a) assess whether an established taxonomy of 53 smoking cessation behavior change techniques (BCTs) may be applied or adapted as a method for reliably specifying the content of smoking cessation behavioral support consultations and (b) develop an effective method for training researchers and practitioners in the reliable application of the taxonomy. Fifteen transcripts of audio-recorded consultations delivered by England's Stop Smoking Services were coded into component BCTs using the taxonomy. Interrater reliability and potential adaptations to the taxonomy to improve coding were discussed following 3 coding waves. A coding training manual was developed through expert consensus and piloted on 10 trainees, assessing coding reliability and self-perceived competence before and after training. An average of 33 BCTs from the taxonomy were identified at least once across sessions and coding waves. Consultations contained on average 12 BCTs (range = 8-31). Average interrater reliability was high (88% agreement). The taxonomy was adapted to simplify coding by merging co-occurring BCTs and refining BCT definitions. Coding reliability and self-perceived competence significantly improved posttraining for all trainees. It is possible to apply a taxonomy to reliably identify and classify BCTs in smoking cessation behavioral support delivered in practice, and train inexperienced coders to do so reliably. This method can be used to investigate variability in provision of behavioral support across services, monitor fidelity of delivery, and identify training needs.
Design Optimization Method for Composite Components Based on Moment Reliability-Sensitivity Criteria
NASA Astrophysics Data System (ADS)
Sun, Zhigang; Wang, Changxi; Niu, Xuming; Song, Yingdong
2017-08-01
In this paper, a Reliability-Sensitivity Based Design Optimization (RSBDO) methodology for the design of the ceramic matrix composites (CMCs) components has been proposed. A practical and efficient method for reliability analysis and sensitivity analysis of complex components with arbitrary distribution parameters are investigated by using the perturbation method, the respond surface method, the Edgeworth series and the sensitivity analysis approach. The RSBDO methodology is then established by incorporating sensitivity calculation model into RBDO methodology. Finally, the proposed RSBDO methodology is applied to the design of the CMCs components. By comparing with Monte Carlo simulation, the numerical results demonstrate that the proposed methodology provides an accurate, convergent and computationally efficient method for reliability-analysis based finite element modeling engineering practice.
Portfolio Assessment: Increasing Reliability and Validity.
ERIC Educational Resources Information Center
Griffee, Dale
2002-01-01
Addresses the traditional understanding of reliability as it pertains to writing portfolio assessments. Offers a list of practical actions that can be taken to increase assessment reliability, including explicit definitions of what a portfolio holds, rater training, rater burnout, and consistent rating procedures. (Contains 26 references.) (NB)
Manual and automatic locomotion scoring systems in dairy cows: a review.
Schlageter-Tello, Andrés; Bokkers, Eddie A M; Koerkamp, Peter W G Groot; Van Hertem, Tom; Viazzi, Stefano; Romanini, Carlos E B; Halachmi, Ilan; Bahr, Claudia; Berckmans, Daniël; Lokhorst, Kees
2014-09-01
The objective of this review was to describe, compare and evaluate agreement, reliability, and validity of manual and automatic locomotion scoring systems (MLSSs and ALSSs, respectively) used in dairy cattle lameness research. There are many different types of MLSSs and ALSSs. Twenty-five MLSSs were found in 244 articles. MLSSs use different types of scale (ordinal or continuous) and different gait and posture traits need to be observed. The most used MLSS (used in 28% of the references) is based on asymmetric gait, reluctance to bear weight, and arched back, and is scored on a five-level scale. Fifteen ALSSs were found that could be categorized according to three approaches: (a) the kinetic approach measures forces involved in locomotion, (b) the kinematic approach measures time and distance of variables associated to limb movement and some specific posture variables, and (c) the indirect approach uses behavioural variables or production variables as indicators for impaired locomotion. Agreement and reliability estimates were scarcely reported in articles related to MLSSs. When reported, inappropriate statistical methods such as PABAK and Pearson and Spearman correlation coefficients were commonly used. Some of the most frequently used MLSSs were poorly evaluated for agreement and reliability. Agreement and reliability estimates for the original four-, five- or nine-level MLSS, expressed in percentage of agreement, kappa and weighted kappa, showed large ranges among and sometimes also within articles. After the transformation into a two-level scale, agreement and reliability estimates showed acceptable estimates (percentage of agreement ≥ 75%; kappa and weighted kappa ≥ 0.6), but still estimates showed a large variation between articles. Agreement and reliability estimates for ALSSs were not reported in any article. Several ALSSs use MLSSs as a reference for model calibration and validation. However, varying agreement and reliability estimates of MLSSs make a clear definition of a lameness case difficult, and thus affect the validity of ALSSs. MLSSs and ALSSs showed limited validity for hoof lesion detection and pain assessment. The utilization of MLSSs and ALSSs should aim to the prevention and efficient management of conditions that induce impaired locomotion. Long-term studies comparing MLSSs and ALSSs while applying various strategies to detect and control unfavourable conditions leading to impaired locomotion are required to determine the usefulness of MLSSs and ALSSs for securing optimal production and animal welfare in practice. Copyright © 2014 Elsevier B.V. All rights reserved.
Hartling, L; Wittmeier, K D M; Caldwell, P H; van der Lee, J H; Klassen, T P; Craig, J C; Offringa, M
2011-11-01
Standards for Research in (StaR) Child Health was founded in 2009 to address the paucity and shortcomings of pediatric clinical trials. This initiative involves international experts who are dedicated to developing practical, evidence-based standards to enhance the reliability and relevance of pediatric clinical research. Through a systematic "knowledge to action" plan, StaR Child Health will make efforts to improve and expand the evidence base for child health across the world.
2016-11-14
the best practices in project management related to the establishment of goals and time frames.9 We obtained documentary and testimonial evidence...have created new incentives for DOD to assess its approach for managing the commissaries and exchanges. Congress identified $1.3 billion in fiscal...have relevant and reliable information to help management make informed decisions.8 We also reviewed relevant information in the report compared with
Baert, Isabel A C; Lluch, Enrique; Struyf, Thomas; Peeters, Greta; Van Oosterwijck, Sophie; Tuynman, Joanna; Rufai, Salim; Struyf, Filip
2018-06-01
The therapeutic value of proprioceptive-based exercises in knee osteoarthritis (KOA) management warrants investigation of proprioceptive testing methods easily accessible in clinical practice. To estimate inter- and intrarater reliability of the knee joint position sense (KJPS) test and knee force sense (KFS) test in subjects with and without KOA. Cross-sectional test-retest design. Two blinded raters performed independently repeated measures of the KJPS and KFS test, using an analogue inclinometer and handheld dynamometer, respectively, in eight KOA patients (12 symptomatic knees) and 26 healthy controls (52 asymptomatic knees). Intraclass correlation coefficients (ICCs; model 2,1), standard error of measurement (SEM) and minimal detectable change with 95% confidence bounds (MDC 95 ) were calculated. For KJPS, results showed good to excellent test-retest agreement (ICCs 0.70-0.95 in KOA patients; ICCs 0.65-0.85 in healthy controls). A 2° measurement error (SEM 1°) was reported when measuring KJPS in multiple test positions and calculating mean repositioning error. Testing KOA patients pre and post therapy a repositioning error larger than 4° (MDC 95 ) is needed to consider true change. Measuring KFS using handheld dynamometry showed poor to fair interrater and poor to excellent intrarater reliability in subjects with and without KOA. Measuring KJPS in multiple test positions using an analogue inclinometer and calculating mean repositioning error is reliable and can be used in clinical practice. We do not recommend the use of the KFS test to clinicians. Further research is required to establish diagnostic accuracy and validity of our KJPS test in larger knee pain populations. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Yu, Zheng
2002-08-01
Facing the new demands of the optical fiber communications market, almost all the performance and reliability of optical network system are dependent on the qualification of the fiber optics components. So, how to comply with the system requirements, the Telcordia / Bellcore reliability and high-power testing has become the key issue for the fiber optics components manufacturers. The qualification of Telcordia / Bellcore reliability or high-power testing is a crucial issue for the manufacturers. It is relating to who is the outstanding one in the intense competition market. These testing also need maintenances and optimizations. Now, work on the reliability and high-power testing have become the new demands in the market. The way is needed to get the 'Triple-Win' goal expected by the component-makers, the reliability-testers and the system-users. To those who are meeting practical problems for the testing, there are following seven topics that deal with how to shoot the common mistakes to perform qualify reliability and high-power testing: ¸ Qualification maintenance requirements for the reliability testing ¸ Lots control for preparing the reliability testing ¸ Sampling select per the reliability testing ¸ Interim measurements during the reliability testing ¸ Basic referencing factors relating to the high-power testing ¸ Necessity of re-qualification testing for the changing of producing ¸ Understanding the similarity for product family by the definitions
System and Software Reliability (C103)
NASA Technical Reports Server (NTRS)
Wallace, Dolores
2003-01-01
Within the last decade better reliability models (hardware. software, system) than those currently used have been theorized and developed but not implemented in practice. Previous research on software reliability has shown that while some existing software reliability models are practical, they are no accurate enough. New paradigms of development (e.g. OO) have appeared and associated reliability models have been proposed posed but not investigated. Hardware models have been extensively investigated but not integrated into a system framework. System reliability modeling is the weakest of the three. NASA engineers need better methods and tools to demonstrate that the products meet NASA requirements for reliability measurement. For the new models for the software component of the last decade, there is a great need to bring them into a form that they can be used on software intensive systems. The Statistical Modeling and Estimation of Reliability Functions for Systems (SMERFS'3) tool is an existing vehicle that may be used to incorporate these new modeling advances. Adapting some existing software reliability modeling changes to accommodate major changes in software development technology may also show substantial improvement in prediction accuracy. With some additional research, the next step is to identify and investigate system reliability. System reliability models could then be incorporated in a tool such as SMERFS'3. This tool with better models would greatly add value in assess in GSFC projects.
Mayo-Wilson, Evan; Ng, Sueko Matsumura; Chuck, Roy S; Li, Tianjing
2017-09-05
Systematic reviews should inform American Academy of Ophthalmology (AAO) Preferred Practice Pattern® (PPP) guidelines. The quality of systematic reviews related to the forthcoming Preferred Practice Pattern® guideline (PPP) Refractive Errors & Refractive Surgery is unknown. We sought to identify reliable systematic reviews to assist the AAO Refractive Errors & Refractive Surgery PPP. Systematic reviews were eligible if they evaluated the effectiveness or safety of interventions included in the 2012 PPP Refractive Errors & Refractive Surgery. To identify potentially eligible systematic reviews, we searched the Cochrane Eyes and Vision United States Satellite database of systematic reviews. Two authors identified eligible reviews and abstracted information about the characteristics and quality of the reviews independently using the Systematic Review Data Repository. We classified systematic reviews as "reliable" when they (1) defined criteria for the selection of studies, (2) conducted comprehensive literature searches for eligible studies, (3) assessed the methodological quality (risk of bias) of the included studies, (4) used appropriate methods for meta-analyses (which we assessed only when meta-analyses were reported), (5) presented conclusions that were supported by the evidence provided in the review. We identified 124 systematic reviews related to refractive error; 39 met our eligibility criteria, of which we classified 11 to be reliable. Systematic reviews classified as unreliable did not define the criteria for selecting studies (5; 13%), did not assess methodological rigor (10; 26%), did not conduct comprehensive searches (17; 44%), or used inappropriate quantitative methods (3; 8%). The 11 reliable reviews were published between 2002 and 2016. They included 0 to 23 studies (median = 9) and analyzed 0 to 4696 participants (median = 666). Seven reliable reviews (64%) assessed surgical interventions. Most systematic reviews of interventions for refractive error are low methodological quality. Following widely accepted guidance, such as Cochrane or Institute of Medicine standards for conducting systematic reviews, would contribute to improved patient care and inform future research.
FANTASTIC Lifestyle Assessment: Part 5 Measuring Lifestyle in Family Practice
Kason, Yvonne; Ylanko, Veli J.
1984-01-01
Family physicians generally agree that they should play an active role in disease prevention and health promotion. However, until recently no valid and reliable tool was available to help physicians clinically assess patients' lifestyle. The authors have studied the validity and reliability of a new five point-scale version of the FANTASTIC Lifestyle Assessment, used in a family practice. Also, the authors polled their patients on their opinions of their doctor assessing lifestyle. They found that the FANTASTIC was a reliable instrument, which their patients thought was useful and appropriate for their physician to be using. PMID:21279064
Enhancing nurses' ethical practice: development of a clinical ethics program.
McDaniel, C
1998-06-01
There is increasing attention paid to ethics under managed care; however, few clinical-based ethics programs are reported. This paper reports the assessment and outcomes of one such program. A quasi-experimental research design with t-tests is used to assess the outcome differences between participants and control groups. There are twenty nurses in each; they are assessed for comparability. Differences are predicted on two outcomes using reliable and valid measures: nurses' time with their patients in ethics discussions, and nurses' opinions regarding their clinical ethics environments. Results reveal a statistically significant difference (p <.05) between the two groups, with modest positive change in the participants. Additional exploratory analyses are reported on variables influential in health care services.
MEMS Reliability Assurance Activities at JPL
NASA Technical Reports Server (NTRS)
Kayali, S.; Lawton, R.; Stark, B.
2000-01-01
An overview of Microelectromechanical Systems (MEMS) reliability assurance and qualification activities at JPL is presented along with the a discussion of characterization of MEMS structures implemented on single crystal silicon, polycrystalline silicon, CMOS, and LIGA processes. Additionally, common failure modes and mechanisms affecting MEMS structures, including radiation effects, are discussed. Common reliability and qualification practices contained in the MEMS Reliability Assurance Guideline are also presented.
Reliability and Availability Evaluation Program Manual.
1982-11-01
research and development. The manual’s purpose was to provide a practical method for making reliability measurements, measurements directly related to... Research , Development, Test and Evaluation. RMA Reliability, Maintainability and Availability. R&R Repair and Refurbishment, Repair and Replacement, etc...length. phenomena such as mechanical wear and A number of researchers in the reliability chemical deterioration. Maintenance should field 14-pages 402
Fernandes, Marcelo José; Ruta, Danny Adolph; Ogden, Graham Richard; Pitts, Nigel Berry; Ogston, Simon Alexander
2006-02-01
To validate the Oral Health Impact Profile (OHIP)-14 in a sample of patients attending general dental practice. Patients with pathology-free impacted wisdom teeth were recruited from six general dental practices in Tayside, Scotland, and followed for a year to assess the development of problems related to impaction. The OHIP-14 was completed at baseline and at 1-year follow-up, and analysed using three different scoring methods: a summary score, a weighted and standardized score and the total number of problems reported. Instrument reliability was measured by assessing internal consistency and test-retest reliability. Construct validity was assessed using a number of variables. Linear regression was then used to model the relationship between OHIP-14 and all significantly correlated variables. Responsiveness was measured using the standardized response mean (SRM). Adjusted R(2)s and SRMs were calculated for each of the three scoring methods. Estimates for the differences between adjusted R(2)s and the differences between SRMs were obtained with 95% confidence intervals. A total of 278 and 169 patients completed the questionnaire at baseline and follow-up, respectively. Reliability - Cronbach's alpha coefficients ranged from 0.30 to 0.75. Alpha coefficients for all 14 items were 0.88 and 0.87 for baseline and follow-up, respectively. Test-retest coefficients ranged from 0.72 to 0.78. Validity - OHIP-14 scores were significantly correlated with number of teeth, education, main activity, the use of mouthwash, frequency of seeing a dentist, the reason for the last dental appointment, smoking, alcohol intake, pain and symptoms. Adjusted R(2)s ranged from 0.123 to 0.202 and there were no statistically significant differences between those for the three different scoring methods. Responsiveness - The SRMs ranged from 0.37 to 0.56 and there was a statistically significant difference between the summary scores method and the total number of problems method for symptomatic patients. The OHIP-14 is a valid and reliable measure of oral health-related quality of life in general dental practice and is responsive to third molar clinical change. The summary score method demonstrated performance as good as, or better than, the other methods studied.
77 FR 69615 - Commission Information Collection Activities (FERC-715); Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2012-11-20
... transmission planning; A detailed description of the transmission planning reliability criteria used to..., but not limited to, how reliability criteria are applied and the steps taken in [[Page 69616... performance as measured against its stated reliability criteria using its stated assessment practices. The...
Behrangrad, Shabnam; Kordi Yoosefinejad, Amin
2018-03-01
The purpose of this study is to investigate the validity and reliability of the Persian version of the Multidimensional Assessment of Fatigue Scale (MAFS) in an Iranian population with multiple sclerosis. A self-reported survey on fatigue including the MAFS, Fatigue Impact Scale and demographic measures was completed by 130 patients with multiple sclerosis and 60 healthy persons sampled with a convenience method. Test-retest reliability and validity were evaluated 3 days apart. Construct validity of the MAFS was assessed with the Fatigue Impact Scale. The MAFS had high internal consistency (Cronbach's alpha >0.9) and 3-d test-retest reliability (intraclass correlation coefficient = 0.99). Correlation between the Fatigue Impact Scale and MAFS was high (r = 0.99). Correlation between MAFS scores and the Expanded Disability Status Scale was also strong (r = 0.85). Questionnaire items showed acceptable item-scale correlation (0.968-0.993). The Persian version of the MAFS appears to be a valid and reliable questionnaire. It is an appropriate short multidimensional instrument to assess fatigue in patients with multiple sclerosis in clinical practice and research. Implications for Rehabilitation The Persian version of Multidimensional Assessment of Fatigue is a valid and reliable instrument for the assessment and monitoring the fatigue in Persian-language patients with multiple sclerosis. It is very easy to administer and a time efficient scale in comparison to other instruments evaluating fatigue in patients with multiple sclerosis.
Adnan, Mehnaz; Peterkin, Donald; Lopez, Liza; Mackereth, Graham
2017-02-01
Electronic reporting of Influenza-like illness (eILI) from primary care was implemented and evaluated in three general medical practices in New Zealand during May to September 2015. To measure the uptake of eILI and to identify the system's strength and limitations. Analysis of transactional data from the eILI system; comparative study of influenza-like illness cases reported using manual methods and eILI; questionnaire administered to clinical and operational stakeholders. Over the study period 66% of total ILI cases were reported using eILI. Reporting timeliness improved significantly compared to manual reporting with an average of 24 minutes from submission by the clinician to processing in the national database. Users found the system to be user-friendly. eILI assists clinicians to report ILI cases to public health authorities within a stipulated time period and is associated with faster, more reliable and improved information transfer.
Essential elements of the nursing practice environment in nursing homes: Psychometric evaluation.
de Brouwer, Brigitte Johanna Maria; Kaljouw, Marian J; Schoonhoven, Lisette; van Achterberg, Theo
2017-06-01
To develop and psychometrically test the Essentials of Magnetism II in nursing homes. Increasing numbers and complex needs of older people in nursing homes strain the nursing workforce. Fewer adequately trained staff and increased care complexity raise concerns about declining quality. Nurses' practice environment has been reported to affect quality of care and productivity. The Essentials of Magnetism II © measures processes and relationships of practice environments that contribute to productivity and quality of care and can therefore be useful in identifying processes requiring change to pursue excellent practice environments. However, this instrument was not explicitly evaluated for its use in nursing home settings so far. In a preparatory phase, a cross-sectional survey study focused on face validity of the essentials of magnetism in nursing homes. A second cross-sectional survey design was then used to further test the instrument's validity and reliability. Psychometric testing included evaluation of content and construct validity, and reliability. Nurses (N = 456) working at 44 units of three nursing homes were included. Respondent acceptance, relevance and clarity were adequate. Five of the eight subscales and 54 of the 58 items did meet preset psychometric criteria. All essentials of magnetism are considered relevant for nursing homes. The subscales Adequacy of Staffing, Clinically Competent Peers, Patient Centered Culture, Autonomy and Nurse Manager Support can be used in nursing homes without problems. The other subscales cannot be directly applied to this setting. The valid subscales of the Essentials of Magnetism II instrument can be used to design excellent nursing practice environments that support nurses' delivery of care. Before using the entire instrument, however, the other subscales have to be improved. © 2016 John Wiley & Sons Ltd.
Wang, M; Petrini, M A; Guan, Q
2015-05-01
Family-centred service (FCS) has become essential to parents of children with cerebral palsy (CP) and professionals in Chinese paediatric rehabilitation services. FCS practice meets the unique needs of the child and family, through facilitation of optimal service provision delivered by professionals, and ensures service systems to be flexible, appropriate and actively responsive to the family needs. Parents used the Measure of Processes of Care 20 (MPOC-20) questionnaire to evaluate and verify the efficacy of use in China. The aims of the present study were twofold: to assess the validity and reliability of the Chinese MPOC-20, and investigate the range of parents' satisfaction with service provision in an FCS practice using the MPOC-20. The Chinese MPOC-20 was selected to assess parent satisfaction with service provision of professionals in FCS practice. Participants were parents of children under 8 years of age with CP, who had received rehabilitation services between May 2012 and May 2013, and were receiving rehabilitation services in May 2013 at a hospital outpatient department and a rehabilitation centre. The reliability and validity of the Chinese MPOC-20 were confirmed. Parents evaluated FCS practice with the MPOC-20 survey. Respectful and supportive care was rated with the highest score and providing general information the lowest. Parents according to the data were dissatisfied with the lack of information. Parents fairly evaluated service provision of professionals in FCS practice with the Chinese MPOC-20. Professionals received feedback reports of parents, summaries of the inadequacy of service delivery, and developed and implemented ameliorated measures in the FCS policy to strive to provide exemplary service. © 2014 John Wiley & Sons Ltd.
Monitoring occupational exposure to cancer chemotherapy drugs
NASA Technical Reports Server (NTRS)
Baker, E. S.; Connor, T. H.
1996-01-01
Reports of the health effects of handling cytotoxic drugs and compliance with guidelines for handling these agents are briefly reviewed, and studies using analytical and biological methods of detecting exposure are evaluated. There is little conclusive evidence of detrimental health effects from occupational exposure to cytotoxic drugs. Work practices have improved since the issuance of guidelines for handling these drugs, but compliance with the recommended practices is still inadequate. Of 64 reports published since 1979 on studies of workers' exposure to these drugs, 53 involved studies of changes in cellular or molecular endpoints (biological markers) and 12 described chemical analyses of drugs or their metabolites in urine (2 involved both, and 2 reported the same study). The primary biological markers used were urine mutagenicity, sister chromatid exchange, and chromosomal aberrations; other studies involved formation of micronuclei and measurements of urinary thioethers. The studies had small sample sizes, and the methods were qualitative, nonspecific, subject to many confounders, and possibly not sensitive enough to detect most occupational exposures. Since none of the currently available biological and analytical methods is sufficiently reliable or reproducible for routine monitoring of exposure in the workplace, further studies using these methods are not recommended; efforts should focus instead on wide-spread implementation of improved practices for handling cytotoxic drugs.
Quality Factor Effect on the Wireless Range of Microstrip Patch Antenna Strain Sensors
Daliri, Ali; Galehdar, Amir; Rowe, Wayne S. T.; John, Sabu; Wang, Chun H.; Ghorbani, Kamran
2014-01-01
Recently introduced passive wireless strain sensors based on microstrip patch antennas have shown great potential for reliable health and usage monitoring in aerospace and civil industries. However, the wireless interrogation range of these sensors is limited to few centimeters, which restricts their practical application. This paper presents an investigation on the effect of circular microstrip patch antenna (CMPA) design on the quality factor and the maximum practical wireless reading range of the sensor. The results reveal that by using appropriate substrate materials the interrogation distance of the CMPA sensor can be increased four-fold, from the previously reported 5 to 20 cm, thus improving considerably the viability of this type of wireless sensors for strain measurement and damage detection. PMID:24451457
Ingham, Roger J
2007-07-01
This letter is a response to a recent report by J. S. Yaruss, C. Coleman, and D. Hammer (2006) that described a treatment program for preschool children who stutter. Problems with the Yaruss et al. study fall into four domains: (a) failure to provide clinicians with replicable procedures, (b) failure to collect valid and reliable speech performance data, (c) failure to control for predictable improvement in children who have been stuttering for less than 15 months, and (d) the advocacy of procedures for which there is no credible research evidence. The claims made for the efficacy of this treatment are problematic and essentially violate the principles of evidence-based practice as recommended by the American Speech-Language-Hearing Association (ASHA).
Quality factor effect on the wireless range of microstrip patch antenna strain sensors.
Daliri, Ali; Galehdar, Amir; Rowe, Wayne S T; John, Sabu; Wang, Chun H; Ghorbani, Kamran
2014-01-02
Recently introduced passive wireless strain sensors based on microstrip patch antennas have shown great potential for reliable health and usage monitoring in aerospace and civil industries. However, the wireless interrogation range of these sensors is limited to few centimeters, which restricts their practical application. This paper presents an investigation on the effect of circular microstrip patch antenna (CMPA) design on the quality factor and the maximum practical wireless reading range of the sensor. The results reveal that by using appropriate substrate materials the interrogation distance of the CMPA sensor can be increased four-fold, from the previously reported 5 to 20 cm, thus improving considerably the viability of this type of wireless sensors for strain measurement and damage detection.
Støre-Valen, Jakob; Ryum, Truls; Pedersen, Geir A F; Pripp, Are H; Jose, Paul E; Karterud, Sigmund
2015-09-01
The Global Assessment of Functioning (GAF) Scale is used in routine clinical practice and research to estimate symptom and functional severity and longitudinal change. Concerns about poor interrater reliability have been raised, and the present study evaluated the effect of a Web-based GAF training program designed to improve interrater reliability in routine clinical practice. Clinicians rated up to 20 vignettes online, and received deviation scores as immediate feedback (i.e., own scores compared with expert raters) after each rating. Growth curves of absolute SD scores across the vignettes were modeled. A linear mixed effects model, using the clinician's deviation scores from expert raters as the dependent variable, indicated an improvement in reliability during training. Moderation by content of scale (symptoms; functioning), scale range (average; extreme), previous experience with GAF rating, profession, and postgraduate training were assessed. Training reduced deviation scores for inexperienced GAF raters, for individuals in clinical professions other than nursing and medicine, and for individuals with no postgraduate specialization. In addition, training was most beneficial for cases with average severity of symptoms compared with cases with extreme severity. The results support the use of Web-based training with feedback routines as a means to improve the reliability of GAF ratings performed by clinicians in mental health practice. These results especially pertain to clinicians in mental health practice who do not have a masters or doctoral degree. (c) 2015 APA, all rights reserved.
ERIC Educational Resources Information Center
Weber, Deborah A.
Greater understanding and use of confidence intervals is central to changes in statistical practice (G. Cumming and S. Finch, 2001). Reliability coefficients and confidence intervals for reliability coefficients can be computed using a variety of methods. Estimating confidence intervals includes both central and noncentral distribution approaches.…
77 FR 53877 - Commission Information Collection Activities (FERC-715); Comment Request; Extension
Federal Register 2010, 2011, 2012, 2013, 2014
2012-09-04
...; A detailed description of the transmission planning reliability criteria used to evaluate system... reliability criteria are applied and the steps taken in performing transmission planning studies); and A... reliability criteria using its stated assessment practices. The FERC-715 enables the Commission to use the...
Tamuz, Michal; Harrison, Michael I
2006-01-01
Objective To identify the distinctive contributions of high-reliability theory (HRT) and normal accident theory (NAT) as frameworks for examining five patient safety practices. Data Sources/Study Setting We reviewed and drew examples from studies of organization theory and health services research. Study Design After highlighting key differences between HRT and NAT, we applied the frames to five popular safety practices: double-checking medications, crew resource management (CRM), computerized physician order entry (CPOE), incident reporting, and root cause analysis (RCA). Principal Findings HRT highlights how double checking, which is designed to prevent errors, can undermine mindfulness of risk. NAT emphasizes that social redundancy can diffuse and reduce responsibility for locating mistakes. CRM promotes high reliability organizations by fostering deference to expertise, rather than rank. However, HRT also suggests that effective CRM depends on fundamental changes in organizational culture. NAT directs attention to an underinvestigated feature of CPOE: it tightens the coupling of the medication ordering process, and tight coupling increases the chances of a rapid and hard-to-contain spread of infrequent, but harmful errors. Conclusions Each frame can make a valuable contribution to improving patient safety. By applying the HRT and NAT frames, health care researchers and administrators can identify health care settings in which new and existing patient safety interventions are likely to be effective. Furthermore, they can learn how to improve patient safety, not only from analyzing mishaps, but also by studying the organizational consequences of implementing safety measures. PMID:16898984
Rozen, Warren Matthew; Spychal, Robert T.; Hunter-Smith, David J.
2016-01-01
Background Accurate volumetric analysis is an essential component of preoperative planning in both reconstructive and aesthetic breast procedures towards achieving symmetrization and patient-satisfactory outcome. Numerous comparative studies and reviews of individual techniques have been reported. However, a unifying review of all techniques comparing their accuracy, reliability, and practicality has been lacking. Methods A review of the published English literature dating from 1950 to 2015 using databases, such as PubMed, Medline, Web of Science, and EMBASE, was undertaken. Results Since Bouman’s first description of water displacement method, a range of volumetric assessment techniques have been described: thermoplastic casting, direct anthropomorphic measurement, two-dimensional (2D) imaging, and computed tomography (CT)/magnetic resonance imaging (MRI) scans. However, most have been unreliable, difficult to execute and demonstrate limited practicability. Introduction of 3D surface imaging has revolutionized the field due to its ease of use, fast speed, accuracy, and reliability. However, its widespread use has been limited by its high cost and lack of high level of evidence. Recent developments have unveiled the first web-based 3D surface imaging program, 4D imaging, and 3D printing. Conclusions Despite its importance, an accurate, reliable, and simple breast volumetric analysis tool has been elusive until the introduction of 3D surface imaging technology. However, its high cost has limited its wide usage. Novel adjunct technologies, such as web-based 3D surface imaging program, 4D imaging, and 3D printing, appear promising. PMID:27047788
Falaggis, Konstantinos; Towers, David P; Towers, Catherine E
2012-09-20
Multiwavelength interferometry (MWI) is a well established technique in the field of optical metrology. Previously, we have reported a theoretical analysis of the method of excess fractions that describes the mutual dependence of unambiguous measurement range, reliability, and the measurement wavelengths. In this paper wavelength, selection strategies are introduced that are built on the theoretical description and maximize the reliability in the calculated fringe order for a given measurement range, number of wavelengths, and level of phase noise. Practical implementation issues for an MWI interferometer are analyzed theoretically. It is shown that dispersion compensation is best implemented by use of reference measurements around absolute zero in the interferometer. Furthermore, the effects of wavelength uncertainty allow the ultimate performance of an MWI interferometer to be estimated.
Awadh, Ammar Ihsan; Hassali, Mohamed Azmi; Al-lela, Omer Qutaiba; Bux, Siti Halimah; Elkalmi, Ramadan M; Hadi, Hazrina
2014-10-27
Parents are the main decision makers for their children vaccinations. This fact makes parents' immunization knowledge and practices as predictor factors for immunization uptake and timeliness. The aim of this pilot study was to develop a reliable and valid instrument in Malaysian language to measure immunization knowledge and practice (KP) of Malaysian parents. A cross-sectional prospective pilot survey was conducted among 88 Malaysian parents who attended public health facilities that provide vaccinations. Translated immunization KP questionnaires (Bahasa Melayu version) were used. Descriptive statistics were applied, face and content validity were assessed, and internal consistency, test-retest reliability, and construct validity were determined. The mean ± standard deviation (SD) of the knowledge scores was 7.36 ± 2.29 and for practice scores was 7.13 ± 2.20. Good internal consistency was found for knowledge and practice items (Cronbach's alpha = 0.757 and 0.743 respectively); the test-retest reliability value was 0.740 (p = 0.014). A panel of three specialist pharmacists who are experts in this field judged the face and content validity of the final questionnaire. Parents with up-to-date immunized children had significantly better knowledge and practice scores than parents who did not (p < 0.001 and p = 0.001 respectively), suggesting a good construct validity. A significant difference was found in knowledge and practice scores among parents' age (p = 0.006 and p = 0.029 respectively) and place of living (p = 0.037 and p = 0.043). The parents' knowledge level was positively associated with their practice toward immunization (Spearman's rank correlation coefficient 0.310, p = 0.003). The pilot study concluded that the Bahasa Melayu version of the immunization KP questionnaire has good reliability and validity for measuring the knowledge and practices of Malaysian parents and therefore this version can be used in future research.
Memorial Hermann: high reliability from board to bedside.
Shabot, M Michael; Monroe, Douglas; Inurria, Juan; Garbade, Debbi; France, Anne-Claire
2013-06-01
In 2006 the Memorial Hermann Health System (MHHS), which includes 12 hospitals, began applying principles embraced by high reliability organizations (HROs). Three factors support its HRO journey: (1) aligned organizational structure with transparent management systems and compressed reporting processes; (2) Robust Process Improvement (RPI) with high-reliability interventions; and (3) cultural establishment, sustainment, and evolution. The Quality and Safety strategic plan contains three domains, each with a specific set of measures that provide goals for performance: (1) "Clinical Excellence;" (2) "Do No Harm;" and (3) "Saving Lives," as measured by the Serious Safety Event rate. MHHS uses a uniform approach to performance improvement--RPI, which includes Six Sigma, Lean, and change management, to solve difficult safety and quality problems. The 9 acute care hospitals provide multiple opportunities to integrate high-reliability interventions and best practices across MHHS. For example, MHHS partnered with the Joint Commission Center for Transforming Healthcare in its inaugural project to establish reliable hand hygiene behaviors, which improved MHHS's average hand hygiene compliance rate from 44% to 92% currently. Soon after compliance exceeded 85% at all 12 hospitals, the average rate of central line-associated bloodstream and ventilator-associated pneumonias decreased to essentially zero. MHHS's size and diversity require a disciplined approach to performance improvement and systemwide achievement of measurable success. The most significant cultural change at MHHS has been the expectation for 100% compliance with evidence-based quality measures and 0% incidence of patient harm.
Improved fire protection system for underground fueling areas. Volume II. Final report Sep 77-Oct 81
DOE Office of Scientific and Technical Information (OSTI.GOV)
McDonald, L.; Kennedy, D.; Reid, G.
1981-10-01
The objectives of this investigation were to (1) develop safe practice guidelines that will minimize the chance of fires in underground fueling areas and (2) to develop a low-cost, reliable, automatic fire control system (AFCS) for underground fueling areas. Volume I of the report covered the period from June 21, 1976, to September 30, 1977, and included (1) the preparation of safe practice guidelines for underground fueling areas; (2) preparation of recommended AFCS design concepts for underground fueling areas; and (3) the design, fabrication, and in-mine fire test of an AFCS at Pine Creek Mine, Bishop, Calif. Volume II ofmore » the report covers the period from September 30, 1977, to September 30, 1981, and includes (1) a long-term validation test of the AFCS in the Pine Creek Mine, (2) a study of the environmental effects of aqueous film-forming foam, (3) the design and installation of a system at AMAX Buick Mine, Boss, Mo., (4) the design of a system for enclosed fuel areas, and (5) the design of a system for semipermanent fueling areas.« less
Brown, J. B.; Schmidt, G.; Lent, B.; Sas, G.; Lemelin, J.
2001-01-01
OBJECTIVE: To replicate, in a Francophone community, our prior work determining the reliability and validity of the full Woman Abuse Screening Tool (WAST) and a two-item version (WAST-Short). DESIGN: Questionnaires completed by abused and nonabused women. SETTING: Two women's shelters in Francophone communities in Ontario and Quebec and participants' homes or workplaces. PARTICIPANTS: A convenience sample of 25 abused women currently residing in two women's shelters and a convenience sample of 21 women who reported they were not abused. MAIN OUTCOME MEASURES: Women's responses to French versions of the WAST, the Abuse Risk Inventory (ARI), and comfort in answering the questions were compared. Also, the reliability and validity of French versions of WAST and WAST-Short were assessed. RESULTS: Abused (n = 23) and not abused (n = 21) women were demographically similar. A strong single-factor structure that accounted for 81% of total variance in the French WAST items was identified. The French WAST was found to be highly reliable with a coefficient alpha of .95 and demonstrated construct and discriminant validity. The WAST-Short correctly classified all the nonabused women and 78.7% of the abused women. The abused women reported feeling less comfortable responding to the WAST questions than the nonabused women. CONCLUSION: The French version of the WAST demonstrated good reliability and validity and discriminated between known samples of abused and nonabused women. Even though the French WAST-Short did not perform as well as the English version, results of this study support further evaluation of the WAST for screening women in Francophone or bilingual family practice settings. PMID:11398732
Carter, Amanda G; Creedy, Debra K; Sidebotham, Mary
2015-07-01
Well developed critical thinking skills are essential for nursing and midwifery practices. The development of students' higher-order cognitive abilities, such as critical thinking, is also well recognised in nursing and midwifery education. Measurement of critical thinking development is important to demonstrate change over time and effectiveness of teaching strategies. To evaluate tools designed to measure critical thinking in nursing and midwifery undergraduate students. The following six databases were searched and resulted in the retrieval of 1191 papers: CINAHL, Ovid Medline, ERIC, Informit, PsycINFO and Scopus. After screening for inclusion, each paper was evaluated using the Critical Appraisal Skills Programme Tool. Thirty-four studies met the inclusion criteria and quality appraisal. Sixteen different tools that measure critical thinking were reviewed for reliability and validity and extent to which the domains of critical thinking were evident. Sixty percent of studies utilised one of four standardised commercially available measures of critical thinking. Reliability and validity were not consistently reported and there was a variation in reliability across studies that used the same measure. Of the remaining studies using different tools, there was also limited reporting of reliability making it difficult to assess internal consistency and potential applicability of measures across settings. Discipline specific instruments to measure critical thinking in nursing and midwifery are required, specifically tools that measure the application of critical thinking to practise. Given that critical thinking development occurs over an extended period, measurement needs to be repeated and multiple methods of measurement used over time. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
McInerney, David; Thyer, Mark; Kavetski, Dmitri; Kuczera, George
2016-04-01
Appropriate representation of residual errors in hydrological modelling is essential for accurate and reliable probabilistic streamflow predictions. In particular, residual errors of hydrological predictions are often heteroscedastic, with large errors associated with high runoff events. Although multiple approaches exist for representing this heteroscedasticity, few if any studies have undertaken a comprehensive evaluation and comparison of these approaches. This study fills this research gap by evaluating a range of approaches for representing heteroscedasticity in residual errors. These approaches include the 'direct' weighted least squares approach and 'transformational' approaches, such as logarithmic, Box-Cox (with and without fitting the transformation parameter), logsinh and the inverse transformation. The study reports (1) theoretical comparison of heteroscedasticity approaches, (2) empirical evaluation of heteroscedasticity approaches using a range of multiple catchments / hydrological models / performance metrics and (3) interpretation of empirical results using theory to provide practical guidance on the selection of heteroscedasticity approaches. Importantly, for hydrological practitioners, the results will simplify the choice of approaches to represent heteroscedasticity. This will enhance their ability to provide hydrological probabilistic predictions with the best reliability and precision for different catchment types (e.g. high/low degree of ephemerality).
Dealing with patients in healthcare: A self-assessment tool.
Gremigni, Paola; Casu, Giulia; Sommaruga, Marinella
2016-06-01
This study aimed to investigate how healthcare personnel self-evaluate their ability to relate to patients in day-to-day practice from a patient-centered perspective, and to test the psychometric properties of a questionnaire developed to assess it. A sample of 600 healthcare personnel, recruited among eight hospitals in various parts of Italy, completed the 16-item Provider-Patient Relationship Questionnaire (PPRQ). A sample of 50 nurses answered the PPRQ twice, at a four-week interval. The PPRQ validity, reliability and susceptibility to social desirability were tested. PPRQ showed good reliability and structural validity, with four first-order factors: effective communication, interest in the patient's agenda, empathy, and patient involvement in care. Correlation with social desirability was negligible. Participants rated themselves as highly competent in communicating with patients, but less interested in involving the patient in care and in the patient's agenda. Differences in PPRQ dimensions were found between groups based on job type and geographic area. PPRQ is a brief self-report measure of the provider-patient relationship with promising psychometric properties in this sample. PPRQ has potential value in promoting a self-reflecting learning environment, whether through training or day-to-day practice. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Clarke, John R
2009-01-01
Surgical errors with minimally invasive surgery differ from those in open surgery. Perforations are typically the result of trocar introduction or electrosurgery. Infections include bioburdens, notably enteric viruses, on complex instruments. Retained foreign objects are primarily unretrieved device fragments and lost gallstones or other specimens. Fires and burns come from illuminated ends of fiber-optic cables and from electrosurgery. Pressure ischemia is more likely with longer endoscopic surgical procedures. Gas emboli can occur. Minimally invasive surgery is more dependent on complex equipment, with high likelihood of failures. Standardization, checklists, and problem reporting are solutions for minimizing failures. The necessity of electrosurgery makes education about best electrosurgical practices important. The recording of minimally invasive surgical procedures is an opportunity to debrief in a way that improves the reliability of future procedures. Safety depends on reliability, designing systems to withstand inevitable human errors. Safe systems are characterized by a commitment to safety, formal protocols for communications, teamwork, standardization around best practice, and reporting of problems for improvement of the system. Teamwork requires shared goals, mental models, and situational awareness in order to facilitate mutual monitoring and backup. An effective team has a flat hierarchy; team members are empowered to speak up if they are concerned about problems. Effective teams plan, rehearse, distribute the workload, and debrief. Surgeons doing minimally invasive surgery have a unique opportunity to incorporate the principles of safety into the development of their discipline.
An instrument for assessment of videotapes of general practitioners' performance.
Cox, J; Mulholland, H
1993-01-01
OBJECTIVES--To identify those important characteristics of doctors' and patients' behaviour that distinguish between "good" and "bad" consultations when viewed on videotape; to use these characteristics to develop a reliable instrument for assessing general practitioners' performance in their own consultations. DESIGN--Questionnaires completed by patients, general practitioner trainers, and general practitioner trainees. Reliability of draft instrument tested by general practitioner trainers. SETTING--All vocational training schemes for general practice in the Northern region of England. SUBJECTS--First stage: 76 patients in seven groups, 108 general practice trainers in 12 groups, and 122 general practice trainees in 10 groups. Second stage: 85 general practice trainers in 12 groups. MAIN OUTCOME MEASURES--Trainers' ratings of importance; alpha coefficients of draft instrument by trainee, group, and consultation. RESULTS--6890 characteristics of good and bad consultations were consolidated into a draft assessment instrument consisting of 46 pairs of definitions separated by six point bipolar scales. Nine statement pairs given low importance ratings by trainers were eliminated, reducing the instrument to 37 statement pairs. To test reliability, general practitioner trainers used the instrument to assess three consultations. With the exception of one group of trainers, all alpha coefficients exceeded the acceptable level of 0.80. CONCLUSION--The instrument produced is reliable for assessing general practitioners' performance in their own consultations. PMID:8490501
Design and validation of instruments to measure knowledge.
Elliott, T E; Regal, R R; Elliott, B A; Renier, C M
2001-01-01
Measuring health care providers' learning after they have participated in educational interventions that use experimental designs requires valid, reliable, and practical instruments. A literature review was conducted. In addition, experience gained from designing and validating instruments for measuring the effect of an educational intervention informed this process. The eight main steps for designing, validating, and testing the reliability of instruments for measuring learning outcomes are presented. The key considerations and rationale for this process are discussed. Methods for critiquing and adapting existent instruments and creating new ones are offered. This study may help other investigators in developing valid, reliable, and practical instruments for measuring the outcomes of educational activities.
Psychometric properties of the Kids' Child Feeding Questionnaire-Restriction.
Stromberg, Sarah E; Minski, Samantha; Wheeler, Paris B; Chardon, Marie L; Janicke, David M
Research exploring parental restrictive feeding is mixed and shows that it both negatively and positively affects children's dietary intake. One hypothesis for these inconsistent findings is the use of parent-report vs. youth-report measures of parental restrictive feeding, but there are limited psychometrically-sound youth-report measures of this construct. Therefore, the current study aims to evaluate the psychometric properties of a measure of parent restrictive feeding practices, the Kids' Child Feeding Questionnaire-Restriction (KCFQ-R), from the youth perspective. The 7-item, youth-report KCFQ-R is composed of the restriction subscale from the Kids' Child Feeding Questionnaire. This measure was completed by 225 youth attending a primary care appointment. Initial exploratory factor analysis and communalities yielded a single factor solution explaining 39.93% of the variability in the data. Internal consistency using the seven items was .73. The KCFQ-R demonstrated external validity through its significant relationship with parent concern about child overweight. Results provide preliminary support that the KCFQ-R is a psychometrically sound and reliable measure of youth-reported parental restrictive feeding practices. Given the mixed research on the effects of parent-reported parental feeding restriction on various child outcomes, this youth-report measure may help clarify these relationships. Future research should examine youth-report measures of other parent feeding domains. Copyright © 2017 Asia Oceania Association for the Study of Obesity. All rights reserved.
Hanauer, David I; Bauerle, Cynthia
2015-01-01
Science, technology, engineering, and mathematics education reform efforts have called for widespread adoption of evidence-based teaching in which faculty members attend to student outcomes through assessment practice. Awareness about the importance of assessment has illuminated the need to understand what faculty members know and how they engage with assessment knowledge and practice. The Faculty Self-Reported Assessment Survey (FRAS) is a new instrument for evaluating science faculty assessment knowledge and experience. Instrument validation was composed of two distinct studies: an empirical evaluation of the psychometric properties of the FRAS and a comparative known-groups validation to explore the ability of the FRAS to differentiate levels of faculty assessment experience. The FRAS was found to be highly reliable (α = 0.96). The dimensionality of the instrument enabled distinction of assessment knowledge into categories of program design, instrumentation, and validation. In the known-groups validation, the FRAS distinguished between faculty groups with differing levels of assessment experience. Faculty members with formal assessment experience self-reported higher levels of familiarity with assessment terms, higher frequencies of assessment activity, increased confidence in conducting assessment, and more positive attitudes toward assessment than faculty members who were novices in assessment. These results suggest that the FRAS can reliably and validly differentiate levels of expertise in faculty knowledge of assessment. © 2015 D. I. Hanauer and C. Bauerle. CBE—Life Sciences Education © 2015 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
14 CFR Sec. 2-5 - Revenue and accounting practices.
Code of Federal Regulations, 2010 CFR
2010-01-01
... 14 Aeronautics and Space 4 2010-01-01 2010-01-01 false Revenue and accounting practices. Sec. 2-5... General Accounting Provisions Sec. 2-5 Revenue and accounting practices. (a) Revenue accounting practices... physically verify the reliability of its passenger revenue accounting practice at least once each accounting...
Wang, Weiwen; Sun, Ran; Mulvehill, Alice M; Gilson, Courtney C; Huang, Linda L
2017-02-01
Patient care problems arise when health care consumers and professionals find health information on the Internet because that information is often inaccurate. To mitigate this problem, nurses can develop Web literacy and share that skill with health care consumers. This study evaluated a Web-literacy intervention for undergraduate nursing students to find reliable Web-based health information. A pre- and postsurvey queried undergraduate nursing students in an informatics course; the intervention comprised lecture, in-class practice, and assignments about health Web site evaluation tools. Data were analyzed using Wilcoxon and ANOVA signed-rank tests. Pre-intervention, 75.9% of participants reported using Web sites to obtain health information. Postintervention, 87.9% displayed confidence in using an evaluation tool. Both the ability to critique health Web sites (p = .005) and confidence in finding reliable Internet-based health information (p = .058) increased. Web-literacy education guides nursing students to find, evaluate, and use reliable Web sites, which improves their ability to deliver safer patient care. [J Nurs Educ. 2017;56(2):110-114.]. Copyright 2017, SLACK Incorporated.
Innes, Ev; Straker, Leon
2003-01-01
The purpose of this study was to understand the current beliefs of therapists in Australia, and the strategies they use to address the issues of credibility, reliability, consistency, trustworthiness, validity, generalisability and quality in conducting work-related assessments. In-depth semi-structured interviews were conducted with 26 occupational therapists and physiotherapists from around Australia. Participants expressed the belief that the therapist was the assessment instrument and was central to the credibility of an assessment. Conflict was reported when participants modified standardised assessments in an attempt to focus on context relevant activities and tasks. Participants were aware of the issues of reliability and validity but believed it was not practical to establish these aspects formally in most work-related assessments. The strategies used to achieve credibility, reliability, consistency, trustworthiness, validity, generalisability and quality were similar to those recommended for use in qualitative research. The strategies identified in this study can provide the basis for therapists to examine how they conduct work-related assessments and consider whether they currently use these strategies or have the opportunity to implement others.
The Nature Contact Questionnaire: a measure of healthy workplace exposure.
Largo-Wight, Erin; Chen, W William; Dodd, Virginia; Weiler, Robert
2011-01-01
Understanding and promoting healthy workplaces is an important and growing area of interest in occupational health. Nature contact is a central component to the study of and promotion of healthy places. Previous findings suggest that nature contact influences health via stress appraisal process. Currently, there are no known comprehensive valid and reliable measures of nature contact, which presents obstacles to research and worksite health promotion. This study was designed to develop and test an instrument to measure nature contact at work, entitled the Nature Contact Questionnaire (NCQ), 16-item self-reported checklist to measure actual exposure. A sample of 503 (30% response rate) office staff completed the questionnaire. Office staff were sent an email with a link to the electronic survey twice, two weeks apart. Content and construct validity (KMO=0.68), internal consistency (Alpha=0.64), and test-retest reliability (r=0.85, p<0.01) were established. The NCQ is the first known comprehensive, reliable and valid survey to measure nature contact, which allows research to compare forms of nature contact to best inform practice and design of healthy places.
NASA Astrophysics Data System (ADS)
Gurov, V. V.
2017-01-01
Software tools for educational purposes, such as e-lessons, computer-based testing system, from the point of view of reliability, have a number of features. The main ones among them are the need to ensure a sufficiently high probability of their faultless operation for a specified time, as well as the impossibility of their rapid recovery by the way of replacing it with a similar running program during the classes. The article considers the peculiarities of reliability evaluation of programs in contrast to assessments of hardware reliability. The basic requirements to reliability of software used for carrying out practical and laboratory classes in the form of computer-based training programs are given. The essential requirements applicable to the reliability of software used for conducting the practical and laboratory studies in the form of computer-based teaching programs are also described. The mathematical tool based on Markov chains, which allows to determine the degree of debugging of the training program for use in the educational process by means of applying the graph of the software modules interaction, is presented.
Ward, Jane K; McEachan, Rosemary R C; Lawton, Rebecca; Armitage, Gerry; Watt, Ian; Wright, John
2011-05-27
Patients have the potential to provide a rich source of information on both organisational aspects of safety and patient safety incidents. This project aims to develop two patient safety interventions to promote organisational learning about safety - a patient measure of organisational safety (PMOS), and a patient incident reporting tool (PIRT) - to help the NHS prevent patient safety incidents by learning more about when and why they occur. To develop the PMOS 1) literature will be reviewed to identify similar measures and key contributory factors to error; 2) four patient focus groups will ascertain practicality and feasibility; 3) 25 patient interviews will elicit approximately 60 items across 10 domains; 4) 10 patient and clinician interviews will test acceptability and understanding. Qualitative data will be analysed using thematic content analysis.To develop the PIRT 1) individual and then combined patient and clinician focus groups will provide guidance for the development of three potential reporting tools; 2) nine wards across three hospital directorates will pilot each of the tools for three months. The best performing tool will be identified from the frequency, volume and quality of reports. The validity of both measures will be tested. 300 patients will be asked to complete the PMOS and PIRT during their stay in hospital. A sub-sample (N = 50) will complete the PMOS again one week later. Health professionals in participating wards will also be asked to complete the AHRQ safety culture questionnaire. Case notes for all patients will be reviewed. The psychometric properties of the PMOS will be assessed and a final valid and reliable version developed. Concurrent validity for the PIRT will be assessed by comparing reported incidents with those identified from case note review and the existing staff reporting scheme. In a subsequent study these tools will be used to provide information to wards/units about their priorities for patient safety. A patient panel will provide steering to the research. The PMOS and PIRT aim to provide a reliable means of eliciting patient views about patient safety. Both interventions are likely to have relevance and practical utility for all NHS hospital trusts.
Che, Chong Chin; Hairi, Noran Naqiah; Chong, Mei Chan
2017-09-01
To review systematically the psychometric properties of instruments used to measure intention to work with older people. Nursing students are part of the future healthcare workforce; thus, being aware of their intention to work with older people would give valuable insights to nursing education and practice. Despite a plethora of research on measuring intention to work with older people, a valid and reliable instrument has not been identified. A systematic literature review of evidence and psychometric properties. Eight database searches were conducted between 2006 - 2016. English articles were selected based on inclusion and exclusion criteria. The COSMIN checklist was used to assess instruments reporting a psychometric evaluation of validity and reliability. Of 41 studies identified for full text review, 36 met the inclusion criteria. Seven different types of instruments were identified for psychometric evaluation. Measures of reliability were reported in eight papers and validity in five papers. Evidence for each measurement property was limited, with each instrument demonstrating a lack of information on measurement properties. Based on the COSMIN checklist, the overall quality of the psychometric properties was rated as poor to good. No single instrument was found to be optimal for use. Studies of high methodological quality are needed to properly assess the measurement properties of the instruments that are currently available. Until such studies are available, we recommend using existing instruments with caution. © 2017 John Wiley & Sons Ltd.
Happell, Brenda; Byrne, Louise; Platania-Phung, Chris
2015-01-01
Recovery-oriented services are a goal for policy and practice in the Australian mental health service system. Evidence-based reform requires an instrument to measure knowledge of recovery concepts. The Recovery Knowledge Inventory (RKI) was designed for this purpose, however, its suitability and validity for student health professionals has not been evaluated. The purpose of the current article is to report the psychometric features of the RKI for measuring nursing students' views on recovery. The RKI, a self-report measure, consists of four scales: (I) Roles and Responsibilities, (II) Non-Linearity of the Recovery Process, (III) Roles of Self-Definition and Peers, and (IV) Expectations Regarding Recovery. Confirmatory and exploratory factor analyses of the baseline data (n = 167) were applied to assess validity and reliability. Exploratory factor analyses generally replicated the item structure suggested by the three main scales, however more stringent analyses (confirmatory factor analysis) did not provide strong support for convergent validity. A refined RKI with 16 items had internal reliabilities of α = .75 for Roles and Responsibilities, α = .49 for Roles of Self-Definition and Peers, and α = .72, for Recovery as Non-Linear Process. If the RKI is to be applied to nursing student populations, the conceptual underpinning of the instrument needs to be reworked, and new items should be generated to evaluate and improve scale validity and reliability.
Ebesutani, Chad; Korathu-Larson, Priya; Nakamura, Brad J; Higa-McMillan, Charmaine; Chorpita, Bruce
2017-09-01
To help facilitate the dissemination and implementation of evidence-based assessment practices, we examined the psychometric properties of the shortened 25-item version of the Revised Child Anxiety and Depression Scale-parent report (RCADS-25-P), which was based on the same items as the previously published shortened 25-item child version. We used two independent samples of youth-a school sample ( N = 967, Grades 3-12) and clinical sample ( N = 433; 6-18 years)-to examine the factor structure, reliability, and validity of the RCADS-25-P scale scores. Results revealed that the two-factor structure (i.e., depression and broad anxiety factor) fit the data well in both the school and clinical sample. All reliability estimates, including test-retest indices, exceeded benchmark for good reliability. In the school sample, the RCADS-25-P scale scores converged significantly with related criterion measures and diverged with nonrelated criterion measures. In the clinical sample, the RCADS-25-P scale scores successfully discriminated between those with and without target problem diagnoses. In both samples, child-parent agreement indices were in the expected ranges. Normative data were also reported. The RCADS-25-P thus demonstrated robust psychometric properties across both a school and clinical sample as an effective brief screening instrument to assess for depression and anxiety in children and adolescents.
VanGeest, Jonathan B; Wynia, Matthew K; Cummins, Deborah S; Wilson, Ira B
2002-06-01
This study examined the test-retest reliability of physicians' self-reported manipulation of reimbursement rules for patients. The test-retest reliability of self-report of three specific tactics were examined: (1) exaggerating the severity of patients' conditions, (2) changing a patient's official (billing) diagnosis, and (3) reporting signs or symptoms that patients did not have. The reliability of a scaled summary measure of physicians' manipulation of reimbursement rules was also assessed. Overall, the authors found high levels of test-retest agreement across all three items and the summary measure. These findings suggest that self-report can be used to produce reliable data on this controversial issue. Specifically, the three items reported here can be used to produce a reliable summary measure of physicians' manipulation of reimbursement rules to help patients obtain care that physicians perceive as necessary.
Camacho, Diego R; Schlachta, Christopher M; Serrano, Oscar K; Nguyen, Ninh T
2018-03-30
Surgical telementoring programs (STMPs) as educational tools have consistently demonstrated success in the training of surgeons in a variety of surgical disciplines. The goal of an STMP is to train and educate practicing surgeons by improving or remediating surgical skills or assisting in the safe adoption of new procedures. STMPs may even have a role in assisting with recertification. In 2015, the Society of American Gastrointestinal and Endoscopic Surgeons (SAGES) launched the SAGES Telementoring Initiative at the Project 6 Summit. Herein, we provide a report on the SAGES Project 6 Logistics working group and lay out a plan for the recommended logistical framework to carry out an STMP.
The Utrecht questionnaire (U-CEP) measuring knowledge on clinical epidemiology proved to be valid.
Kortekaas, Marlous F; Bartelink, Marie-Louise E L; de Groot, Esther; Korving, Helen; de Wit, Niek J; Grobbee, Diederick E; Hoes, Arno W
2017-02-01
Knowledge on clinical epidemiology is crucial to practice evidence-based medicine. We describe the development and validation of the Utrecht questionnaire on knowledge on Clinical epidemiology for Evidence-based Practice (U-CEP); an assessment tool to be used in the training of clinicians. The U-CEP was developed in two formats: two sets of 25 questions and a combined set of 50. The validation was performed among postgraduate general practice (GP) trainees, hospital trainees, GP supervisors, and experts. Internal consistency, internal reliability (item-total correlation), item discrimination index, item difficulty, content validity, construct validity, responsiveness, test-retest reliability, and feasibility were assessed. The questionnaire was externally validated. Internal consistency was good with a Cronbach alpha of 0.8. The median item-total correlation and mean item discrimination index were satisfactory. Both sets were perceived as relevant to clinical practice. Construct validity was good. Both sets were responsive but failed on test-retest reliability. One set took 24 minutes and the other 33 minutes to complete, on average. External GP trainees had comparable results. The U-CEP is a valid questionnaire to assess knowledge on clinical epidemiology, which is a prerequisite for practicing evidence-based medicine in daily clinical practice. Copyright © 2016 Elsevier Inc. All rights reserved.
Hanson, Lisa C; Taylor, Nicholas F; McBurney, Helen
2016-09-01
To determine the retest reliability of the 10m incremental shuttle walk test (ISWT) in a mixed cardiac rehabilitation population. Participants completed two 10m ISWTs in a single session in a repeated measures study. Ten participants completed a third 10m ISWT as part of a pilot study. Hospital physiotherapy department. 62 adults aged a mean of 68 years (SD 10) referred to a cardiac rehabilitation program. Retest reliability of the 10m ISWT expressed as relative reliability and measurement error. Relative reliability was expressed in a ratio in the form of an intraclass correlation coefficient (ICC) and measurement error in the form of the standard error of measurement (SEM) and 95% confidence intervals for the group and individual. There was a high level of relative reliability over the two walks with an ICC of .99. The SEMagreement was 17m, and a change of at least 23m for the group and 54m for the individual would be required to be 95% confident of exceeding measurement error. The 10m ISWT demonstrated good retest reliability and is sufficiently reliable to be applied in practice in this population without the use of a practice test. Copyright © 2015 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Murphy, Douglas J; Bruce, David A; Mercer, Stewart W; Eva, Kevin W
2009-05-01
To investigate the reliability and feasibility of six potential workplace-based assessment methods in general practice training: criterion audit, multi-source feedback from clinical and non-clinical colleagues, patient feedback (the CARE Measure), referral letters, significant event analysis, and video analysis of consultations. Performance of GP registrars (trainees) was evaluated with each tool to assess the reliabilities of the tools and feasibility, given raters and number of assessments needed. Participant experience of process determined by questionnaire. 171 GP registrars and their trainers, drawn from nine deaneries (representing all four countries in the UK), participated. The ability of each tool to differentiate between doctors (reliability) was assessed using generalisability theory. Decision studies were then conducted to determine the number of observations required to achieve an acceptably high reliability for "high-stakes assessment" using each instrument. Finally, descriptive statistics were used to summarise participants' ratings of their experience using these tools. Multi-source feedback from colleagues and patient feedback on consultations emerged as the two methods most likely to offer a reliable and feasible opinion of workplace performance. Reliability co-efficients of 0.8 were attainable with 41 CARE Measure patient questionnaires and six clinical and/or five non-clinical colleagues per doctor when assessed on two occasions. For the other four methods tested, 10 or more assessors were required per doctor in order to achieve a reliable assessment, making the feasibility of their use in high-stakes assessment extremely low. Participant feedback did not raise any major concerns regarding the acceptability, feasibility, or educational impact of the tools. The combination of patient and colleague views of doctors' performance, coupled with reliable competence measures, may offer a suitable evidence-base on which to monitor progress and completion of doctors' training in general practice.
Failure-Time Distribution Of An m-Out-of-n System
NASA Technical Reports Server (NTRS)
Scheuer, Ernest M.
1988-01-01
Formulas for reliability extended to more general cases. Useful in analyses of reliabilities of practical systems and structures, especially of redundant systems of identical components, among which operating loads distributed equally.
2004-01-01
Laboratories performing waived testing must follow the manufacturer's instructions as well as good laboratory practices to ensure that test results are reliable. Four things to concentrate on to maximize the performance and reliability of waived tests are to: 1. Read and follow the information found in the package inserts. 2. Follow the manufacturer's recommendations for running quality control. 3. Train staff members to perform tests correctly. 4. Follow established policies and procedures for patient testing in the practice.
White-Heisel, Regina; Canfield, James P; Young-Hughes, Sadie
Perceiving imminent safe patient handling and movement (SPH&M) dangers may reduce musculoskeletal (MSK) injuries for nurses in the workplace. The purpose of this study is to develop and validate the 17-item Safe Patient Handling Perception Scale (SPHPS) as an evaluation instrument assessing perceptual risk of MSK injury based on SPH&M knowledge, practice, and resource accessibility in the workplace. Data were collected from a convenience sample (N = 117) of nursing employees at a Veteran Affairs Medical Center. Factor analysis identified three factors: knowledge, practice, and accessibility. The SPHPS demonstrated high levels of reliability, supported by acceptable alpha scores (SPHM knowledge [α = .866], SPHM practices [α = .901], and access to SPHM resources [α = .855]), in addition to the relatively low standard error of measurement scores (SEM). The study outcomes suggest that the SPHPS is a valid and reliable tool that can measure participants' perceived risk factors for MSK injuries.
Self-audit of lockout/tagout in manufacturing workplaces: A pilot study.
Yamin, Samuel C; Parker, David L; Xi, Min; Stanley, Rodney
2017-05-01
Occupational health and safety (OHS) self-auditing is a common practice in industrial workplaces. However, few audit instruments have been tested for inter-rater reliability and accuracy. A lockout/tagout (LOTO) self-audit checklist was developed for use in manufacturing enterprises. It was tested for inter-rater reliability and accuracy using responses of business self-auditors and external auditors. Inter-rater reliability at ten businesses was excellent (κ = 0.84). Business self-auditors had high (100%) accuracy in identifying elements of LOTO practice that were present as well those that were absent (81% accuracy). Reliability and accuracy increased further when problematic checklist questions were removed from the analysis. Results indicate that the LOTO self-audit checklist would be useful in manufacturing firms' efforts to assess and improve their LOTO programs. In addition, a reliable self-audit instrument removes the need for external auditors to visit worksites, thereby expanding capacity for outreach and intervention while minimizing costs. © 2017 Wiley Periodicals, Inc.
Duan, Lili; Liu, Xiao; Zhang, John Z H
2016-05-04
Efficient and reliable calculation of protein-ligand binding free energy is a grand challenge in computational biology and is of critical importance in drug design and many other molecular recognition problems. The main challenge lies in the calculation of entropic contribution to protein-ligand binding or interaction systems. In this report, we present a new interaction entropy method which is theoretically rigorous, computationally efficient, and numerically reliable for calculating entropic contribution to free energy in protein-ligand binding and other interaction processes. Drastically different from the widely employed but extremely expensive normal mode method for calculating entropy change in protein-ligand binding, the new method calculates the entropic component (interaction entropy or -TΔS) of the binding free energy directly from molecular dynamics simulation without any extra computational cost. Extensive study of over a dozen randomly selected protein-ligand binding systems demonstrated that this interaction entropy method is both computationally efficient and numerically reliable and is vastly superior to the standard normal mode approach. This interaction entropy paradigm introduces a novel and intuitive conceptual understanding of the entropic effect in protein-ligand binding and other general interaction systems as well as a practical method for highly efficient calculation of this effect.
Bergamin, Marco; Gobbo, Stefano; Bullo, Valentina; Vendramin, Barbara; Duregon, Federica; Frizziero, Antonio; Di Blasio, Andrea; Cugusi, Lucia; Zaccaria, Marco; Ermolao, Andrea
2017-01-01
Lower extremity muscle mass, strength, power, and physical performance are critical determinants of independent functioning in later life. Isokinetic dynamometers are becoming very common in assessing different features of muscle strength, in both research and clinical practice; however, reliability studies are still needed to support the extended use of those devices. The purpose of this study is to assess the test-retest reliability of knee and ankle isokinetic and isometric strength testing protocols in a sample of older healthy subjects, using a new and untested isokinetic multi-joint evaluation system. Sixteen male and fourteen female older adults (mean age 65.2 ± 4.6 years) were assessed in two testing sessions. Each participant performed a randomized testing procedure that includes different isometric and isokinetic tests for knee and ankle joints. All participants concluded the trial safety and no subject reported any discomfort throughout the overall assessment. Coefficients of correlation between measures were calculated showing moderate to strong effects among all test-retest assessments and paired-sample t test showed only one significant difference (p<0.05) in the maximal isokinetic bilateral knee flexion torque. The multi-joint evaluation system for the assessment of knee and ankle isokinetic and isometric strength provided reliable test-retest measures in healthy older adults. Ib.
Spreco, A; Timpka, T
2016-01-01
Objectives Reliable monitoring of influenza seasons and pandemic outbreaks is essential for response planning, but compilations of reports on detection and prediction algorithm performance in influenza control practice are largely missing. The aim of this study is to perform a metanarrative review of prospective evaluations of influenza outbreak detection and prediction algorithms restricted settings where authentic surveillance data have been used. Design The study was performed as a metanarrative review. An electronic literature search was performed, papers selected and qualitative and semiquantitative content analyses were conducted. For data extraction and interpretations, researcher triangulation was used for quality assurance. Results Eight prospective evaluations were found that used authentic surveillance data: three studies evaluating detection and five studies evaluating prediction. The methodological perspectives and experiences from the evaluations were found to have been reported in narrative formats representing biodefence informatics and health policy research, respectively. The biodefence informatics narrative having an emphasis on verification of technically and mathematically sound algorithms constituted a large part of the reporting. Four evaluations were reported as health policy research narratives, thus formulated in a manner that allows the results to qualify as policy evidence. Conclusions Awareness of the narrative format in which results are reported is essential when interpreting algorithm evaluations from an infectious disease control practice perspective. PMID:27154479
International classification of reliability for implanted cochlear implant receiver stimulators.
Battmer, Rolf-Dieter; Backous, Douglas D; Balkany, Thomas J; Briggs, Robert J S; Gantz, Bruce J; van Hasselt, Andrew; Kim, Chong Sun; Kubo, Takeshi; Lenarz, Thomas; Pillsbury, Harold C; O'Donoghue, Gerard M
2010-10-01
To design an international standard to be used when reporting reliability of the implanted components of cochlear implant systems to appropriate governmental authorities, cochlear implant (CI) centers, and for journal editors in evaluating manuscripts involving cochlear implant reliability. The International Consensus Group for Cochlear Implant Reliability Reporting was assembled to unify ongoing efforts in the United States, Europe, Asia, and Australia to create a consistent and comprehensive classification system for the implanted components of CI systems across manufacturers. All members of the consensus group are from tertiary referral cochlear implant centers. None. A clinically relevant classification scheme adapted from principles of ISO standard 5841-2:2000 originally designed for reporting reliability of cardiac pacemakers, pulse generators, or leads. Standard definitions for device failure, survival time, clinical benefit, reduced clinical benefit, and specification were generated. Time intervals for reporting back to implant centers for devices tested to be "out of specification," categorization of explanted devices, the method of cumulative survival reporting, and content of reliability reports to be issued by manufacturers was agreed upon by all members. The methodology for calculating Cumulative survival was adapted from ISO standard 5841-2:2000. The International Consensus Group on Cochlear Implant Device Reliability Reporting recommends compliance to this new standard in reporting reliability of implanted CI components by all manufacturers of CIs and the adoption of this standard as a minimal reporting guideline for editors of journals publishing cochlear implant research results.
Carthey, Jane; McCormack, Katie; Coombes, Julie; Gilbert, Douglas; Farrar, Daniel
2016-12-01
This article describes an intervention that combined in-situ coaching, observational audits and story-telling to educate theatre teams at University College London Hospitals about the Five steps to safer surgery (NPSA 2010). Our philosophy was to educate theatre teams about 'what goes right' (good catches, exemplary leadership etc) as well as 'what could be improved'. Results showed improvements on 'behavioural reliability' metrics, a 68% increase in near miss reporting and a reduction in surgical harm incidents. Copyright the Association for Perioperative Practice.
Parametric Design and Mechanical Analysis of Beams based on SINOVATION
NASA Astrophysics Data System (ADS)
Xu, Z. G.; Shen, W. D.; Yang, D. Y.; Liu, W. M.
2017-07-01
In engineering practice, engineer needs to carry out complicated calculation when the loads on the beam are complex. The processes of analysis and calculation take a lot of time and the results are unreliable. So VS2005 and ADK are used to develop a software for beams design based on the 3D CAD software SINOVATION with C ++ programming language. The software can realize the mechanical analysis and parameterized design of various types of beams and output the report of design in HTML format. Efficiency and reliability of design of beams are improved.
A practical and scalable manufacturing process for an anti-fungal agent, Nikkomycin Z.
Stenland, Christopher J; Lis, Lev G; Schendel, Frederick J; Hahn, Nicholas J; Smart, Mary A; Miller, Amy L; von Keitz, Marc G; Gurvich, Vadim J
2013-02-15
A scalable and reliable manufacturing process for Nikkomycin Z HCl on a 170 g scale has been developed and optimized. The process is characterized by a 2.3 g/L fermentation yield, 79% purification yield, and >98% relative purity of the final product. This method is suitable for further scale up and cGMP production. The Streptomyces tendae ΔNikQ strain developed during the course of this study is superior to any previously reported strain in terms of higher yield and purity of Nikkomycin Z.
Software Reliability Analysis of NASA Space Flight Software: A Practical Experience
Sukhwani, Harish; Alonso, Javier; Trivedi, Kishor S.; Mcginnis, Issac
2017-01-01
In this paper, we present the software reliability analysis of the flight software of a recently launched space mission. For our analysis, we use the defect reports collected during the flight software development. We find that this software was developed in multiple releases, each release spanning across all software life-cycle phases. We also find that the software releases were developed and tested for four different hardware platforms, spanning from off-the-shelf or emulation hardware to actual flight hardware. For releases that exhibit reliability growth or decay, we fit Software Reliability Growth Models (SRGM); otherwise we fit a distribution function. We find that most releases exhibit reliability growth, with Log-Logistic (NHPP) and S-Shaped (NHPP) as the best-fit SRGMs. For the releases that experience reliability decay, we investigate the causes for the same. We find that such releases were the first software releases to be tested on a new hardware platform, and hence they encountered major hardware integration issues. Also such releases seem to have been developed under time pressure in order to start testing on the new hardware platform sooner. Such releases exhibit poor reliability growth, and hence exhibit high predicted failure rate. Other problems include hardware specification changes and delivery delays from vendors. Thus, our analysis provides critical insights and inputs to the management to improve the software development process. As NASA has moved towards a product line engineering for its flight software development, software for future space missions will be developed in a similar manner and hence the analysis results for this mission can be considered as a baseline for future flight software missions. PMID:29278255
Software Reliability Analysis of NASA Space Flight Software: A Practical Experience.
Sukhwani, Harish; Alonso, Javier; Trivedi, Kishor S; Mcginnis, Issac
2016-01-01
In this paper, we present the software reliability analysis of the flight software of a recently launched space mission. For our analysis, we use the defect reports collected during the flight software development. We find that this software was developed in multiple releases, each release spanning across all software life-cycle phases. We also find that the software releases were developed and tested for four different hardware platforms, spanning from off-the-shelf or emulation hardware to actual flight hardware. For releases that exhibit reliability growth or decay, we fit Software Reliability Growth Models (SRGM); otherwise we fit a distribution function. We find that most releases exhibit reliability growth, with Log-Logistic (NHPP) and S-Shaped (NHPP) as the best-fit SRGMs. For the releases that experience reliability decay, we investigate the causes for the same. We find that such releases were the first software releases to be tested on a new hardware platform, and hence they encountered major hardware integration issues. Also such releases seem to have been developed under time pressure in order to start testing on the new hardware platform sooner. Such releases exhibit poor reliability growth, and hence exhibit high predicted failure rate. Other problems include hardware specification changes and delivery delays from vendors. Thus, our analysis provides critical insights and inputs to the management to improve the software development process. As NASA has moved towards a product line engineering for its flight software development, software for future space missions will be developed in a similar manner and hence the analysis results for this mission can be considered as a baseline for future flight software missions.
Kitchener, Henry C; Gittins, Matthew; Desai, Mina; Smith, John H F; Cook, Gary; Roberts, Chris; Turnbull, Lesley
2015-03-01
Liquid-based cytology (LBC) for cervical screening would benefit from laboratory practice guidelines that define specimen adequacy for reporting of slides. The evidence base required to define cell adequacy should incorporate both ThinPrep™ (TP; Hologic, Inc., Bedford, MA, USA) and SurePath™ (SP; BD Diagnostics, Burlington, NC, USA), the two LBC systems used in the UK cervical screening programmes. The objectives of this study were to determine (1) current practice for reporting LBC in England, Wales and Scotland, (2) a reproducible method for cell counting, (3) the cellularity of slides classified as inadequate, negative or abnormal and (4) the impact of varying cellularity on the likelihood of detecting cytological abnormalities. The study involved four separate arms to pursue each of the four objectives. (1) A questionnaire survey of laboratories was conducted. (2) A standard counting protocol was developed and used by three experienced cytopathologists to determine a reliable and reproducible cell counting method. (3) Slide sets which included a range of cytological abnormalities were each sent to three laboratories for cell counting to study the correlation between cell counts and reported cytological outcomes. (4) Dilution of LBC samples by fluid only (unmixed) or by dilution with a sample containing normal cells (mixed) was performed to study the impact on reporting of reducing either the total cell count or the relative proportion of abnormal to normal cells. The study was conducted within the cervical screening programmes in England, Wales and Scotland, using routinely obtained cervical screening samples, and in 56 participating NHS cervical cytology laboratories. The study involved only routinely obtained cervical screening samples. There was no clinical intervention. The main outcome measures were (1) reliability of counting method, (2) correlation of reported cytology grades with cellularity and (3) levels of detection of abnormal cells in progressively diluted cervical samples. Laboratory practice varied in terms of threshold of cellular adequacy and of morphological markers of adequacy. While SP laboratories generally used a minimum acceptable cell count (MACC) of 15,000, the MACC employed by TP laboratories varied between 5000 and 15,000. The cell counting study showed that a standard protocol achieved moderate to strong inter-rater reproducibility. Analysis of slide reporting from laboratories revealed that a large proportion of the samples reported as inadequate had cell counts above a threshold of 15,000 for SP, and 5000 and 10,000 for TP. Inter-rater unanimity was greater among more cellular preparations. Dilution studies demonstrated greater detection of abnormalities in slides with counts above the MACC and among slides with more than 25 dyskaryotic cells. Variation in laboratory practice demonstrates a requirement for evidence-based standards for designating a MACC. This study has indicated that a MACC of 15,000 and 5000 for SP and TP, respectively, achieves a balance in terms of maintaining sensitivity and low inadequacy rates. The findings of this study should inform the development of laboratory practice guidelines. The National Institute for Health Research Health Technology Assessment programme.
ERIC Educational Resources Information Center
Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
ERIC Educational Resources Information Center
Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald
2012-01-01
In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Rating reliability and representation validity in scenic landscape assessments
James F. Palmer; Robin E. Hoffman
2001-01-01
The US Supreme Court recently determined that experts from all fields of knowledge must demonstrate the reliability and validity of their testimony. While the broader implications of their finding have yet to manifest itself, it clearly has the potential to challenge all manner of professional practices. This paper explores the reliability of visual quality ratings of...
Franklin, Ashley E; Burns, Paulette; Lee, Christopher S
2014-10-01
In 2006, the National League for Nursing published three measures related to novice nurses' beliefs about self-confidence, scenario design, and educational practices associated with simulation. Despite the extensive use of these measures, little is known about their reliability and validity. The psychometric properties of the Student Satisfaction and Self-Confidence in Learning Scale, Simulation Design Scale, and Educational Practices Questionnaire were studied among a sample of 2200 surveys completed by novice nurses from a liberal arts university in the southern United States. Psychometric tests included item analysis, confirmatory and exploratory factor analyses in randomly-split subsamples, concordant and discordant validity, and internal consistency. All three measures have sufficient reliability and validity to be used in education research. There is room for improvement in content validity with the Student Satisfaction and Self-Confidence in Learning and Simulation Design Scale. This work provides robust evidence to ensure that judgments made about self-confidence after simulation, simulation design and educational practices are valid and reliable. Copyright © 2014 Elsevier Ltd. All rights reserved.
Lau, Nathan; Jamieson, Greg A; Skraaning, Gyrd
2016-03-01
The Process Overview Measure is a query-based measure developed to assess operator situation awareness (SA) from monitoring process plants. A companion paper describes how the measure has been developed according to process plant properties and operator cognitive work. The Process Overview Measure demonstrated practicality, sensitivity, validity and reliability in two full-scope simulator experiments investigating dramatically different operational concepts. Practicality was assessed based on qualitative feedback of participants and researchers. The Process Overview Measure demonstrated sensitivity and validity by revealing significant effects of experimental manipulations that corroborated with other empirical results. The measure also demonstrated adequate inter-rater reliability and practicality for measuring SA in full-scope simulator settings based on data collected on process experts. Thus, full-scope simulator studies can employ the Process Overview Measure to reveal the impact of new control room technology and operational concepts on monitoring process plants. Practitioner Summary: The Process Overview Measure is a query-based measure that demonstrated practicality, sensitivity, validity and reliability for assessing operator situation awareness (SA) from monitoring process plants in representative settings.
Seismology software: state of the practice
NASA Astrophysics Data System (ADS)
Smith, W. Spencer; Zeng, Zheng; Carette, Jacques
2018-05-01
We analyzed the state of practice for software development in the seismology domain by comparing 30 software packages on four aspects: product, implementation, design, and process. We found room for improvement in most seismology software packages. The principal areas of concern include a lack of adequate requirements and design specification documents, a lack of test data to assess reliability, a lack of examples to get new users started, and a lack of technological tools to assist with managing the development process. To assist going forward, we provide recommendations for a document-driven development process that includes a problem statement, development plan, requirement specification, verification and validation (V&V) plan, design specification, code, V&V report, and a user manual. We also provide advice on tool use, including issue tracking, version control, code documentation, and testing tools.
Seismology software: state of the practice
NASA Astrophysics Data System (ADS)
Smith, W. Spencer; Zeng, Zheng; Carette, Jacques
2018-02-01
We analyzed the state of practice for software development in the seismology domain by comparing 30 software packages on four aspects: product, implementation, design, and process. We found room for improvement in most seismology software packages. The principal areas of concern include a lack of adequate requirements and design specification documents, a lack of test data to assess reliability, a lack of examples to get new users started, and a lack of technological tools to assist with managing the development process. To assist going forward, we provide recommendations for a document-driven development process that includes a problem statement, development plan, requirement specification, verification and validation (V&V) plan, design specification, code, V&V report, and a user manual. We also provide advice on tool use, including issue tracking, version control, code documentation, and testing tools.
Application of exercise ECG stress test in the current high cost modern-era healthcare system.
Vaidya, Gaurang Nandkishor
Exercise electrocardiogram (ECG) tests boasts of being more widely available, less resource intensive, lower cost and absence of radiation. In the presence of a normal baseline ECG, an exercise ECG test is able to generate a reliable and reproducible result almost comparable to Technitium-99m sestamibi perfusion imaging. Exercise ECG changes when combined with other clinical parameters obtained during the test has the potential to allow effective redistribution of scarce resources by excluding low risk patients with significant accuracy. As we look towards a future of rising healthcare costs, increased prevalence of cardiovascular disease and the need for proper allocation of limited resources; exercise ECG test offers low cost, vital and reliable disease interpretation. This article highlights the physiology of the exercise ECG test, patient selection, effective interpretation, describe previously reported scores and their clinical application in today's clinical practice. Copyright © 2017. Published by Elsevier B.V.
Jeong, Jong Seok; Mkhoyan, K Andre
2016-06-01
Acquiring an atomic-resolution compositional map of crystalline specimens has become routine practice, thus opening possibilities for extracting subatomic information from such maps. A key challenge for achieving subatomic precision is the improvement of signal-to-noise ratio (SNR) of compositional maps. Here, we report a simple and reliable solution for achieving high-SNR energy-dispersive X-ray (EDX) spectroscopy spectrum images for individual atomic columns. The method is based on standard cross-correlation aided by averaging of single-column EDX maps with modifications in the reference image. It produces EDX maps with minimal specimen drift, beam drift, and scan distortions. Step-by-step procedures to determine a self-consistent reference map with a discussion on the reliability, stability, and limitations of the method are presented here.
Final Report of the NASA Office of Safety and Mission Assurance Agile Benchmarking Team
NASA Technical Reports Server (NTRS)
Wetherholt, Martha
2016-01-01
To ensure that the NASA Safety and Mission Assurance (SMA) community remains in a position to perform reliable Software Assurance (SA) on NASAs critical software (SW) systems with the software industry rapidly transitioning from waterfall to Agile processes, Terry Wilcutt, Chief, Safety and Mission Assurance, Office of Safety and Mission Assurance (OSMA) established the Agile Benchmarking Team (ABT). The Team's tasks were: 1. Research background literature on current Agile processes, 2. Perform benchmark activities with other organizations that are involved in software Agile processes to determine best practices, 3. Collect information on Agile-developed systems to enable improvements to the current NASA standards and processes to enhance their ability to perform reliable software assurance on NASA Agile-developed systems, 4. Suggest additional guidance and recommendations for updates to those standards and processes, as needed. The ABT's findings and recommendations for software management, engineering and software assurance are addressed herein.
Turner, Erlanger A
2012-08-01
The purpose of this paper is to provide psychometric data on the Parental Attitudes Toward Psychological Services Inventory (PATPSI), which is a revised measure to assess parents' attitudes toward outpatient mental health services. Using a sample of adults (N = 250), Study 1 supported a 3-factor structure (RMSEA = .05, NNFI = .94, and CFI = .94), adequate internal consistency (ranging from .72 to .92), and test-retest reliability (ranging from .66 to .84). Additionally, results indicated that individuals with previous use of mental health services reported more positive views toward child mental health services. Study 2 provided confirming evidence of the 3-factor structure (NNFI = .94, RMSEA = .08, and the CFI = .95) and adequate reliability (ranging from .70 to .90) using a parent-sample (N = 260). Additionally, discriminate validity of the PATPSI was supported. Implications for research and clinical practice are discussed.
Development and reliability testing of the Worksite and Energy Balance Survey.
Hoehner, Christine M; Budd, Elizabeth L; Marx, Christine M; Dodson, Elizabeth A; Brownson, Ross C
2013-01-01
Worksites represent important venues for health promotion. Development of psychometrically sound measures of worksite environments and policy supports for physical activity and healthy eating are needed for use in public health research and practice. Assess the test-retest reliability of the Worksite and Energy Balance Survey (WEBS), a self-report instrument for assessing perceptions of worksite supports for physical activity and healthy eating. The WEBS included items adapted from existing surveys or new items on the basis of a review of the literature and expert review. Cognitive interviews among 12 individuals were used to test the clarity of items and further refine the instrument. A targeted random-digit-dial telephone survey was administered on 2 occasions to assess test-retest reliability (mean days between time periods = 8; minimum = 5; maximum = 14). Five Missouri census tracts that varied by racial-ethnic composition and walkability. Respondents included 104 employed adults (67% white, 64% women, mean age = 48.6 years). Sixty-three percent were employed at worksites with less than 100 employees, approximately one-third supervised other people, and the majority worked a regular daytime shift (75%). Test-retest reliability was assessed using Spearman correlations for continuous variables, Cohen's κ statistics for nonordinal categorical variables, and 1-way random intraclass correlation coefficients for ordinal categorical variables. Test-retest coefficients ranged from 0.41 to 0.97, with 80% of items having reliability coefficients of more than 0.6. Items that assessed participation in or use of worksite programs/facilities tended to have lower reliability. Reliability of some items varied by gender, obesity status, and worksite size. Test-retest reliability and internal consistency for the 5 scales ranged from 0.84 to 0.94 and 0.63 to 0.84, respectively. The WEBS items and scales exhibited sound test-retest reliability and may be useful for research and surveillance. Further evaluation is needed to document the validity of the WEBS and associations with energy balance outcomes.
2014-01-01
Background Latino preschoolers (3-5 year old children) have among the highest rates of obesity. Low levels of physical activity (PA) are a risk factor for obesity. Characterizing what Latino parents do to encourage or discourage their preschooler to be physically active can help inform interventions to increase their PA. The objective was therefore to develop and assess the psychometrics of a new instrument: the Preschooler Physical Activity Parenting Practices (PPAPP) among a Latino sample, to assess parenting practices used to encourage or discourage PA among preschool-aged children. Methods Cross-sectional study of 240 Latino parents who reported the frequency of using PA parenting practices. 95% of respondents were mothers; 42% had more than a high school education. Child mean age was 4.5 (±0.9) years (52% male). Test-retest reliability was assessed in 20%, 2 weeks later. We assessed the fit of a priori models using Confirmatory factor analyses (CFA). In a separate sub-sample (35%), preschool-aged children wore accelerometers to assess associations with their PA and PPAPP subscales. Results The a-priori models showed poor fit to the data. A modified factor structure for encouraging PPAPP had one multiple-item scale: engagement (15 items), and two single-items (have outdoor toys; not enroll in sport-reverse coded). The final factor structure for discouraging PPAPP had 4 subscales: promote inactive transport (3 items), promote screen time (3 items), psychological control (4 items) and restricting for safety (4 items). Test-retest reliability (ICC) for the two scales ranged from 0.56-0.85. Cronbach’s alphas ranged from 0.5-0.9. Several sub-factors correlated in the expected direction with children’s objectively measured PA. Conclusion The final models for encouraging and discouraging PPAPP had moderate to good fit, with moderate to excellent test-retest reliabilities. The PPAPP should be further evaluated to better assess its associations with children’s PA and offers a new tool for measuring PPAPP among Latino families with preschool-aged children. PMID:24428935
Rail Transit System Maintenance Practices for Automatic Fare Collection Equipment
DOT National Transportation Integrated Search
1984-05-01
A review of rail transit system maintenance practices for automatic fare collection (AFC) equipment was performed. This study supports an UMTA sponsored program to improve the reliability of AFC equipment. The maintenance practices of the transit sys...
NASA preferred reliability-practices for design and test
NASA Technical Reports Server (NTRS)
Lisk, Ronald C.
1992-01-01
NASA HQ established the NASA R&M Steering Committee (R&MSC) comprised of membership from each NASA field center. The primary charter of the R&MSC is to obtain, record, and share the best design practices that NASA has applied to successful space flight programs and current design considerations (guidelines) that should enhance flight reliability on emerging programs. The practices and guidelines are being assembled in a living document for distribution to NASA centers and the aerospace community. The document will be updated annually with additional practices and guidelines as contributions from the centers are reviewed and approved by the R&MSC. Practices and guidelines are not requirements, but rather a means of sharing procedures and techniques that a given center and the R&MSC together feel have strong technical merit and application to the design of space-related equipment.
Development and psychometric evaluation of the Professional Practice Environment (PPE) scale.
Erickson, Jeanette Ives; Duffy, Mary E; Gibbons, M Patricia; Fitzmaurice, Joan; Ditomassi, Marianne; Jones, Dorothy
2004-01-01
To describe the Professional Practice Environment (PPE) scale, its conceptual development and psychometric evaluation, and its uses in measuring eight characteristics of the professional practice environment in an acute care setting. The 38-item PPE Scale was validated on a sample of 849 professional practice staff at the Massachusetts General Hospital in Boston. Psychometric analysis included: item analysis, principal components analysis (PCA) with varimax rotation and Kaiser normalization, and internal consistency reliability using Cronbach's alpha coefficient. Eight components were shown, confirming the original conceptually derived model's structure and accounting for 61% of explained variance. Cronbach's alpha coefficients for the eight PPE subscales ranged from .78 to .88. Findings showed the 38-item PPE Scale was reliable and valid for use in health outcomes research to examine the professional practice environment of staff working in acute care settings.
Predicting Cost/Reliability/Maintainability of Advanced General Aviation Avionics Equipment
NASA Technical Reports Server (NTRS)
Davis, M. R.; Kamins, M.; Mooz, W. E.
1978-01-01
A methodology is provided for assisting NASA in estimating the cost, reliability, and maintenance (CRM) requirements for general avionics equipment operating in the 1980's. Practical problems of predicting these factors are examined. The usefulness and short comings of different approaches for modeling coast and reliability estimates are discussed together with special problems caused by the lack of historical data on the cost of maintaining general aviation avionics. Suggestions are offered on how NASA might proceed in assessing cost reliability CRM implications in the absence of reliable generalized predictive models.
Kimball, Alexa B; Sundaram, Murali; Banderas, Benjamin; Foley, Catherine; Shields, Alan L
2018-03-01
Two patient-reported outcome (PRO) questionnaires, the Hidradenitis Suppurativa Symptom Assessment (HSSA) and Hidradenitis Suppurativa Impact Assessment (HSIA), were developed to measure signs, symptoms and impacts of HS in treatment efficacy studies. In accordance with FDA guidelines and published best practices, four stages of research were conducted to create the questionnaires: concept elicitation, questionnaire construction, content evaluation and psychometric evaluation. Subjects (N = 20) who participated in the concept elicitation stage reported 15 unique HS-related signs and symptoms and 51 impacts. Following this, eight sign and symptom concepts and 21 impacts were selected for construction of the HSSA and HSIA, respectively. During content evaluation, cognitive debriefing interviews with HS subjects (N = 20) confirmed subjects could read, comprehend and meaningfully respond to both questionnaires. Modifications made after this stage of work resulted in a nine-item HSSA and a 17-item HSIA. The HSSA and HSIA were subsequently entered into a US-based observational study (N = 40), and the scores produced by each were found to be reliable, construct valid, and able to distinguish among clinically distinct groups. The HSSA and HSIA are content-valid, HS-specific, PRO questionnaires with demonstrated ability to generate reliable, valid scores when administered to patients with HS in a research setting.
Educational testing validity and reliability in pharmacy and medical education literature.
Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J
2013-12-16
To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; p<0.001). While there were more scholarship of teaching and learning (SoTL) articles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peters, Valerie A.; Ogilvie, Alistair B.
2012-01-01
This report addresses the general data requirements for reliability analysis of fielded wind turbines and other wind plant equipment. The report provides a rationale for why this data should be collected, a list of the data needed to support reliability and availability analysis, and specific data recommendations for a Computerized Maintenance Management System (CMMS) to support automated analysis. This data collection recommendations report was written by Sandia National Laboratories to address the general data requirements for reliability analysis of operating wind turbines. This report is intended to help develop a basic understanding of the data needed for reliability analysis frommore » a Computerized Maintenance Management System (CMMS) and other data systems. The report provides a rationale for why this data should be collected, a list of the data needed to support reliability and availability analysis, and specific recommendations for a CMMS to support automated analysis. Though written for reliability analysis of wind turbines, much of the information is applicable to a wider variety of equipment and analysis and reporting needs. The 'Motivation' section of this report provides a rationale for collecting and analyzing field data for reliability analysis. The benefits of this type of effort can include increased energy delivered, decreased operating costs, enhanced preventive maintenance schedules, solutions to issues with the largest payback, and identification of early failure indicators.« less
Sternberg, Scot B; Co, John Patrick T; Homer, Charles J
2011-01-01
To identify, assess, and make recommendations for inclusion of measures that assess the domain of "most integrated health care setting," with a specific focus on measures of the medical home, one particular mechanism for integrating care, to identify gaps in measurement; and to make recommendations for new measure development. We developed a conceptual framework for care integration and reviewed literature on measures assessing the presence and quality of the medical home to determine their validity, reliability, and feasibility as a proxy for care integration. We identified 2 broad approaches to assessing the extent to which patients receive care that fulfills the aims of the medical home: 1) organizational assessment of practice systems and processes thought associated with achieving these desired aims (viz, the National Committee for Quality Assurance Physician Practice Connections-Patient Centered Medical Home measure and the Medical Home Index, and 2) direct assessment by patients/families of their experience of care in targeted dimensions. Based on concerns about the absence of reliability data and the feasibility of applying the practice audit/self-assessment approach on a population level for the purpose of state reporting, as well as the limited data linking performance on the specific measures with important child outcomes, we did not recommend any of the measures of organizational assessments of practice systems for inclusion in the core set as an indicator of care integration. In contrast, measures of the medical home based on items from the National Survey of Child Health on a population level of or the Consumer Assessment of Healthcare Providers and Systems for practice- and state-level assessment are more feasible, have known reliability and performance characteristics, and more closely reflect the aims of the medical home, including care integration. Measures of health care integration as captured by the experience of care in a medical home can best be assessed for state-level performance through patient/family experience surveys. Better measures of care integration, care coordination, and integration of mental, developmental, and physical health into a comprehensive care system are high-priority topics for measure development. Copyright © 2011 Academic Pediatric Association. Published by Elsevier Inc. All rights reserved.
Empirical methods for assessing meaningful neuropsychological change following epilepsy surgery.
Sawrie, S M; Chelune, G J; Naugle, R I; Lüders, H O
1996-11-01
Traditional methods for assessing the neurocognitive effects of epilepsy surgery are confounded by practice effects, test-retest reliability issues, and regression to the mean. This study employs 2 methods for assessing individual change that allow direct comparison of changes across both individuals and test measures. Fifty-one medically intractable epilepsy patients completed a comprehensive neuropsychological battery twice, approximately 8 months apart, prior to any invasive monitoring or surgical intervention. First, a Reliable Change (RC) index score was computed for each test score to take into account the reliability of that measure, and a cutoff score was empirically derived to establish the limits of statistically reliable change. These indices were subsequently adjusted for expected practice effects. The second approach used a regression technique to establish "change norms" along a common metric that models both expected practice effects and regression to the mean. The RC index scores provide the clinician with a statistical means of determining whether a patient's retest performance is "significantly" changed from baseline. The regression norms for change allow the clinician to evaluate the magnitude of a given patient's change on 1 or more variables along a common metric that takes into account the reliability and stability of each test measure. Case data illustrate how these methods provide an empirically grounded means for evaluating neurocognitive outcomes following medical interventions such as epilepsy surgery.
Hulteen, Ryan M; Lander, Natalie J; Morgan, Philip J; Barnett, Lisa M; Robertson, Samuel J; Lubans, David R
2015-10-01
It has been suggested that young people should develop competence in a variety of 'lifelong physical activities' to ensure that they can be active across the lifespan. The primary aim of this systematic review is to report the methodological properties, validity, reliability, and test duration of field-based measures that assess movement skill competency in lifelong physical activities. A secondary aim was to clearly define those characteristics unique to lifelong physical activities. A search of four electronic databases (Scopus, SPORTDiscus, ProQuest, and PubMed) was conducted between June 2014 and April 2015 with no date restrictions. Studies addressing the validity and/or reliability of lifelong physical activity tests were reviewed. Included articles were required to assess lifelong physical activities using process-oriented measures, as well as report either one type of validity or reliability. Assessment criteria for methodological quality were adapted from a checklist used in a previous review of sport skill outcome assessments. Movement skill assessments for eight different lifelong physical activities (badminton, cycling, dance, golf, racquetball, resistance training, swimming, and tennis) in 17 studies were identified for inclusion. Methodological quality, validity, reliability, and test duration (time to assess a single participant), for each article were assessed. Moderate to excellent reliability results were found in 16 of 17 studies, with 71% reporting inter-rater reliability and 41% reporting intra-rater reliability. Only four studies in this review reported test-retest reliability. Ten studies reported validity results; content validity was cited in 41% of these studies. Construct validity was reported in 24% of studies, while criterion validity was only reported in 12% of studies. Numerous assessments for lifelong physical activities may exist, yet only assessments for eight lifelong physical activities were included in this review. Generalizability of results may be more applicable if more heterogeneous samples are used in future research. Moderate to excellent levels of inter- and intra-rater reliability were reported in the majority of studies. However, future work should look to establish test-retest reliability. Validity was less commonly reported than reliability, and further types of validity other than content validity need to be established in future research. Specifically, predictive validity of 'lifelong physical activity' movement skill competency is needed to support the assertion that such activities provide the foundation for a lifetime of activity.
The German Version of the Gaze Anxiety Rating Scale (GARS): Reliability and Validity
Domes, Gregor; Marx, Lisa; Spenthof, Ines; Heinrichs, Markus
2016-01-01
Objective Fear of eye gaze and avoidance of eye contact are core features of social anxiety disorders (SAD). To measure self-reported fear and avoidance of eye gaze, the Gaze Anxiety Rating Scale (GARS) has been developed and validated in recent years in its English version. The main objectives of the present study were to psychometrically evaluate the German translation of the GARS concerning its reliability, factorial structure, and validity. Methods Three samples of participants were enrolled in the study. (1) A non-patient sample (n = 353) completed the GARS and a set of trait questionnaires to assess internal consistency, test-retest reliability, factorial structure, and concurrent and divergent validity. (2) A sample of patients with SAD (n = 33) was compared to a healthy control group (n = 30) regarding their scores on the GARS and the trait measures. Results The German GARS fear and avoidance scales exhibited excellent internal consistency and high stability over 2 and 4 months, as did the original version. The English version’s factorial structure was replicated, yielding two categories of situations: (1) everyday situations and (2) situations involving high evaluative threat. GARS fear and avoidance displayed convergent validity with trait measures of social anxiety and were markedly higher in patients with GSAD than in healthy controls. Fear and avoidance of eye contact in situations involving high levels of evaluative threat related more closely to social anxiety than to gaze anxiety in everyday situations. Conclusions The German version of the GARS has demonstrated reliability and validity similar to the original version, and is thus well suited to capture fear and avoidance of eye contact in different social situations as a valid self-report measure of social anxiety and related disorders in the social domain for use in both clinical practice and research. PMID:26937638
The German Version of the Gaze Anxiety Rating Scale (GARS): Reliability and Validity.
Domes, Gregor; Marx, Lisa; Spenthof, Ines; Heinrichs, Markus
2016-01-01
Fear of eye gaze and avoidance of eye contact are core features of social anxiety disorders (SAD). To measure self-reported fear and avoidance of eye gaze, the Gaze Anxiety Rating Scale (GARS) has been developed and validated in recent years in its English version. The main objectives of the present study were to psychometrically evaluate the German translation of the GARS concerning its reliability, factorial structure, and validity. Three samples of participants were enrolled in the study. (1) A non-patient sample (n = 353) completed the GARS and a set of trait questionnaires to assess internal consistency, test-retest reliability, factorial structure, and concurrent and divergent validity. (2) A sample of patients with SAD (n = 33) was compared to a healthy control group (n = 30) regarding their scores on the GARS and the trait measures. The German GARS fear and avoidance scales exhibited excellent internal consistency and high stability over 2 and 4 months, as did the original version. The English version's factorial structure was replicated, yielding two categories of situations: (1) everyday situations and (2) situations involving high evaluative threat. GARS fear and avoidance displayed convergent validity with trait measures of social anxiety and were markedly higher in patients with GSAD than in healthy controls. Fear and avoidance of eye contact in situations involving high levels of evaluative threat related more closely to social anxiety than to gaze anxiety in everyday situations. The German version of the GARS has demonstrated reliability and validity similar to the original version, and is thus well suited to capture fear and avoidance of eye contact in different social situations as a valid self-report measure of social anxiety and related disorders in the social domain for use in both clinical practice and research.
Verification of adolescent self-reported smoking.
Kentala, Jukka; Utriainen, Pekka; Pahkala, Kimmo; Mattila, Kari
2004-02-01
Smoking and the validity of information obtained on it is often questioned in view of the widespread belief that adolescents tend to under- or over-report the habit. The aim here was to verify smoking habits as reported in a questionnaire given in conjunction with dental examinations by asking participants directly whether they smoked or not and performing biochemical measurements of thiocyanate in the saliva and carbon monoxide in the expired air. The series consisted of 150 pupils in the ninth grade (age 15 years). The reports in the questionnaires seemed to provide a reliable estimate of adolescent smoking, the sensitivity of the method being 81-96%, specificity 77-95%. Biochemical verification or control of smoking proved needless in normal dental practice. Accepting information offered by the patient provides a good starting point for health education and work motivating and supporting of self-directed breaking of the habit.
Haywood, K L; Collin, S M; Crawley, E
2014-11-01
Chronic Fatigue Syndrome or Myalgic Encephalomyelitis (CFS/ME) in children is characterized by persistent or recurrent debilitating fatigue which results in a substantial reduction in activity. There is a growing interest in the use of questionnaires, or patient-reported outcome measures (PROMs), to assess how patients function and feel in relation to their health and associated healthcare. However, guidance for PROM selection for children with CFS/ME does not exist. We reviewed the quality and acceptability of PROMs used with children with CFS/ME to inform recommendations for practice. We conducted a systematic review of PROMs completed by children with CFS/ME. The quality of the evaluative studies and the reviewed measures were assessed against recommended criteria using an appraisal framework and the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. We sought evidence of measurement (reliability, validity, responsiveness, interpretability, data quality) and practical properties (acceptability, relevance, feasibility). Sixteen articles were included in the review, providing evidence of reliability and/or validity for 13 PROMs. Of these, five were child-specific (one health-related quality-of-life; four emotional well-being) and eight were not (four emotional well-being, three fatigue-specific; and one generic). All measures had limited evidence of measurement properties and no evidence of practical properties. Recommendations for patient-reported assessment are difficult to make because of limited evidence of the quality and acceptability of PROMs for children with CFS/ME. The appraisal method highlighted significant methodological and quality issues which must be addressed in future research. There is a lack of qualitative evidence describing the outcomes of healthcare that are important to children with CFS/ME, and the relevance or appropriateness of available measures. Future PROM development and evaluation in this group must seek to involve children collaboratively to ensure that the outcomes that children care about are assessed in an acceptable way. © 2014 John Wiley & Sons Ltd.
Harris, Deborah L; Weston, Philip J; Battin, Malcolm R; Harding, Jane E
2014-10-01
Neonatal hypoglycaemia is a common problem linked to both brain damage and death. There is controversy regarding both the definition of and best treatment for neonatal hypoglycaemia. To determine current management of neonatal hypoglycaemia within the Australian and New Zealand Neonatal Network (ANZNN). Four questionnaires were sent to the Director of each of the 45 nurseries within the ANZNN. The Director was asked to complete one questionnaire and give the remaining three to other doctors involved with the management of babies with hypoglycaemia in the nursery. One hundred and eighty surveys were sent and 127 were returned (71%), including at least one from each nursery. Almost all respondents (120, 94%) reported using a protocol to treat hypoglycaemia. Only 2 (2%) reported screening all babies for neonatal hypoglycaemia, with the remainder screening babies at risk. Only 67, (53%) reported that blood glucose levels were tested on an analyser generally considered to be reliable at low levels. Most respondents (99, 78%) reported the clinical threshold for treatment was <2.6 mmol/L. However, when provided with clinical scenarios, respondents reported a variety of interventions, including no treatment. Doctors within the ANZNN are consistent about definition and screening for neonatal hypoglycaemia. However, frequently, the diagnosis is made using unreliable analysers. There is also wide variation in treatment, suggesting a lack of reliable evidence on which to base practice. © 2009 The Authors. Journal compilation © 2009 Paediatrics and Child Health Division (Royal Australasian College of Physicians).
Extending the validity of the Feeding Practices and Structure Questionnaire.
Jansen, Elena; Mallan, Kimberley M; Daniels, Lynne A
2015-06-30
Feeding practices are commonly examined as potentially modifiable determinants of children's eating behaviours and weight status. Although a variety of questionnaires exist to assess different feeding aspects, many lack thorough reliability and validity testing. The Feeding Practices and Structure Questionnaire (FPSQ) is a tool designed to measure early feeding practices related to non-responsive feeding and structure of the meal environment. Face validity, factorial validity, internal reliability and cross-sectional correlations with children's eating behaviours have been established in mothers with 2-year-old children. The aim of the present study was to further extend the validity of the FPSQ by examining factorial, construct and predictive validity, and stability. Participants were from the NOURISH randomised controlled trial which evaluated an intervention with first-time mothers designed to promote protective feeding practices. Maternal feeding practices (FP) and child eating behaviours were assessed when children were aged 2 years and 3.7 years (n = 388). Confirmatory Factor analysis, group differences, predictive relationships, and stability were tested. The original 9-factor structure was confirmed when children were aged 3.7 ± 0.3 years. Cronbach's alpha was above the recommended 0.70 cut-off for all factors except Structured Meal Timing, Over Restriction and Distrust in Appetite which were 0.58, 0.67 and 0.66 respectively. Allocated group differences reflected behaviour consistent with intervention content and all feeding practices were stable across both time points (range of r = 0.45-0.70). There was some evidence for the predictive validity of factors with 2 FP showing expected relationships, 2 FP showing expected and unexpected relationships and 5 FP showing no relationship. Reliability and validity was demonstrated for most subscales of the FPSQ. Future validation is warranted with culturally diverse samples and with fathers and other caregivers. The use of additional outcomes to further explore predictive validity is recommended as well as testing test-retest reliability of the questionnaire.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-02-03
..., EPRI/NRC- RES Fire Human Reliability Analysis Guidelines, Draft Report for Comment AGENCY: Nuclear... Human Reliability Analysis Guidelines, Draft Report for Comment'' (December 11, 2009; 74 FR 65810). This... Human Reliability Analysis Guidelines'' is available electronically under ADAMS Accession Number...
18 CFR 39.11 - Reliability reports.
Code of Federal Regulations, 2010 CFR
2010-04-01
... Electric Reliability Organization shall conduct assessments of the adequacy of the Bulk-Power System in... assessments as determined by the Commission of the reliability of the Bulk-Power System in North America and... 18 Conservation of Power and Water Resources 1 2010-04-01 2010-04-01 false Reliability reports. 39...
ERIC Educational Resources Information Center
Thompson, Bruce
Researchers too frequently consider the reliability of the scores they analyze, and this may lead to incorrect conclusions. Practice in this regard may be negatively influenced by telegraphic habits of speech implying that tests possess reliability and other measurement characteristics. Styles of speaking in journal articles, in textbooks, and in…
McGinley, Jennifer L; Goldie, Patricia A; Greenwood, Kenneth M; Olney, Sandra J
2003-02-01
Physical therapists routinely observe gait in clinical practice. The purpose of this study was to determine the accuracy and reliability of observational assessments of push-off in gait after stroke. Eighteen physical therapists and 11 subjects with hemiplegia following a stroke participated in the study. Measurements of ankle power generation were obtained from subjects following stroke using a gait analysis system. Concurrent videotaped gait performances were observed by the physical therapists on 2 occasions. Ankle power generation at push-off was scored as either normal or abnormal using two 11-point rating scales. These observational ratings were correlated with the measurements of peak ankle power generation. A high correlation was obtained between the observational ratings and the measurements of ankle power generation (mean Pearson r=.84). Interobserver reliability was moderately high (mean intraclass correlation coefficient [ICC (2,1)]=.76). Intraobserver reliability also was high, with a mean ICC (2,1) of.89 obtained. Physical therapists were able to make accurate and reliable judgments of push-off in videotaped gait of subjects following stroke using observational assessment. Further research is indicated to explore the accuracy and reliability of data obtained with observational gait analysis as it occurs in clinical practice.
Sittig, Dean F; Gonzalez, Daniel; Singh, Hardeep
2014-11-01
Reliable health information technology (HIT) in general, and electronic health record systems (EHRs) in particular are essential to a high-performing healthcare system. When the availability of EHRs are disrupted, alternative methods must be used to maintain the continuity of healthcare. We developed a survey to assess institutional practices to handle situations when EHRs were unavailable for use (downtime preparedness). We used literature reviews and expert opinion to develop items that assessed the implementation of potentially useful practices. We administered the survey to U.S.-based healthcare institutions that were members of a professional organization that focused on collaboration and sharing of HIT-related best practices among its members. All members were large integrated health systems. We received responses from 50 of the 59 (84%) member institutions. Nearly all (96%) institutions reported at least one unplanned downtime (of any length) in the last 3 years and 70% had at least one unplanned downtime greater than 8h in the last 3 years. Three institutions reported that one or more patients were injured as a result of either a planned or unplanned downtime. The majority of institutions (70-85%) had implemented a portion of the useful practices we identified, but very few practices were followed by all organizations. Unexpected downtimes related to EHRs appear to be fairly common among institutions in our survey. Most institutions had only partially implemented comprehensive contingency plans to maintain safe and effective healthcare during unexpected EHRs downtimes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Byram, Jessica N; Seifert, Mark F; Brooks, William S; Fraser-Cotlin, Laura; Thorp, Laura E; Williams, James M; Wilson, Adam B
2017-03-01
With integrated curricula and multidisciplinary assessments becoming more prevalent in medical education, there is a continued need for educational research to explore the advantages, consequences, and challenges of integration practices. This retrospective analysis investigated the number of items needed to reliably assess anatomical knowledge in the context of gross anatomy and histology. A generalizability analysis was conducted on gross anatomy and histology written and practical examination items that were administered in a discipline-based format at Indiana University School of Medicine and in an integrated fashion at the University of Alabama School of Medicine and Rush University Medical College. Examination items were analyzed using a partially nested design s×(i:o) in which items were nested within occasions (i:o) and crossed with students (s). A reliability standard of 0.80 was used to determine the minimum number of items needed across examinations (occasions) to make reliable and informed decisions about students' competence in anatomical knowledge. Decision study plots are presented to demonstrate how the number of items per examination influences the reliability of each administered assessment. Using the example of a curriculum that assesses gross anatomy knowledge over five summative written and practical examinations, the results of the decision study estimated that 30 and 25 items would be needed on each written and practical examination to reach a reliability of 0.80, respectively. This study is particularly relevant to educators who may question whether the amount of anatomy content assessed in multidisciplinary evaluations is sufficient for making judgments about the anatomical aptitude of students. Anat Sci Educ 10: 109-119. © 2016 American Association of Anatomists. © 2016 American Association of Anatomists.
Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M
2016-04-01
The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
Morita, Tatsuya; Murata, Hisayuki; Hirai, Kei; Tamura, Keiko; Kataoka, Jun; Ohnishi, Hideki; Akizuki, Nobuya; Kurihara, Yukie; Akechi, Tatsuo; Uchitomi, Yosuke
2007-08-01
Recent empirical studies revealed that fostering patients' perception of meaning in their life is an essential task for palliative care clinicians. However, few studies have reported the effects of training programs for nurses specifically aimed at improving skills to relieve the meaninglessness of terminally ill cancer patients, and we have had no specific measurement instruments. The primary aims of this study were 1) to validate measurement tools to quantify nurses' self-reported practice and attitudes toward caring for terminally ill cancer patients feeling meaninglessness and 2) to explore the effects of the five-hour educational workshop focusing on meaninglessness on nurses' self-reported practice, attitudes toward caring for such patients, confidence, burnout, death anxiety, and meaning of life. A quasi-experimental pre-post questionnaire survey was performed on 147 nurses. The questionnaire was distributed before the intervention workshop and one and six months after. The workshop consisted of lecture, role-play, and the exercise of assessment and care planning based on two vignette verbatim records. First, using the first questionnaire sample and an additional sample of 20 nurses for the test-retest examination, we validated a six-item Self-Reported Practice scale, and an eight-item Attitudes Toward Caring for Patients Feeling Meaninglessness scale with three subscales (Willingness to Help, Positive Appraisal, and Helplessness). The nurses also completed a scale to assess confidence in caring for terminally ill patients with meaninglessness, the Maslach Burnout Inventory, the Death Attitude Inventory, the Frommelt Attitudes Toward Care of the Dying scale, the Self-Reported Practice Score in General Communication, and the three pain-related items from the Palliative Care Quiz for Nursing. For the Self-Reported Practice scale and the subscales of the Attitudes Toward Caring for Patients Feeling Meaninglessness scale, the Cronbach's alpha coefficients were 0.63-0.91, and the intra-class correlations were 0.89-0.94. The Self-Reported Practice scale significantly, but moderately, correlated with the Self-Reported Practice Score in General Communication (P=0.41). The Willingness to Help and Helplessness subscales significantly but weakly correlated with the Frommelt scale (P=-0.27, 0.21). Both scales did not correlate or minimally correlated with the Palliative Care Quiz for Nursing (P<0.20). The construct validity was confirmed using factor analysis. At the follow-up, of 147 nurses who participated in this workshop, 91 (62%) and 80 (54%) nurses responded. Self-reported practice and confidence significantly improved, whereas helplessness, emotional exhaustion, and death anxiety significantly decreased. The percentages of nurses who evaluated this program as "useful" or "very useful" were 79% (to understand the conceptual framework in caring for terminally ill patients with meaninglessness), 73% (to help in self-disclosing nurses' personal beliefs, values, and life goals), and 80% (to help in learning how to provide care for patients with meaninglessness). The Self-Reported Practice scale and the Attitudes Toward Caring for Patients Feeling Meaninglessness scale are reliable and valid tools to specifically quantify nurses' self-reported practice and attitudes toward caring for terminally ill cancer patients feeling meaninglessness of life. The five-hour workshop appeared to have a modest but significant beneficial effect on nurse-reported practice, attitudes, and confidence in providing care for terminally ill cancer patients feeling meaninglessness. Further educational intervention trials with control groups are promising.
Lima, Viviane Dias; Andia, Irene; Kabakyenga, Jerome; Mbabazi, Pamela; Emenyonu, Nneka; Patterson, Thomas L.; Hogg, Robert S.; Bangsberg, David R.
2013-01-01
The objective of this study was to develop a reliable HAART optimism scale among HIV-positive women in Uganda and to test the scale’s validity against measures of fertility intentions, sexual activity, and unprotected sexual intercourse. We used cross-sectional survey data of 540 women (18–50 years) attending Mbarara University’s HIV clinic in Uganda. Women were asked how much they agreed or disagreed with 23 statements about HAART. Data were subjected to a principal components and factor analyses. Subsequently, we tested the association between the scale and fertility intentions and sexual behaviour using Wilcoxon rank sum test. Factor analysis yielded three factors, one of which was an eight-item HAART optimism scale with moderately high internal consistency (α = 0.70). Women who reported that they intended to have (more) children had significantly higher HAART optimism scores (median = 13.5 [IQR: 12–16]) than women who did not intend to have (more) children (median = 10.5 [IQR: 8–12]; P <0.0001). Similarly, women who were sexually active and who reported practicing unprotected sexual intercourse had significantly higher HAART optimism scores than women who were sexually abstinent or who practiced protected sexual intercourse. Our reliable and valid scale, termed the Women’s HAART Optimism Monitoring and EvaluatioN scale (WHOMEN’s scale), may be valuable to broader studies investigating the role of HAART optimism on reproductive intentions and sexual behaviours of HIV-positive women in high HIV prevalence settings. PMID:19387819
Tsukayama, Hiroshi
2008-01-01
Evidence-based approach on the safety of acupuncture had been lagging behind both in the West and the East, but reliable data based on some prospective surveys were published after the late 1990s. In the present article, we, focusing on ‘Japanese acupuncture’, review relevant case reports and prospective surveys on adverse events in Japan, assess the safety of acupuncture practice in this country, and suggest a strategy for reducing the therapists’ error. Based on the prospective surveys, it seems reasonable to suppose that serious adverse events are rare in standard practice by adequately trained acupuncturists, regardless of countries or modes of practice. Almost all of adverse reactions commonly seen in acupuncture practice—such as fatigue, drowsiness, aggravation, minor bleeding, pain on insertion and subcutaneous hemorrhage—are mild and transient, although we should be cautious of secondary injury following drowsiness and needle fainting. After demonstrating that acupuncture is inherently safe, we have been focusing on how to reduce the risk of negligence in Japan, as well as educating acupuncturists more about safe depth of insertion and infection control. Incident reporting and feedback system is a useful strategy for reducing therapist errors such as forgotten needles. For the benefit of acupuncture patients in Japan, it is important to establish mandatory postgraduate clinical training and continued education system. PMID:18955234
de Jong, Marin J; Huibregtse, Roxanne; Masclee, Ad A M; Jonkers, Daisy M A E; Pierik, Marie J
2018-05-01
Mucosal inflammation must be carefully monitored to improve the long-term outcomes of patients with inflammatory bowel diseases (IBD). Patient-reported outcome measures (PROMs) are used increasingly to monitor disease activity in clinical practice and as endpoints in clinical trials. We performed a systematic review to provide an overview of the available PROMs on IBD activity and to evaluate their diagnostic value. A systematic search of the PubMed, Medline, Cochrane library, and Embase databases using defined keywords, identified 973 articles. These were screened by 2 independent reviewers, and 37 articles on development or validation of PROMs to assess IBD activity were identified for further analysis. Based on the recommendations of the Food and Drug Administration (FDA), the following measurement properties were evaluated: content, construct, and criterion validity; reliability; and responsiveness to change. In addition, data on ease of use in clinical practice were collected. Seventeen articles presenting 20 different PROMs were included the final analysis, although none met all the FDA-recommended criteria. Only 2 PROMs (patient-reported Harvey Bradshaw Index and Simple Clinical Colitis Activity Index scores) reported patient involvement during its development. Only 6 PROMs (patient-reported global assessment, patient assessment of disease activity, mobile health index for Crohn's disease, mobile health index for ulcerative colitis, patient-reported outcome derived from the Mayo score, and the 6-point Mayo score) were validated as markers of IBD activity, using findings from endoscopy as the reference standard; these PROMs identified patients with mucosal inflammation with area under the curve values of 0.63-0.82. The mobile health index for CD and UC scores had the best measurement properties for use in clinical practice and in clinical trials. In a systematic review, we identified more than 20 PROMS that have been developed and tested for their ability to determine IBD activity. Further studies are needed to determine their accuracy and whether they can be used effectively in routine practice, clinical trials, telemedicine systems, and value-based healthcare programs. Copyright © 2018 AGA Institute. Published by Elsevier Inc. All rights reserved.
Is self-reported height or arm span a more accurate alternative measure of height?
Brown, Jean K; Feng, Jui-Ying; Knapp, Thomas R
2002-11-01
The purpose of this study was to determine whether self-reported height or arm span is the more accurate alternative measure of height. A sample of 409 people between the ages of 19 and 67 (M = 35.0) participated in this anthropometric study. Height, self-reported height, and arm span were measured by 82 nursing research students. Mean differences from criterion measures were 0.17 cm for the measuring rules, 0.47 cm for arm span, and 0.85 cm and 0.87 cm for heights. Test-retest reliability was r = .997 for both height and arm span. The relationships of height to self-reported height and arm span were r = .97 and .90, respectively. Mean absolute differences were 1.80 cm and 4.29 cm, respectively. These findings support the practice of using self-reported height as an alternative measure of measured height in clinical settings, but arm span is an accurate alternative when neither measured height nor self-reported height is obtainable.
A conflict management scale for pharmacy.
Austin, Zubin; Gregory, Paul A; Martin, Craig
2009-11-12
To develop and establish the validity and reliability of a conflict management scale specific to pharmacy practice and education. A multistage inventory-item development process was undertaken involving 93 pharmacists and using a previously described explanatory model for conflict in pharmacy practice. A 19-item inventory was developed, field tested, and validated. The conflict management scale (CMS) demonstrated an acceptable degree of reliability and validity for use in educational or practice settings to promote self-reflection and self-awareness regarding individuals' conflict management styles. The CMS provides a unique, pharmacy-specific method for individuals to determine and reflect upon their own conflict management styles. As part of an educational program to facilitate self-reflection and heighten self-awareness, the CMS may be a useful tool to promote discussions related to an important part of pharmacy practice.
The reliability-quality relationship for quality systems and quality risk management.
Claycamp, H Gregg; Rahaman, Faiad; Urban, Jason M
2012-01-01
Engineering reliability typically refers to the probability that a system, or any of its components, will perform a required function for a stated period of time and under specified operating conditions. As such, reliability is inextricably linked with time-dependent quality concepts, such as maintaining a state of control and predicting the chances of losses from failures for quality risk management. Two popular current good manufacturing practice (cGMP) and quality risk management tools, failure mode and effects analysis (FMEA) and root cause analysis (RCA) are examples of engineering reliability evaluations that link reliability with quality and risk. Current concepts in pharmaceutical quality and quality management systems call for more predictive systems for maintaining quality; yet, the current pharmaceutical manufacturing literature and guidelines are curiously silent on engineering quality. This commentary discusses the meaning of engineering reliability while linking the concept to quality systems and quality risk management. The essay also discusses the difference between engineering reliability and statistical (assay) reliability. The assurance of quality in a pharmaceutical product is no longer measured only "after the fact" of manufacturing. Rather, concepts of quality systems and quality risk management call for designing quality assurance into all stages of the pharmaceutical product life cycle. Interestingly, most assays for quality are essentially static and inform product quality over the life cycle only by being repeated over time. Engineering process reliability is the fundamental concept that is meant to anticipate quality failures over the life cycle of the product. Reliability is a well-developed theory and practice for other types of manufactured products and manufacturing processes. Thus, it is well known to be an appropriate index of manufactured product quality. This essay discusses the meaning of reliability and its linkages with quality systems and quality risk management.
Making intelligent systems team players: Overview for designers
NASA Technical Reports Server (NTRS)
Malin, Jane T.; Schreckenghost, Debra L.
1992-01-01
This report is a guide and companion to the NASA Technical Memorandum 104738, 'Making Intelligent Systems Team Players,' Volumes 1 and 2. The first two volumes of this Technical Memorandum provide comprehensive guidance to designers of intelligent systems for real-time fault management of space systems, with the objective of achieving more effective human interaction. This report provides an analysis of the material discussed in the Technical Memorandum. It clarifies what it means for an intelligent system to be a team player, and how such systems are designed. It identifies significant intelligent system design problems and their impacts on reliability and usability. Where common design practice is not effective in solving these problems, we make recommendations for these situations. In this report, we summarize the main points in the Technical Memorandum and identify where to look for further information.
Best Practices for Real Property Management
2016-04-21
organizations and are experienced in creating reliable and actionable solutions to complex management issues guided by proven best business practices...DEFENSE BUSINESS BOARD For the Secretary of Defense Best Practices for Real Property Management DBB...FY16-02 Recommendations on the best practices for management of Department of Defense real property assets Defense Business
Describing the role of Australian community pharmacists in oral healthcare.
Taing, Meng-Wong; Ford, Pauline J; Gartner, Coral E; Freeman, Christopher R
2016-08-01
To investigate community pharmacist's attitudes, beliefs and practices towards oral health in the Australian setting, describe the frequency and nature of consumer enquiries relating to oral health, and gain insight regarding smoking cessation support for people experiencing oral health problems. An online questionnaire was developed based on previous research, validated to ensure accuracy and reliability, and convenience sampling used to advertise across major pharmacy organisational websites and newsletters to maximise community pharmacist responses. One hundred and forty-four valid community pharmacist responses were descriptively analysed. The majority of pharmacists (93%) believed it was their role to deliver oral health advice in the community and almost all (97%) pharmacists believed further education would benefit their practice. The top four consumer enquiries pharmacists reported confidence in handling related to analgesic medication to relieve oral-related pain (95.8%), mouth ulcers (95.1%), oral thrush (94.4%) and toothache (93.8%); and the most frequently reported consumer enquires were those where the majority of pharmacists reported high confidence in handling. A small proportion of pharmacists (8%) always enquired about patient smoking status, and nearly all pharmacists (97%) desired further education and training to benefit their practice in oral healthcare. This study highlights that Australian pharmacists have an important role in oral health and there is opportunity to enhance this role, and address risk factors such as smoking with further training, support and education. The findings from this study can guide future research into the development of appropriate training programmes, standards, and best oral healthcare practices for Australian pharmacists. © 2016 Royal Pharmaceutical Society.
Helfrich, Christian D; Li, Yu-Fang; Sharp, Nancy D; Sales, Anne E
2009-01-01
Background The Promoting Action on Research Implementation in Health Services, or PARIHS, framework is a theoretical framework widely promoted as a guide to implement evidence-based clinical practices. However, it has as yet no pool of validated measurement instruments that operationalize the constructs defined in the framework. The present article introduces an Organizational Readiness to Change Assessment instrument (ORCA), organized according to the core elements and sub-elements of the PARIHS framework, and reports on initial validation. Methods We conducted scale reliability and factor analyses on cross-sectional, secondary data from three quality improvement projects (n = 80) conducted in the Veterans Health Administration. In each project, identical 77-item ORCA instruments were administered to one or more staff from each facility involved in quality improvement projects. Items were organized into 19 subscales and three primary scales corresponding to the core elements of the PARIHS framework: (1) Strength and extent of evidence for the clinical practice changes represented by the QI program, assessed with four subscales, (2) Quality of the organizational context for the QI program, assessed with six subscales, and (3) Capacity for internal facilitation of the QI program, assessed with nine subscales. Results Cronbach's alpha for scale reliability were 0.74, 0.85 and 0.95 for the evidence, context and facilitation scales, respectively. The evidence scale and its three constituent subscales failed to meet the conventional threshold of 0.80 for reliability, and three individual items were eliminated from evidence subscales following reliability testing. In exploratory factor analysis, three factors were retained. Seven of the nine facilitation subscales loaded onto the first factor; five of the six context subscales loaded onto the second factor; and the three evidence subscales loaded on the third factor. Two subscales failed to load significantly on any factor. One measured resources in general (from the context scale), and one clinical champion role (from the facilitation scale). Conclusion We find general support for the reliability and factor structure of the ORCA. However, there was poor reliability among measures of evidence, and factor analysis results for measures of general resources and clinical champion role did not conform to the PARIHS framework. Additional validation is needed, including criterion validation. PMID:19594942
Helfrich, Christian D; Li, Yu-Fang; Sharp, Nancy D; Sales, Anne E
2009-07-14
The Promoting Action on Research Implementation in Health Services, or PARIHS, framework is a theoretical framework widely promoted as a guide to implement evidence-based clinical practices. However, it has as yet no pool of validated measurement instruments that operationalize the constructs defined in the framework. The present article introduces an Organizational Readiness to Change Assessment instrument (ORCA), organized according to the core elements and sub-elements of the PARIHS framework, and reports on initial validation. We conducted scale reliability and factor analyses on cross-sectional, secondary data from three quality improvement projects (n = 80) conducted in the Veterans Health Administration. In each project, identical 77-item ORCA instruments were administered to one or more staff from each facility involved in quality improvement projects. Items were organized into 19 subscales and three primary scales corresponding to the core elements of the PARIHS framework: (1) Strength and extent of evidence for the clinical practice changes represented by the QI program, assessed with four subscales, (2) Quality of the organizational context for the QI program, assessed with six subscales, and (3) Capacity for internal facilitation of the QI program, assessed with nine subscales. Cronbach's alpha for scale reliability were 0.74, 0.85 and 0.95 for the evidence, context and facilitation scales, respectively. The evidence scale and its three constituent subscales failed to meet the conventional threshold of 0.80 for reliability, and three individual items were eliminated from evidence subscales following reliability testing. In exploratory factor analysis, three factors were retained. Seven of the nine facilitation subscales loaded onto the first factor; five of the six context subscales loaded onto the second factor; and the three evidence subscales loaded on the third factor. Two subscales failed to load significantly on any factor. One measured resources in general (from the context scale), and one clinical champion role (from the facilitation scale). We find general support for the reliability and factor structure of the ORCA. However, there was poor reliability among measures of evidence, and factor analysis results for measures of general resources and clinical champion role did not conform to the PARIHS framework. Additional validation is needed, including criterion validation.
Stretchable and high-performance supercapacitors with crumpled graphene papers.
Zang, Jianfeng; Cao, Changyong; Feng, Yaying; Liu, Jie; Zhao, Xuanhe
2014-10-01
Fabrication of unconventional energy storage devices with high stretchability and performance is challenging, but critical to practical operations of fully power-independent stretchable electronics. While supercapacitors represent a promising candidate for unconventional energy-storage devices, existing stretchable supercapacitors are limited by their low stretchability, complicated fabrication process, and high cost. Here, we report a simple and low-cost method to fabricate extremely stretchable and high-performance electrodes for supercapacitors based on new crumpled-graphene papers. Electrolyte-mediated-graphene paper bonded on a compliant substrate can be crumpled into self-organized patterns by harnessing mechanical instabilities in the graphene paper. As the substrate is stretched, the crumpled patterns unfold, maintaining high reliability of the graphene paper under multiple cycles of large deformation. Supercapacitor electrodes based on the crumpled graphene papers exhibit a unique combination of high stretchability (e.g., linear strain ~300%, areal strain ~800%), high electrochemical performance (e.g., specific capacitance ~196 F g(-1)), and high reliability (e.g., over 1000 stretch/relax cycles). An all-solid-state supercapacitor capable of large deformation is further fabricated to demonstrate practical applications of the crumpled-graphene-paper electrodes. Our method and design open a wide range of opportunities for manufacturing future energy-storage devices with desired deformability together with high performance.
Stretchable and High-Performance Supercapacitors with Crumpled Graphene Papers
NASA Astrophysics Data System (ADS)
Zang, Jianfeng; Cao, Changyong; Feng, Yaying; Liu, Jie; Zhao, Xuanhe
2014-10-01
Fabrication of unconventional energy storage devices with high stretchability and performance is challenging, but critical to practical operations of fully power-independent stretchable electronics. While supercapacitors represent a promising candidate for unconventional energy-storage devices, existing stretchable supercapacitors are limited by their low stretchability, complicated fabrication process, and high cost. Here, we report a simple and low-cost method to fabricate extremely stretchable and high-performance electrodes for supercapacitors based on new crumpled-graphene papers. Electrolyte-mediated-graphene paper bonded on a compliant substrate can be crumpled into self-organized patterns by harnessing mechanical instabilities in the graphene paper. As the substrate is stretched, the crumpled patterns unfold, maintaining high reliability of the graphene paper under multiple cycles of large deformation. Supercapacitor electrodes based on the crumpled graphene papers exhibit a unique combination of high stretchability (e.g., linear strain ~300%, areal strain ~800%), high electrochemical performance (e.g., specific capacitance ~196 F g-1), and high reliability (e.g., over 1000 stretch/relax cycles). An all-solid-state supercapacitor capable of large deformation is further fabricated to demonstrate practical applications of the crumpled-graphene-paper electrodes. Our method and design open a wide range of opportunities for manufacturing future energy-storage devices with desired deformability together with high performance.
Development and Psychometric Testing of the Iceland-Family Illness Beliefs Questionnaire.
Gisladottir, Margret; Svavarsdottir, Erla Kolbrun
2016-08-01
Illness beliefs affect how individuals and families deal with illness. A valid and reliable instrument has not yet been developed to measure "illness beliefs" in family nursing research and clinical practice. This article describes the purpose, reliability, validity, and the potential clinical and research applications of a new instrument, the Iceland-Family Illness Beliefs Questionnaire (ICE-FIBQ). The ICE-FIBQ is a short, self-report measure of an individual's beliefs about illness. Drawing from an advanced nursing practice model called the Illness Beliefs Model, the instrument was developed to measure illness beliefs about (a) cause of illness, that is, etiology; (b) control of illness on family and control of family on illness; (c) effect of illness on the individual and family; (d) illness suffering; and (e) support received from health care professionals during illness. The instrument was tested on 139 family caregivers of adolescents/youth with an illness or a disorder. Exploratory factor analysis reduced the original questionnaire from eight to seven items with a one-factor solution (Cronbach's α = .780). Confirmatory factor analysis supported the one-factor solution (Cronbach's α = .789). Further research is needed to determine concurrent validity with other illness belief/illness perception scales and if the instrument is sensitive to capture change in illness beliefs following family nursing intervention. © The Author(s) 2016.
Choi, Keum-Hyeong; Buskey, Wendy; Johnson, Bonita
2010-07-01
The main purpose of this study was to investigate how receiving personal counseling at a university counseling center helps students deal with their personal problems and facilitates academic functioning. To that end, this study used both clinical and academic outcome measures that are relevant to the practice of counseling provided at a counseling center and its unique function in an institution of higher education. In addition, this study used the clinical significance methodology (N. S. Jacobson & P. Truax, 1991) that takes into account clients' differences in making clinically reliable and significant change. Pre-intake and post-termination surveys, including the Outcome Questionnaire (M. J. Lambert, K. Lunnen, V. Umphress, N. Hansen, & G. Burlingame, 1994), were completed by 78 clients, and the responses were analyzed using clinical significance methodology. The results revealed that those who made clinically reliable and significant change (i.e., the recovered group) reported the highest level of improvement in academic commitment to their educational goals and problem resolution, compared with those who did not make clinically significant change. The implications of the findings on practice for counseling at university counseling centers and for administrators in higher education institutions are discussed. (c) 2010 APA, all rights reserved.
Testing the reliability and validity of a measure of safety climate.
Anderson, E; McGovern, P M; Kochevar, L; Vesley, D; Gershon, R
2000-01-01
The lack of compliance with universal precautions (UP) is well documented across a wide variety of healthcare professions and has been reported both before and after the enactment of the Occupational Safety and Health Administration's Bloodborne Pathogens Standard. Gershon, Karkashian, and Felknor (1994) found that several factors correlated significantly with healthcare workers' lack of compliance with UP, including a measure of organizational safety climate (e.g., the employees' perception of their organizational culture and practices regarding safety). We conducted a secondary analysis using data from a cross-sectional survey of a convenience sample of 1,746 healthcare workers at risk of occupational exposure to bloodborne pathogens to assess the validity and reliability of Gershon's measure of safety climate. Findings revealed no relationship between safety climate and employees' gender, age, education, tenure in position, profession, hours worked per day, perceived risk, attitude toward risk, and training. An association was demonstrated between safety climate and (1) healthcare worker compliance with UP and (2) the availability of personal protective equipment, providing support for the construct validity of this measure of safety climate. These findings could be used by occupational health professionals to assess employees' perceptions of the safety culture and practices in the workplace and to guide the institution's risk management efforts in association with U.P.
Stretchable and High-Performance Supercapacitors with Crumpled Graphene Papers
Zang, Jianfeng; Cao, Changyong; Feng, Yaying; Liu, Jie; Zhao, Xuanhe
2014-01-01
Fabrication of unconventional energy storage devices with high stretchability and performance is challenging, but critical to practical operations of fully power-independent stretchable electronics. While supercapacitors represent a promising candidate for unconventional energy-storage devices, existing stretchable supercapacitors are limited by their low stretchability, complicated fabrication process, and high cost. Here, we report a simple and low-cost method to fabricate extremely stretchable and high-performance electrodes for supercapacitors based on new crumpled-graphene papers. Electrolyte-mediated-graphene paper bonded on a compliant substrate can be crumpled into self-organized patterns by harnessing mechanical instabilities in the graphene paper. As the substrate is stretched, the crumpled patterns unfold, maintaining high reliability of the graphene paper under multiple cycles of large deformation. Supercapacitor electrodes based on the crumpled graphene papers exhibit a unique combination of high stretchability (e.g., linear strain ~300%, areal strain ~800%), high electrochemical performance (e.g., specific capacitance ~196 F g−1), and high reliability (e.g., over 1000 stretch/relax cycles). An all-solid-state supercapacitor capable of large deformation is further fabricated to demonstrate practical applications of the crumpled-graphene-paper electrodes. Our method and design open a wide range of opportunities for manufacturing future energy-storage devices with desired deformability together with high performance. PMID:25270673
Park, Heehoon; Ebesutani, Chad K; Chung, Kyong-Mee; Stanick, Cameo
2018-01-01
The objective of this study was to create the Korean version of the Modified Practice Attitudes Scale (K-MPAS) to measure clinicians' attitudes toward evidence-based treatments (EBTs) in the Korean mental health system. Using 189 U.S. therapists and 283 members from the Korean mental health system, we examined the reliability and validity of the MPAS scores. We also conducted the first exploratory and confirmatory factor analysis on the MPAS and compared EBT attitudes across U.S. and Korean therapists. Results revealed that the inclusion of both "reversed-worded" and "non-reversed-worded" items introduced significant method effects that compromised the integrity of the one-factor MPAS model. Problems with the one-factor structure were resolved by eliminating the "non-reversed-worded" items. Reliability and validity were adequate among both Korean and U.S. therapists. Korean therapists also reported significantly more negative attitudes toward EBTs on the MPAS than U.S. therapists. The K-MPAS is the first questionnaire designed to measure Korean service providers' attitudes toward EBTs to help advance the dissemination of EBTs in Korea. The current study also demonstrated the negative impacts that can be introduced by incorporating oppositely worded items into a scale, particularly with respect to factor structure and detecting significant group differences.
The reliability and stability of visual working memory capacity.
Xu, Z; Adam, K C S; Fang, X; Vogel, E K
2018-04-01
Because of the central role of working memory capacity in cognition, many studies have used short measures of working memory capacity to examine its relationship to other domains. Here, we measured the reliability and stability of visual working memory capacity, measured using a single-probe change detection task. In Experiment 1, the participants (N = 135) completed a large number of trials of a change detection task (540 in total, 180 each of set sizes 4, 6, and 8). With large numbers of both trials and participants, reliability estimates were high (α > .9). We then used an iterative down-sampling procedure to create a look-up table for expected reliability in experiments with small sample sizes. In Experiment 2, the participants (N = 79) completed 31 sessions of single-probe change detection. The first 30 sessions took place over 30 consecutive days, and the last session took place 30 days later. This unprecedented number of sessions allowed us to examine the effects of practice on stability and internal reliability. Even after much practice, individual differences were stable over time (average between-session r = .76).
Reliability of concussion history in former professional football players.
Kerr, Zachary Y; Marshall, Stephen W; Guskiewicz, Kevin M
2012-03-01
The reliability of athletes to recall and self-report a concussion history has never been quantified. This study examined the reliability of the self-report concussion history measure and explored determinants of recall in the number of self-reported concussions in a group of retired professional football players. In 2001, a short questionnaire was administered to a cohort of former professional football players to ascertain the number of self-reported concussions they sustained during their professional playing careers. In 2010, the same instrument was readministered to a subset (n = 899) of the original cohort to assess reliability. Overall reliability was moderate (weighted Cohen κ = 0.48). The majority (62.1%) reported the same number of concussions in both administrations (2001 and 2010); 31.4% reported more concussions in the second administration. Compared with the "same number reported" group, the "greater number reported" group had more deficits in the second administration in their Short Form 36 physical health (composite score combining physical functioning, role physical, bodily pain, general health) and mental health (e.g., composite score combining vitality, social functioning, role emotional) scales. The self-reported concussion history had moderate reliability in former professional football players, on the basis of two administrations of the same instrument, 9 yr apart. However, changes in health status may be differentially associated with recall of concussions.
Reducing the Risks of Firearm Violence in High Schools: Principals' Perceptions and Practices.
Price, James H; Khubchandani, Jagdish; Payton, Erica; Thompson, Amy
2016-04-01
This study assessed the perceptions and practices of a national sample of secondary school principals regarding reducing firearm violence in high schools. Data were collected via three-wave postal mailings. A 59-item valid and reliable questionnaire was mailed to a national random sample of 800 secondary school principals. Of the 349 principals (46 %) that responded, 17 % reported a firearm incident at their school in the past 5 years. Principals perceived inadequate parental monitoring (70 %), inadequate mental health services (64 %), peer harassment/bullying (59 %), and easy access to firearms (50 %) as the main causes of firearm violence in schools. The three barriers to implementing firearm violence prevention practices were: lack of expertise as to which practices to implement (33 %), lack of time (30 %), and lack of research as to which practices are most effective (30 %). Less than half of schools trained school personnel regarding firearm violence issues. The findings indicate that firearm incidents at schools may be more common than previously thought. A significant portion of principals are at a loss as to what to implement because of a lack of empirical evidence on what is effective. More research is needed to find the most effective school interventions for reducing firearm violence.
García-Ramos, Amador; Haff, Guy Gregory; Pestaña-Melero, Francisco Luis; Pérez-Castilla, Alejandro; Rojas, Francisco Javier; Balsalobre-Fernández, Carlos; Jaric, Slobodan
2017-09-05
This study compared the concurrent validity and reliability of previously proposed generalized group equations for estimating the bench press (BP) one-repetition maximum (1RM) with the individualized load-velocity relationship modelled with a two-point method. Thirty men (BP 1RM relative to body mass: 1.08 0.18 kg·kg -1 ) performed two incremental loading tests in the concentric-only BP exercise and another two in the eccentric-concentric BP exercise to assess their actual 1RM and load-velocity relationships. A high velocity (≈ 1 m·s -1 ) and a low velocity (≈ 0.5 m·s -1 ) was selected from their load-velocity relationships to estimate the 1RM from generalized group equations and through an individual linear model obtained from the two velocities. The directly measured 1RM was highly correlated with all predicted 1RMs (r range: 0.847-0.977). The generalized group equations systematically underestimated the actual 1RM when predicted from the concentric-only BP (P <0.001; effect size [ES] range: 0.15-0.94), but overestimated it when predicted from the eccentric-concentric BP (P <0.001; ES range: 0.36-0.98). Conversely, a low systematic bias (range: -2.3-0.5 kg) and random errors (range: 3.0-3.8 kg), no heteroscedasticity of errors (r 2 range: 0.053-0.082), and trivial ES (range: -0.17-0.04) were observed when the prediction was based on the two-point method. Although all examined methods reported the 1RM with high reliability (CV≤5.1%; ICC≥0.89), the direct method was the most reliable (CV<2.0%; ICC≥0.98). The quick, fatigue-free, and practical two-point method was able to predict the BP 1RM with high reliability and practically perfect validity, and therefore we recommend its use over generalized group equations.
Structural Reliability Analysis and Optimization: Use of Approximations
NASA Technical Reports Server (NTRS)
Grandhi, Ramana V.; Wang, Liping
1999-01-01
This report is intended for the demonstration of function approximation concepts and their applicability in reliability analysis and design. Particularly, approximations in the calculation of the safety index, failure probability and structural optimization (modification of design variables) are developed. With this scope in mind, extensive details on probability theory are avoided. Definitions relevant to the stated objectives have been taken from standard text books. The idea of function approximations is to minimize the repetitive use of computationally intensive calculations by replacing them with simpler closed-form equations, which could be nonlinear. Typically, the approximations provide good accuracy around the points where they are constructed, and they need to be periodically updated to extend their utility. There are approximations in calculating the failure probability of a limit state function. The first one, which is most commonly discussed, is how the limit state is approximated at the design point. Most of the time this could be a first-order Taylor series expansion, also known as the First Order Reliability Method (FORM), or a second-order Taylor series expansion (paraboloid), also known as the Second Order Reliability Method (SORM). From the computational procedure point of view, this step comes after the design point identification; however, the order of approximation for the probability of failure calculation is discussed first, and it is denoted by either FORM or SORM. The other approximation of interest is how the design point, or the most probable failure point (MPP), is identified. For iteratively finding this point, again the limit state is approximated. The accuracy and efficiency of the approximations make the search process quite practical for analysis intensive approaches such as the finite element methods; therefore, the crux of this research is to develop excellent approximations for MPP identification and also different approximations including the higher-order reliability methods (HORM) for representing the failure surface. This report is divided into several parts to emphasize different segments of the structural reliability analysis and design. Broadly, it consists of mathematical foundations, methods and applications. Chapter I discusses the fundamental definitions of the probability theory, which are mostly available in standard text books. Probability density function descriptions relevant to this work are addressed. In Chapter 2, the concept and utility of function approximation are discussed for a general application in engineering analysis. Various forms of function representations and the latest developments in nonlinear adaptive approximations are presented with comparison studies. Research work accomplished in reliability analysis is presented in Chapter 3. First, the definition of safety index and most probable point of failure are introduced. Efficient ways of computing the safety index with a fewer number of iterations is emphasized. In chapter 4, the probability of failure prediction is presented using first-order, second-order and higher-order methods. System reliability methods are discussed in chapter 5. Chapter 6 presents optimization techniques for the modification and redistribution of structural sizes for improving the structural reliability. The report also contains several appendices on probability parameters.
Taape, Tillmann
2014-08-01
Hieronymus Brunschwig's Liber de arte distillandi, written in German and first published in Strasbourg in 1500, was the first printed manual on the distillation of medicinal waters. Although influential among early modern audiences and well known to modern scholars, its intriguing blend of intellectual and practical traditions has thus far received little attention. This paper identifies these strands in Brunschwig's technical instructions and shows how they intertwine in the production of reliable remedies. Exploring the intellectual dimension of Brunschwig's work, I argue that his concept of distillation is shaped by an alchemical understanding of matter, especially by the writings on 'quintessence' of the fourteenth-century alchemist John of Rupescissa. To realise this concept in the workshop, Brunschwig emphasises the central importance of the body and its senses to ensure true craftsmanship. Brunschwig's printed manual was as much a product of skilled artisanal practices as the distilled waters it describes, and I argue that it was shaped by the same concerns about technical precision and reliability.
Lewis, Matthew S; Maruff, Paul; Silbert, Brendan S; Evered, Lis A; Scott, David A
2007-02-01
The reliable change index (RCI) expresses change relative to its associated error, and is useful in the identification of postoperative cognitive dysfunction (POCD). This paper examines four common RCIs that each account for error in different ways. Three rules incorporate a constant correction for practice effects and are contrasted with the standard RCI that had no correction for practice. These rules are applied to 160 patients undergoing coronary artery bypass graft (CABG) surgery who completed neuropsychological assessments preoperatively and 1 week postoperatively using error and reliability data from a comparable healthy nonsurgical control group. The rules all identify POCD in a similar proportion of patients, but the use of the within-subject standard deviation (WSD), expressing the effects of random error, as an error estimate is a theoretically appropriate denominator when a constant error correction, removing the effects of systematic error, is deducted from the numerator in a RCI.
Valuing long-haul and metropolitan freight travel time and reliability
DOT National Transportation Integrated Search
2000-12-01
Most evaluations and economic assessments of transportation proposal and policies in Australia omit a valuation of time spent in transit for individual items or loads of freight. Knowledge of delays and the practical value of reliability can be usefu...
Incorporating travel-time reliability into the congestion management process : a primer.
DOT National Transportation Integrated Search
2015-02-01
This primer explains the value of incorporating travel-time reliability into the Congestion Management Process (CMP) : and identifies the most current tools available to assist with this effort. It draws from applied research and best practices : fro...
Studies of Physical Education in the United States Using SOFIT: A Review.
McKenzie, Thomas L; Smith, Nicole J
2017-12-01
An objective database for physical education (PE) is important for policy and practice decisions, and the System for Observing Fitness Instruction Time (SOFIT) has been identified as an appropriate surveillance tool for PE across the nation. The purpose of this review was to assess peer-reviewed studies using SOFIT to study K-12 PE in U.S. schools. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses informed the review, and 10 databases were searched for English-language articles published through 2016. A total of 704 records identifying SOFIT were located, and 137 full texts were read. Two authors reviewed full-text articles, and a data extraction tool was developed to select studies and main topics for synthesis. Twenty-nine studies that included direct observations of 12,256 PE lessons met inclusion criteria; 17 were conducted in elementary schools, 9 in secondary schools, and 3 in combined-level schools. Inconsistent reporting among studies was evident, including not all identifying the number of classes and teachers involved. All studies reported student physical activity, but fewer reported observer reliabilities (88%), lesson context (76%), teacher behavior (38%), and PE dosage (34%). The most frequently analyzed independent variables were teacher preparation (48%), lesson location (38%), and student gender (31%). SOFIT can be used reliably in diverse settings. Inconsistent reporting about study procedures and variables analyzed, however, limited comparisons among studies. Adherence to an established protocol and more consistent reporting would more fully enable the development of a viable database for PE in U.S. schools.
Kalmar, Alain F; Absalom, Anthony; Rombouts, Pieter; Roets, Jelle; Dewaele, Frank; Verdonck, Pascal; Stemerdink, Arjanne; Zijlstra, Jan G; Monsieurs, Koenraad G
2016-08-01
Unrecognised endotracheal tube misplacement in emergency intubations has a reported incidence of up to 17%. Current detection methods have many limitations restricting their reliability and availability in these circumstances. There is therefore a clinical need for a device that is small enough to be practical in emergency situations and that can detect oesophageal intubation within seconds. In a first reported evaluation, we demonstrated an algorithm based on pressure waveform analysis, able to determine tube location with high reliability in healthy patients. The aim of this study was to validate the specificity of the algorithm in patients with abnormal pulmonary compliance, and to demonstrate the reliability of a newly developed small device that incorporates the technology. Intubated patients with mild to moderate lung injury, admitted to intensive care were included in the study. The device was connected to the endotracheal tube, and three test ventilations were performed in each patient. All diagnostic data were recorded on PC for subsequent specificity/sensitivity analysis. A total of 105 ventilations in 35 patients with lung injury were analysed. With the threshold D-value of 0.1, the system showed a 100% sensitivity and specificity to diagnose tube location. The algorithm retained its specificity in patients with decreased pulmonary compliance. We also demonstrated the feasibility to integrate sensors and diagnostic hardware in a small, portable hand-held device for convenient use in emergency situations. Copyright © 2016 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Sources of medicine information and their reliability evaluated by medicine users.
Närhi, Ulla
2007-12-01
To study the medicine users' sources of medicine information and the perceived reliability of these sources in different age groups. A computer-aided telephone interview (CATI) to Finnish consumers (n = 1,004). Those respondents (n = 714) who reported using any prescription or self-medication medicines more than once a month were included in the study. The respondents were interviewed about their use of sources of medicine information during the previous 6 months. The reliability of sources in different age groups was estimated using a 4-point scale: very reliable, somewhat reliable, somewhat unreliable and very unreliable. The respondents also had the option of being unable to make an appraisal. A proportion of respondents reporting using the source, number of mentioned sources and their reliability evaluated by respondents. About half of the respondents in each age group mentioned two to four sources. The most common sources of information were Patient Information Leaflets (PILs) (74%), doctors (68%) and pharmacists (60%). Next came television (40%), newspapers and magazines (40%), drug advertisements (32%), nurses (28%), drug information leaflets (27%), relatives and friends (24%), medicine guides and books (22%) and the Internet (20%). There was a significant difference between age groups in reporting the Internet as a source of medicine information (15-34-year-old respondents reported the greatest Internet use). The three most reliable sources in every age group were reported to be PILs, doctors and pharmacists. Nurses, drug regulatory authorities, drug information leaflets and medicine guides and books were considered next most reliable. Relatives and friends, television, newspapers and magazines were considered the least reliable. The respondents were most uncertain about the reliability of the Internet, patient organisations and telephone services. There was a significant difference between age groups in evaluating the reliability of telephone services (15-34-year-olds found them more reliable). Medicine users reported receiving medicine information from many sources. The most commonly used sources were perceived as the most reliable, but their reliability did not seem to depend on age. The counsellors should take into account that patients have many sources of medicine information, with varying validity.
Adoption of electronic health records and barriers
Palabindala, Venkataraman; Pamarthy, Amaleswari; Jonnalagadda, Nageshwar Reddy
2016-01-01
Electronic health records (EHR) are not a new idea in the U.S. medical system, but surprisingly there has been very slow adoption of fully integrated EHR systems in practice in both primary care settings and within hospitals. For those who have invested in EHR, physicians report high levels of satisfaction and confidence in the reliability of their system. There is also consensus that EHR can improve patient care, promote safe practice, and enhance communication between patients and multiple providers, reducing the risk of error. As EHR implementation continues in hospitals, administrative and physician leadership must actively investigate all of the potential risks for medical error, system failure, and legal responsibility before moving forward. Ensuring that physicians are aware of their responsibilities in relation to their charting practices and the depth of information available within an EHR system is crucial for minimizing the risk of malpractice and lawsuit. Hospitals must commit to regular system upgrading and corresponding training for all users to reduce the risk of error and adverse events. PMID:27802857
Yao, Min; Yang, Long; Cao, Zuo-Yuan; Cheng, Shao-Dan; Tian, Shuang-Lin; Sun, Yue-Li; Wang, Jing; Xu, Bao-Ping; Hu, Xiao-Chun; Wang, Yong-Jun; Zhang, Ying; Cui, Xue-Jun
2017-09-18
Shoulder pain is a common musculoskeletal disorder in Chinese population, which affects more than 1,3 billion individuals. To the best of our knowledge, there has been no available Chinese-language version of measurements of shoulder pain and disability so far. Moreover, the Constant-Murley score (CMS) questionnaire is a universally recognized patient-reported questionnaire for clinical practice and research. The present study was designed to evaluate a Chinese translational version of CMS and subsequently assess its reliability and validity. The Chinese translational version of CMS was formulated by means of forward-backward translation. Meanwhile, a final review was carried out by an expert committee, followed by conducting a test of the pre-final version. Therefore, the reliability and validity of the Chinese translational version of CMS could be assessed using the internal consistency, construct validity, factor analysis, reliability and floor and ceiling effects. Specifically, the reliability was assessed by testing the internal consistency (Cronbach's α) and test-retest reliability (intraclass coefficient correlation [ICC]), while the construct validity was evaluated via comparison between the Chinese translational version of CMS with visual analog scale (VAS) score and the 36-Item Short Form Health Survey (SF-36, Spearman correlation). The questionnaire was verified to be acceptable after distribution among 120 subjects with unilateral shoulder pain. Factor analysis had revealed a two-factor and 10-item solution. Moreover, the assessment results indicated that the Chinese translational version of CMS questionnaire harbored good internal consistency (Cronbach's α = 0.739) and test-retest reliability (ICC = 0.827). In addition, the Chinese translational version of CMS was moderately correlated with VAS score (r = 0.497) and SF-36 (r = 0.135). No obvious floor and ceiling effects were observed in the Chinese translational version of CMS questionnaire. Chinese translational version of CMS exhibited good reliability, which is relatively acceptable and is likely to be widely used in this population.
ERIC Educational Resources Information Center
Ghazali, Nor Hasnida Md
2016-01-01
A valid, reliable and practical instrument is needed to evaluate the implementation of the school-based assessment (SBA) system. The aim of this study is to develop and assess the validity and reliability of an instrument to measure the perception of teachers towards the SBA implementation in schools. The instrument is developed based on a…
A study on reliability of power customer in distribution network
NASA Astrophysics Data System (ADS)
Liu, Liyuan; Ouyang, Sen; Chen, Danling; Ma, Shaohua; Wang, Xin
2017-05-01
The existing power supply reliability index system is oriented to power system without considering actual electricity availability in customer side. In addition, it is unable to reflect outage or customer’s equipment shutdown caused by instantaneous interruption and power quality problem. This paper thus makes a systematic study on reliability of power customer. By comparing with power supply reliability, reliability of power customer is defined and extracted its evaluation requirements. An indexes system, consisting of seven customer indexes and two contrast indexes, are designed to describe reliability of power customer from continuity and availability. In order to comprehensively and quantitatively evaluate reliability of power customer in distribution networks, reliability evaluation method is proposed based on improved entropy method and the punishment weighting principle. Practical application has proved that reliability index system and evaluation method for power customer is reasonable and effective.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-11-30
... covered at this meeting include: Cybersecurity best practices, ISP network protection practices... to Jeffery Goldthorp, Associate Chief for Cybersecurity and Communications Reliability Public Safety...
Schrimshaw, Eric W.; Rosario, Margaret; Meyer-Bahlburg, Heino F. L.; Scharf-Matlick, Alice A.
2011-01-01
Despite the importance of reliable self-reported sexual information for research on sexuality and sexual health, research has not examined reliability of information provided by gay, lesbian, and bisexual (GLB) youths. Test-retest reliability of self-reported sexual behaviors, sexual orientation, sexual identity, and psychosexual developmental milestones was examined among an ethnically diverse sample of 64 self-identified GLB youths. Two face-to-face interviews were conducted approximately two weeks apart using the Sexual Risk Behavior Assessment Schedule for Homosexual Youths (SERBAS-Y-HM). Overall, the mean of the test-retest reliability coefficients was substantial for 6 of the 7 domains: lifetime sexual behaviors (M = .89), sexual behavior in the past 3 months (M = .96), unprotected sexual behavior in the past 3 months (M = .93), sexual identity (κ = .89), sexual orientation (M = .82), and ages of various psychosexual developmental milestones (M = .77). Inconsistent reliability was found for reports of sexual behaviors while using substances. A small number of gender differences emerged, with lower reliability among female youths in the lifetime number of same-sex partners. The overall findings suggest that a wide range of self-reported sexual information can be reliably assessed among GLB youths by means of interviewer-administered questionnaires, such as the SERBAS-Y-HM. PMID:16752124
Updated Reliability Evaluation of V730 Transmission
DOT National Transportation Integrated Search
1983-11-01
This report culminates a two-year review of factors concerning the reliability of the Detroit Diesel Allison V730 automatic three-speed transmission for urban transit buses. This report is a continuing examination of the transmission's reliability. M...
NASA Astrophysics Data System (ADS)
Ighravwe, D. E.; Oke, S. A.; Adebiyi, K. A.
2016-06-01
The growing interest in technicians' workloads research is probably associated with the recent surge in competition. This was prompted by unprecedented technological development that triggers changes in customer tastes and preferences for industrial goods. In a quest for business improvement, this worldwide intense competition in industries has stimulated theories and practical frameworks that seek to optimise performance in workplaces. In line with this drive, the present paper proposes an optimisation model which considers technicians' reliability that complements factory information obtained. The information used emerged from technicians' productivity and earned-values using the concept of multi-objective modelling approach. Since technicians are expected to carry out routine and stochastic maintenance work, we consider these workloads as constraints. The influence of training, fatigue and experiential knowledge of technicians on workload management was considered. These workloads were combined with maintenance policy in optimising reliability, productivity and earned-values using the goal programming approach. Practical datasets were utilised in studying the applicability of the proposed model in practice. It was observed that our model was able to generate information that practicing maintenance engineers can apply in making more informed decisions on technicians' management.
Broyles, S T; Drazba, K T; Church, T S; Chaput, J-P; Fogelholm, M; Hu, G; Kuriyan, R; Kurpad, A; Lambert, E V; Maher, C; Maia, J; Matsudo, V; Olds, T; Onywera, V; Sarmiento, O L; Standage, M; Tremblay, M S; Tudor-Locke, C; Zhao, P; Katzmarzyk, P T
2015-01-01
Objectives: Schools are an important setting to enable and promote physical activity. Researchers have created a variety of tools to perform objective environmental assessments (or ‘audits') of other settings, such as neighborhoods and parks; yet, methods to assess the school physical activity environment are less common. The purpose of this study is to describe the approach used to objectively measure the school physical activity environment across 12 countries representing all inhabited continents, and to report on the reliability and feasibility of this methodology across these diverse settings. Methods: The International Study of Childhood Obesity, Lifestyle and the Environment (ISCOLE) school audit tool (ISAT) data collection required an in-depth training (including field practice and certification) and was facilitated by various supporting materials. Certified data collectors used the ISAT to assess the environment of all schools enrolled in ISCOLE. Sites completed a reliability audit (simultaneous audits by two independent, certified data collectors) for a minimum of two schools or at least 5% of their school sample. Item-level agreement between data collectors was assessed with both the kappa statistic and percent agreement. Inter-rater reliability of school summary scores was measured using the intraclass correlation coefficient. Results: Across the 12 sites, 256 schools participated in ISCOLE. Reliability audits were conducted at 53 schools (20.7% of the sample). For the assessed environmental features, inter-rater reliability (kappa) ranged from 0.37 to 0.96; 18 items (42%) were assessed with almost perfect reliability (κ=0.80–0.96), and a further 24 items (56%) were assessed with substantial reliability (κ=0.61–0.79). Likewise, scores that summarized a school's support for physical activity were highly reliable, with the exception of scores assessing aesthetics and perceived suitability of the school grounds for sport, informal games and general play. Conclusions: This study suggests that the ISAT can be used to conduct reliable objective audits of the school physical activity environment across diverse, international school settings. PMID:27152183
Bergamin, Marco; Gobbo, Stefano; Bullo, Valentina; Vendramin, Barbara; Duregon, Federica; Frizziero, Antonio; Di Blasio, Andrea; Cugusi, Lucia; Zaccaria, Marco; Ermolao, Andrea
2017-01-01
Summary Background Lower extremity muscle mass, strength, power, and physical performance are critical determinants of independent functioning in later life. Isokinetic dynamometers are becoming very common in assessing different features of muscle strength, in both research and clinical practice; however, reliability studies are still needed to support the extended use of those devices. Objective The purpose of this study is to assess the test-retest reliability of knee and ankle isokinetic and isometric strength testing protocols in a sample of older healthy subjects, using a new and untested isokinetic multi-joint evaluation system. Methods Sixteen male and fourteen female older adults (mean age 65.2 ± 4.6 years) were assessed in two testing sessions. Each participant performed a randomized testing procedure that includes different isometric and isokinetic tests for knee and ankle joints. Results All participants concluded the trial safety and no subject reported any discomfort throughout the overall assessment. Coefficients of correlation between measures were calculated showing moderate to strong effects among all test-retest assessments and paired-sample t test showed only one significant difference (p<0.05) in the maximal isokinetic bilateral knee flexion torque. Conclusions The multi-joint evaluation system for the assessment of knee and ankle isokinetic and isometric strength provided reliable test-retest measures in healthy older adults. Level of evidence Ib. PMID:29264344
van Hooren, Susan; van der Veld, William M.; Hutschemaekers, Giel
2017-01-01
Abstract Despite the use of art therapy in clinical practice, its appreciation and reported beneficial results, no instruments are available to measure specific effects of art therapy among patients with personality disorders cluster B/C in multidisciplinary treatment. In the present study, we described the development and psychometric evaluation of the Self‐expression and Emotion Regulation in Art Therapy Scale (SERATS). Structural validity (exploratory and confirmatory factor analysis), reliability, construct validity and sensitivity to change were examined using two independent databases (n = 335; n = 34) of patients diagnosed with personality disorders cluster B/C. This resulted in a nine‐item effect scale with a single factor with a high internal reliability and high test–retest reliability; it demonstrated discriminant validity and sensitivity to change. In conclusion, the SERATS is brief and content‐valid and offers objective and reliable information on self‐expression and emotion regulation in art therapy among patients with personality disorders cluster B/C. Although more research on construct validity is needed, the SERATS is a promising tool to be applied as an effect scale and as a monitoring tool during art therapy treatment. © 2017 The Authors Personality and Mental Health Published by John Wiley & Sons Ltd PMID:28730717
Reliability and validity of the adolescent health profile-types.
Riley, A W; Forrest, C B; Starfield, B; Green, B; Kang, M; Ensminger, M
1998-08-01
The purpose of this study was to demonstrate the preliminary reliability and validity of a set 13 profiles of adolescent health that describe distinct patterns of health and health service requirements on four domains of health. Reliability and validity were tested in four ethnically diverse population samples of urban and rural youths aged 11 to 17-years-old in public schools (N = 4,066). The reliability of the classification procedure and construct validity were examined in terms of the predicted and actual distributions of age, gender, race, socioeconomic status, and family type. School achievement, medical conditions, and the proportion of youths with a psychiatric disorder also were examined as tests of construct validity. The classification method was shown to produce consistent results across the four populations in terms of proportions of youths assigned with specific sociodemographic characteristics. Variations in health described by specific profiles showed expected relations to sociodemographic characteristics, family structure, school achievement, medical disorders, and psychiatric disorders. This taxonomy of health profile-types appears to effectively describe a set of patterns that characterize adolescent health. The profile-types provide a unique and practical method for identifying subgroups having distinct needs for health services, with potential utility for health policy and planning. Such integrative reporting methods are critical for more effective utilization of health status instruments in health resource planning and policy development.
Measuring eating disorder attitudes and behaviors: a reliability generalization study
2014-01-01
Background Although score reliability is a sample-dependent characteristic, researchers often only report reliability estimates from previous studies as justification for employing particular questionnaires in their research. The present study followed reliability generalization procedures to determine the mean score reliability of the Eating Disorder Inventory and its most commonly employed subscales (Drive for Thinness, Bulimia, and Body Dissatisfaction) and the Eating Attitudes Test as a way to better identify those characteristics that might impact score reliability. Methods Published studies that used these measures were coded based on their reporting of reliability information and additional study characteristics that might influence score reliability. Results Score reliability estimates were included in 26.15% of studies using the EDI and 36.28% of studies using the EAT. Mean Cronbach’s alphas for the EDI (total score = .91; subscales = .75 to .89), EAT-40 (total score = .81) and EAT-26 (total score = .86; subscales = .56 to .80) suggested variability in estimated internal consistency. Whereas some EDI subscales exhibited higher score reliability in clinical eating disorder samples than in nonclinical samples, other subscales did not exhibit these differences. Score reliability information for the EAT was primarily reported for nonclinical samples, making it difficult to characterize the effect of type of sample on these measures. However, there was a tendency for mean score reliability to be higher in the adult (vs. adolescent) samples and in female (vs. male) samples. Conclusions Overall, this study highlights the importance of assessing and reporting internal consistency during every test administration because reliability is affected by characteristics of the participants being examined. PMID:24764530
Neil, Sarah E; Myring, Alec; Peeters, Mon Jef; Pirie, Ian; Jacobs, Rachel; Hunt, Michael A; Garland, S Jayne; Campbell, Kristin L
2013-11-01
Muscular strength is a key parameter of rehabilitation programs and a strong predictor of functional capacity. Traditional methods to measure strength, such as manual muscle testing (MMT) and hand-held dynamometry (HHD), are limited by the strength and experience of the tester. The Performance Recorder 1 (PR1) is a strength assessment tool attached to resistance training equipment and may be a time- and cost-effective tool to measure strength in clinical practice that overcomes some limitations of MMT and HHD. However, reliability and validity of the PR1 have not been reported. Test-retest and inter-rater reliability was assessed using the PR1 in healthy adults (n = 15) during isometric knee flexion and extension. Criterion-related validity was assessed through comparison of values obtained from the PR1 and Biodex® isokinetic dynamometer. Test-retest reliability was excellent for peak knee flexion (intra-class correlation coefficient [ICC] of 0.96, 95% CI: 0.85, 0.99) and knee extension (ICC = 0.96, 95% CI: 0.87, 0.99). Inter-rater reliability was also excellent for peak knee flexion (ICC = 0.95, 95% CI: 0.85, 0.99) and peak knee extension (ICC = 0.97, 95% CI: 0.91, 0.99). Validity was moderate for peak knee flexion (ICC = 0.75, 95% CI: 0.38, 0.92) but poor for peak knee extension (ICC = 0.37, 95% CI: 0, 0.73). The PR1 provides a reliable measure of isometric knee flexor and extensor strength in healthy adults that could be used in the clinical setting, but absolute values may not be comparable to strength assessment by gold-standard measures.
Reliability and validity of the Thai version of the Pediatric Quality of Life Inventory 4.0.
Sritipsukho, Paskorn; Wisai, Matoorada; Thavorncharoensap, Montarat
2013-04-01
The study aimed to evaluate the reliability and validity of the Thai version of the Pediatric Quality of Life Inventory™ 4.0 Core Scales (PedsQL) as a measure of health-related quality of life (HRQOL). The PedsQL items were completed by 2,086 pupils aged 8-15 years and 1,914 parents from four schools, and 100 pediatric outpatients and 100 parents from a University Hospital. Test-retest reliability was conducted in a randomly selected of 150 pupils at a 1-month interval. Internal consistency reliability for the Total Scale score (α = 0.84 self-report, 0.88 proxy-report), Physical Health Summary score (α = 0.76 self-report, 0.79 proxy-report), and Psychosocial Health Summary score (α = 0.74 self-report, 0.85 proxy-report) exceeded the minimum reliability standard of 0.70. School children had significantly higher mean HRQOL scores compared to those with chronic health conditions for all subscales with the mean differences of 3.1-12.4 for self-report (p < 0.03) and 7.7-15.6 for proxy-report (p < 0.001). Test-retest reliability showed intraclass correlation coefficients above 0.60 in all subscales (p < 0.001). The Thai version of PedsQL had adequate reliability and validity and could be used as an outcome measure of HRQOL in Thai children aged 8-15 years.
Bean, Donna L; Rotheram-Borus, Mary Jane; Leibowitz, Arleen; Horwitz, Sarah M; Weidmer, Beverly
2003-02-01
To assess test-retest reliability of the service utilization screening section of the Services Assessment for Children and Adolescents (SACA) interview among Spanish-speaking parents and adolescents, correspondence between parent and adolescent reports, and the correlation between reliability and participants' demographic and service use characteristics. The English SACA was translated and administered from September 1999 through January 2000 in Los Angeles County, California, on two separate occasions to eligible parents with a child (4-17 years old) who was a client of a local public mental health authority. Adolescents of these parents (12-17 years old) were also interviewed. Reliability was measured by the kappa statistic. Adult and adolescent reports about lifetime and previous year service setting use exhibited good reliability, but concordance of parents and adolescents did not. Children's service utilization appears to be correlated with reliability of parent reports, and child gender appears to be correlated with reliability of adolescent reports. The SACA appears to be a useful tool for screening Spanish-speaking families about child and adolescent mental health service use. These findings must be considered preliminary until replicated in a larger sample of culturally diverse Spanish-speaking families.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maitra, Neepa
2016-07-14
This project investigates the accuracy of currently-used functionals in time-dependent density functional theory, which is today routinely used to predict and design materials and computationally model processes in solar energy conversion. The rigorously-based electron-ion dynamics method developed here sheds light on traditional methods and overcomes challenges those methods have. The fundamental research undertaken here is important for building reliable and practical methods for materials discovery. The ultimate goal is to use these tools for the computational design of new materials for solar cell devices of high efficiency.
Whitford, Heather M; Donnan, Peter T; Symon, Andrew G; Kellett, Gillian; Monteith-Hodge, Ewa; Rauchhaus, Petra; Wyatt, Jeremy C
2012-01-01
To test the reliability, validity, acceptability, and practicality of short message service (SMS) messaging for collection of research data. The studies were carried out in a cohort of recently delivered women in Tayside, Scotland, UK, who were asked about their current infant feeding method and future feeding plans. Reliability was assessed by comparison of their responses to two SMS messages sent 1 day apart. Validity was assessed by comparison of their responses to text questions and the same question administered by phone 1 day later, by comparison with the same data collected from other sources, and by correlation with other related measures. Acceptability was evaluated using quantitative and qualitative questions, and practicality by analysis of a researcher log. Reliability of the factual SMS message gave perfect agreement. Reliabilities for the numerical question were reasonable, with κ between 0.76 (95% CI 0.56 to 0.96) and 0.80 (95% CI 0.59 to 1.00). Validity for data compared with that collected by phone within 24 h (κ =0.92 (95% CI 0.84 to 1.00)) and with health visitor data (κ =0.85 (95% CI 0.73 to 0.97)) was excellent. Correlation validity between the text responses and other related demographic and clinical measures was as expected. Participants found the method a convenient and acceptable way of providing data. For researchers, SMS text messaging provided an easy and functional method of gathering a large volume of data. In this sample and for these questions, SMS was a reliable and valid method for capturing research data.
Busch, Robyn M; Lineweaver, Tara T; Ferguson, Lisa; Haut, Jennifer S
2015-06-01
Reliable change indices (RCIs) and standardized regression-based (SRB) change score norms permit evaluation of meaningful changes in test scores following treatment interventions, like epilepsy surgery, while accounting for test-retest reliability, practice effects, score fluctuations due to error, and relevant clinical and demographic factors. Although these methods are frequently used to assess cognitive change after epilepsy surgery in adults, they have not been widely applied to examine cognitive change in children with epilepsy. The goal of the current study was to develop RCIs and SRB change score norms for use in children with epilepsy. Sixty-three children with epilepsy (age range: 6-16; M=10.19, SD=2.58) underwent comprehensive neuropsychological evaluations at two time points an average of 12 months apart. Practice effect-adjusted RCIs and SRB change score norms were calculated for all cognitive measures in the battery. Practice effects were quite variable across the neuropsychological measures, with the greatest differences observed among older children, particularly on the Children's Memory Scale and Wisconsin Card Sorting Test. There was also notable variability in test-retest reliabilities across measures in the battery, with coefficients ranging from 0.14 to 0.92. Reliable change indices and SRB change score norms for use in assessing meaningful cognitive change in children following epilepsy surgery are provided for measures with reliability coefficients above 0.50. This is the first study to provide RCIs and SRB change score norms for a comprehensive neuropsychological battery based on a large sample of children with epilepsy. Tables to aid in evaluating cognitive changes in children who have undergone epilepsy surgery are provided for clinical use. An Excel sheet to perform all relevant calculations is also available to interested clinicians or researchers. Copyright © 2015 Elsevier Inc. All rights reserved.
Donnan, Peter T; Symon, Andrew G; Kellett, Gillian; Monteith-Hodge, Ewa; Rauchhaus, Petra; Wyatt, Jeremy C
2012-01-01
Objective To test the reliability, validity, acceptability, and practicality of short message service (SMS) messaging for collection of research data. Materials and methods The studies were carried out in a cohort of recently delivered women in Tayside, Scotland, UK, who were asked about their current infant feeding method and future feeding plans. Reliability was assessed by comparison of their responses to two SMS messages sent 1 day apart. Validity was assessed by comparison of their responses to text questions and the same question administered by phone 1 day later, by comparison with the same data collected from other sources, and by correlation with other related measures. Acceptability was evaluated using quantitative and qualitative questions, and practicality by analysis of a researcher log. Results Reliability of the factual SMS message gave perfect agreement. Reliabilities for the numerical question were reasonable, with κ between 0.76 (95% CI 0.56 to 0.96) and 0.80 (95% CI 0.59 to 1.00). Validity for data compared with that collected by phone within 24 h (κ =0.92 (95% CI 0.84 to 1.00)) and with health visitor data (κ =0.85 (95% CI 0.73 to 0.97)) was excellent. Correlation validity between the text responses and other related demographic and clinical measures was as expected. Participants found the method a convenient and acceptable way of providing data. For researchers, SMS text messaging provided an easy and functional method of gathering a large volume of data. Conclusion In this sample and for these questions, SMS was a reliable and valid method for capturing research data. PMID:22539081
A Conflict Management Scale for Pharmacy
Gregory, Paul A.; Martin, Craig
2009-01-01
Objectives To develop and establish the validity and reliability of a conflict management scale specific to pharmacy practice and education. Methods A multistage inventory-item development process was undertaken involving 93 pharmacists and using a previously described explanatory model for conflict in pharmacy practice. A 19-item inventory was developed, field tested, and validated. Results The conflict management scale (CMS) demonstrated an acceptable degree of reliability and validity for use in educational or practice settings to promote self-reflection and self-awareness regarding individuals' conflict management styles. Conclusions The CMS provides a unique, pharmacy-specific method for individuals to determine and reflect upon their own conflict management styles. As part of an educational program to facilitate self-reflection and heighten self-awareness, the CMS may be a useful tool to promote discussions related to an important part of pharmacy practice. PMID:19960081
Software reliability models for critical applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham, H.; Pham, M.
This report presents the results of the first phase of the ongoing EG G Idaho, Inc. Software Reliability Research Program. The program is studying the existing software reliability models and proposes a state-of-the-art software reliability model that is relevant to the nuclear reactor control environment. This report consists of three parts: (1) summaries of the literature review of existing software reliability and fault tolerant software reliability models and their related issues, (2) proposed technique for software reliability enhancement, and (3) general discussion and future research. The development of this proposed state-of-the-art software reliability model will be performed in the secondmore » place. 407 refs., 4 figs., 2 tabs.« less
Software reliability models for critical applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham, H.; Pham, M.
This report presents the results of the first phase of the ongoing EG&G Idaho, Inc. Software Reliability Research Program. The program is studying the existing software reliability models and proposes a state-of-the-art software reliability model that is relevant to the nuclear reactor control environment. This report consists of three parts: (1) summaries of the literature review of existing software reliability and fault tolerant software reliability models and their related issues, (2) proposed technique for software reliability enhancement, and (3) general discussion and future research. The development of this proposed state-of-the-art software reliability model will be performed in the second place.more » 407 refs., 4 figs., 2 tabs.« less
Lucas, Nicholas; Macaskill, Petra; Irwig, Les; Moran, Robert; Bogduk, Nikolai
2009-01-01
Trigger points are promoted as an important cause of musculoskeletal pain. There is no accepted reference standard for the diagnosis of trigger points, and data on the reliability of physical examination for trigger points are conflicting. To systematically review the literature on the reliability of physical examination for the diagnosis of trigger points. MEDLINE, EMBASE, and other sources were searched for articles reporting the reliability of physical examination for trigger points. Included studies were evaluated for their quality and applicability, and reliability estimates were extracted and reported. Nine studies were eligible for inclusion. None satisfied all quality and applicability criteria. No study specifically reported reliability for the identification of the location of active trigger points in the muscles of symptomatic participants. Reliability estimates varied widely for each diagnostic sign, for each muscle, and across each study. Reliability estimates were generally higher for subjective signs such as tenderness (kappa range, 0.22-1.0) and pain reproduction (kappa range, 0.57-1.00), and lower for objective signs such as the taut band (kappa range, -0.08-0.75) and local twitch response (kappa range, -0.05-0.57). No study to date has reported the reliability of trigger point diagnosis according to the currently proposed criteria. On the basis of the limited number of studies available, and significant problems with their design, reporting, statistical integrity, and clinical applicability, physical examination cannot currently be recommended as a reliable test for the diagnosis of trigger points. The reliability of trigger point diagnosis needs to be further investigated with studies of high quality that use current diagnostic criteria in clinically relevant patients.
Fan, Yahui; Zhang, Shaoru; Li, Yan; Li, Yuelu; Zhang, Tianhua; Liu, Weiping; Jiang, Hualin
2018-05-08
TB outbreaking in schools is extremely complex, and presents a major challenge for public health. Understanding the knowledge, attitudes and practices among student TB patients in such settings is fundamental when it comes to decreasing future TB cases. The objective of this study was to develop a Knowledge, Attitudes and Practices Questionnaire among Student Tuberculosis Patients (STBP-KAPQ), and evaluate its psychometric properties. This study was conducted in three stages: item construction, pilot testing in 10 student TB patients and psychometric testing, including reliability and validity. The item pool for the questionnaire was compiled from literature review and early individual interviews. The questionnaire items were evaluated by the Delphi method based on 12 experts. Reliability and validity were assessed using student TB patients (n = 416) and healthy students (n = 208). Reliability was examined with internal consistency reliability and test-retest reliability. Content validity was calculated by content validity index (CVI); Construct validity was examined using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA); The Public Tuberculosis Knowledge, Attitudes and Practices Questionnaire (PTB-KAPQ) was applied to evaluate criterion validity; As concerning discriminant validity, T-test was performed. The final STBP-KAPQ consisted of three dimensions and 25 items. Cronbach's α coefficient and intraclass correlation coefficient (ICC) was 0.817 and 0.765, respectively. Content validity index (CVI) was 0.962. Seven common factors were extracted by principal factor analysis and varimax rotation, with a cumulative contribution of 66.253%. The resulting CFA model of the STBP-KAPQ exhibited an appropriate model fit (χ2/df = 1.74, RMSEA = 0.082, CFI = 0.923, NNFI = 0.962). STBP-KAPQ and PTB-KAPQ had a strong correlation in the knowledge part, and the correlation coefficient was 0.606 (p < 0.05). Discriminant validity was supported through a significant difference between student TB patients and healthy students across all domains (p < 0.05). An instrument, "Knowledge, Attitudes and Practices Questionnaire among Student Tuberculosis Patients (STBP-KAPQ)" was developed. Psychometric testing indicated that it had adequate validity and reliability for use in KAP researches with student TB patients in China. The new tool might help public health researchers evaluate the level of KAP in student TB patients, and it could also be used to examine the effects of TB health education.
Fernández-Domínguez, Juan Carlos; de Pedro-Gómez, Joan Ernest; Morales-Asencio, José Miguel; Sastre-Fullana, Pedro; Sesé-Abad, Albert
2017-01-01
Introduction Most of the EBP measuring instruments available to date present limitations both in the operationalisation of the construct and also in the rigour of their psychometric development, as revealed in the literature review performed. The aim of this paper is to provide rigorous and adequate reliability and validity evidence of the scores of a new transdisciplinary psychometric tool, the Health Sciences Evidence-Based Practice (HS-EBP), for measuring the construct EBP in Health Sciences professionals. Methods A pilot study and a subsequent two-stage validation test sample were conducted to progressively refine the instrument until a reduced 60-item version with a five-factor latent structure. Reliability was analysed through both Cronbach’s alpha coefficient and intraclass correlations (ICC). Latent structure was contrasted using confirmatory factor analysis (CFA) following a model comparison aproach. Evidence of criterion validity of the scores obtained was achieved by considering attitudinal resistance to change, burnout, and quality of professional life as criterion variables; while convergent validity was assessed using the Spanish version of the Evidence-Based Practice Questionnaire (EBPQ-19). Results Adequate evidence of both reliability and ICC was obtained for the five dimensions of the questionnaire. According to the CFA model comparison, the best fit corresponded to the five-factor model (RMSEA = 0.049; CI 90% RMSEA = [0.047; 0.050]; CFI = 0.99). Adequate criterion and convergent validity evidence was also provided. Finally, the HS-EBP showed the capability to find differences between EBP training levels as an important evidence of decision validity. Conclusions Reliability and validity evidence obtained regarding the HS-EBP confirm the adequate operationalisation of the EBP construct as a process put into practice to respond to every clinical situation arising in the daily practice of professionals in health sciences (transprofessional). The tool could be useful for EBP individual assessment and for evaluating the impact of specific interventions to improve EBP. PMID:28486533
Fernández-Domínguez, Juan Carlos; de Pedro-Gómez, Joan Ernest; Morales-Asencio, José Miguel; Bennasar-Veny, Miquel; Sastre-Fullana, Pedro; Sesé-Abad, Albert
2017-01-01
Most of the EBP measuring instruments available to date present limitations both in the operationalisation of the construct and also in the rigour of their psychometric development, as revealed in the literature review performed. The aim of this paper is to provide rigorous and adequate reliability and validity evidence of the scores of a new transdisciplinary psychometric tool, the Health Sciences Evidence-Based Practice (HS-EBP), for measuring the construct EBP in Health Sciences professionals. A pilot study and a subsequent two-stage validation test sample were conducted to progressively refine the instrument until a reduced 60-item version with a five-factor latent structure. Reliability was analysed through both Cronbach's alpha coefficient and intraclass correlations (ICC). Latent structure was contrasted using confirmatory factor analysis (CFA) following a model comparison aproach. Evidence of criterion validity of the scores obtained was achieved by considering attitudinal resistance to change, burnout, and quality of professional life as criterion variables; while convergent validity was assessed using the Spanish version of the Evidence-Based Practice Questionnaire (EBPQ-19). Adequate evidence of both reliability and ICC was obtained for the five dimensions of the questionnaire. According to the CFA model comparison, the best fit corresponded to the five-factor model (RMSEA = 0.049; CI 90% RMSEA = [0.047; 0.050]; CFI = 0.99). Adequate criterion and convergent validity evidence was also provided. Finally, the HS-EBP showed the capability to find differences between EBP training levels as an important evidence of decision validity. Reliability and validity evidence obtained regarding the HS-EBP confirm the adequate operationalisation of the EBP construct as a process put into practice to respond to every clinical situation arising in the daily practice of professionals in health sciences (transprofessional). The tool could be useful for EBP individual assessment and for evaluating the impact of specific interventions to improve EBP.
Siedlecki, Sandra L; Albert, Nancy M
This article will describe how to assess interrater reliability and validity of risk assessment tools, using easy-to-follow formulas, and to provide calculations that demonstrate principles discussed. Clinical nurse specialists should be able to identify risk assessment tools that provide high-quality interrater reliability and the highest validity for predicting true events of importance to clinical settings. Making best practice recommendations for assessment tool use is critical to high-quality patient care and safe practices that impact patient outcomes and nursing resources. Optimal risk assessment tool selection requires knowledge about interrater reliability and tool validity. The clinical nurse specialist will understand the reliability and validity issues associated with risk assessment tools, and be able to evaluate tools using basic calculations. Risk assessment tools are developed to objectively predict quality and safety events and ultimately reduce the risk of event occurrence through preventive interventions. To ensure high-quality tool use, clinical nurse specialists must critically assess tool properties. The better the tool's ability to predict adverse events, the more likely that event risk is mediated. Interrater reliability and validity assessment is relatively an easy skill to master and will result in better decisions when selecting or making recommendations for risk assessment tool use.
A reliable, fast and low cost maximum power point tracker for photovoltaic applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Enrique, J.M.; Andujar, J.M.; Bohorquez, M.A.
This work presents a new maximum power point tracker system for photovoltaic applications. The developed system is an analog version of the ''P and O-oriented'' algorithm. It maintains its main advantages: simplicity, reliability and easy practical implementation, and avoids its main disadvantages: inaccurateness and relatively slow response. Additionally, the developed system can be implemented in a practical way at a low cost, which means an added value. The system also shows an excellent behavior for very fast variables in incident radiation levels. (author)
A mind you can count on: validating breath counting as a behavioral measure of mindfulness.
Levinson, Daniel B; Stoll, Eli L; Kindy, Sonam D; Merry, Hillary L; Davidson, Richard J
2014-01-01
Mindfulness practice of present moment awareness promises many benefits, but has eluded rigorous behavioral measurement. To date, research has relied on self-reported mindfulness or heterogeneous mindfulness trainings to infer skillful mindfulness practice and its effects. In four independent studies with over 400 total participants, we present the first construct validation of a behavioral measure of mindfulness, breath counting. We found it was reliable, correlated with self-reported mindfulness, differentiated long-term meditators from age-matched controls, and was distinct from sustained attention and working memory measures. In addition, we employed breath counting to test the nomological network of mindfulness. As theorized, we found skill in breath counting associated with more meta-awareness, less mind wandering, better mood, and greater non-attachment (i.e., less attentional capture by distractors formerly paired with reward). We also found in a randomized online training study that 4 weeks of breath counting training improved mindfulness and decreased mind wandering relative to working memory training and no training controls. Together, these findings provide the first evidence for breath counting as a behavioral measure of mindfulness.
Quality improvement in pediatrics: past, present, and future.
Schwartz, Stephanie P; Rehder, Kyle J
2017-01-01
Almost two decades ago, the landmark report "To Err is Human" compelled healthcare to address the large numbers of hospitalized patients experiencing preventable harm. Concurrently, it became clear that the rapidly rising cost of healthcare would be unsustainable in the long-term. As a result, quality improvement methodologies initially rooted in other high-reliability industries have become a primary focus of healthcare. Multiple pediatric studies demonstrate remarkable quality and safety improvements in several domains including handoffs, catheter-associated blood stream infections, and other serious safety events. While both quality improvement and research are data-driven processes, significant differences exist between the two. Research utilizes a hypothesis driven approach to obtain new knowledge while quality improvement often incorporates a cyclic approach to translate existing knowledge into clinical practice. Recent publications have provided guidelines and methods for effectively reporting quality and safety work and improvement implementations. This review examines not only how quality improvement in pediatrics has led to improved outcomes, but also looks to the future of quality improvement in healthcare with focus on education and collaboration to ensure best practice approaches to caring for children.
Solar PV O&M Standards and Best Practices – Existing Gaps and Improvement Efforts
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klise, Geoffrey Taylor; Balfour, John R.; Keating, T. J.
2014-11-01
As greater numbers of photovoltaic (PV) systems are being installed, operations & maintenance (O&M) activities will need to be performed to ensure the PV system is operating as designed over its useful lifetime. To mitigate risks to PV system availability and performance, standardized procedures for O&M activities are needed to ensure high reliability and long-term system bankability. Efforts are just getting underway to address the need for standard O&M procedures as PV gains a larger share of U.S. generation capacity. Due to the existing landscape of how and where PV is installed, including distributed generation from small and medium PVmore » systems, as well as large, centralized utility-scale PV, O&M activities will require different levels of expertise and reporting, making standards even more important. This report summarizes recent efforts made by solar industry stakeholders to identify the existing standards and best practices applied to solar PV O&M activities, and determine the gaps that have yet to be, or are currently being addressed by industry.« less
Solar PV O&M Standards and Best Practices - Existing Gaps and Improvement Efforts
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klise, Geoffrey Taylor; Balfour, John R.; Keating, T. J.
2014-11-01
As greater numbers of photovoltaic (PV) systems are being installed, operations & maintenance (O&M) activities will need to be performed to ensure the PV system is operating as designed over its useful lifetime. To mitigate risks to PV system availability and performance, standardized procedures for O&M activities are needed to ensure high reliability and long-term system bankability. Efforts are just getting underway to address the need for standard O&M procedures as PV gains a larger share of U.S. generation capacity. Due to the existing landscape of how and where PV is installed, including distributed generation from small and medium PVmore » systems, as well as large, centralized utility-scale PV, O&M activities will require different levels of expertise and reporting, making standards even more important. This report summarizes recent efforts made by solar industry stakeholders to identify the existing standards and best practices applied to solar PV O&M activities, and determine the gaps that have yet to be, or are currently being addressed by industry.« less
The acute lethal dose 50 (LD50) of caffeine in albino rats.
Adamson, Richard H
2016-10-01
An acute LD50 is a statistically derived amount of a substance that can be expected to cause death in 50% of the animals when given by a specified route as a single dose and the animals observed for a specified time period. Although conducting routine acute toxicity testing in rodents has been criticized, it can serve useful functions and also have practical implications. Material safety data sheets (MSDS) will reflect the acute toxicity of a substance and may require workers to wear protective gear, if appropriate, based on the LD50. There is no information in the scientific published literature which calculates a mean LD50 and standard deviation for caffeine administered orally to rats, using studies performed under good laboratory practice (GLP) or equivalent. This report does that and should be useful to manufacturers, packagers, transporters and regulators of this material. Using data from studies that are reproducible and reliable, the most accurate estimate of the acute LD50 of caffeine administered orally in male albino rats is hereby reported to be 367/mg/kg. Copyright © 2016 Elsevier Inc. All rights reserved.
RELIABLE COMPUTATION OF HOMOGENEOUS AZEOTROPES. (R824731)
It is important to determine the existence and composition of homogeneous azeotropes in the analysis of phase behavior and in the synthesis and design of separation systems, from both theoretical and practical standpoints. A new method for reliably locating an...
Reference values for the muscle power sprint test in 6- to 12-year-old children.
Douma-van Riet, Danielle; Verschuren, Olaf; Jelsma, Dorothee; Kruitwagen, Cas; Smits-Engelsman, Bouwien; Takken, Tim
2012-01-01
The aims of this study were (1) to develop centile reference values for anaerobic performance of Dutch children tested using the Muscle Power Sprint Test (MPST) and (2) to examine the test-retest reliability of the MPST. Children who were developing typically (178 boys and 201 girls) and aged 6 to 12 years (mean = 8.9 years) were recruited. The MPST was administered to 379 children, and test-retest reliability was examined in 47 children. MPST scores were transformed into centile curves, which were created using generalized additive models for location, scale, and shape. Height-related reference curves were created for both genders. Excellent (intraclass correlation coefficient = 0.98) test-retest reliability was demonstrated. The reference values for the MPST of children who are developing typically and aged 6 to 12 years can serve as a clinical standard in pediatric physical therapy practice. The MPST is a reliable and practical method for determining anaerobic performance in children.
Report on Wind Turbine Subsystem Reliability - A Survey of Various Databases (Presentation)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sheng, S.
2013-07-01
Wind industry has been challenged by premature subsystem/component failures. Various reliability data collection efforts have demonstrated their values in supporting wind turbine reliability and availability research & development and industrial activities. However, most information on these data collection efforts are scattered and not in a centralized place. With the objective of getting updated reliability statistics of wind turbines and/or subsystems so as to benefit future wind reliability and availability activities, this report is put together based on a survey of various reliability databases that are accessible directly or indirectly by NREL. For each database, whenever feasible, a brief description summarizingmore » database population, life span, and data collected is given along with its features & status. Then selective results deemed beneficial to the industry and generated based on the database are highlighted. This report concludes with several observations obtained throughout the survey and several reliability data collection opportunities in the future.« less
Report: Studies Addressing EPA’s Organizational Structure
Report #2006-P-00029, August 16, 2006. The 13 studies, articles, publications, and reports we reviewed identified issues with cross-media management, regional offices, reliable information, and reliable science.
A systematic review of clinical assessment for undergraduate nursing students.
Wu, Xi Vivien; Enskär, Karin; Lee, Cindy Ching Siang; Wang, Wenru
2015-02-01
Consolidated clinical practicum prepares pre-registration nursing students to function as beginning practitioners. The clinical competencies of final-year nursing students provide a key indication of professional standards of practice and patient safety. Thus, clinical assessment of nursing students is a crucial issue for educators and administrators. The aim of this systematic review was to explore the clinical competency assessment for undergraduate nursing students. PubMed, CINAHL, ScienceDirect, Web of Science, and EBSCO were systematically searched from January 2000 to December 2013. The systematic review was in line with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Published quantitative and qualitative studies that examined clinical assessment practices and tools used in clinical nursing education were retrieved. Quality assessment, data extraction, and analysis were completed on all included studies. This review screened 2073 titles, abstracts and full-text records, resulting in 33 included studies. Two reviewers assessed the quality of the included studies. Fourteen quantitative and qualitative studies were identified for this evaluation. The evidence was ordered into emergent themes; the overarching themes were current practices in clinical assessment, issues of learning and assessment, development of assessment tools, and reliability and validity of assessment tools. There is a need to develop a holistic clinical assessment tool with reasonable level of validity and reliability. Clinical assessment is a robust activity and requires collaboration between clinical partners and academia to enhance the clinical experiences of students, the professional development of preceptors, and the clinical credibility of academics. Copyright © 2014 Elsevier Ltd. All rights reserved.
Cheung, Yuen Man; Scoones, Gail; Stolker, Robert Jan; Weber, Frank
2018-04-16
To assess the thoughts of practicing anaesthesiologists about the use of depth of hypnosis monitors in children. Members of the European Society for Paediatric Anaesthesiology were invited to participate in an online survey about their thoughts regarding the use, applicability and reliability of hypnosis monitoring in children. The survey achieved a response rate of 30% (N = 168). A total of 138 completed surveys were included for further analysis. Sixty-eight respondents used hypnosis monitoring in children (Users) and 70 did not (Non-users). Sixty-five percent of the Users reported prevention of intra-operative awareness as their main reason to apply hypnosis monitoring. Among the Non-users, the most frequently given reason (43%) not to use hypnosis monitoring in children was the perceived lack or reliability of the devices in children. Hypnosis monitoring is used with a higher frequency during propofol anaesthesia than during inhalation anaesthesia. Hypnosis monitoring is furthermore used more frequently in children > 4 years than in younger children. An ideal hypnosis monitor should be reliable for all age groups and any (combination of) anaesthetic drug. We found no agreement in the interpretation of monitor index values and subsequent anaesthetic interventions following from it. Prevention of intraoperative awareness appears to be the most important reason to use hypnosis monitoring in children. The perceived lack of reliability of hypnosis monitoring in children is the most important reasons not to use it. No consensus currently exists on how to adjust anaesthesia according to hypnosis monitor index values in children.
Larsen, Camilla Marie; Juul-Kristensen, Birgit; Lund, Hans; Søgaard, Karen
2014-10-01
The aims were to compile a schematic overview of clinical scapular assessment methods and critically appraise the methodological quality of the involved studies. A systematic, computer-assisted literature search using Medline, CINAHL, SportDiscus and EMBASE was performed from inception to October 2013. Reference lists in articles were also screened for publications. From 50 articles, 54 method names were identified and categorized into three groups: (1) Static positioning assessment (n = 19); (2) Semi-dynamic (n = 13); and (3) Dynamic functional assessment (n = 22). Fifteen studies were excluded for evaluation due to no/few clinimetric results, leaving 35 studies for evaluation. Graded according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN checklist), the methodological quality in the reliability and validity domains was "fair" (57%) to "poor" (43%), with only one study rated as "good". The reliability domain was most often investigated. Few of the assessment methods in the included studies that had "fair" or "good" measurement property ratings demonstrated acceptable results for both reliability and validity. We found a substantially larger number of clinical scapular assessment methods than previously reported. Using the COSMIN checklist the methodological quality of the included measurement properties in the reliability and validity domains were in general "fair" to "poor". None were examined for all three domains: (1) reliability; (2) validity; and (3) responsiveness. Observational evaluation systems and assessment of scapular upward rotation seem suitably evidence-based for clinical use. Future studies should test and improve the clinimetric properties, and especially diagnostic accuracy and responsiveness, to increase utility for clinical practice.
Kosteniuk, Julie G; Wilson, Erin C; Penz, Kelly L; MacLeod, Martha L P; Stewart, Norma J; Kulig, Judith C; Karunanayake, Chandima P; Kilpatrick, Kelley
2016-01-01
To report the development and psychometric evaluation of a scale to measure rural and remote (rural/remote) nurses' perceptions of the engagement of their workplaces in key dimensions of primary health care (PHC). Amidst ongoing PHC reforms, a comprehensive instrument is needed to evaluate the degree to which rural/remote health care settings are involved in the key dimensions that characterize PHC delivery, particularly from the perspective of professionals delivering care. This study followed a three-phase process of instrument development and psychometric evaluation. A literature review and expert consultation informed instrument development in the first phase, followed by an iterative process of content evaluation in the second phase. In the final phase, a pilot survey was undertaken and item discrimination analysis employed to evaluate the internal consistency reliability of each subscale in the preliminary 60-item Primary Health Care Engagement (PHCE) Scale. The 60-item scale was subsequently refined to a 40-item instrument. The pilot survey sample included 89 nurses in current practice who had experience in rural/remote practice settings. Participants completed either a web-based or paper survey from September to December, 2013. Following item discrimination analysis, the 60-item instrument was refined to a 40-item PHCE Scale consisting of 10 subscales, each including three to five items. Alpha estimates of the 10 refined subscales ranged from 0.61 to 0.83, with seven of the subscales demonstrating acceptable reliability (α ⩾ 0.70). The refined 40-item instrument exhibited good internal consistency reliability (α=0.91). The 40-item PHCE Scale may be considered for use in future studies regardless of locale, to measure the extent to which health care professionals perceive their workplaces to be engaged in key dimensions of PHC.
NEPP DDR Device Reliability FY13 Report
NASA Technical Reports Server (NTRS)
Guertin, Steven M.; Armbar, Mehran
2014-01-01
This document reports the status of the NEPP Double Data Rate (DDR) Device Reliability effort for FY2013. The task targeted general reliability of > 100 DDR2 devices from Hynix, Samsung, and Micron. Detailed characterization of some devices when stressed by several data storage patterns was studied, targeting ability of the data cells to store the different data patterns without refresh, highlighting the weakest bits. DDR2, Reliability, Data Retention, Temperature Stress, Test System Evaluation, General Reliability, IDD measurements, electronic parts, parts testing, microcircuits
Dahlborn, K; Bugnon, P; Nevalainen, T; Raspa, M; Verbost, P; Spangenberg, E
2013-01-01
The primary aim of this report is to assist scientists in selecting more reliable/suitable identification (ID) methods for their studies. This is especially true for genetically altered (GA) animals where individual identification is strictly necessary to link samples, research design and genotype. The aim of this Federation of European Laboratory Animal Science Associations working group was to provide an update of the methods used to identify rodents in different situations and to assess their implications for animal welfare. ID procedures are an indispensable prerequisite for conducting good science but the degree of invasiveness differs between the different methods; therefore, one needs to make a good ethical evaluation of the method chosen. Based on the scientific literature the advantages and disadvantages of various methods have been presented comprehensively and this report is intended as a practical guide for researchers. New upcoming methods have been included next to the traditional techniques. Ideally, an ID method should provide reliable identification, be technically easy to apply and not inflict adverse effects on animals while taking into account the type of research. There is no gold standard method because each situation is unique; however, more studies are needed to better evaluate ID systems and the desirable introduction of new and modern approaches will need to be assessed by detailed scientific evaluation.
Development of a practice-based research program.
Hawk, C; Long, C R; Boulanger, K
1998-01-01
To establish an infrastructure to collect accurate data from ambulatory settings. The program was developed through an iterative model governed by a process of formative evaluation. The three iterations were a needs assessment, feasibility study and pilot project. Necessary program components were identified as infrastructure, practitioner-researcher partnership, centralized data management and standardized quality assurance measures. Volunteer chiropractors and their staff collected data on patients in their practices in ambulatory settings in the U.S. and Canada. Evaluative measures were counts of participants, patients and completed forms. Standardized, validated and reliable measures collected by patient self-report were used to assess treatment outcomes. These included the SF-36 or SF-12 Health Survey, the Pain Disability Index, and the Global Well-Being Scale. For characteristics for which appropriate standardized instruments were not available, questionnaires were designed and and pilot-tested before use. Information was gathered on practice and patient characteristics and treatment outcomes, but for this report, only those data concerning process evaluation are reported. Through the three program iterations, 65 DCs collected data on 1360 patients, 663 of whom were new patients. Follow-up data recorded by doctors were obtained for more than 70% of patients; a maximum of 50% of patient-completed follow-up forms were collected in the three iterations. This program is capable of providing data for descriptive epidemiology of ambulatory patients, and, with continued effort to maximize follow-up, may have utility in providing insight into utilization patterns and patient outcomes.
Current State of Test Development, Administration, and Analysis: A Study of Faculty Practices.
Bristol, Timothy J; Nelson, John W; Sherrill, Karin J; Wangerin, Virginia S
Developing valid and reliable test items is a critical skill for nursing faculty. This research analyzed the test item writing practice of 674 nursing faculty. Relationships between faculty characteristics and their test item writing practices were analyzed. Findings reveal variability in practice and a gap in implementation of evidence-based standards when developing and evaluating teacher-made examinations.
Jiang, Hualin; Zhang, Shaoru; Ding, Yi; Li, Yuelu; Zhang, Tianhua; Liu, Weiping; Fan, Yahui; Li, Yan; Zhang, Rongqiang; Ma, Xuexue
2017-12-12
China faces many challenges in controlling tuberculosis (TB). One significant challenge is the control of college students' TB. In particular, cross-sectional studies of college students' knowledge, attitudes and practices (KAP) in regard to TB have attracted substantial attention. However, few measurement tools have been developed to aid processes related to expert consultation, pre-testing, reliability and validity testing. Our study developed the College Students' TB Knowledge Attitudes and Practices Questionnaire (CS-TBKAPQ) following the scale development steps. The construction of the CS-TBKAPQ was based on the Theory of Knowledge, Attitude, Belief, and Practice (KABP or KAP). The item pool was compiled from literature reviews and individual interviews. The reliability validation was assessed by calculating Cronbach's α coefficient, the split-half reliability coefficient, and the test-retest reliability coefficient. Construct validity was assessed using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). The diagnostic accuracy was evaluated using the World Health Organization Advocacy, Communication and Social Mobilization KAP Survey Questionnaire (WHO-TBKAPQ) as the reference standard. A total of 31 questionnaire items were proposed. Cronbach's α coefficient, the split-half reliability coefficient and the test-retest reliability coefficient were 0.86, 0.78 and 0.91. Four factors that explained 62.52% of the total variance were also identified in EFA and confirmed in CFA. The CFA model fit indices were x 2 /df = 1.82 (p < 0.001), GFI = 0.925, AGFI = 0.900, RMR = 0.068, and RMSEA = 0.049. The CS-TBKAPQ was significantly correlated with the WHO-TBKAPQ and the Chinese Public TB KAP Questionnaire (CDC-TBKAPQ) developed by the Chinese Center for Disease Control and Prevention (r = 0.59, 0.60, p < 0.001). The receiver operating characteristics curve (ROC) analysis suggested a cut-off point of 47.5, with which the CS-TBKAPQ showed a sensitivity of 73.63% and a specificity of 80.51% in identifying students with low-level KAP. The positive and negative predictive values were 83.23% and 69.91%. The findings of this study demonstrate that the CS-TBKAPQ is a reliable and valid tool for measuring the KAP towards TB in college students.
Assessment of patient-reported symptoms of anxiety
Rose, Matthias; Devine, Janine
2014-01-01
Patient self-reported symptoms are of crucial importance to identify anxiety disorders, as well as to monitor their treatment in clinical practice and research. Thus, for evidence-based medicine, a precise, reliable, and valid (ie, “objective”) assessment of the patient's reported “subjective” symptoms is warranted. There is a plethora of instruments available, which can provide psychometrically sound assessments of anxiety, but there are several limitations of current tools that need to be carefully considered for their successful use. Nevertheless, the empirical assessment of mental health status is not as accepted in medicine as is the assessment of biomarkers. One reason for this may be that different instruments assessing the same psychological construct use different scales. In this paper we present some new developments that promise to provide one common metric for the assessment of anxiety, to facilitate the general acceptance of mental health assessments in the future. PMID:25152658
Mahoney, Diane Feeney; Jones, Richard N; Coon, David W; Mendelsohn, Aaron B; Gitlin, Laura N; Ory, Marcia
2003-01-01
This article reports on the measurement properties of Caregiver Vigilance, a four-item caregiver self-report of perceived oversight demand for Alzheimer's disease and related disorders family caregiving. The self-report uses data from the Resources for Enhancing Alzheimer's Caregiver Health (REACH) project, a multisite National Institutes of Health (NIH) sponsored study of over 1,200 family caregivers. Results indicate that the items were clearly understood by the racially/ethnically diverse respondents; and, when responses were transformed into a summary scale and analyzed, unidimensionality was evident and internal consistency reliability favorably demonstrated. We suggest using the Caregiver Vigilance Scale in conjunction with traditional burden measures to systematically include the caregiving time associated with protectively watching over care recipients and the daily duration of this responsibility. We also highlight the practical utility of selected items for potential use in the caregiver assessment process.
Jarvis, Michael F; Williams, Michael
2016-04-01
Concerns regarding the reliability of biomedical research outcomes were precipitated by two independent reports from the pharmaceutical industry that documented a lack of reproducibility in preclinical research in the areas of oncology, endocrinology, and hematology. Given their potential impact on public health, these concerns have been extensively covered in the media. Assessing the magnitude and scope of irreproducibility is limited by the anecdotal nature of the initial reports and a lack of quantitative data on specific failures to reproduce published research. Nevertheless, remediation activities have focused on needed enhancements in transparency and consistency in the reporting of experimental methodologies and results. While such initiatives can effectively bridge knowledge gaps and facilitate best practices across established and emerging research disciplines and therapeutic areas, concerns remain on how these improve on the historical process of independent replication in validating research findings and their potential to inhibit scientific innovation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Civil Forensic Psychiatry - part 3: practical aspects of managing a medico-legal practice.
Samuels, Anthony H
2018-06-01
This is the third in a series of papers on Civil Forensic Psychiatry and provides practical advice for forensic psychiatrists, general psychiatrists and trainees who are expanding or contemplating a medico-legal aspect to their practice. Attention to the practice setting, office layout, recording of information, management of documentation, screening of briefs and proper timetabling can improve safety, quality, reliability and workload manageability.
Food allergy: opportunities and challenges in the clinical practice of allergy and immunology.
James, John M
2004-10-01
Food allergy offers numerous opportunities and challenges for the allergy and clinical immunology specialist. Physicians with board certification in allergy and clinical immunology should be the main source of reliable clinical information to educate patients with food-related disorders. There has been a wealth of reliable information published related to food allergy that can be utilized by health care providers in clinical practice. This includes information about the cross-reactivity of food allergens, the evaluation of potential new therapies, and the practical application of new diagnostic methods and management strategies. This article addresses some of the new developments in food allergy, with an emphasis on cross-reactvity of food allergens, recombinant food allergens, and potential future therapies for food allergy.
Test-Retest Reliability and Practice Effects of the Stability Evaluation Test.
Williams, Richelle M; Corvo, Matthew A; Lam, Kenneth C; Williams, Travis A; Gilmer, Lesley K; McLeod, Tamara C Valovich
2017-01-17
Postural control plays an essential role in concussion evaluation. The Stability Evaluation Test (SET) aims to objectively analyze postural control by measuring sway velocity on the NeuroCom's VSR portable force platform (Natus, San Carlos, CA). To assess the test-retest reliability and practice effects of the SET protocol. Cohort. Research Laboratory. Fifty healthy adults (males=20, females=30, age=25.30±3.60 years, height=166.60±12.80 cm, mass=68.80±13.90 kg). All participants completed four trials of the SET. Each trial consisted of six 20-second balance tests with eyes closed, under the following conditions: double-leg firm (DFi), single-leg firm (SFi), tandem firm (TFi), double-leg foam (DFo), single-leg foam (SFo), and tandem foam (TFo). Each trial was separated by a 5-minute seated rest period. The dependent variable was sway velocity (deg/sec), with lower values indicating better balance. Sway velocity was recorded for each of the six conditions as well as a composite score for each trial. Test-retest reliability was analyzed across four trials with Intraclass Correlation Coefficients. Practice effects analyzed with repeated measures analysis of variance, followed by Tukey post-hoc comparisons for any significant main effects (p<.05). Sway velocity reliability values were good to excellent: DFi (ICC=0.88;95%CI:0.81,0.92), SFi (ICC=0.75;95%CI:0.61,0.85), TFi (ICC=0.84;95%CI:0.75,0.90), DFo (ICC=0.83;95%CI:0.74,0.90), SFo (ICC=0.82;95%CI:0.72,0.89), TFo (ICC=0.81;95%CI:0.69,0.88), and composite score (ICC=0.93;95%CI:0.88,0.95). Significant practice effects (p<.05) were noted on the SFi, DFo, SFo, TFo conditions, and composite scores. Our results suggest the SET has good to excellent reliability for the assessment of postural control in healthy adults. Due to the practice effects noted, a familiarization session is recommended (i.e., all 6 conditions) prior to recording the data. Future studies should evaluate injured patients to determine meaningful change scores during various injuries.
10 CFR 712.12 - HRP implementation.
Code of Federal Regulations, 2012 CFR
2012-01-01
... DEPARTMENT OF ENERGY HUMAN RELIABILITY PROGRAM Establishment of and Procedures for the Human Reliability...) Report any observed or reported behavior or condition of another HRP-certified individual that could indicate a reliability concern, including those behaviors and conditions listed in § 712.13(c), to a...
Study of structural reliability of existing concrete structures
NASA Astrophysics Data System (ADS)
Druķis, P.; Gaile, L.; Valtere, K.; Pakrastiņš, L.; Goremikins, V.
2017-10-01
Structural reliability of buildings has become an important issue after the collapse of a shopping center in Riga 21.11.2013, caused the death of 54 people. The reliability of a building is the practice of designing, constructing, operating, maintaining and removing buildings in ways that ensure maintained health, ward suffered injuries or death due to use of the building. Evaluation and improvement of existing buildings is becoming more and more important. For a large part of existing buildings, the design life has been reached or will be reached in the near future. The structures of these buildings need to be reassessed in order to find out whether the safety requirements are met. The safety requirements provided by the Eurocodes are a starting point for the assessment of safety. However, it would be uneconomical to require all existing buildings and structures to comply fully with these new codes and corresponding safety levels, therefore the assessment of existing buildings differs with each design situation. This case study describes the simple and practical procedure of determination of minimal reliability index β of existing concrete structures designed by different codes than Eurocodes and allows to reassess the actual reliability level of different structural elements of existing buildings under design load.
The quality improvement attitude survey: Development and preliminary psychometric characteristics.
Dunagan, Pamela B
2017-12-01
To report the development of a tool to measure nurse's attitudes about quality improvement in their practice setting and to examine preliminary psychometric characteristics of the Quality Improvement Nursing Attitude Scale. Human factors such as nursing attitudes of complacency have been identified as root causes of sentinel events. Attitudes of nurses concerning use of Quality and Safety Education for nurse's competencies can be most challenging to teach and to change. No tool has been developed measuring attitudes of nurses concerning their role in quality improvement. A descriptive study design with preliminary psychometric evaluation was used to examine the preliminary psychometric characteristics of the Quality Improvement Nursing Attitude Scale. Registered bedside clinical nurses comprised the sample for the study (n = 57). Quantitative data were analysed using descriptive statistics and Cronbach's alpha reliability. Total score and individual item statistics were evaluated. Two open-ended items were used to collect statements about nurses' feelings regarding their experience in quality improvement efforts. Strong support for the internal consistency reliability and face validity of the Quality Improvement Nursing Attitude Scale was found. Total scale scores were high indicating nurse participants valued Quality and Safety Education for Nurse competencies in practice. However, item-level statistics indicated nurses felt powerless when other nurses deviate from care standards. Additionally, the sample indicated they did not consistently report patient safety issues and did not have a feeling of value in efforts to improve care. Findings suggested organisational culture fosters nurses' reporting safety issues and feeling valued in efforts to improve care. Participants' narrative comments and item analysis revealed the need to generate new items for the Quality Improvement Nursing Attitude Scale focused on nurses' perception of their importance in quality and safety and their power to enact principles. The Quality Improvement Nursing Attitude Scale-Revised edition was designed to help in understanding nurses' attitudes and values. It can be used to further explore broad concepts of quality improvement efforts. © 2017 John Wiley & Sons Ltd.
Assuring Electronics Reliability: What Could and Should Be Done Differently
NASA Astrophysics Data System (ADS)
Suhir, E.
The following “ ten commandments” for the predicted and quantified reliability of aerospace electronic, and photonic products are addressed and discussed: 1) The best product is the best compromise between the needs for reliability, cost effectiveness and time-to-market; 2) Reliability cannot be low, need not be higher than necessary, but has to be adequate for a particular product; 3) When reliability is imperative, ability to quantify it is a must, especially if optimization is considered; 4) One cannot design a product with quantified, optimized and assured reliability by limiting the effort to the highly accelerated life testing (HALT) that does not quantify reliability; 5) Reliability is conceived at the design stage and should be taken care of, first of all, at this stage, when a “ genetically healthy” product should be created; reliability evaluations and assurances cannot be delayed until the product is fabricated and shipped to the customer, i.e., cannot be left to the prognostics-and-health-monitoring/managing (PHM) stage; it is too late at this stage to change the design or the materials for improved reliability; that is why, when reliability is imperative, users re-qualify parts to assess their lifetime and use redundancy to build a highly reliable system out of insufficiently reliable components; 6) Design, fabrication, qualification and PHM efforts should consider and be specific for particular products and their most likely actual or at least anticipated application(s); 7) Probabilistic design for reliability (PDfR) is an effective means for improving the state-of-the-art in the field: nothing is perfect, and the difference between an unreliable product and a robust one is “ merely” the probability of failure (PoF); 8) Highly cost-effective and highly focused failure oriented accelerated testing (FOAT) geared to a particular pre-determined reliability model and aimed at understanding the physics of failure- anticipated by this model is an important constituent part of the PDfR effort; 9) Predictive modeling (PM) is another important constituent of the PDfR approach; in combination with FOAT, it is a powerful means to carry out sensitivity analyses (SA), to quantify and nearly eliminate failures (“ principle of practical confidence” ); 10) Consistent, comprehensive and physically meaningful PDfR can effectively contribute to the most feasible and the most effective qualification test (QT) methodologies, practices and specifications. The general concepts addressed in the paper are illustrated by numerical examples. It is concluded that although the suggested concept is promising and fruitful, further research, refinement, and validations are needed before this concept becomes widely accepted by the engineering community and implemented into practice. It is important that this novel approach is introduced gradually, whenever feasible and appropriate, in addition to, and in some situations even instead of, the currently employed various types and modifications of the forty year old HALT.
Trusted Care In The Air Force Medical Service: Practical Recommendations For Transformation
2016-02-16
Organizing for High Reliability: Processes of Collective Mindfulness,” in Research in Organizational Behavior, ed. R.S. Sutton and B.M. Staw...Karl E.Weick, Kathleen M. Sutcliffe, and David Obstfeld, “ Organizing for High Reliability: Processes of Collective Mindfulness,” in Research in...La Porte, “High Reliability Organizations : Unlikely, Demanding and At Risk.” Journal of Contingencies and Crisis Management 4, no. 2 (June 1996): 60
de Vries, Nienke M; Staal, J Bart; Olde Rikkert, Marcel G M; Nijhuis-van der Sanden, Maria W G
2013-04-01
Physical activity is assumed to be important in the prevention and treatment of frailty. It is unclear, however, to what extent frailty can be influenced because instruments designed to assess frailty have not been validated as evaluative outcome instruments in clinical practice. The aims of this study were: (1) to develop a frailty index (i.e., the evaluative frailty index for physical activity [EFIP]) based on the method of deficit accumulation and (2) to test the clinimetric properties of the EFIP. The content of the EFIP was determined using a written Delphi procedure. Intrarater reliability, interrater reliability, and construct validity were determined in an observational study (n=24). Intrarater reliability and interrater reliability were calculated using Cohen kappa and intraclass correlation coefficients (ICCs). Construct validity was determined by correlating the score on the EFIP with those on the timed "up & go" test (TUG), the performance-oriented mobility assessment (POMA), and the Cumulative Illness Rating Scale for Geriatrics (CIRS-G). Fifty items were included in the EFIP. Interrater reliability (Cohen kappa=0.72, ICC=.96) and intrarater reliability (Cohen kappa=0.77 and 0.80, ICC=.93 and .98) were good. As expected, a fair to moderate correlation with the TUG, POMA, and CIRS-G was found (.61, -.70, and .66, respectively). Reliability and validity of the EFIP have been tested in a small sample. These and other clinimetric properties, such as responsiveness, will be assessed or reassessed in a larger study population. The EFIP is a reliable and valid instrument to evaluate the effect of physical activity on frailty in research and in clinical practice.
Lenzlinger-Asprion, Rahel; Keller, Niculina; Meichtry, André; Luomajoki, Hannu
2017-01-31
Hip joint complaints are a problem associated with increasing age and impair the mobility of a large section of the elderly population. Reliable and valid tests are necessary for a thorough investigation of a joint. A fundamental function of the hip joint is movement control and a test of this function forms a part of the standard examination. Until now there have been few scientific studies which specifically investigate the reliability of measurement tests of movement control of the hip joint. The aim of this study was to examine the intratester and intertester reliability of the movement control tests of the hip joint which are in use in current clinical practice. Sixteen participants with hip joint complaints and 14 without hip joint impairment were recruited. All participants performed five active movement control tests for the hip joint and were video filmed whilst performing these tests. These films formed the basis for the evaluation and were assessed by two independent physiotherapists. For the intertester and intratester reliability calculations specially set weighted kappa values and the calculated percentages were used. The intertester reliability of the five examined movement control tests of the hip joint showed good to almost perfect values (weighted kappa (wk) = 0.56-0.87). The intratester reliability of the more experienced evaluator A was better in regards to the less experienced evaluator B (average wk = 0.62 vs 0.38). The visual evaluation of movement control tests of the hip joint is especially reliable when carried out by an experienced evaluator. 4 out of 5 tests also showed good results for intertester reliability and support their use in clinical practice.
Reliability analysis in the Office of Safety, Environmental, and Mission Assurance (OSEMA)
NASA Astrophysics Data System (ADS)
Kauffmann, Paul J.
1994-12-01
The technical personnel in the SEMA office are working to provide the highest degree of value-added activities to their support of the NASA Langley Research Center mission. Management perceives that reliability analysis tools and an understanding of a comprehensive systems approach to reliability will be a foundation of this change process. Since the office is involved in a broad range of activities supporting space mission projects and operating activities (such as wind tunnels and facilities), it was not clear what reliability tools the office should be familiar with and how these tools could serve as a flexible knowledge base for organizational growth. Interviews and discussions with the office personnel (both technicians and engineers) revealed that job responsibilities ranged from incoming inspection to component or system analysis to safety and risk. It was apparent that a broad base in applied probability and reliability along with tools for practical application was required by the office. A series of ten class sessions with a duration of two hours each was organized and scheduled. Hand-out materials were developed and practical examples based on the type of work performed by the office personnel were included. Topics covered were: Reliability Systems - a broad system oriented approach to reliability; Probability Distributions - discrete and continuous distributions; Sampling and Confidence Intervals - random sampling and sampling plans; Data Analysis and Estimation - Model selection and parameter estimates; and Reliability Tools - block diagrams, fault trees, event trees, FMEA. In the future, this information will be used to review and assess existing equipment and processes from a reliability system perspective. An analysis of incoming materials sampling plans was also completed. This study looked at the issues associated with Mil Std 105 and changes for a zero defect acceptance sampling plan.
Reliability analysis in the Office of Safety, Environmental, and Mission Assurance (OSEMA)
NASA Technical Reports Server (NTRS)
Kauffmann, Paul J.
1994-01-01
The technical personnel in the SEMA office are working to provide the highest degree of value-added activities to their support of the NASA Langley Research Center mission. Management perceives that reliability analysis tools and an understanding of a comprehensive systems approach to reliability will be a foundation of this change process. Since the office is involved in a broad range of activities supporting space mission projects and operating activities (such as wind tunnels and facilities), it was not clear what reliability tools the office should be familiar with and how these tools could serve as a flexible knowledge base for organizational growth. Interviews and discussions with the office personnel (both technicians and engineers) revealed that job responsibilities ranged from incoming inspection to component or system analysis to safety and risk. It was apparent that a broad base in applied probability and reliability along with tools for practical application was required by the office. A series of ten class sessions with a duration of two hours each was organized and scheduled. Hand-out materials were developed and practical examples based on the type of work performed by the office personnel were included. Topics covered were: Reliability Systems - a broad system oriented approach to reliability; Probability Distributions - discrete and continuous distributions; Sampling and Confidence Intervals - random sampling and sampling plans; Data Analysis and Estimation - Model selection and parameter estimates; and Reliability Tools - block diagrams, fault trees, event trees, FMEA. In the future, this information will be used to review and assess existing equipment and processes from a reliability system perspective. An analysis of incoming materials sampling plans was also completed. This study looked at the issues associated with Mil Std 105 and changes for a zero defect acceptance sampling plan.
Visual judgements of steadiness in one-legged stance: reliability and validity.
Haupstein, T; Goldie, P
2000-01-01
There is a paucity of information about the validity and reliability of clinicians' visual judgements of steadiness in one-legged stance. Such judgements are used frequently in clinical practice to support decisions about treatment in the fields of neurology, sports medicine, paediatrics and orthopaedics. The aim of the present study was to address the validity and reliability of visual judgements of steadiness in one-legged stance in a group of physiotherapists. A videotape of 20 five-second performances was shown to 14 physiotherapists with median clinical experience of 6.75 years. Validity of visual judgement was established by correlating scores obtained from an 11-point rating scale with criterion scores obtained from a force platform. In addition, partial correlations were used to control for the potential influence of body weight on the relationship between the visual judgements and criterion scores. Inter-observer reliability was quantified between the physiotherapists; intra-observer reliability was quantified between two tests four weeks apart. Mean criterion-related validity was high, regardless of whether body weight was controlled for statistically (Pearson's r = 0.84, 0.83, respectively). The standard error of estimating the criterion score was 3.3 newtons. Inter-observer reliability was high (ICC (2,1) = 0.81 at Test 1 and 0.82 at Test 2). Intra-observer reliability was high (on average ICC (2,1) = 0.88; Pearson's r = 0.90). The standard error of measurement for the 11-point scale was one unit. The finding of higher accuracy of making visual judgements than previously reported may be due to several aspects of design: use of a criterion score derived from the variability of the force signal which is more discriminating than variability of centre of pressure; use of a discriminating visual rating scale; specificity and clear definition of the phenomenon to be rated.
Dontje, Manon L; Dall, Philippa M; Skelton, Dawn A; Gill, Jason M R; Chastin, Sebastien F M
2018-01-01
Prolonged sedentary behaviour (SB) is associated with poor health. It is unclear which SB measure is most appropriate for interventions and population surveillance to measure and interpret change in behaviour in older adults. The aims of this study: to examine the relative and absolute reliability, Minimal Detectable Change (MDC) and responsiveness to change of subjective and objective methods of measuring SB in older adults and give recommendations of use for different study designs. SB of 18 older adults (aged 71 (IQR 7) years) was assessed using a systematic set of six subjective tools, derived from the TAxonomy of Self report Sedentary behaviour Tools (TASST), and one objective tool (activPAL3c), over 14 days. Relative reliability (Intra Class Correlation coefficients-ICC), absolute reliability (SEM), MDC, and the relative responsiveness (Cohen's d effect size (ES) and Guyatt's Responsiveness coefficient (GR)) were calculated for each of the different tools and ranked for different study designs. ICC ranged from 0.414 to 0.946, SEM from 36.03 to 137.01 min, MDC from 1.66 to 8.42 hours, ES from 0.017 to 0.259 and GR from 0.024 to 0.485. Objective average day per week measurement ranked as most responsive in a clinical practice setting, whereas a one day measurement ranked highest in quasi-experimental, longitudinal and controlled trial study designs. TV viewing-Previous Week Recall (PWR) ranked as most responsive subjective measure in all study designs. The reliability, Minimal Detectable Change and responsiveness to change of subjective and objective methods of measuring SB is context dependent. Although TV viewing-PWR is the more reliable and responsive subjective method in most situations, it may have limitations as a reliable measure of total SB. Results of this study can be used to guide choice of tools for detecting change in sedentary behaviour in older adults in the contexts of population surveillance, intervention evaluation and individual care.
Eisenlohr-Moul, Tory A.; Girdler, Susan S.; Schmalenberger, Katja M.; Dawson, Danyelle N.; Surana, Pallavi; Johnson, Jacqueline L.; Rubinow, David R.
2016-01-01
Objective Despite evidence for the validity of premenstrual dysphoric disorder (PMDD) and its recent inclusion in DSM-5, variable diagnostic practices compromise the construct validity of the diagnosis and threaten the clarity of efforts to understand and treat its underlying pathophysiology. In an effort to hasten and streamline the translation of the new DSM-5 criteria for PMDD into terms compatible with existing research practices, we present the development and initial validation of the Carolina Premenstrual Assessment Scoring System (C-PASS). The C-PASS is a standardized scoring system for making DSM-5 PMDD diagnoses using 2 or more menstrual cycles of daily symptom ratings using the Daily Record of Severity of Problems (DRSP). Method Two hundred women recruited for retrospectively-reported premenstrual emotional symptoms provided 2–4 menstrual cycles of daily symptom ratings on the DRSP. Diagnoses were made by expert clinician and the C-PASS. Results Agreement of C-PASS diagnosis with expert clinical diagnosis was excellent; overall correct classification by the C-PASS was estimated at 98%. Consistent with previous evidence, retrospective reports of premenstrual symptom increases were a poor predictor of prospective C-PASS diagnosis. Conclusions The C-PASS (available as a worksheet, Excel macro, and SAS macro) is a reliable and valid companion protocol to the DRSP that standardizes and streamlines the complex, multilevel diagnosis of DSM-5 PMDD. Consistent use of this robust diagnostic method would result in more clearly-defined, homogeneous samples of women with PMDD, thereby improving the clarity of studies seeking to characterize or treat the underlying pathophysiology of the disorder. PMID:27523500
A Statewide Quality Improvement Collaborative to Increase Breastfeeding Rates in Tennessee.
Ware, Julie L; Schetzina, Karen E; Morad, Anna; Barker, Brenda; Scott, Theresa A; Grubb, Peter H
2018-05-01
Tennessee has low breastfeeding rates and has identified opportunities for improvement to enhance maternity practices to support breastfeeding mothers. We sought a 10% relative increase in the aggregate Joint Commission measure of breastfeeding exclusivity at discharge (TJC PC-05) by focusing on high-reliability (≥90%) implementation of processes that promote breastfeeding in the delivery setting. A statewide, multidisciplinary development team reviewed evidence from the WHO-UNICEF "Ten Steps to Successful Breastfeeding" to create a consensus toolkit of process indicators aligned with the Ten Steps. Hospitals submitted monthly TJC PC-05 data for 6 months while studying local implementation of the Ten Steps to identify improvement opportunities, and for an additional 11 months while conducting tests of change to improve Ten Steps implementation using Plan-Do-Study-Act cycles, local process audits, and control charts. Data were aggregated at the state level and presented at 12 monthly webinars, 3 regional learning sessions, and 1 statewide meeting where teams shared their local data and implementation experiences. Thirteen hospitals accounting for 47% of live births in Tennessee submitted data on 31,183 mother-infant dyads from August 1, 2012, to December 31, 2013. Aggregate monthly mean PC-05 demonstrated "special cause" improvement increasing from 37.1% to 41.2%, an 11.1% relative increase. Five hospitals reported implementation of ≥5 of the Ten Steps and two hospitals reported ≥90% reliability on ≥5 of the Ten Steps using locally designed process audits. Using large-scale improvement methodology, a successful statewide collaborative led to >10% relative increase in breastfeeding exclusivity at discharge in participating Tennessee hospitals. Further opportunities for improvement in implementing breastfeeding supportive practices were identified.
German dentists' websites on periodontitis have low quality of information.
Schwendicke, Falk; Stange, Jörg; Stange, Claudia; Graetz, Christian
2017-08-02
The internet is an increasingly relevant source of health information. We aimed to assess the quality of German dentists' websites on periodontitis, hypothesizing that it was significantly associated with a number of practice-specific parameters. We searched four electronic search engines and included pages which were freely accessible, posted by a dental practice in Germany, and mentioned periodontal disease/therapy. Websites were assessed for (1) technical and functional aspects, (2) generic quality and risk of bias, (3) disease-specific information. For 1 and 2, validated tools (LIDA/DISCERN) were used for assessment. For 3, we developed a criterion catalogue encompassing items on etiologic and prognostic factors for periodontitis, the diagnostic and treatment process, and the generic chance of tooth retention in periodontitis patients. Inter- and intra-rater reliabilities were largely moderate. Generalized linear modeling was used to assess the association between the information quality (measured as % of maximally available scores) and practice-specific characteristics. Seventy-one websites were included. Technical and functional aspects were reported in significantly higher quality (median: 71%, 25/75th percentiles: 67/79%) than all other aspects (p < 0.05). Generic risk of bias and most disease-specific aspects showed significantly lower reporting quality (median range was 0-40%), with poorest reporting for prognostic factors (9;0/27%), diagnostic process (0;0/33%) and chances of tooth retention (0;0/2%). We found none of the practice-specific parameters to have significant impact on the overall quality of the websites. Most German dentists' websites on periodontitis are not fully trustworthy and relevant information are not or insufficiently considered. There is great need to improve the information quality from such websites at least with regards to periodontitis.
Tool for Evaluating the Ways Nurses Assess Pain (TENAP): psychometric properties assessment.
Ng, Siok Qi; Brammer, Jillian; Creedy, Debra K; Klainin-Yobas, Piyanee
2014-12-01
Elderly people with cognitive impairment are at risk for under-treatment of pain due to their inability to communicate. Poor knowledge and attitudes of nurses toward pain in this population may result in inadequate pain assessment. This study used a descriptive correlational design to develop and validate a tool to assess nurses' knowledge, attitudes, and reported practice of pain assessment in cognitively impaired elderly patients in acute care settings. The Tool for Evaluating the ways Nurses Assess Pain (TENAP) has two sections: (1) nurses' knowledge and attitudes about pain assessment and management and (2) two vignettes to assess reported practice. Content validity was established by an expert panel of three geriatric-trained nurse clinicians, and pilot tested with a convenience sample of 10 nurses. The psychometric properties were tested with a sample of 263 Registered and Enrolled nurses working in medical wards of two public hospitals in Singapore. The final version of TENAP comprised 29 items. Content validity index ranged from 0.84 to 1.00. The scale took 10 to 15 minutes to complete and items were easily understood. Results from the factor analysis suggested that Section A demonstrated one factor (13 items) while Section B had two distinct factors (16 items), one for each vignette, supporting construct validity of the scale. Cronbach's alphas for all factors were acceptable. TENAP was feasible, valid, and reliable for assessing nurses' knowledge, attitudes, and reported practice of pain assessment in cognitively-impaired elderly patients. Further testing of the tool with a larger sample of nurses in other practice contexts is needed. Copyright © 2014 American Society for Pain Management Nursing. Published by Elsevier Inc. All rights reserved.
A flight test of laminar flow control leading-edge systems
NASA Technical Reports Server (NTRS)
Fischer, M. C.; Wright, A. S., Jr.; Wagner, R. D.
1983-01-01
NASA's program for development of a laminar flow technology base for application to commercial transports has made significant progress since its inception in 1976. Current efforts are focused on development of practical reliable systems for the leading-edge region where the most difficult problems in applying laminar flow exist. Practical solutions to these problems will remove many concerns about the ultimate practicality of laminar flow. To address these issues, two contractors performed studies, conducted development tests, and designed and fabricated fully functional leading-edge test articles for installation on the NASA JetStar aircraft. Systems evaluation and performance testing will be conducted to thoroughly evaluate all system capabilities and characteristics. A simulated airline service flight test program will be performed to obtain the operational sensitivity, maintenance, and reliability data needed to establish that practical solutions exist for the difficult leading-edge area of a future commercial transport employing laminar flow control.
Applications of ultrasensitive magnetic measurement technologies (invited) (abstract)
NASA Astrophysics Data System (ADS)
Hirschkoff, Eugene C.
1993-05-01
The development of reliable, easy-to-use magnetic measurement systems with significantly enhanced levels of sensitivity has opened up a number of broad new areas of application for magnetic sensing. Magnetometers based on optical pumping offer sensitivities at the picotesla level, while those that utilize superconducting quantum interference devices can operate at the femtotesla level. These systems are finding applications in areas as diverse as geophysical exploration, communications, and medical diagnostics. This review briefly surveys the capabilities and application areas for a number of magnetic sensing technologies. The emphasis then focuses on the application of the most sensitive of these to the field of medical diagnostics and functional imaging. Protocols for specific applications to noninvasive presurgical planning and to the noninvasive assay of cortical dysfunction in diseases ranging from epilepsy to migraine and schizophrenia will be described in detail. Data will be presented reporting independent validation of these techniques in ten patients who subsequently underwent surgery. Routine and reliable utilization of this ultrasensitive magnetic sensing technology in the clinic is now feasible and practical.
Reliable and efficient solution of genome-scale models of Metabolism and macromolecular Expression
Ma, Ding; Yang, Laurence; Fleming, Ronan M. T.; ...
2017-01-18
Currently, Constraint-Based Reconstruction and Analysis (COBRA) is the only methodology that permits integrated modeling of Metabolism and macromolecular Expression (ME) at genome-scale. Linear optimization computes steady-state flux solutions to ME models, but flux values are spread over many orders of magnitude. Data values also have greatly varying magnitudes. Furthermore, standard double-precision solvers may return inaccurate solutions or report that no solution exists. Exact simplex solvers based on rational arithmetic require a near-optimal warm start to be practical on large problems (current ME models have 70,000 constraints and variables and will grow larger). We also developed a quadrupleprecision version of ourmore » linear and nonlinear optimizer MINOS, and a solution procedure (DQQ) involving Double and Quad MINOS that achieves reliability and efficiency for ME models and other challenging problems tested here. DQQ will enable extensive use of large linear and nonlinear models in systems biology and other applications involving multiscale data.« less
Chan, Chitat; Fung, Hong Wang; Choi, Tat Ming; Ross, Colin A
2017-01-01
Identifying dissociation is important for mental health services because it could fundamentally affect one's diagnosis and treatment plan. The Dissociative Experiences Scale (DES) is a widely-used self-report scale for measuring dissociative experiences. It has been translated into many languages and used in many countries. However, there is no validated Hong Kong Chinese version of the DES available in the field, and there is no other validated Hong Kong Chinese instrument for assessing dissociative disorders. This pilot study used online methods to translate the DES to Hong Kong Chinese (HKC-DES). The results indicated that the HKC-DES has excellent internal consistency (α = .953) and very good test-retest reliability (r = .797). Bilingual participants' responses to the DES and HKC-DES indicated high similarity, and were significantly correlated (r = .960). These results initially verified the reliability and cross-language equivalence of the scale. Implications for healthcare practice and research are discussed.