Ishman, Stacey L; Benke, James R; Johnson, Kaalan Erik; Zur, Karen B; Jacobs, Ian N; Thorne, Marc C; Brown, David J; Lin, Sandra Y; Bhatti, Nasir; Deutsch, Ellen S
2012-10-01
OBJECTIVES To confirm interrater reliability using blinded evaluation of a skills-assessment instrument to assess the surgical performance of resident and fellow trainees performing pediatric direct laryngoscopy and rigid bronchoscopy in simulated models. DESIGN Prospective, paired, blinded observational validation study. SUBJECTS Paired observers from multiple institutions simultaneously evaluated residents and fellows who were performing surgery in an animal laboratory or using high-fidelity manikins. The evaluators had no previous affiliation with the residents and fellows and did not know their year of training. INTERVENTIONS One- and 2-page versions of an objective structured assessment of technical skills (OSATS) assessment instrument composed of global and a task-specific surgical items were used to evaluate surgical performance. RESULTS Fifty-two evaluations were completed by 17 attending evaluators. The instrument agreement for the 2-page assessment was 71.4% when measured as a binary variable (ie, competent vs not competent) (κ = 0.38; P = .08). Evaluation as a continuous variable revealed a 42.9% percentage agreement (κ = 0.18; P = .14). The intraclass correlation was 0.53, considered substantial/good interrater reliability (69% reliable). For the 1-page instrument, agreement was 77.4% when measured as a binary variable (κ = 0.53, P = .0015). Agreement when evaluated as a continuous measure was 71.0% (κ = 0.54, P < .001). The intraclass correlation was 0.73, considered high interrater reliability (85% reliable). CONCLUSIONS The OSATS assessment instrument is an effective tool for evaluating surgical performance among trainees with acceptable interrater reliability in a simulator setting. Reliability was good for both the 1- and 2-page OSATS checklists, and both serve as excellent tools to provide immediate formative feedback on operational competency.
Urrutia, Julio; Zamora, Tomas; Yurac, Ratko; Campos, Mauricio; Palma, Joaquin; Mobarec, Sebastian; Prada, Carlos
2017-03-01
An agreement study. The aim of this study was to perform an independent interobserver and intraobserver agreement assessment of the AOSpine subaxial cervical spine injury classification system. The AOSpine subaxial cervical spine injury classification system was recently described. It showed substantial inter- and intraobserver agreement in the study describing it; however, an independent evaluation has not been performed. Anteroposterior and lateral radiographs, computed tomography scans, and magnetic resonance imaging of 65 patients with acute traumatic subaxial cervical spine injuries were selected and classified using the morphologic grading of the subaxial cervical spine injury classification system by 6 evaluators (3 spine surgeons and 3 orthopedic surgery residents). After a 6-week interval, the 65 cases were presented to the same evaluators in a random sequence for repeat evaluation. The kappa coefficient (κ) was used to determine the inter- and intraobserver agreement. The interobserver agreement was substantial when considering the fracture main types (A, B, C, or F), with κ = 0.61 (0.57-0.64), but moderate when considering the subtypes: κ = 0.57 (0.54-0.60). The intraobserver agreement was substantial considering the fracture types, with κ = 0.68 (0.62-0.74) and considering subtypes, κ = 0.62 (0.57-0.66). No significant differences were observed between spine surgeons and orthopedic residents in the overall inter- and intraobserver agreement, or in the inter- and intraobserver agreement of specific A, B, C, or F type of injuries. This classification allows adequate agreement among different observers and by the same observer on separate occasions. Future prospective studies should determine whether this classification allows surgeons to decide the best treatment for patients with subaxial cervical spine injuries. 3.
Boguslav, Mayla; Cohen, Kevin Bretonnel
2017-01-01
Human-annotated data is a fundamental part of natural language processing system development and evaluation. The quality of that data is typically assessed by calculating the agreement between the annotators. It is widely assumed that this agreement between annotators is the upper limit on system performance in natural language processing: if humans can't agree with each other about the classification more than some percentage of the time, we don't expect a computer to do any better. We trace the logical positivist roots of the motivation for measuring inter-annotator agreement, demonstrate the prevalence of the widely-held assumption about the relationship between inter-annotator agreement and system performance, and present data that suggest that inter-annotator agreement is not, in fact, an upper bound on language processing system performance.
Vroland-Nordstrand, Kristina; Krumlinde-Sundholm, Lena
2012-11-01
to evaluate the test-retest reliability of children's perceptions of their own competence in performing daily tasks and of their choice of goals for intervention using the Swedish version of the perceived efficacy and goal setting system (PEGS). A second aim was to evaluate agreement between children's and parents' perceptions of the child's competence and choices of intervention goals. Forty-four children with disabilities and their parents completed the Swedish version of the PEGS. Thirty-six of the children completed a retest session allocated into one of two groups: (A) for evaluation of perceived competence and (B) for evaluation of choice of goals. Cohen's kappa, weighted kappa and absolute agreement were calculated. Test-retest reliability for children's perceived competence showed good agreement for the dichotomized scale of competent/non-competent performance; however, using the four-point scale the agreement varied. The children's own goals were relatively stable over time; 78% had an absolute agreement ranging from 50% to 100%. There was poor agreement between the children's and their parents' ratings. Goals identified by the children differed from those identified by their parents, with 48% of the children having no goals identical to those chosen by their parents. These results indicate that the Swedish version of the PEGS produces reliable outcomes comparable to the original version.
78 FR 48726 - Placement of the Georgia Agreement State Program on Probation
Federal Register 2010, 2011, 2012, 2013, 2014
2013-08-09
... the Integrated Materials Performance Evaluation Program (IMPEP). The Management Review Board (MRB), in... CONTACT: Lisa Dimmick, Office of Federal and State Materials and Environmental Management Programs, U.S... Agreement State radiation control programs, using performance indicators, to ensure that public health and...
Weeks, Douglas L; Molsberry, Dianne M
2009-03-01
This study determined inter-rater agreement between skill assessments provided by on-site PALS evaluators with ratings from evaluators at a remote site viewing the same skill performance over a videoconferencing network. Judgments about feasibility of remote evaluation were also obtained from the evaluators and PALS course participants. Two remote and two on-site instructors independently rated performance of 27 course participants who performed cardiac and shock/respiratory emergency core cases. Inter-rater reliability was assessed with the intraclass correlation coefficient (ICC). Feasibility was assessed with surveys of evaluators and course participants. Core cases were under the direction of the remote evaluators. The ICC for overall agreement on pass/fail decisions was 0.997 for the cardiac cases and 0.998 for the shock/respiratory cases. Perfect agreement was reached on 52 of 54 pass/fail decisions. Across all evaluators, all core cases, and all participants, 2584 ratings of individual skill criteria were provided, of which 21 (0.8%) were ratings in which a single evaluator disagreed with the other three evaluators. No trends emerged for location of the disagreeing evaluator. Survey responses indicated that remote evaluation was acceptable and feasible to course participants and to the evaluators. Videoconferencing technology was shown to provide adequate spatial and temporal resolution for PALS evaluators at-a-distance from course participants to agree with ratings of on-site evaluators.
MAP Markers: Research and Evaluation of the Mutual Agreement Program. Resource Document No. 5.
ERIC Educational Resources Information Center
Robison, James O.
This report is an evaluation, using randomized experimental/control design, of a program involving negotiated contractual agreements (MAP) between prisoners and parole boards for specific parole dates contingent on performance in work, training and treatment activities. Contracts were generally for less than six months. Both feasibility and…
Reliability and Validity of the Professional Counseling Performance Evaluation
ERIC Educational Resources Information Center
Shepherd, J. Brad; Britton, Paula J.; Kress, Victoria E.
2008-01-01
The definition and measurement of counsellor trainee competency is an issue that has received increased attention yet lacks quantitative study. This research evaluates item responses, scale reliability and intercorrelations, interrater agreement, and criterion-related validity of the Professional Performance Fitness Evaluation/Professional…
Casartelli, Nicola C; Maffiuletti, Nicola A; Brunner, Romana; Büchi, Marcel; Sutter, Reto; Pfirrmann, Christian W; Naal, Florian D; Leunig, Michael; Bizzini, Mario
2018-04-01
Study Design Cross-sectional study. Objectives To evaluate intrarater and interrater agreement among physical therapists with different clinical experience in performing a visual rating of movement-pattern quality of patients with femoroacetabular impingement (FAI) syndrome using a semi-quantitative scale. Background Visual rating of movement patterns in patients with FAI syndrome is of interest, because poor control of dynamic hip motion is frequently noted. Methods A video camera was used to record the performance of 34 patients with FAI syndrome performing single-limb standing, squat, frontal lunge, hop lunge, bridge, and plank. Visual rating of movement, as recorded on video, was performed by a highly experienced, a moderately experienced, and a novice physical therapist on 2 occasions using a semi-quantitative scale. Hip abductor strength was assessed using dynamometry, and hip pain and function were assessed with a patient-reported questionnaire. Intrarater and interrater agreement among physical therapists was evaluated using Gwet's agreement coefficient 1. Construct validity was evaluated as the association between physical therapists' rating and patients' hip abductor strength, pain, and function. Results Good intrarater and interrater agreement was observed in the highly experienced and moderately experienced physical therapists when rating single-limb standing, bridge, and plank. Poor to moderate intrarater and interrater agreement was found when they rated squat, frontal lunge, and hop lunge. Poor performers, as rated by the highly experienced physical therapist only, demonstrated lower hip abductor strength (P<.05), and similar hip pain and hip function compared to those of good performers. Conclusion Movement-pattern quality of patients with FAI syndrome should be rated by a highly experienced physical therapist. J Orthop Sports Phys Ther 2018;48(4):260-269. doi:10.2519/jospt.2018.7840.
Simonsen, Daniel; Nielsen, Ida F; Spaich, Erika G; Andersen, Ole K
2017-05-02
The present paper describes the design and evaluation of an automated version of the Modified Jebsen Test of Hand Function (MJT) based on the Microsoft Kinect sensor. The MJT was administered twice to 11 chronic stroke subjects with varying degrees of hand function deficits. The test times of the MJT were evaluated manually by a therapist using a stopwatch, and automatically using the Microsoft Kinect sensor. The ground truth times were assessed based on inspection of the video-recordings. The agreement between the methods was evaluated along with the test-retest performance. The results from Bland-Altman analysis showed better agreement between the ground truth times and the automatic MJT time evaluations compared to the agreement between the ground truth times and the times estimated by the therapist. The results from the test-retest performance showed that the subjects significantly improved their performance in several subtests of the MJT, indicating a practice effect. The results from the test showed that the Kinect can be used for automating the MJT.
Benefits and costs of programmatic agreements
DOT National Transportation Integrated Search
2015-02-01
The performing organization, on behalf of the FHWA Office of Planning, Environment, and Realty, conducted a benefit-cost assessment of programmatic agreements and approaches. The assessment consisted of a case study approach that evaluated three agre...
ERIC Educational Resources Information Center
Garman, Barry R.; And Others
1991-01-01
Band, orchestra, and choir festival evaluations are a regular part of many secondary school music programs, and most such festivals engage adjudicators who rate each group's performance. Because music ensemble performance is complex and multi-dimensional, it does not lend itself readily to precise measurement; generally, musical performances are…
Horta-Baas, Gabriel; Pérez Bolde-Hernández, Arturo; Hernández-Cabrera, María Fernanda; Vergara-Sánchez, Imelda; Romero-Figueroa, María Del Socorro
2017-10-11
To achieve control of rheumatoid arthritis (RA) it is necessary to be able to evaluate its activity. The American College of Rheumatology (ACR) recommends for this purpose indexes of activity that can be performed by the patient (PAS-II and RAPID-3) and IA including medical evaluation with laboratory studies (DAS28 and SDAI) or without them (CDAI). The objective was to analyze the concordance between self-rated clinimetric evaluation and clinimetric evaluation performed by the physician. Analytical cross-sectional study in 126 patients with RA. The agreement was evaluated through the weighted κ coefficient and the Krippendorff's α coefficient. The PAS-II and RAPID-3 significantly correlated with all variables included in the core set of measures recommended by the ACR/EULAR. The agreement between PAS-II and CDAI-SDAI was good (κ: 0.6, α: 0.61-0.62), and moderate with DAS28-ESR (κ: 0.53, α: 0.56). The concordance between RAPID-3 and CDAI-SDAI was moderate (κ: 0.55-0.57, α: 0.50-0.51), and moderate with DAS28-ESR (κ: 0.55, α: 0.53). When categorizing the activity in remission/low activity vs. moderate/severe activity, the agreement was greater with the PAS-II (0.59 vs. 0.34; P=.012). The good concordance between PAS-II and SDAI supports their use in clinical practice, especially if biomarkers of inflammation or the possibility of joint count are not available. However, in order to recommend its routine application in clinical practice, it is necessary to perform longitudinal studies that assess its responsiveness. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Rutz, Matt A; Clary, Julie M; Kline, Jeffrey A; Russell, Frances M
2017-07-01
Focused cardiac ultrasound (FOCUS) is a useful tool in evaluating patients presenting to the emergency department (ED) with acute dyspnea. Prior work has shown that right ventricular (RV) dilation is associated with repeat hospitalizations and shorter life expectancy. Traditionally, RV assessment has been evaluated by cardiologist-interpreted comprehensive echocardiography. The primary goal of this study was to determine the inter-rater reliability between emergency physicians (EPs) and a cardiologist for determining RV dilation on FOCUS performed on ED patients with acute dyspnea. This was a prospective, observational study at two urban academic EDs; patients were enrolled if they had acute dyspnea and a computed tomographic pulmonary angiogram without acute disease. All patients had an EP-performed FOCUS to assess for RV dilation. RV dilation was defined as an RV to left ventricular ratio greater than 1. FOCUS interpretations were compared to a blinded cardiologist FOCUS interpretation using agreement and kappa statistics. Of 84 FOCUS examinations performed on 83 patients, 17% had RV dilation. Agreement and kappa, for EP-performed FOCUS for RV dilation were 89% (95% confidence interval [CI] 80-95%) and 0.68 (95% CI 0.48-0.88), respectively. Emergency physician sonographers are able to detect RV dilation with good agreement when compared to cardiology. These results support the wider use of EP-performed FOCUS to evaluate for RV dilation in ED patients with dyspnea. © 2017 by the Society for Academic Emergency Medicine.
Jacquot, Amélie; Eskenazi, Terry; Sales-Wuillemin, Edith; Montalan, Benoît; Proust, Joëlle; Grèzes, Julie; Conty, Laurence
2015-01-01
Through metacognitive evaluations, individuals assess their own cognitive operations with respect to their current goals. We have previously shown that non-verbal social cues spontaneously influence these evaluations, even when the cues are unreliable. Here, we explore whether a belief about the reliability of the source can modulate this form of social impact. Participants performed a two-alternative forced choice task that varied in difficulty. The task was followed by a video of a person who was presented as being either competent or incompetent at performing the task. That person provided random feedback to the participant through facial expressions indicating agreement, disagreement or uncertainty. Participants then provided a metacognitive evaluation by rating their confidence in their answer. Results revealed that participants' confidence was higher following agreements. Interestingly, this effect was merely reduced but not canceled for the incompetent individual, even though participants were able to perceive the individual's incompetence. Moreover, perceived agreement induced zygomaticus activity, but only when the feedback was provided for difficult trials by the competent individual. This last result strongly suggests that people implicitly appraise the relevance of social feedback with respect to their current goal. Together, our findings suggest that people always integrate social agreement into their metacognitive evaluations, even when epistemic vigilance mechanisms alert them to the risk of being misinformed.
Dental students' self-assessment of operative preparations using CAD/CAM: a preliminary analysis.
Mays, Keith A; Levine, Eric
2014-12-01
The Commission on Dental Accreditation (CODA)'s accreditation standards for dental schools state that "graduates must demonstrate the ability to self-assess." Therefore, dental schools have developed preclinical and clinical self-assessment (SA) protocols aimed at fostering a reflective process. This study comparing students' visual SA with students' digital SA and with faculty assessment was designed to test the hypothesis that higher agreement would occur when utilizing a digital evaluation. Twenty-five first-year dental students at one dental school participated by preparing a mesial occlusal preparation on tooth #30 and performing both types of SAs. A faculty evaluation was then performed both visually and digitally using the same evaluation criteria. The Kappa statistic was used to measure agreement between evaluators. The results showed statistically significant moderate agreement between the faculty visual and faculty digital modes of evaluation for occlusal shape (K=0.507, p=0.002), proximal shape (K=0.564, p=0.001), orientation (K=0.425, p=0.001), and definition (K=0.480, p=0.001). There was slight to poor agreement between the student visual and faculty visual assessment, except for preparation orientation occlusal shape (K=0.164, p=0.022), proximal shape (K=-0.227, p=0.032), orientation (K=0.253, p=0.041), and definition (K=-0.027, p=0.824). This study showed that the students had challenges in self-assessing even when using CAD/CAM and the digital assessment did not improve the amount of student/faculty agreement.
Code of Federal Regulations, 2010 CFR
2010-10-01
... BUSINESS PROGRAMS The Department of Energy Mentor-Protege Program 919.7013 Reports. (a) Prior to performing an evaluation of a Mentor's performance under its Mentor-Protege Agreement for use in award fee evaluations, the Mentor-Protege Program Manager must consult with the cognizant DOE technical program manager...
Teo, Jeanette W P; Chiang, Donald; Jureen, Roland; Lin, Raymond T P
2015-11-01
In this study, we evaluate the performance of a commercial assay, the AmpliVue HSV 1+2 Assay (Quidel), which employs HDA for the detection of both HSV-1 and HSV-2. The assay was tested on 307 clinical specimens (genital, oral, and dermal). When compared to shell vial virus culture and immunofluorescence typing of HSV, the positive percent agreement and negative percent agreement values were 98.2% and 90.9%, respectively. Excellent assay performance was demonstrated. Copyright © 2015 Elsevier Inc. All rights reserved.
Urrutia, Julio; Besa, Pablo; Campos, Mauricio; Cikutovic, Pablo; Cabezon, Mario; Molina, Marcelo; Cruz, Juan Pablo
2016-09-01
Grading inter-vertebral disc degeneration (IDD) is important in the evaluation of many degenerative conditions, including patients with low back pain. Magnetic resonance imaging (MRI) is considered the best imaging instrument to evaluate IDD. The Pfirrmann classification is commonly used to grade IDD; the authors describing this classification showed an adequate agreement using it; however, there has been a paucity of independent agreement studies using this grading system. The aim of this study was to perform an independent inter- and intra-observer agreement study using the Pfirrmann classification. T2-weighted sagittal images of 79 patients consecutively studied with lumbar spine MRI were classified using the Pfirrmann grading system by six evaluators (three spine surgeons and three radiologists). After a 6-week interval, the 79 cases were presented to the same evaluators in a random sequence for repeat evaluation. The intra-class correlation coefficient (ICC) and the weighted kappa (wκ) were used to determine the inter- and intra-observer agreement. The inter-observer agreement was excellent, with an ICC = 0.94 (0.93-0.95) and wκ = 0.83 (0.74-0.91). There were no differences between spine surgeons and radiologists. Likewise, there were no differences in agreement evaluating the different lumbar discs. Most differences among observers were only of one grade. Intra-observer agreement was also excellent with ICC = 0.86 (0.83-0.89) and wκ = 0.89 (0.85-0.93). In this independent study, the Pfirrmann classification demonstrated an adequate agreement among different observers and by the same observer on separate occasions. Furthermore, it allows communication between radiologists and spine surgeons.
Rater agreement of visual lameness assessment in horses during lungeing
Hammarberg, M.; Egenvall, A.; Pfau, T.
2015-01-01
Summary Reasons for performing study Lungeing is an important part of lameness examinations as the circular path may accentuate low‐grade lameness. Movement asymmetries related to the circular path, to compensatory movements and to pain make the lameness evaluation complex. Scientific studies have shown high inter‐rater variation when assessing lameness during straight line movement. Objectives The aim was to estimate inter‐ and intra‐rater agreement of equine veterinarians evaluating lameness from videos of sound and lame horses during lungeing and to investigate the influence of veterinarians’ experience and the objective degree of movement asymmetry on rater agreement. Study design Cross‐sectional observational study. Methods Video recordings and quantitative gait analysis with inertial sensors were performed in 23 riding horses of various breeds. The horses were examined at trot on a straight line and during lungeing on soft or hard surfaces in both directions. One video sequence was recorded per condition and the horses were classified as forelimb lame, hindlimb lame or sound from objective straight line symmetry measurements. Equine veterinarians (n = 86), including 43 with >5 years of orthopaedic experience, participated in a web‐based survey and were asked to identify the lamest limb on 60 videos, including 10 repeats. The agreements between (inter‐rater) and within (intra‐rater) veterinarians were analysed with κ statistics (Fleiss, Cohen). Results Inter‐rater agreement κ was 0.31 (0.38/0.25 for experienced/less experienced) and higher for forelimb (0.33) than for hindlimb lameness (0.11) or soundness (0.08) evaluation. Median intra‐rater agreement κ was 0.57. Conclusions Inter‐rater agreement was poor for less experienced raters, and for all raters when evaluating hindlimb lameness. Since identification of the lame limb/limbs is a prerequisite for successful diagnosis, treatment and recovery, the high inter‐rater variation when evaluating lameness on the lunge is likely to influence the accuracy and repeatability of lameness examinations and, indirectly, the efficacy of treatment. PMID:25399722
A rater training protocol to assess team performance.
Eppich, Walter; Nannicelli, Anna P; Seivert, Nicholas P; Sohn, Min-Woong; Rozenfeld, Ranna; Woods, Donna M; Holl, Jane L
2015-01-01
Simulation-based methodologies are increasingly used to assess teamwork and communication skills and provide team training. Formative feedback regarding team performance is an essential component. While effective use of simulation for assessment or training requires accurate rating of team performance, examples of rater-training programs in health care are scarce. We describe our rater training program and report interrater reliability during phases of training and independent rating. We selected an assessment tool shown to yield valid and reliable results and developed a rater training protocol with an accompanying rater training handbook. The rater training program was modeled after previously described high-stakes assessments in the setting of 3 facilitated training sessions. Adjacent agreement was used to measure interrater reliability between raters. Nine raters with a background in health care and/or patient safety evaluated team performance of 42 in-situ simulations using post-hoc video review. Adjacent agreement increased from the second training session (83.6%) to the third training session (85.6%) when evaluating the same video segments. Adjacent agreement for the rating of overall team performance was 78.3%, which was added for the third training session. Adjacent agreement was 97% 4 weeks posttraining and 90.6% at the end of independent rating of all simulation videos. Rater training is an important element in team performance assessment, and providing examples of rater training programs is essential. Articulating key rating anchors promotes adequate interrater reliability. In addition, using adjacent agreement as a measure allows differentiation between high- and low-performing teams on video review. © 2015 The Alliance for Continuing Education in the Health Professions, the Society for Academic Continuing Medical Education, and the Council on Continuing Medical Education, Association for Hospital Medical Education.
Evaluation of high fidelity patient simulator in assessment of performance of anaesthetists.
Weller, J M; Bloch, M; Young, S; Maze, M; Oyesola, S; Wyner, J; Dob, D; Haire, K; Durbridge, J; Walker, T; Newble, D
2003-01-01
There is increasing emphasis on performance-based assessment of clinical competence. The High Fidelity Patient Simulator (HPS) may be useful for assessment of clinical practice in anaesthesia, but needs formal evaluation of validity, reliability, feasibility and effect on learning. We set out to assess the reliability of a global rating scale for scoring simulator performance in crisis management. Using a global rating scale, three judges independently rated videotapes of anaesthetists in simulated crises in the operating theatre. Five anaesthetists then independently rated subsets of these videotapes. There was good agreement between raters for medical management, behavioural attributes and overall performance. Agreement was high for both the initial judges and the five additional raters. Using a global scale to assess simulator performance, we found good inter-rater reliability for scoring performance in a crisis. We estimate that two judges should provide a reliable assessment. High fidelity simulation should be studied further for assessing clinical performance.
New agreement measures based on survival processes
Guo, Ying; Li, Ruosha; Peng, Limin; Manatunga, Amita K.
2013-01-01
Summary The need to assess agreement arises in many scenarios in biomedical sciences when measurements were taken by different methods on the same subjects. When the endpoints are survival outcomes, the study of agreement becomes more challenging given the special characteristics of time-to-event data. In this paper, we propose a new framework for assessing agreement based on survival processes that can be viewed as a natural representation of time-to-event outcomes. Our new agreement measure is formulated as the chance-corrected concordance between survival processes. It provides a new perspective for studying the relationship between correlated survival outcomes and offers an appealing interpretation as the agreement between survival times on the absolute distance scale. We provide a multivariate extension of the proposed agreement measure for multiple methods. Furthermore, the new framework enables a natural extension to evaluate time-dependent agreement structure. We develop nonparametric estimation of the proposed new agreement measures. Our estimators are shown to be strongly consistent and asymptotically normal. We evaluate the performance of the proposed estimators through simulation studies and then illustrate the methods using a prostate cancer data example. PMID:23844617
Fixation-free assessment of the Hirschberg ratio.
Model, Dmitri; Eizenman, Moshe; Sturm, Veit
2010-08-01
To describe a novel methodology by which to measure the Hirschberg ratio (HR) in infants. The methodology does not require fixation on specific points, and measurements are made while infants look naturally at a display. The HR is calculated automatically from measurements of the direction of the optical axis, the position of the pupil center, and corneal reflexes in video images from an advanced two-camera eye-tracking system. The performance of the novel fixation-free procedure (FFP) was evaluated in 43 adults by measuring the average difference and the 95% limits of agreement with the standard fixation-based procedure (FBP). Repeatability of the HR measurements was evaluated by assessing the 95% limits of agreement between two independent measurements. Performance of the FFP was also evaluated in five infants. In adults, the average HR was 12.89 +/- 1.22 degrees/mm for FFP and 12.81 +/- 1.22 degrees/mm for FBP. FFP and FBP measurements were highly correlated (r = 0.95; P < 0.001). The 95% limits of agreement between FFP and FBP were +/-0.86 degrees/mm. The 95% limits of agreement of repeated measurements were +/-0.66 degrees/mm for FFP and +/-0.77 degrees/mm for FBP. In infants, the 95% limits of agreement of repeated measurements by FFP were +/-0.63 degrees/mm. In adults, the FFP provides accurate measurements of the HR that are in excellent agreement with measurements by FBP. In infants, measurements of HR by FFP show the same repeatability and consistency.
Urrutia, Julio; Zamora, Tomas; Campos, Mauricio; Yurac, Ratko; Palma, Joaquin; Mobarec, Sebastian; Prada, Carlos
2016-07-01
We performed an agreement study using two subaxial cervical spine classification systems: the AOSpine and the Allen and Ferguson (A&F) classifications. We sought to determine which scheme allows better agreement by different evaluators and by the same evaluator on different occasions. Complete imaging studies of 65 patients with subaxial cervical spine injuries were classified by six evaluators (three spine sub-specialists and three senior orthopaedic surgery residents) using the AOSpine subaxial cervical spine classification system and the A&F scheme. The cases were displayed in a random sequence after a 6-week interval for repeat evaluation. The Kappa coefficient (κ) was used to determine inter- and intra-observer agreement. Inter-observer: considering the main AO injury types, the agreement was substantial for the AOSpine classification [κ = 0.61 (0.57-0.64)]; using AO sub-types, the agreement was moderate [κ = 0.57 (0.54-0.60)]. For the A&F classification, the agreement [κ = 0.46 (0.42-0.49)] was significantly lower than using the AOSpine scheme. Intra-observer: the agreement was substantial considering injury types [κ = 0.68 (0.62-0.74)] and considering sub-types [κ = 0.62 (0.57-0.66)]. Using the A&F classification, the agreement was also substantial [κ = 0.66 (0.61-0.71)]. No significant differences were observed between spine surgeons and orthopaedic residents in the overall inter- and intra-observer agreement, or in the inter- and intra-observer agreement of specific type of injuries. The AOSpine classification (using the four main injury types or at the sub-types level) allows a significantly better agreement than the A&F classification. The A&F scheme does not allow reliable communication between medical professionals.
7 CFR 1709.16 - Performance reviews.
Code of Federal Regulations, 2014 CFR
2014-01-01
... 7 Agriculture 11 2014-01-01 2014-01-01 false Performance reviews. 1709.16 Section 1709.16... AGRICULTURE ASSISTANCE TO HIGH ENERGY COST COMMUNITIES General Requirements § 1709.16 Performance reviews. Each grant agreement shall include performance criteria and RUS will regularly evaluate the progress...
Lau, Elaine; Dolovich, Lisa; Austin, Zubin
2007-03-01
The Family Practice Simulator (FPS) was piloted as a teaching, learning, and assessment opportunity for pharmacists making the transition into primary care practice. During this one-day simulation of a typical day in a family physician's office, nine pharmacists rotated through a series of 13 OSCE stations where they interacted with physicians, patients, nurses and office staff while completing primary care activities and receiving performance evaluations. Pharmacists' performance ratings from self, physician, and standardized patient evaluations were compared using Global Rating Scales (GRS) scores and station-specific key points checklists. The mean (SD) overall GRS scores obtained by pharmacists across all stations in the FPS were 4.56 (SD = 0.60) from standardized patients, 3.95 (SD = 0.63) from physicians, and 3.60 (SD = 0.63) from self-assessment (out of a maximum score of 5). Agreement between pharmacists' and patients' GRS ratings ranged from moderate to good (generalizability coefficient (G) = 0.45 to 0.72) for all except one station. Agreement in GRS scores between pharmacists and physicians was at most fair for every station (G = 0.02 - 0.26). There was fair agreement on key points scores between pharmacists and patients (weighted kappa = 27%; 95% CI 7%, 47%) and moderate agreement between pharmacists and physicians (weighted kappa = 45%; 95% CI 21%, 70%). Although there was at best moderate agreement in rating scores between pharmacists, standardized patients, and physicians, the FPS provided an important opportunity to measure expectations regarding the professional role, responsibilities, and performance of pharmacists from a multi-professional perspective, thus better preparing pharmacists for integration into primary care practice. Differences in agreement may have been due to different preconceptions and expectations among raters.
Current and Developing Conceptions of Use: Evaluation Use TIG Survey Results.
ERIC Educational Resources Information Center
Preskill, Hallie; Caracelli, Valerie
1997-01-01
A survey was sent to members of the Evaluation Use Topical Interest Group (TIG) to determine their perceptions about and experiences with evaluation use. Responses from 282 members show agreement on the major purposes of evaluation and an increased use of performance-results oriented and formative evaluations. (SLD)
Mendicino, Diego; Stafuza, Mariana; Colussi, Carlina; del Barco, Mónica; Streiger, Mirtha; Moretti, Edgardo
2014-01-01
Many patients with Chagas disease live in remote communities that lack both equipment and trained personnel to perform a diagnosis by conventional serology (CS). Thus, reliable tests suitable for use under difficult conditions are required. In this study, we evaluated the ability of personnel with and without laboratory skills to perform immunochromatographic (IC) tests to detect Chagas disease at a primary health care centre (PHCC). We examined whole blood samples from 241 patients and serum samples from 238 patients. Then, we calculated the percentage of overall agreement (POA) between the two groups of operators for the sensitivity (S), specificity (Sp) and positive (PPV) and negative (NPV) predictive values of IC tests compared to CS tests. We also evaluated the level of agreement between ELISAs and indirect haemagglutination (IHA) tests. The readings of the IC test results showed 100% agreement (POA = 1). The IC test on whole blood showed the following values: S = 87.3%; Sp = 98.8%; PPV = 96.9% and NPV = 95.9%. Additionally, the IC test on serum displayed the following results: S = 95.7%; Sp = 100%; PPV = 100% and NPV = 98.2%. Using whole blood, the agreement with ELISA was 96.3% and the agreement with IHA was 94.1%. Using serum, the agreement with ELISA was 97.8% and the agreement with IHA was 96.6%. The IC test performance with serum samples was excellent and demonstrated its usefulness in a PHCC with minimal equipment. If the IC test S value and NPV with whole blood are improved, then this test could also be used in areas lacking laboratories or specialised personnel. PMID:25466624
Huynh, Thien J; Flaherty, Matthew L; Gladstone, David J; Broderick, Joseph P; Demchuk, Andrew M; Dowlatshahi, Dar; Meretoja, Atte; Davis, Stephen M; Mitchell, Peter J; Tomlinson, George A; Chenkin, Jordan; Chia, Tze L; Symons, Sean P; Aviv, Richard I
2014-01-01
Rapid, accurate, and reliable identification of the computed tomography angiography spot sign is required to identify patients with intracerebral hemorrhage for trials of acute hemostatic therapy. We sought to assess the accuracy and interobserver agreement for spot sign identification. A total of 131 neurology, emergency medicine, and neuroradiology staff and fellows underwent imaging certification for spot sign identification before enrolling patients in 3 trials targeting spot-positive intracerebral hemorrhage for hemostatic intervention (STOP-IT, SPOTLIGHT, STOP-AUST). Ten intracerebral hemorrhage cases (spot-positive/negative ratio, 1:1) were presented for evaluation of spot sign presence, number, and mimics. True spot positivity was determined by consensus of 2 experienced neuroradiologists. Diagnostic performance, agreement, and differences by training level were analyzed. Mean accuracy, sensitivity, and specificity for spot sign identification were 87%, 78%, and 96%, respectively. Overall sensitivity was lower than specificity (P<0.001) because of true spot signs incorrectly perceived as spot mimics. Interobserver agreement for spot sign presence was moderate (k=0.60). When true spots were correctly identified, 81% correctly identified the presence of single or multiple spots. Median time needed to evaluate the presence of a spot sign was 1.9 minutes (interquartile range, 1.2-3.1 minutes). Diagnostic performance, interobserver agreement, and time needed for spot sign evaluation were similar among staff physicians and fellows. Accuracy for spot identification is high with opportunity for improvement in spot interpretation sensitivity and interobserver agreement particularly through greater reliance on computed tomography angiography source data and awareness of limitations of multiplanar images. Further prospective study is needed.
Inter-rater Agreement of End-of-shift Evaluations Based on a Single Encounter
Warrington, Steven; Beeson, Michael; Bradford, Amber
2017-01-01
Introduction End-of-shift evaluation (ESE) forms, also known as daily encounter cards, represent a subset of encounter-based assessment forms. Encounter cards have become prevalent for formative evaluation, with some suggesting a potential for summative evaluation. Our objective was to evaluate the inter-rater agreement of ESE forms using a single scripted encounter at a conference of emergency medicine (EM) educators. Methods Following institutional review board exemption, we created a scripted video simulating an encounter between an intern and a patient with an ankle injury. That video was shown during a lecture at the Council of EM Residency Director’s Academic Assembly with attendees asked to evaluate the “resident” using one of eight possible ESE forms randomly distributed. Descriptive statistics were used to analyze the results with Fleiss’ kappa to evaluate inter-rater agreement. Results Most of the 324 respondents were leadership in residency programs (66%), with a range of 29–47 responses per evaluation form. Few individuals (5%) felt they were experts in assessing residents based on EM milestones. Fleiss’ kappa ranged from 0.157 – 0.308 and did not perform much better in two post-hoc subgroup analyses. Conclusion The kappa ranges found show only slight to fair inter-rater agreement and raise concerns about the use of ESE forms in assessment of EM residents. Despite limitations present in this study, these results and a lack of other studies on inter-rater agreement of encounter cards should prompt further studies of such methods of assessment. Additionally, EM educators should focus research on methods to improve inter-rater agreement of ESE forms or other evaluating other methods of assessment of EM residents. PMID:28435505
[Evaluation of an educational website on First Aid].
Mori, Satomi; Whitaker, Iveth Yamaguchi; Marin, Heimar de Fátima
2013-08-01
The aim of this study was to evaluate the structure, quality of information and usability of a website on First Aid. The evaluation was performed by information technology (IT) and health care professionals and by students, using specific and validated instruments. The kappa method was used to evaluate the agreement of the answers, and Cronbach's α coefficient was used to assess the reliability of the instrument. There was no agreement (0.047) among the answers obtained from the IT professionals, indicating that the structure of the website must be reviewed. There was also no agreement in the evaluation by the health care professionals (-0.062); however, the overall positive scores suggest that the quality of the information of the website is adequate. The assessment of reliability of the instrument to evaluate the navigability rendered a value of α=0.974. Although improvement of the website structure is recommended, the quality of the information is good, and its use has contributed to the apprenticeship of students.
Marcos-Garcés, V; Harvat, M; Molina Aguilar, P; Ferrández Izquierdo, A; Ruiz-Saurí, A
2017-08-01
Measurement of collagen bundle orientation in histopathological samples is a widely used and useful technique in many research and clinical scenarios. Fourier analysis is the preferred method for performing this measurement, but the most appropriate staining and microscopy technique remains unclear. Some authors advocate the use of Haematoxylin-Eosin (H&E) and confocal microscopy, but there are no studies comparing this technique with other classical collagen stainings. In our study, 46 human skin samples were collected, processed for histological analysis and stained with Masson's trichrome, Picrosirius red and H&E. Five microphotographs of the reticular dermis were taken with a 200× magnification with light microscopy, polarized microscopy and confocal microscopy, respectively. Two independent observers measured collagen bundle orientation with semiautomated Fourier analysis with the Image-Pro Plus 7.0 software and three independent observers performed a semiquantitative evaluation of the same parameter. The average orientation for each case was calculated with the values of the five pictures. We analyzed the interrater reliability, the consistency between Fourier analysis and average semiquantitative evaluation and the consistency between measurements in Masson's trichrome, Picrosirius red and H&E-confocal. Statistical analysis for reliability and agreement was performed with the SPSS 22.0 software and consisted of intraclass correlation coefficient (ICC), Bland-Altman plots and limits of agreement and coefficient of variation. Interrater reliability was almost perfect (ICC > 0.8) with all three histological and microscopy techniques and always superior in Fourier analysis than in average semiquantitative evaluation. Measurements were consistent between Fourier analysis by one observer and average semiquantitative evaluation by three observers, with an almost perfect agreement with Masson's trichrome and Picrosirius red techniques (ICC > 0.8) and a strong agreement with H&E-confocal (0.7 < ICC < 0.8). Comparison of measurements between the three techniques for the same observer showed an almost perfect agreement (ICC > 0.8), better with Fourier analysis than with semiquantitative evaluation (single and average). These results in nonpathological skin samples were also confirmed in a preliminary analysis in eight scleroderma skin samples. Our results show that Masson's trichrome and Picrosirius red are consistent with H&E-confocal for measuring collagen bundle orientation in histological samples and could thus be used indistinctly for this purpose. Fourier analysis is superior to average semiquantitative evaluation and should keep being used as the preferred method. © 2017 The Authors Journal of Microscopy © 2017 Royal Microscopical Society.
Holistic rubric vs. analytic rubric for measuring clinical performance levels in medical students.
Yune, So Jung; Lee, Sang Yeoup; Im, Sun Ju; Kam, Bee Sung; Baek, Sun Yong
2018-06-05
Task-specific checklists, holistic rubrics, and analytic rubrics are often used for performance assessments. We examined what factors evaluators consider important in holistic scoring of clinical performance assessment, and compared the usefulness of applying holistic and analytic rubrics respectively, and analytic rubrics in addition to task-specific checklists based on traditional standards. We compared the usefulness of a holistic rubric versus an analytic rubric in effectively measuring the clinical skill performances of 126 third-year medical students who participated in a clinical performance assessment conducted by Pusan National University School of Medicine. We conducted a questionnaire survey of 37 evaluators who used all three evaluation methods-holistic rubric, analytic rubric, and task-specific checklist-for each student. The relationship between the scores on the three evaluation methods was analyzed using Pearson's correlation. Inter-rater agreement was analyzed by Kappa index. The effect of holistic and analytic rubric scores on the task-specific checklist score was analyzed using multiple regression analysis. Evaluators perceived accuracy and proficiency to be major factors in objective structured clinical examinations evaluation, and history taking and physical examination to be major factors in clinical performance examinations evaluation. Holistic rubric scores were highly related to the scores of the task-specific checklist and analytic rubric. Relatively low agreement was found in clinical performance examinations compared to objective structured clinical examinations. Meanwhile, the holistic and analytic rubric scores explained 59.1% of the task-specific checklist score in objective structured clinical examinations and 51.6% in clinical performance examinations. The results show the usefulness of holistic and analytic rubrics in clinical performance assessment, which can be used in conjunction with task-specific checklists for more efficient evaluation.
Design and Testing of a Tool for Evaluating the Quality of Diabetes Consumer-Information Web Sites
Steinwachs, Donald; Rubin, Haya R
2003-01-01
Background Most existing tools for measuring the quality of Internet health information focus almost exclusively on structural criteria or other proxies for quality information rather than evaluating actual accuracy and comprehensiveness. Objective This research sought to develop a new performance-measurement tool for evaluating the quality of Internet health information, test the validity and reliability of the tool, and assess the variability in diabetes Web site quality. Methods An objective, systematic tool was developed to evaluate Internet diabetes information based on a quality-of-care measurement framework. The principal investigator developed an abstraction tool and trained an external reviewer on its use. The tool included 7 structural measures and 34 performance measures created by using evidence-based practice guidelines and experts' judgments of accuracy and comprehensiveness. Results Substantial variation existed in all categories, with overall scores following a normal distribution and ranging from 15% to 95% (mean was 50% and median was 51%). Lin's concordance correlation coefficient to assess agreement between raters produced a rho of 0.761 (Pearson's r of 0.769), suggesting moderate to high agreement. The average agreement between raters for the performance measures was 0.80. Conclusions Diabetes Web site quality varies widely. Alpha testing of this new tool suggests that it could become a reliable and valid method for evaluating the quality of Internet health sites. Such an instrument could help lay people distinguish between beneficial and misleading information. PMID:14713658
Wengert, G J; Helbich, T H; Woitek, R; Kapetas, P; Clauser, P; Baltzer, P A; Vogl, W-D; Weber, M; Meyer-Baese, A; Pinker, Katja
2016-11-01
To evaluate the inter-/intra-observer agreement of BI-RADS-based subjective visual estimation of the amount of fibroglandular tissue (FGT) with magnetic resonance imaging (MRI), and to investigate whether FGT assessment benefits from an automated, observer-independent, quantitative MRI measurement by comparing both approaches. Eighty women with no imaging abnormalities (BI-RADS 1 and 2) were included in this institutional review board (IRB)-approved prospective study. All women underwent un-enhanced breast MRI. Four radiologists independently assessed FGT with MRI by subjective visual estimation according to BI-RADS. Automated observer-independent quantitative measurement of FGT with MRI was performed using a previously described measurement system. Inter-/intra-observer agreements of qualitative and quantitative FGT measurements were assessed using Cohen's kappa (k). Inexperienced readers achieved moderate inter-/intra-observer agreement and experienced readers a substantial inter- and perfect intra-observer agreement for subjective visual estimation of FGT. Practice and experience reduced observer-dependency. Automated observer-independent quantitative measurement of FGT was successfully performed and revealed only fair to moderate agreement (k = 0.209-0.497) with subjective visual estimations of FGT. Subjective visual estimation of FGT with MRI shows moderate intra-/inter-observer agreement, which can be improved by practice and experience. Automated observer-independent quantitative measurements of FGT are necessary to allow a standardized risk evaluation. • Subjective FGT estimation with MRI shows moderate intra-/inter-observer agreement in inexperienced readers. • Inter-observer agreement can be improved by practice and experience. • Automated observer-independent quantitative measurements can provide reliable and standardized assessment of FGT with MRI.
Besa, Pablo; Urrutia, Julio; Campos, Mauricio; Mobarec, Sebastián; Cruz, Juan Pablo; Cikutovic, Pablo; Diaz, Gonzalo
2018-04-27
Differentiating osteoporotic vertebral fractures (OVFs) from metastatic vertebral fractures (MVFs) is an important clinical challenge. A novel magnetic resonance imaging (MRI)-based score (the META score) was described, aiming to differentiate OVF from MVF. This score showed an almost perfect agreement by the group developing it, but an independent agreement evaluation is pending. We aimed to perform an independent inter- and intraobserver agreement evaluation of the META score and to test the score's capability of differentiating OVF from MVF. This is an agreement study of the META score. Sixty-four patients with confirmed OVF or MVF were assessed by six independent evaluators (three spine surgeons and three fellowship-trained radiologists) using the META score. We used the intraclass correlation coefficient (ICC) to determine the overall inter-and intraobserver agreement, and the kappa statistic (κ) to express the agreement for each individual score criterion. The score accuracy was determined by calculating the area under the receiver operating characteristic curve. Finally, we used κ to evaluate the agreement among raters to determine whether the fracture was OVF or MVF. The overall interobserver agreement was poor [ICC=0.10 (0.02-0.20)]; spine surgeons [ICC=0.75 (0.66-0.83)] had better agreement than radiologists did [ICC=0.05 (-0.08 to 0.21)]. The intraobserver agreement was poor [ICC=0.17 (0.01-0.32)]; both spine surgeons [ICC=0.21 (0.05-0.41)] and radiologists had a poor agreement [ICC=0.03 (-0.29 to 0.27)]. The agreement for each specific criterion varied from κ=0.24 to κ=0.60. The area under the receiver operating characteristic curve was 0.58 (0.64 for spine surgeons and 0.52 for radiologists, p<.01). The interobserver agreement using the META score was adequate for spine surgeons but not for other potential users (radiologists); the intraobserver agreement was poor. Further studies are thus necessary before the use of this score is recommended. Copyright © 2018 Elsevier Inc. All rights reserved.
House, Joseph B.; Dooley-Hash, Suzanne; Kowalenko, Terry; Sikavitsas, Athina; Seeyave, Desiree M.; Younger, John G.; Hamstra, Stanley J.; Nypaver, Michele M.
2012-01-01
Introduction Real-time assessment of operator performance during procedural simulation is a common practice that requires undivided attention by 1 or more reviewers, potentially over many repetitions of the same case. Objective To determine whether reviewers display better interrater agreement of procedural competency when observing recorded, rather than live, performance; and to develop an assessment tool for pediatric rapid sequence intubation (pRSI). Methods A framework of a previously established Objective Structured Assessment of Technical Skills (OSATS) tool was modified for pRSI. Emergency medicine residents (postgraduate year 1–4) were prospectively enrolled in a pRSI simulation scenario and evaluated by 2 live raters using the modified tool. Sessions were videotaped and reviewed by the same raters at least 4 months later. Raters were blinded to their initial rating. Interrater agreement was determined by using the Krippendorff generalized concordance method. Results Overall interrater agreement for live review was 0.75 (95% confidence interval [CI], 0.72–0.78) and for video was 0.79 (95% CI, 0.73–0.82). Live review was significantly superior to video review in only 1 of the OSATS domains (Preparation) and was equivalent in the other domains. Intrarater agreement between the live and video evaluation was very good, greater than 0.75 for all raters, with a mean of 0.81 (95% CI, 0.76–0.85). Conclusion The modified OSATS assessment tool demonstrated some evidence of validity in discriminating among levels of resident experience and high interreviewer reliability. With this tool, intrareviewer reliability was high between live and 4-months' delayed video review of the simulated procedure, which supports feasibility of delayed video review in resident assessment. PMID:23997874
Reisfeld, S; Assaly, M; Tannous, E; Amarney, K; Stein, M
2018-06-01
Approximately 20-50% of antimicrobial therapy in hospitalized patients is considered inappropriate, which may be associated with increased morbidity and mortality. The best method for evaluation of appropriateness is not well defined. To evaluate the rate of appropriate antimicrobial therapy in a secondary hospital using three different methods, and determine the rate of agreement between the different methods. A point prevalence study included all adult hospitalized patients receiving systemic antimicrobial therapy during 2016, screened on a single day. Clinical, laboratory and therapeutic data were collected from patient files, and appropriateness was rated with a qualitative evaluation by expert opinion. In addition, a quantitative evaluation was performed according to 11 quality indicators (QIs) rated for each patient. A strict definition of appropriateness was fulfilled if six essential QIs were met, and a lenient definition was fulfilled if at least five QIs were met. Agreement between methods was analysed using kappa statistic. Among 106 patients included, rates of appropriateness of antimicrobial therapy ranged from 20% to 75%, depending on the method of evaluation. Very low agreement was found between the strict definition and expert opinion (kappa=0.068), and medium agreement was found between the lenient definition and expert opinion (kappa=0.45). Rates of appropriateness of antimicrobial therapy varied between evaluation methods, with low to moderate agreement between the different methods. Copyright © 2017 The Healthcare Infection Society. Published by Elsevier Ltd. All rights reserved.
Wentzensen, Nicolas; Fetterman, Barbara; Tokugawa, Diane; Schiffman, Mark; Castle, Philip E; Wood, Shannon N; Stiemerling, Eric; Poitras, Nancy; Lorey, Thomas; Kinney, Walter
2014-12-01
Dual-stain cytology for p16 and Ki-67 has been proposed as a biomarker in cervical cancer screening. The authors evaluated the reproducibility and accuracy of dual-stain cytology among 10 newly trained evaluators. In total, 480 p16/Ki-67-stained slides from human papillomavirus-positive women were evaluated in masked fashion by 10 evaluators. None of the evaluators had previous experience with p16 or p16/Ki-67 cytology. All participants underwent p16/Ki-67 training and subsequent proficiency testing. Reproducibility of dual-stain cytology was measured using the percentage agreement, individual and aggregate κ values, as well as McNemar statistics. Clinical performance for the detection of cervical intraepithelial neoplasia grade 2 or greater (CIN2+) was evaluated for each individual evaluator and for all evaluators combined compared with the reference evaluation by a cytotechnologist who had extensive experience with dual-stain cytology. The percentage agreement of individual evaluators with the reference evaluation ranged from 83% to 91%, and the κ values ranged from 0.65 to 0.81. The combined κ value was 0.71 for all evaluators and 0.73 for cytotechnologists. The average sensitivity and specificity for the detection of CIN2+ among novice evaluators was 82% and 64%, respectively; whereas the reference evaluation had 84% sensitivity and 63% specificity, respectively. Agreement on dual-stain positivity increased with greater numbers of p16/Ki-67-positive cells on the slides. Good to excellent reproducibility of p16/Ki-67 dual-stain cytology was observed with almost identical clinical performance of novice evaluators compared with reference evaluations. The current findings suggest that p16/Ki-67 dual-stain evaluation can be implemented in routine cytology practice with limited training. © 2014 American Cancer Society.
Dantas, Roberto Nery; Assuncao, Antonildes Nascimento; Marques, Ismar Aguiar; Fahel, Mateus Guimaraes; Nomura, Cesar Higa; Avila, Luiz Francisco Rodrigues; Giorgi, Maria Clementina Pinto; Soares, Jose; Meneghetti, Jose Claudio; Parga, Jose Rodrigues
2018-06-01
Despite advances in non-invasive myocardial perfusion imaging (MPI) evaluation, computed tomography (CT) multiphase MPI protocols have not yet been compared with the highly accurate rubidium-82 positron emission tomography ( 82 RbPET) MPI. Thus, this study aimed to evaluate agreement between 82 RbPET and 320-detector row CT (320-CT) MPI using a multiphase protocol in suspected CAD patients. Forty-four patients referred for MPI evaluation were prospectively enrolled and underwent dipyridamole stress 82 RbPET and multiphase 320-CT MPI (five consecutive volumetric acquisitions during stress). Statistical analyses were performed using the R software. There was high agreement for recognizing summed stress scores ≥ 4 (kappa 0.77, 95% CI 0.55-0.98, p < 0.001) and moderate for detecting SDS ≥ 2 (kappa 0.51, 95% CI 0.23-0.80, p < 0.001). In a per segment analysis, agreement was high for the presence of perfusion defects during stress and rest (kappa 0.75 and 0.82, respectively) and was moderate for impairment severity (kappa 0.58 and 0.65, respectively). The 320-CT protocol was safe, with low radiation burden (9.3 ± 2.4 mSv). There was a significant agreement between dipyridamole stress 320-CT MPI and 82 RbPET MPI in the evaluation of suspected CAD patients of intermediate risk. The multiphase 320-CT MPI protocol was feasible, diagnostic and with relatively low radiation exposure. • Rubidium-82 PET and 320-MDCT can perform MPI studies for CAD investigation. • There is high agreement between rubidium-82 PET and 320-MDCT for MPI assessment. • Multiphase CT perfusion protocols are feasible and with low radiation. • Multiphase CT perfusion protocols can identify image artefacts.
Schot, Marjolein J C; van Delft, Sanne; Kooijman-Buiting, Antoinette M J; de Wit, Niek J; Hopstaken, Rogier M
2015-01-01
Objective Various point-of-care testing (POCT) urine analysers are commercially available for routine urine analysis in general practice. The present study compares analytical performance, agreement and user-friendliness of six different POCT urine analysers for diagnosing urinary tract infection in general practice. Setting All testing procedures were performed at a diagnostic centre for primary care in the Netherlands. Urine samples were collected at four general practices. Primary and secondary outcome measures Analytical performance and agreement of the POCT analysers regarding nitrite, leucocytes and erythrocytes, with the laboratory reference standard, was the primary outcome measure, and analysed by calculating sensitivity, specificity, positive and negative predictive value, and Cohen's κ coefficient for agreement. Secondary outcome measures were the user-friendliness of the POCT analysers, in addition to other characteristics of the analysers. Results The following six POCT analysers were evaluated: Uryxxon Relax (Macherey Nagel), Urisys 1100 (Roche), Clinitek Status (Siemens), Aution 11 (Menarini), Aution Micro (Menarini) and Urilyzer (Analyticon). Analytical performance was good for all analysers. Compared with laboratory reference standards, overall agreement was good, but differed per parameter and per analyser. Concerning the nitrite test, the most important test for clinical practice, all but one showed perfect agreement with the laboratory standard. For leucocytes and erythrocytes specificity was high, but sensitivity was considerably lower. Agreement for leucocytes varied between good to very good, and for the erythrocyte test between fair and good. First-time users indicated that the analysers were easy to use. They expected higher productivity and accuracy when using these analysers in daily practice. Conclusions The overall performance and user-friendliness of all six commercially available POCT urine analysers was sufficient to justify routine use in suspected urinary tract infections in general practice. PMID:25986635
On Evaluating Brain Tissue Classifiers without a Ground Truth
Martin-Fernandez, Marcos; Ungar, Lida; Nakamura, Motoaki; Koo, Min-Seong; McCarley, Robert W.; Shenton, Martha E.
2009-01-01
In this paper, we present a set of techniques for the evaluation of brain tissue classifiers on a large data set of MR images of the head. Due to the difficulty of establishing a gold standard for this type of data, we focus our attention on methods which do not require a ground truth, but instead rely on a common agreement principle. Three different techniques are presented: the Williams’ index, a measure of common agreement; STAPLE, an Expectation Maximization algorithm which simultaneously estimates performance parameters and constructs an estimated reference standard; and Multidimensional Scaling, a visualization technique to explore similarity data. We apply these different evaluation methodologies to a set eleven different segmentation algorithms on forty MR images. We then validate our evaluation pipeline by building a ground truth based on human expert tracings. The evaluations with and without a ground truth are compared. Our findings show that comparing classifiers without a gold standard can provide a lot of interesting information. In particular, outliers can be easily detected, strongly consistent or highly variable techniques can be readily discriminated, and the overall similarity between different techniques can be assessed. On the other hand, we also find that some information present in the expert segmentations is not captured by the automatic classifiers, suggesting that common agreement alone may not be sufficient for a precise performance evaluation of brain tissue classifiers. PMID:17532646
Abdominal auscultation does not provide clear clinical diagnoses.
Durup-Dickenson, Maja; Christensen, Marie Kirk; Gade, John
2013-05-01
Abdominal auscultation is a part of the clinical examination of patients, but the determining factors in bowel sound evaluation are poorly described. The aim of this study was to assess inter- and intra-observer agreement in physicians' evaluation of pitch, intensity and quantity in abdominal auscultation. A total of 100 physicians were presented with 20 bowel sound recordings in a blinded set-up. Recordings had been made in a mix of healthy volunteers and emergency patients. They evaluated pitch, intensity and quantity of bowel sounds in a questionnaire with three, three and four categories of answers, respectively. Fleiss' multi-rater kappa (κ) coefficients were calculated for inter-observer agreement; for intra-observer agreement, calculation of probability was performed. Inter-observer agreement regarding pitch, intensity and quantity yielded κ-values of 0.19 (p < 0.0001), 0.30 (p < 0.0001) and 0.24 (p < 0.0001), respectively, corresponding to slight, fair and fair agreement. Regarding intra-observer agreement, the probability of agreement was 0.55 (95% confidence interval (CI): 0.51-0.59), 0.45 (95% CI: 0.42-0.49) and 0.41 (95% CI: 0.38-0.45) for pitch, intensity and quantity, respectively. Although relatively poor, observer agreement was slight to fair and thus better than expected by chance. Since the diagnostic value of auscultation increases with addition of history and clinics, and may be further improved by systematic training, it should still be used in the examination of patients with acute abdominal pain. not relevant. not relevant.
Rousseau, Jacqueline; Kaegi, Christine; Boudreault, Renée; Nadeau, Sylvie
2008-01-01
Purpose: The GEM scale is an objective assessment tool, specifically developed for older adults, to evaluate walking safety using standardized tasks. The purpose of this study was to estimate the interrater and test–retest agreement of the GEM scale. Method: Participants (n = 41; ≥ 65 years) were recruited from geriatric units and assessed independently and simultaneously by three raters on two occasions using the GEM scale. Kappa coefficients and percentage agreement were calculated for each item of the scale. Results: A majority of walking items (n = 22) showed fair to substantial interrater agreement (κ ≥ 0.25) and substantial to almost perfect test–retest agreement (κ ≥ 0.60). Mean percentage agreement was high for both interrater and test–retest agreement (79% ± 15% and 83% ± 16% respectively). Moreover, detailed analyses demonstrated that the relatively low agreement of some items resulted from changes in the performance of some participants and the low variability of scores. Although some walking items showed less agreement, the final decision regarding the participants’ ability to walk safely resulted in moderate to substantial interrater and test–retest agreement. Conclusion: The GEM scale is a new assessment tool that can now be used with estimated interrater and test–retest properties to allow therapists to objectively evaluate walking safety among the elderly. PMID:20145760
Cheng, Annie; Kirby, James E
2014-03-01
To evaluate the performance of the Hologic Gen-Probe (San Diego, CA) PANTHER system. The performance of PANTHER was compared with the Hologic Gen-Probe TIGRIS and/or Roche (Indianapolis, IN) COBAS AMPLICOR systems through testing of patient specimens and the spiked-urine matrix. After discrepant resolution, PANTHER demonstrated a 99.3% (95% confidence interval [CI], 96.0%-99.9%) positive and 100% (98.5%-100.0%) negative agreement for Chlamydia trachomatis (CT) and 100% (96.6%-100.0%) positive and 100% (98.6%-100.0%) negative agreement for Neisseria gonorrhoeae (NG) for all male, female, unsexed, and NG-spiked female urine specimens combined. For other specimen types collectively, the PANTHER demonstrated 100% (95% CI, 90.6%-100.0%) positive and 100% (88.3%-100.0%) negative agreement for CT and 90.9% (62.8%-98.4%) positive and 100% (93.5%-100.0%) negative agreement for NG. Analytical sensitivity of the PANTHER in urine matrix was similar to the TIGRIS system. The PANTHER system provides an excellent new addition to options for detecting CT and NG, is appropriate for testing urine samples, and will facilitate high-throughput testing in the clinical laboratory.
The Problem of Limited Inter-rater Agreement in Modelling Music Similarity
Flexer, Arthur; Grill, Thomas
2016-01-01
One of the central goals of Music Information Retrieval (MIR) is the quantification of similarity between or within pieces of music. These quantitative relations should mirror the human perception of music similarity, which is however highly subjective with low inter-rater agreement. Unfortunately this principal problem has been given little attention in MIR so far. Since it is not meaningful to have computational models that go beyond the level of human agreement, these levels of inter-rater agreement present a natural upper bound for any algorithmic approach. We will illustrate this fundamental problem in the evaluation of MIR systems using results from two typical application scenarios: (i) modelling of music similarity between pieces of music; (ii) music structure analysis within pieces of music. For both applications, we derive upper bounds of performance which are due to the limited inter-rater agreement. We compare these upper bounds to the performance of state-of-the-art MIR systems and show how the upper bounds prevent further progress in developing better MIR systems. PMID:28190932
Beaufrère, Hugues; Pariaut, Romain; Rodriguez, Daniel; Nevarez, Javier G; Tully, Thomas N
2012-10-01
To assess the agreement and reliability of cardiac measurements obtained with 3 echocardiographic techniques in anesthetized red-tailed hawks (Buteo jamaicensis). 10 red-tailed hawks. Transcoelomic, contrast transcoelomic, and transesophageal echocardiographic evaluations of the hawks were performed, and cineloops of imaging planes were recorded. Three observers performed echocardiographic measurements of cardiac variables 3 times on 3 days. The order in which hawks were assessed and echocardiographic techniques were used was randomized. Results were analyzed with linear mixed modeling, agreement was assessed with intraclass correlation coefficients, and variation was estimated with coefficients of variation. Significant differences were evident among the 3 echocardiographic methods for most measurements, and the agreement among findings was generally low. Interobserver agreement was generally low to medium. Intraobserver agreement was generally medium to high. Overall, better agreement was achieved for the left ventricular measurements and for the transesophageal approach than for other measurements and techniques. Echocardiographic measurements in hawks were not reliable, except when the left ventricle was measured by the same observer. Furthermore, cardiac morphometric measurements may not be clinically important. When measurements are required, one needs to consider that follow-up measurements should be performed by the same echocardiographer and should show at least a 20% difference from initial measurements to be confident that any difference is genuine.
Repeated Low Teaching Evaluations: A Form of Habitual Behaviour?
ERIC Educational Resources Information Center
Grayson, J. Paul
2015-01-01
Teaching evaluations have become part of life on Canadian campuses; however, there is no agreement among researchers as to their validity. In this article, comparisons were made between first- and third-year collective evaluations of professors' performance at the University of British Columbia, York University, and McGill University. Overall, it…
Development of an interprofessional lean facilitator assessment scale.
Bravo-Sanchez, Cindy; Dorazio, Vincent; Denmark, Robert; Heuer, Albert J; Parrott, J Scott
2018-05-01
High reliability is important for optimising quality and safety in healthcare organisations. Reliability efforts include interprofessional collaborative practice (IPCP) and Lean quality/process improvement strategies, which require skilful facilitation. Currently, no validated Lean facilitator assessment tool for interprofessional collaboration exists. This article describes the development and pilot evaluation of such a tool; the Interprofessional Lean Facilitator Assessment Scale (ILFAS), which measures both technical and 'soft' skills, which have not been measured in other instruments. The ILFAS was developed using methodologies and principles from Lean/Shingo, IPCP, metacognition research and Bloom's Taxonomy of Learning Domains. A panel of experts confirmed the initial face validity of the instrument. Researchers independently assessed five facilitators, during six Lean sessions. Analysis included quantitative evaluation of rater agreement. Overall inter-rater agreement of the assessment of facilitator performance was high (92%), and discrepancies in the agreement statistics were analysed. Face and content validity were further established, and usability was evaluated, through primary stakeholder post-pilot feedback, uncovering minor concerns, leading to tool revision. The ILFAS appears comprehensive in the assessment of facilitator knowledge, skills, abilities, and may be useful in the discrimination between facilitators of different skill levels. Further study is needed to explore instrument performance and validity.
(Dis)agreement on Sight-Singing Assessment of Undergraduate Musicians
Bortz, Graziela; Germano, Nayana G.; Cogo-Moreira, Hugo
2018-01-01
Assessment criteria for sight-singing abilities are similar to those used to judge music performances across music school programs. However, little evidence of agreement among judges has been provided in the literature. Fifty out of 152 participants were randomly selected and blindly assessed by three judges, who evaluated students based on given criteria. Participants were recorded while sight-singing 19 intervals and 10 tonal melodies. Interjudge agreement on melodic sight-singing was tested considering four items in a five-point Likert scale format as follows: (1) Intonation and pitch accuracy; (2) Tonal sense and memory; (3) Rhythmic precision, regularity of pulse and subdivisions; (4) Fluency and music direction. Intervals were scored considering a 3-point Likert scale. Agreement was conducted using weighted kappa. For melodic sight-singing considering the ten tonal melodies, on average, the weighted kappa (κw) were: κ1w = 0.296, κ2w = 0.487, κ3w = 0.224, and κ4w = 0.244, ranging from fair to moderate.. For intervals, the lowest agreement was kappa = 0.406 and the highest was kappa = 0.792 (on average, kappa = 0.637). These findings light up the discussion on the validity and reliability of models that have been taken for granted in assessing music performance in auditions and contests, and illustrate the need to better discuss evaluation criteria. PMID:29896144
von Arx, Thomas; Janner, Simone F M; Hänni, Stefan; Bornstein, Michael M
2016-02-01
Conventional 2-dimensional radiography uses defined criteria for outcome assessment of apical surgery. However, these radiographic healing criteria are not applicable for 3-dimensional radiography. The present study evaluated the repeatability and reproducibility of new cone-beam computed tomographic (CBCT)-based healing criteria for the judgment of periapical healing 1 year after apical surgery. CBCT scans taken 1 year after apical surgery (61 roots of 54 teeth in 54 patients, mean age = 54.4 years) were evaluated by 3 blinded and calibrated observers using 4 different indices. Reformatted buccolingual CBCT sections through the longitudinal axis of the treated roots were analyzed. Radiographic healing was assessed at the resection plane (R index), within the apical area (A index), of the cortical plate (C index), and regarding a combined apical-cortical area (B index). All readings were performed twice to calculate the intraobserver agreement (repeatability). Second-time readings were used for analyzing the interobserver agreement (reproducibility). Various statistical tests (Cohen, kappa, Fisher, and Spearman) were performed to measure the intra- and interobserver concurrence, the variability of score ratios, and the correlation of indices. For all indices, the rates of identical first- and second-time scores were always higher than 80% (intraobserver Cohen κ values ranging from 0.793 to 0.963). The B index (94.0%) showed the highest intraobserver agreement. Regarding interobserver agreement, the highest rate was found for the B index (72.1%). The Fleiss' κ values for R and B indices exhibited substantial agreement (0.626 and 0.717, respectively), whereas the values for A and C indices showed moderate agreement (0.561 and 0.573, respectively). The Spearman correlation coefficients for R, A, C, and B indices all exhibited a moderate to very strong correlation with the highest correlation found between C and B indices (rs = 0.8069). All indices showed an excellent intraobserver agreement (repeatability). With regard to interobserver agreement (reproducibility), the B index (healing of apical and cortical defects combined) and the R index (healing on the resection plane) showed substantial congruence and thus are to be recommended in future studies when using buccolingual CBCT sections for radiographic outcome assessment of apical surgery. Copyright © 2016 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Postma, M J; Kwik, J J; Rutten, W J M J; de Jong-van den Berg, L T W; Brouwers, J R B J
2002-06-08
To investigate whether the health-economics research published in Dutch journals is in agreement with the guidelines for pharmaco-economic research as published in 1999 by the Dutch 'College voor zorgverzekeringen' [Healthcare Insurance Board]. Descriptive. A literature search was performed for health-economic evaluations published during 1990-1999 in Dutch journals such as Nederlands Tijdschrift voor Geneeskunde (NTvG), Pharmaceutisch Weekblad and Tijdschrift voor Gezondheidswetenschappen. The resulting 19 studies, primarily published in the NTvG, were compared with the pharmaco-economic guidelines, summarised in 9 criteria. The social perspective was taken in 5 studies (26%), non-related medical costs were excluded in 19 (100%), relevant subgroup analyses were performed in 16 (84%), effectiveness was explicitly differentiated from efficacy in 15 (79%), incremental analysis was performed in 18 (95%), an adequate period of analysis was chosen in 16 (84%), costs, benefits and health gains were discounted in 13 (68%), sensitivity analysis was included in the investigation in 10 (53%), and estimated cost prices were used in 9 (47%). The methodology in recently published Dutch health-economics research was partly in agreement with the later official guidelines for pharmaco-economic studies in the Netherlands.
Scaini, Simona; Ogliari, Anna; De Carolis, Ludovica; Bellodi, Laura; Di Serio, Clelia; Brombin, Chiara
2017-01-01
Background: A great part of the literature has confirmed the importance of both child and parents reports as source of factual information, especially for childhood emotional syndromes. In our study we aimed at: (i) calculating mother-child agreement and (ii) evaluating factorial structure of the Screen for Child Anxiety Related Emotional Disorders (SCARED) questionnaire in an Italian clinical sample. The novelty of this contribution is two-fold: first, from a clinical point of view, we investigated the parent-child agreement level and examined separately the factorial structures of both parent and child versions of the SCARED for the first time in an Italian clinical sample. Second, unlike previous studies, we used statistical approaches specifically suited to account for the ordinal nature of the collected variables. Method: In a clinical sample of 171 children and adolescents aged 8–18 and their mothers we evaluated inter-rater agreement using weighted kappa indices to assess agreement for each item belonging to a certain SCARED subscale. Exploratory factor analysis for ordinal data was then performed on the polychoric correlation matrix calculated on SCARED items. Differences in the numbers of symptoms reported by children and parents were evaluated as well. Results and Conclusions: Our results reveal moderate to strong mother-child agreement. A significant age effect is present. Two different factorial solutions emerged for parent and child SCARED versions (a 5 factor structure for parents and a 6 factor solution in the child version, including a new factor “Worry about Parents”). This study confirmed the importance of evaluating both child and parent reports in assessment protocols for anxiety disorders. Our findings could help clinicians to determine which information, and from which rater, must be accounted for in evaluating treatment decisions. Moreover, we find that patients characteristics, such as gender and age, should be taken into account when assessing agreement. PMID:28286488
48 CFR 5215.605 - Evaluation factors.
Code of Federal Regulations, 2010 CFR
2010-10-01
... appropriate premiums for measured increments of quality. When a cost/benefit approach is used, cost must carry... recent Forward Pricing Rate Agreements (FPRAs)). (iv) Cost realism evaluation generally will be performed... realism data are required, the contracting officer shall not request a formal field pricing report but...
An evaluation of the state of time synchronization on leadership class supercomputers
Jones, Terry; Ostrouchov, George; Koenig, Gregory A.; ...
2017-10-09
We present a detailed examination of time agreement characteristics for nodes within extreme-scale parallel computers. Using a software tool we introduce in this paper, we quantify attributes of clock skew among nodes in three representative high-performance computers sited at three national laboratories. Our measurements detail the statistical properties of time agreement among nodes and how time agreement drifts over typical application execution durations. We discuss the implications of our measurements, why the current state of the field is inadequate, and propose strategies to address observed shortcomings.
An evaluation of the state of time synchronization on leadership class supercomputers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jones, Terry; Ostrouchov, George; Koenig, Gregory A.
We present a detailed examination of time agreement characteristics for nodes within extreme-scale parallel computers. Using a software tool we introduce in this paper, we quantify attributes of clock skew among nodes in three representative high-performance computers sited at three national laboratories. Our measurements detail the statistical properties of time agreement among nodes and how time agreement drifts over typical application execution durations. We discuss the implications of our measurements, why the current state of the field is inadequate, and propose strategies to address observed shortcomings.
Therapeutic drug monitoring of infliximab: performance evaluation of three commercial ELISA kits.
Schmitz, Ellen M H; van de Kerkhof, Daan; Hamann, Dörte; van Dongen, Joost L J; Kuijper, Philip H M; Brunsveld, Luc; Scharnhorst, Volkher; Broeren, Maarten A C
2016-07-01
Therapeutic drug monitoring (TDM) of infliximab (IFX, Remicade®) can aid to optimize therapy efficacy. Many assays are available for this purpose. However, a reference standard is lacking. Therefore, we evaluated the analytical performance, agreement and clinically relevant differences of three commercially available IFX ELISA kits on an automated processing system. The kits of Theradiag (Lisa Tracker Infliximab), Progenika (Promonitor IFX) and apDia (Infliximab ELISA) were implemented on an automated processing system. Imprecision was determined by triplicate measurements of patient samples on five days. Agreement was evaluated by analysis of 30 patient samples and four spiked samples by the selected ELISA kits and the in-house IFX ELISA of Sanquin Diagnostics (Amsterdam, The Netherlands). Therapeutic consequences were evaluated by dividing patients into four treatment groups using cut-off levels of 1, 3 and 7 μg/mL and determining assay concordance. Within-run and between-run imprecision were acceptable (≤12% and ≤17%, respectively) within the quantification range of the selected ELISA kits. The apDia assay had the best precision and agreement to target values. Statistically significant differences were found between all assays except between Sanquin Diagnostics and the Lisa Tracker assay. The Promonitor assay measured the lowest IFX concentrations, the apDia assay the highest. When patients were classified in four treatment categories, 70% concordance was achieved. Although all assays are suitable for TDM, significant differences were observed in both imprecision and agreement. Therapeutic consequences were acceptable when patients were divided in treatment categories, but this could be improved by assay standardization.
Inter and intra-observer concordance for the diagnosis of portal hypertension gastropathy.
Casas, Meritxell; Vergara, Mercedes; Brullet, Enric; Junquera, Félix; Martínez-Bauer, Eva; Miquel, Mireia; Sánchez-Delgado, Jordi; Dalmau, Blai; Campo, Rafael; Calvet, Xavier
2018-03-01
At present there is no fully accepted endoscopic classification for the assessment of the severity of portal hypertensive gastropathy (PHG). Few studies have evaluated inter and intra-observer concordance or the degree of concordance between different endoscopic classifications. To evaluate inter and intra-observer agreement for the presence of portal hypertensive gastropathy and enteropathy using different endoscopic classifications. Patients with liver cirrhosis were included into the study. Enteroscopy was performed under sedation. The location of lesions and their severity was recorded. Images were videotaped and subsequently evaluated independently by three different endoscopists, one of whom was the initial endoscopist. The agreement between observations was assessed using the kappa index. Seventy-four patients (mean age 63.2 years, 53 males and 21 females) were included. The agreement between the three endoscopists regarding the presence or absence of PHG using the Tanoue and McCormack classifications was very low (kappa scores = 0.16 and 0.27, respectively). The current classifications of portal hypertensive gastropathy have a very low degree of intra and inter-observer agreement for the diagnosis and assessment of gastropathy severity.
A Local Agreement Pattern Measure Based on Hazard Functions for Survival Outcomes
Dai, Tian; Guo, Ying; Peng, Limin; Manatunga, Amita K.
2017-01-01
Summary Assessing agreement is often of interest in biomedical and clinical research when measurements are obtained on the same subjects by different raters or methods. Most classical agreement methods have been focused on global summary statistics, which cannot be used to describe various local agreement patterns. The objective of this work is to study the local agreement pattern between two continuous measurements subject to censoring. In this paper, we propose a new agreement measure based on bivariate hazard functions to characterize the local agreement pattern between two correlated survival outcomes. The proposed measure naturally accommodates censored observations, fully captures the dependence structure between bivariate survival times and provides detailed information on how the strength of agreement evolves over time. We develop a nonparametric estimation method for the proposed local agreement pattern measure and study theoretical properties including strong consistency and asymptotical normality. We then evaluate the performance of the estimator through simulation studies and illustrate the method using a prostate cancer data example. PMID:28724196
A local agreement pattern measure based on hazard functions for survival outcomes.
Dai, Tian; Guo, Ying; Peng, Limin; Manatunga, Amita K
2018-03-01
Assessing agreement is often of interest in biomedical and clinical research when measurements are obtained on the same subjects by different raters or methods. Most classical agreement methods have been focused on global summary statistics, which cannot be used to describe various local agreement patterns. The objective of this work is to study the local agreement pattern between two continuous measurements subject to censoring. In this article, we propose a new agreement measure based on bivariate hazard functions to characterize the local agreement pattern between two correlated survival outcomes. The proposed measure naturally accommodates censored observations, fully captures the dependence structure between bivariate survival times and provides detailed information on how the strength of agreement evolves over time. We develop a nonparametric estimation method for the proposed local agreement pattern measure and study theoretical properties including strong consistency and asymptotical normality. We then evaluate the performance of the estimator through simulation studies and illustrate the method using a prostate cancer data example. © 2017, The International Biometric Society.
Becker, Anton S; Cornelius, Alexander; Reiner, Cäcilia S; Stocker, Daniel; Ulbrich, Erika J; Barth, Borna K; Mortezavi, Ashkan; Eberli, Daniel; Donati, Olivio F
2017-09-01
to simultaneously evaluate interreader agreement and diagnostic accuracy in the of PI-RADS v2 and compare it to v1. A total of 67 patients (median age 65.3 y, range 51.2-78.2 y; PSA 6.8μg/L, 0.2-33μg/L) undergoing MRI of the prostate and subsequent transperineal template biopsy within ≤6 months from MRI were included. Four readers from two institutions evaluated the likelihood of prostate cancer using PI-RADS v1 and v2 in two separate reading sessions ≥3 months apart. Interreader agreement was assessed for each pulse-sequence and for total PI-RADS scores using the intraclass correlation coefficient (ICC). Differences were considered significant for non-overlapping 95%-confidence intervals. Diagnostic accuracy was assessed with the area under the receiver operating characteristic curve (A Z ). A p-value <0.05 was considered statistically significant. Interreader agreement for DCE-scores was good in v2 (ICC 2 =0.70; 95% CI: 0.66-0.74) and slightly lower in v1 (ICC 1 =0.64, 0.59-0.69). Agreement for DWI scores (ICC 1 =0.77, ICC 2 =0.76) as well as final PI-RADS scores per quadrant were nearly identical (ICC 1 =ICC 2 =0.71). Diagnostic accuracy showed no significant differences (p=0.09-0.93) between v1 and v2 in any of the readers (range: A Z =0.78-0.88). PI-RADS scores show similar interreader agreement in v2 and v1 at comparable diagnostic performance. The simplification of the DCE interpretation in v2 might slightly improve agreement while not negatively affecting diagnostic performance. Copyright © 2017 Elsevier B.V. All rights reserved.
Krogh, T P; Fredberg, U; Christensen, R; Stengaard-Pedersen, K; Ellingsen, T
2013-10-01
Tennis elbow, also known as lateral epicondylitis (LE), is a common disorder often assessed by ultrasound. The aim of this study was to evaluate the ultrasonographic outcomes and methods used in LE research and clinical practice. This study was designed as an intra- and interobserver reliability and agreement study. Ultrasonographic examination of the common extensor tendon of the elbow was performed. The intraobserver study examined tendon thickness twice in 20 right elbows from 20 healthy individuals at an interval of 7 to 12 days. The interobserver study examined tendon thickness, color Doppler activity, and bony spurs in 18 right elbows in 9 healthy individuals and 9 patients with LE. Two trained rheumatologists performed the interobserver examinations with the same scanner on the same day. The main outcomes were intra- and interclass correlation (ICC) and agreement. In the intraobserver study, the ICC with regard to tendon thickness ranged from 0.76 to 0.81, depending on the measurement techniques used. The agreement ranged from 0.06 to 0.13 mm. In the interobserver study, the tendon thickness ICC ranged from 0.45 to 0.65 and the agreement ranged from -0.17 to 0.13 mm. The ICC for color Doppler activity was 0.93, with agreement in 14/18 (78 %) of the cases. A perfect reliability was demonstrated for bony spurs, with an ICC of 1 and exact agreement in 18/18 (100 %) of the cases. Good to excellent reliability was obtained for all measurements. The ultrasonographic techniques evaluated in this trial can be recommended for use in both research and clinical practice. © Georg Thieme Verlag KG Stuttgart · New York.
Partnering to Establish and Study Simulation in International Nursing Education.
Garner, Shelby L; Killingsworth, Erin; Raj, Leena
The purpose of this article was to describe an international partnership to establish and study simulation in India. A pilot study was performed to determine interrater reliability among faculty new to simulation when evaluating nursing student competency performance. Interrater reliability was below the ideal agreement level. Findings in this study underscore the need to obtain baseline interrater reliability data before integrating competency evaluation into a simulation program.
ERIC Educational Resources Information Center
McGair, Charles D.
2012-01-01
Many theories, methods, and practices are utilized to evaluate teachers with the intention of determining teacher effectiveness to better inform decisions about retention, tenure, certification and performance-based pay. In the 21st century there has been a renewed emphasis on teacher evaluation in public schools, largely due to federal "Race…
Georgakis, D. Christine; Trace, David A.; Naeymi-Rad, Frank; Evens, Martha
1990-01-01
Medical expert systems require comprehensive evaluation of their diagnostic accuracy. The usefulness of these systems is limited without established evaluation methods. We propose a new methodology for evaluating the diagnostic accuracy and the predictive capacity of a medical expert system. We have adapted to the medical domain measures that have been used in the social sciences to examine the performance of human experts in the decision making process. Thus, in addition to the standard summary measures, we use measures of agreement and disagreement, and Goodman and Kruskal's λ and τ measures of predictive association. This methodology is illustrated by a detailed retrospective evaluation of the diagnostic accuracy of the MEDAS system. In a study using 270 patients admitted to the North Chicago Veterans Administration Hospital, diagnoses produced by MEDAS are compared with the discharge diagnoses of the attending physicians. The results of the analysis confirm the high diagnostic accuracy and predictive capacity of the MEDAS system. Overall, the agreement of the MEDAS system with the “gold standard” diagnosis of the attending physician has reached a 90% level.
7 CFR 636.19 - Access to operating unit.
Code of Federal Regulations, 2014 CFR
2014-01-01
..., DEPARTMENT OF AGRICULTURE LONG TERM CONTRACTING WILDLIFE HABITAT INCENTIVE PROGRAM § 636.19 Access to... information necessary to evaluate the habitat development performance specified in the cost-share agreements...
7 CFR 636.19 - Access to operating unit.
Code of Federal Regulations, 2013 CFR
2013-01-01
..., DEPARTMENT OF AGRICULTURE LONG TERM CONTRACTING WILDLIFE HABITAT INCENTIVE PROGRAM § 636.19 Access to... information necessary to evaluate the habitat development performance specified in the cost-share agreements...
7 CFR 636.19 - Access to operating unit.
Code of Federal Regulations, 2011 CFR
2011-01-01
..., DEPARTMENT OF AGRICULTURE LONG TERM CONTRACTING WILDLIFE HABITAT INCENTIVE PROGRAM § 636.19 Access to... information necessary to evaluate the habitat development performance specified in the cost-share agreements...
7 CFR 636.19 - Access to operating unit.
Code of Federal Regulations, 2012 CFR
2012-01-01
..., DEPARTMENT OF AGRICULTURE LONG TERM CONTRACTING WILDLIFE HABITAT INCENTIVE PROGRAM § 636.19 Access to... information necessary to evaluate the habitat development performance specified in the cost-share agreements...
25 CFR 224.131 - What is a periodic review and evaluation?
Code of Federal Regulations, 2011 CFR
2011-04-01
... 224.131 Indians BUREAU OF INDIAN AFFAIRS, DEPARTMENT OF THE INTERIOR ENERGY AND MINERALS TRIBAL ENERGY RESOURCE AGREEMENTS UNDER THE INDIAN TRIBAL ENERGY DEVELOPMENT AND SELF DETERMINATION ACT Periodic Reviews... the Director performs to monitor a tribe's performance of activities associated with the development...
25 CFR 224.131 - What is a periodic review and evaluation?
Code of Federal Regulations, 2010 CFR
2010-04-01
... 224.131 Indians BUREAU OF INDIAN AFFAIRS, DEPARTMENT OF THE INTERIOR ENERGY AND MINERALS TRIBAL ENERGY RESOURCE AGREEMENTS UNDER THE INDIAN TRIBAL ENERGY DEVELOPMENT AND SELF DETERMINATION ACT Periodic Reviews... the Director performs to monitor a tribe's performance of activities associated with the development...
Schot, Marjolein J C; van Delft, Sanne; Kooijman-Buiting, Antoinette M J; de Wit, Niek J; Hopstaken, Rogier M
2015-05-18
Various point-of-care testing (POCT) urine analysers are commercially available for routine urine analysis in general practice. The present study compares analytical performance, agreement and user-friendliness of six different POCT urine analysers for diagnosing urinary tract infection in general practice. All testing procedures were performed at a diagnostic centre for primary care in the Netherlands. Urine samples were collected at four general practices. Analytical performance and agreement of the POCT analysers regarding nitrite, leucocytes and erythrocytes, with the laboratory reference standard, was the primary outcome measure, and analysed by calculating sensitivity, specificity, positive and negative predictive value, and Cohen's κ coefficient for agreement. Secondary outcome measures were the user-friendliness of the POCT analysers, in addition to other characteristics of the analysers. The following six POCT analysers were evaluated: Uryxxon Relax (Macherey Nagel), Urisys 1100 (Roche), Clinitek Status (Siemens), Aution 11 (Menarini), Aution Micro (Menarini) and Urilyzer (Analyticon). Analytical performance was good for all analysers. Compared with laboratory reference standards, overall agreement was good, but differed per parameter and per analyser. Concerning the nitrite test, the most important test for clinical practice, all but one showed perfect agreement with the laboratory standard. For leucocytes and erythrocytes specificity was high, but sensitivity was considerably lower. Agreement for leucocytes varied between good to very good, and for the erythrocyte test between fair and good. First-time users indicated that the analysers were easy to use. They expected higher productivity and accuracy when using these analysers in daily practice. The overall performance and user-friendliness of all six commercially available POCT urine analysers was sufficient to justify routine use in suspected urinary tract infections in general practice. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Signori, Cácia; Collares, Kauê; Cumerlato, Catarina B F; Correa, Marcos B; Opdam, Niek J M; Cenci, Maximiliano S
2018-04-01
The aim of this study was to investigate the validity of assessment of intraoral digital photography in the evaluation of dental restorations. Intraoral photographs of anterior and posterior restorations were classified based on FDI criteria according to the need for intervention: no intervention, repair and replacement. Evaluations were performed by an experienced expert in restorative dentistry (gold standard evaluator) and 3 trained dentists (consensus). The clinical inspection was the reference standard method. The prevalence of failures was explored. Cohen's kappa statistic was used. Validity was accessed by sensitivity, specificity, likelihood ratio and predictives values. Higher prevalence of failed restorations intervention was identified by the intraoral photography (17.7%) in comparison to the clinical evaluation (14.1%). Moderate agreement in the diagnosis of total failures was shown between the methods for the gold standard evaluator (kappa = 0.51) and consensus of evaluators (kappa = 0.53). Gold standard evaluator and consensus showed substantial and moderate agreement for posterior restorations (kappa = 0.61; 0.59), and fair and moderate agreement for anterior restorations (kappa = 0.36; 0.43), respectively. The accuracy was 84.8% in the assessment by intraoral photographs. Sensitivity and specificity values of 87.5% and 89.3% were found. Under the limits of this study, the assessment of digital photography performed by intraoral camera is an indirect diagnostic method valid for the evaluation of dental restorations, mainly in posterior teeth. This method should be employed taking into account the higher detection of defects provided by the images, which are not always clinically relevant. Copyright © 2018 Elsevier Ltd. All rights reserved.
Souto Filho, João Tadeu D; Loureiro, Monique M; Pulcheri, Wolmar; Morais, José Carlos; Nucci, Marcio; Portugal, Rodrigo D
2015-07-25
Early assessment of response to chemotherapy in acute myeloid leukemia may be performed by examining bone marrow aspirate (BMA) or biopsy (BMB); a hypocellular bone marrow sample indicates adequate anti-leukemic activity. We sought to evaluate the quantitative and qualitative assessment of BMA performed on day 14 (D14) of chemotherapy, to verify the inter-observer agreement, to compare the results of BMA and BMB, and to evaluate the impact of D14 blast clearance on the overall survival (OS). A total of 107 patients who received standard induction chemotherapy and had bone marrow samples were included. BMA evaluation was performed by two observers using two methods: quantitative assessment and a qualitative (Likert) scale. ROC curves were obtained correlating the BMA quantification of blasts and the qualitative scale, by both observers, with BMB result as gold-standard. There was a significant agreement between the two observers in both the qualitative and quantitative assessments (Kw = 0.737, p < 0.001, and rs = 0.798, p < 0.001; ICC = 0.836, p < 0.001, respectively). The areas under the curve (AUC) were 0.924 and 0.946 for observer 1 and 0.867 and 0.870 for observer 2 for assessments of the percentage of blasts and qualitative scale, respectively. The best cutoff for blast percentage in BMA was 6% and 7% for observers 1 and 2, respectively. A similar analysis for the qualitative scale showed the best cutoff as "probably infiltrated". Patients who attained higher grades of cytoreduction on D14 had better OS. Evaluation of D14 BMA using both methods had a significant agreement with BMB and between observers, identifying a population of patients with poor outcome.
NASA Astrophysics Data System (ADS)
Javernick, Luke; Redolfi, Marco; Bertoldi, Walter
2018-05-01
New data collection techniques offer numerical modelers the ability to gather and utilize high quality data sets with high spatial and temporal resolution. Such data sets are currently needed for calibration, verification, and to fuel future model development, particularly morphological simulations. This study explores the use of high quality spatial and temporal data sets of observed bed load transport in braided river flume experiments to evaluate the ability of a two-dimensional model, Delft3D, to predict bed load transport. This study uses a fixed bed model configuration and examines the model's shear stress calculations, which are the foundation to predict the sediment fluxes necessary for morphological simulations. The evaluation is conducted for three flow rates, and model setup used highly accurate Structure-from-Motion (SfM) topography and discharge boundary conditions. The model was hydraulically calibrated using bed roughness, and performance was evaluated based on depth and inundation agreement. Model bed load performance was evaluated in terms of critical shear stress exceedance area compared to maps of observed bed mobility in a flume. Following the standard hydraulic calibration, bed load performance was tested for sensitivity to horizontal eddy viscosity parameterization and bed morphology updating. Simulations produced depth errors equal to the SfM inherent errors, inundation agreement of 77-85%, and critical shear stress exceedance in agreement with 49-68% of the observed active area. This study provides insight into the ability of physically based, two-dimensional simulations to accurately predict bed load as well as the effects of horizontal eddy viscosity and bed updating. Further, this study highlights how using high spatial and temporal data to capture the physical processes at work during flume experiments can help to improve morphological modeling.
Engjom, Trond; Pham, Khahn Do-Chong; Erchinger, Friedemann; Haldorsen, Ingfrid Salvesen; Gilja, Odd Helge; Dimcevski, Georg; Havre, Roald Flesland
2018-03-26
We aimed to evaluate the agreement of single criteria and dedicated scores from transabdominal ultrasound of the pancreas (US) compared to standards by endoscopic ultrasound (EUS) and computed tomography (CT). In this observational cohort study performed in a tertiary care center, US and EUS were performed in 110 patients referred for suspected CP. Based on the Mayo score, 52 patients were diagnosed with CP. The sonographic findings obtained by both methods were registered. The number of criteria was counted and scored according to the Rosemont score. Agreement between the number of detected US and EUS criteria was substantial (ICC = 0.74 [0.61 - 0.83]. Adding Rosemont weighting improved the agreement (ICC = 0.88 [0.81 - 0.92]). Regarding individual criteria, the agreement was substantial for the detection of calcifications (κ = 0.86) and moderate for cysts and irregular or dilated pancreatic duct (κ = 0.42 - 0.58). Agreement for the other criteria was poorer (κ≤ 0.40). The diagnostic performance indices [95 % CI] of US for diagnosing CP (using Mayo score as reference standard) were for the unweighted score: Sensitivity: 0.65 [0.51 - 0.78], specificity: 0.97 [0.87 - 1.00]; and for Rosemont score: Sensitivity: 0.75 [0.61 - 0.86], specificity: 0.95 [0.83 - 0.99]. The agreement between US and EUS for the unweighted and weighted scores was substantial. For the features calcifications, cysts and main pancreatic duct (MPD) changes, agreement was moderate to substantial. For the other detected US criteria, the agreement with EUS was too poor to be clinically relevant. © Georg Thieme Verlag KG Stuttgart · New York.
Performance Evaluation of the United Nations Environment Programme Air Quality Monitoring Unit
A request for technical collaboration between the UNEP and the US EPA resulted in the establishment of a MCRADA. The purpose of this agreement was to evaluate an air quality monitoring system (referred to as the UNEP pod) developed by the UNEP for use in environmental situations ...
One important way to get the best performance from staff.
Goldstein, J M
1999-03-01
Good communication about expectations is essential to encouraging the best performance from staff. An excellent tool for fostering this communication is the performance evaluation. Used correctly, it can open a dialogue that can lead to agreement between the supervisor and staff member on areas that need to be improved and ways to do so.
Rater agreement of visual lameness assessment in horses during lungeing.
Hammarberg, M; Egenvall, A; Pfau, T; Rhodin, M
2016-01-01
Lungeing is an important part of lameness examinations as the circular path may accentuate low-grade lameness. Movement asymmetries related to the circular path, to compensatory movements and to pain make the lameness evaluation complex. Scientific studies have shown high inter-rater variation when assessing lameness during straight line movement. The aim was to estimate inter- and intra-rater agreement of equine veterinarians evaluating lameness from videos of sound and lame horses during lungeing and to investigate the influence of veterinarians' experience and the objective degree of movement asymmetry on rater agreement. Cross-sectional observational study. Video recordings and quantitative gait analysis with inertial sensors were performed in 23 riding horses of various breeds. The horses were examined at trot on a straight line and during lungeing on soft or hard surfaces in both directions. One video sequence was recorded per condition and the horses were classified as forelimb lame, hindlimb lame or sound from objective straight line symmetry measurements. Equine veterinarians (n = 86), including 43 with >5 years of orthopaedic experience, participated in a web-based survey and were asked to identify the lamest limb on 60 videos, including 10 repeats. The agreements between (inter-rater) and within (intra-rater) veterinarians were analysed with κ statistics (Fleiss, Cohen). Inter-rater agreement κ was 0.31 (0.38/0.25 for experienced/less experienced) and higher for forelimb (0.33) than for hindlimb lameness (0.11) or soundness (0.08) evaluation. Median intra-rater agreement κ was 0.57. Inter-rater agreement was poor for less experienced raters, and for all raters when evaluating hindlimb lameness. Since identification of the lame limb/limbs is a prerequisite for successful diagnosis, treatment and recovery, the high inter-rater variation when evaluating lameness on the lunge is likely to influence the accuracy and repeatability of lameness examinations and, indirectly, the efficacy of treatment. © 2015 The Authors. Equine Veterinary Journal published by John Wiley & Sons Ltd on behalf of EVJ Ltd.
Abuhamad, Alfred; Zhao, Yili; Abuhamad, Sharon; Sinkovskaya, Elena; Rao, Rashmi; Kanaan, Camille; Platt, Lawrence
2016-01-01
This study aims to validate the feasibility and accuracy of a new standardized six-step approach to the performance of the focused basic obstetric ultrasound examination, and compare the new approach to the regular approach performed in the scheduled obstetric ultrasound examination. A new standardized six-step approach to the performance of the focused basic obstetric ultrasound examination, to evaluate fetal presentation, fetal cardiac activity, presence of multiple pregnancy, placental localization, amniotic fluid volume evaluation, and biometric measurements, was prospectively performed on 100 pregnant women between 18(+0) and 27(+6) weeks of gestation and another 100 pregnant women between 28(+0) and 36(+6) weeks of gestation. The agreement of findings for each of the six steps of the standardized six-step approach was evaluated against the regular approach. In all ultrasound examinations performed, substantial to perfect agreement (Kappa value between 0.64 and 1.00) was observed between the new standardized six-step approach and the regular approach. The new standardized six-step approach to the focused basic obstetric ultrasound examination can be performed successfully and accurately between 18(+0) and 36(+6) weeks of gestation. This standardized approach can be of significant benefit to limited resource settings and in point of care obstetric ultrasound applications. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.
An Investigation into Specifying Service Level Agreements for Provisioning Cloud Computing Services
2012-12-01
IT .................................................................................................... Information Technology KPI ...the service delivery be measured? 3. Key Performance Indicators ( KPIs ): Describe the KPIs and the responsible party for producing the KPIs . 4...level objectives (SLOs) that are evaluated according to measurable Key Performance Indicators ( KPIs ). Automatic SLA protection enables further
Federal Register 2010, 2011, 2012, 2013, 2014
2010-08-12
... Project Performance Reports, Conversion of Use Provisions, and Grant Agreements and Amendments AGENCY.... 2. Conversion of Use Provisions Title: Urban Park and Recovery Program Conversion of Use Provisions...) All alternatives to the conversion have been evaluated and then rejected on a sound basis; (b...
25 CFR 224.82 - What activities will the Department continue to perform after approval of a TERA?
Code of Federal Regulations, 2010 CFR
2010-04-01
... assist the tribe in evaluating the physical, economic, financial, cultural, social, environmental, and... AND SELF DETERMINATION ACT Implementation of Tribal Energy Resource Agreements Applicable Authorities and Responsibilities § 224.82 What activities will the Department continue to perform after approval...
Quantitative evaluation of patient-specific quality assurance using online dosimetry system
NASA Astrophysics Data System (ADS)
Jung, Jae-Yong; Shin, Young-Ju; Sohn, Seung-Chang; Min, Jung-Whan; Kim, Yon-Lae; Kim, Dong-Su; Choe, Bo-Young; Suh, Tae-Suk
2018-01-01
In this study, we investigated the clinical performance of an online dosimetry system (Mobius FX system, MFX) by 1) dosimetric plan verification using gamma passing rates and dose volume metrics and 2) error-detection capability evaluation by deliberately introduced machine error. Eighteen volumetric modulated arc therapy (VMAT) plans were studied. To evaluate the clinical performance of the MFX, we used gamma analysis and dose volume histogram (DVH) analysis. In addition, to evaluate the error-detection capability, we used gamma analysis and DVH analysis utilizing three types of deliberately introduced errors (Type 1: gantry angle-independent multi-leaf collimator (MLC) error, Type 2: gantry angle-dependent MLC error, and Type 3: gantry angle error). A dosimetric verification comparison of physical dosimetry system (Delt4PT) and online dosimetry system (MFX), gamma passing rates of the two dosimetry systems showed very good agreement with treatment planning system (TPS) calculation. For the average dose difference between the TPS calculation and the MFX measurement, most of the dose metrics showed good agreement within a tolerance of 3%. For the error-detection comparison of Delta4PT and MFX, the gamma passing rates of the two dosimetry systems did not meet the 90% acceptance criterion with the magnitude of error exceeding 2 mm and 1.5 ◦, respectively, for error plans of Types 1, 2, and 3. For delivery with all error types, the average dose difference of PTV due to error magnitude showed good agreement between calculated TPS and measured MFX within 1%. Overall, the results of the online dosimetry system showed very good agreement with those of the physical dosimetry system. Our results suggest that a log file-based online dosimetry system is a very suitable verification tool for accurate and efficient clinical routines for patient-specific quality assurance (QA).
Hadlich, Marcelo Souza; Oliveira, Gláucia Maria Moraes; Feijóo, Raúl A; Azevedo, Clerio F; Tura, Bernardo Rangel; Ziemer, Paulo Gustavo Portela; Blanco, Pablo Javier; Pina, Gustavo; Meira, Márcio; Souza e Silva, Nelson Albuquerque de
2012-10-01
The standardization of images used in Medicine in 1993 was performed using the DICOM (Digital Imaging and Communications in Medicine) standard. Several tests use this standard and it is increasingly necessary to design software applications capable of handling this type of image; however, these software applications are not usually free and open-source, and this fact hinders their adjustment to most diverse interests. To develop and validate a free and open-source software application capable of handling DICOM coronary computed tomography angiography images. We developed and tested the ImageLab software in the evaluation of 100 tests randomly selected from a database. We carried out 600 tests divided between two observers using ImageLab and another software sold with Philips Brilliance computed tomography appliances in the evaluation of coronary lesions and plaques around the left main coronary artery (LMCA) and the anterior descending artery (ADA). To evaluate intraobserver, interobserver and intersoftware agreements, we used simple and kappa statistics agreements. The agreements observed between software applications were generally classified as substantial or almost perfect in most comparisons. The ImageLab software agreed with the Philips software in the evaluation of coronary computed tomography angiography tests, especially in patients without lesions, with lesions < 50% in the LMCA and < 70% in the ADA. The agreement for lesions > 70% in the ADA was lower, but this is also observed when the anatomical reference standard is used.
Sordi, Marina de; Mourão, Lucia Figueiredo; Silva, Ariovaldo Armando da; Flosi, Luciana Claudia Leite
2009-01-01
Patients with dysphagia have impairments in many aspects, and an interdisciplinary approach is fundamental to define diagnosis and treatment. A joint approach in the clinical and videoendoscopy evaluation is paramount. To study the correlation between the clinical assessment (ACD) and the videoendoscopic (VED) assessment of swallowing by classifying the degree of severity and the qualitative/descriptive analyses of the procedures. cross-sectional, descriptive and comparative. held from March to December of 2006, at the Otolaryngology/Dysphagia ward of a hospital in the country side of São Paulo. 30 dysphagic patients with different disorders were assessed by ACD and VED. The data was classified by means of severity scales and qualitative/ descriptive analysis. the correlation between severity ACD and VED scales pointed to a statistically significant low agreement (KAPA = 0.4) (p=0,006). The correlation between the qualitative/descriptive analysis pointed to an excellent and statistically significant agreement (KAPA=0.962) (p<0.001) concerning the entire sample. the low agreement between the severity scales point to a need to perform both procedures, reinforcing VED as a doable procedure. The descriptive qualitative analysis pointed to an excellent agreement, and such data reinforces our need to understand swallowing as a process.
Cañón-Montañez, Wilson; Oróstegui-Arenas, Myriam
2015-01-01
To determine the reliability (internal consistency, inter-rater reproducibility and level of agreement) of nursing outcome: "Knowledge: cardiac disease management (1830)" of the version published in Spanish, in outpatients with heart failure. A reliability study was conducted on 116 outpatients with heart failure. Six indicators of nursing outcome were operationalized. All participants were assessed simultaneously by two evaluators. Three evaluation periods were defined: initial (at baseline), final (a month later), and follow-up (two months later). Internal consistency by Cronbach alpha coefficient, inter-rater reproducibility with intraclass correlation coefficient of reproducibility or agreement and level agreement using the 95% limits of Bland and Altman. Cronbach's alpha was 0.83 (95% CI: 0.77 - 0.89) in the final evaluation, and follow-up values of 0.85 (95% CI: 0.82-0.89) and 0.83 (95% CI: 0.78 - 0.88) were found for the first and second evaluator, respectively. The intraclass correlation coefficient showed values greater 0.9 in the three evaluation periods in both the random and mixed model. The Bland-Altman 95% limits of agreement were close to zero in the three evaluations performed. The questionnaire operationalized to assess the nursing outcome: "Knowledge: cardiac disease management (1830)" in its Spanish version, is a reliable method to measure skills and knowledge in outpatients with heart failure in the Colombian context. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Rei, Mariana; Tavares, Sara; Pinto, Pedro; Machado, Ana P; Monteiro, Sofia; Costa, Antónia; Costa-Santos, Cristina; Bernardes, João; Ayres-De-Campos, Diogo
2016-10-01
Visual analysis of cardiotocographic (CTG) tracings has been shown to be prone to poor intra- and interobserver agreement when several interpretation guidelines are used, and this may have an important impact on the technology's performance. The aim of this study was to evaluate agreement in CTG interpretation using the new 2015 FIGO guidelines on intrapartum fetal monitoring. A pre-existing database of intrapartum CTG tracings was used to sequentially select 151 cases acquired with a fetal electrode, with duration exceeding 60minutes, and signal loss less than 15%. These tracings were presented to six clinicians, three with more than 5 years' experience in the labor ward, and three with 5 or less years' experience. Observers were asked to evaluate tracings independently, to assess basic CTG features: baseline, variability, accelerations, decelerations, sinusoidal pattern, tachysystole, and to classify each tracing as normal, suspicious or pathologic, according to the 2015 FIGO guidelines on intrapartum fetal monitoring. Agreement between observers was evaluated using the proportions of agreement (Pa), with 95% confidence intervals (95%CI). A good interobserver agreement was found in the evaluation of most CTG features, but not bradycardia, reduced variability, saltatory pattern, absence of accelerations and absence of decelerations. For baseline classification Pa was 0.85 [0.82-0.90], for variability 0.82 [0.78-0.85], for accelerations 0.72 [0.68-0.75], for tachysystole 0.77 [0.74-0.81], for decelerations 0.92 [0.90-0.95], for variable decelerations 0.62 [0.58-0.65], for late decelerations 0.63 [0.59-0.66], for repetitive decelerations 0.73 [0.69-0.78], and for prolonged decelerations 0.81 [0.77-0.85]. For overall CTG classification, Pa were 0.60 [0.56-0.64], for classification as normal 0.67 [0.61-0.72], for suspicious 0.54 [0.48-0.60] and for pathologic 0.59 [0.51-0.66]. No differences in agreement according to the level of expertise were observed, except in the identification of accelerations, where it was better in the more experienced group. A good interobserver agreement was found in evaluation of most CTG features and in overall tracing classification. Results were better than those reported in previous studies evaluating agreement in overall tracing classification. Observer experience did not appear to play a role in agreement. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Measuring and Promoting Inter-Rater Agreement of Teacher and Principal Performance Ratings
ERIC Educational Resources Information Center
Graham, Matthew; Milanowski, Anthony; Miller, Jackson
2012-01-01
As states, districts, and schools transition toward more rigorous educator evaluation systems, they are placing additional weight on judgments about educator practice. Since teacher and principal observation ratings inherently rely on evaluators' professional judgment, there is always a question of how much the ratings depend on the particular…
An evaluation of classification systems for stillbirth
Flenady, Vicki; Frøen, J Frederik; Pinar, Halit; Torabi, Rozbeh; Saastad, Eli; Guyon, Grace; Russell, Laurie; Charles, Adrian; Harrison, Catherine; Chauke, Lawrence; Pattinson, Robert; Koshy, Rachel; Bahrin, Safiah; Gardener, Glenn; Day, Katie; Petersson, Karin; Gordon, Adrienne; Gilshenan, Kristen
2009-01-01
Background Audit and classification of stillbirths is an essential part of clinical practice and a crucial step towards stillbirth prevention. Due to the limitations of the ICD system and lack of an international approach to an acceptable solution, numerous disparate classification systems have emerged. We assessed the performance of six contemporary systems to inform the development of an internationally accepted approach. Methods We evaluated the following systems: Amended Aberdeen, Extended Wigglesworth; PSANZ-PDC, ReCoDe, Tulip and CODAC. Nine teams from 7 countries applied the classification systems to cohorts of stillbirths from their regions using 857 stillbirth cases. The main outcome measures were: the ability to retain the important information about the death using the InfoKeep rating; the ease of use according to the Ease rating (both measures used a five-point scale with a score <2 considered unsatisfactory); inter-observer agreement and the proportion of unexplained stillbirths. A randomly selected subset of 100 stillbirths was used to assess inter-observer agreement. Results InfoKeep scores were significantly different across the classifications (p ≤ 0.01) due to low scores for Wigglesworth and Aberdeen. CODAC received the highest mean (SD) score of 3.40 (0.73) followed by PSANZ-PDC, ReCoDe and Tulip [2.77 (1.00), 2.36 (1.21), 1.92 (1.24) respectively]. Wigglesworth and Aberdeen resulted in a high proportion of unexplained stillbirths and CODAC and Tulip the lowest. While Ease scores were different (p ≤ 0.01), all systems received satisfactory scores; CODAC received the highest score. Aberdeen and Wigglesworth showed poor agreement with kappas of 0.35 and 0.25 respectively. Tulip performed best with a kappa of 0.74. The remainder had good to fair agreement. Conclusion The Extended Wigglesworth and Amended Aberdeen systems cannot be recommended for classification of stillbirths. Overall, CODAC performed best with PSANZ-PDC and ReCoDe performing well. Tulip was shown to have the best agreement and a low proportion of unexplained stillbirths. The virtues of these systems need to be considered in the development of an international solution to classification of stillbirths. Further studies are required on the performance of classification systems in the context of developing countries. Suboptimal agreement highlights the importance of instituting measures to ensure consistency for any classification system. PMID:19538759
An evaluation of classification systems for stillbirth.
Flenady, Vicki; Frøen, J Frederik; Pinar, Halit; Torabi, Rozbeh; Saastad, Eli; Guyon, Grace; Russell, Laurie; Charles, Adrian; Harrison, Catherine; Chauke, Lawrence; Pattinson, Robert; Koshy, Rachel; Bahrin, Safiah; Gardener, Glenn; Day, Katie; Petersson, Karin; Gordon, Adrienne; Gilshenan, Kristen
2009-06-19
Audit and classification of stillbirths is an essential part of clinical practice and a crucial step towards stillbirth prevention. Due to the limitations of the ICD system and lack of an international approach to an acceptable solution, numerous disparate classification systems have emerged. We assessed the performance of six contemporary systems to inform the development of an internationally accepted approach. We evaluated the following systems: Amended Aberdeen, Extended Wigglesworth; PSANZ-PDC, ReCoDe, Tulip and CODAC. Nine teams from 7 countries applied the classification systems to cohorts of stillbirths from their regions using 857 stillbirth cases. The main outcome measures were: the ability to retain the important information about the death using the InfoKeep rating; the ease of use according to the Ease rating (both measures used a five-point scale with a score <2 considered unsatisfactory); inter-observer agreement and the proportion of unexplained stillbirths. A randomly selected subset of 100 stillbirths was used to assess inter-observer agreement. InfoKeep scores were significantly different across the classifications (p < or = 0.01) due to low scores for Wigglesworth and Aberdeen. CODAC received the highest mean (SD) score of 3.40 (0.73) followed by PSANZ-PDC, ReCoDe and Tulip [2.77 (1.00), 2.36 (1.21), 1.92 (1.24) respectively]. Wigglesworth and Aberdeen resulted in a high proportion of unexplained stillbirths and CODAC and Tulip the lowest. While Ease scores were different (p < or = 0.01), all systems received satisfactory scores; CODAC received the highest score. Aberdeen and Wigglesworth showed poor agreement with kappas of 0.35 and 0.25 respectively. Tulip performed best with a kappa of 0.74. The remainder had good to fair agreement. The Extended Wigglesworth and Amended Aberdeen systems cannot be recommended for classification of stillbirths. Overall, CODAC performed best with PSANZ-PDC and ReCoDe performing well. Tulip was shown to have the best agreement and a low proportion of unexplained stillbirths. The virtues of these systems need to be considered in the development of an international solution to classification of stillbirths. Further studies are required on the performance of classification systems in the context of developing countries. Suboptimal agreement highlights the importance of instituting measures to ensure consistency for any classification system.
Patient and Parent-Reported Signs and Symptoms for Group A Streptococcal Pharyngitis.
Lindgren, Christina; Neuman, Mark I; Monuteaux, Michael C; Mandl, Kenneth D; Fine, Andrew M
2016-07-01
Identifying symptomatic patients who are at low risk for group A streptococcal (GAS) pharyngitis could reduce unnecessary visits and antibiotic use. The accuracy with which patients and parents report signs and symptoms of GAS has not been studied. Our objectives were to measure agreement between patient or parent and physician-reported signs and symptoms of GAS and to evaluate the performance of a modified Centor score, based on patient or parent and physician reports, for identifying patients at low risk for GAS pharyngitis. Children 3 to 21 years old presenting to a single tertiary care emergency department between October 2013 and January 2015 were included if they complained of a sore throat and were tested for GAS. Patients or parents and physicians completed surveys assessing signs and symptoms to determine a modified age-adjusted Centor score for GAS. We evaluated the overall agreement and κ between patient or parent and physician-reported signs and symptoms and compared the performance of the scores based on assessments by patients or parents and physicians and the risk of GAS. Of 320 patients enrolled, 107 (33%) tested GAS positive. Agreement was higher for symptoms (fever [agreement = 82%, κ = 0.64] and cough [72%, 0.45]) than for signs (exudate [80%, 0.41] and tender cervical nodes [73%, 0.18]). Agreement was highest when no signs and symptoms contained in the Centor score were present (94%, κ = 0.61). The proportion of patients testing GAS positive rose as the modified Centor score increased. For identifying GAS pharyngitis, patients or parents and physicians showed moderate to substantial agreement for 3 of 4 key pharyngitis signs and symptoms. Copyright © 2016 by the American Academy of Pediatrics.
Prediction of pump cavitation performance
NASA Technical Reports Server (NTRS)
Moore, R. D.
1974-01-01
A method for predicting pump cavitation performance with various liquids, liquid temperatures, and rotative speeds is presented. Use of the method requires that two sets of test data be available for the pump of interest. Good agreement between predicted and experimental results of cavitation performance was obtained for several pumps operated in liquids which exhibit a wide range of properties. Two cavitation parameters which qualitatively evaluate pump cavitation performance are also presented.
Accuracy of ultrasound-guided nerve blocks of the cervical zygapophysial joints.
Siegenthaler, Andreas; Mlekusch, Sabine; Trelle, Sven; Schliessbach, Juerg; Curatolo, Michele; Eichenberger, Urs
2012-08-01
Cervical zygapophysial joint nerve blocks typically are performed with fluoroscopic needle guidance. Descriptions of ultrasound-guided block of these nerves are available, but only one small study compared ultrasound with fluoroscopy, and only for the third occipital nerve. To evaluate the potential usefulness of ultrasound-guidance in clinical practice, studies that determine the accuracy of this technique using a validated control are essential. The aim of this study was to determine the accuracy of ultrasound-guided nerve blocks of the cervical zygapophysial joints using fluoroscopy as control. Sixty volunteers were studied. Ultrasound-imaging was used to place the needle to the bony target of cervical zygapophysial joint nerve blocks. The levels of needle placement were determined randomly (three levels per volunteer). After ultrasound-guided needle placement and application of 0.2 ml contrast dye, fluoroscopic imaging was performed for later evaluation by a blinded pain physician and considered as gold standard. Raw agreement, chance-corrected agreement κ, and chance-independent agreement Φ between the ultrasound-guided placement and the assessment using fluoroscopy were calculated to quantify accuracy. One hundred eighty needles were placed in 60 volunteers. Raw agreement was 87% (95% CI 81-91%), κ was 0.74 (0.64-0.83), and Φ 0.99 (0.99-0.99). Accuracy varied significantly between the different cervical nerves: it was low for the C7 medial branch, whereas all other levels showed very good accuracy. Ultrasound-imaging is an accurate technique for performing cervical zygapophysial joint nerve blocks in volunteers, except for the medial branch blocks of C7.
Does adding clinical data to administrative data improve agreement among hospital quality measures?
Hanchate, Amresh D; Stolzmann, Kelly L; Rosen, Amy K; Fink, Aaron S; Shwartz, Michael; Ash, Arlene S; Abdulkerim, Hassen; Pugh, Mary Jo V; Shokeen, Priti; Borzecki, Ann
2017-09-01
Hospital performance measures based on patient mortality and readmission have indicated modest rates of agreement. We examined if combining clinical data on laboratory tests and vital signs with administrative data leads to improved agreement with each other, and with other measures of hospital performance in the nation's largest integrated health care system. We used patient-level administrative and clinical data, and hospital-level data on quality indicators, for 2007-2010 from the Veterans Health Administration (VA). For patients admitted for acute myocardial infarction (AMI), heart failure (HF) and pneumonia we examined changes in hospital performance on 30-d mortality and 30-d readmission rates as a result of adding clinical data to administrative data. We evaluated whether this enhancement yielded improved measures of hospital quality, based on concordance with other hospital quality indicators. For 30-d mortality, data enhancement improved model performance, and significantly changed hospital performance profiles; for 30-d readmission, the impact was modest. Concordance between enhanced measures of both outcomes, and with other hospital quality measures - including Joint Commission process measures, VA Surgical Quality Improvement Program (VASQIP) mortality and morbidity, and case volume - remained poor. Adding laboratory tests and vital signs to measure hospital performance on mortality and readmission did not improve the poor rates of agreement across hospital quality indicators in the VA. Efforts to improve risk adjustment models should continue; however, evidence of validation should precede their use as reliable measures of quality. Published by Elsevier Inc.
Agreement studies in radiology research.
Farzin, B; Gentric, J-C; Pham, M; Tremblay-Paquet, S; Brosseau, L; Roy, C; Jamali, S; Chagnon, M; Darsaut, T E; Guilbert, F; Naggara, O; Raymond, J
2017-03-01
The goal of this study was to estimate the frequency and the quality of agreement studies published in diagnostic imaging journals. All studies published between January 2011 and December 2012 in four radiology journals were reviewed. Four trained readers evaluated agreement studies using a 24-item form that included the 15 items of the Guidelines for Reporting Reliability and Agreement Studies criteria. Of 2229 source titles, 280 studies (13%) reported agreement. The mean number of patients per study was 81±99 (SD) (range, 0-180). Justification for sample size was found in 9 studies (3%). The number of raters was≤2 in 226 studies (81%). No intra-observer study was performed in 212 (76%) articles. Confidence intervals and interpretation of statistical estimates were provided in 98 (35%) and 147 (53%) of the studies, respectively. In 168 studies (60%), the agreement study was not mentioned in the discussion section. In 8 studies (3%), reporting of the agreement study was judged to be adequate. Twenty studies (7%) were dedicated to agreement. Agreement studies are preliminary and not adequately reported. Studies dedicated to agreement are infrequent. They are research opportunities that should be promoted. Copyright © 2016 Éditions françaises de radiologie. Published by Elsevier Masson SAS. All rights reserved.
Marshall, Carrie; Mounzer, Rawad; Hall, Matt; Simon, Violette; Centeno, Barbara; Dennis, Katie; Dhillon, Jasreman; Fan, Fang; Khazai, Laila; Klapman, Jason; Komanduri, Srinadh; Lin, Xiaoqi; Lu, David; Mehrotra, Sanjana; Muthusamy, V Raman; Nayar, Ritu; Paintal, Ajit; Rao, Jianyu; Sams, Sharon; Shah, Janak; Watson, Rabindra; Rastogi, Amit; Wani, Sachin
2018-07-01
Despite the widespread use of endoscopic ultrasound-guided fine-needle aspiration (EUS-FNA) to sample pancreatic lesions and the standardization of pancreaticobiliary cytopathologic nomenclature, there are few data on inter-observer agreement among cytopathologists evaluating pancreatic cytologic specimens obtained by EUS-FNA. We developed a scoring system to assess agreement among cytopathologists in overall diagnosis and quantitative and qualitative parameters, and evaluated factors associated with agreement. We performed a prospective study to validate results from our pilot study that demonstrated moderate to substantial inter-observer agreement among cytopathologists for the final cytologic diagnosis. In the first phase, 3 cytopathologists refined criteria for assessment of quantity and quality measures. During phase 2, EUS-FNA specimens of solid pancreatic lesions from 46 patients were evaluated by 11 cytopathologists at 5 tertiary care centers using a standardized scoring tool. Individual quantitative and qualitative measures were scored and an overall cytologic diagnosis was determined. Clinical and EUS parameters were assessed as predictors of unanimous agreement. Inter-observer agreement (IOA) was calculated using multi-rater kappa (κ) statistics and a logistic regression model was created to identify factors associated with unanimous agreement. The IOA for final diagnoses, based on cytologic analysis, was moderate (κ = 0.56; 95% CI, 0.43-0.70). Kappa values did not increase when categories of suspicious for malignancy, malignant, and neoplasm were combined. IOA was slight to moderate for individual quantitative (κ = 0.007; 95% CI, -0.03 to -0.04) and qualitative parameters (κ = 0.5; 95% CI, 0.47-0.53). Jaundice was the only factor associated with agreement among all cytopathologists on multivariate analysis (odds ratio for unanimous agreement, 5.3; 95% CI, 1.1-26.89). There is a suboptimal level of agreement among cytopathologists in the diagnosis of malignancy based on analysis of EUS-FNA specimens obtained from solid pancreatic masses. Strategies are needed to refine the cytologic criteria for diagnosis of malignancy and enhance tissue acquisition techniques to improve diagnostic reproducibility among cytopathologists. Copyright © 2018 AGA Institute. Published by Elsevier Inc. All rights reserved.
Lee, Young Han; Hahn, Seok; Lim, Daekeon; Suh, Jin-Suck
2017-02-01
Background Conventionally, two-dimensional (2D) fast spin-echo (FSE) sequences have been widely used for clinical cartilage imaging as well as gradient (GRE) sequences. Recently, three-dimensional (3D) volumetric magnetic resonance imaging (MRI) has been introduced with one 3D volumetric scan, and this is replacing slice-by-slice 2D MR scans. Purpose To evaluate the image quality and diagnostic performance of two 3D sequences for abnormalities of knee cartilage: fat-suppressed (FS) FSE-based 3D volume isotropic turbo spin-echo acquisition (VISTA) and GRE-based 3D T1 high-resolution isovolumetric examination (THRIVE). Material and Methods The institutional review board approved the protocol of this retrospective review. This study enrolled 40 patients (41 knees) with arthroscopically confirmed abnormalities of cartilage. All patients underwent isovoxel 3D-VISTA and 3D-THRIVE MR sequences on 3T MRI. We assessed the cartilage grade on the two 3D sequences using arthroscopy as a gold standard. Inter-observer agreement for each technique was evaluated with the intraclass correlation coefficient (ICC). Differences in the area under the curve (AUC) were compared between the 3D-THRIVE and 3D-VISTA. Results Although inter-observer agreement for both sequences was excellent, the inter-observer agreement for 3D-VISTA was higher than for 3D-THRIVE for cartilage grading in all regions of the knee. There was no significant difference in the diagnostic performance ( P > 0.05) between the two sequences for detecting cartilage grade. Conclusion FSE-based 3D-VISTA images had good diagnostic performance that was comparable to GRE-based 3D-THRIVE images in the evaluation of knee cartilage, and can be used in routine knee MR protocols for the evaluation of cartilage.
TFM classification and staging of oral submucous fibrosis: A new proposal.
Arakeri, Gururaj; Thomas, Deepak; Aljabab, Abdulsalam S; Hunasgi, Santosh; Rai, Kirthi Kumar; Hale, Beverley; Fonseca, Felipe Paiva; Gomez, Ricardo Santiago; Rahimi, Siavash; Merkx, Matthias A W; Brennan, Peter A
2018-04-01
We have evaluated the rationale of existing grading and staging schemes of oral submucous fibrosis (OSMF) based on how they are categorized. A novel classification and staging scheme is proposed. A total of 300 OSMF patients were evaluated for agreement between functional, clinical, and histopathological staging. Bilateral biopsies were assessed in 25 patients to evaluate for any differences in histopathological staging of OSMF in the same mouth. Extent of clinician agreement for categorized staging data was evaluated using Cohen's weighted kappa analysis. Cross-tabulation was performed on categorical grading data to understand the intercorrelation, and the unweighted kappa analysis was used to assess the bilateral grade agreement. Probabilities of less than 0.05 were considered significant. Data were analyzed using SPSS Statistics (version 25.0, IBM, USA). A low agreement was found between all the stages depicting the independent nature of trismus, clinical features, and histopathological components (K = 0.312, 0.167, 0.152) in OSMF. Following analysis, a three-component classification scheme (TFM classification) was developed that describes the severity of each independently, grouping them using a novel three-tier staging scheme as a guide to the treatment plan. The proposed classification and staging could be useful for effective communication, categorization, and for recording data and prognosis, and for guiding treatment plans. Furthermore, the classification considers OSMF malignant transformation in detail. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Hillmann, Elise de Castro; Dos Reis, Ricardo; Monego, Heleusa; Appel, Márcia; Hammes, Luciano Serpa; Rivoire, Waldemar Augusto; Capp, Edison
2013-07-01
This study aims to evaluate and to compare the performance of cervical digital photography (CDP) to the visual inspection with acetic acid (VIA) and visual inspection with Lugol's iodine (VILI) methods for screening the uterine cervix cancer and its precursor lesions in developing countries. A cross-sectional study was performed in Brazil. 176 women were evaluated by VIA, VILI, CDP with acetic acid and CDP with Lugol's iodine. Kappa statistic was used to estimate the interobserver and intermethod agreement. Sensitivity, specificity and diagnostic accuracy of the four methods (VIA, VILI, CDP with acetic acid, CDP with Lugol's iodine) was calculated. Interobserver agreement for CDP with acetic acid was K = 0.441 and for CDP with Lugol's iodine was K = 0.533; intermethod agreement of VIA and CDP with acetic acid, K = 0.559; and of VILI and CDP with Lugol's iodine, K = 0.507. Sensitivity and specificity of CDP with acetic acid were 84.00 and 95.83 %, and of CDP with Lugol's iodine were 88.00 and 97.26 %, respectively. The diagnostic accuracy of CDP with acetic acid and CDP with Lugol's iodine was 92.78 and 94.90 %, respectively. This was the first study to assess the CDP with Lugol's iodine performance, which had similar performance to the CDP with acetic acid. CDP is considered a promising method for screening the uterine cervix cancer and its precursor lesions in developing countries.
Validity of self-assessment in a quality improvement collaborative in Ecuador.
Hermida, Jorge; Broughton, Edward I; Miller Franco, Lynne
2011-12-01
Health care quality improvement (QI) efforts commonly use self-assessment to measure compliance with quality standards. This study investigates the validity of self-assessment of quality indicators. Cross sectional. A maternal and newborn care improvement collaborative intervention conducted in health facilities in Ecuador in 2005. Four external evaluators were trained in abstracting medical records to calculate six indicators reflecting compliance with treatment standards. About 30 medical records per month were examined at 12 participating health facilities for a total of 1875 records. The same records had already been reviewed by QI teams at these facilities (self-assessment). Overall compliance, agreement (using the Kappa statistic), sensitivity and specificity were analyzed. We also examined patterns of disagreement and the effect of facility characteristics on levels of agreement. External evaluators reported compliance of 69-90%, while self-assessors reported 71-92%, with raw agreement of 71-95% and Kappa statistics ranging from fair to almost perfect agreement. Considering external evaluators as the gold standard, sensitivity of self-assessment ranged from 90 to 99% and specificity from 48 to 86%. Simpler indicators had fewer disagreements. When disagreements occurred between self-assessment and external valuators, the former tended to report more positive findings in five of six indicators, but this tendency was not of a magnitude to change program actions. Team leadership, understanding of the tools and facility size had no overall impact on the level of agreement. When compared with external evaluation (gold standard), self-assessment was found to be sufficiently valid for tracking QI team performance. Sensitivity was generally higher than specificity. Simplifying indicators may improve validity.
Boeschoten, K H; Folmer, K B; van der Lee, J H; Nollet, F
2007-02-01
To develop an observational instrument that can be used to evaluate the quality of arm and hand skills in daily functional activities in children with obstetric brachial plexus lesion (OBPL). A set of functional activities was constructed and standardized, and the intra-observer reliability of the assessment of this set of activities was studied. Department of Occupational Therapy and Department of Rehabilitation Medicine, VU University Medical Centre. Twenty-six children with OBPL in the age range of 4 -6 years. The children were asked to perform 47 bimanual activities, which were recorded on videotape. The videotapes were scored twice by the same occupational therapist. The percentage of agreement in scoring 'hand-use', 'speed' and 'assistance' was over 80% for a substantial number of activities, indicating a strong agreement. However, in scoring 'deviations in movements and body posture' the percentage of agreement was insufficient in most activities. This set of activities has good potential for assessment of the performance of functional activities in children with OBPL. This study, however, showed a number of difficulties in observing and scoring the activities that have to be considered when developing a standardized video observation.
Relationships between Contextual and Task Performance and Interrater Agreement: Are There Any?
Díaz-Vilela, Luis F; Delgado Rodríguez, Naira; Isla-Díaz, Rosa; Díaz-Cabrera, Dolores; Hernández-Fernaud, Estefanía; Rosales-Sánchez, Christian
2015-01-01
Work performance is one of the most important dependent variables in Work and Organizational Psychology. The main objective of this paper was to explore the relationships between citizenship performance and task performance measures obtained from different appraisers and their consistency through a seldom-used methodology, intraclass correlation coefficients. Participants were 135 public employees, the total staff in a local government department. Jobs were clustered into job families through a work analysis based on standard questionnaires. A task description technique was used to develop a performance appraisal questionnaire for each job family, with three versions: self-, supervisor-, and peer-evaluation, in addition to a measure of citizenship performance. Only when the self-appraisal bias is controlled, significant correlations appeared between task performance rates. However, intraclass correlations analyses show that only self- (contextual and task) performance measures are consistent, while interrater agreement disappears. These results provide some interesting clues about the procedure of appraisal instrument development, the role of appraisers, and the importance of choosing adequate consistency analysis methods.
Relationships between Contextual and Task Performance and Interrater Agreement: Are There Any?
Díaz-Cabrera, Dolores; Hernández-Fernaud, Estefanía; Rosales-Sánchez, Christian
2015-01-01
Work performance is one of the most important dependent variables in Work and Organizational Psychology. The main objective of this paper was to explore the relationships between citizenship performance and task performance measures obtained from different appraisers and their consistency through a seldom-used methodology, intraclass correlation coefficients. Participants were 135 public employees, the total staff in a local government department. Jobs were clustered into job families through a work analysis based on standard questionnaires. A task description technique was used to develop a performance appraisal questionnaire for each job family, with three versions: self-, supervisor-, and peer-evaluation, in addition to a measure of citizenship performance. Only when the self-appraisal bias is controlled, significant correlations appeared between task performance rates. However, intraclass correlations analyses show that only self- (contextual and task) performance measures are consistent, while interrater agreement disappears. These results provide some interesting clues about the procedure of appraisal instrument development, the role of appraisers, and the importance of choosing adequate consistency analysis methods. PMID:26473956
Fledelius, Joan; Khalil, Azza; Hjorthaug, Karin; Frøkiær, Jørgen
2016-12-01
The purpose of this study is to determine whether a qualitative approach or a semi-quantitative approach provides the most robust method for early response evaluation with 2'-deoxy-2'-[(18)F]fluoro-D-glucose (F-18-FDG) positron emission tomography combined with whole body computed tomography (PET/CT) in non-small cell lung cancer (NSCLC). In this study eight Nuclear Medicine consultants analyzed F-18-FDG PET/CT scans from 35 patients with locally advanced NSCLC. Scans were performed at baseline and after 2 cycles of chemotherapy. Each observer used two different methods for evaluation: (1) PET response criteria in solid tumors (PERCIST) 1.0 and (2) a qualitative approach. Both methods allocate patients into one of four response categories (complete and partial metabolic response (CMR and PMR) and stable and progressive metabolic disease (SMD and PMD)). The inter-observer agreement was evaluated using Fleiss' kappa for multiple raters, Cohens kappa for comparison of the two methods, and intraclass correlation coefficients (ICC) for comparison of lean body mass corrected standardized uptake value (SUL) peak measurements. The agreement between observers when determining the percentage change in SULpeak was "almost perfect", with ICC = 0.959. There was a strong agreement among observers allocating patients to the different response categories with a Fleiss kappa of 0.76 (0.71-0.81). In 22 of the 35 patients, complete agreement was observed with PERCIST 1.0. The agreement was lower when using the qualitative method, moderate, having a Fleiss kappa of 0.60 (0.55-0.64). Complete agreement was achieved in only 10 of the 35 patients. The difference between the two methods was statistically significant (p < 0.005) (chi-squared). Comparing the two methods for each individual observer showed Cohen's kappa values ranging from 0.64 to 0.79, translating into a strong agreement between the two methods. PERCIST 1.0 provides a higher overall agreement between observers than the qualitative approach in categorizing early treatment response in NSCLC patients. The inter-observer agreement is in fact strong when using PERCIST 1.0 even when the level of instruction is purposely kept to a minimum in order to mimic the everyday situation. The variability is largely owing to the subjective elements of the method.
Barlow, Pepita; McKee, Martin; Basu, Sanjay; Stuckler, David
2017-03-08
Regional trade agreements are major international policy instruments that shape macro-economic and political systems. There is widespread debate as to whether and how these agreements pose risks to public health. Here we perform a comprehensive systematic review of quantitative studies of the health impact of trade and investment agreements. We identified studies from searches in PubMed, Web of Science, EMBASE, and Global Health Online. Research articles were eligible for inclusion if they were quantitative studies of the health impacts of trade and investment agreements or policy. We systematically reviewed study findings, evaluated quality using the Quality Assessment Tool from the Effective Public Health Practice Project, and performed network citation analysis to study disciplinary siloes. Seventeen quantitative studies met our inclusion criteria. There was consistent evidence that implementing trade agreements was associated with increased consumption of processed foods and sugar-sweetened beverages. Granting import licenses for patented drugs was associated with increased access to pharmaceuticals. Implementing trade agreements and associated policies was also correlated with higher cardiovascular disease incidence and higher Body Mass Index (BMI), whilst correlations with tobacco consumption, under-five mortality, maternal mortality, and life expectancy were inconclusive. Overall, the quality of studies is weak or moderately weak, and co-citation analysis revealed a relative isolation of public health from economics. We identified limitations in existing studies which preclude definitive conclusions of the health impacts of regional trade and investment agreements. Few address unobserved confounding, and many possible consequences and mechanisms linking trade and investment agreements to health remain poorly understood. Results from our co-citation analysis suggest scope for greater interdisciplinary collaboration. Notwithstanding these limitations, our results find evidence that trade agreements pose some significant health risks. Health protections in trade and investment treaties may mitigate these impacts.
Surriabre, Pedro; Allende, Gustavo; Prado, Marcela; Cáceres, Leyddy; Bellot, Diego; Torrico, Andrea; Ustariz, Karina; Rojas, Shirley; Barriga, Jaime; Calle, Pamela; Villarroel, Ligia; Yañez, Rosse Mary; Baay, Marc; Rodriguez, Patricia; Fontaine, Véronique
2017-12-22
Cervical cancer incidence and mortality rates in Bolivia are among the highest in Latin America. This investigation aims to evaluate the possibility of using simple devices, e.g. a cotton swab and a glass slide, for self-sampling in order to detect human papillomavirus (HPV) DNA by PCR in cervico-vaginal cells. In the first phase of our study we evaluated the use of a glass slide as a transport medium for cervical cells. A physician took paired-cervical samples from 235 women. One sample was transported in Easyfix® solution and the other sample was smeared over a glass slide. Both were further analyzed and compared for human DNA recovery and HPV detection. A kappa value was determined to evaluate the agreement between the HPV DNA detection rates. In the second phase of the study, 222 women from the urban, peri-urban and rural regions of Cochabamba were requested to perform self-sampling using the following devices: a cotton swab combined with a glass slide, and a vaginal tampon. Women gave their opinion about the self-sampling technique. Finally, the agreement for high risk-HPV detection between self- and physician-collected samples was performed in 201 samples in order to evaluate the self-sampling technique. Firstly, the comparison between Easyfix® solution and the glass slide to transport clinical samples gave a good agreement for HPV DNA detection (κ = 0.71, 95% CI 0.60-0.81). Secondly, self-sampling, especially with cotton swab combined with glass slide, would generally be preferred over clinician sampling for a screening program based on HPV detection. Finally, we showed a good agreement between self- and physician collected samples for high risk-HPV detection (κ = 0.71, 95% CI 0.55-0.88). Simple devices such as a cotton swab and a glass slide can be used to perform self-sampling and HPV DNA detection. Furthermore, most Bolivian women preferred self-sampling over clinician-sampling for cervical cancer screening.
7 CFR 2201.25 - Performance Agreement.
Code of Federal Regulations, 2014 CFR
2014-01-01
... 7 Agriculture 15 2014-01-01 2014-01-01 false Performance Agreement. 2201.25 Section 2201.25... Agreement. (a) The Borrower of a Loan guaranteed under the Program shall enter into a Performance Agreement... to meet its stipulated Performance Agreement entered into under paragraph (a) of this section. ...
7 CFR 2201.25 - Performance Agreement.
Code of Federal Regulations, 2011 CFR
2011-01-01
... 7 Agriculture 15 2011-01-01 2011-01-01 false Performance Agreement. 2201.25 Section 2201.25... Agreement. (a) The Borrower of a Loan guaranteed under the Program shall enter into a Performance Agreement... to meet its stipulated Performance Agreement entered into under paragraph (a) of this section. ...
van Zadelhoff, Claudia; Ehrle, Anna; Merle, Roswitha; Jahn, Werner; Lischer, Christoph
2018-05-09
Scintigraphy is a standard diagnostic method for evaluating horses with back pain due to suspected thoracic processus spinosus pathology. Lesion detection is based on subjective or semi-quantitative assessments of increased uptake. This retrospective, analytical study is aimed to compare semi-quantitative and subjective methods in the evaluation of scintigraphic images of the processi spinosi in the equine thoracic spine. Scintigraphic images of 20 Warmblood horses, presented for assessment of orthopedic conditions between 2014 and 2016, were included in the study. Randomized, blinded image evaluation was performed by 11 veterinarians using subjective and semi-quantitative methods. Subjective grading was performed for the analysis of red-green-blue and grayscale scintigraphic images, which were presented in full-size or as masked images. For the semi-quantitative assessment, observers placed regions of interest over each processus spinosus. The uptake ratio of each processus spinosus in comparison to a reference region of interest was determined. Subsequently, a modified semi-quantitative calculation was developed whereby only the highest counts-per-pixel for a specified number of pixels was processed. Inter- and intraobserver agreement was calculated using intraclass correlation coefficients. Inter- and intraobserver intraclass correlation coefficients were 41.65% and 71.39%, respectively, for the subjective image assessment. Additionally, a correlation between intraobserver agreement, experience, and grayscale images was identified. The inter- and intraobserver agreement was significantly increased when using semi-quantitative analysis (97.35% and 98.36%, respectively) or the modified semi-quantitative calculation (98.61% and 98.82%, respectively). The proposed modified semi-quantitative technique showed a higher inter- and intraobserver agreement when compared to other methods, which makes it a useful tool for the analysis of scintigraphic images. The association of the findings from this study with clinical and radiological examinations requires further investigation. © 2018 American College of Veterinary Radiology.
Performance evaluation of the Abbott CELL-DYN Ruby and the Sysmex XT-2000i haematology analysers.
Leers, M P G; Goertz, H; Feller, A; Hoffmann, J J M L
2011-02-01
Two mid-range haematology analysers (Abbott CELL-DYN Ruby and Sysmex XT-2000i) were evaluated to determine their analytical performance and workflow efficiency in the haematology laboratory. In total 418 samples were processed for determining equivalence of complete blood count (CBC) measurements, and 100 for reticulocyte comparison. Blood smears served for assessing the agreement of the differential counts. Inter-instrument agreement for most parameters was good although small numbers of discrepancies were observed. Systematic biases were found for mean cell volume, reticulocytes, platelets and mean platelet volume. CELL-DYN Ruby WBC differentials were obtained with all samples while the XT-2000i suppressed differentials partially or completely in 13 samples (3.1%). WBC subpopulation counts were otherwise in good agreement with no major outliers. Following first-pass CBC/differential analysis, 88 (21%) of XT-2000i samples required further analyser processing compared to 18 (4.3%) for the CELL-DYN Ruby. Smear referrals for suspected WBC/nucleated red blood cells and platelet abnormalities were indicated for 106 (25.4%) and 95 (22.7%) of the XT-2000i and CELL-DYN Ruby samples respectively. Flagging efficiencies for both analysers were found to be similar. The Sysmex XT-2000i and Abbott CELL-DYN Ruby analysers have broadly comparable analytical performance, but the CELL-DYN Ruby showed superior first-pass efficiency. © 2010 Blackwell Publishing Ltd.
Siafarikas, F; Staer-Jensen, J; Braekken, I H; Bø, K; Engh, M Ellström
2013-03-01
To evaluate the learning process for acquiring three- and four-dimensional (3D/4D) transperineal ultrasound volumes of the levator hiatus (LH) dimensions at rest, during pelvic floor muscle (PFM) contraction and on Valsalva maneuver, and for analyzing the ultrasound volumes, as well as to perform an interobserver reliability study between two independent ultrasound examiners. This was a prospective study including 22 women. We monitored the learning process of an inexperienced examiner (IE) performing 3D/4D transperineal ultrasonography and analyzing the volumes. The examination included acquiring volumes during three PFM contractions and three Valsalva maneuvers. LH dimensions were determined in the axial plane. The learning process was documented by estimating agreement between the IE and an experienced examiner (E) using the intraclass correlation coefficient. Agreement was calculated in blocks of 10 ultrasound examinations and analyzed volumes. After the learning process was complete the interobserver reliability for the technique was calculated between these two independent examiners. For offline analysis of the first 10 ultrasound volumes obtained by E, good to very good agreement between E and IE was achieved for all LH measurements except for the left and right levator-urethra gap and pubic arc. For the next 10 analyzed volumes, agreement improved for all LH measurements. Volumes that had been obtained by IE and E were then re-evaluated by IE, and good to very good agreement was found for all LH measurements indicating consistency in volume acquisition. The interobserver reliability study showed excellent ICC values (ICC, 0.81-0.97) for all LH measurements except the pubic arc (ICC = 0.67). 3D/4D transperineal ultrasound is a reliable technique that can be learned in a short period of time. Copyright © 2012 ISUOG. Published by John Wiley & Sons, Ltd.
Pashmakova, Medora B; Piccione, Julie; Bishop, Micah A; Nelson, Whitney R; Lawhon, Sara D
2017-05-01
OBJECTIVE To evaluate the agreement between results of microscopic examination and bacterial culture of bile samples from dogs and cats with hepatobiliary disease for detection of bactibilia. DESIGN Cross-sectional study. ANIMALS 31 dogs and 21 cats with hepatobiliary disease for which subsequent microscopic examination and bacterial culture of bile samples was performed from 2004 through 2014. PROCEDURES Electronic medical records of included dogs and cats were reviewed to extract data regarding diagnosis, antimicrobials administered, and results of microscopic examination and bacterial culture of bile samples. Agreement between these 2 diagnostic tests was assessed by calculation of the Cohen κ value. RESULTS 17 (33%) dogs and cats had bactibilia identified by microscopic examination of bile samples, and 11 (21%) had bactibilia identified via bacterial culture. Agreement between these 2 tests was substantial (percentage agreement [positive and negative results], 85%; κ = 0.62; 95% confidence interval, 0.38 to 0.89) and improved to almost perfect when calculated for only animals that received no antimicrobials within 24 hours prior to sample collection (percentage agreement, 94%; κ = 0.84; 95% confidence interval, 0.61 to 1.00). CONCLUSIONS AND CLINICAL RELEVANCE Results indicated that agreement between microscopic examination and bacterial culture of bile samples for detection of bactibilia is optimized when dogs and cats are not receiving antimicrobials at the time of sample collection. Concurrent bacterial culture and microscopic examination of bile samples are recommended for all cats and dogs evaluated for hepatobiliary disease.
Oliva, Juan; Brosa, Max; Espín, Jaime; Figueras, Montserrat; Trapero, Marta
2015-01-01
Economic evaluation of health care interventions has experienced a strong growth over the past decade and is increasingly present as a support tool in the decisions making process on public funding of health services and pricing in European countries. A necessary element using them is that agents that perform economic evaluations have minimum rules with agreement on methodological aspects. Although there are methodological issues in which there is a high degree of consensus, there are others in which there is no such degree of agreement being closest to the normative field or have experienced significant methodological advances in recent years. In this first article of a series of three, we will discuss on the perspective of analysis and assessment of costs in economic evaluation of health interventions using the technique Metaplan. Finally, research lines are proposed to overcome the identified discrepancies.
Surviving annual performance reviews.
Lazarus, Arthur
2008-01-01
Physicians who work in organizational settings can expect to be evaluated at least twice a year. Yet physicians are accustomed to functioning autonomously, and they may resist having their performance measured or become anxious at the thought of it. Several recommendations are made to help physicians survive the ordeal: (1) establish measurable goals and objectives for the year; (2) perform at your very best at all times; (3) obtain feedback about your performance from your colleagues; (4) ask for a mentor if you lack experience; (5) learn to manage upward; (6) let your boss know when other people have praised your work; (7) insist on face-to-face evaluations; and (8) sign your annual performance review and indicate agreement or disagreement.
Evaluation of Four Diagnostic Tests for Insulin Dysregulation in Adult Light-Breed Horses.
Dunbar, L K; Mielnicki, K A; Dembek, K A; Toribio, R E; Burns, T A
2016-05-01
Several tests have been evaluated in horses for quantifying insulin dysregulation to support a diagnosis of equine metabolic syndrome. Comparing the performance of these tests in the same horses will provide clarification of their accuracy in the diagnosis of equine insulin dysregulation. The aim of this study was to evaluate the agreement between basal serum insulin concentrations (BIC), the oral sugar test (OST), the combined glucose-insulin test (CGIT), and the frequently sampled insulin-modified intravenous glucose tolerance test (FSIGTT). Twelve healthy, light-breed horses. Randomized, prospective study. Each of the above tests was performed on 12 horses. Minimal model analysis of the FSIGTT was considered the reference standard and classified 7 horses as insulin resistant (IR) and 5 as insulin sensitive (IS). In contrast, BIC and OST assessment using conventional cut-off values classified all horses as IS. Kappa coefficients, measuring agreement among BIC, OST, CGIT, and FSIGTT were poor to fair. Sensitivity of the CGIT (positive phase duration of the glucose curve >45 minutes) was 85.7% and specificity was 40%, whereas CGIT ([insulin]45 >100 μIU/mL) sensitivity and specificity were 28.5% and 100%, respectively. Area under the glucose curve (AUCg0-120 ) was significantly correlated among the OST, CGIT, and FSIGTT, but Bland-Altman method and Lin's concordance coefficient showed a lack of agreement. Current criteria for diagnosis of insulin resistance using BIC and the OST are highly specific but lack sensitivity. The CGIT displayed better sensitivity and specificity, but modifications may be necessary to improve agreement with minimal model analysis. Copyright © 2016 The Authors. Journal of Veterinary Internal Medicine published by Wiley Periodicals, Inc. on behalf of the American College of Veterinary Internal Medicine.
Antón, Alfonso; Pazos, Marta; Martín, Belén; Navero, José Manuel; Ayala, Miriam Eleonora; Castany, Marta; Martínez, Patricia; Bardavío, Javier
2013-01-01
To assess sensitivity, specificity, and agreement among automated event analysis, automated trend analysis, and expert evaluation to detect glaucoma progression. This was a prospective study that included 37 eyes with a follow-up of 36 months. All had glaucomatous disks and fields and performed reliable visual fields every 6 months. Each series of fields was assessed with 3 different methods: subjective assessment by 2 independent teams of glaucoma experts, glaucoma/guided progression analysis (GPA) event analysis, and GPA (visual field index-based) trend analysis. Kappa agreement coefficient between methods and sensitivity and specificity for each method using expert opinion as gold standard were calculated. The incidence of glaucoma progression was 16% to 18% in 3 years but only 3 cases showed progression with all 3 methods. Kappa agreement coefficient was high (k=0.82) between subjective expert assessment and GPA event analysis, and only moderate between these two and GPA trend analysis (k=0.57). Sensitivity and specificity for GPA event and GPA trend analysis were 71% and 96%, and 57% and 93%, respectively. The 3 methods detected similar numbers of progressing cases. The GPA event analysis and expert subjective assessment showed high agreement between them and moderate agreement with GPA trend analysis. In a period of 3 years, both methods of GPA analysis offered high specificity, event analysis showed 83% sensitivity, and trend analysis had a 66% sensitivity.
Gritsiouk, Yaroslav; Hegsted, Damian; Gardiner, Stuart; Merriman, Lisa; Gubler, Kelly Dean
2013-05-01
Little is known about the reliability of data collected by abstractors without professional medical training. This investigation sought to determine the level of agreement among untrained volunteer abstractors as part of a study to evaluate the risk assessment of venous thromboembolism in patients who have undergone trauma. Forty-nine paper charts were chosen randomly from a volunteer-reviewed cohort of 2,339 and were compared with those of a single experienced abstractor. Inter-rater agreement was assessed using percent agreement, Cohen's kappa, and prevalence-adjusted bias-adjusted kappa (PABAK). Of the 71 data points, 28 had perfect agreement. The average agreement across all charts was 97%. Data with imperfect agreement had kappa values between .27 and .96 (mean, .75), with one additional value at zero even though it was associated with an agreement of 94%. PABAK values ranged from .67 to .98 (mean, .91), an average increase of .17 compared with kappa values. The performance of volunteers showed outstanding inter-rater reliability; however, limitations of interpretation can influence reliability. Copyright © 2013 Elsevier Inc. All rights reserved.
OrbView-3 Initial On-Orbit Characterization
NASA Technical Reports Server (NTRS)
Ross, Kent; Blonski, Slawomir; Holekamp, Kara; Pagnutti, Mary; Zanoni, Vicki; Carver, David; Fendley, Debbie; Smith, Charles
2004-01-01
NASA at Stennis Space Center (SSC) established a Space Act Agreement with Orbital Sciences Corporation (OSC) and ORBIMAGE Inc. to collaborate on the characterization of the OrbView-3 system and its imagery products and to develop characterization techniques further. In accordance with the agreement, NASA performed an independent radiometric, spatial, and geopositional accuracy assessment of OrbView-3 imagery acquired before completion of the system's initial on-orbit checkout. OSC acquired OrbView-3 imagery over SSC from July 2003 through January 2004, and NASA collected ground reference information coincident with many of these acquisitions. After evaluating all acquisitions, NASA deemed two multispectral images and five panchromatic images useful for characterization. NASA then performed radiometric, spatial, and geopositional characterizations.
Flight motor set 360L001 (STS-26R). Volume 1: System overview, revision A
NASA Technical Reports Server (NTRS)
Garecht, Diane M.
1990-01-01
The NASA space shuttle flight STS-26R, launched at 11:37.00.009 am, EDT on 29 Sep. 1988, used the redesigned solid rocket motors (RSRM) 360L001A and 360L001B. Evaluation of the ground environment instrumentation (GEI) data recorded prior to flight showed no launch commit criteria violations; that the field joint heater and aft skirt thermal conditioning systems performed adequately; and that the GEI data showed good agreement with thermal model predictions. Evaluation of the developmental flight instrumentation (DFI) revealed excellent agreement with both the predicted and required ballistic specifications. All parameters were well within the GEI specification requirements including propellant burn rates, specific impulse values, and thrust imbalance. Recorded strain values also indicated satisfactory radial growth and stress levels, as well as verification of adequate safety factors. Postflight inspection of the insulation, seals, case, and nozzles showed overall excellent performance. Some thermal DFI protective cork was missing, and inoperative field joint vent valves on the thermal protection cork allowed water entry into the field joints upon splashdown. Evaluation of these anomalies, as well as complete evaluation of all Redesigned Solid Rocket Motor components, is contained.
Flight motor set 360L001 (STS-26R), volume 1
NASA Technical Reports Server (NTRS)
Ricks, Glen A.
1988-01-01
The NASA space shuttle flight STS-26R, launched at 11:37.00.009 a.m. EDT on 29 Sep. 1988, used the redesigned solid rocket motors (RSRM) 360LOO1A and 360LOO1B. Evaluation of the ground environment instrumentation (GEI) data recorded prior to flight showed: (1) no launch commit criteria violations, (2) that the field joint heater and aft skirt thermal conditioning systems performed adequately, and (3) that the GEI data showed good agreement with thermal model predictions. Evaluation of the developmental flight instrumentation (DFI) revealed excellent agreement with both the predicted and required ballistic specifications. All parameters were well within the CEI specification requirements including propellant burn rates, specific impulse values, and thrust imbalance. Recorded strain values also indicated satisfactory radial growth and stress levels, as well as verification of adequate safety factors. Postflight inspection of the insulation, seals, case, and nozzles showed overall excellent performance. Some thermal DFI protective cork was missing, and inoperative field joint vent valves on the thermal protection cork allowed water entry into the field joints upon splashdown. Evaluation of these anomalies, as well as complete evaluation of all RSRM components, is presented.
Silva, Cláudia da Costa; Alves, Marta Maria Osório; El Halal, Michel Georges dos Santos; Pinheiro, Sabrina dos Santos; Carvalho, Paulo Roberto Antonacci
2013-01-01
Objective Compare the scores resulting from the Comfort-B scale with the bispectral index in children in an intensive care unit. Methods Eleven children between the ages of 1 month and 16 years requiring mechanical ventilation and sedation were simultaneously classified based on the bispectral index and the Comfort-B scale. Their behavior was recorded using digital photography, and the record was later evaluated by three independent evaluators. Agreement tests (Bland-Altman and Kappa) were then performed. The correlation between the two methods (Pearson correlation) was tested. Results In total, 35 observations were performed on 11 patients. Based on the Kappa coefficient, the agreement among evaluators ranged from 0.56 to 0.75 (p<0.001). There was a positive and consistent association between the bispectral index and the Comfort-B scale [r=0.424 (p=0.011) to r=0.498 (p=0.002)]. Conclusion Due to the strong correlation between the independent evaluators and the consistent correlation between the two methods, the results suggest that the Comfort-B scale is reproducible and useful in classifying the level of sedation in children requiring mechanical ventilation. PMID:24553512
7 CFR 205.505 - Statement of agreement.
Code of Federal Regulations, 2010 CFR
2010-01-01
... Agriculture Regulations of the Department of Agriculture (Continued) AGRICULTURAL MARKETING SERVICE (Standards... qualities of products labeled as organically produced; (3) Conduct an annual performance evaluation of all... certification decisions and implement measures to correct any deficiencies in certification services; (4) Have...
Johannsen, Finn; Jensen, Signe; Stallknecht, Sandra E; Olsen, Lars Otto; Magnusson, S Peter
2016-10-01
To determine intra- and interobserver reliability and precision of sonographic (US) scanning in measuring thickness of the Achilles tendon, plantar fascia, and heel fat pad in patients with heel pain. Seventeen consecutive patients referred with heel pain were included. Two evaluators blinded to the diagnosis performed independently US scanning of both feet without any dialogue with the patient. The examiner left the room, and the next examiner entered. All patients had two US scans performed by each examiner. Two months later, the US images were randomly presented to the evaluators for measurements. Reliability and agreement were assessed by calculation of intraclass correlation coefficient (ICC), 95% limits of agreement (LOA), and typical error (TE). LOA was calculated as a percentage of the mean thickness of each structure to obtain a unitless parameter. We found excellent intratester reliability (ICC 0.78-0.98) and good intertester reliability using one measurement (ICC 0.72-0.91) and excellent (ICC 0.85-0.95) when using average of two measurements. The intratester agreements were good with LOA: 9.5-23.4% and TE: 3.4-8.4%. The intertester agreements were acceptable using one measurement with LOA: 16.1-36.4%, and better using two measurements with LOA: 14.4-33.2%. US is a reliable technique of measurement in the daily clinic, and one single measurement is sufficient. In research, we recommend that the same observer performs the US measurements, if one single scanning is preferred; if more researchers are involved, the average measurement of two US scans is recommended. © 2016 Wiley Periodicals, Inc. J Clin Ultrasound 44:480-486, 2016. © 2016 Wiley Periodicals, Inc.
Zhang, Liping; Zhu, Shaohui
2015-05-01
To protect the transmission of the sensitive medical data, a secure and efficient authenticated key agreement scheme should be deployed when the healthcare delivery session is established via Telecare Medicine Information Systems (TMIS) over the unsecure public network. Recently, Islam and Khan proposed an authenticated key agreement scheme using elliptic curve cryptography for TMIS. They claimed that their proposed scheme is provably secure against various attacks in random oracle model and enjoys some good properties such as user anonymity. In this paper, however, we point out that any legal but malicious patient can reveal other user's identity. Consequently, their scheme suffers from server spoofing attack and off-line password guessing attack. Moreover, if the malicious patient performs the same time of the registration as other users, she can further launch the impersonation attack, man-in-the-middle attack, modification attack, replay attack, and strong replay attack successfully. To eliminate these weaknesses, we propose an improved ECC-based authenticated key agreement scheme. Security analysis demonstrates that the proposed scheme can resist various attacks and enables the patient to enjoy the remote healthcare services with privacy protection. Through the performance evaluation, we show that the proposed scheme achieves a desired balance between security and performance in comparisons with other related schemes.
Akhtar, Moneeb; Van Heukelom, Paul G; Ahmed, Azeemuddin; Tranter, Rachel D; White, Erinn; Shekem, Nathaniel; Walz, David; Fairfield, Catherine; Vakkalanka, J Priyanka; Mohr, Nicholas M
2018-02-22
Telemedicine allows patients to connect with healthcare providers remotely. It has recently expanded to evaluate low-acuity illnesses such as pharyngitis by using patients' personal communication devices. The purpose of our study was to compare the telemedicine-facilitated physical examination with an in-person examination in emergency department (ED) patients with sore throat. This was a prospective, observational, blinded diagnostic concordance study of patients being seen for sore throat in a 60,000-visit Midwestern academic ED. A telemedicine and a face-to-face examination were performed independently by two advanced practice providers (APP), blinded to the results of the other evaluator. The primary outcome was agreement on pharyngeal redness between the evaluators, with secondary outcomes of agreement and inter-rater reliability on 14 other aspects of the pharyngeal physical examination. We also conducted a survey of patients and providers to evaluate perceptions and preferences for sore throat evaluation using telemedicine. Sixty-two patients were enrolled, with a median tonsil size of 1.0. Inter-rater agreement (kappa) for tonsil size was 0.394, which was worse than our predetermined concordance threshold. Other kappa values ranged from 0 to 0.434, and telemedicine was best for detecting abnormal coloration of the palate and tender superficial cervical lymph nodes (anterior structures), but poor for detecting abnormal submandibular lymph nodes or asymmetry of the posterior pharynx (posterior structures). In survey responses, telemedicine was judged easier to use and more comfortable for providers than patients; however, neither patients nor providers preferred in-person to telemedicine evaluation. Telemedicine exhibited poor agreement with the in-person physical examination on the primary outcome of tonsil size, but exhibited moderate agreement on coloration of the palate and cervical lymphadenopathy. Future work should better characterize the importance of the physical examination in treatment decisions for patients with sore throat and the use of telemedicine in avoiding in-person healthcare visits.
Lou, Junyang; Obuchowski, Nancy A; Krishnaswamy, Amar; Popovic, Zoran; Flamm, Scott D; Kapadia, Samir R; Svensson, Lars G; Bolen, Michael A; Desai, Milind Y; Halliburton, Sandra S; Tuzcu, E Murat; Schoenhagen, Paul
2015-01-01
Preprocedural 3-dimensional CT imaging of the aortic annular plane plays a critical role for transcatheter aortic valve replacement (TAVR) planning; however, manual reconstructions are complex. Automated analysis software may improve reproducibility and agreement between readers but is incompletely validated. In 110 TAVR patients (mean age, 81 years; 37% female) undergoing preprocedural multidetector CT, automated reconstruction of the aortic annular plane and planimetry of the annulus was performed with a prototype of now commercially available software (syngo.CT Cardiac Function-Valve Pilot; Siemens Healthcare, Erlangen, Germany). Fully automated, semiautomated, and manual annulus measurements were compared. Intrareader and inter-reader agreement, intermodality agreement, and interchangeability were analyzed. Finally, the impact of these measurements on recommended valve size was evaluated. Semiautomated analysis required major correction in 5 patients (4.5%). In the remaining 95.5%, only minor correction was performed. Mean manual annulus area was significantly smaller than fully automated results (P < .001 for both readers) but similar to semiautomated measurements (5.0 vs 5.4 vs 4.9 cm(2), respectively). The frequency of concordant recommendations for valve size increased if manual analysis was replaced with the semiautomated method (60% agreement was improved to 82.4%; 95% confidence interval for the difference [69.1%-83.4%]). Semiautomated aortic annulus analysis, with minor correction by the user, provides reliable results in the context of TAVR annulus evaluation. Copyright © 2015 Society of Cardiovascular Computed Tomography. Published by Elsevier Inc. All rights reserved.
GEDOS-SECOT consensus on the care process of patients with knee osteoarthritis and arthoplasty.
Ruiz Iban, M A; Tejedor, A; Gil Garay, E; Revenga, C; Hermosa, J C; Montfort, J; Peña, M J; López Millán, J M; Montero Matamala, A; Capa Grasa, A; Navarro, M J; Gobbo, M; Loza, E
To develop recommendations on the evaluation and management procedure in patients undergoing total knee replacement based on best evidence and the experience of a panel of experts. A multidisciplinary group of 12 experts was selected that defined the scope, users and the document parts. Three systematic reviews were performed in patients undergoing knee replacement: (i)efficacy and safety of fast-tracks; (ii)efficacy and safety of cognitive interventions in patients with catastrophic pain, and (iii) efficacy and safety of acute post-surgical pain management on post-surgical outcomes. A narrative review was conducted on the evaluation and management of pain sensitization, and about the efficacy and safety of pre-surgical physiotherapy. The experts generated the recommendations and explicative text. The level of agreement was evaluated in a multidisciplinary group of 85 experts with the Delphi technique. The level of evidence was established as well for each recommendation. A total of 20 recommendations were produced. An agreement higher than 80% was reached in all of them. We found the highest agreement on the need for a full discharge report, on providing proper information about the process and on following available guidelines. There is consensus among professionals involved in the management of patients undergoing total knee replacement, in that it is important to protocolize the replacement process, performing a proper, integrated and coordinated patient evaluation and follow-up, paying special attention to the surgical procedure and postoperative period. Copyright © 2017 SECOT. Publicado por Elsevier España, S.L.U. All rights reserved.
Evaluating the Performance of the ff99SB Force Field Based on NMR Scalar Coupling Data
Wickstrom, Lauren; Okur, Asim; Simmerling, Carlos
2009-01-01
Abstract Force-field validation is essential for the identification of weaknesses in current models and the development of more accurate models of biomolecules. NMR coupling and relaxation methods have been used to effectively diagnose the strengths and weaknesses of many existing force fields. Studies using the ff99SB force field have shown excellent agreement between experimental and calculated order parameters and residual dipolar calculations. However, recent studies have suggested that ff99SB demonstrates poor agreement with J-coupling constants for short polyalanines. We performed extensive replica-exchange molecular-dynamics simulations on Ala3 and Ala5 in TIP3P and TIP4P-Ew solvent models. Our results suggest that the performance of ff99SB is among the best of currently available models. In addition, scalar coupling constants derived from simulations in the TIP4P-Ew model show a slight improvement over those obtained using the TIP3P model. Despite the overall excellent agreement, the data suggest areas for possible improvement. PMID:19651043
Muramatsu, Yukio; Yamamichi, Junta; Gomi, Shiho; Oubel, Estanislao; Moriyama, Noriyuki
2018-01-01
Background This study sought to evaluate the 95% limits of agreement of the volumes of 5-year clinically stable solid nodules for the development of a follow-up system for indeterminate solid nodules. Methods The volumes of 226 solid nodules that had been clinically stable for 5 years were measured in 186 patients (53 female never-smokers, 36 male never-smokers, 51 males with <30 pack-years, and 46 males with ≥30 pack-years) using a three-dimensional semiautomated method. Volume changes were evaluated using three methods: percent change, proportional change and growth rate. The 95% limits of agreement were evaluated using the Bland-Altman method. Results The 95% limits of agreement were as follows: range of percent change, from ±34.5% to ±37.8%; range of proportional change, from ±34.1% to ±36.8%; and range of growth rate, from ±39.2% to ±47.4%. Percent change-based, proportional change-based, and growth rate-based diagnoses of an increase or decrease in ten solid nodules were made at a mean of 302±402, 367±455, and 329±496 days, respectively, compared with a clinical diagnosis made at 809±616 days (P<0.05). Conclusions The 95% limits of agreement for volume change in 5-year stable solid nodules may enable the detection of an increase or decrease in the solid nodule at an earlier stage than that enabled by a clinical diagnosis, possibly contributing to the development of a follow-up system for reducing the number of additional Computed tomography (CT) scans performed during the follow-up period. PMID:29600047
Kakinuma, Ryutaro; Muramatsu, Yukio; Yamamichi, Junta; Gomi, Shiho; Oubel, Estanislao; Moriyama, Noriyuki
2018-01-01
This study sought to evaluate the 95% limits of agreement of the volumes of 5-year clinically stable solid nodules for the development of a follow-up system for indeterminate solid nodules. The volumes of 226 solid nodules that had been clinically stable for 5 years were measured in 186 patients (53 female never-smokers, 36 male never-smokers, 51 males with <30 pack-years, and 46 males with ≥30 pack-years) using a three-dimensional semiautomated method. Volume changes were evaluated using three methods: percent change, proportional change and growth rate. The 95% limits of agreement were evaluated using the Bland-Altman method. The 95% limits of agreement were as follows: range of percent change, from ±34.5% to ±37.8%; range of proportional change, from ±34.1% to ±36.8%; and range of growth rate, from ±39.2% to ±47.4%. Percent change-based, proportional change-based, and growth rate-based diagnoses of an increase or decrease in ten solid nodules were made at a mean of 302±402, 367±455, and 329±496 days, respectively, compared with a clinical diagnosis made at 809±616 days (P<0.05). The 95% limits of agreement for volume change in 5-year stable solid nodules may enable the detection of an increase or decrease in the solid nodule at an earlier stage than that enabled by a clinical diagnosis, possibly contributing to the development of a follow-up system for reducing the number of additional Computed tomography (CT) scans performed during the follow-up period.
Dobbs, Jessica; Krishnamurthy, Savitri; Kyrish, Matthew; Benveniste, Ana Paula; Yang, Wei; Richards-Kortum, Rebecca
2015-01-01
Tissue sampling is a problematic issue for inflammatory breast carcinoma, and immediate evaluation following core needle biopsy is needed to evaluate specimen adequacy. We sought to determine if confocal fluorescence microscopy provides sufficient resolution to evaluate specimen adequacy by comparing invasive tumor cellularity estimated from standard histologic images to invasive tumor cellularity estimated from confocal images of breast core needle biopsy specimens. Grayscale confocal fluorescence images of breast core needle biopsy specimens were acquired following proflavine application. A breast-dedicated pathologist evaluated invasive tumor cellularity in histologic images with hematoxylin and eosin staining and in grayscale and false-colored confocal images of cores. Agreement between cellularity estimates was quantified using a kappa coefficient. 23 cores from 23 patients with suspected inflammatory breast carcinoma were imaged. Confocal images were acquired in an average of less than 2 min per core. Invasive tumor cellularity estimated from histologic and grayscale confocal images showed moderate agreement by kappa coefficient: κ = 0.48 ± 0.09 (p < 0.001). Grayscale confocal images require less than 2 min for acquisition and allow for evaluation of invasive tumor cellularity in breast core needle biopsy specimens with moderate agreement to histologic images. We show that confocal fluorescence microscopy can be performed immediately following specimen acquisition and could indicate the need for additional biopsies at the initial visit.
Lee, Seung Hyun; Lee, Young Han; Song, Ho-Taek; Suh, Jin-Suck
2017-10-01
To evaluate the feasibility of 3D fast spin-echo (FSE) imaging with compressed sensing (CS) for the assessment of shoulder. Twenty-nine patients who underwent shoulder MRI including image sets of axial 3D-FSE sequence without CS and with CS, using an acceleration factor of 1.5, were included. Quantitative assessment was performed by calculating the root mean square error (RMSE) and structural similarity index (SSIM). Two musculoskeletal radiologists compared image quality of 3D-FSE sequences without CS and with CS, and scored the qualitative agreement between sequences, using a five-point scale. Diagnostic agreement for pathologic shoulder lesions between the two sequences was evaluated. The acquisition time of 3D-FSE MRI was reduced using CS (3min 23s vs. 2min 22s). Quantitative evaluations showed a significant correlation between the two sequences (r=0.872-0.993, p<0.05) and SSIM was in an acceptable range (0.940-0.993; mean±standard deviation, 0.968±0.018). Qualitative image quality showed good to excellent agreement between 3D-FSE images without CS and with CS. Diagnostic agreement for pathologic shoulder lesions between the two sequences was very good (κ=0.915-1). The 3D-FSE sequence with CS is feasible in evaluating the shoulder joint with reduced scan time compared to 3D-FSE without CS. Copyright © 2017 Elsevier Inc. All rights reserved.
Evaluation of a new automated microscopy urine sediment analyser - sediMAX conTRUST®.
Bogaert, Laura; Peeters, Bart; Billen, Jaak
2017-04-01
This study evaluated the performance of the stand-alone sediMAX conTRUST (77Elektronika, Budapest, Hungary) analyser as an alternative to microscopic analysis of urine. The validation included a precision, carry-over, categorical correlation and diagnostic performance study with manual phase-contrast microscopy as reference method. A total of 260 routine urine samples were assessed. The within-run precision was much better at higher concentrations than at very low concentrations. The precision met our predefined limits for all the elements at the different concentrations, with the exception of the lowest RBC, the WBC, pathological casts and crystals count. There was no sample carry-over. The analyser showed good categorical agreement with manual microscopy for RBC and WBC counts, moderate agreement for yeast cells, crystals and squamous epithelial cells and bad agreement for non-squamous epithelial cells, bacteria and casts. Diagnostic performance was satisfying only for RBC, WBC and yeast cells. The number of false negative results was acceptable (≤4%) for all elements after connecting the sediMAX conTRUST with an automatic strip reader (AutionMAX) and after implementation of review rules. We conclude that the sediMAX conTRUST should be used as a screening tool in combination with an automatic strip reader, for the identification of normal samples. Therefore, adequate review rules should be defined. Manual microscopy is still required in 'flagged' pathological samples. Despite the poor analytical performance on pathological samples, the images on the screen can be used for interpretation without the microscope and can be stored as PDF-documents for archiving the results.
Tun, Tin A; Baskaran, Mani; Tan, Shayne S; Perera, Shamira A; Aung, Tin; Husain, Rahat
2017-01-01
To evaluate the diagnostic performance of the anterior segment angle-to-angle scan of the Cirrus high-definition optical coherence tomography (HD-OCT) in detecting eyes with closed angles. All subjects underwent dark-room gonioscopy by an ophthalmologist. A technician performed anterior segment imaging with Cirrus (n = 202) and Visante OCT (n = 85) under dark-room conditions. All eyes were categorized by two masked graders as per number of closed quadrants. Each quadrant of anterior chamber angle was categorized as a closed angle if posterior trabecular meshwork could not be seen on gonioscopy or if there was any irido-corneal contact anterior to scleral spur in Cirrus and Visante images. An eye was graded as having a closed angle if two or more quadrants were closed. Agreement and area under the curve (AUC) were performed. There were 50 (24.8%) eyes with closed angles. The agreements of closed-angle diagnosis (by eye) between Cirrus HD-OCT and gonioscopy (k = 0.59; 95% confidence interval (CI) 0.45-0.72; AC1 = 0.76) and between Cirrus and Visante OCT (k = 0.65; 95% CI 0.48-0.82, AC1 = 0.77) were moderate. The AUC for diagnosing the eye with gonioscopic closed angle by Cirrus HD-OCT was good (AUC = 0.86; sensitivity = 83.33; specificity = 77.78). The diagnostic performance of Cirrus HD-OCT in detecting the eyes with closed angles was similar to that of Visante (AUC 0.87 vs. 0.9, respectively; P = 0.51). The anterior segment angle-to-angle scans of Cirrus HD-OCT demonstrated similar diagnostic performance as Visante in detecting gonioscopic closed angles. The agreement between Cirrus and gonioscopy for detecting eyes with closed angles was moderate.
Vaguine, A A; Richelle, J; Wodak, S J
1999-01-01
In this paper we present SFCHECK, a stand-alone software package that features a unified set of procedures for evaluating the structure-factor data obtained from X-ray diffraction experiments and for assessing the agreement of the atomic coordinates with these data. The evaluation is performed completely automatically, and produces a concise PostScript pictorial output similar to that of PROCHECK [Laskowski, MacArthur, Moss & Thornton (1993). J. Appl. Cryst. 26, 283-291], greatly facilitating visual inspection of the results. The required inputs are the structure-factor amplitudes and the atomic coordinates. Having those, the program summarizes relevant information on the deposited structure factors and evaluates their quality using criteria such as data completeness, structure-factor uncertainty and the optical resolution computed from the Patterson origin peak. The dependence of various parameters on the nominal resolution (d spacing) is also given. To evaluate the global agreement of the atomic model with the experimental data, the program recomputes the R factor, the correlation coefficient between observed and calculated structure-factor amplitudes and Rfree (when appropriate). In addition, it gives several estimates of the average error in the atomic coordinates. The local agreement between the model and the electron-density map is evaluated on a per-residue basis, considering separately the macromolecule backbone and side-chain atoms, as well as solvent atoms and heterogroups. Among the criteria are the normalized average atomic displacement, the local density correlation coefficient and the polymer chain connectivity. The possibility of computing these criteria using the omit-map procedure is also provided. The described software should be a valuable tool in monitoring the refinement procedure and in assessing structures deposited in databases.
Mills, Kathryn; Idris, Aula; Pham, Thu-An; Porte, John; Wiggins, Mark; Kavakli, Manolya
2017-12-18
To determine the validity and reliability of the peak frontal plane knee angle evaluated by a virtual reality (VR) netball game when landing from a drop vertical jump (DVJ). Laboratory Methods: Forty participants performed 3 DVJs evaluated by 3-dimensional (3D) motion analysis and 3 DVJs evaluated by the VR game. Limits of agreement for the peak projected frontal plane knee angle and peak knee abduction were determined. Participants were given a consensus category of "Above threshold" or "Below threshold" based on a pre-specified threshold angle of 9˚ during landing. Classification agreement was determined using kappa coefficient and accuracy was determined using specificity and sensitivity. Ten participants returned 1-week later to determine intra-rater reliability, standard error of the measure and typical error. The mean difference in detected frontal plane knee angle was 3.39˚ (1.03˚, 5.74˚). Limits of agreement were -10.27˚ (-14.36˚, -6.19˚) to 17.05˚ (12.97˚, 21.14˚). Substantial agreement, specificity and sensitivity were observed for the threshold classification (ĸ = 0.66, [0.42, 0.88] specificity= 0.96 [0.78, 1.0], sensitivity= 0.75 [0.43, 0.95]). The game exhibited acceptable reliability over time (ICC (3,1) = 0.844) and error was approximately 2˚. The VR game reliably evaluated a projected frontal plane knee angle. While the knee angle detected by the VR game is strongly related peak knee abduction, the accuracy of detecting the exact angle was limited. A threshold approach may be a more accurate approach for gaming technology to evaluate frontal plane knee angles when landing from a jump.
Robust Adaptive Data Encoding and Restoration
NASA Technical Reports Server (NTRS)
Park, Stephen K.; Rahman, Zia-ur; Halyo, Nesim
2000-01-01
This is the final report for NASA cooperative agreement and covers the period from 01 October, 1997 to 11 April, 2000. The research during this period was performed in three primary, but related, areas. 1. Evaluation of integrated information adaptive imaging. 2. Improvements in memory utilization and performance of the multiscale retinex with color restoration (MSRCR). 3. Commencement of a theoretical study to evaluate the non-linear retinex image enhancement technique. The research resulted in several publications, and an intellectual property disclosure to the NASA patent council in May, 1999.
7 CFR 2201.25 - Performance Agreement.
Code of Federal Regulations, 2010 CFR
2010-01-01
... 7 Agriculture 15 2010-01-01 2010-01-01 false Performance Agreement. 2201.25 Section 2201.25... LOCAL TELEVISION LOAN GUARANTEE PROGRAM-PROGRAM REGULATIONS Loan Guarantees § 2201.25 Performance Agreement. (a) The Borrower of a Loan guaranteed under the Program shall enter into a Performance Agreement...
7 CFR 2201.25 - Performance Agreement.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 7 Agriculture 15 2012-01-01 2012-01-01 false Performance Agreement. 2201.25 Section 2201.25... LOCAL TELEVISION LOAN GUARANTEE PROGRAM-PROGRAM REGULATIONS Loan Guarantees § 2201.25 Performance Agreement. (a) The Borrower of a Loan guaranteed under the Program shall enter into a Performance Agreement...
7 CFR 2201.25 - Performance Agreement.
Code of Federal Regulations, 2013 CFR
2013-01-01
... 7 Agriculture 15 2013-01-01 2013-01-01 false Performance Agreement. 2201.25 Section 2201.25... LOCAL TELEVISION LOAN GUARANTEE PROGRAM-PROGRAM REGULATIONS Loan Guarantees § 2201.25 Performance Agreement. (a) The Borrower of a Loan guaranteed under the Program shall enter into a Performance Agreement...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Becker, G.K.
1997-01-01
Bio-Imaging Research Inc. (BIR) and Lockheed Martin Speciality Components (LMSC) are engaged in a Program Research and Development Agreement and a Rapid Commercialization Initiative with the Department of Energy, EM-50. The agreement required BIR and LMSC to develop a data interpretation method that merges nondestructive assay and nondestructive examination (NDA/NDE) data and information sufficient to establish compliance with applicable National TRU Program (Program) waste characterization requirements and associated quality assurance performance criteria. This effort required an objective demonstration of the BIR and LMSC waste characterization systems in their standalone and integrated configurations. The goal of the test plan is tomore » provide a mechanism from which evidence can be derived to substantiate nondestructive assay capability and utility statement for the BIT and LMSC systems. The plan must provide for the acquisition, compilation, and reporting of performance data thereby allowing external independent agencies a basis for an objective evaluation of the standalone BIR and LMSC measurement systems, WIT and APNEA respectively, as well as an expected performance resulting from appropriate integration of the two systems. The evaluation is to be structured such that a statement regarding select INEL RWMC waste forms can be made in terms of compliance with applicable Program requirements and criteria.« less
Dosimetry and image quality assessment in a direct radiography system
Oliveira, Bruno Beraldo; de Oliveira, Marcio Alves; Paixão, Lucas; Teixeira, Maria Helena Araújo; Nogueira, Maria do Socorro
2014-01-01
Objective To evaluate the mean glandular dose with a solid state detector and the image quality in a direct radiography system, utilizing phantoms. Materials and Methods Irradiations were performed with automatic exposure control and polymethyl methacrylate slabs with different thicknesses to calculate glandular dose values. The image quality was evaluated by means of the structures visualized on the images of the phantoms. Results Considering the uncertainty of the measurements, the mean glandular dose results are in agreement with the values provided by the equipment and with internationally adopted reference levels. Results obtained from images of the phantoms were in agreement with the reference values. Conclusion The present study contributes to verify the equipment conformity as regards dose values and image quality. PMID:25741119
2013-01-01
Background This study investigates the reliability of muscle performance tests using cost- and time-effective methods similar to those used in clinical practice. When conducting reliability studies, great effort goes into standardising test procedures to facilitate a stable outcome. Therefore, several test trials are often performed. However, when muscle performance tests are applied in the clinical setting, clinicians often only conduct a muscle performance test once as repeated testing may produce fatigue and pain, thus variation in test results. We aimed to investigate whether cervical muscle performance tests, which have shown promising psychometric properties, would remain reliable when examined under conditions similar to those of daily clinical practice. Methods The intra-rater (between-day) and inter-rater (within-day) reliability was assessed for five cervical muscle performance tests in patients with (n = 33) and without neck pain (n = 30). The five tests were joint position error, the cranio-cervical flexion test, the neck flexor muscle endurance test performed in supine and in a 45°-upright position and a new neck extensor test. Results Intra-rater reliability ranged from moderate to almost perfect agreement for joint position error (ICC ≥ 0.48-0.82), the cranio-cervical flexion test (ICC ≥ 0.69), the neck flexor muscle endurance test performed in supine (ICC ≥ 0.68) and in a 45°-upright position (ICC ≥ 0.41) with the exception of a new test (neck extensor test), which ranged from slight to moderate agreement (ICC = 0.14-0.41). Likewise, inter-rater reliability ranged from moderate to almost perfect agreement for joint position error (ICC ≥ 0.51-0.75), the cranio-cervical flexion test (ICC ≥ 0.85), the neck flexor muscle endurance test performed in supine (ICC ≥ 0.70) and in a 45°-upright position (ICC ≥ 0.56). However, only slight to fair agreement was found for the neck extensor test (ICC = 0.19-0.25). Conclusions Intra- and inter-rater reliability ranged from moderate to almost perfect agreement with the exception of a new test (neck extensor test), which ranged from slight to moderate agreement. The significant variability observed suggests that tests like the neck extensor test and the neck flexor muscle endurance test performed in a 45°-upright position are too unstable to be used when evaluating neck muscle performance. PMID:24299621
Cross-cultural adaptation of the Nordic musculoskeletal questionnaire.
de Barros, E N C; Alexandre, N M C
2003-06-01
Reports in the literature have identified a need for internationally standardized and reliable measurements to analyse musculoskeletal symptoms. Screening of musculoskeletal disorders may serve as a diagnostic tool to evaluate the work environment. The Nordic general questionnaire is a standardized instrument used to analyse musculoskeletal symptoms in an ergonomic or occupational health context. To translate and adapt a version of the Nordic general questionnaire into Brazilian Portuguese and evaluate its reliability. The cross-cultural adaptation was performed according to internationally recommended methodology, using the following guidelines: translation; back-translation; committee review; and pretesting. First, the questionnaire was independently translated into Portuguese by two teachers and one doctor, and a consensus version was generated. Second, two other translators performed a back-translation independently from one another. This version was then submitted to a committee, consisting of six specialists in the area of knowledge of the instrument, to evaluate its equivalence to the original instrument. The final version was pretested on 20 subjects randomly selected in an outpatient clinic. Reliability was assessed by a test-retest procedure at 1-day intervals using the Kappa coefficient in a group of 40 subjects. The Kappa agreement values were calculated for each one of the four questions of the questionnaire. The agreement among the same observers was substantial, varying from 0.88 to 1, according to the Kappa values. these demonstrated strong agreement of the instrument, suggesting that the Brazilian version of the "Standardized Nordic Questionnaire" offers substantial reliability.
Prospective assessment of interobserver agreement for defecography in fecal incontinence.
Dobben, Annette C; Wiersma, Tjeerd G; Janssen, Lucas W M; de Vos, Rien; Terra, Maaike P; Baeten, Cor G; Stoker, Jaap
2005-11-01
The primary aim of our study was to determine the interobserver agreement of defecography in diagnosing enterocele, anterior rectocele, intussusception, and anismus in fecal-incontinent patients. The subsidiary aim was to evaluate the influence of level of experience on interpreting defecography. Defecography was performed in 105 consecutive fecal-incontinent patients. Observers were classified by level of experience and their findings were compared with the findings of an expert radiologist. The quality of the expert radiologist's findings was evaluated by an intraobserver agreement procedure. Intraobserver agreement was good to very good except for anismus: incomplete evacuation after 30 sec (kappa, 0.55) and puborectalis impression (kappa, 0.54). Interobserver agreement for enterocele and rectocele was good (kappa, 0.66 for both) and for intussusception, fair (kappa, 0.29). Interobserver agreement for anismus: incomplete evacuation after 30 sec was moderate (kappa, 0.47), and for anismus: puborectalis impression was fair (kappa, 0.24). Agreement in grading of enterocele and rectocele was good (kappa, 0.64 and 0.72, respectively) and for intussusception, fair (kappa, 0.39). Agreement separated by experience level was very good for rectocele (kappa, 0.83) and grading of rectoceles (kappa, 0.83) and moderate for intussusception (kappa, 0.44) at the most experienced level. For enterocele and grading, experience level did not influence the reproducibility. Reproducibility for enterocele, anterior rectocele, and severity grading is good, but for intussusception is fair to moderate. For anismus, the diagnosis of incomplete evacuation after 30 sec is more reproducible than puborectalis impression. The level of experience seems to play a role in diagnosing anterior rectocele and its grading and in diagnosing intussusception.
Interrater reliability among epilepsy centers: multicenter study of epilepsy surgery.
Haut, Sheryl R; Berg, Anne T; Shinnar, Shlomo; Cohen, Hillel W; Bazil, Carl W; Sperling, Michael R; Langfitt, John T; Pacia, Steven V; Walczak, Thaddeus S; Spencer, Susan S
2002-11-01
To measure the interrater reliability of presurgical testing and surgical decisions among epilepsy centers. Seven centers participating in an ongoing, prospective multicenter study of resective epilepsy surgery agreed to conform to a detailed protocol regarding presurgical evaluation and surgery. To assess quality assurance, each center independently reviewed 21 randomly selected surgical cases for preoperative study lateralization and localization, and surgical decisions. Interrater reliability was assessed by using intraclass correlation coefficients (ICCs), validated for use with multiple raters, and calculated in a two-way random model based on absolute agreement. Agreement for ICC values: > or = 0.75, excellent; 0.60-0.74, good; 0.40-0.59, fair; < or = 0.39, poor. One center was excluded for missing data. Agreement was excellent for extracranial EEG lateralization (0.8039), magnetic resonance imaging (MRI) lateralization (0.9521) and localization (0.9130), Wada lateralization (0.9453), and intracranial EEG localization (0.7905). Agreement was good for extracranial EEG localization (0.7384) and neuropsychological testing lateralization (0.7178) and localization (0.6891). Consensus about the decision to perform intracranial monitoring was fair (0.5397), in part reflecting one center's tendency toward intracranial monitoring. Overall agreements on whether to perform surgery (0.8311) and specific surgery recommended (0.8164) were excellent. High interrater reliability among six epilepsy centers was present for interpretation of most components of presurgical testing. Although consensus for the decision to perform intracranial monitoring was only fair, agreements for the ultimate decision about resective surgery and specific choice of resection were excellent. We believe that this study demonstrates the feasibility of implementing multicenter protocols for neurologic management, especially those involving localization, as well as protocols combining study results with clinical decision making.
A portable Fourier transform infrared gas analyzer with a photoacoustic detector performed reliably during pollution prevention research at two industrial facilities. It exhibited good agreement (within approximately 6%) with other analytical instruments (dispersive infrared and ...
42 CFR 493.959 - Immunohematology.
Code of Federal Regulations, 2014 CFR
2014-10-01
... challenges per testing event a program must provide for each analyte or test procedure is five. Analyte or... Compatibility testing Antibody identification (d) Evaluation of a laboratory's analyte or test performance. HHS... program must compare the laboratory's response for each analyte with the response that reflects agreement...
42 CFR 493.959 - Immunohematology.
Code of Federal Regulations, 2013 CFR
2013-10-01
... challenges per testing event a program must provide for each analyte or test procedure is five. Analyte or... Compatibility testing Antibody identification (d) Evaluation of a laboratory's analyte or test performance. HHS... program must compare the laboratory's response for each analyte with the response that reflects agreement...
42 CFR 493.959 - Immunohematology.
Code of Federal Regulations, 2012 CFR
2012-10-01
... challenges per testing event a program must provide for each analyte or test procedure is five. Analyte or... Compatibility testing Antibody identification (d) Evaluation of a laboratory's analyte or test performance. HHS... program must compare the laboratory's response for each analyte with the response that reflects agreement...
42 CFR 493.959 - Immunohematology.
Code of Federal Regulations, 2011 CFR
2011-10-01
... challenges per testing event a program must provide for each analyte or test procedure is five. Analyte or... Compatibility testing Antibody identification (d) Evaluation of a laboratory's analyte or test performance. HHS... program must compare the laboratory's response for each analyte with the response that reflects agreement...
Armstrong, David; Barkun, Alan; Bridges, Ron; Carter, Rose; de Gara, Chris; Dubé, Catherine; Enns, Robert; Hollingworth, Roger; MacIntosh, Donald; Borgaonkar, Mark; Forget, Sylviane; Leontiadis, Grigorios; Meddings, Jonathan; Cotton, Peter; Kuipers, Ernst J; Valori, Roland
2012-01-01
BACKGROUND: Increasing use of gastrointestinal endoscopy, particularly for colorectal cancer screening, and increasing emphasis on health care quality, highlight the need for clearly defined, evidence-based processes to support quality improvement in endoscopy. OBJECTIVE: To identify processes and indicators of quality and safety relevant to high-quality endoscopy service delivery. METHODS: A multidisciplinary group of 35 voting participants developed recommendation statements and performance indicators. Systematic literature searches generated 50 initial statements that were revised iteratively following a modified Delphi approach using a web-based evaluation and voting tool. Statement development and evidence evaluation followed the AGREE (Appraisal of Guidelines, REsearch and Evaluation) and GRADE (Grading of Recommendations, Assessment, Development and Evaluation) guidelines. At the consensus conference, participants voted anonymously on all statements using a 6-point scale. Subsequent web-based voting evaluated recommendations for specific, individual quality indicators, safety indicators and mandatory endoscopy reporting fields. Consensus was defined a priori as agreement by 80% of participants. RESULTS: Consensus was reached on 23 recommendation statements addressing the following: ethics (statement 1: agreement 100%), facility standards and policies (statements 2 to 9: 90% to 100%), quality assurance (statements 10 to 13: 94% to 100%), training, education, competency and privileges (statements 14 to 19: 97% to 100%), endoscopy reporting standards (statements 20 and 21: 97% to 100%) and patient perceptions (statements 22 and 23: 100%). Additionally, 18 quality indicators (agreement 83% to 100%), 20 safety indicators (agreement 77% to 100%) and 23 recommended endoscopy-reporting elements (agreement 91% to 100%) were identified. DISCUSSION: The consensus process identified a clear need for high-quality clinical and outcomes research to support quality improvement in the delivery of endoscopy services. CONCLUSIONS: The guidelines support quality improvement in endoscopy by providing explicit recommendations on systematic monitoring, assessment and modification of endoscopy service delivery to yield benefits for all patients affected by the practice of gastrointestinal endoscopy. PMID:22308578
Cultural adaptation of an instrument to assess physical fitness in cardiac patients.
Domingues, Gabriela de Barros Leite; Gallani, Maria Cecília; Gobatto, Claudio Alexandre; Miura, Cinthya Tamie Passos; Rodrigues, Roberta Cunha Matheus; Myers, Jonathan
2011-04-01
To validate the content and to evaluate the reliability of the Veterans Specific Activity Questionnaire instrument, culturally adapted for use in the Brazilian population of cardiac patients. The instrument was translated and back-translated and subsequently analyzed by a committee of judges to evaluate its semantic-idiomatic and cultural equivalences. Physical activities were replaced when indicated in the instrument, but uncommon in the daily life of the target population. Another committee of specialists analyzed the metabolic equivalence of replaced activities. The proportion of agreement of evaluation of the judges was quantified by the Content Validity Index. The pre-test was performed in two stages (n1 and n2=15). Reliability was assessed using the test-retest (interval of 7-15 days, n = 50). In the evaluation of semantic-idiomatic and cultural equivalences, items with a Content Validity Index < 1 were reviewed until consensus among the judges was obtained. The second committee found 100% of agreement in the analysis of metabolic equivalence between original and replaced activities. Test-retest analysis indicated a Kappa coefficient of agreement (k = 0.86; (p<0.001), suggesting temporal stability of the instrument. The Brazilian version of the Veterans Specific Activity Questionnaire showed evidence of reliability, according to the temporal stability criterion and adequate cultural content.
Fu, Michael C; Buerba, Rafael A; Long, William D; Blizzard, Daniel J; Lischuk, Andrew W; Haims, Andrew H; Grauer, Jonathan N
2014-10-01
Magnetic resonance imaging (MRI) is frequently used in the evaluation of degenerative conditions in the lumbar spine. The relative interrater and intrarater agreements of MRI findings across different pathologic conditions are underexplored, as most studies are focused on specific findings. The purpose of this study was to characterize the interrater and intrarater agreements of MRI findings used to assess the degenerative lumbar spine. A retrospective diagnostic study at a large academic medical center was undertaken with a panel of orthopedic surgeons and musculoskeletal radiologists to assess lumbar MRIs using standardized criteria. Seventy-five subjects who underwent routine lumbar spine MRI at our institution were included. Each MRI study was assessed for 10 lumbar degenerative findings using standardized criteria. Lumbar vertebral levels were assessed independently, where applicable, for a total of 52 data points collected per study. T2-weighted axial and sagittal MRI sequences were presented in random order to the four reviewers (two orthopedic spine surgeons and two musculoskeletal radiologists) independently to determine interrater agreement. The first 10 studies were reevaluated at the end to determine intrarater agreement. Images were assessed using standardized and pilot-tested criteria to assess disc degeneration, stenosis, and other degenerative changes. Interrater and intrarater absolute percent agreements were calculated. To highlight the most clinically important MRI disagreements, a modified agreement analysis was also performed (in which disagreements between the lowest two severity grades for applicable conditions were ignored). Fleiss kappa coefficients for interrater agreement were determined. The overall absolute and modified interrater agreements were 76.9% and 93.5%, respectively. The absolute and modified intrarater agreements were 81.3% and 92.7%, respectively. Average Fleiss kappa coefficient was 0.431, suggesting moderate overall agreement. However, when stratified by condition, absolute interrater agreement ranged from 65.1% to 92.0%. Disc hydration, disc space height, and bone marrow changes exhibited the lowest absolute interrater agreements. The absolute intrarater agreement had a narrower range, from 74.5% to 91.5%. Fleiss kappa coefficients ranged from fair-to-substantial agreement (0.282-0.618). Even in a study using standardized evaluation criteria, there was significant variability in the interrater and intrarater agreements of MRI in assessing different degenerative conditions of the lumbar spine. Clinicians should be aware of the condition-specific diagnostic limitations of MRI interpretation. Copyright © 2014 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Suiter, Ashley Elizabeth
Multi-spectral imagery provides a robust and low-cost dataset for assessing wetland extent and quality over broad regions and is frequently used for wetland inventories. However in forested wetlands, hydrology is obscured by tree canopy making it difficult to detect with multi-spectral imagery alone. Because of this, classification of forested wetlands often includes greater errors than that of other wetlands types. Elevation and terrain derivatives have been shown to be useful for modelling wetland hydrology. But, few studies have addressed the use of LiDAR intensity data detecting hydrology in forested wetlands. Due the tendency of LiDAR signal to be attenuated by water, this research proposed the fusion of LiDAR intensity data with LiDAR elevation, terrain data, and aerial imagery, for the detection of forested wetland hydrology. We examined the utility of LiDAR intensity data and determined whether the fusion of Lidar derived data with multispectral imagery increased the accuracy of forested wetland classification compared with a classification performed with only multi-spectral image. Four classifications were performed: Classification A -- All Imagery, Classification B -- All LiDAR, Classification C -- LiDAR without Intensity, and Classification D -- Fusion of All Data. These classifications were performed using random forest and each resulted in a 3-foot resolution thematic raster of forested upland and forested wetland locations in Vermilion County, Illinois. The accuracies of these classifications were compared using Kappa Coefficient of Agreement. Importance statistics produced within the random forest classifier were evaluated in order to understand the contribution of individual datasets. Classification D, which used the fusion of LiDAR and multi-spectral imagery as input variables, had moderate to strong agreement between reference data and classification results. It was found that Classification A performed using all the LiDAR data and its derivatives (intensity, elevation, slope, aspect, curvatures, and Topographic Wetness Index) was the most accurate classification with Kappa: 78.04%, indicating moderate to strong agreement. However, Classification C, performed with LiDAR derivative without intensity data had less agreement than would be expected by chance, indicating that LiDAR contributed significantly to the accuracy of Classification B.
The Salaried, Credited Internship in Environmental Studies.
ERIC Educational Resources Information Center
Schultz, Judith M.
1981-01-01
If carefully designed and arranged, the credited, salaried internship can serve as a transition to the real world of environmental problem-solving, performance evaluation, and ability to work with peers and supervisors. Presented are guidelines for operating such programs and samples of the internship agreement, responsibilities, and evaluation…
Evaluation of the IMMULITE® 2000 CMV IgM assay
2012-01-01
Background Diagnosis of cytomegalovirus (CMV) infection is challenging because of the high rate of asymptomatic infection and the low specificity of associated symptoms and signs. As a result, laboratory testing is an essential aid in making an accurate diagnosis. The presence of CMV IgM is indicative of primary CMV infection. In pregnancy, diagnosis of primary infection is important because primary maternal infection increases fetal infection risk substantially. Fetal infection can result in serious sequelae ranging from neurological deficits to death. Diagnosis among the immunocompromised is also critical for the timely initiation of therapy that can reduce morbidity and mortality risk. Methods The IMMULITE® 2000 CMV IgM assay qualitatively detects CMV IgM antibodies in human serum or plasma to aid in the diagnosis of current or recent CMV infection. To determine expected values in apparently healthy subjects, 136 samples were tested. Reproducibility, normal range, and method comparison studies were also performed to evaluate the assay's performance. The assay's reproducibility was evaluated across three sites. Seven hundred and eighteen (n = 718) individual patient serum samples, which included samples from CMV IgM-positive (n = 109, determined by the Abbott IMx CMV or the Diamedix CMV IgM assays), pregnant (n = 210), HIV-positive (n = 30), immunosuppressed (n = 102), and transplant patients (n = 17) and from patients with potentially cross-reacting conditions (n = 136) were evaluated in the method comparison study. The positive, negative, and overall agreement between the IMMULITE 2000 CMV IgM assay and the VIDAS CMV IgM assay (predicate assay) were determined. Results The assay demonstrated excellent reproducibility with a total CV of less than 10%. The positive, negative, and overall agreement between the IMMULITE 2000 assay and the VIDAS assay were > 95% for the method comparison samples. Among potentially cross-reactive samples, the overall agreement between the two assays was 96%. Similarly, among the immunocompromised and pregnant subjects, the overall agreement was ~96% and ~97%, respectively. Conclusions The IMMULITE 2000 CMV IgM assay demonstrated excellent reproducibility, minimal cross-reactivity, and performance comparable to that of the VIDAS CMV IgM assay. It can aid in the diagnosis of acute CMV or recent CMV infection by qualitatively detecting the CMV IgM antibodies in human serum or plasma. PMID:22377002
Evaluation of the IMMULITE® 2000 CMV IgM assay.
Bal, Tricia A; Armstrong, Glenn; Han, Xiang Y
2012-02-29
Diagnosis of cytomegalovirus (CMV) infection is challenging because of the high rate of asymptomatic infection and the low specificity of associated symptoms and signs. As a result, laboratory testing is an essential aid in making an accurate diagnosis. The presence of CMV IgM is indicative of primary CMV infection. In pregnancy, diagnosis of primary infection is important because primary maternal infection increases fetal infection risk substantially. Fetal infection can result in serious sequelae ranging from neurological deficits to death. Diagnosis among the immunocompromised is also critical for the timely initiation of therapy that can reduce morbidity and mortality risk. The IMMULITE® 2000 CMV IgM assay qualitatively detects CMV IgM antibodies in human serum or plasma to aid in the diagnosis of current or recent CMV infection. To determine expected values in apparently healthy subjects, 136 samples were tested. Reproducibility, normal range, and method comparison studies were also performed to evaluate the assay's performance. The assay's reproducibility was evaluated across three sites. Seven hundred and eighteen (n = 718) individual patient serum samples, which included samples from CMV IgM-positive (n = 109, determined by the Abbott IMx CMV or the Diamedix CMV IgM assays), pregnant (n = 210), HIV-positive (n = 30), immunosuppressed (n = 102), and transplant patients (n = 17) and from patients with potentially cross-reacting conditions (n = 136) were evaluated in the method comparison study. The positive, negative, and overall agreement between the IMMULITE 2000 CMV IgM assay and the VIDAS CMV IgM assay (predicate assay) were determined. The assay demonstrated excellent reproducibility with a total CV of less than 10%. The positive, negative, and overall agreement between the IMMULITE 2000 assay and the VIDAS assay were > 95% for the method comparison samples. Among potentially cross-reactive samples, the overall agreement between the two assays was 96%. Similarly, among the immunocompromised and pregnant subjects, the overall agreement was ~96% and ~97%, respectively. The IMMULITE 2000 CMV IgM assay demonstrated excellent reproducibility, minimal cross-reactivity, and performance comparable to that of the VIDAS CMV IgM assay. It can aid in the diagnosis of acute CMV or recent CMV infection by qualitatively detecting the CMV IgM antibodies in human serum or plasma.
Under the Air Quality Model Evaluation International Initiative, Phase 2 (AQMEII-2), three online coupled air quality model simulations, with six different configurations, are analyzed for their performance, inter-model agreement, and responses to emission and meteorological chan...
Evaluation of the Aptima HIV-1 Quant Dx Assay Using Plasma and Dried Blood Spots
Sahoo, Malaya K.; Varghese, Vici; White, Elizabeth; Winslow, Meg; Katzenstein, David A.; Shafer, Robert W.
2016-01-01
HIV-1 RNA quantitation in plasma, or virus load testing, is the primary method by which the response to antiretroviral therapy is monitored. Here we describe evaluation of the Aptima HIV-1 Quant Dx assay (Aptima) performed on the automated Panther system. The clinical performance of Aptima was compared to that of the Cobas AmpliPrep/Cobas TaqMan HIV-1 Test v2.0 (CAP/CTM) using 162 EDTA plasma samples collected from patients undergoing HIV-1 monitoring. Overall agreement was 84.0% (136/162), with a kappa statistic of 0.723 (standard error, 0.047; 95% confidence interval [CI], 0.630 to 0.815), indicating substantial agreement. Using the 86 clinical samples quantifiable by both methods, Passing-Bablok regression revealed a regression line of Y = (1.069 × X) − 0.346 (95% CI of the slope [1.003 to 1.139] and intercept [−0.666 to −0.074]), and Bland-Altman analysis demonstrated a mean difference (Aptima-CAP/CTM) of −0.075 log10 copies/ml (95% limits of agreement of −0.624 to 0.475), consistent with negative bias. Comparison of Aptima results for paired dried blood spot (DBS) and plasma specimens archived from participants in the Peninsula AIDS Research Cohort Study (PARC) demonstrated an overall agreement of 94.7% (90/95) when 1,000 copies/ml was used as the threshold. In conclusion, the Aptima HIV-1 Quant Dx assay provides a suitable alternative for HIV-1 monitoring in plasma and DBS. PMID:27535684
Evaluation of the Aptima HIV-1 Quant Dx Assay Using Plasma and Dried Blood Spots.
Sahoo, Malaya K; Varghese, Vici; White, Elizabeth; Winslow, Meg; Katzenstein, David A; Shafer, Robert W; Pinsky, Benjamin A
2016-10-01
HIV-1 RNA quantitation in plasma, or virus load testing, is the primary method by which the response to antiretroviral therapy is monitored. Here we describe evaluation of the Aptima HIV-1 Quant Dx assay (Aptima) performed on the automated Panther system. The clinical performance of Aptima was compared to that of the Cobas AmpliPrep/Cobas TaqMan HIV-1 Test v2.0 (CAP/CTM) using 162 EDTA plasma samples collected from patients undergoing HIV-1 monitoring. Overall agreement was 84.0% (136/162), with a kappa statistic of 0.723 (standard error, 0.047; 95% confidence interval [CI], 0.630 to 0.815), indicating substantial agreement. Using the 86 clinical samples quantifiable by both methods, Passing-Bablok regression revealed a regression line of Y = (1.069 × X) - 0.346 (95% CI of the slope [1.003 to 1.139] and intercept [-0.666 to -0.074]), and Bland-Altman analysis demonstrated a mean difference (Aptima-CAP/CTM) of -0.075 log10 copies/ml (95% limits of agreement of -0.624 to 0.475), consistent with negative bias. Comparison of Aptima results for paired dried blood spot (DBS) and plasma specimens archived from participants in the Peninsula AIDS Research Cohort Study (PARC) demonstrated an overall agreement of 94.7% (90/95) when 1,000 copies/ml was used as the threshold. In conclusion, the Aptima HIV-1 Quant Dx assay provides a suitable alternative for HIV-1 monitoring in plasma and DBS. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Fitzgerald, Rebecca C; di Pietro, Massimiliano; Ragunath, Krish; Ang, Yeng; Kang, Jin-Yong; Watson, Peter; Trudgill, Nigel; Patel, Praful; Kaye, Philip V; Sanders, Scott; O'Donovan, Maria; Bird-Lieberman, Elizabeth; Bhandari, Pradeep; Jankowski, Janusz A; Attwood, Stephen; Parsons, Simon L; Loft, Duncan; Lagergren, Jesper; Moayyedi, Paul; Lyratzopoulos, Georgios; de Caestecker, John
2014-01-01
These guidelines provide a practical and evidence-based resource for the management of patients with Barrett's oesophagus and related early neoplasia. The Appraisal of Guidelines for Research and Evaluation (AGREE II) instrument was followed to provide a methodological strategy for the guideline development. A systematic review of the literature was performed for English language articles published up until December 2012 in order to address controversial issues in Barrett's oesophagus including definition, screening and diagnosis, surveillance, pathological grading for dysplasia, management of dysplasia, and early cancer including training requirements. The rigour and quality of the studies was evaluated using the SIGN checklist system. Recommendations on each topic were scored by each author using a five-tier system (A+, strong agreement, to D+, strongly disagree). Statements that failed to reach substantial agreement among authors, defined as >80% agreement (A or A+), were revisited and modified until substantial agreement (>80%) was reached. In formulating these guidelines, we took into consideration benefits and risks for the population and national health system, as well as patient perspectives. For the first time, we have suggested stratification of patients according to their estimated cancer risk based on clinical and histopathological criteria. In order to improve communication between clinicians, we recommend the use of minimum datasets for reporting endoscopic and pathological findings. We advocate endoscopic therapy for high-grade dysplasia and early cancer, which should be performed in high-volume centres. We hope that these guidelines will standardise and improve management for patients with Barrett's oesophagus and related neoplasia.
Stahl, Ido; Katsman, Alexander; Zaidman, Michael; Keshet, Doron; Sigal, Amit; Eidelman, Mark
2017-07-11
Smartphones have the ability to capture and send images, and their use has become common in the emergency setting for transmitting radiographic images with the intent to consult an off-site specialist. Our objective was to evaluate the reliability of smartphone-based instant messaging applications for the evaluation of various pediatric limb traumas, as compared with the standard method of viewing images of a workstation-based picture archiving and communication system (PACS). X-ray images of 73 representative cases of pediatric limb trauma were captured and transmitted to 5 pediatric orthopedic surgeons by the Whatsapp instant messaging application on an iPhone 6 smartphone. Evaluators were asked to diagnose, classify, and determine the course of treatment for each case over their personal smartphones. Following a 4-week interval, revaluation was conducted using the PACS. Intraobserver agreement was calculated for overall agreement and per fracture site. The overall results indicate "near perfect agreement" between interpretations of the radiographs on smartphones compared with computer-based PACS, with κ of 0.84, 0.82, and 0.89 for diagnosis, classification, and treatment planning, respectively. Looking at the results per fracture site, we also found substantial to near perfect agreement. Smartphone-based instant messaging applications are reliable for evaluation of a wide range of pediatric limb fractures. This method of obtaining an expert opinion from the off-site specialist is immediately accessible and inexpensive, making smartphones a powerful tool for doctors in the emergency department, primary care clinics, or remote medical centers, enabling timely and appropriate treatment for the injured child. This method is not a substitution for evaluation of the images in the standard method over computer-based PACS, which should be performed before final decision-making.
Chavada, Ruchir; Maley, Michael
2015-01-01
Introduction: Community and healthcare associated infections caused by multi-drug resistant gram negative organisms (MDR GN) represent a worldwide threat. Nucleic Acid Detection tests are becoming more common for their detection; however they can be expensive requiring specialised equipment and local expertise. This study was done to evaluate the utility of a commercial multiplex tandem (MT) PCR for detection of MDR GN. Methods: The study was done on stored laboratory MDR GN isolates from sterile and non-sterile specimens (n=126, out of stored 567 organisms). Laboratory validation of the MT PCR was done to evaluate sensitivity, specificity and agreement with the current phenotypic methods used in the laboratory. Amplicon sequencing was also done on selected isolates for assessing performance characteristics. Workflow and cost implications of the MT PCR were evaluated. Results: The sensitivity and specificity of the MT PCR were calculated to be 95% and 96.7% respectively. Agreement with the phenotypic methods was 80%. Major lack of agreement was seen in detection of AmpC beta lactamase in enterobacteriaceae and carbapenemase in non-fermenters. Agreement of the MT PCR with another multiplex PCR was found to be 87%. Amplicon sequencing confirmed the genotype detected by MT PCR in 94.2 % of cases tested. Time to result was faster for the MT PCR but cost per test was higher. Conclusion: This study shows that with carefully chosen targets for detection of resistance genes in MDR GN, rapid and efficient identification is possible. MT PCR was sensitive and specific and likely more accurate than phenotypic methods. PMID:26464612
Malarial retinopathy in northern Nigerian children.
Mohammed, I; Ibrahim, U Y; Mukhtar, M; Farouq, Z; Obiagwu, P N; Yashua, A H
2010-01-01
This is a prospective study involving two centres. Children younger than 16 years old who had severe malaria underwent funduscopy. In one centre, direct ophthalmoscopy was performed by both an ophthalmologist and a non-ophthalmologist physician. At the other, two ophthalmologists performed a funduscopy on each patient using different methods (one using direct and the other indirect ophthalmoscopy). The objective was to determine the frequency of retinopathy and evaluate the effectiveness of direct ophthalmoscopy in detecting retinopathy in constantly dilated eyes. Of 62 children seen at both centres, 37% were found to have malarial retinopathy. Macula whitening was the most common finding. There was substantial agreement between the ophthalmologists using either the direct or indirect ophthalmoscopy techniques (Kappa value 0.604). However, at the centre using only direct ophthalmoscopy, there was poor agreement between the ophthalmologist and the non-ophthalmologist (kappa value -0.244). Direct ophthalmoscopy seemed to be effective in detecting malarial retinopathy, but only when performed by experienced personnel.
Kwon, S R; Meharry, M; Oyoyo, U; Li, Y
2015-01-01
To evaluate the efficacy of do-it-yourself (DIY) whitening as compared to conventional tooth whitening modalities using different shade assessment tools. Extracted human molars (120) were randomly distributed to six groups (n=20). Whitening was performed according to manufacturer's directions for over-the-counter, dentist-dispensed for home use, and in-office whitening. DIY whitening consisted of a strawberry and baking soda mix. Additionally, negative and positive controls were used. Two evaluators used the Vita Classical (VC) and Vita Bleachedguide 3D-Master with interpolated numbers (BGi) for visual assessment at baseline and one-week, one-month, and three-month postwhitening. Instrumental measurements were performed with a spectrophotometer. Kruskal-Wallis procedure was used to assess color changes among groups and intraclass correlation (ICC) to evaluate agreement between evaluators. DIY exhibited lower color change (ΔSGUVC, ΔSGUBGi, ΔE*, where SGU = shade guide unit and E = overall color change) compared to other whitening groups at all time points (p<0.05). ICC demonstrated very good agreement between evaluators with VC and BGi at each time point. Both shade guides were related with each other and strongly related to instrumental measurements (p<0.05). DIY whitening was the least effective whitening modality. Both VC and BGi are related with each other and have good correlation with instrumental measurements.
DOT National Transportation Integrated Search
1999-04-01
Grain marketing may be defined as "the performance of all business activities that coordinate the flow of goods and services from grain producers to consumers and users." This analysis examines the transportation component of the grain marketing syst...
RADON REMOVAL USING POINT-OF-ENTRY WATER TREATMENT TECHNIQUES
The purpose of the EPA Cooperative Agreement was to evaluate the performance of POE granular activated carbon (GAC), and diffused bubble and bubble place aeration systems treating a ground water supply containing radon (35,620 ±6,717 pCi/L). The pattern of loading to the uni...
NASA Technical Reports Server (NTRS)
Hirshorn, Steven R.
2017-01-01
Historically, most successful NASA projects have depended on effectively blending project management, systems engineering, and technical expertise among NASA, contractors, and third parties. Underlying these successes are a variety of agreements (e.g., contract, memorandum of understanding, grant, cooperative agreement) between NASA organizations or between NASA and other Government agencies, Government organizations, companies, universities, research laboratories, and so on. To simplify the discussions, the term "contract" is used to encompass these agreements. This section focuses on the NASA systems engineering activities pertinent to awarding a contract, managing contract performance, and completing a contract. In particular, NASA systems engineering interfaces to the procurement process are covered, since the NASA engineering technical team plays a key role in the development and evaluation of contract documentation. Contractors and third parties perform activities that supplement (or substitute for) the NASA project technical team accomplishment of the NASA common systems engineering technical process activities and requirements outlined in this guide. Since contractors might be involved in any part of the systems engineering life cycle, the NASA project technical team needs to know how to prepare for, allocate or perform, and implement surveillance of technical activities that are allocated to contractors.
Simulation and performance of brushless dc motor actuators
NASA Astrophysics Data System (ADS)
Gerba, A., Jr.
1985-12-01
The simulation model for a Brushless D.C. Motor and the associated commutation power conditioner transistor model are presented. The necessary conditions for maximum power output while operating at steady-state speed and sinusoidally distributed air-gap flux are developed. Comparison of simulated model with the measured performance of a typical motor are done both on time response waveforms and on average performance characteristics. These preliminary results indicate good agreement. Plans for model improvement and testing of a motor-driven positioning device for model evaluation are outlined.
Evaluation of the 235 U resonance parameters to fit the standard recommended values
Leal, Luiz; Noguere, Gilles; Paradela, Carlos; ...
2017-09-13
A great deal of effort has been dedicated to the revision of the standard values in connection with the neutron interaction for some actinides. While standard data compilation are available for decades nuclear data evaluations included in existing nuclear data libraries (ENDF, JEFF, JENDL, etc.) do not follow the standard recommended values. Indeed, the majority of evaluations for major actinides do not conform to the standards whatsoever. In particular, for the n + 235U interaction the only value in agreement with the standard is the thermal fission cross section. We performed a resonance re-evaluation of the n + 235U interactionmore » in order to address the issues regarding standard values in the energy range from 10-5 eV to 2250 eV. Recently, 235U fission cross-section measurements have been performed at the CERN Neutron Time-o-Flight facility (TOF), known as n_TOF, in the energy range from 0.7 eV to 10 keV. The data were normalized according to the recommended standard of the fission integral in the energy range 7.8 eV to 11 eV. As a result, the n_TOF averaged fission cross sections above 100 eV are in good agreement with the standard recommended values. The n_TOF data were included in the 235U resonance analysis that was performed with the code SAMMY. In addition to the average standard values related to the fission cross section, standard thermal values for fission, capture, and elastic cross sections were also included in the evaluation. Our paper presents the procedure used for re-evaluating the 235U resonance parameters including the recommended standard values as well as new cross section measurements.« less
Evaluation of the 235 U resonance parameters to fit the standard recommended values
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leal, Luiz; Noguere, Gilles; Paradela, Carlos
A great deal of effort has been dedicated to the revision of the standard values in connection with the neutron interaction for some actinides. While standard data compilation are available for decades nuclear data evaluations included in existing nuclear data libraries (ENDF, JEFF, JENDL, etc.) do not follow the standard recommended values. Indeed, the majority of evaluations for major actinides do not conform to the standards whatsoever. In particular, for the n + 235U interaction the only value in agreement with the standard is the thermal fission cross section. We performed a resonance re-evaluation of the n + 235U interactionmore » in order to address the issues regarding standard values in the energy range from 10-5 eV to 2250 eV. Recently, 235U fission cross-section measurements have been performed at the CERN Neutron Time-o-Flight facility (TOF), known as n_TOF, in the energy range from 0.7 eV to 10 keV. The data were normalized according to the recommended standard of the fission integral in the energy range 7.8 eV to 11 eV. As a result, the n_TOF averaged fission cross sections above 100 eV are in good agreement with the standard recommended values. The n_TOF data were included in the 235U resonance analysis that was performed with the code SAMMY. In addition to the average standard values related to the fission cross section, standard thermal values for fission, capture, and elastic cross sections were also included in the evaluation. Our paper presents the procedure used for re-evaluating the 235U resonance parameters including the recommended standard values as well as new cross section measurements.« less
Evaluation of the 235U resonance parameters to fit the standard recommended values
NASA Astrophysics Data System (ADS)
Leal, Luiz; Noguere, Gilles; Paradela, Carlos; Durán, Ignacio; Tassan-Got, Laurent; Danon, Yaron; Jandel, Marian
2017-09-01
A great deal of effort has been dedicated to the revision of the standard values in connection with the neutron interaction for some actinides. While standard data compilation are available for decades nuclear data evaluations included in existing nuclear data libraries (ENDF, JEFF, JENDL, etc.) do not follow the standard recommended values. Indeed, the majority of evaluations for major actinides do not conform to the standards whatsoever. In particular, for the n + 235U interaction the only value in agreement with the standard is the thermal fission cross section. A resonance re-evaluation of the n + 235U interaction has been performed to address the issues regarding standard values in the energy range from 10-5 eV to 2250 eV. Recently, 235U fission cross-section measurements have been performed at the CERN Neutron Time-of-Flight facility (TOF), known as n_TOF, in the energy range from 0.7 eV to 10 keV. The data were normalized according to the recommended standard of the fission integral in the energy range 7.8 eV to 11 eV. As a result, the n_TOF averaged fission cross sections above 100 eV are in good agreement with the standard recommended values. The n_TOF data were included in the 235U resonance analysis that was performed with the code SAMMY. In addition to the average standard values related to the fission cross section, standard thermal values for fission, capture, and elastic cross sections were also included in the evaluation. This paper presents the procedure used for re-evaluating the 235U resonance parameters including the recommended standard values as well as new cross section measurements.
Appropriate Statistics for Determining Chance-Removed Interpractitioner Agreement.
Popplewell, Michael; Reizes, John; Zaslawski, Chris
2018-05-31
Fleiss' Kappa (FK) has been commonly, but incorrectly, employed as the "standard" for evaluating chance-removed inter-rater agreement with ordinal data. This practice may lead to misleading conclusions in inter-rater agreement research. An example is presented that demonstrates the conditions where FK produces inappropriate results, compared with Gwet's AC2, which is proposed as a more appropriate statistic. A novel format for recording a Chinese Medical (CM) diagnoses, called the Diagnostic System of Oriental Medicine (DSOM), was used to record and compare patient diagnostic data, which, unlike the contemporary CM diagnostic format, allows agreement by chance to be considered when evaluating patient data obtained with unrestricted diagnostic options available to diagnosticians. Five CM practitioners diagnosed 42 subjects drawn from an open population. Subjects' diagnoses were recorded using the DSOM format. All the available data were initially used to evaluate agreement. Then, the subjects were sorted into three groups to demonstrate the effects of differing data marginality on the calculated chance-removed agreement. Agreement between the practitioners for each subject was evaluated with linearly weighted simple agreement, FK and Gwet's AC2. In all cases, overall agreement was much lower with FK than Gwet's AC2. Larger differences occurred when the data were more free marginal. Inter-rater agreement determined with FK statistics is unlikely to be correct unless it can be shown that the data from which agreement is determined are, in fact, fixed marginal. It follows that results obtained on agreement between practitioners with FK are probably incorrect. It is shown that inter-rater agreement evaluated with AC2 statistic is an appropriate measure when fixed marginal data are neither expected nor guaranteed. The AC2 statistic should be used as the standard statistical approach for determining agreement between practitioners.
Veillard, Jeremy; Huynh, Tai; Ardal, Sten; Kadandale, Sowmya; Klazinga, Niek S.; Brown, Adalsteinn D.
2010-01-01
This study examined the experience of the Ontario Ministry of Health and Long-Term Care in enhancing its stewardship and performance management role by developing a health system strategy map and a strategy-based scorecard through a process of policy reviews and expert consultations, and linking them to accountability agreements. An evaluation of the implementation and of the effects of the policy intervention has been carried out through direct policy observation over three years, document analysis, interviews with decision-makers and systematic discussion of findings with other authors and external reviewers. Cascading strategies at health and local health system levels were identified, and a core set of health system and local health system performance indicators was selected and incorporated into accountability agreements with the Local Health Integration Networks. despite the persistence of such challenges as measurement limitations and lack of systematic linkage to decision-making processes, these activities helped to strengthen substantially the ministry's performance management function. PMID:21286268
NASA Technical Reports Server (NTRS)
1997-01-01
Under a Space Act Agreement between the Kirby company and Lewis Research Center, NASA technology was applied to a commercial vacuum cleaner product line. Kirby engineers were interested in advanced operational concepts, such as particle flow behavior and vibration, critical factors to improve vacuum cleaner performance. An evaluation of the company 1994 home care system, the Kirby G4, led to the refinement of the new G5 and future models. Under the cooperative agreement, Kirby had access to Lewis' holography equipment, which added insight into how long a vacuum cleaner fan would perform, as well as advanced computer software that can simulate the flow of air through fans. The collaboration resulted in several successes including fan blade redesign and continuing dialogue on how to improve air-flow traits in various nozzle designs.
25 CFR 170.621 - What if a tribe fails to substantially perform work under a contract or agreement?
Code of Federal Regulations, 2014 CFR
2014-04-01
... 25 Indians 1 2014-04-01 2014-04-01 false What if a tribe fails to substantially perform work under... Contracts and Agreements Under Isdeaa § 170.621 What if a tribe fails to substantially perform work under a contract or agreement? If a tribe fails to substantially perform work under a contract or agreement: (a...
25 CFR 170.621 - What if a tribe fails to substantially perform work under a contract or agreement?
Code of Federal Regulations, 2013 CFR
2013-04-01
... 25 Indians 1 2013-04-01 2013-04-01 false What if a tribe fails to substantially perform work under... Contracts and Agreements Under Isdeaa § 170.621 What if a tribe fails to substantially perform work under a contract or agreement? If a tribe fails to substantially perform work under a contract or agreement: (a...
Corbellini, Carlo; Andreoni, Bruno; Ansaloni, Luca; Sgroi, Giovanni; Martinotti, Mario; Scandroglio, Ildo; Carzaniga, Pierluigi; Longoni, Mauro; Foschi, Diego; Dionigi, Paolo; Morandi, Eugenio; Agnello, Mauro
2018-01-01
Measurement and monitoring of the quality of care using a core set of quality measures are increasing in health service research. Although administrative databases include limited clinical data, they offer an attractive source for quality measurement. The purpose of this study, therefore, was to evaluate the completeness of different administrative data sources compared to a clinical survey in evaluating rectal cancer cases. Between May 2012 and November 2014, a clinical survey was done on 498 Lombardy patients who had rectal cancer and underwent surgical resection. These collected data were compared with the information extracted from administrative sources including Hospital Discharge Dataset, drug database, daycare activity data, fee-exemption database, and regional screening program database. The agreement evaluation was performed using a set of 12 quality indicators. Patient complexity was a difficult indicator to measure for lack of clinical data. Preoperative staging was another suboptimal indicator due to the frequent missing administrative registration of tests performed. The agreement between the 2 data sources regarding chemoradiotherapy treatments was high. Screening detection, minimally invasive techniques, length of stay, and unpreventable readmissions were detected as reliable quality indicators. Postoperative morbidity could be a useful indicator but its agreement was lower, as expected. Healthcare administrative databases are large and real-time collected repositories of data useful in measuring quality in a healthcare system. Our investigation reveals that the reliability of indicators varies between them. Ideally, a combination of data from both sources could be used in order to improve usefulness of less reliable indicators.
Blanck, Oliver; Masi, Laura; Chan, Mark K H; Adamczyk, Sebastian; Albrecht, Christian; Damme, Marie-Christin; Loutfi-Krauss, Britta; Alraun, Manfred; Fehr, Roman; Ramm, Ulla; Siebert, Frank-Andre; Stelljes, Tenzin Sonam; Poppinga, Daniela; Poppe, Björn
2016-06-01
High precision radiosurgery demands comprehensive delivery-quality-assurance techniques. The use of a liquid-filled ion-chamber-array for robotic-radiosurgery delivery-quality-assurance was investigated and validated using several test scenarios and routine patient plans. Preliminary evaluation consisted of beam profile validation and analysis of source-detector-distance and beam-incidence-angle response dependence. The delivery-quality-assurance analysis is performed in four steps: (1) Array-to-plan registration, (2) Evaluation with standard Gamma-Index criteria (local-dose-difference⩽2%, distance-to-agreement⩽2mm, pass-rate⩾90%), (3) Dose profile alignment and dose distribution shift until maximum pass-rate is found, and (4) Final evaluation with 1mm distance-to-agreement criterion. Test scenarios consisted of intended phantom misalignments, dose miscalibrations, and undelivered Monitor Units. Preliminary method validation was performed on 55 clinical plans in five institutions. The 1000SRS profile measurements showed sufficient agreement compared with a microDiamond detector for all collimator sizes. The relative response changes can be up to 2.2% per 10cm source-detector-distance change, but remains within 1% for the clinically relevant source-detector-distance range. Planned and measured dose under different beam-incidence-angles showed deviations below 1% for angles between 0° and 80°. Small-intended errors were detected by 1mm distance-to-agreement criterion while 2mm criteria failed to reveal some of these deviations. All analyzed delivery-quality-assurance clinical patient plans were within our tight tolerance criteria. We demonstrated that a high-resolution liquid-filled ion-chamber-array can be suitable for robotic radiosurgery delivery-quality-assurance and that small errors can be detected with tight distance-to-agreement criterion. Further improvement may come from beam specific correction for incidence angle and source-detector-distance response. Copyright © 2016 Associazione Italiana di Fisica Medica. Published by Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kang, S; Suh, T; Chung, J
2015-06-15
Purpose: To verify the dose accuracy of Acuros XB (AXB) dose calculation algorithm at air-tissue interface using inhomogeneous phantom for 6-MV flattening filter-free (FFF) beams. Methods: An inhomogeneous phantom included air cavity was manufactured for verifying dose accuracy at the air-tissue interface. The phantom was composed with 1 and 3 cm thickness of air cavity. To evaluate the central axis doses (CAD) and dose profiles of the interface, the dose calculations were performed for 3 × 3 and 4 × 4 cm{sup 2} fields of 6 MV FFF beams with AAA and AXB in Eclipse treatment plainning system. Measurements inmore » this region were performed with Gafchromic film. The root mean square errors (RMSE) were analyzed with calculated and measured dose profile. Dose profiles were divided into inner-dose profile (>80%) and penumbra (20% to 80%) region for evaluating RMSE. To quantify the distribution difference, gamma evaluation was used and determined the agreement with 3%/3mm criteria. Results: The percentage differences (%Diffs) between measured and calculated CAD in the interface, AXB shows more agreement than AAA. The %Diffs were increased with increasing the thickness of air cavity size and it is similar for both algorithms. In RMSEs of inner-profile, AXB was more accurate than AAA. The difference was up to 6 times due to overestimation by AAA. RMSEs of penumbra appeared to high difference for increasing the measurement depth. Gamma agreement also presented that the passing rates decreased in penumbra. Conclusion: This study demonstrated that the dose calculation with AXB shows more accurate than with AAA for the air-tissue interface. The 2D dose distributions with AXB for both inner-profile and penumbra showed better agreement than with AAA relative to variation of the measurement depths and air cavity sizes.« less
Henderson, Joseph W; Kane, Sarah M; Mangel, Jeffrey M; Kikano, Elias G; Garibay, Jorge A; Pollard, Robert R; Mahajan, Sangeeta T; Debanne, Sara M; Hijaz, Adonis K
2018-06-01
The cough stress test is a common and accepted tool to evaluate stress urinary incontinence but there is no agreement on how the test should be performed. We assessed the diagnostic ability of different cough stress tests performed when varying patient position and bladder volume using urodynamic stress urinary incontinence as the gold standard. The 24-hour pad test was also evaluated. We recruited women who presented to specialty outpatient clinics with the complaint of urinary incontinence and who were recommended to undergo urodynamic testing. A total of 140 patients were randomized to 4 cough stress test groups, including group 1-a comfortably full bladder, group 2-an empty bladder, group 3- a bladder infused with 200 cc saline and group 4-a bladder filled to half functional capacity. The sequence of standing and sitting was randomly assigned. The groups were compared by 1-way ANOVA or the generalized Fisher exact test. The κ statistic was used to evaluate agreement between the sitting and standing positions. The 95% CIs of sensitivity and specificity were calculated using the Wilson method. ROC analysis was done to evaluate the performance of the 24-hour pad test. The cough stress test performed with a bladder filled to half functional capacity was the best performing test with 83% sensitivity and 90% specificity. There was no statistically significant evidence that the sensitivity or specificity of 1 cough stress test differed from that of the others. The pad test had no significant predictive ability to diagnose urodynamic stress urinary incontinence (AUC 0.60, p = 0.08). Cough stress tests were accurate to diagnose urodynamic stress urinary incontinence. The 24-hour pad test was not predictive of urodynamic stress urinary incontinence and not helpful when used in conjunction with the cough stress test. Copyright © 2018 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Sam, Soya S; Caliendo, Angela M; Ingersoll, Jessica; Abdul-Ali, Deborah; Kraft, Colleen S
Timely and precise laboratory diagnosis of Herpes simplex viruses (HSV) is required to guide clinical management. The study evaluated limit of detection (LOD) and performance characteristics of the Aptima HSV 1 & 2 assay in comparison to four assays. The multi-center study compared qualitative detection of HSV-1 and 2 by the Aptima HSV-1 and 2 assay (Hologic) to ELVIS culture, Lyra Direct (Quidel), AmpliVue (Quidel) and a laboratory developed test (LDT). LOD was performed using VTM and STM diluted viral concentrations and clinical performance was evaluated using 505 swab specimens. The Aptima LOD studies performed showed a lower detection limit for STM specimens as 1450 copies/mL and 430 copies/mL for HSV1 and HSV-2 respectively; the LOD for VTM specimens was 9370 copies/mL and 8045 copies/mL for HSV-1 and HSV-2 respectively. When the assays were analyzed based on the positive consensus result established the Aptima had 95% of percent positive agreement (PPA) and 100% negative percent agreement (NPA) for the HSV-1. For the HSV-2, the PPA and NPA for Aptima were 96% and 100% respectively. AmpliVue had 1.8% invalid rate, while Lyra had no invalid results but an inhibition rate of 0.8%. Aptima and LDT did not have any invalid or inhibited results. The results indicate that the Aptima HSV-1 & 2 assay is sensitive and the performance characteristics of the Aptima assay is comparable to the assays analyzed for the detection and differentiation of HSV-1 and 2 from cutaneous and mucocutaneous lesions. Copyright © 2017 Elsevier B.V. All rights reserved.
43 CFR 3192.12 - What activities may Tribes or States perform under cooperative agreements?
Code of Federal Regulations, 2012 CFR
2012-10-01
... 43 Public Lands: Interior 2 2012-10-01 2012-10-01 false What activities may Tribes or States perform under cooperative agreements? 3192.12 Section 3192.12 Public Lands: Interior Regulations Relating... Cooperative Agreements § 3192.12 What activities may Tribes or States perform under cooperative agreements...
43 CFR 3192.12 - What activities may Tribes or States perform under cooperative agreements?
Code of Federal Regulations, 2011 CFR
2011-10-01
... 43 Public Lands: Interior 2 2011-10-01 2011-10-01 false What activities may Tribes or States perform under cooperative agreements? 3192.12 Section 3192.12 Public Lands: Interior Regulations Relating... Cooperative Agreements § 3192.12 What activities may Tribes or States perform under cooperative agreements...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baldini, Elizabeth H., E-mail: ebaldini@partners.org; Abrams, Ross A.; Bosch, Walter
Purpose: The purpose of this study was to evaluate the variability in target volume and organ at risk (OAR) contour delineation for retroperitoneal sarcoma (RPS) among 12 sarcoma radiation oncologists. Methods and Materials: Radiation planning computed tomography (CT) scans for 2 cases of RPS were distributed among 12 sarcoma radiation oncologists with instructions for contouring gross tumor volume (GTV), clinical target volume (CTV), high-risk CTV (HR CTV: area judged to be at high risk of resulting in positive margins after resection), and OARs: bowel bag, small bowel, colon, stomach, and duodenum. Analysis of contour agreement was performed using the simultaneousmore » truth and performance level estimation (STAPLE) algorithm and kappa statistics. Results: Ten radiation oncologists contoured both RPS cases, 1 contoured only RPS1, and 1 contoured only RPS2 such that each case was contoured by 11 radiation oncologists. The first case (RPS 1) was a patient with a de-differentiated (DD) liposarcoma (LPS) with a predominant well-differentiated (WD) component, and the second case (RPS 2) was a patient with DD LPS made up almost entirely of a DD component. Contouring agreement for GTV and CTV contours was high. However, the agreement for HR CTVs was only moderate. For OARs, agreement for stomach, bowel bag, small bowel, and colon was high, but agreement for duodenum (distorted by tumor in one of these cases) was fair to moderate. Conclusions: For preoperative treatment of RPS, sarcoma radiation oncologists contoured GTV, CTV, and most OARs with a high level of agreement. HR CTV contours were more variable. Further clarification of this volume with the help of sarcoma surgical oncologists is necessary to reach consensus. More attention to delineation of the duodenum is also needed.« less
Baldini, Elizabeth H.; Abrams, Ross A.; Bosch, Walter; Roberge, David; Haas, Rick L.M.; Catton, Charles N.; Indelicato, Daniel J.; Olsen, Jeffrey R.; Deville, Curtiland; Chen, Yen-Lin; Finkelstein, Steven E.; DeLaney, Thomas F.; Wang, Dian
2015-01-01
Purpose The purpose of this study was to evaluate the variability in target volume and organ at risk (OAR) contour delineation for retroperitoneal sarcoma (RPS) among 12 sarcoma radiation oncologists. Methods and Materials Radiation planning computed tomography (CT) scans for 2 cases of RPS were distributed among 12 sarcoma radiation oncologists with instructions for contouring gross tumor volume (GTV), clinical target volume (CTV), high-risk CTV (HR CTV: area judged to be at high risk of resulting in positive margins after resection), and OARs: bowel bag, small bowel, colon, stomach, and duodenum. Analysis of contour agreement was performed using the simultaneous truth and performance level estimation (STAPLE) algorithm and kappa statistics. Results Ten radiation oncologists contoured both RPS cases, 1 contoured only RPS1, and 1 contoured only RPS2 such that each case was contoured by 11 radiation oncologists. The first case (RPS 1) was a patient with a de-differentiated (DD) liposarcoma (LPS) with a predominant well-differentiated (WD) component, and the second case (RPS 2) was a patient with DD LPS made up almost entirely of a DD component. Contouring agreement for GTV and CTV contours was high. However, the agreement for HR CTVs was only moderate. For OARs, agreement for stomach, bowel bag, small bowel, and colon was high, but agreement for duodenum (distorted by tumor in one of these cases) was fair to moderate. Conclusions For preoperative treatment of RPS, sarcoma radiation oncologists contoured GTV, CTV, and most OARs with a high level of agreement. HR CTV contours were more variable. Further clarification of this volume with the help of sarcoma surgical oncologists is necessary to reach consensus. More attention to delineation of the duodenum is also needed. PMID:26194680
Kumar, Addepalli U.; Jonnadula, Ganesh B.; Garudadri, Chandrasekhar; Rao, Harsha L.; Senthil, Sirisha; Papas, Eric B.; Sankaridurg, Padmaja; Khanna, Rohit C.
2013-01-01
Purpose To compare the diagnostic performance of glaucoma specialists and experienced optometrists in gonioscopy and optic disc assessment. Methods This study was done to validate the diagnostic performance of two experienced optometrists for using their skills of detecting glaucoma using gonioscopy and optic disc assessment in a major epidemiological study, the L V Prasad Eye Institute Glaucoma Epidemiology and Molecular Genetics Study (LVPEI-GLEAMS). Gonioscopic findings for 150 eyes were categorized as 0, 1 and 2 for open angle, primary angle closure suspect (PACS) and primary angle closure (PAC) respectively. Optic disc findings for 200 eyes were categorized as 0, 1 and 2 for normal, suspects and glaucomatous respectively. Weighted kappa (κ) and diagnostic accuracy parameters were calculated. Two optometrists (#1 and #2) participated in the study. Results Agreement between glaucoma specialists and optometrist for interpretation of gonioscopy to discriminate PACS and PAC from open angles and for interpretation of optic disc to discriminate glaucomatous and suspicious discs from normal, the kappa (κ) was 0.92 and 0.84 and 0.90 and 0.89 for optometrists #1 and #2 respectively. Sensitivities and specificities were above 90% for gonioscopy. Optic disc evaluation had specificities greater than 95% to discriminate normal from glaucomatous discs while the sensitivities were 83% and 93% for optometrists #1 and #2 respectively. Conclusion Agreement between optometrists and glaucoma specialists, in diagnostic performance of gonioscopy and optic assessment was excellent with high sensitivity and specificity. Hence, we conclude that the experienced optometrists can detect glaucoma accurately in the LVPEI-GLEAMS.
External validation of the Society of Thoracic Surgeons General Thoracic Surgery Database.
Magee, Mitchell J; Wright, Cameron D; McDonald, Donna; Fernandez, Felix G; Kozower, Benjamin D
2013-11-01
The Society of Thoracic Surgeons (STS) General Thoracic Surgery Database (GTSD) reports outstanding results for lung and esophageal cancer resection. However, a major weakness of the GTSD has been the lack of validation of this voluntary registry. The purpose of this study was to perform an external, independent audit to assess the accuracy of the data collection process and the quality of the database. An independent firm was contracted to audit 5% of sites randomly selected from the GTDB in 2011. Audits were performed remotely to maximize the number of audits performed and reduce cost. Auditors compared lobectomy cases submitted to the GTSD with the hospital operative logs to evaluate completeness of the data. In addition, 20 lobectomy records from each site were audited in detail. Agreement rates were calculated for 32 individual data elements, 7 data categories pertaining to patient status or care delivery, and an overall agreement rate for each site. Six process variables were also evaluated to assess best practice for data collection and submission. Ten sites were audited from the 222 participants. Comparison of the 559 submitted lobectomy cases with operative logs from each site identified 28 omissions, a 94.6% agreement rate (discrepancies/site range, 2 to 27). Importantly, cases not submitted had no mortality or major morbidity, indicating a lack of purposeful omission. The aggregate agreement rates for all categories were greater than 90%. The overall data accuracy was 94.9%. External audits of the GTSD validate the accuracy and completeness of the data. Careful examination of unreported cases demonstrated no purposeful omission or gaming. Although these preliminary results are quite good, it is imperative that the audit process is refined and continues to expand along with the GTSD to insure reliability of the database. The audit results are currently being incorporated into educational and quality improvement processes to add further value. Copyright © 2013 The Society of Thoracic Surgeons. Published by Elsevier Inc. All rights reserved.
Reliability of plain radiographic parameters for developmental dysplasia of the hip in children.
Upasani, Vidyadhar V; Bomar, James D; Parikh, Gaurav; Hosalkar, Harish
2012-07-01
Few studies have evaluated the reliability and reproducibility of the femoral neck-shaft angle (NSA), center-edge angle (CEA), and acetabular index (AI) in young children with developmental dysplasia of the hip (DDH). We wanted to determine whether these parameters could be used reliably by practitioners. Fifty radiographs from 21 children with DDH were reviewed. Analysis was performed by three observers, at two time periods. The intra- and inter-observer reliability for each measure was assessed. At time period one, we noted a "high" level of agreement between observers when measuring the NSA, a "low" level when measuring the CEA, and a "moderate" level when measuring the AI. At time period two, we noted a "very high" level of agreement between observers when measuring the NSA and a "high" level when measuring the CEA and AI. When comparing the measurements of observer 1 at the two different time periods, we noted nearly "very high" agreement when measuring the NSA, a "moderate" agreement when measuring the CEA, and a "high" agreement for the AI. In comparing the measurements of observer 2, we noted "very high" agreement for the NSA and "high" agreement for the CEA and AI. In comparing the measurements for observer 3, we noted nearly "very high" agreement for the NSA, nearly "high" agreement for the CEA, and "high" agreement for the AI. It is difficult to reliably measure three-dimensional pelvic morphology on a frontal plane radiograph, especially when important pelvic landmarks have yet to ossify.
Kosasih, Agus Susanto; Sugiarto, Christine; Hayuanta, Hubertus Hosti; Juhaendi, Runingsih; Setiawan, Lyana
2017-08-08
Measurement of viral load in human immunodeficiency virus type 1 (HIV-1) infected patients is essential for the establishment of a therapeutic strategy. Several assays based on qPCR are available for the measurement of viral load; they differ in sample volume, technology applied, target gene, sensitivity and dynamic range. The Bioneer AccuPower® HIV-1 Quantitative RT-PCR is a novel commercial kit that has not been evaluated for its performance. This study aimed to evaluate the performance of the Bioneer AccuPower® HIV-1 Quantitative RT-PCR kit. In total, 288 EDTA plasma samples from the Dharmais Cancer Hospital were analyzed with the Bioneer AccuPower® HIV-1 Quantitative RT-PCR kit and the Roche COBAS? AmpliPrep/COBAS® TaqMan® HIV-1 version 2.0 (CAP/CTM v2.0). The performance of the Bioneer assay was then evaluated against the Roche CAP/CTM v2.0. Overall, there was good agreement between the two assays. The Bioneer assay showed significant linear correlation with CAP/CTM v2.0 (R2=0.963, p<0.001) for all samples (N=118) which were quantified by both assays, with high agreement (94.9%, 112/118) according to the Bland-Altman model. The mean difference between the quantitative values measured by Bioneer assay and CAP/CTM v2.0 was 0.11 Log10 IU/mL (SD=0.26). Based on these results, the Bioneer assay can be used to quantify HIV-1 RNA in clinical laboratories.
Schwab, Fabienne; Redling, Katharina; Siebert, Matthias; Schötzau, Andy; Schoenenberger, Cora-Ann; Zanetti-Dällenbach, Rosanna
2016-11-01
Our aim was to prospectively evaluate inter- and intra-observer agreement between Breast Imaging Reporting and Data System (BI-RADS) classifications and Tsukuba elasticity scores (TSs) of breast lesions. The study included 164 breast lesions (63 malignant, 101 benign). The BI-RADS classification and TS of each breast lesion was assessed by the examiner and twice by three reviewers at an interval of 2 months. Weighted κ values for inter-observer agreement ranged from moderate to substantial for BI-RADS classification (κ = 0.585-0.738) and was substantial for TS (κ = 0.608-0.779). Intra-observer agreement was almost perfect for ultrasound (US) BI-RADS (κ = 0.847-0.872) and TS (κ = 0.879-0.914). Overall, individual reviewers are highly self-consistent (almost perfect intra-observer agreement) with respect to BI-RADS classification and TS, whereas inter-observer agreement was moderate to substantial. Comprehensive training is essential for achieving high agreement and minimizing the impact of subjectivity. Our results indicate that breast US and real-time elastography can achieve high diagnostic performance. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Field evaluation of a prototype paper-based point-of-care fingerstick transaminase test.
Pollock, Nira R; McGray, Sarah; Colby, Donn J; Noubary, Farzad; Nguyen, Huyen; Nguyen, The Anh; Khormaee, Sariah; Jain, Sidhartha; Hawkins, Kenneth; Kumar, Shailendra; Rolland, Jason P; Beattie, Patrick D; Chau, Nguyen V; Quang, Vo M; Barfield, Cori; Tietje, Kathy; Steele, Matt; Weigl, Bernhard H
2013-01-01
Monitoring for drug-induced liver injury (DILI) via serial transaminase measurements in patients on potentially hepatotoxic medications (e.g., for HIV and tuberculosis) is routine in resource-rich nations, but often unavailable in resource-limited settings. Towards enabling universal access to affordable point-of-care (POC) screening for DILI, we have performed the first field evaluation of a paper-based, microfluidic fingerstick test for rapid, semi-quantitative, visual measurement of blood alanine aminotransferase (ALT). Our objectives were to assess operational feasibility, inter-operator variability, lot variability, device failure rate, and accuracy, to inform device modification for further field testing. The paper-based ALT test was performed at POC on fingerstick samples from 600 outpatients receiving HIV treatment in Vietnam. Results, read independently by two clinic nurses, were compared with gold-standard automated (Roche Cobas) results from venipuncture samples obtained in parallel. Two device lots were used sequentially. We demonstrated high inter-operator agreement, with 96.3% (95% C.I., 94.3-97.7%) agreement in placing visual results into clinically-defined "bins" (<3x, 3-5x, and >5x upper limit of normal), >90% agreement in validity determination, and intraclass correlation coefficient of 0.89 (95% C.I., 0.87-0.91). Lot variability was observed in % invalids due to hemolysis (21.1% for Lot 1, 1.6% for Lot 2) and correlated with lots of incorporated plasma separation membranes. Invalid rates <1% were observed for all other device controls. Overall bin placement accuracy for the two readers was 84% (84.3%/83.6%). Our findings of extremely high inter-operator agreement for visual reading-obtained in a target clinical environment, as performed by local practitioners-indicate that the device operation and reading process is feasible and reproducible. Bin placement accuracy and lot-to-lot variability data identified specific targets for device optimization and material quality control. This is the first field study performed with a patterned paper-based microfluidic device and opens the door to development of similar assays for other important analytes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buttner, W.; Post, M.; Burgess, R.
This progress report is a programmatic summary of a formal MOA between NREL and the European Union Joint Research Center, Institute for Energy and Transport to be presented at the Steering Committee Meeting, December 3, 2012.
The Effects of Rater Training on Inter-Rater Agreement
ERIC Educational Resources Information Center
Pufpaff, Lisa A.; Clarke, Laura; Jones, Ruth E.
2015-01-01
This paper addresses the effects of rater training on the rubric-based scoring of three preservice teacher candidate performance assessments. This project sought to evaluate the consistency of ratings assigned to student learning outcome measures being used for program accreditation and to explore the need for rater training in order to increase…
George, David L; Johnson, Eric J; O'Neal, Katherine S; Smith, Michael J
2018-04-01
To report student perceived adequacy regarding didactic content and practical experiences of vaccination clinic business operations. Didactic content, a case study, and practical experiences regarding vaccination clinic business operations were implemented in related lectures of a Pharmacy Business and Entrepreneurship (PBE) elective and the college of pharmacy sponsored vaccination clinics. An online survey was used to evaluate student perceived adequacy of didactic content and practical experiences of vaccination clinic business operations. Mean scaled agreement was compared between students in the PBE elective versus those not in the elective. Student confidence in performing business operations was also assessed. Students in the PBE had higher mean confidence than non-elective students regarding staff management (3.23 vs. 2.73, p = 0.04). Success of the interventions may be attributed to students in the PBE elective that reported a higher mean perceived adequacy of content and practical experiences and confidence in performing nearly all business operations. Still, further evaluation of interventions is being considered to assess effectiveness of learning. Published by Elsevier Inc.
Diffuse intrinsic pontine glioma: is MRI surveillance improved by region of interest volumetry?
Riley, Garan T; Armitage, Paul A; Batty, Ruth; Griffiths, Paul D; Lee, Vicki; McMullan, John; Connolly, Daniel J A
2015-02-01
Paediatric diffuse intrinsic pontine glioma (DIPG) is noteworthy for its fibrillary infiltration through neuroparenchyma and its resultant irregular shape. Conventional volumetry methods aim to approximate such irregular tumours to a regular ellipse, which could be less accurate when assessing treatment response on surveillance MRI. Region-of-interest (ROI) volumetry methods, using manually traced tumour profiles on contiguous imaging slices and subsequent computer-aided calculations, may prove more reliable. To evaluate whether the reliability of MRI surveillance of DIPGs can be improved by the use of ROI-based volumetry. We investigated the use of ROI- and ellipsoid-based methods of volumetry for paediatric DIPGs in a retrospective review of 22 MRI examinations. We assessed the inter- and intraobserver variability of the two methods when performed by four observers. ROI- and ellipsoid-based methods strongly correlated for all four observers. The ROI-based volumes showed slightly better agreement both between and within observers than the ellipsoid-based volumes (inter-[intra-]observer agreement 89.8% [92.3%] and 83.1% [88.2%], respectively). Bland-Altman plots show tighter limits of agreement for the ROI-based method. Both methods are reproducible and transferrable among observers. ROI-based volumetry appears to perform better with greater intra- and interobserver agreement for complex-shaped DIPG.
ERIC Educational Resources Information Center
Smith, Carla M.; Smith, Richard G.; Dracobly, Joseph D.; Pace, Amy Peterson
2012-01-01
We evaluated interrater agreement across multiple respondents on anecdotal assessments and compared cases in which agreement was obtained with outcomes of functional analyses. Experiment 1 evaluated agreement among multiple respondents on the function of problem behavior for 27 individuals across 42 target behaviors using the Motivation Assessment…
Raoult, Hélène; Eugène, François; Le Bras, Anthony; Mineur, Géraldine; Carsin-Nicol, Béatrice; Ferré, Jean-Christophe; Gauvrit, Jean-Yves
2018-03-07
The WEB is an innovative flow disruption device for cerebral aneurysm embolization with rapidly expanding indications. Our purpose was to evaluate the diagnostic performance of computed tomography angiography (CTA) at 1-year follow-up of aneurysms treated with the WEB. Between April 2014 and May 2016, the study prospectively included patients treated with the WEB at our institution, and followed up within 24hours by CTA and at 1year by CTA, time-of-flight magnetic resonance angiography (TOF MRA) and digital subtraction angiography (DSA). The diagnostic quality of imaging data was assessed based on the confidence index, artifacts, and WEB shape depiction. The imaging diagnostic performance was assessed using 3 criteria at 1year: aneurysm occlusion status and worsening, and WEB shape compression. Interobserver and intermodality agreement was determined by calculating κ values. The study ultimately included 16 patients (9 women, mean age 53±7.6years). CTA quality confidence was scored as 2/2, artifacts 0.4/2 and WEB shape depiction 1.9/2, superior to TOF MRA for the latter two criteria. Aneurysm occlusion was adequate in 93.7% of patients, with CTA showing excellent interobserver reproducibility and agreement with DSA on a 4-grade scale (κ=1.00), while TOF MRA yielded good reproducibility (κ=0.76) and agreement with DSA (κ=0.69). CTA also identified aneurysm occlusion worsening (43.7%) and WEB compression (81.2%) in excellent agreement with DSA (κ=0.85 and 1.00). CTA is a reproducible and reliable technique for the follow-up of aneurysms treated with the WEB device. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
QNOTE: an instrument for measuring the quality of EHR clinical notes.
Burke, Harry B; Hoang, Albert; Becher, Dorothy; Fontelo, Paul; Liu, Fang; Stephens, Mark; Pangaro, Louis N; Sessums, Laura L; O'Malley, Patrick; Baxi, Nancy S; Bunt, Christopher W; Capaldi, Vincent F; Chen, Julie M; Cooper, Barbara A; Djuric, David A; Hodge, Joshua A; Kane, Shawn; Magee, Charles; Makary, Zizette R; Mallory, Renee M; Miller, Thomas; Saperstein, Adam; Servey, Jessica; Gimbel, Ronald W
2014-01-01
The outpatient clinical note documents the clinician's information collection, problem assessment, and patient management, yet there is currently no validated instrument to measure the quality of the electronic clinical note. This study evaluated the validity of the QNOTE instrument, which assesses 12 elements in the clinical note, for measuring the quality of clinical notes. It also compared its performance with a global instrument that assesses the clinical note as a whole. Retrospective multicenter blinded study of the clinical notes of 100 outpatients with type 2 diabetes mellitus who had been seen in clinic on at least three occasions. The 300 notes were rated by eight general internal medicine and eight family medicine practicing physicians. The QNOTE instrument scored the quality of the note as the sum of a set of 12 note element scores, and its inter-rater agreement was measured by the intraclass correlation coefficient. The Global instrument scored the note in its entirety, and its inter-rater agreement was measured by the Fleiss κ. The overall QNOTE inter-rater agreement was 0.82 (CI 0.80 to 0.84), and its note quality score was 65 (CI 64 to 66). The Global inter-rater agreement was 0.24 (CI 0.19 to 0.29), and its note quality score was 52 (CI 49 to 55). The QNOTE quality scores were consistent, and the overall QNOTE score was significantly higher than the overall Global score (p=0.04). We found the QNOTE to be a valid instrument for evaluating the quality of electronic clinical notes, and its performance was superior to that of the Global instrument. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Statistical Validation for Clinical Measures: Repeatability and Agreement of Kinect™-Based Software.
Lopez, Natalia; Perez, Elisa; Tello, Emanuel; Rodrigo, Alejandro; Valentinuzzi, Max E
2018-01-01
The rehabilitation process is a fundamental stage for recovery of people's capabilities. However, the evaluation of the process is performed by physiatrists and medical doctors, mostly based on their observations, that is, a subjective appreciation of the patient's evolution. This paper proposes a tracking platform of the movement made by an individual's upper limb using Kinect sensor(s) to be applied for the patient during the rehabilitation process. The main contribution is the development of quantifying software and the statistical validation of its performance, repeatability, and clinical use in the rehabilitation process. The software determines joint angles and upper limb trajectories for the construction of a specific rehabilitation protocol and quantifies the treatment evolution. In turn, the information is presented via a graphical interface that allows the recording, storage, and report of the patient's data. For clinical purposes, the software information is statistically validated with three different methodologies, comparing the measures with a goniometer in terms of agreement and repeatability. The agreement of joint angles measured with the proposed software and goniometer is evaluated with Bland-Altman plots; all measurements fell well within the limits of agreement, meaning interchangeability of both techniques. Additionally, the results of Bland-Altman analysis of repeatability show 95% confidence. Finally, the physiotherapists' qualitative assessment shows encouraging results for the clinical use. The main conclusion is that the software is capable of offering a clinical history of the patient and is useful for quantification of the rehabilitation success. The simplicity, low cost, and visualization possibilities enhance the use of the software Kinect for rehabilitation and other applications, and the expert's opinion endorses the choice of our approach for clinical practice. Comparison of the new measurement technique with established goniometric methods determines that the proposed software agrees sufficiently to be used interchangeably.
Validation of tungsten cross sections in the neutron energy region up to 100 keV
NASA Astrophysics Data System (ADS)
Pigni, Marco T.; Žerovnik, Gašper; Leal, Luiz. C.; Trkov, Andrej
2017-09-01
Following a series of recent cross section evaluations on tungsten isotopes performed at Oak Ridge National Laboratory (ORNL), this paper presents the validation work carried out to test the performance of the evaluated cross sections based on lead-slowing-down (LSD) benchmarks conducted in Grenoble. ORNL completed the resonance parameter evaluation of four tungsten isotopes - 182,183,184,186W - in August 2014 and submitted it as an ENDF-compatible file to be part of the next release of the ENDF/B-VIII.0 nuclear data library. The evaluations were performed with support from the US Nuclear Criticality Safety Program in an effort to provide improved tungsten cross section and covariance data for criticality safety sensitivity analyses. The validation analysis based on the LSD benchmarks showed an improved agreement with the experimental response when the ORNL tungsten evaluations were included in the ENDF/B-VII.1 library. Comparison with the results obtained with the JEFF-3.2 nuclear data library are also discussed.
Ciepiela, Olga; Kotuła, Iwona; Kierat, Szymon; Sieczkowska, Sandra; Podsiadłowska, Anna; Jenczelewska, Anna; Księżarczyk, Karolina; Demkow, Urszula
2016-11-01
Modern automated laboratory hematology analyzers allow the measurement of over 30 different hematological parameters useful in the diagnostic and clinical interpretation of patient symptoms. They use different methods to measure the same parameters. Thus, a comparison of complete blood count made by Mindray BC-6800, Sysmex XN-2000 and Beckman Coulter LH750 was performed. A comparison of results obtained by automated analysis of 807 anticoagulated blood samples from children and 125 manual microscopic differentiations were performed. This comparative study included white blood cell count, red blood cell count, and erythrocyte indices, as well as platelet count. The present study showed a poor level of agreement between white blood cell enumeration and differentiation of the three automated hematology analyzers under comparison. A very good agreement was found when comparing manual blood smear and automated granulocytes, monocytes, and lymphocytes differentiation. Red blood cell evaluation showed better agreement than white blood cells between the studied analyzers. To conclude, studied instruments did not ensure satisfactory interchangeability and did not facilitate a substitution of one analyzer by another. © 2016 Wiley Periodicals, Inc.
Fate of Volatile Organic Compounds in Constructed Wastewater Treatment Wetlands
Keefe, S.H.; Barber, L.B.; Runkel, R.L.; Ryan, J.N.
2004-01-01
The fate of volatile organic compounds was evaluated in a wastewater-dependent constructed wetland near Phoenix, AZ, using field measurements and solute transport modeling. Numerically based volatilization rates were determined using inverse modeling techniques and hydraulic parameters established by sodium bromide tracer experiments. Theoretical volatilization rates were calculated from the two-film method incorporating physicochemical properties and environmental conditions. Additional analyses were conducted using graphically determined volatilization rates based on field measurements. Transport (with first-order removal) simulations were performed using a range of volatilization rates and were evaluated with respect to field concentrations. The inverse and two-film reactive transport simulations demonstrated excellent agreement with measured concentrations for 1,4-dichlorobenzene, tetrachloroethene, dichloromethane, and trichloromethane and fair agreement for dibromochloromethane, bromo-dichloromethane, and toluene. Wetland removal efficiencies from inlet to outlet ranged from 63% to 87% for target compounds.
Clauser, Paola; Marcon, Magda; Maieron, Marta; Zuiani, Chiara; Bazzocchi, Massimo; Baltzer, Pascal A T
2016-07-01
To evaluate the influence of post-processing systems, intra- and inter-reader agreement on the variability of apparent diffusion coefficient (ADC) measurements in breast lesions. Forty-one patients with 41 biopsy-proven breast lesions gave their informed consent and were included in this prospective IRB-approved study. Magnetic resonance imaging (MRI) examinations were performed at 1.5 T using an EPI-DWI sequence, with b-values of 0 and 1000 s/mm(2). Two radiologists (R1, R2) reviewed the images in separate sessions and measured the ADC for lesion, using MRI-workstation (S-WS), PACS-workstation (P-WS) and a commercial DICOM viewer (O-SW). Agreement was evaluated using the intraclass correlation coefficient (ICC), Bland-Altman plots and coefficient of variation (CV). Thirty-one malignant, two high-risk and eight benign mass-like lesions were analysed. Intra-reader agreement was almost perfect (ICC-R1 = 0.974; ICC-R2 = 0.990) while inter-reader agreement was substantial (ICC from 0.615 to 0.682). Bland-Altman plots revealed a significant bias in ADC values measured between O-SW and S-WS (P = 0.025), no further systematic differences were identified. CV varied from 6.8 % to 7.9 %. Post-processing systems may have a significant, although minor, impact on ADC measurements in breast lesions. While intra-reader agreement is high, the main source of ADC variability seems to be caused by inter-reader variation. • ADC provides quantitative information on breast lesions independent from the system used. • ADC measurement using different workstations and software systems is generally reliable. • Systematic, but minor, differences may occur between different post-processing systems. • Inter-reader agreement of ADC measurements exceeded intra-reader agreement.
Identifying balance impairments in people with Parkinson's disease using video and wearable sensors.
Stack, Emma; Agarwal, Veena; King, Rachel; Burnett, Malcolm; Tahavori, Fatemeh; Janko, Balazs; Harwin, William; Ashburn, Ann; Kunkel, Dorit
2018-05-01
Falls and near falls are common among people with Parkinson's (PwP). To date, most wearable sensor research focussed on fall detection, few studies explored if wearable sensors can detect instability. Can instability (caution or near-falls) be detected using wearable sensors in comparison to video analysis? Twenty-four people (aged 60-86) with and without Parkinson's were recruited from community groups. Movements (e.g. walking, turning, transfers and reaching) were observed in the gait laboratory and/or at home; recorded using clinical measures, video and five wearable sensors (attached on the waist, ankles and wrists). After defining 'caution' and 'instability', two researchers evaluated video data and a third the raw wearable sensor data; blinded to each other's evaluations. Agreement between video and sensor data was calculated on stability, timing, step count and strategy. Data was available for 117 performances: 82 (70%) appeared stable on video. Ratings agreed in 86/117 cases (74%). Highest agreement was noted for chair transfer, timed up and go test and 3 m walks. Video analysts noted caution (slow, contained movements, safety-enhancing postures and concentration) and/or instability (saving reactions, stopping after stumbling or veering) in 40/134 performances (30%): raw wearable sensor data identified 16/35 performances rated cautious or unstable (sensitivity 46%) and 70/82 rated stable (specificity 85%). There was a 54% chance that a performance identified from wearable sensors as cautious/unstable was so; rising to 80% for stable movements. Agreement between wearable sensor and video data suggested that wearable sensors can detect subtle instability and near-falls. Caution and instability were observed in nearly a third of performances, suggesting that simple, mildly challenging actions, with clearly defined start- and end-points, may be most amenable to monitoring during free-living at home. Using the genuine near-falls recorded, work continues to automatically detect subtle instability using algorithms. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.
Lauzier, Sophie; Campbell, H Sharon; Livingston, Patricia M; Maunsell, Elizabeth
2014-10-15
Community-based cancer organizations provide services to support patients. An anticipated benefit of these services is patient empowerment. However, this outcome has not been evaluated because of the lack of validated health-related empowerment questionnaires in the cancer context. In this validation study, the authors assessed the extent to which 16 indicators used by the Canadian Cancer Society (CCS) and the Cancer Council Victoria, Australia (CCV) to evaluate their services were associated with health-related empowerment. Cancer patients/survivors who were diagnosed < 3 years earlier and who used CCS programs completed a questionnaire that included the 16 CCS-CCV indicators and 5 scales from the Health Education Impact Questionnaire (heiQ) measuring key dimensions of empowerment. To determine whether the CCS-CCV indicators captured empowerment, differences in heiQ scores were compared between 2 groups: those with higher levels of agreement (agreeing or agreeing strongly) with an indicator and those with lower levels of agreement (agreeing slightly or disagreeing to any degree). Participation was 72% (207 of 289 eligible CCS users). Compared with participants who had lower levels of agreement on CCS-CCV indicators, those who had higher levels of agreement were more likely to report higher levels of empowerment on the different heiQ scales. For 15 of 16 indicators, these differences were significant (Wilcoxon rank-sum test; P < .10) on ≥ 1 of 5 heiQ scales and for 10 of 16 indicators on ≥ 3 of 5 heiQ scales. Two indicators were associated significantly with all 5 heiQ scales (cope better and feel more in control). Using CCS-CCV indicators to evaluate community-based cancer organizations' services will help determine whether these services are reaching one of their important goals: namely empowering patients. © 2014 American Cancer Society.
Selleri, Paolo; Di Girolamo, Nicola
2014-01-01
Point-of-care testing is an attractive option in rabbit medicine, because it permits rapid analysis of a panel of electrolytes, chemistries, blood gases, hemoglobin, and hematocrit, requiring only 65 μL of blood. The purpose of this study was to evaluate the performance of a portable clinical analyzer for measurement of pH, partial pressure of CO2, Na, chloride, potassium, blood urea nitrogen, glucose, hematocrit, and hemoglobin in healthy and diseased rabbits. Blood samples obtained from 30 pet rabbits were analyzed immediately after collection by the portable clinical analyzer (PCA) and immediately thereafter (time <20 sec) by a reference analyzer. Bland-Altman plots and Passing-Bablok regression analysis were used to compare the results. Limits of agreement were wide for all the variables studied, with the exception of pH. Most variables presented significant proportional and/or constant bias. The current study provides sufficient evidence that the PCA presents reliability for pH, although its low agreement with a reference analyzer for the other variables does not support their interchangeability. Limits of agreement provided for each variable allow researchers to evaluate if the PCA is reliable enough for their scope. To the authors' knowledge, the present is the first report evaluating a PCA in the rabbit.
Tsai, Richard; Raptis, Constantine; Fowler, Kathryn J; Owen, Joseph W; Mellnick, Vincent M
2017-11-01
To determine the degree of interradiologist agreement between the MRI features of appendicitis during pregnancy, the outcomes associated with an indeterminate interpretation and the negative predictive value of non-visualization of the appendix. Our study was approved by the institutional review board at the Washington University in St. Louis, Missouri (WUStL) and was HIPAA (Health Insurance Portability and Accountability Act of 1996)-compliant. The informed consent requirement was waived. Cases of suspected appendicitis during pregnancy evaluated using MRI were retrospectively identified using search queries. Scans were re-reviewed by two radiologists (7 and 9 years experience, respectively) to evaluate the interradiologist agreement of different MRI features of appendicitis during pregnancy (visualization of the appendix, appendiceal diameter, appendiceal wall thickening, periappendiceal fat stranding, fluid-filled appendix and periappendiceal fluid). The radiologists were blinded to patient outcome, patient intervention, laboratory data, demographic data and the original MRI reports. Clinical outcomes were documented by surgical pathology or clinical observation. Interradiologist agreement was analysed using Cohen's κ, while patient demographic and clinical data was analysed using Student's t-testing. 233 females with suspected appendicitis during pregnancy were evaluated using MRI over a 13-year period (mean age, 28.4 years; range, 17-38 years). There were 14 (6%) positive examinations for appendicitis during pregnancy, including 1 patient whose MRI was interpreted as negative, proven by surgical pathology. The presence of periappendiceal soft-tissue stranding and the final overall impression had the most interradiologist agreement (к = 0.81-1). There were no pregnant patients found to have acute appendicitis who had an indeterminate MR interpretation or when the appendix could not be visualized. The final impression by the two retrospectively reviewing radiologists of MR examinations performed for suspected appendicitis during pregnancy had near-perfect agreement. In patients where the appendix could not be visualized or in patients that were interpreted as indeterminate, no patients had acute appendicitis. Advances in knowledge: MR impression for suspected appendicitis in the pregnant patient has high interradiologist agreement, and a non-visualized appendix or lack of inflammatory findings at the time of MR, reliably excludes surgical appendicitis.
Pulmonary tumor measurements from x-ray computed tomography in one, two, and three dimensions.
Villemaire, Lauren; Owrangi, Amir M; Etemad-Rezai, Roya; Wilson, Laura; O'Riordan, Elaine; Keller, Harry; Driscoll, Brandon; Bauman, Glenn; Fenster, Aaron; Parraga, Grace
2011-11-01
We evaluated the accuracy and reproducibility of three-dimensional (3D) measurements of lung phantoms and patient tumors from x-ray computed tomography (CT) and compared these to one-dimensional (1D) and two-dimensional (2D) measurements. CT images of three spherical and three irregularly shaped tumor phantoms were evaluated by three observers who performed five repeated measurements. Additionally, three observers manually segmented 29 patient lung tumors five times each. Follow-up imaging was performed for 23 tumors and response criteria were compared. For a single subject, imaging was performed on nine occasions over 2 years to evaluate multidimensional tumor response. To evaluate measurement accuracy, we compared imaging measurements to ground truth using analysis of variance. For estimates of precision, intraobserver and interobserver coefficients of variation and intraclass correlations (ICC) were used. Linear regression and Pearson correlations were used to evaluate agreement and tumor response was descriptively compared. For spherical shaped phantoms, all measurements were highly accurate, but for irregularly shaped phantoms, only 3D measurements were in high agreement with ground truth measurements. All phantom and patient measurements showed high intra- and interobserver reproducibility (ICC >0.900). Over a 2-year period for a single patient, there was disagreement between tumor response classifications based on 3D measurements and those generated using 1D and 2D measurements. Tumor volume measurements were highly reproducible and accurate for irregular, spherical phantoms and patient tumors with nonuniform dimensions. Response classifications obtained from multidimensional measurements suggest that 3D measurements provide higher sensitivity to tumor response. Copyright © 2011 AUR. Published by Elsevier Inc. All rights reserved.
Arcangeletti, M. C.; Dussaix, E.; Ferraglia, F.; Roque-Afonso, A. M.; Graube, A.; Chezzi, C.
2011-01-01
A multicentric clinical study was conducted on representative sera from 1,738 European and U.S. subjects for the evaluation of new anti-hepatitis A virus enzyme immunoassays from Bio-Rad Laboratories. Comparison with reference DiaSorin S.p.A. tests confirmed the good performance of Bio-Rad assays (99.85% and 99.47% overall agreement in detecting total antibodies and IgM, respectively). PMID:21653739
Royse, C F; Haji, D L; Faris, J G; Veltman, M G; Kumar, A; Royse, A G
2012-05-01
Limited transthoracic echocardiography performed by treating physicians may facilitate assessment of haemodynamic abnormalities in perioperative and critical care patients. The interpretative skills of one hundred participants who completed an education program in limited transthoracic echocardiography were assessed by reporting five pre-recorded case studies. A high level of agreement was observed in ventricular volume assessment (left 95%, right 96%), systolic function (left 99%, right 96%), left atrial pressure (96%) and haemodynamic state (97%). The highest failure to report answers (that is, no answer given) was for right ventricular volume and function. For moderate or severe valve lesions, agreement ranged from 90 to 98%, with failure to report <5% in all cases except for mitral stenosis (18%). For mild valve lesions, the range of agreement was lower (53 to 100%) due to overestimation of severity. Medical practitioners who completed the structured educational program showed good agreement with experts in interpretation of valve and ventricular function.
Horvath, Bela; Zhu, Lei; Allende, Daniela; Xie, Hao; Guirguis, John; Cruise, Michael; Patil, Deepa T.; O’Shea, Robert; Rivas, John; Yordanka, Reyna; Lan, Nan; Liu, Xiuli
2017-01-01
Background Long-standing congestive heart failure can induce a constellation of histopathology changes in the liver that can range from mild sinusoidal dilation to advanced fibrosis and loss of normal perivenular expression of glutamine synthetase (GS). Liver biopsies might be performed to assess the perioperative risk of these patients or to determine the need of synchronous liver transplant. We aimed to assess interobserver agreement in recognizing these liver histologic features in patients undergoing evaluation for heart transplantation and to examine whether immunohistochemistry of GS will aid the diagnosis of cardiac hepatopathy (CH). Methods Hematoxylin-eosin and trichrome-stained slides from 36 liver biopsies from patients undergoing evaluation for heart transplantation were reviewed by four liver pathologists. Histologic features of CH were reviewed and an overall fibrosis (stage) was assessed according to a recently proposed congestive hepatic fibrosis score (CHFS). In addition, 24 liver biopsies with a consensus diagnosis of CH and eight liver biopsies with no significant pathological changes were subjected to immunohistochemistry for GS. The Fleiss’ kappa coefficient (K) analysis was performed to determine the interobserver agreement. Further, histologic features of CH were correlated with the staining pattern of GS. Results Sinusoidal dilation, centrilobular hepatocyte atrophy, centrilobular fibrosis and hemorrhage were the most common findings in this cohort with a substantial-to-fair level of interobserver agreement among four reviewers. The overall agreement on the diagnosis of CH and CHFS was moderate (K = 0.55, 95% confidence interval (CI): 0.32 - 0.73) and fair (K = 0.35, 95% CI: 0.24 - 0.49), respectively. Twelve (of 24, 50%) cases of CH showed loss of the normal perivenular GS staining, while the remaining 12 cases of CH and all eight controls showed retained GS expression. Histologic features of CH (presence of sinusoidal dilation, centrilobular hepatocyte atrophy, hemorrhage, and centrilobular fibrosis) and the stage of fibrosis (CHFS) were not correlated with the loss of GS staining. Conclusion Most common features of CH can be interpreted with fair-to-substantial level of agreement by liver pathologists, with an overall moderate level agreement for the diagnosis and fair agreement for CHFS. Loss of normal perivenular expression of GS only occurs in 50% CH and thus is not a sensitive marker for CH. PMID:28725306
Observers' Agreement on Measurements in Fiberoptic Endoscopic Evaluation of Swallowing.
Pilz, Walmari; Vanbelle, Sophie; Kremer, Bernd; van Hooren, Michel R; van Becelaere, Tine; Roodenburg, Nel; Baijens, Laura W J
2016-04-01
This study analyzed the effect that dysphagia etiology, different observers, and bolus consistency might have on the level of agreement for measurements in FEES images reached by independent versus consensus panel rating. Sixty patients were included and divided into two groups according to dysphagia etiology: neurological or head and neck oncological. All patients underwent standardized FEES examination using thin and thick liquid consistencies. Two observers scored the same exams, first independently and then in a consensus panel. Four ordinal FEES variables were analyzed. Statistical analysis was performed using a linear weighted kappa coefficient and Bayesian multilevel model. Intra- and interobserver agreement on FEES measurements ranged from 0.76 to 0.93 and from 0.61 to 0.88, respectively. Dysphagia etiology did not influence observers' agreement level. However, bolus consistency resulted in decreased interobserver agreement for all measured FEES variables during thin liquid swallows. When rating on the consensus panel, the observers deviated considerably from the scores they had previously given on the independent rating task. Observer agreement on measurements in FEES exams was influenced by bolus consistency, not by dysphagia etiology. Therefore, observer agreement on FEES measurements should be analyzed by taking bolus consistency into account, as it might affect the interpretation of the outcome. Identifying factors that might influence agreement levels could lead to better understanding of the rating process and assist in developing a more precise measurement scale that would ensure higher levels of observer agreement for measurements in FEES exams.
Huber, H; Brambrink, M; Funk, R; Rieger, M
2012-10-01
The purpose of this study was to evaluate differences in the D-DRG results of a hospital case by 2 independently coding MKD raters. Calculation of the 2-inter-rater reliability was performed by examination of the coding of individual hospital cases. The reasons for the non-agreement of the expert evaluations and suggestions to improve the process are discussed. From the expert evaluation pool of the MDK-WL a random sample of 0.7% of the 57,375 expertises was taken. Distribution equality with the basic total was tested by the χ² test or, respectively, Fisher's exact test. For the total of 402 individual hospital cases, the G-DRG case sums of 2 experts of the MDK were determined independently and the results checked for each individual case for agreement or non-agreement. The corresponding confidence intervals with standard errors were analysed to test if certain major diagnosis categories (MDC) were statistically significantly more affected by differing expertise results than others. In 280 of the total 402 tested hospital cases, the 2 MDK raters independently reached the same G-DRG results; in 122 cases the G-DRG case sums determined by the 2 raters differed (agreement 70%; CI 65.2-74.1). Different DRG results between the 2 experts occurred regularly in the entire MDC spectrum. No MDC chapter in which significant differences between the 2 raters arose could be identified. The results of our study demonstrate an almost 70% agreement in the evaluation of hospital cost accounts by 2 independently operating MDK. This result leaves room for improvement. Optimisation potentials can be recognised on the basis of the results. Potential for improvement was established in combination with regular further training and the expansion of binding internal code recommendations as well as exchange of code-relevant information among experts in internal forums. The presented model is in principle suitable for cross-border examinations within the MDK system with the advantage that further trends could be uncovered by more variety and larger numbers of the randomly selected cases. © Georg Thieme Verlag KG Stuttgart · New York.
Nascimento, Alex Rodrigues do; Andrade, Fabíola Bof de; César, Cibele Comini
2016-11-03
This study sought to describe the agreement between self-perception and clinical evaluation of dental treatment needs in adults and analyze associated factors. The sample comprised adult individuals who took part in SBBrazil 2010 and SBMinas Gerais 2012. The study's outcome was agreement between self-perception and clinical evaluation of dental treatment needs. We used multiple Poisson regression in order to determine the factors associated with the outcome. Agreement between self-perception and clinical evaluation was 78.8% in Brazil and 73.8% in Minas Gerais. Clinical and self-reported oral health conditions that affect function and quality of life were associated with a higher agreement, while a recent visit to the dentist was associated with a lower agreement. Identifying associated factors may enable the development of questionnaires that favor correct self-perception regarding treatment needs.
Baldini, Elizabeth H; Bosch, Walter; Kane, John M; Abrams, Ross A; Salerno, Kilian E; Deville, Curtiland; Raut, Chandrajit P; Petersen, Ivy A; Chen, Yen-Lin; Mullen, John T; Millikan, Keith W; Karakousis, Giorgos; Kendrick, Michael L; DeLaney, Thomas F; Wang, Dian
2015-09-01
Curative intent management of retroperitoneal sarcoma (RPS) requires gross total resection. Preoperative radiotherapy (RT) often is used as an adjuvant to surgery, but recurrence rates remain high. To enhance RT efficacy with acceptable tolerance, there is interest in delivering "boost doses" of RT to high-risk areas of gross tumor volume (HR GTV) judged to be at risk for positive resection margins. We sought to evaluate variability in HR GTV boost target volume delineation among collaborating sarcoma radiation and surgical oncologist teams. Radiation planning CT scans for three cases of RPS were distributed to seven paired radiation and surgical oncologist teams at six institutions. Teams contoured HR GTV boost volumes for each case. Analysis of contour agreement was performed using the simultaneous truth and performance level estimation (STAPLE) algorithm and kappa statistics. HRGTV boost volume contour agreement between the seven teams was "substantial" or "moderate" for all cases. Agreement was best on the torso wall posteriorly (abutting posterior chest abdominal wall) and medially (abutting ipsilateral para-vertebral space and great vessels). Contours varied more significantly abutting visceral organs due to differing surgical opinions regarding planned partial organ resection. Agreement of RPS HRGTV boost volumes between sarcoma radiation and surgical oncologist teams was substantial to moderate. Differences were most striking in regions abutting visceral organs, highlighting the importance of collaboration between the radiation and surgical oncologist for "individualized" target delineation on the basis of areas deemed at risk and planned resection.
Constellation X-Ray Mission and Support
NASA Technical Reports Server (NTRS)
Tananbaum, H.; Grady, Jean (Technical Monitor)
2002-01-01
This report is a supplement to the Third Annual Report summarizing work performed by the Smithsonian Astrophysical Observatory (SAO) for NASA Goddard Space Flight Center (GSFC) under Cooperative Agreement NCC5-3681. The Agreement is entitled 'Constellation X-ray Mission Study and Support.' This supplementary report covers the period from October 1, 2001 through January 10, 2002. The report has been prepared and submitted to ensure that the Constellation-X Project Office at GSFC has current performance information needed to evaluate a proposed modified budget for FY02. That proposed budget is being submitted separately. SAO continues to perform work under the overall direction of Dr. Harvey Tananbaum, the SAO Principal Investigator for the program. Mr. Robert Rasche is the SAO Program Manager and is responsible for day-to-day program management at SAO and coordination with GSFC. The report summarizes the main areas of SAO activity. Most of the work has been done jointly with personnel from GSFC and Marshall Space Flight Center (MSFC). We describe SAO participation in these efforts. Under the Agreement, SAO performed work in seven major areas of activity. These areas related to: (1) Constellation X-ray Mission Facility Definition Team and Study Management; (2) Science Support; (3) Spectroscopy X-ray Telescope (SXT); (4) Systems Engineering; (5) Travel in Support of the Work Effort; and (6) In-house Management and Coordination.
Clarençon, Frédéric; Di Maria, Federico; Sourour, Nader-Antoine; Gabrieli, Joseph; Nouet, Aurélien; Shotar, Eimad; Cormier, Evelyne; Fahed, Robert; Cornu, Philippe; Chiras, Jacques
2016-10-01
To evaluate the performances of the CT-angiography by direct intra-aortic contrast media injection (IA-CTA) for spinal vascular malformations (SVMs)' imaging. Thirteen patients (8 males, 5 females, mean age: 56 y) with suspected SVM underwent IA-CTAs by direct intra-aortic iodinated contrast media injection (5 cc/s; 100 cc) via an arterial femoral or humeral access. Two independent observers evaluated the angioarchitecture of the SVMs and the visualisation of both the Adamkiewicz artery and the anterior spinal artery. Then a consensus was obtained between the 2 reviewers; the results of the IA-CTA were finally compared with those of the full spinal DSA evaluated in consensus. The IA-CTA was feasible in all cases and depicted the SVM in all except one case (92 %). Interrater agreement was good for the location of the SVMs' level. Intermodality (IA-CTA/DSA) agreement was excellent for the level and side of the shunt point, as well as for the SVM subtype evaluation. In 77 % of the cases, the Adamkiewicz artery was satisfactorily seen at the same time on IA-CTA. IA-CTA is a new technique that seems helpful to reach a better understanding of SMVs and may help to tailor more precisely their treatment. • IA-CTA is an accurate technique for the SVMs' angioarchitecture analysis • IA-CTA can locate, at the same time, the Adamkiewicz artery (AKA) • IA-CTA may be helpful in elderly patients with troublesome vasculature.
In-vitro model for evaluation of pulse oximetry
NASA Astrophysics Data System (ADS)
Vegfors, Magnus; Lindberg, Lars-Goeran; Lennmarken, Claes; Oberg, P. Ake
1991-06-01
An in vitro model with blood circulating in a silicon tubing system and including an artificial arterial bed is an important tool for evaluation of the pulse oximetry technique. The oxygen saturation was measured on an artificial finger using a pulse oximeter (SpO2) and on blood samples using a hemoximeter (SaO2). Measurements were performed at different blood flows and at different blood hematocrits. An increase in steady as well as in pulsatile blood flow was followed by an increase in pulse oximeter readings and a better agreement between SpO2 and SaO2 readings. After diluting the blood with normal saline (decreased hematocrit) the agreement was further improved. These results indicate that the pulse oximeter signal is related to blood hematocrit and the velocity of blood. The flow-related dependance of SpO2 was also evaluated in a human model. These results provided evidence that the pulse oximeter signal is dependent on vascular changes.
NASA Astrophysics Data System (ADS)
Heinze, Rieke; Moseley, Christopher; Böske, Lennart Nils; Muppa, Shravan Kumar; Maurer, Vera; Raasch, Siegfried; Stevens, Bjorn
2017-06-01
Large-eddy simulations (LESs) of a multi-week period during the HD(CP)2 (High-Definition Clouds and Precipitation for advancing Climate Prediction) Observational Prototype Experiment (HOPE) conducted in Germany are evaluated with respect to mean boundary layer quantities and turbulence statistics. Two LES models are used in a semi-idealized setup through forcing with mesoscale model output to account for the synoptic-scale conditions. Evaluation is performed based on the HOPE observations. The mean boundary layer characteristics like the boundary layer depth are in a principal agreement with observations. Simulating shallow-cumulus layers in agreement with the measurements poses a challenge for both LES models. Variance profiles agree satisfactorily with lidar measurements. The results depend on how the forcing data stemming from mesoscale model output are constructed. The mean boundary layer characteristics become less sensitive if the averaging domain for the forcing is large enough to filter out mesoscale fluctuations.
Log-Linear Modeling of Agreement among Expert Exposure Assessors
Hunt, Phillip R.; Friesen, Melissa C.; Sama, Susan; Ryan, Louise; Milton, Donald
2015-01-01
Background: Evaluation of expert assessment of exposure depends, in the absence of a validation measurement, upon measures of agreement among the expert raters. Agreement is typically measured using Cohen’s Kappa statistic, however, there are some well-known limitations to this approach. We demonstrate an alternate method that uses log-linear models designed to model agreement. These models contain parameters that distinguish between exact agreement (diagonals of agreement matrix) and non-exact associations (off-diagonals). In addition, they can incorporate covariates to examine whether agreement differs across strata. Methods: We applied these models to evaluate agreement among expert ratings of exposure to sensitizers (none, likely, high) in a study of occupational asthma. Results: Traditional analyses using weighted kappa suggested potential differences in agreement by blue/white collar jobs and office/non-office jobs, but not case/control status. However, the evaluation of the covariates and their interaction terms in log-linear models found no differences in agreement with these covariates and provided evidence that the differences observed using kappa were the result of marginal differences in the distribution of ratings rather than differences in agreement. Differences in agreement were predicted across the exposure scale, with the likely moderately exposed category more difficult for the experts to differentiate from the highly exposed category than from the unexposed category. Conclusions: The log-linear models provided valuable information about patterns of agreement and the structure of the data that were not revealed in analyses using kappa. The models’ lack of dependence on marginal distributions and the ease of evaluating covariates allow reliable detection of observational bias in exposure data. PMID:25748517
Zhang, Rui-Fang; Fu, Yu-Chuan; Lu, Yi; Zhang, Xiao-Xia; Hu, Yu-Min; Zhou, Yong-Jin; Tian, Nai-Feng; He, Jia-Wei; Yan, Zhi-Han
2017-02-01
Accurately evaluating the extent of trunk imbalance in the coronal plane is significant for patients before and after treatment. We preliminarily practiced a new method, axis-line-angle technique (ALAT), for evaluating coronal trunk imbalance with excellent intra-observer and interobserver reliability. Radiologists and surgeons were encouraged to use this method in clinical practice. However, the optimal cutoff value of the ALAT for determination of the extent of coronal trunk imbalance has not been calculated up to now. The purpose of this study was to identify the cutoff value of the ALAT that best predicts a positive measurement point to assess coronal balance or imbalance. A retrospective study at a university affiliated hospital was carried out. A total of 130 patients with C7-central sacral vertical line (CSVL) >0 mm and aged 10-18 years were recruited in this study from September 2013 to December 2014. Data were analyzed to determine the optimal cutoff value of the ALAT measurement. The C7-CSVL and ALAT measurements were conducted respectively twice on plain film within a 2-week interval by two radiologists. The optimal cutoff value of the ALAT was analyzed via receiver operating characteristic (ROC) curve. Comparison variables were performed with chi-square test between the C7-CSVL and ALAT measurements for evaluating trunk imbalance. Kappa agreement coefficient method was used to test the intra-observer and interobserver agreement of C7-CSVL and ALAT. The ROC curve area for the ALAT was 0.82 (95% confidence interval: 0.753-0.894, p<.001). The maximum Youden index was 0.51, and the corresponding cutoff point was 2.59°. No statistical difference was found between the C7-CSVL and ALAT measurements for evaluating trunk imbalance (p>.05). Intra-observer agreement values for the C7-CSVL measurements by observers 1 and 2 were 0.79 and 0.91 (p<.001), respectively, whereas intra-observer agreement values for the ALAT measurements were both 0.89 by observers 1 and 2 (p<.001). The interobserver agreement values for the first and second measurements with the C7-CSVL were 0.78 and 0.85 (p<.001), respectively, whereas the interobserver agreement values for the first and second measurements with the ALAT were 0.91 and 0.88 (p<.001), respectively. The newly developed ALAT provided an acceptable optimal cutoff value for evaluating trunk imbalance in the coronal plane with a high level of intra-observer and interobserver agreement, which suggests that the ALAT is suitable for clinical use. Copyright © 2016 Elsevier Inc. All rights reserved.
Guo, Ying; Manatunga, Amita K
2009-03-01
Assessing agreement is often of interest in clinical studies to evaluate the similarity of measurements produced by different raters or methods on the same subjects. We present a modified weighted kappa coefficient to measure agreement between bivariate discrete survival times. The proposed kappa coefficient accommodates censoring by redistributing the mass of censored observations within the grid where the unobserved events may potentially happen. A generalized modified weighted kappa is proposed for multivariate discrete survival times. We estimate the modified kappa coefficients nonparametrically through a multivariate survival function estimator. The asymptotic properties of the kappa estimators are established and the performance of the estimators are examined through simulation studies of bivariate and trivariate survival times. We illustrate the application of the modified kappa coefficient in the presence of censored observations with data from a prostate cancer study.
Martin-Gill, Christian; Higgins, J Stephen; Van Dongen, Hans P A; Buysse, Daniel J; Thackery, Ronald W; Kupas, Douglas F; Becker, David S; Dean, Bradley E; Lindbeck, George H; Guyette, Francis X; Penner, Josef H; Violanti, John M; Lang, Eddy S; Patterson, P Daniel
2018-02-15
Performance measures are a key component of implementation, dissemination, and evaluation of evidence-based guidelines (EBGs). We developed performance measures for Emergency Medical Services (EMS) stakeholders to enable the implementation of guidelines for fatigue risk management in the EMS setting. Panelists associated with the Fatigue in EMS Project, which was supported by the National Highway Traffic Safety Administration (NHTSA), used an iterative process to develop a draft set of performance measures linked to 5 recommendations for fatigue risk management in EMS. We used a cross-sectional survey design and the Content Validity Index (CVI) to quantify agreement among panelists on the wording and content of draft measures. An anonymous web-based tool was used to solicit the panelists' perceptions of clarity and relevance of draft measures. Panelists rated the clarity and relevance separately for each draft measure on a 4-point scale. CVI scores ≥0.78 for clarity and relevance were specified a priori to signify agreement and completion of measurement development. Panelists judged 5 performance measures for fatigue risk management as clear and relevant. These measures address use of fatigue and/or sleepiness survey instruments, optimal duration of shifts, access to caffeine as a fatigue countermeasure, use of napping during shift work, and the delivery of education and training on fatigue risk management for EMS personnel. Panelists complemented performance measures with suggestions for implementation by EMS agencies. Performance measures for fatigue risk management in the EMS setting will facilitate the implementation and evaluation of the EBG for Fatigue in EMS.
Laboratory and clinical evaluation of on-site urine drug testing.
Beck, Olof; Carlsson, Sten; Tusic, Marinela; Olsson, Robert; Franzen, Lisa; Hulten, Peter
2014-11-01
Products for on-site urine drug testing offer the possibility to perform screening for drugs of abuse directly at the point-of-care. This is a well-established routine in emergency and dependency clinics but further evaluation of performance is needed due to inherent limitations with the available products. Urine drug testing by an on-site product was compared with routine laboratory methods. First, on-site testing was performed at the laboratory in addition to the routine method. Second, the on-site testing was performed at a dependency clinic and urine samples were subsequently sent to the laboratory for additional analytical investigation. The on-site testing products did not perform with assigned cut-off levels. The subjective reading between the presence of a spot (i.e. negative test result) being present or no spot (positive result) was difficult in 3.2% of the cases, and occurred for all parameters. The tests performed more accurately in drug negative samples (specificity 96%) but less accurately for detecting positives (sensitivity 79%). Of all incorrect results by the on-site test the proportion of false negatives was 42%. The overall agreement between on-site and laboratory testing was 95% in the laboratory study and 98% in the clinical study. Although a high degree of agreement was observed between on-site and routine laboratory urine drug testing, the performance of on-site testing was not acceptable due to significant number of false negative results. The limited sensitivity of on-site testing compared to laboratory testing reduces the applicability of these tests.
Vasconcelos, Karla de Faria; Rovaris, Karla; Nascimento, Eduarda Helena Leandro; Oliveira, Matheus Lima; Távora, Débora de Melo; Bóscolo, Frab Norberto
2017-11-01
To evaluate the performance of conventional radiography and photostimulable phosphor (PSP) plate in the detection of simulated internal root resorption (IRR) lesions in early stages. Twenty single-rooted teeth were X-rayed before and after having a simulated IRR early lesion. Three imaging systems were used: Kodak InSight dental film and two PSPs digital systems, Digora Optime and VistaScan. The digital images were displayed on a 20.1″ LCD monitor using the native software of each system, and the conventional radiographs were evaluated on a masked light box. Two radiologists were asked to indicate the presence or absence of IRR and, after two weeks, all images were re-evaluated. Cohen's kappa coefficient was calculated to assess intra- and interobserver agreement. The three imaging systems were compared using the Kruskal-Wallis test. For interexaminer agreement, overall kappa values were 0.70, 0.65 and 0.70 for conventional film, Digora Optima and VistaScan, respectively. Both the conventional and digital radiography presented low sensitivity, specificity, accuracy, positive and negative predictive values with no significant difference between imaging systems (p = .0725). The performance of conventional and PSP was similar in the detection of simulated IRR lesions in early stages with low accuracy.
Interrater Agreement Evaluation: A Latent Variable Modeling Approach
ERIC Educational Resources Information Center
Raykov, Tenko; Dimitrov, Dimiter M.; von Eye, Alexander; Marcoulides, George A.
2013-01-01
A latent variable modeling method for evaluation of interrater agreement is outlined. The procedure is useful for point and interval estimation of the degree of agreement among a given set of judges evaluating a group of targets. In addition, the approach allows one to test for identity in underlying thresholds across raters as well as to identify…
Transformation From a Conventional Clinical Microbiology Laboratory to Full Automation.
Moreno-Camacho, José L; Calva-Espinosa, Diana Y; Leal-Leyva, Yoseli Y; Elizalde-Olivas, Dolores C; Campos-Romero, Abraham; Alcántar-Fernández, Jonathan
2017-12-22
To validate the performance, reproducibility, and reliability of BD automated instruments in order to establish a fully automated clinical microbiology laboratory. We used control strains and clinical samples to assess the accuracy, reproducibility, and reliability of the BD Kiestra WCA, the BD Phoenix, and BD Bruker MALDI-Biotyper instruments and compared them to previously established conventional methods. The following processes were evaluated: sample inoculation and spreading, colony counts, sorting of cultures, antibiotic susceptibility test, and microbial identification. The BD Kiestra recovered single colonies in less time than conventional methods (e.g. E. coli, 7h vs 10h, respectively) and agreement between both methodologies was excellent for colony counts (κ=0.824) and sorting cultures (κ=0.821). Antibiotic susceptibility tests performed with BD Phoenix and disk diffusion demonstrated 96.3% agreement with both methods. Finally, we compared microbial identification in BD Phoenix and Bruker MALDI-Biotyper and observed perfect agreement (κ=1) and identification at a species level for control strains. Together these instruments allow us to process clinical urine samples in 36h (effective time). The BD automated technologies have improved performance compared with conventional methods, and are suitable for its implementation in very busy microbiology laboratories. © American Society for Clinical Pathology 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Agreement between gonioscopy and ultrasound biomicroscopy in detecting iridotrabecular apposition.
Barkana, Yaniv; Dorairaj, Syril K; Gerber, Yariv; Liebmann, Jeffrey M; Ritch, Robert
2007-10-01
To assess the agreement between findings obtained at dark-room gonioscopy and ultrasound biomicroscopy (UBM) in the diagnosis of iridotrabecular apposition in light and dark conditions. We enrolled patients with appositional angle closure at dark-room gonioscopy performed using a 1-mm slitlamp beam that did not cross the pupil. Ultrasound biomicroscopic images were acquired in normal room light and subsequently with all room lights off. Images were evaluated for the presence or absence of iris-cornea contact. The angle opening distance at 500 microm was calculated. Iridotrabecular apposition in at least 1 angle quadrant was demonstrated in all 18 eyes at dark-room gonioscopy, 17 eyes (94%) at dark-room UBM, and only 10 eyes (56%) at UBM in room light. Of 18 superior angles that were appositionally closed at dark-room gonioscopy, apposition was demonstrated on UBM images in 16 (89%) in a dark room but only 6 (33%) in room light. Angle opening distance was less during dark-room gonioscopy in all but the nasal quadrant. We found high agreement between gonioscopy and UBM when both are performed in a completely dark room. Our findings support the recommendation that, in routine clinical practice, gonioscopy be performed in a dark room to avoid misdiagnosis of treatable iridotrabecular apposition.
Sedaghat, Ahmad R; Kieff, David A; Bergmark, Regan W; Cunnane, Mary E; Busaba, Nicolas Y
2015-03-01
Performance of septoplasty is dependent on objective evidence of nasal septal deviation. Although physical examination including anterior rhinoscopy and endoscopic examination is the gold standard for evaluation of septal deviation, third-party payors' reviews of septoplasty claims are often made on computed tomography (CT) findings. However, the correlation between radiographic evaluation of septal deviation with physical examination findings is unknown. Retrospective, blinded, independent evaluation of septal deviation in 39 consecutive patients from physical examination, including anterior rhinoscopy and endoscopic examination, by an otolaryngologist and radiographic evaluation of sinus CT scan by a neuroradiologist. Four distinct septal locations (nasal valve, cartilaginous, inferior/maxillary crest and osseous septum) were evaluated on a 4-point scale representing (1) 0% to 25%, (2) >25% to 50%, (3) >50% to 75%, and (4) >75% obstruction. Correlation between physical examination and radiographic evaluations was made by Pearson's correlation and quantitative agreement assessed by Krippendorf's alpha. Statistically significant correlation was detected between physical examination including nasal endoscopy and radiographic assessment of septal deviation only at the osseous septum (p = 0.007, r = 0.425) with low quantitative agreement (α = 0.290). No significant correlation was detected at the cartilaginous septum (p = 0.286, r = 0.175), inferior septum (p = 0.117, r = 0.255), or nasal valve (p = 0.174, r = 0.222). Quantitative agreement at the nasal valve suggested a bias in CT to underestimate physical exam findings (α = -0.490). CT is a poor substitute for physical examination, the gold standard, in assessment of septal deviation. Clinical decisions about pursuit of septoplasty or third-party payors' decisions to approve septoplasty should not be made on radiographic evidence. © 2014 ARS-AAOA, LLC.
Development of a reactive-dispersive plume model
NASA Astrophysics Data System (ADS)
Kim, Hyun S.; Kim, Yong H.; Song, Chul H.
2017-04-01
A reactive-dispersive plume model (RDPM) was developed in this study. The RDPM can consider two main components of large-scale point source plume: i) turbulent dispersion and ii) photochemical reactions. In order to evaluate the simulation performance of newly developed RDPM, the comparisons between the model-predicted and observed mixing ratios were made using the TexAQS II 2006 (Texas Air Quality Study II 2006) power-plant experiment data. Statistical analyses show good correlation (0.61≤R≤0.92), and good agreement with the Index of Agreement (0.70≤R≤0.95). The chemical NOx lifetimes for two power-plant plumes (Monticello and Welsh power plants) were also estimated.
Ghobrial, Fady Emil Ibrahim; Eldin, Manal Salah; Razek, Ahmed Abdel Khalek Abdel; Atwan, Nadia Ibrahim; Shamaa, Sameh Sayed Ahmed
2017-01-01
To assess inter-observer agreement of revised RECIST criteria (version 1.1) for computed tomography assessment of hepatic metastases of breast cancer. A prospective study was conducted in 28 female patients with breast cancer and with at least one measurable metastatic lesion in the liver that was treated with 3 cycles of anthracycline-based chemotherapy. All patients underwent computed tomography of the abdomen with 64-row multi- detector CT at baseline and after 3 cycles of chemotherapy for response assessment. Image analysis was performed by 2 observers, based on the RECIST criteria (version 1.1). Computed tomography revealed partial response of hepatic metastases in 7 patients (25%) by one observer and in 10 patients (35.7%) by the other observer, with good inter-observer agreement (k=0.75, percent agreement of 89.29%). Stable disease was detected in 19 patients (67.8%) by one observer and in 16 patients (57.1%) by the other observer, with good agreement (k=0.774, percent agreement of 89.29%). Progressive disease was detected in 2 patients (7.2%) by both observers, with perfect agreement (k=1, percent agreement of 100%). The overall inter-observer agreement in the CT-based response assessment of hepatic metastasis between the two observers was good ( k =0.793, percent agreement of 89.29%). We concluded that computed tomography is a reliable and reproducible imaging modality for response assessment of hepatic metastases of breast cancer according to the RECIST criteria (version 1.1).
Santosh, D; Goel, A; Birchall, I W; Kumar, A; Lee, K H; Patel, V H; Low, G
2017-10-01
To compare magnetic resonance cholangiopancreatography (MRCP) and Gd-EOB-DTPA-enhanced MRI in the evaluation of the biliary anatomy in potential living liver donors (LLDs). A retrospective study was conducted in a tertiary care liver transplant center after obtaining ethics and institutional approvals. A total of 42 potential LLD MRI examinations were performed between November 2013 and March 2016. All patients underwent a standard MRI protocol which included MRCP and Gd-EOB-DTPA-enhanced MRI sequences in a single session. Three abdominal MR radiologists independently reviewed the studies and completed a customized data collection sheet for each MR sequence. The readers subjectively scored the bile duct visualization on each MR sequence on a Likert scale and classified the biliary anatomic configuration. Statistical analysis was performed using intraclass correlation coefficient and the McNemar Chi-square (χ 2 ) test. The 42 potential LLDs included 22 males and 20 females with an age range of 18-60 years. There was 'good' or 'excellent' inter-reader agreement on either MRI examination for the visualization of the first- and second-order ducts and the majority of third-order ducts. 'Good' inter-reader agreement on Gd-EOB-DTPA-enhanced MRI and 'fair' inter-reader agreement on MRCP was noted for the left third-order medial duct. There was significantly better visualization of the cystic duct, left hepatic duct, and right second-order ducts on Gd-EOB-DTPA-enhanced MRI compared with MRCP. A 12.6% improvement in classifying the biliary branch pattern was also observed on Gd-EOB-DTPA-enhanced MRI compared with MRCP (P = 0.03). Gd-EOB-DTPA-enhanced MRI provides additional diagnostic confidence over MRCP in the evaluation of the biliary ductal anatomy in potential LLDs.
Paesani, Daniel A; Guarda-Nardini, Luca; Gelos, Carlota; Salmaso, Luigi; Manfredini, Daniele
2014-03-01
The aim was to answer the clinical research question: is incisal/occlusal tooth wear assessment on dental casts performed by five professionals with expertise in different fields of dentistry reliable? Five examiners with different fields of expertise in the dental profession assessed tooth wear on dental casts of 45 subjects, based on a six-degree rating of incisal/occlusal wear. After a calibration meeting, the examiners evaluated the casts individually and various issues concerning interexaminer agreement and reliability were assessed. A total of 872 teeth were evaluated. The five examiners agreed only for the rating of 6.6% of the teeth. The teeth with the highest percentage of agreement were the premolars. Pairwise comparison of the assessments of the examiners #1 (bruxism expert), #2 (orthodontist), #3 (temporomandibular disorders [TMD] and occlusion expert), #4 (dental nurse) showed fair to moderate agreement, with κ-values ranging from 0.306 to 0.577, whilst the examiner #5 (lab technician) achieved low interexaminer reliability values with all the other four examiners. The interexaminer reliability of tooth wear assessment on dental casts performed by five professionals with expertise in different fields of dentistry is highly variable. General practitioners should keep in mind that consensus decisions by the examiners and assessment by raters belonging to the same dental discipline are recommended strategies to increase the reliability of tooth wear evaluation in the clinical setting. This investigation adds to the literature suggesting that, in a clinical setting, a single examiner's assessment of tooth wear on dental casts does not have optimal reliability and that it may be source of internal validity problems in the research setting.
Numerical Study of Aeroacoustic Sound on Performance of Bladeless Fan
NASA Astrophysics Data System (ADS)
Jafari, Mohammad; Sojoudi, Atta; Hafezisefat, Parinaz
2017-03-01
Aeroacoustic performance of fans is essential due to their widespread application. Therefore, the original aim of this paper is to evaluate the generated noise owing to different geometric parameters. In current study, effect of five geometric parameters was investigated on well performance of a Bladeless fan. Airflow through this fan was analyzed simulating a Bladeless fan within a 2 m×2 m×4 m room. Analysis of the flow field inside the fan and evaluating its performance were obtained by solving conservations of mass and momentum equations for aerodynamic investigations and FW-H noise equations for aeroacoustic analysis. In order to design Bladeless fan Eppler 473 airfoil profile was used as the cross section of this fan. Five distinct parameters, namely height of cross section of the fan, outlet angle of the flow relative to the fan axis, thickness of airflow outlet slit, hydraulic diameter and aspect ratio for circular and quadratic cross sections were considered. Validating acoustic code results, we compared numerical solution of FW-H noise equations for NACA0012 with experimental results. FW-H model was selected to predict the noise generated by the Bladeless fan as the numerical results indicated a good agreement with experimental ones for NACA0012. To validate 3-D numerical results, the experimental results of a round jet showed good agreement with those simulation data. In order to indicate the effect of each mentioned parameter on the fan performance, SPL and OASPL diagrams were illustrated.
Performance of Lung Ultrasound in Detecting Peri-Operative Atelectasis after General Anesthesia.
Yu, Xin; Zhai, Zhenping; Zhao, Yongfeng; Zhu, Zhiming; Tong, Jianbin; Yan, Jianqin; Ouyang, Wen
2016-12-01
The aim of this prospective observational study was to evaluate the performance of lung ultrasound (LUS) in detecting post-operative atelectasis in adult patients under general anesthesia. Forty-six patients without pulmonary comorbidities who were scheduled for elective neurosurgery were enrolled in the study. A total of 552 pairs of LUS clips and thoracic computed tomography (CT) images were ultimately analyzed to determine the presence of atelectasis in 12 prescribed lung regions. The accuracy of LUS in detecting peri-operative atelectasis was evaluated with thoracic CT as gold standard. Levels of agreement between the two observers for LUS and the two observers for thoracic CT were analyzed using the κ reliability test. The quantitative correlation between LUS scores of aeration and the volumetric data of atelectasis in thoracic CT were further evaluated. LUS had reliable performance in post-operative atelectasis, with a sensitivity of 87.7%, specificity of 92.1% and diagnostic accuracy of 90.8%. The levels of agreement between the two observers for LUS and for thoracic CT were both satisfactory, with κ coefficients of 0.87 (p < 0.0001) and 0.93 (p < 0.0001), respectively. In patients in the supine position, LUS scores were highly correlated with the atelectasis volume of CT (r = 0.58, p < 0.0001). Thus, LUS provides a fast, reliable and radiation-free method to identify peri-operative atelectasis in adults. Copyright © 2016. Published by Elsevier Inc.
Lima, Alexandre; López, Alejandra; van Genderen, Michel E; Hurtado, Francisco Javier; Angulo, Martin; Grignola, Juan C; Shono, Atsuko; van Bommel, Jasper
2015-09-01
This was a cross-sectional multicenter study to investigate the ability of physicians and nurses from three different countries to subjectively evaluate sublingual microcirculation images and thereby discriminate normal from abnormal sublingual microcirculation based on flow and density abnormalities. Forty-five physicians and 61 nurses (mean age, 36 ± 10 years; 44 males) from three different centers in The Netherlands (n = 61), Uruguay (n = 12), and Japan (n = 33) were asked to subjectively evaluate a sample of 15 microcirculation videos randomly selected from an experimental model of endotoxic shock in pigs. All videos were first analyzed offline using the A.V.A. software by an independent, experienced investigator and were categorized as good, bad, or very bad microcirculation based on the microvascular flow index, perfused capillary density, and proportion of perfused capillaries. Then, the videos were randomly assigned to the examiners, who were instructed to subjectively categorize each image as good, bad, or very bad. An interrater analysis was performed, and sensitivity and specificity tests were calculated to evaluate the proportion of A.V.A. score abnormalities that the examiners correctly identified. The κ statistics indicated moderate agreement in the evaluation of microcirculation abnormalities using three categories, i.e., good, bad, or very bad (κ = 0.48), and substantial agreement using two categories, i.e., normal (good) and abnormal (bad or very bad) (κ = 0.66). There was no significant difference between the κ three and κ two statistics. We found that the examiner's subjective evaluations had good diagnostic performance and were highly sensitive (84%; 95% confidence interval, 81%-86%) and specific (87%; 95% confidence interval, 84%-90%) for sublingual microcirculatory abnormalities as assessed using the A.V.A. software. The subjective evaluations of sublingual microcirculation by physicians and nurses agreed well with a conventional offline analysis and were highly sensitive and specific for sublingual microcirculatory abnormalities.
Code of Federal Regulations, 2014 CFR
2014-07-01
... to performance or repayment under a scholarship agreement? 386.42 Section 386.42 Education... performance or repayment under a scholarship agreement? To obtain a deferral or exception to performance or repayment under a scholarship agreement, a scholar shall provide the following: (a) Written application. A...
Code of Federal Regulations, 2010 CFR
2010-07-01
... exception to performance or repayment under a scholarship agreement? 386.42 Section 386.42 Education... performance or repayment under a scholarship agreement? To obtain a deferral or exception to performance or repayment under a scholarship agreement, a scholar shall provide the following: (a) Written application. A...
Code of Federal Regulations, 2011 CFR
2011-07-01
... to performance or repayment under a scholarship agreement? 386.42 Section 386.42 Education... performance or repayment under a scholarship agreement? To obtain a deferral or exception to performance or repayment under a scholarship agreement, a scholar shall provide the following: (a) Written application. A...
Code of Federal Regulations, 2012 CFR
2012-07-01
... exception to performance or repayment under a scholarship agreement? 386.42 Section 386.42 Education... performance or repayment under a scholarship agreement? To obtain a deferral or exception to performance or repayment under a scholarship agreement, a scholar shall provide the following: (a) Written application. A...
Code of Federal Regulations, 2013 CFR
2013-07-01
... exception to performance or repayment under a scholarship agreement? 386.42 Section 386.42 Education... performance or repayment under a scholarship agreement? To obtain a deferral or exception to performance or repayment under a scholarship agreement, a scholar shall provide the following: (a) Written application. A...
FY 1999 Annual Self-Evaluation Report of the Pacific Northwest National Laboratory
DOE Office of Scientific and Technical Information (OSTI.GOV)
Randy R. LaBarge
1999-11-05
This is a report of the Pacific Northwest National Laboratory's (Pacific Northwest's) FY1999 Annual Self-Evaluation Report. This report summarizes our progress toward accomplishment of the critical outcomes, objectives, and performance indicators as delineated in the FY1999 Performance Evaluation & Fee Agreement. It also summarizes our analysis of the results of Pacific Northwest's Division and Directorate annual self-assessments, and the implementation of our key operational improvement initiatives. Together, these provide an indication of how well we have used our Integrated Assessment processes to identify and plan improvements for FY2000. As you review the report you will find areas of significantly positivemore » progress; you will also note areas where I believe the Laboratory could make improvements. Overall, however, I believe you will be quite pleased to note that we have maintained, or exceeded, the high standards of performance we have set for the Laboratory.« less
Gas exchange and lactate anaerobic thresholds: inter- and intraevaluator agreement.
Gladden, L B; Yates, J W; Stremel, R W; Stamford, B A
1985-06-01
Twenty-four coded graph sets of gas exchange variables and blood lactate concentration (LA) plotted against time at 15-s intervals were analyzed by nine evaluators who determined the gas exchange (ATGE) and LA (ATLA) anaerobic thresholds. In addition, ATGE and ATLA were determined by a linear regression computer program. Agreement between ATGE and ATLA was poor; the median intraclass correlation coefficient (ri) was 0.53. Among evaluators, ATLA agreement (median ri = 0.81) was better than ATGE agreement (median ri = 0.70). In general, the ability of any evaluator to choose similar values from duplicate plots for either ATGE (median ri = 0.97) or ATLA (median ri = 0.995) was good. There was better agreement between the mean ATLA of the evaluators and the computer ATLA (ComLA) (ri = 0.88) than between the mean ATGE of the evaluators and the computer ATGE (ComVE), (ri = 0.58). Agreement between ComVE and ComLA was poor (ri = 0.29). These results suggest that ATGE does not accurately predict ATLA and that different evaluators choose different thresholds from the same data. Further assessment of the validity and precision of ATGE based on breath-by-breath and minute-by-minute data is needed.
Wee, Leonard; Hackett, Sara Lyons; Jones, Andrew; Lim, Tee Sin; Harper, Christopher Stirling
2013-01-01
This study evaluated the agreement of fiducial marker localization between two modalities — an electronic portal imaging device (EPID) and cone‐beam computed tomography (CBCT) — using a low‐dose, half‐rotation scanning protocol. Twenty‐five prostate cancer patients with implanted fiducial markers were enrolled. Before each daily treatment, EPID and half‐rotation CBCT images were acquired. Translational shifts were computed for each modality and two marker‐matching algorithms, seed‐chamfer and grey‐value, were performed for each set of CBCT images. The localization offsets, and systematic and random errors from both modalities were computed. Localization performances for both modalities were compared using Bland‐Altman limits of agreement (LoA) analysis, Deming regression analysis, and Cohen's kappa inter‐rater analysis. The differences in the systematic and random errors between the modalities were within 0.2 mm in all directions. The LoA analysis revealed a 95% agreement limit of the modalities of 2 to 3.5 mm in any given translational direction. Deming regression analysis demonstrated that constant biases existed in the shifts computed by the modalities in the superior–inferior (SI) direction, but no significant proportional biases were identified in any direction. Cohen's kappa analysis showed good agreement between the modalities in prescribing translational corrections of the couch at 3 and 5 mm action levels. Images obtained from EPID and half‐rotation CBCT showed acceptable agreement for registration of fiducial markers. The seed‐chamfer algorithm for tracking of fiducial markers in CBCT datasets yielded better agreement than the grey‐value matching algorithm with EPID‐based registration. PACS numbers: 87.55.km, 87.55.Qr PMID:23835391
Structured reporting platform improves CAD-RADS assessment.
Szilveszter, Bálint; Kolossváry, Márton; Karády, Júlia; Jermendy, Ádám L; Károlyi, Mihály; Panajotu, Alexisz; Bagyura, Zsolt; Vecsey-Nagy, Milán; Cury, Ricardo C; Leipsic, Jonathon A; Merkely, Béla; Maurovich-Horvat, Pál
2017-11-01
Structured reporting in cardiac imaging is strongly encouraged to improve quality through consistency. The Coronary Artery Disease - Reporting and Data System (CAD-RADS) was recently introduced to facilitate interdisciplinary communication of coronary CT angiography (CTA) results. We aimed to assess the agreement between manual and automated CAD-RADS classification using a structured reporting platform. Five readers prospectively interpreted 500 coronary CT angiographies using a structured reporting platform that automatically calculates the CAD-RADS score based on stenosis and plaque parameters manually entered by the reader. In addition, all readers manually assessed CAD-RADS blinded to the automatically derived results, which was used as the reference standard. We evaluated factors influencing reader performance including CAD-RADS training, clinical load, time of the day and level of expertise. Total agreement between manual and automated classification was 80.2%. Agreement in stenosis categories was 86.7%, whereas the agreement in modifiers was 95.8% for "N", 96.8% for "S", 95.6% for "V" and 99.4% for "G". Agreement for V improved after CAD-RADS training (p = 0.047). Time of the day and clinical load did not influence reader performance (p > 0.05 both). Less experienced readers had a higher total agreement as compared to more experienced readers (87.0% vs 78.0%, respectively; p = 0.011). Even though automated CAD-RADS classification uses data filled in by the readers, it outperforms manual classification by preventing human errors. Structured reporting platforms with automated calculation of the CAD-RADS score might improve data quality and support standardization of clinical decision making. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Rahimy, Ehsan; Reddy, Sahitya; DeCroos, Francis Char; Khan, M Ali; Boyer, David S; Gupta, Omesh P; Regillo, Carl D; Haller, Julia A
2015-08-01
To evaluate the visual acuity agreement between a standard back-illuminated Early Treatment Diabetic Retinopathy Study (ETDRS) chart and a handheld internally illuminated ETDRS chart. Two-center prospective study. Seventy patients (134 eyes) with retinal pathology were enrolled between October 2012 and August 2013. Visual acuity was measured using both the ETDRS chart and the handheld device by masked independent examiners after best protocol refraction. Examination was performed in the same room under identical illumination and testing conditions. The mean number of letters seen was 63.0 (standard deviation: 19.8 letters) and 61.2 letters (standard deviation: 19.1 letters) for the ETDRS chart and handheld device, respectively. Mean difference per eye between the ETDRS and handheld device was 1.8 letters. A correlation coefficient (r) of 0.95 demonstrated a positive linear correlation between ETDRS chart and handheld device measured acuities. Intraclass correlation coefficient was performed to assess the reproducibility of the measurements made by different observers measuring the same quantity and was calculated to be 0.95 (95% confidence interval: 0.93-0.96). Agreement was independent of retinal disease. The strong correlation between measured visual acuity using the ETDRS and handheld equivalent suggests that they may be used interchangeably, with accurate measurements. Potential benefits of this device include convenience and portability, as well as the ability to assess ETDRS visual acuity without a dedicated testing lane.
Pujol, I; Guarro, J; Llop, C; Soler, L; Fernández-Ballart, J
1996-09-01
An evaluation of broth dilution antifungal susceptibility tests was performed by determining both the micro- and macrodilution MICs of amphotericin B, fluconazole, ketoconazole, 5-fluorocytosine, miconazole, and itraconazole against representative species of opportunistic hyphomycetes (Fusarium spp. and Cladosporium [Cladophialophora] spp.) and ascomycetes (Chaetomium spp.). A total of 78 strains were tested, the majority of them twice and some three times on different days. Both methods were performed according to the recommendations of the National Committee for Clinical Laboratory Standards (Document M27-P), with the exception of the temperature of incubation, which was 25 degrees C in our case. A spectrophotometric method for inoculum preparation, RPMI 1640 medium buffered with morpholinepropanesulfonic acid (pH 7.0), and an additive drug dilution procedure were used. The MICs obtained by the two methods were read after 48, 72, and 96 h of incubation for Fusarium spp. and after 72, 96, and 120 h for the remaining isolates. The kappa test was used to calculate the degree of agreement. Considering the three fungal groups together, a good agreement between the results of both tests was observed with almost all the drugs at the different incubation times. There were no cases of poor agreement. The highest level (kappa index = 1) was observed with ketoconazole at the second-day reading. These results support the further evaluation of the broth microdilution test as an alternative to the reference broth macrodilution susceptibility test.
Pujol, I; Guarro, J; Llop, C; Soler, L; Fernández-Ballart, J
1996-01-01
An evaluation of broth dilution antifungal susceptibility tests was performed by determining both the micro- and macrodilution MICs of amphotericin B, fluconazole, ketoconazole, 5-fluorocytosine, miconazole, and itraconazole against representative species of opportunistic hyphomycetes (Fusarium spp. and Cladosporium [Cladophialophora] spp.) and ascomycetes (Chaetomium spp.). A total of 78 strains were tested, the majority of them twice and some three times on different days. Both methods were performed according to the recommendations of the National Committee for Clinical Laboratory Standards (Document M27-P), with the exception of the temperature of incubation, which was 25 degrees C in our case. A spectrophotometric method for inoculum preparation, RPMI 1640 medium buffered with morpholinepropanesulfonic acid (pH 7.0), and an additive drug dilution procedure were used. The MICs obtained by the two methods were read after 48, 72, and 96 h of incubation for Fusarium spp. and after 72, 96, and 120 h for the remaining isolates. The kappa test was used to calculate the degree of agreement. Considering the three fungal groups together, a good agreement between the results of both tests was observed with almost all the drugs at the different incubation times. There were no cases of poor agreement. The highest level (kappa index = 1) was observed with ketoconazole at the second-day reading. These results support the further evaluation of the broth microdilution test as an alternative to the reference broth macrodilution susceptibility test. PMID:8878589
Scales of degree of facial paralysis: analysis of agreement.
Fonseca, Kércia Melo de Oliveira; Mourão, Aline Mansueto; Motta, Andréa Rodrigues; Vicente, Laelia Cristina Caseiro
2015-01-01
It has become common to use scales to measure the degree of involvement of facial paralysis in phonoaudiological clinics. To analyze the inter- and intra-rater agreement of the scales of degree of facial paralysis and to elicit point of view of the appraisers regarding their use. Cross-sectional observational clinical study of the Chevalier and House & Brackmann scales performed by five speech therapists with clinical experience, who analyzed the facial expression of 30 adult subjects with impaired facial movements two times, with a one week interval between evaluations. The kappa analysis was employed. There was excellent inter-rater agreement for both scales (kappa>0.80), and on the Chevalier scale a substantial intra-rater agreement in the first assessment (kappa=0.792) and an excellent agreement in the second assessment (kappa=0.928). The House & Brackmann scale showed excellent agreement at both assessments (kappa=0.850 and 0.857). As for the appraisers' point of view, one appraiser thought prior training is necessary for the Chevalier scale and, four appraisers felt that training is important for the House & Brackmann scale. Both scales have good inter- and intra-rater agreement and most of the appraisers agree on the ease and relevance of the application of these scales. Copyright © 2014 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
Agreement in the assessment of metastatic spine disease using scoring systems.
Arana, Estanislao; Kovacs, Francisco M; Royuela, Ana; Asenjo, Beatriz; Pérez-Ramírez, Ursula; Zamora, Javier
2015-04-01
To assess variability in the use of Tomita and modified Bauer scores in spine metastases. Clinical data and imaging from 90 patients with biopsy-proven spinal metastases, were provided to 83 specialists from 44 hospitals. Spinal levels involved and the Tomita and modified Bauer scores for each case were determined twice by each clinician, with a minimum of 6-week interval. Clinicians were blinded to every evaluation. Kappa statistic was used to assess intra and inter-observer agreement. Subgroup analyses were performed according to clinicians' specialty (medical oncology, neurosurgery, radiology, orthopedic surgery and radiation oncology), years of experience (⩽7, 8-13, ⩾14), and type of hospital (four levels). For metastases identification, intra-observer agreement was "substantial" (0.60
Lai, Jeffrey K C; Robertson, Patricia L; Goh, Christine; Szer, Jeff
2018-02-01
To evaluate the intraobserver and interobserver agreement for bone marrow burden (BMB) scores for individual examinations and for the change in BMB score over time in the same patient. A total of 119 sets of MR images of the lumbar spine and femora from 60 patients with Gaucher disease were included. Each set of MR images was scored using the BMB score independently by two experienced MSK radiologists. One radiologist performed a second read four weeks later. Intraobserver and interobserver agreement was assessed using Bland-Altman analysis and weighted kappa scores. BMB scores (n=119) demonstrated fair intraobserver agreement (weighted kappa=0.53) with a mean difference of -0.20 and 95% limits of agreement (LOA) of (-3.41, 3.01). Inter observer agreement was poor with weighted kappa 0.28 with mean difference of -0.16 and 95% LOA of (-4.45, 4.11). Change in BMB scores over time (n=59) demonstrated poor/fair intraobserver agreement (weighted kappa 0.41, mean difference-0.20 and 95% LOA (-4.35, 3.94)). Interobserver agreement was poor (weighted kappa 0.25, mean difference -0.12 with wide 95% LOA (-6.23, 5.99)). Significant interobserver, and to a lesser extent intraobserver, variation occurs with blinded BMB scoring of Gaucher disease. Copyright © 2016 Elsevier Inc. All rights reserved.
Evaluation of Multiclass Model Observers in PET LROC Studies
NASA Astrophysics Data System (ADS)
Gifford, H. C.; Kinahan, P. E.; Lartizien, C.; King, M. A.
2007-02-01
A localization ROC (LROC) study was conducted to evaluate nonprewhitening matched-filter (NPW) and channelized NPW (CNPW) versions of a multiclass model observer as predictors of human tumor-detection performance with PET images. Target localization is explicitly performed by these model observers. Tumors were placed in the liver, lungs, and background soft tissue of a mathematical phantom, and the data simulation modeled a full-3D acquisition mode. Reconstructions were performed with the FORE+AWOSEM algorithm. The LROC study measured observer performance with 2D images consisting of either coronal, sagittal, or transverse views of the same set of cases. Versions of the CNPW observer based on two previously published difference-of-Gaussian channel models demonstrated good quantitative agreement with human observers. One interpretation of these results treats the CNPW observer as a channelized Hotelling observer with implicit internal noise
Monte Carlo simulation of Ray-Scan 64 PET system and performance evaluation using GATE toolkit
NASA Astrophysics Data System (ADS)
Li, Suying; Zhang, Qiushi; Vuletic, Ivan; Xie, Zhaoheng; Yang, Kun; Ren, Qiushi
2017-02-01
In this study, we aimed to develop a GATE model for the simulation of Ray-Scan 64 PET scanner and model its performance characteristics. A detailed implementation of system geometry and physical process were included in the simulation model. Then we modeled the performance characteristics of Ray-Scan 64 PET system for the first time, based on National Electrical Manufacturers Association (NEMA) NU-2 2007 protocols and validated the model against experimental measurement, including spatial resolution, sensitivity, counting rates and noise equivalent count rate (NECR). Moreover, an accurate dead time module was investigated to simulate the counting rate performance. Overall results showed reasonable agreement between simulation and experimental data. The validation results showed the reliability and feasibility of the GATE model to evaluate major performance of Ray-Scan 64 PET system. It provided a useful tool for a wide range of research applications.
Automated lobar quantification of emphysema in patients with severe COPD.
Revel, Marie-Pierre; Faivre, Jean-Baptiste; Remy-Jardin, Martine; Deken, Valérie; Duhamel, Alain; Marquette, Charles-Hugo; Tacelli, Nunzia; Bakai, Anne-Marie; Remy, Jacques
2008-12-01
Automated lobar quantification of emphysema has not yet been evaluated. Unenhanced 64-slice MDCT was performed in 47 patients evaluated before bronchoscopic lung-volume reduction. CT images reconstructed with a standard (B20) and high-frequency (B50) kernel were analyzed using a dedicated prototype software (MevisPULMO) allowing lobar quantification of emphysema extent. Lobar quantification was obtained following (a) a fully automatic delineation of the lobar limits by the software and (b) a semiautomatic delineation with manual correction of the lobar limits when necessary and was compared with the visual scoring of emphysema severity per lobe. No statistically significant difference existed between automated and semiautomated lobar quantification (p > 0.05 in the five lobes), with differences ranging from 0.4 to 3.9%. The agreement between the two methods (intraclass correlation coefficient, ICC) was excellent for left upper lobe (ICC = 0.94), left lower lobe (ICC = 0.98), and right lower lobe (ICC = 0.80). The agreement was good for right upper lobe (ICC = 0.68) and moderate for middle lobe (IC = 0.53). The Bland and Altman plots confirmed these results. A good agreement was observed between the software and visually assessed lobar predominance of emphysema (kappa 0.78; 95% CI 0.64-0.92). Automated and semiautomated lobar quantifications of emphysema are concordant and show good agreement with visual scoring.
Kim, Jeeyong; Cho, Chi Hyun; Jung, Bo Kyeung; Nam, Jeonghun; Seo, Hong Seog; Shin, Sehyun; Lim, Chae Seung
2018-04-14
The objective of this study was to comparatively evaluate three commercial whole-blood platelet function analyzer systems: Platelet Function Analyzer-200 (PFA; Siemens Canada, Mississauga, Ontario, Canada), Multiplate analyzer (MP; Roche Diagnostics International Ltd., Rotkreuz, Switzerland), and Plateletworks Combo-25 kit (PLW; Helena Laboratories, Beaumont, TX, USA). Venipuncture was performed on 160 patients who visited a department of cardiology. Pairwise agreement among the three platelet function assays was assessed using Cohen's kappa coefficient and percent agreement within the reference limit. Kappa values with the same agonists were poor between PFA-collagen (COL; agonist)/adenosine diphosphate (ADP) and MP-ADP (-0.147), PFA-COL/ADP and PLW-ADP (0.089), MP-ADP and PLW-ADP (0.039), PFA-COL/ADP and MP-COL (-0.039), and between PFA-COL/ADP and PLW-COL (-0.067). Nonetheless, kappa values for the same assay principle with a different agonist were slightly higher between PFA-COL/ADP and PFA-COL/EPI (0.352), MP-ADP and MP-COL (0.235), and between PLW-ADP and PLW-COL (0.247). The range of percent agreement values was 38.7% to 73.8%. Therefore, various measurements of platelet function by more than one method were needed to obtain a reliable interpretation of platelet function considering low kappa coefficient and modest percent agreement rates among 3 different platelet function tests.
Interpretation of bedside chest X-rays in the ICU: is the radiologist still needed?
Martini, Katharina; Ganter, Christoph; Maggiorini, Marco; Winklehner, Anna; Leupi-Skibinski, Katarzyna E; Frauenfelder, Thomas; Nguyen-Kim, Thi Dan Linh
2015-01-01
To compare diagnostic accuracy of intensivists to radiologists in reading bedside chest X-rays. In a retrospective trial, 33 bedside chest X-rays were evaluated by five radiologists and five intensivists with different experience. Images were evaluated for devices and lung pathologies. Interobserver agreement and diagnostic accuracy were calculated. Computed tomography served as reference standard. Seniors had higher diagnostic accuracy than residents (mean-ExpB(Senior)=1.456; mean-ExpB(Resident)=1.635). Interobserver agreement for installations was more homogenously distributed between radiologists compared to intensivists (ExpB(Rad)=1.204-1.672; ExpB(Int)=1.005-2.368). Seniors had comparable diagnostic accuracy. No significant difference in diagnostic performance was seen between seniors of both disciplines, whereas the resident intensivists might still benefit from an interdisciplinary dialogue. Copyright © 2015 Elsevier Inc. All rights reserved.
2013-08-06
CAPE CANAVERAL, Fla. – In Hangar N at Cape Canaveral Air Force Station, ceramic materials are positioned for Advanced Partial Angle Computed Tomography testing. The activity is part of work performed by PaR Systems, Inc. under a partnership agreement with NASA. NASA's Kennedy Space Center in Florida recently established a partnership agreement with PaR Systems, Inc. of Shoreview, Minn., for operation of the Hangar N facility and its nondestructive testing and evaluation equipment. As the spaceport transitions from a historically government-only launch facility to a multi-user spaceport for both federal and commercial customers, partnerships between the space agency and other organizations will be a key element in that effort. Hangar N is located at Cape Canaveral Air Force Station adjacent to Kennedy and houses a unique inventory of test and evaluation equipment and the capability for current and future mission spaceflight support. Photo credit: NASA/ Dimitri Gerondidakis
Simulation of a 7.7 MW onshore wind farm with the Actuator Line Model
NASA Astrophysics Data System (ADS)
Guggeri, A.; Draper, M.; Usera, G.
2017-05-01
Recently, the Actuator Line Model (ALM) has been evaluated with coarser resolution and larger time steps than what is generally recommended, taking into account an atmospheric sheared and turbulent inflow condition. The aim of the present paper is to continue these studies, assessing the capability of the ALM to represent the wind turbines’ interactions in an onshore wind farm. The ‘Libertad’ wind farm, which consists of four 1.9MW Vestas V100 wind turbines, was simulated considering different wind directions, and the results were compared with the wind farm SCADA data, finding good agreement between them. A sensitivity analysis was performed to evaluate the influence of the spatial resolution, finding acceptable agreement, although some differences were found. It is believed that these differences are due to the characteristics of the different Atmospheric Boundary Layer (ABL) simulations taken as inflow condition (precursor simulations).
Chacón, Jose
2010-07-21
Drug efficacy might differ from clinical trial results when performed in clinical daily conditions. Therefore, it is mandatory to conduct trials about effectiveness to improve external validity. This post-authorization, open-label, noncontrolled, prospective, multicenter, observational, and naturalistic trial was designed to search for factors influencing the racecadotril overall effect on childhood acute watery diarrhea in a real-world setting of Venezuela. There were 3,873 children with acute watery diarrhea treated with racecadotril, an enkephalin breakdown blocker plus oral rehydration therapy by 97 pediatricians. Evaluations were carried out daily until emission of two consecutive formed stools or absence of watery bowel movements for 24 hours. The primary end-point was time-to-relief, defined as the time from first racecadotril dose to the last watery bowel movement time. Age, gender, nursing type, nursing status during diarrhea, diarrhea severity, and co-medication were considered as factors in the statistical analysis. The primary end-point was evaluated by factors using UNIANOVA, and post-hoc tests were done. A multiple regression analysis was carried out to identify factors affecting drug performance, racecadotril effectiveness and tolerability overall assessment was searched by physicians and patients, and inter-observer agreement was evaluated by kappa statistics. The mean time-to-relief was 18.5 +/- 12.5 hours [95% confidence interval 17.9-19.0] and the diarrhea severity was the only variable with significant and independent weight on racecadotril effectiveness explaining 23% of time-to-relief variance, but even in severe diarrhea cases this time was less than 24 hours. High agreement about satisfactory perception on effectiveness and tolerability was reached among physicians and patients. In conclusion, the racecadotril overall effect, evaluated in a real-world setting of Venezuela, was in agreement with results of some earlier controlled trials. It was only influenced by severity of diarrhea episode, as well as being considered an effective and well tolerated treatment by physicians and patients.
England, Matthew R.; Jurcic Smith, Kristen L.; He, Taojun; Wijetunge, Dona Saumya; Chamberland, Robin R.; Menegus, Marilyn; Swierkosz, Ella M.; Jerris, Robert C.; Greene, Wallace
2017-01-01
ABSTRACT The performance of the new ePlex Respiratory Pathogen (RP) panel (GenMark Diagnostics) for the simultaneous detection of 19 viruses (influenza A virus; influenza A H1 virus; influenza A 2009 H1 virus; influenza A H3 virus; influenza B virus; adenovirus; coronaviruses [HKU1, OC43, NL63, and 229E]; human rhinovirus/enterovirus; human metapneumovirus; parainfluenza viruses 1, 2, 3, and 4; and respiratory syncytial virus [RSV] [RSV subtype A and RSV subtype B]) and 2 bacteria (Mycoplasma pneumoniae and Chlamydia pneumoniae) was evaluated. Prospectively and retrospectively collected nasopharyngeal swab (NPS) specimens (n = 2,908) were evaluated by using the ePlex RP panel, with the bioMérieux/BioFire FilmArray Respiratory Panel (BioFire RP) as the comparator method. Discordance analysis was performed by using target-specific PCRs and bidirectional sequencing. The reproducibility of the assay was evaluated by using reproducibility panels comprised of 6 pathogens. The overall agreement between the ePlex RP and BioFire RP results was >95% for all targets. Positive percent agreement with the BioFire RP result for viruses ranged from 85.1% (95% confidence interval [CI], 80.2% to 88.9%) to 95.1% (95% CI, 89.0% to 97.9%), while negative percent agreement values ranged from 99.5% (95% CI, 99.1% to 99.7%) to 99.8% (95% CI, 99.5% to 99.9%). Additional testing of discordant targets (12%; 349/2,908) confirmed the results of ePlex RP for 38% (131/349) of samples tested. Reproducibility was 100% for all targets tested, with the exception of adenovirus, for which reproducibilities were 91.6% at low virus concentrations and 100% at moderate virus concentrations. The ePlex RP panel offers a new, rapid, and sensitive “sample-to-answer” multiplex panel for the detection of the most common viral and bacterial respiratory pathogens. PMID:29212701
Garg, Amit; Biello, Katie; Hoot, Joyce W; Reddy, Shalini B; Wilson, Lindsay; George, Paul; Robinson-Bostom, Leslie; Belazarian, Leah; Domingues, Erik; Powers, Jennifer; Jacob, Reza; Powers, Michael; Besen, Justin; Geller, Alan C
2015-12-01
Assessing medical students on core skills related to melanoma detection is challenging in the absence of a well-developed instrument. We sought to develop an objective structured clinical examination for the detection and evaluation of melanoma among medical students. This was a prospective cohort analysis of student and objective rater agreement on performance of clinical skills and assessment of differences in performance across 3 schools. Kappa coefficients indicated excellent agreement for 3 of 5 core skills including commenting on the presence of the moulage (k = 0.87, 95% confidence interval 0.77-0.96), obtaining a history for the moulage (k = 0.84, 95% confidence interval 0.74-0.94), and making a clinical impression (k = 0.80, 95% confidence interval 0.68-0.92). There were no differences in performance across schools with respect to 3 of 5 core skills: commenting on the presence of the moulage (P = .15), initiating a history (P = .53), and managing the suspicious lesion (P value range .07-.17). Overall, 54.2% and 44.7% of students commented on the presence of the moulage and achieved maximum performance of core skills, respectively, with no difference in performance across schools. Limitations include overall sample size of students and schools. The Skin Cancer Objective Structured Clinical Examination represents a potentially important instrument to measure students' performance on the optimal step-by-step evaluation of a melanoma. Copyright © 2015 American Academy of Dermatology, Inc. Published by Elsevier Inc. All rights reserved.
Decision-directed detector for overlapping PCM/NRZ signals.
NASA Technical Reports Server (NTRS)
Wang, C. D.; Noack, T. L.
1973-01-01
A decision-directed (DD) technique for the detection of overlapping PCM/NRZ signals in the presence of white Gaussian noise is investigated. The performance of the DD detector is represented by probability of error Pe versus input signal-to-noise ratio (SNR). To examine how much improvement in performance can be achieved with this technique, Pe's with and without DD feedback are evaluated in parallel. Further, analytical results are compared with those found by Monte Carlo simulations. The results are in good agreement.
NASA Astrophysics Data System (ADS)
Bhardwaj, Manish; McCaughan, Leon; Olkhovets, Anatoli; Korotky, Steven K.
2006-12-01
We formulate an analytic framework for the restoration performance of path-based restoration schemes in planar mesh networks. We analyze various switch architectures and signaling schemes and model their total restoration interval. We also evaluate the network global expectation value of the time to restore a demand as a function of network parameters. We analyze a wide range of nominally capacity-optimal planar mesh networks and find our analytic model to be in good agreement with numerical simulation data.
van Egdom, L S E; Lagendijk, M; Heijkoop, E H M; Koning, A H J; van Deurzen, C H M; Jager, A; van Lankeren, W; Koppert, L B
2018-07-01
Accurate measurement of tumour response during and after neoadjuvant chemotherapy (NAC) is important and may influence treatment decisions in invasive breast cancer patients. Breast MRI forms the gold standard but is more burdensome, time consuming and costly. In this study response measurement was done with 3-D ultrasound by Automated Breast Volume Scanner (ABVS) and compared to breast MRI. Moreover, patient satisfaction with both techniques was compared. A single-institution, prospective observational pilot study evaluating tumour response by ABVS in addition to breast MRI (standard care) was performed in 25 invasive breast cancer patients receiving NAC. Tumour response was evaluated comparing longest tumour diameters as well as tumour volumes at predefined time points using Bland-Altman analysis. Volume measurements for breast MRI were obtained using a fully immersive virtual reality system (a Barco I-Space) and V-Scope software. Same software was used to obtain ABVS volume measurements using an in-house developed desktop VR system. Inter- and intra-observer agreement was evaluated by Intraclass Correlation Coefficient (ICC). Twenty-five patients were eligible for baseline measurement, 20 for a mid-NAC response evaluation, and five for a post-NAC response evaluation. MRI and ABVS showed absolute concordance in 73% of patients for the mid-NAC evaluation, with a 'good' correlation for the difference in longest diameter measurement (ICC 0.73, p < 0.01) as compared to baseline assessment. Concerning difference in volume measurement in the mid-NAC response evaluation showed a 'fair' correlation (ICC 0.52, p < 0.01) and in the post-NAC response evaluation an 'excellent' correlation (ICC 0.98, p < 0.01). 'Excellent' inter- and intra-observer agreement was found (ICC 0.88, p < 0.01) with comparable limits of agreement (LOA) for observer 1 and 2 in both diameter and volume measurement. Patient satisfaction was higher for ABVS compared to breast MRI, 93% versus 12% respectively. ABVS showed 'good' correlation with MRI tumour response evaluation in breast cancer patients during NAC with 'excellent' inter- and intra-observer agreement. ABVS has patients' preference over breast MRI and could be considered as alternative to breast MRI, in case results on an on-going prospective trial confirm these results (NTR6799). Copyright © 2018 Elsevier B.V. All rights reserved.
O'Connor, Francis G; Johnson, Jeremy D; Chapin, Mark; Oriscello, Ralph G; Taylor, Dean C
2005-05-01
To evaluate the interobserver agreement between physicians regarding a abnormal cardiovascular assessment on athletic preparticipation examinations. Cross-sectional clinical survey. Outpatient Clinic, United States Military Academy, West Point, NY. We randomly selected 101 out of 539 cadet-athletes presenting for a preparticipation examination. Two primary care sports medicine fellows and a cardiologist examined the cadets. After obtaining informed consent from all participants, all 3 physicians separately evaluated all 101 cadets. The physicians recorded their clinical findings and whether they thought further cardiovascular evaluation (echocardiography) was indicated. Rate of referral for further cardiovascular evaluation, clinical agreement between sports medicine fellows, and clinical agreement between sports medicine fellows and the cardiologist. Each fellow referred 6 of the 101 evaluated cadets (5.9%). The cardiologist referred none. Although each fellow referred 6 cadets, only 1 cadet was referred by both. The kappa statistic for clinical agreement between fellows is 0.114 (95% CI, -0.182 to 0.411). There was no clinical agreement between the fellows and the cardiologist. This pilot study reveals a low level of agreement between physicians regarding which athletes with an abnormal examination deserved further testing. It challenges the standard of care and questions whether there is a need for improved technologies or improved training in cardiovascular clinical assessment.
Di Rezze, Briano; Rosenbaum, Peter; Zwaigenbaum, Lonnie; Hidecker, Mary Jo Cooley; Stratford, Paul; Cousins, Martha; Camden, Chantal; Law, Mary
2016-09-01
Impairments in social communication are the hallmark of autism spectrum disorder (ASD). Operationalizing 'severity' in ASD has been challenging; thus, stratifying by functioning has not been possible. The purpose of this study is to describe the development of the Autism Classification System of Functioning: Social Communication (ACSF:SC) and to evaluate its consistency within and between parent and professional ratings. (1) ACSF:SC development based on focus groups and surveys involving parents, educators, and clinicians familiar with preschoolers with ASD; and (2) evaluation of the intra- and interrater agreement of the ACSF:SC using weighted kappa (кw ). Seventy-six participants were involved in the development process. Core characteristics of social communication were ascertained: communicative intent; communicative skills and reciprocity; and impact of environment. Five ACSF:SC levels were created and content-validated across participants. Best capacity and typical performance agreement ratings varied as follows: intrarater agreement on 41 children was кw =0.61 to 0.69 for parents, and кw =0.71 to 0.95 for professionals; interrater agreement between professionals was кw =0.47 to 0.61, and between parents and professionals was кw =0.33 to 0.53. Perspectives from parents and professionals informed ACSF:SC development, providing common descriptions of the levels of everyday communicative abilities of children with ASD to complement the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition. Rater agreement demonstrates that the ACSF:SC can be used with acceptable consistency compared with other functional classification systems. © 2016 Mac Keith Press.
Olofsson, Per; Norén, Håkan; Carlsson, Ann
2018-02-01
The updated intrapartum cardiotocography (CTG) classification system by FIGO in 2015 (FIGO2015) and the FIGO2015-approached classification by the Swedish Society of Obstetricians and Gynecologist in 2017 (SSOG2017) are not harmonized with the fetal ECG ST analysis (STAN) algorithm from 2007 (STAN2007). The study aimed to reveal homogeneity and agreement between the systems in classifying CTG and ST events, and relate them to maternal and perinatal outcomes. Among CTG traces with ST events, 100 traces originally classified as normal, 100 as suspicious and 100 as pathological were randomly selected from a STAN database and classified by two experts in consensus. Homogeneity and agreement statistics between the CTG classifications were performed. Maternal and perinatal outcomes were evaluated in cases with clinically hidden ST data (n = 151). A two-tailed p < 0.05 was regarded as significant. For CTG classes, the heterogeneity was significant between the old and new systems, and agreements were moderate to strong (proportion of agreement, kappa index 0.70-0.86). Between the new classifications, heterogeneity was significant and agreements strong (0.90, 0.92). For significant ST events, heterogeneities were significant and agreements moderate to almost perfect (STAN2007 vs. FIGO2015 0.86, 0.72; STAN2007 vs. SSOG2017 0.92, 0.84; FIGO2015 vs. SSOG2017 0.94, 0.87). Significant ST events occurred more often combined with STAN2007 than with FIGO2015 classification, but not with SSOG2017; correct identification of adverse outcomes was not significantly different between the systems. There are discrepancies in the classification of CTG patterns and significant ST events between the old and new systems. The clinical relevance of the findings remains to be shown. © 2017 The Authors. Acta Obstetricia et Gynecologica Scandinavica published by John Wiley & Sons Ltd on behalf of Nordic Federation of Societies of Obstetrics and Gynecology (NFOG).
Zhang, Liping; Zhu, Shaohui; Tang, Shanyu
2017-03-01
Telecare medicine information systems (TMIS) provide flexible and convenient e-health care. However, the medical records transmitted in TMIS are exposed to unsecured public networks, so TMIS are more vulnerable to various types of security threats and attacks. To provide privacy protection for TMIS, a secure and efficient authenticated key agreement scheme is urgently needed to protect the sensitive medical data. Recently, Mishra et al. proposed a biometrics-based authenticated key agreement scheme for TMIS by using hash function and nonce, they claimed that their scheme could eliminate the security weaknesses of Yan et al.'s scheme and provide dynamic identity protection and user anonymity. In this paper, however, we demonstrate that Mishra et al.'s scheme suffers from replay attacks, man-in-the-middle attacks and fails to provide perfect forward secrecy. To overcome the weaknesses of Mishra et al.'s scheme, we then propose a three-factor authenticated key agreement scheme to enable the patient to enjoy the remote healthcare services via TMIS with privacy protection. The chaotic map-based cryptography is employed in the proposed scheme to achieve a delicate balance of security and performance. Security analysis demonstrates that the proposed scheme resists various attacks and provides several attractive security properties. Performance evaluation shows that the proposed scheme increases efficiency in comparison with other related schemes.
Xu, Xin; Zhu, Ping; Wen, Qiaoyan; Jin, Zhengping; Zhang, Hua; He, Lian
2014-01-01
In the field of the Telecare Medicine Information System, recent researches have focused on consummating more convenient and secure healthcare delivery services for patients. In order to protect the sensitive information, various attempts such as access control have been proposed to safeguard patients' privacy in this system. However, these schemes suffered from some certain security defects and had costly consumption, which were not suitable for the telecare medicine information system. In this paper, based on the elliptic curve cryptography, we propose a secure and efficient two-factor mutual authentication and key agreement scheme to reduce the computational cost. Such a scheme enables to provide the patient anonymity by employing the dynamic identity. Compared with other related protocols, the security analysis and performance evaluation show that our scheme overcomes some well-known attacks and has a better performance in the telecare medicine information system.
OSA severity assessment based on sleep breathing analysis using ambient microphone.
Dafna, E; Tarasiuk, A; Zigel, Y
2013-01-01
In this paper, an audio-based system for severity estimation of obstructive sleep apnea (OSA) is proposed. The system estimates the apnea-hypopnea index (AHI), which is the average number of apneic events per hour of sleep. This system is based on a Gaussian mixture regression algorithm that was trained and validated on full-night audio recordings. Feature selection process using a genetic algorithm was applied to select the best features extracted from time and spectra domains. A total of 155 subjects, referred to in-laboratory polysomnography (PSG) study, were recruited. Using the PSG's AHI score as a gold-standard, the performances of the proposed system were evaluated using a Pearson correlation, AHI error, and diagnostic agreement methods. Correlation of R=0.89, AHI error of 7.35 events/hr, and diagnostic agreement of 77.3% were achieved, showing encouraging performances and a reliable non-contact alternative method for OSA severity estimation.
FY2011 Progress Report: Agreement 8697 - NOx Sensor Development
DOE Office of Scientific and Technical Information (OSTI.GOV)
Woo, L Y; Glass, R S
Objectives are: (1) Develop an inexpensive, rapid-response, high-sensitivity and selective electrochemical sensor for oxides of nitrogen (NO{sub x}) for compression-ignition, direct-injection (CIDI) OBD II systems; (2) Explore and characterize novel, effective sensing methodologies based on impedance measurements and designs and manufacturing methods that are compatible with mass fabrication; and (3) Collaborate with industry in order to (ultimately) transfer the technology to a supplier for commercialization. Approach used is: (1) Use an ionic (O{sup 2-}) conducting ceramic as a solid electrolyte and metal or metal-oxide electrodes; (2) Correlate NO{sub x} concentration with changes in cell impedance; (3) Evaluate sensing mechanisms andmore » aging effects on long-term performance using electrochemical techniques; and (4) Collaborate with Ford Research Center to optimize sensor performance and perform dynamometer and on-vehicle testing. Work in FY2011 focused on using an algorithm developed in FY2010 in a simplified strategy to demonstrate how data from controlled laboratory evaluation could be applied to data from real-world engine testing. The performance of a Au wire prototype sensor was evaluated in the laboratory with controlled gas compositions and in dynamometer testing with diesel exhaust. The laboratory evaluation indicated a nonlinear dependence of the NO{sub x} and O{sub 2} sensitivity with concentration. For both NO{sub x} and O{sub 2}, the prototype sensor had higher sensitivity at concentrations less than {approx}20 ppm and {approx}7%, respectively, compared to lower NO{sub x} and O{sub 2} sensitivity at concentrations greater than {approx}50 ppm and {approx}10.5%, respectively. Results in dynamometer diesel exhaust generally agreed with the laboratory results. Diesel exhaust after-treatment systems will likely require detection levels less than {approx}20 ppm in order to meet emission regulations. The relevant mathematical expressions for sensitivity in different concentration regimes obtained from bench-level laboratory evaluation were used to adjust the sensor signal in dynamometer testing. Both NO{sub x} and O{sub 2} exhibited non-linear responses over the concentration regimes examined (0-100 ppm for NO{sub x} and 4-7% for O{sub 2}). Adjusted sensor signals had better agreement with both a commercial NO{sub x} sensor and FTIR measurements. However, the lack of complete agreement indicated that it was not possible to completely account for the nonlinear sensor behavior in certain concentration regimes. The agreement at lower NO{sub x} levels (less than 20 ppm) was better than at higher levels (50-100 ppm). Other progress in FY2011 included dynamometer testing of sensors with imbedded heaters and protective housings that were mounted directly into the exhaust manifold. Advanced testing protocols were used to evaluate the sensors. These experiments confirmed the potential for sensor robustness and durability. Advanced material processing methods appropriate for mass manufacturing, such as sputtering, are also being evaluated. A major milestone for this past year was the licensing of the LLNL NO{sub x} sensor technology to EmiSense Technologies, LLC. EmiSense has extensive experience and resources for the development of emission control sensors. A CRADA is in development that will allow LLNL to work in partnership with EmiSense to bring the LLNL NO{sub x} sensor technology to commercialization. Ford Motor Company is also a partner in this effort.« less
OAO-3 end of mission power subsystem evaluation
NASA Technical Reports Server (NTRS)
Tasevoli, M.
1982-01-01
End of mission tests were performed on the OAO-3 power subsystem in three component areas: solar array, nickel-cadmium batteries and the On-Board Processor (OBP) power boost operation. Solar array evaluation consisted of analyzing array performance characteristics and comparing them to earlier flight data. Measured solar array degradation of 14.1 to 17.7% after 8 1/3 years is in good agreement with theortical radiation damage losses. Battery discharge characteristics were compared to results of laboratory life cycle tests performed on similar cells. Comparison of cell voltage profils reveals close correlation and confirms the validity of real time life cycle simulation. The successful operation of the system in the OBP/power boost regulation mode demonstrates the excellent life, reliability and greater system utilization of power subsystems using maximum power trackers.
Fosbøl, M; Reving, S; Petersen, E H; Rossing, P; Lajer, M; Zerahn, B
2017-01-01
To investigate whether inclusion of quantitative data on blood flow distribution compared with visual qualitative evaluation improve the reliability and diagnostic performance of 99 m Tc-hydroxymethylene diphosphate three-phase bone scintigraphy (TPBS) in patients suspected for charcot neuropathic osteoarthropathy (CNO) of the foot. A retrospective cohort study of TPBS performed on 148 patients with suspected acute CNO referred from a single specialized diabetes care centre. The quantitative blood flow distribution was calculated based on the method described by Deutsch et al. All scintigraphies were re-evaluated by independent, blinded observers twice with and without quantitative data on blood flow distribution at ankle and focus level, respectively. The diagnostic validity of TPBS was determined by subsequent review of clinical data and radiological examinations. A total of 90 patients (61%) had confirmed diagnosis of CNO. The sensitivity, specificity and accuracy of three-phase bone scintigraphy without/with quantitative data were 89%/88%, 58%/62% and 77%/78%, respectively. The intra-observer agreement improved significantly by adding quantitative data in the evaluation (Kappa value 0·79/0·94). The interobserver agreement was not significantly improved. Adding quantitative data on blood flow distribution in the interpretation of TBPS improves intra-observer variation, whereas no difference in interobserver variation was observed. The sensitivity of TPBS in the diagnosis of CNO is high, but holds limited specificity. Diagnostic performance does not improve using quantitative data in the evaluation. This may be due to the reference intervals applied in the study or the absence of a proper gold standard diagnostic procedure for comparison. © 2015 Scandinavian Society of Clinical Physiology and Nuclear Medicine. Published by John Wiley & Sons Ltd.
Rella, L; Telegrafo, M; Nardone, A; Milella, A; Stabile Ianora, A A; Lioce, M; Angelelli, G; Moschetta, M
2015-09-01
To evaluate the effect of post-mastectomy radiation therapy (RT) on breast implants as detected by magnetic resonance imaging (MRI) searching for short-term complications. One hundred and forty patients (total of 144 implants) were evaluated by MRI; 80 (group 1) had undergone RT, whereas the remaining 60 patients (group 2) underwent mastectomy with implant reconstruction without RT. Two radiologists evaluated MRI images searching for implant rupture signs, sub-capsular seromas, capsular contracture, soft-tissue oedema, peri-implant fluid collections. Implant ruptures were classified as severe complications; seromas and capsular contractures as moderate complications; oedema and fluid collections as mild complications. The prevalence of MRI findings in the two groups was calculated and compared by unpaired t-test. Cohen's kappa statistics was used to assess interobserver agreement. Sixty-nine out of 144 (48%) implants presented pathological findings at MRI with complication rates of 47.5 and 48.4 for groups 1 and 2, respectively. Two (5%) severe complications, 10 (26%) moderate complications, and 26 (69%) mild complications occurred in group 1 and surgical treatment was performed in 10 cases. Two (6%) severe complications, seven (23%) moderate complications, and 22 (71%) mild complications occurred in group 2 and surgical treatment was performed in eight cases. No significant difference between the two groups was found (p>0.1). Almost perfect agreement between the two radiologists was found for MRI image detection (k=0.86). RT does not seem to cause a significant effect on breast implants in terms of complication rate in patients undergoing implant-based breast reconstruction. One-stage immediate implant-based breast reconstruction performed at the same time as mastectomy could be proposed. Copyright © 2015 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.
Staging laparoscopy improves treatment decision-making for advanced gastric cancer.
Hu, Yan-Feng; Deng, Zhen-Wei; Liu, Hao; Mou, Ting-Yu; Chen, Tao; Lu, Xin; Wang, Da; Yu, Jiang; Li, Guo-Xin
2016-02-07
To evaluate the clinical value of staging laparoscopy in treatment decision-making for advanced gastric cancer (GC). Clinical data of 582 patients with advanced GC were retrospectively analyzed. All patients underwent staging laparoscopy. The strength of agreement between computed tomography (CT) stage, endoscopic ultrasound (EUS) stage, laparoscopic stage, and final stage were determined by weighted Kappa statistic (Kw). The number of patients with treatment decision-changes was counted. A χ(2) test was used to analyze the correlation between peritoneal metastasis or positive cytology and clinical characteristics. Among the 582 patients, the distributions of pathological T classifications were T2/3 (153, 26.3%), T4a (262, 45.0%), and T4b (167, 28.7%). Treatment plans for 211 (36.3%) patients were changed after staging laparoscopy was performed. Two (10.5%) of 19 patients in M1 regained the opportunity for potential radical resection by staging laparoscopy. Unnecessary laparotomy was avoided in 71 (12.2%) patients. The strength of agreement between preoperative T stage and final T stage was in almost perfect agreement (Kw = 0.838; 95% confidence interval (CI): 0.803-0.872; P < 0.05) for staging laparoscopy; compared with CT and EUS, which was in fair agreement. The strength of agreement between preoperative M stage and final M stage was in almost perfect agreement (Kw = 0.990; 95% CI: 0.977-1.000; P < 0.05) for staging laparoscopy; compared with CT, which was in slight agreement. Multivariate analysis revealed that tumor size (≥ 40 mm), depth of tumor invasion (T4b), and Borrmann type (III or IV) were significantly correlated with either peritoneal metastasis or positive cytology. The best performance in diagnosing P-positive was obtained when two or three risk factors existed. Staging laparoscopy can improve treatment decision-making for advanced GC and decrease unnecessary exploratory laparotomy.
Perinetti, G; Perillo, L; Franchi, L; Di Lenarda, R; Contardo, L
2014-11-01
Diagnostic agreement on individual basis between the third middle phalanx maturation (MPM) method and the cervical vertebral maturation (CVM) method has conjecturally been based mainly on overall correlation analyses. Herein, the true agreement between methods according to stage and sex has been evaluated through a comprehensive diagnostic performance analysis. Four hundred and fifty-one Caucasian subjects were included in the study, 231 females and 220 males (mean age, 12.2 ± 2.5 years; range, 7.0-17.9 years). The X-rays of the middle phalanx of the third finger and the lateral cephalograms were examined for staging by blinded operators, blinded for MPM stages and subjects' age. The MPM and CVM methods based on six stages, two pre-pubertal (1 and 2), two pubertal (3 and 4), and two post-pubertal (5 and 6), were considered. Specifically, for each MPM stage, the diagnostic performance in the identification of the corresponding CVM stage was described by Bayesian statistics. For both sexes, overall agreement was 77.6%. Most of the disagreement was due to 1 stage apart. Slight disagreement was seen for the stages 5 and 6, where the third middle phalanx shows an earlier maturation. The two maturational methods show an overall satisfactorily diagnostic agreement. However, at post-pubertal stages, the middle phalanx of the third finger appears to mature earlier than the cervical vertebrae. Post-pubertal growth phase should thus be based on the presence of stage 6 in MPM. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Báez-Saldaña, Renata; López-Arteaga, Yesenia; Bizarrón-Muro, Alma; Ferreira-Guerrero, Elizabeth; Ferreyra-Reyes, Leticia; Delgado-Sánchez, Guadalupe; Cruz-Hervert, Luis Pablo; Mongua-Rodríguez, Norma; García-García, Lourdes
2013-01-01
Despite chemotherapy, patients with cured pulmonary tuberculosis may result in lung functional impairment. To evaluate a novel scoring system based on the degree of radiographic abnormalities and related spirometric values in patients with cured pulmonary tuberculosis. One hundred and twenty seven patients with cured pulmonary tuberculosis were prospectively enrolled in a referral hospital specializing in respiratory diseases. Spirometry was performed and the extent of radiographic abnormalities was evaluated twice by each of two readers to generate a novel quantitative score. Scoring reproducibility was analyzed by the intra-class correlation coefficient (ICC) and the Bland-Altman method. Multiple linear regression models were performed to assess the association of the extent of radiographic abnormalities with spirometric values. The intra-observer agreement for scoring of radiographic abnormalities (SRA) showed an ICC of 0.81 (CI:95%, 0.67-0.95) and 0.78 (CI:95%, 0.65-0.92), for reader 1 and 2, respectively. Inter-observer reproducibility for the first measurement was 0.83 (CI:95%, 0.71-0.95), and for the second measurement was 0.74 (CI:95%, 0.58-0.90). The Bland-Altman analysis of the intra-observer agreement showed a mean bias of 0.87% and -0.55% and an inter-observer agreement of -0.35% and -1.78%, indicating a minor average systematic variability. After adjustment for age, gender, height, smoking status, pack-years of smoking, and degree of dyspnea, the scoring degree of radiographic abnormalities was significantly and negatively associated with absolute and percent predicted values of FVC: -0.07 (CI:95%, -0.01 to -0.04); -2.48 (CI:95%, -3.45 to -1.50); and FEV1 -0.07 (CI:95%, -0.10 to -0.05); -2.92 (CI:95%, -3.87 to -1.97) respectively, in the patients studied. The extent of radiographic abnormalities, as evaluated through our novel scoring system, was inversely associated with spirometric values, and exhibited good reliability and reproducibility. As intra-observer and inter-observer agreement of the SRA varied from good to excellent, the use of SRA in this setting appears acceptable.
Báez-Saldaña, Renata; López-Arteaga, Yesenia; Bizarrón-Muro, Alma; Ferreira-Guerrero, Elizabeth; Ferreyra-Reyes, Leticia; Delgado-Sánchez, Guadalupe; Cruz-Hervert, Luis Pablo; Mongua-Rodríguez, Norma; García-García, Lourdes
2013-01-01
Background Despite chemotherapy, patients with cured pulmonary tuberculosis may result in lung functional impairment. Objective To evaluate a novel scoring system based on the degree of radiographic abnormalities and related spirometric values in patients with cured pulmonary tuberculosis. Methods One hundred and twenty seven patients with cured pulmonary tuberculosis were prospectively enrolled in a referral hospital specializing in respiratory diseases. Spirometry was performed and the extent of radiographic abnormalities was evaluated twice by each of two readers to generate a novel quantitative score. Scoring reproducibility was analyzed by the intra-class correlation coefficient (ICC) and the Bland-Altman method. Multiple linear regression models were performed to assess the association of the extent of radiographic abnormalities with spirometric values. Results The intra-observer agreement for scoring of radiographic abnormalities (SRA) showed an ICC of 0.81 (CI:95%, 0.67–0.95) and 0.78 (CI:95%, 0.65–0.92), for reader 1 and 2, respectively. Inter-observer reproducibility for the first measurement was 0.83 (CI:95%, 0.71–0.95), and for the second measurement was 0.74 (CI:95%, 0.58–0.90). The Bland-Altman analysis of the intra-observer agreement showed a mean bias of 0.87% and -0.55% and an inter-observer agreement of -0.35% and -1.78%, indicating a minor average systematic variability. After adjustment for age, gender, height, smoking status, pack-years of smoking, and degree of dyspnea, the scoring degree of radiographic abnormalities was significantly and negatively associated with absolute and percent predicted values of FVC: -0.07 (CI:95%, -0.01 to -0.04); -2.48 (CI:95%, -3.45 to -1.50); and FEV1 -0.07 (CI:95%, -0.10 to -0.05); -2.92 (CI:95%, -3.87 to -1.97) respectively, in the patients studied. Conclusion The extent of radiographic abnormalities, as evaluated through our novel scoring system, was inversely associated with spirometric values, and exhibited good reliability and reproducibility. As intra-observer and inter-observer agreement of the SRA varied from good to excellent, the use of SRA in this setting appears acceptable. PMID:24223865
Galvis-Marín, Juan Camilo; Rodríguez-Bocanegra, María Ximena; Pulido-Villamarín, Adriana Del Pilar; Castañeda-Salazar, Rubiela; Celis-Ramírez, Adriana Marcela; Linares-Linares, Melva Yomary
Malassezia furfur is a human skin commensal yeast that can cause skin and opportunistic systemic infections. Given its lipid dependant status, the reference methods established by the Clinical and Laboratory Standards Institute (CLSI) to evaluate antifungal susceptibility in yeasts are not applicable. To evaluate the in vitro susceptibility of M. furfur isolates from infections in humans to antifungals of clinical use. The susceptibility profile to amphotericin B, itraconazole, ketoconazole and voriconazole of 20 isolates of M. furfur, using the broth microdilution method (CLSI M27-A3) and Etest ® , was evaluated. Itraconazole and voriconazole had the highest antifungal activity against the isolates tested. The essential agreement between the two methods for azoles antifungal activity was in the region of 60-85% and the categorical agreement was around 70-80%, while the essential and categorical agreement for amphotericin B was 10%. The azoles were the compounds that showed the highest antifungal activity against M. furfur, as determined by the two techniques used; however more studies need to be performed to support that Etest ® is a reliable method before its implementation as a routine clinical laboratory test. Copyright © 2016 Asociación Española de Micología. Publicado por Elsevier España, S.L.U. All rights reserved.
Gabriel, Guilherme Paiva; Chiquetto, Letícia; Morcillo, André Moreno; Ferreira, Maria do Carmo; Bazan, Ivan Gilberto M.; Daolio, Luísa Dias; Lemos, Jéssica J. Rocha; Carniel, Emília de Faria
2014-01-01
Objective: To assess the completeness and reliability of the Information System on Live Births (Sinasc) data. Methods: A cross-sectional analysis of the reliability and completeness of Sinasc's data was performed using a sample of Live Birth Certificate (LBC) from 2009, related to births from Campinas, Southeast Brazil. For data analysis, hospitals were grouped according to category of service (Unified National Health System, private or both), 600 LBCs were randomly selected and the data were collected in LBC-copies through mothers and newborns' hospital records and by telephone interviews. The completeness of LBCs was evaluated, calculating the percentage of blank fields, and the LBCs agreement comparing the originals with the copies was evaluated by Kappa and intraclass correlation coefficients. Results: The percentage of completeness of LBCs ranged from 99.8%-100%. For the most items, the agreement was excellent. However, the agreement was acceptable for marital status, maternal education and newborn infants' race/color, low for prenatal visits and presence of birth defects, and very low for the number of deceased children. Conclusion: The results showed that the municipality Sinasc is reliable for most of the studied variables. Investments in training of the professionals are suggested in an attempt to improve system capacity to support planning and implementation of health activities for the benefit of maternal and child population. PMID:25479847
Morphological Deficits of Children with SLI: Evaluation of Number Marking and Agreement.
ERIC Educational Resources Information Center
Rice, Mabel L.; Oetting, Janna B.
1993-01-01
Grammatical deficits (e.g., missing feature, surface account, and missing agreement) reported for children with specific language impairment (SLI) were evaluated in spontaneous language transcripts from 108 preschool children. Results indicated that children with SLI do control number marking but find number agreement across clausal boundaries…
Burggraaff, Coreline N; Cornelisse, Alexander C; Hoekstra, Otto S; Lugtenburg, Pieternella J; de Keizer, Bart; Arens, Anne I J; Celik, Filiz; Huijbregts, Julia E; De Vet, Henrica C W; Zijlstra, Josee M
2018-05-04
We aimed to assess the interobserver agreement of Interim PET (I-PET) and End-of-Treatment PET (EoT-PET) using the Deauville 5-point scale (DS) in first-line DLBCL patients. Methods: I-PET and EoT-PET scans of DLBCL patients were performed in the HOVON84 study (2007-2012), an international multicenter randomized controlled trial. Patients received R-CHOP14 and were randomized to receive rituximab intensification in the first 4 cycles or not. I-PET was made after 4 cycles (for observational purposes), and EoT-PET scan after 6 or 8 cycles. Two independent central reviewers retrospectively scored all scans according to the DS-system, blinded to clinical outcomes. Results were dichotomised as 'negative' (DS: 1-3) or 'positive' (DS: 4-5). Besides percentage overall agreement we calculated agreement for positive and negative scores, expressed as positive agreement (PA) and negative agreement (NA), respectively. Results: 465 I-PET and 457 EoT-PET scans were centrally reviewed; baseline 18 F-FDG PET(/CT) was available in 75-77%, and CT in the remaining cases. Percentage overall agreement for I-PET and EoT-PET were 87.7% and 91.7% ( P =0.049), with NA of 92.0% and 95.0% ( P =0.091), and PA of 73.7% and 76.3% ( P =0.656), respectively. Conclusion: Interobserver agreement using DS in DLBCL patients in I-PET and EoT-PET yields high overall and negative agreement. The lower positive agreement suggests that EoT-PET/CT treatment evaluation in daily practice and I-PET adapted trials may benefit from dual reads and central review, respectively. Copyright © 2018 by the Society of Nuclear Medicine and Molecular Imaging, Inc.
Comparison of LiDAR- and photointerpretation-based estimates of canopy cover
Demetrios Gatziolis
2012-01-01
An evaluation of the agreement between photointerpretation- and LiDARbased estimates of canopy cover was performed using 397 90 x 90 m reference areas in Oregon. It was determined that at low canopy cover levels LiDAR estimates tend to exceed those from photointerpretation and that this tendency reverses at high canopy cover levels. Characteristics of the airborne...
Ruiz, P.; Zerolo, F. J.; Casal, M. J.
2000-01-01
The ESP Culture System II was evaluated for its capacity to test the susceptibility of 389 cultures of Mycobacterium tuberculosis to streptomycin, rifampin, ethambutol, and isoniazid. Good agreement with results with the BACTEC TB 460 was found. ESP II is a reliable, rapid, and automated method for performing susceptibility testing. PMID:11101619
Land, Sally; Zhou, Julian; Cunningham, Philip; Sohn, Annette H; Singtoroj, Thida; Katzenstein, David; Mann, Marita; Sayer, David; Kantor, Rami
2013-01-01
Background The TREAT Asia Quality Assessment Scheme (TAQAS) was developed as a quality assessment programme through expert education and training, for laboratories in the Asia-Pacific and Africa that perform HIV drug-resistance (HIVDR) genotyping. We evaluated the programme performance and factors associated with high-quality HIVDR genotyping. Methods Laboratories used their standard protocols to test panels of human immunodeficiency virus (HIV)-positive plasma samples or electropherograms. Protocols were documented and performance was evaluated according to a newly developed scoring system, agreement with panel-specific consensus sequence, and detection of drug-resistance mutations (DRMs) and mixtures of wild-type and resistant virus (mixtures). High-quality performance was defined as detection of ≥95% DRMs. Results Over 4.5 years, 23 participating laboratories in 13 countries tested 45 samples (30 HIV-1 subtype B; 15 non-B subtypes) in nine panels. Median detection of DRMs was 88–98% in plasma panels and 90–97% in electropherogram panels. Laboratories were supported to amend and improve their test outcomes as appropriate. Three laboratories that detected <80% DRMs in early panels demonstrated subsequent improvement. Sample complexity factors – number of DRMs (p<0.001) and number of DRMs as mixtures (p<0.001); and laboratory performance factors – detection of mixtures (p<0.001) and agreement with consensus sequence (p<0.001), were associated with high performance; sample format (plasma or electropherogram), subtype and genotyping protocol were not. Conclusion High-quality HIVDR genotyping was achieved in the TAQAS collaborative laboratory network. Sample complexity and detection of mixtures were associated with performance quality. Laboratories conducting HIVDR genotyping are encouraged to participate in quality assessment programmes. PMID:23845227
De Meulenaere, Astrid; Vermassen, Tijl; Creytens, David; Aspeslagh, Sandrine; Deron, Philippe; Duprez, Frederic; Rottey, Sylvie; Van Dorpe, Jo; Ferdinande, Liesbeth
2018-05-16
A great deal of research is being conducted into PD-L1 immunohistochemistry (IHC) and tumor infiltrating lymphocytes (TILs) as predictive or prognostic biomarkers for immunotherapy, although several practical issues exist concerning their assessment. The aim of this research was therefore to assess the importance of choice of materials and methods in PD-L1 and TILs scoring in oropharyngeal squamous cell carcinoma (OSCC). IHC for PD-L1 (SP142 and 22C3 clone) and TILs subtyping was performed on formalin-fixed paraffin-embedded tissue slides (biopsy, resection and/or lymph nodes specimens) of 99 patients with OSCC. A comparative analysis of PD-L1 and TILs scoring was made between different types of tissue specimens, between different PD-L1 clones, between TILs and different subsets of TILs, and between the quantitative and semi-quantitative assessment. PD-L1 scoring resulted in fair to moderate agreement for 22C3 and SP142 between various tissue specimens, with higher agreement at higher cut-off values, and in moderate agreement for 22C3 versus SP142. Evaluation by four independent observers proved substantial inter-rater agreement for both clones with high consistency in their ratings. Moderate agreement was observed for TILs and TILs subsets for the comparison between biopsy and resection. Lastly, strong correlations were found between quantitative and semi-quantitative assessment for all PD-L1 and TILs scores. Our results highlight the challenges associated with the evaluation of PD-L1 and TILs in OSCC. Further research is warranted to evaluate the use of these biomarkers in order to allow implementation of PD-L1 and TILs infiltrate as biomarkers in daily clinical practice. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Chang, Shang-Jen; Yang, Stephen S D
2008-12-01
To evaluate the inter-observer and intra-observer agreement on the interpretation of uroflowmetry curves of children. Healthy kindergarten children were enrolled for evaluation of uroflowmetry. Uroflowmetry curves were classified as bell-shaped, tower, plateau, staccato and interrupted. Only the bell-shaped curves were regarded as normal. Two urodynamists evaluated the curves independently after reviewing the definitions of the different types of uroflowmetry curve. The senior urodynamist evaluated the curves twice 3 months apart. The final conclusion was made when consensus was reached. Agreement among observers was analyzed using kappa statistics. Of 190 uroflowmetry curves eligible for analysis, the intra-observer agreement in interpreting each type of curve and interpreting normalcy vs abnormality was good (kappa=0.71 and 0.68, respectively). Very good inter-observer agreement (kappa=0.81) on normalcy and good inter-observer agreement (kappa=0.73) on types of uroflowmetry were observed. Poor inter-observer agreement existed on the classification of specific types of abnormal uroflowmetry curves (kappa=0.07). Uroflowmetry is a good screening tool for normalcy of kindergarten children, while not a good tool to define the specific types of abnormal uroflowmetry.
Occlusion assessment of intracranial aneurysms treated with the WEB device.
Caroff, Jildaz; Mihalea, Cristian; Tuilier, Titien; Barreau, Xavier; Cognard, Christophe; Desal, Hubert; Pierot, Laurent; Arnoux, Armelle; Moret, Jacques; Spelle, Laurent
2016-09-01
The Woven EndoBridge (WEB) system is an innovative device under evaluation for its capacity to treat wide-neck bifurcation intracranial aneurysms. The purpose of this study is to evaluate the use of the different occlusion scales available in clinical practice. Seven WEB-experienced neurointerventionalists were provided with 30 angiographic follow-up data sets and asked to grade each evaluation point according to the Bicêtre Occlusion Scale Score (BOSS), firstly based on DSA images only then using additional C-Arm VasoCT analysis. This BOSS evaluation was then converted into the WEB Occlusion Scale (WOS) and into a dichotomized scale (complete occlusion or not). To estimate the inter-rater agreement among the seven raters, an overall kappa coefficient [1] and its standard error (SE) were computed. Using the five-grade BOSS, raters showed "moderate" agreement (kappa = 0.56). Using the three-grade WOS, agreement appeared slightly better (kappa = 0.59). Strongest inter-rater agreement was observed with a dichotomized version of the scale (complete occlusion or not), which enabled an "almost perfect" agreement (kappa = 0.88). VasoCT consistently enhanced the agreement particularly with regards depicting intra-WEB residual filling. The WOS is a consistent means to angiographically evaluate the WEB device efficiency. But the five-grade BOSS scale allows to identify aneurysm subgroups with differing risks of recurrence and/or rehemorrhage, which needs to be separated especially at the initial phase of evaluation of this innovative device. The additional use of VasoCT allows better inter-rater agreement in evaluating occlusion and specially in depicting intra-WEB persistent filling.
Kang, Hee-Chung; Hong, Jae-Seok
2011-08-16
With a greater emphasis on cost containment in many health care systems, it has become common to evaluate each physician's relative resource use. This study explored the major factors that influence the economic performance rankings of medical clinics in the Korea National Health Insurance (NHI) program by assessing the consistency between cost-efficiency indices constructed using different profiling criteria. Data on medical care benefit costs for outpatient care at medical clinics nationwide were collected from the NHI claims database. We calculated eight types of cost-efficiency index with different profiling criteria for each medical clinic and investigated the agreement between the decile rankings of each index pair using the weighted kappa statistic. The exclusion of pharmacy cost lowered agreement between rankings to the lowest level, and differences in case-mix classification also lowered agreement considerably. A medical clinic may be identified as either cost-efficient or cost-inefficient, even when using the same index, depending on the profiling criteria applied. Whether a country has a single insurance or a multiple-insurer system, it is very important to have standardized profiling criteria for the consolidated management of health care costs.
Developing and evaluating a target-background similarity metric for camouflage detection.
Lin, Chiuhsiang Joe; Chang, Chi-Chan; Liu, Bor-Shong
2014-01-01
Measurement of camouflage performance is of fundamental importance for military stealth applications. The goal of camouflage assessment algorithms is to automatically assess the effect of camouflage in agreement with human detection responses. In a previous study, we found that the Universal Image Quality Index (UIQI) correlated well with the psychophysical measures, and it could be a potentially camouflage assessment tool. In this study, we want to quantify the camouflage similarity index and psychophysical results. We compare several image quality indexes for computational evaluation of camouflage effectiveness, and present the results of an extensive human visual experiment conducted to evaluate the performance of several camouflage assessment algorithms and analyze the strengths and weaknesses of these algorithms. The experimental data demonstrates the effectiveness of the approach, and the correlation coefficient result of the UIQI was higher than those of other methods. This approach was highly correlated with the human target-searching results. It also showed that this method is an objective and effective camouflage performance evaluation method because it considers the human visual system and image structure, which makes it consistent with the subjective evaluation results.
Ellis, Jason D; Arendt, Susan W; Strohbehn, Catherine H; Meyer, Janell; Paez, Paola
2010-11-01
Food safety training has been the primary avenue for ensuring food workers are performing proper food handling practices and thus, serving safe food. Yet, knowledge of safe food handling practices does not necessarily result in actual performance of these practices. This research identified participating food service employees' level of agreement with four factors of motivation (internal motivations, communication, reward-punishment, and resources) and determined if respondents with different demographic characteristics reported different motivating factors. Data were collected from 311 food service employees who did not have any supervisory responsibilities. Intrinsic motivation agreement scores were consistently the highest of all four motivational factors evaluated and did not differ across any of the demographic characteristics considered. In contrast, motivation agreement scores for communication, reward-punishment, and resources did differ based on respondents' gender, age, place of employment, job status, food service experience, completion of food handler course, or possession of a food safety certification. In general, respondents agreed that these motivation factors influenced their likelihood to perform various safe food handling procedures. This research begins to illustrate how employees' demographic characteristics influence their responses to various motivators, helping to clarify the complex situation of ensuring safe food in retail establishments. Future research into why employee willingness to perform varies more for extrinsic motivation than for intrinsic motivation could assist food service managers in structuring employee development programs and the work environment, in a manner that aids in improving external motivation (communication, reward-punishment, and resources) and capitalizing on internal motivation.
NPAC-Nozzle Performance Analysis Code
NASA Technical Reports Server (NTRS)
Barnhart, Paul J.
1997-01-01
A simple and accurate nozzle performance analysis methodology has been developed. The geometry modeling requirements are minimal and very flexible, thus allowing rapid design evaluations. The solution techniques accurately couple: continuity, momentum, energy, state, and other relations which permit fast and accurate calculations of nozzle gross thrust. The control volume and internal flow analyses are capable of accounting for the effects of: over/under expansion, flow divergence, wall friction, heat transfer, and mass addition/loss across surfaces. The results from the nozzle performance methodology are shown to be in excellent agreement with experimental data for a variety of nozzle designs over a range of operating conditions.
Garimella, Sandilya V. B.; Ibrahim, Yehia. M.; Webb, Ian K.; ...
2015-08-19
The process of redirecting ions through 90° turns and ‘tee’ switches utilizing Structures for Lossless Ion Manipulations (SLIM) was evaluated using theoretical and simulation methods at 4 Torr pressure. SIMION simulations were used to optimize and evaluate conditions for performing turns without loss of signal intensity or ion mobility resolving power. Fundamental considerations indicated that the “race track” effect during ion turns may incur only small losses to the ion mobility resolving power at 4 Torr pressure for the typical plume widths predicted in an optimized SLIM ‘tee’ switch design. The dynamic switching of ions into orthogonal channels was alsomore » evaluated using SIMION ion trajectory simulations, and achieved similar performance. Simulation results were in close agreement with experimental results and were used to refine SLIM designs and applied potentials for their use.« less
Mortality Measures to Profile Hospital Performance for Patients With Septic Shock.
Walkey, Allan J; Shieh, Meng-Shiou; Liu, Vincent X; Lindenauer, Peter K
2018-04-30
Sepsis care is becoming a more common target for hospital performance measurement, but few studies have evaluated the acceptability of sepsis or septic shock mortality as a potential performance measure. In the absence of a gold standard to identify septic shock in claims data, we assessed agreement and stability of hospital mortality performance under different case definitions. Retrospective cohort study. U.S. acute care hospitals. Hospitalized with septic shock at admission, identified by either implicit diagnosis criteria (charges for antibiotics, cultures, and vasopressors) or by explicit International Classification of Diseases, 9th revision, codes. None. We used hierarchical logistic regression models to determine hospital risk-standardized mortality rates and hospital performance outliers. We assessed agreement in hospital mortality rankings when septic shock cases were identified by either explicit International Classification of Diseases, 9th revision, codes or implicit diagnosis criteria. Kappa statistics and intraclass correlation coefficients were used to assess agreement in hospital risk-standardized mortality and hospital outlier status, respectively. Fifty-six thousand six-hundred seventy-three patients in 308 hospitals fulfilled at least one case definition for septic shock, whereas 19,136 (33.8%) met both the explicit International Classification of Diseases, 9th revision, and implicit septic shock definition. Hospitals varied widely in risk-standardized septic shock mortality (interquartile range of implicit diagnosis mortality: 25.4-33.5%; International Classification of Diseases, 9th revision, diagnosis: 30.2-38.0%). The median absolute difference in hospital ranking between septic shock cohorts defined by International Classification of Diseases, 9th revision, versus implicit criteria was 37 places (interquartile range, 16-70), with an intraclass correlation coefficient of 0.72, p value of less than 0.001; agreement between case definitions for identification of outlier hospitals was moderate (kappa, 0.44 [95% CI, 0.30-0.58]). Risk-standardized septic shock mortality rates varied considerably between hospitals, suggesting that septic shock is an important performance target. However, efforts to profile hospital performance were sensitive to septic shock case definitions, suggesting that septic shock mortality is not currently ready for widespread use as a hospital quality measure.
Chen, Jian; Zhang, Yan-Ming; Song, Ze-Zhou; Fu, Yan-Fei; Geng, Yu
2018-04-10
The interobserver agreement in the assessment of the grade of carotid plaque neovascularization by contrast-enhanced ultrasonography is poorly established. We examined 140 carotid plaques in 66 patients (all patients had bilateral plaques, and 8 patients had 2 plaques on one side). We performed conventional and contrast-enhanced ultrasonography to analyze the presence of carotid plaque neovascularization, which was graded by two independent observers whose interobserver agreement (κ) was evaluated according to the thickness of carotid plaque. For all carotid plaques, the mean κ was 0.689 (95% confidence interval 0.604-0.774). It was 0.689 (0.569-0.808), 0.637 (0.487-0.787), and 0.740 (0.585-0.896), respectively for carotid plaques with maximal thickness <2 mm, from 2 mm to 3 mm, and >3 mm. The interobserver agreement for assessing carotid plaque neovascularization by using contrast-enhanced ultrasonography is substantial and acceptable for research purposes, regardless of the maximal thickness of the plaque. © 2018 Wiley Periodicals, Inc.
29 CFR 553.23 - Agreement or understanding prior to performance of work.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 29 Labor 3 2010-07-01 2010-07-01 false Agreement or understanding prior to performance of work... understanding prior to performance of work. (a) General. (1) As a condition for use of compensatory time in lieu of overtime payment in cash, section 7(o)(2)(A) of the Act requires an agreement or understanding...
[Evaluation (assessment) of three tests for diagnosis of geohelmints in Colombia].
López, Myriam Consuelo; Moncada, Ligia Inés; Ariza-Araújo, Yoseth; Fernández-Niño, Julián Alfredo; Reyes, Patricia; Nicholls, Rubén Santiago
2013-01-01
Soil-transmitted helminth infections are considered a public health problem in developing countries. The diagnostic tests, both for individual parient diagnosis as for population studies should be evaluated in terms of validity and reliability. To compare the direct examination, the modified Ritchie-Frick method, a Kato-Katz designed by a Brazilian group and one designed by the WHO, for the diagnosis of soil-transmitted helminthes. A diagnostic test reliability study was performed. The same stool sample was analyzed by the same observer using four diagnostic tests. 204 samples were obtained, 194 of those fulfilled the inclusion criteria and were analyzed. The observers did not know the participants' identity neither the other tests results. For the analysis the Kato-Katz (WHO) was considered as the gold standard. For the reliability assessment percent agreement, positive percent agreement, Kappa statistic, and intraclass correlation were performed. The Brazilian Kato-Katz showed a good performance with high sensitivity and specificity for T. trichiura and Hookworm with values of 0.97 and 0.96 respectively, and a high specificity with mild sensitivity for A. lumbricoides (0.95 and 0.79) meanwhile the direct examination and the Ritche-Frick method showed a performance between mild and poor. The differences were higher for hookworm and Trichiuris trichiura than for Ascaris lumbricoides. The Brazilian Kato Katz test could be implemented, but further studies are needed to correlate its operative capacity with its feasibility, availability and cost.
Effect of Auditory-Perceptual Training With Natural Voice Anchors on Vocal Quality Evaluation.
Dos Santos, Priscila Campos Martins; Vieira, Maurílio Nunes; Sansão, João Pedro Hallack; Gama, Ana Cristina Côrtes
2018-01-10
To analyze the effects of auditory-perceptual training with anchor stimuli of natural voices on inter-rater agreement during the assessment of vocal quality. This is a quantitative nature study. An auditory-perceptual training site was developed consisting of Programming Interface A, an auditory training activity, and Programming Interface B, a control activity. Each interface had three stages: pre-training/pre-interval evaluation, training/interval, and post-training/post-interval evaluation. Two experienced evaluators classified 381 voices according to the GRBASI scale (G-grade, R-roughness, B-breathiness, A-asthenia, S-strain, I-instability). Voices were selected that received the same evaluation by both evaluators: 57 voices for evaluation and 56 for training were selected, with varying degrees of deviation across parameters. Fifteen inexperienced evaluators were then selected. In the pre-, post-training, pre-, and postinterval stages, evaluators listened to the voices and classified them via the GRBASI scale. In the stage interval evaluators read a text. In the stage training each parameter was trained separately. Evaluators analyzed the degrees of deviation of the GRBASI parameters based on anchor stimuli, and could only advance after correctly classifying the voices. To quantify inter-rater agreement and provide statistical analyses, the AC1 coefficient, confidence intervals, and percentage variation of agreement were employed. Except for the asthenia parameter, decreased agreement was observed in the control condition. Improved agreement was observed with auditory training, but this improvement did not achieve statistical significance. Training with natural voice anchors suggest an increased inter-rater agreement during perceptual voice analysis, potentially indicating that new internal references were established. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Trust Mines: Legal Documents and Settlements
Legal Documents and Settlements related to the Northern Abandoned Uranium Mines Region including the Phase 1 Settlement Agreement and Environmental Response Trust Agreement, Phase 2 Settlement Agreement Removal Site Evaluation (RSE) Trust Agreement.
Thermoelectric generator testing and RTG degradation mechanisms evaluation. Progress report No. 33
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lockwood, A.; Stapfer, G.
1979-12-01
The n-type selenide legs after 10,600 hours continue to show agreement with the 3M Co. published data. In the ingradient testing after 11,378 hours the n-legs show comparable performance to the reported 3M data. The new design p-type legs were placed on test. The remaining MHW generator on test Q1-A has accumulated 19,567 hours and performance remains stable. Three 18 couple modules S/N-1, 2, and 3 previously tested at RCA were received for JPL test and evaluation. S/N-1 has 1700 hours testing at JPL with results indistinguishable from those at the end of RCA testing in 1977. The performances ofmore » LES 8/9 generators are following DEGRA predications after 31,824 hours. The performance of the Voyager 1 and 2 RTGs is reported after 19,577 hours and 20,122 hours of operation, respectively.« less
Paquet, Anne-Laure; Valli, Vanina; Philippon, Anne-Laure; Devilliers, Catherine; Bloom, Ben; Hausfater, Pierre; Riou, Bruno; Freund, Yonathan
2018-04-01
In the emergency department (ED), lactate is routinely used for risk stratification. Whether venous or arterial lactate measured on blood gas is interchangeable is not known. We hypothesized that venous lactate can be used instead of arterial lactate for the evaluation of acute patients in the ED. This was a prospective single-center study. All patients requiring a lactate measurement were enrolled and we simultaneously drew arterial and venous blood. We followed up all patients to hospital discharge. Our primary aim was to evaluate agreements between the two measurements using Bland and Altman plots with the report of bias (mean difference) and limits of agreements. We also aimed to determine the rate of misclassification (defined as one measurement<1.8 mmol/l and the other>2.2). Our secondary aim was to evaluate their respective prognostic value to predict in-hospital death or admission in the ICU longer than 72 h. The mean age of the 132 analyzed patients was 62 years (SD: 18 years), and 59% were men. The mean difference (bias) between arterial and venous lactate was -0.6 mmol/l (limits of agreement: -1.7 to 0.6 mmol/l). The rate of misclassification was 8% (95% confidence interval: 3-2%). Both methods present similar performances for the prediction of poor outcomes, with an area under the receiving operator characteristic curves of 0.67 for both. Results were similar when focused only on septic patients. Venous and arterial lactates do not agree well, and there is a high misclassification rate. Venous lactate does not appear to be interchangeable with arterial sampling.
The ICI classification for calcaneal injuries: a validation study.
Frima, Herman; Eshuis, Rienk; Mulder, Paul; Leenen, Luke
2012-06-01
The integral classification of injuries (ICI), by Zwipp et al. has been developed as a classification system for injuries of the bones, joints, cartilage and ligaments of the foot. It follows the principles of the comprehensive classification of fractures by Müller et al. The ICI was developed for 'everyday use' and scientific purposes. Our aim was to perform a validation study for this classification system applied to the calcaneal injuries. A panel of five experienced trauma and orthopaedic surgeons evaluated the ICI score in 20 calcaneal injuries. After 2 months, a second classification was performed in a different order. Inter- and intra-observer variability were evaluated by kappa statistics. Panel members were not able to evaluate capsule and ligamental injuries based on X-ray and computed tomography (CT) films. Two injuries were excluded for logistical reasons. The inter-observer agreement based on 18 injuries of bone and joints was slight; kappa 0.14 (90% confidence interval (CI): 0.05-0.22). The intra-observer agreement was fair; kappa 0.31 (90% CI: 0.22-0.41). Overall, the panel rated the system as very complicated and not practical. The ICI is a complicated classification system with slight to fair inter- and intra-observer variabilities. It might not be a practical classification system for calcaneal injuries in 'everyday use' or scientific purposes. Copyright © 2011 Elsevier Ltd. All rights reserved.
Lack of grading agreement among international hemostasis external quality assessment programs
Olson, John D.; Jennings, Ian; Meijer, Piet; Bon, Chantal; Bonar, Roslyn; Favaloro, Emmanuel J.; Higgins, Russell A.; Keeney, Michael; Mammen, Joy; Marlar, Richard A.; Meley, Roland; Nair, Sukesh C.; Nichols, William L.; Raby, Anne; Reverter, Joan C.; Srivastava, Alok; Walker, Isobel
2018-01-01
Laboratory quality programs rely on internal quality control and external quality assessment (EQA). EQA programs provide unknown specimens for the laboratory to test. The laboratory's result is compared with other (peer) laboratories performing the same test. EQA programs assign target values using a variety of methods statistical tools and performance assessment of ‘pass’ or ‘fail’ is made. EQA provider members of the international organization, external quality assurance in thrombosis and hemostasis, took part in a study to compare outcome of performance analysis using the same data set of laboratory results. Eleven EQA organizations using eight different analytical approaches participated. Data for a normal and prolonged activated partial thromboplastin time (aPTT) and a normal and reduced factor VIII (FVIII) from 218 laboratories were sent to the EQA providers who analyzed the data set using their method of evaluation for aPTT and FVIII, determining the performance for each laboratory record in the data set. Providers also summarized their statistical approach to assignment of target values and laboratory performance. Each laboratory record in the data set was graded pass/fail by all EQA providers for each of the four analytes. There was a lack of agreement of pass/fail grading among EQA programs. Discordance in the grading was 17.9 and 11% of normal and prolonged aPTT results, respectively, and 20.2 and 17.4% of normal and reduced FVIII results, respectively. All EQA programs in this study employed statistical methods compliant with the International Standardization Organization (ISO), ISO 13528, yet the evaluation of laboratory results for all four analytes showed remarkable grading discordance. PMID:29232255
Improving agreement in assessment of synovitis in rheumatoid arthritis.
Cheung, Peter P; Dougados, Maxime; Andre, Vincent; Balandraud, Nathalie; Chales, Gérard; Chary-Valckenaere, Isabelle; Chatelus, Emmanuel; Dernis, Emmanuelle; Gill, Ghislaine; Gilson, Mélanie; Guis, Sandrine; Mouterde, Gael; Pavy, Stephan; Pouyol, François; Marhadour, Thierry; Richette, Pascal; Ruyssen-Witrand, Adeline; Soubrier, Martin; Gossec, Laure
2013-03-01
Synovitis assessment through evaluation of swollen joints is integral in steering treatment decisions in rheumatoid arthritis (RA). However, there is high inter-observer variation. The objective was to assess if a short collegiate consensus would improve swollen joint agreement between rheumatologists and whether this was affected by experience. Eighteen rheumatologists from French university rheumatology units participated in three 30 minutes rounds over a half day meeting evaluating joint counts of RA patients in small groups, followed by short consensus discussions. Agreement was evaluated at the end of each round as follows: (i) global agreement of swollen joints (ii) swollen joint agreement according to level of experience of the rheumatologist (iii) swollen joint count and (iv) agreement of disease activity state according to the Disease Activity Score (DAS28). Agreement was calculated using percentage agreement and kappa. Global agreement of swollen joints failed to improve (kappa 0.50 to 0.52) at the joint level. Agreement between seniors did not improve but agreement between newly qualified rheumatologists and their senior peer, which was initially poor (kappa 0.28), improved significantly (to 0.54) at the end of the consensus exercises. Concordance of DAS28 activity states improved from 71% to 87%. Consensus exercises for swollen joint assessment is worthwhile and may potentially improve agreement between clinicians in clinical synovitis and disease activity state, benefit was mostly observed in newly qualified rheumatologists. Copyright © 2012 Société française de rhumatologie. Published by Elsevier SAS. All rights reserved.
San Francisco urban partnership agreement : national evaluation plan.
DOT National Transportation Integrated Search
2009-12-22
This report provides an analytic framework for evaluating the San Francisco Urban Partnership Agreement (UPA) under the United States Department of Transportation (U.S. DOT) UPA Program. The San Francisco UPA projects to be evaluated focus on those r...
[The Brazilian Hospital Information System and the acute myocardial infarction hospital care].
Escosteguy, Claudia Caminha; Portela, Margareth Crisóstomo; Medronho, Roberto de Andrade; de Vasconcellos, Maurício Teixeira Leite
2002-08-01
To analyze the applicability of the Brazilian Unified Health System's national hospital database to evaluate the quality of acute myocardial infarction hospital care. It was evaluated 1,936 hospital admission forms having acute myocardial infarction (AMI) as primary diagnosis in the municipal district of Rio de Janeiro, Brazil, in 1997. Data was collected from the national hospital database. A stratified random sampling of 391 medical records was also evaluated. AMI diagnosis agreement followed the literature criteria. Variable accuracy analysis was performed using kappa index agreement. The quality of AMI diagnosis registered in hospital admission forms was satisfactory according to the gold standard of the literature. In general, the accuracy of the variables demographics (sex, age group), process (medical procedures and interventions), and outcome (hospital death) was satisfactory. The accuracy of demographics and outcome variables was higher than the one of process variables. Under registration of secondary diagnosis was high in the forms and it was the main limiting factor. Given the study findings and the widespread availability of the national hospital database, it is pertinent its use as an instrument in the evaluation of the quality of AMI medical care.
Successful associateship agreements.
Crafton, B C
1997-08-01
When evaluating potential associateship agreements, dentists need to recognize and understand how status, noncompete clauses, scheduling and compensation affect the strength of an associateship agreement. Dentists should not enter an associateship agreement without fully understanding the agreement and its obligations or without the help of an accountant and an attorney.
Fagnani, Francis; Pham, Thao; Claudepierre, Pascal; Berenbaum, Francis; De Chalus, Thibault; Saadoun, Carine; Joubert, Jean-Michel; Fautrel, Bruno
2016-08-01
To evaluate the cost-effectiveness of a Treat-to-Target strategy with certolizumab pegol in patients with rheumatoid arthritis in the context of a pay-for-performance agreement in which medication costs are refunded in case of discontinuation during the first 3 months of treatment. The Treat-to-Target strategy consisted of a systematic switch to second-line tumor necrosis factor (TNF)α inhibitor in case of an unmet ACR50 response at 3 months compared to current routine clinical practice. A reference cohort treated first-line with certolizumab pegol according to current practice without systematic switching was considered as the comparator. A decision-tree model was constructed to estimate clinical outcome (health assessment questionnaire-disability index or HAQ-DI score), time spent in ACR50 response (ACR 50), and direct costs of treatment over a 2-year period. HAQ scores were derived from American College of Rheumatology 50 (ACR50) responses. All TNFα inhibitors were assumed to have equivalent efficacy and tolerability. Costs were estimated at 2013 French retail prices (date of the pay-for-performance agreement). The mean duration of an ACR50 response was 1.23 years in the Treat-to-Target strategy certolizumab pegol cohort vs 0.98 years in the reference cohort, resulting in a mean gain in HAQ at 24 months of 0.117. The Treat-to-Target strategy with a mix of TNFα inhibitors as second-line therapy was more expensive than the reference strategy in absolute terms, but this difference was entirely offset by the pay-for-performance agreement. The Treat-to-Target strategy was, thus, cost-neutral over a 2-year period after the payback of CZP cost for patients not achieving the target at 3 months. In the context of a pay-for-performance agreement, the management of patients with rheumatoid arthritis using a Treat-to-Target strategy with certolizumab pegol in first line is dominant compared to standard use of this drug in the French setting in 2013.
Rossettini, Giacomo; Rondoni, Angie; Lovato, Tommaso; Strobe, Marco; Verzè, Elisa; Vicentini, Marco; Testa, Marco
2016-06-03
Passive Intervertebral Movements (PIVMs) are commonly used to assess and treat patients with nonspecific neck pain. Only very few studies have investigated 3D movements until now. This study assessed intra- and inter-rater reliability of three-dimensional (3D) cervical PIVMs performed by physical therapy students in patients with nonspecific neck pain. Thirty-one patients, mean age 47.2 ± 7.2 years, were independently evaluated by 2 physical therapy students. The raters (A and B) assessed mobility, end-feel and pain provocation performing bilaterally the 3D cervical segmental side-bending test (3D CSSB) from levels C2-C3 to C6-C7. Percentage agreement (raw, positive and negative), Cohen's kappa (95% CI), prevalence index and bias index were calculated to estimate intra- and inter-reliability. Intra-rater reliability showed kappa values ranging between fair and substantial (k 0.29-0.80) for pain provocation, mobility and end-feel, with percentage agreements between 61%-90%. Inter-rater reliability presented kappa values ranging between fair and substantial (k 0.22-0.62) for pain provocation, mobility and end-feel, with percentage agreements between 61% and 80%. Intra-rater reliability of 3D PIVMs was superior to inter-rater reliability in patients with nonspecific neck pain. The most repeatable evaluation parameter was pain. However overall poor reliability suggests avoiding the use of these techniques alone to examine patients and measure their outcome. Further studies are needed to investigate PIVMs reliability in combination with other assessment procedure in symptomatic patients.
Kentel, Behzat B; King, Mark A; Mitchell, Sean R
2011-11-01
A torque-driven, subject-specific 3-D computer simulation model of the impact phase of one-handed tennis backhand strokes was evaluated by comparing performance and simulation results. Backhand strokes of an elite subject were recorded on an artificial tennis court. Over the 50-ms period after impact, good agreement was found with an overall RMS difference of 3.3° between matching simulation and performance in terms of joint and racket angles. Consistent with previous experimental research, the evaluation process showed that grip tightness and ball impact location are important factors that affect postimpact racket and arm kinematics. Associated with these factors, the model can be used for a better understanding of the eccentric contraction of the wrist extensors during one-handed backhand ground strokes, a hypothesized mechanism of tennis elbow.
Analysis of rockbolt performance at the Waste Isolation Pilot Plant
DOE Office of Scientific and Technical Information (OSTI.GOV)
Terrill, L.J.; Francke, C.T.; Saeb, S.
Rockbolt failures at the Waste Isolation Pilot Plant have been recorded since 1990 and are categorized in terms of mode of failure. The failures are evaluated in terms of physical location of installation within the mine, local excavation geometry and stratigraphy, proximity to other excavations or shafts, and excavation age. The database of failures has revealed discrete ares of the mine containing relatively large numbers of failures. The results of metallurgical analyses and standard rockbolt load testing have generally been in agreement with the in situ evaluations.
Nabawy, Mostafa R. A.; Crowther, William J.
2014-01-01
This paper introduces a generic, transparent and compact model for the evaluation of the aerodynamic performance of insect-like flapping wings in hovering flight. The model is generic in that it can be applied to wings of arbitrary morphology and kinematics without the use of experimental data, is transparent in that the aerodynamic components of the model are linked directly to morphology and kinematics via physical relationships and is compact in the sense that it can be efficiently evaluated for use within a design optimization environment. An important aspect of the model is the method by which translational force coefficients for the aerodynamic model are obtained from first principles; however important insights are also provided for the morphological and kinematic treatments that improve the clarity and efficiency of the overall model. A thorough analysis of the leading-edge suction analogy model is provided and comparison of the aerodynamic model with results from application of the leading-edge suction analogy shows good agreement. The full model is evaluated against experimental data for revolving wings and good agreement is obtained for lift and drag up to 90° incidence. Comparison of the model output with data from computational fluid dynamics studies on a range of different insect species also shows good agreement with predicted weight support ratio and specific power. The validated model is used to evaluate the relative impact of different contributors to the induced power factor for the hoverfly and fruitfly. It is shown that the assumption of an ideal induced power factor (k = 1) for a normal hovering hoverfly leads to a 23% overestimation of the generated force owing to flapping. PMID:24554578
Nabawy, Mostafa R A; Crowther, William J
2014-05-06
This paper introduces a generic, transparent and compact model for the evaluation of the aerodynamic performance of insect-like flapping wings in hovering flight. The model is generic in that it can be applied to wings of arbitrary morphology and kinematics without the use of experimental data, is transparent in that the aerodynamic components of the model are linked directly to morphology and kinematics via physical relationships and is compact in the sense that it can be efficiently evaluated for use within a design optimization environment. An important aspect of the model is the method by which translational force coefficients for the aerodynamic model are obtained from first principles; however important insights are also provided for the morphological and kinematic treatments that improve the clarity and efficiency of the overall model. A thorough analysis of the leading-edge suction analogy model is provided and comparison of the aerodynamic model with results from application of the leading-edge suction analogy shows good agreement. The full model is evaluated against experimental data for revolving wings and good agreement is obtained for lift and drag up to 90° incidence. Comparison of the model output with data from computational fluid dynamics studies on a range of different insect species also shows good agreement with predicted weight support ratio and specific power. The validated model is used to evaluate the relative impact of different contributors to the induced power factor for the hoverfly and fruitfly. It is shown that the assumption of an ideal induced power factor (k = 1) for a normal hovering hoverfly leads to a 23% overestimation of the generated force owing to flapping.
Intra- and interobserver agreement for fetal cerebral measurements in 3D-ultrasonography.
Albers, Maria E W A; Buisman, Erato T I A; Kahn, René S; Franx, Arie; Onland-Moret, N Charlotte; de Heus, Roel
2018-04-10
The aim of this study is to evaluate intra- and interobserver agreement for measurement of intracranial, cerebellar, and thalamic volume with the Virtual Organ Computer-aided AnaLysis (VOCAL) technique in three-dimensional ultrasound images, in comparison to two-dimensional measurements of these brain structures. Three-dimensional ultrasound images of the brains of 80 fetuses at 20-24 weeks' gestational age were obtained from YOUth, a Dutch prospective cohort study. Two observers performed offline measurement of the occipitofrontal diameter, intracranial volume, transcerebellar diameter, cerebellar volume, and thalamic width, area, and volume, independently. VOCAL was used for calculation of the volumes. The two-way random, single measures intraclass correlation coefficient (ICC) was used for analysis of agreement and Bland-Altman plots were configured. Intra- and interobserver agreement was almost perfect for occipitofrontal diameter (intra ICC 0.88, 95% CI 0.82-0.92; inter ICC 0.91, 95% CI 0.85-0.94), intracranial volume (intra ICC 0.96, 95% CI 0.91-0.98; inter ICC 0.97, 95% CI 0.96-0.98) and transcerebellar diameter (intra ICC 0.91, 95% CI 0.86-0.94; inter ICC 0.86, 95% CI 0.78-0.910). For cerebellar volume, the intraobserver agreement was almost perfect (0.85, 95% CI 0.76-0.90), whereas the interobserver agreement was substantial (0.75, 95% CI 0.44-0.88). Agreement was only moderate for thalamic measurements. Bland-Altman plots for the volume measurements are normally distributed with acceptable mean differences and 95% limits of agreement. The intra- and interobserver agreement of the measurement of intracranial and cerebellar volume with VOCAL was almost perfect. These measurements are therefore reliable, and can be used to investigate fetal brain development. Thalamic measurements are not reliable enough. © 2018 Wiley Periodicals, Inc.
Statistical Validation for Clinical Measures: Repeatability and Agreement of Kinect™-Based Software
Tello, Emanuel; Rodrigo, Alejandro; Valentinuzzi, Max E.
2018-01-01
Background The rehabilitation process is a fundamental stage for recovery of people's capabilities. However, the evaluation of the process is performed by physiatrists and medical doctors, mostly based on their observations, that is, a subjective appreciation of the patient's evolution. This paper proposes a tracking platform of the movement made by an individual's upper limb using Kinect sensor(s) to be applied for the patient during the rehabilitation process. The main contribution is the development of quantifying software and the statistical validation of its performance, repeatability, and clinical use in the rehabilitation process. Methods The software determines joint angles and upper limb trajectories for the construction of a specific rehabilitation protocol and quantifies the treatment evolution. In turn, the information is presented via a graphical interface that allows the recording, storage, and report of the patient's data. For clinical purposes, the software information is statistically validated with three different methodologies, comparing the measures with a goniometer in terms of agreement and repeatability. Results The agreement of joint angles measured with the proposed software and goniometer is evaluated with Bland-Altman plots; all measurements fell well within the limits of agreement, meaning interchangeability of both techniques. Additionally, the results of Bland-Altman analysis of repeatability show 95% confidence. Finally, the physiotherapists' qualitative assessment shows encouraging results for the clinical use. Conclusion The main conclusion is that the software is capable of offering a clinical history of the patient and is useful for quantification of the rehabilitation success. The simplicity, low cost, and visualization possibilities enhance the use of the software Kinect for rehabilitation and other applications, and the expert's opinion endorses the choice of our approach for clinical practice. Comparison of the new measurement technique with established goniometric methods determines that the proposed software agrees sufficiently to be used interchangeably. PMID:29750166
Silva, Paolo S; Walia, Saloni; Cavallerano, Jerry D; Sun, Jennifer K; Dunn, Cheri; Bursell, Sven-Erik; Aiello, Lloyd M; Aiello, Lloyd Paul
2012-09-01
To compare agreement between diagnosis of clinical level of diabetic retinopathy (DR) and diabetic macular edema (DME) derived from nonmydriatic fundus images using a digital camera back optimized for low-flash image capture (MegaVision) compared with standard seven-field Early Treatment Diabetic Retinopathy Study (ETDRS) photographs and dilated clinical examination. Subject comfort and image acquisition time were also evaluated. In total, 126 eyes from 67 subjects with diabetes underwent Joslin Vision Network nonmydriatic retinal imaging. ETDRS photographs were obtained after pupillary dilation, and fundus examination was performed by a retina specialist. There was near-perfect agreement between MegaVision and ETDRS photographs (κ=0.81, 95% confidence interval [CI] 0.73-0.89) for clinical DR severity levels. Substantial agreement was observed with clinical examination (κ=0.71, 95% CI 0.62-0.80). For DME severity level there was near-perfect agreement with ETDRS photographs (κ=0.92, 95% CI 0.87-0.98) and moderate agreement with clinical examination (κ=0.58, 95% CI 0.46-0.71). The wider MegaVision 45° field led to identification of nonproliferative changes in areas not imaged by the 30° field of ETDRS photos. Field area unique to ETDRS photographs identified proliferative changes not visualized with MegaVision. Mean MegaVision acquisition time was 9:52 min. After imaging, 60% of subjects preferred the MegaVision lower flash settings. When evaluated using a rigorous protocol, images captured using a low-light digital camera compared favorably with ETDRS photography and clinical examination for grading level of DR and DME. Furthermore, these data suggest the importance of more extensive peripheral images and suggest that utilization of wide-field retinal imaging may further improve accuracy of DR assessment.
Tuan, Nguyen T; Wang, Youfa
2014-01-01
Objectives To evaluate performance of anthropometric measures relative to percentage body fat (%BF) measured by dual-energy X-ray absorptiometry (DXA) in children. Design and Methods We used data from 8-19-y-old US children enrolled in a nationally representative cross-sectional survey in 2001-2004 (n=5,355) with measured %BF, body mass index (BMI), waist circumference (WC), waist-to-height ratio (WHtR) and triceps skinfold thickness (TSF). Agreement and prediction were evaluated based on standardized regression coefficients (β), kappa, and the area under the receiver-operating characteristic curves (AUC). Results The association between Z scores for %BF and anthropometric measures was strong (β of ~0.75-0.90, kappa of ~0.60-0.75, and AUC of ~0.87-0.98; P<0.001 for all), with only some variations by race-ethnicity, mostly in girls. In boys, TSF and WHtR Z-scores had stronger agreement with %BF than BMI (β of 0.91 and 0.86 vs. 0.79, kappa of 0.75 and 0.71 vs. 0.59, and AUC of 0.97 and 0.97 vs. 0.91; P<0.05 for all). In boys with BMI < median but %BF ≥ median, β value of TSF Z score was higher than those from BMI. In girls, TSF also provided a higher agreement than BMI, but was only statistically higher for kappa. Conclusions High agreement and small racial-ethnic variations in the association between percentage body fat and anthropometric measures support the use of anthropometric measures, especially waist-to-height ratio and triceps skinfold thickness, as proxy indicators for adiposity. PMID:24415710
Prakash, Gaurav; Srivastava, Dhruv; Choudhuri, Sounak
2015-12-01
The purpose of this study is to analyze the repeatability and agreement of corneal power using a new Hartman type topographer in comparison to Scheimpflug+Placido and autorefractor devices. In this cross sectional, observational study performed at the cornea services of a specialty hospital, 100 normal eyes (100 consecutive candidates) without any previous ocular surgery or morbidity except refractive error were evaluated. All candidates underwent three measurements each on a Full gradient, Hartman type topographer (FG) (iDesign, AMO), Scheimpflug+Placido topographer (SP) (Sirius, CSO) and rotating prism auto-keratorefractor (AR) (KR1, Nidek). The parameters assessed were flat keratometry (K1), steep keratometry (K2), steep axis (K2 axis), mean K, J 0 and J 45. Intra-device repeatability and inter-device agreement were evaluated. On repeatability analysis, the intra-device means were not significantly different (ANOVA, p > 0.05). Intraclass correlations (ICC) were >0.98 except for J 0 and J 45. In terms of intra-measurement standard deviation (Sw), the SP and FG groups fared better than AR group (p < 0.001, ANOVA). On Sw versus Average plots, no significantly predictive fit was seen (p > 0.05, R (2) < 0.1 for all the values). On inter-device agreement analysis, there was no difference in means (ANOVA, p > 0.05). ICC ranged from 0.92 to 0.99 (p < 0.001). Regression fits on Bland-Altman plots suggested no clinically significant effect of average values over difference in means. The repeatability of Hartman type topographer in normal eyes is comparable to SP combination device and better than AR. The agreement between the three devices is good. However, we recommend against interchanging these devices between follow-ups or pooling their data.
Frequency of otitis media based on otoendoscopic evaluation in preterm infants.
Coticchia, James; Shah, Priyanka; Sachdeva, Livjot; Kwong, Kelvin; Cortez, Josef M; Nation, Javan; Rudd, Tracy; Zidan, Marwan; Cepeda, Eugene; Gonik, Bernard
2014-10-01
This study was conducted to determine the frequency of otitis media in preterm neonates using otoendoscopy and tympanometry. Prospective study. Wayne State University, Hutzel Women's Hospital Neonatal Intensive Care Unit. Eighty-six preterm infants were included (gestational age <36 weeks). Otoendoscopy and tympanometry were performed to detect the presence of otitis media. Kappa statistic and logistic regression were used for statistical analysis. Otoendoscopy was performed in 85 patients. The frequency of otoendoscopy-diagnosed otitis media was 72.9% (62/85). Tympanometry could be performed on 69.76% of the ears. There was 73.5% agreement between the findings of tympanometry and those of otoendoscopy. The association between the presence of otitis media and gestational age at birth was statistically significant. The lower the gestational age, the higher the frequency of otoendoscopy-diagnosed otitis media (P = .001). Otoendoscopically diagnosed otitis media is frequent in preterm neonates. There was agreement between the results of tympanometry and those of otoendoscopy. The frequency of otitis media increased with lower gestational age. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2014.
Evaluation of a pilot workload metric for simulated VTOL landing tasks
NASA Technical Reports Server (NTRS)
North, R. A.; Graffunder, K.
1979-01-01
A methodological approach to measuring workload was investigated for evaluation of new concepts in VTOL aircraft displays. Multivariate discriminant functions were formed from conventional flight performance and/or visual response variables to maximize detection of experimental differences. The flight performance variable discriminant showed maximum differentiation between crosswind conditions. The visual response measure discriminant maximized differences between fixed vs. motion base conditions and experimental displays. Physiological variables were used to attempt to predict the discriminant function values for each subject/condition/trial. The weights of the physiological variables in these equations showed agreement with previous studies. High muscle tension, light but irregular breathing patterns, and higher heart rate with low amplitude all produced higher scores on this scale and thus, represented higher workload levels.
Low agreement between the fitnessgram criterion references for adolescents
Coledam, Diogo Henrique Constantino; Batista, João Pedro; Glaner, Maria Fátima
2015-01-01
OBJECTIVE: To analyze the association and agreement of fitnessgram reference criteria (RC) for cardiorespiratory fitness, body mass index (BMI) and strength in youth. METHODS: The study included 781 youth, 386 females, aged 10 to 18 years of Londrina-PR. It were performed cardiorespiratory fitness and muscular strength tests and was calculated body mass index. The association between the tests was analyzed using Poisson regression to obtain prevalence ratio (PR) and confidence intervals of 95%, while agreement of the reference criteria was tested by Kappa index. RESULTS: Significant associations were found between cardiorespiratory fitness and BMI (PR=1,49, 1,27-1,75), muscle strength and BMI (PR=1,55, 1,17-2,08), cardiorespiratory fitness and muscle strength (PR=1,81, 1,47-2,24). The agreement between reference criteria ranged from weak to fair, 48.8% (k=0.05, p=0.10) for cardiorespiratory fitness and BMI, 52.9% (k=0.09, p=0.001) for muscle strength and BMI and 38.4% (k=0.22, p<0.001) for cardiorespiratory fitness and muscle strength. CONCLUSIONS: Although RC for cardiorespiratory fitness, muscle strength and BMI are associated, the agreement between them ranged from weak to fair. To evaluate health related physical fitness it is suggest the execution of all tests, since each test has specific characteristics. PMID:25649383
32 CFR 37.500 - What must my pre-award business evaluation address?
Code of Federal Regulations, 2010 CFR
2010-07-01
... DoD GRANT AND AGREEMENT REGULATIONS TECHNOLOGY INVESTMENT AGREEMENTS Pre-Award Business Evaluation § 37.500 What must my pre-award business evaluation address? (a) You must determine the qualification... 32 National Defense 1 2010-07-01 2010-07-01 false What must my pre-award business evaluation...
Validation of Heart Rate Monitor Polar RS800 for Heart Rate Variability Analysis During Exercise.
Hernando, David; Garatachea, Nuria; Almeida, Rute; Casajús, Jose A; Bailón, Raquel
2018-03-01
Hernando, D, Garatachea, N, Almeida, R, Casajús, JA, and Bailón, R. Validation of heart rate monitor Polar RS800 for heart rate variability analysis during exercise. J Strength Cond Res 32(3): 716-725, 2018-Heart rate variability (HRV) analysis during exercise is an interesting noninvasive tool to measure the cardiovascular response to the stress of exercise. Wearable heart rate monitors are a comfortable option to measure interbeat (RR) intervals while doing physical activities. It is necessary to evaluate the agreement between HRV parameters derived from the RR series recorded by wearable devices and those derived from an electrocardiogram (ECG) during dynamic exercise of low to high intensity. Twenty-three male volunteers performed an exercise stress test on a cycle ergometer. Subjects wore a Polar RS800 device, whereas ECG was also recorded simultaneously to extract the reference RR intervals. A time-frequency spectral analysis was performed to extract the instantaneous mean heart rate (HRM), and the power of low-frequency (PLF) and high-frequency (PHF) components, the latter centered on the respiratory frequency. Analysis was done in intervals of different exercise intensity based on oxygen consumption. Linear correlation, reliability, and agreement were computed in each interval. The agreement between the RR series obtained from the Polar device and from the ECG is high throughout the whole test although the shorter the RR is, the more differences there are. Both methods are interchangeable when analyzing HRV at rest. At high exercise intensity, HRM and PLF still presented a high correlation (ρ > 0.8) and excellent reliability and agreement indices (above 0.9). However, the PHF measurements from the Polar showed reliability and agreement coefficients around 0.5 or lower when the level of the exercise increases (for levels of O2 above 60%).
NASA Technical Reports Server (NTRS)
Simon, F. F.
1975-01-01
The use of a solar simulator for performance determination permits collector testing under standard conditions of wind, ambient temperature, flow rate and sun. The performance results determined with the simulator have been found to be in good agreement with outdoor performance results. The measured thermal efficiency and evaluation of 23 collectors are reported which differ according to absorber material (copper, aluminum, steel), absorber coating (nonselective black paint, selective copper oxide, selective black nickel, selective black chrome), type of glazing material (glass, Tedlar, Lexan, antireflection glass), the use of honeycomb material and the use of vacuum to prevent thermal convection losses. The collectors were given performance rankings based on noon-hour solar conditions and all-day solar conditions. The determination with the simulator of an all-day collector performance was made possible by tests at different incident angles. The solar performance rankings were made based on whether the collector is to be used for pool heating, hot water, absorption air conditioning, heating, or for a solar Rankine machine.
Quality assurance of lower limb venous duplex scans performed by vascular surgeons.
Kordowicz, A; Ferguson, G; Salaman, R; Onwudike, M
2015-02-01
Duplex scanning is the gold standard for investigating venous reflux; increasingly surgeons perform these scans themselves. There has been no data published analysing the accuracy of Duplex scans performed by vascular surgeons. We aimed to evaluate an objective method of comparing the results of lower limb Duplex scans performed by one consultant vascular surgeon with those performed by a vascular technologist. We assessed 100 legs with symptomatic varicose veins. Each patient underwent two lower limb venous Duplex scans; one performed by a consultant vascular surgeon and one by a vascular technologist. Scan results were randomised and sent to two consultant vascular surgeons blinded to the identity and experience of the sonographer. They were asked to recommend treatment. A k score was calculated in each case to assess the level of agreement between the scans performed by the consultant and the technologist. Eighty-one patients were studied (53 females). The kappa score for assessor 1 was 0.60 (95%CI:0.44-0.75) and for assessor 2 was 0.62 (95%CI:0.48-0.75). k scores >0.60 represent a substantial strength of agreement. Duplex scans performed by this surgeon were comparable to those performed by a vascular technologist. It is possible to quality-assure duplex performed by vascular surgeons without directly observing the scanning process or reviewing digitally recorded images. We propose standardisation of training, assessment and quality assurance for vascular surgeons wishing to perform ultrasound scans.
Krisa, L; Gaughan, J; Vogel, L; Betz, R R; Mulcahey, M J
2013-01-01
A prospective repeated measures multicenter study to determine reliability at individual spinal levels when applied to young persons with spinal cord injury (SCI). To evaluate intra- and inter-rater agreement of repeated motor and sensory scores at individual spinal levels. Shriners Hospitals for Children--Philadelphia and Chicago, USA. A total 189 youth with complete and incomplete SCI underwent four neurological exams by two different raters. Agreement between and within raters for each myotome and dermatome was evaluated for complete and incomplete SCI separately. Intraclass correlation coefficients and 95% confidence intervals were calculated. Overall, both intra- and inter-rater agreement resulted in moderate-to-high agreement among myotomes. Subjects with complete SCI had moderate agreement for light touch (LT) and pin prick (PP) testing, whereas subjects with incomplete SCI had >60.0% of dermatomes resulting in poor agreement for PP testing. Overall, moderate-to-high agreement was found for muscle strength comparisons and moderate-to-poor agreement was found for PP and LT.
Performance, physiological, and oculometer evaluation of VTOL landing displays
NASA Technical Reports Server (NTRS)
North, R. A.; Stackhouse, S. P.; Graffunder, K.
1979-01-01
A methodological approach to measuring workload was investigated for evaluation of new concepts in VTOL aircraft displays. Physiological, visual response, and conventional flight performance measures were recorded for landing approaches performed in the NASA Visual Motion Simulator (VMS). Three displays (two computer graphic and a conventional flight director), three crosswind amplitudes, and two motion base conditions (fixed vs. moving base) were tested in a factorial design. Multivariate discriminant functions were formed from flight performance and/or visual response variables. The flight performance variable discriminant showed maximum differentation between crosswind conditions. The visual response measure discriminant maximized differences between fixed vs. motion base conditions and experimental displays. Physiological variables were used to attempt to predict the discriminant function values for each subject/condition trial. The weights of the physiological variables in these equations showed agreement with previous studies. High muscle tension, light but irregular breathing patterns, and higher heart rate with low amplitude all produced higher scores on this scale and thus represent higher workload levels.
A telehealth approach to conducting clinical swallowing evaluations in children with cerebral palsy.
Kantarcigil, Cagla; Sheppard, Justine Joan; Gordon, Andrew M; Friel, Kathleen M; Malandraki, Georgia A
2016-08-01
Accurate and timely evaluation of dysphagia in children with cerebral palsy (CP) is critical. For children with limited access to quality healthcare, telehealth is an option; however, its reliability needs to be investigated. To test the reliability of an asynchronous telehealth model for evaluating dysphagia in children with CP using a standardized clinical assessment. Nineteen children (age range 6.9-17.5) were assessed at three mealtimes via the Dysphagia Disorder Survey (DDS) by three clinicians (face-to-face evaluations). Mealtimes were video-recorded to allow asynchronous evaluations by a remote clinician who also completed approximately 1/3 of face-to-face evaluations. Agreement was tested on DDS variables and dysphagia severity. Results revealed substantial to excellent agreement between face-to-face and remote assessments by the same rater (78-100%, KW=0.64-1) on all, but two variables (oral transport and oral pharyngeal swallow) and by different raters (69-89%, KW=0.6-0.86) on all but one variable (orienting). For dysphagia severity, intrarater agreement was excellent (100%, KW=1); interrater agreement was substantial (85%; KW=0.76). Asynchronous clinical swallowing evaluations using standardized tools have acceptable levels of agreement with face-to-face evaluations, and can be an alternative for children with limited access to expert swallowing care. Copyright © 2016 Elsevier Ltd. All rights reserved.
Microwave photonic link with improved phase noise using a balanced detection scheme
NASA Astrophysics Data System (ADS)
Hu, Jingjing; Gu, Yiying; Tan, Wengang; Zhu, Wenwu; Wang, Linghua; Zhao, Mingshan
2016-07-01
A microwave photonic link (MPL) with improved phase noise performance using a dual output Mach-Zehnder modulator (DP-MZM) and balanced detection is proposed and experimentally demonstrated. The fundamental concept of the approach is based on the two complementary outputs of DP-MZM and the destructive combination of the photocurrent in balanced photodetector (BPD). Theoretical analysis is performed to numerical evaluate the additive phase noise performance and shows a good agreement with the experiment. Experimental results are presented for 4 GHz, 8 GHz and 12 GHz transmission link and an 11 dB improvement of phase noise performance at 10 MHz offset is achieved compared to the conventional intensity-modulation and direct-detection (IMDD) MPL.
ERIC Educational Resources Information Center
Dressler, William W.; Balieiro, Mauro C.; dos Santos, José Ernesto
2015-01-01
This article reports the replication after 10 years of cultural consensus analyses in four cultural domains in the city of Ribeirão Preto, Brazil. Additionally, two methods for evaluating residual agreement are applied to the data, and a new technique for evaluating how cultural knowledge is represented by residual agreement is introduced. We…
Edwards, Ryan; Alsufyani, Noura; Heo, Giseon; Flores-Mir, Carlos
2015-08-01
In this study, we aimed to assess interrater and intrarater agreement among orthodontic clinicians in their assessments of reported incidental findings in regard to both the need for additional follow-up and the impact on future orthodontic treatment in large-field maxillofacial cone-beam computed tomography (CBCT) imaging. The study sample consisted of 18 nonrandomly selected large-field maxillofacial CBCT volumes containing a reported total of 88 radiographic findings. All scans were associated with formal radiologic reports. However, the suggestions of further follow-up were removed from the radiologic reports so as to not bias the 3 evaluating orthodontists in their subsequent decision making. The evaluators had on average 7.6 years of CBCT usage and self-interpretation experience. Reliability was determined by quantifying the level of agreement between the evaluators' assessments for both research questions for all 88 findings using a binary response (yes/no) as the outcome measure. The Cohen kappa statistic was calculated to quantify intrarater and interrater agreement globally for both statements. Although interrater agreement was considerable, potential decisions with clinical impact were not consistent. This needs to be considered when interpreting maxillofacial incidental findings. Evaluators demonstrated higher levels of agreement for dentoalveolar findings compared with all other extragnathic regions when assessing clinical significance. Among the evaluators who were considered experienced in CBCT, "fair-to-good" interrater agreement and "excellent" intrarater agreement were demonstrated in terms of the need for further follow-up and their potential impact on future orthodontic treatment. Copyright © 2015 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.
Clinical application of qualitative assessment for breast masses in shear-wave elastography.
Gweon, Hye Mi; Youk, Ji Hyun; Son, Eun Ju; Kim, Jeong-Ah
2013-11-01
To evaluate the interobserver agreement and the diagnostic performance of various qualitative features in shear-wave elastography (SWE) for breast masses. A total of 153 breast lesions in 152 women who underwent B-mode ultrasound and SWE before biopsy were included. Qualitative analysis in SWE was performed using two different classifications: E values (Ecol; 6-point color score, Ehomo; homogeneity score and Esha; shape score) and a four-color pattern classification. Two radiologists reviewed five data sets: B-mode ultrasound, SWE, and combination of both for E values and four-color pattern. The BI-RADS categories were assessed B-mode and combined sets. Interobserver agreement was assessed using weighted κ statistics. Areas under the receiver operating characteristic curve (AUC), sensitivity, and specificity were analyzed. Interobserver agreement was substantial for Ecol (κ=0.79), Ehomo (κ=0.77) and four-color pattern (κ=0.64), and moderate for Esha (κ=0.56). Better-performing qualitative features were Ecol and four-color pattern (AUCs, 0.932 and 0.925) compared with Ehomo and Esha (AUCs, 0.857 and 0.864; P<0.05). The diagnostic performance of B-mode ultrasound (AUC, 0.950) was not significantly different from combined sets with E value and with four color pattern (AUCs, 0.962 and 0.954). When all qualitative values were negative, leading to downgrade the BI-RADS category, the specificity increased significantly from 16.5% to 56.1% (E value) and 57.0% (four-color pattern) (P<0.001) without improvement in sensitivity. The qualitative SWE features were highly reproducible and showed good diagnostic performance in suspicious breast masses. Adding qualitative SWE to B-mode ultrasound increased specificity in decision making for biopsy recommendation. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Gabriel, Guilherme Paiva; Chiquetto, Letícia; Morcillo, André Moreno; Ferreira, Maria do Carmo; Bazan, Ivan Gilberto M; Daolio, Luísa Dias; Lemos, Jéssica J Rocha; Carniel, Emília de Faria
2014-09-01
To assess the completeness and reliability of the Information System on Live Births (Sinasc) data. A cross-sectional analysis of the reliability and completeness of Sinasc's data was performed using a sample of Live Birth Certificate (LBC) from 2009, related to births from Campinas, Southeast Brazil. For data analysis, hospitals were grouped according to category of service (Unified National Health System, private or both), 600 LBCs were randomly selected and the data were collected in LBC-copies through mothers and newborns' hospital records and by telephone interviews. The completeness of LBCs was evaluated, calculating the percentage of blank fields, and the LBCs agreement comparing the originals with the copies was evaluated by Kappa and intraclass correlation coefficients. The percentage of completeness of LBCs ranged from 99.8%-100%. For the most items, the agreement was excellent. However, the agreement was acceptable for marital status, maternal education and newborn infants' race/color, low for prenatal visits and presence of birth defects, and very low for the number of deceased children. The results showed that the municipality Sinasc is reliable for most of the studied variables. Investments in training of the professionals are suggested in an attempt to improve system capacity to support planning and implementation of health activities for the benefit of maternal and child population. Copyright © 2014 Sociedade de Pediatria de São Paulo. Publicado por Elsevier Editora Ltda. All rights reserved.
NASA and Deere to study effects of low-g on iron processing
NASA Technical Reports Server (NTRS)
1981-01-01
A technical exchange agreement to study the effects of processing iron alloys in microgravity is described. Alloy solidification experiments are to be performed in the low-g facilities at Marshall Space Center. Deere is to prepare and evaluate the samples, and perform thermal characterization studies of the furnace used for melting and solidifying the samples. Experiment planning and analysis are to be performed jointly and data shared between the two parties. The technical exchange concept, developed by NASA to involve the private sector in low gravity research programs, is described. Other low gravity processing fields in which industry is invited to participate are listed.
Tramm, Trine; Di Caterino, Tina; Jylling, Anne-Marie B; Lelkaitis, Giedrius; Lænkholm, Anne-Vibeke; Ragó, Péter; Tabor, Tomasz P; Talman, Maj-Lis M; Vouza, Emmanouela
2018-01-01
In breast cancer, there is a growing body of evidence that tumor-infiltrating lymphocytes (TILs) may have clinical utility and may be able to direct clinical decisions for subgroups of patients. Clinical utility is, however, not sufficient for warranting the implementation of a new biomarker in the routine practice, and evaluation of the analytical validity is needed, including testing the reproducibility of decentralized assessment of TILs. The aim of this study was to evaluate the inter-observer agreement of TILs assessment using a standardized method, as proposed by the International TILs Working Group 2014, applied to a cohort of breast cancers reflecting an average breast cancer population. Stromal TILs were assessed using full slide sections from 124 breast cancers with varying histology, malignancy grade and ER- and HER2 status. TILs were estimated by nine dedicated breast pathologists using scanned hematoxylin-eosin stainings. TILs results were categorized using various cutoffs, and the inter-observer agreement was evaluated using the intraclass coefficient (ICC), Kappa statistics as well as individual overall agreements with the median value of TILs. Evaluation of TILs led to an ICC of 0.71 (95% CI: 0.65-0.77) corresponding to an acceptable agreement. Kappa values were in the range of 0.38-0.46 corresponding to a fair to moderate agreement. The individual agreements increased, when using only two categories ('high' vs. 'low' TILs) and a cutoff of 50-60%. The results of the present study are in accordance with previous studies, and shows that the proposed methodology for standardized evaluation of TILs renders an acceptable inter-observer agreement. The findings, however, indicate that assessment of TILs needs further refinement, and is in support of the latest St. Gallen Consensus, that routine reporting of TILs for early breast cancer is not ready for implementation in a clinical setting.
Tang, Zhiteng; Xu, Ye; Song, Najie; Zou, Dongqing; Liao, Yiqun; Li, Qingge; Pan, Chao
2018-03-01
The clinical performance of the newly developed MeltPro ® HPV Test, based on multicolor melting curve analysis, was evaluated and compared with the commercially available Cobas ® HPV Test for detection of HPV and genotyping of HPV-16 and HPV-18. A total of 1647 cervical samples were analyzed with both tests. The agreement values were 96.2% for HPV detection, 99.6% for HPV-16 identification, and 99.7% for HPV-18 identification. All genotyping results from MeltPro ® HPV Test showed that HPV-52, HPV-58, and HPV-16 were the most common types in this study. Intra-laboratory reproducibility studies showed 97.8% agreement while inter-laboratory reproducibility studies showed 96.9% agreement for the MeltPro ® HPV Test. The MeltPro ® HPV Test and Cobas ® HPV Test are highly correlative and are useful for monitoring HPV infection.
NASA Astrophysics Data System (ADS)
Trahmono; Lusiana, N.; Indarti, J.
2017-08-01
The aim of this study was to compare the performance of multimodal hyperspectral spectroscopy (MHS), which combines fluorescence and reflectance spectroscopy, with that of conventional laboratory-based screening tests, such as the Papanicolaou (Pap) smear test and human papilloma virus (HPV) DNA test, for detecting precancerous lesions of the cervix. The study utilized a cross-sectional design, and the kappa test was used in the analytical assessment. MHS scans were obtained from a sample of 70 consecutive patients, followed by sample collection for Pap and HPV DNA analysis and colposcopy referral, if indicated. Of the 70 patients evaluated, the results of cervical spectroscopy were normal in 38 (54.3%) patients, and they were abnormal in 32 (45.7%) patients. Based on the cytology results, 45 (64.3%) samples were normal, and 25 (35.7%) samples were abnormal. According to the results of the HPV DNA test, 47 (67.14%) samples were normal, and 17 (24.28%) samples were abnormal. Based on the results of the kappa test, the agreement between MHS and cytology was 0.793 (p < 0.001). The agreement between MHS and the HPV DNA test was 0.195 (p = 0.086), and the agreement between MHS and colposcopy was 0.479 (p < 0.001).
Schmitz, E M H; Boonen, K; van den Heuvel, D J A; van Dongen, J L J; Schellings, M W M; Emmen, J M A; van der Graaf, F; Brunsveld, L; van de Kerkhof, D
2014-10-01
Three novel direct oral anticoagulants (DOACs) have recently been registered by the Food and Drug Administration and European Medicines Agency Commission: dabigatran, rivaroxaban, and apixaban. To quantify DOACs in plasma, various dedicated coagulation assays have been developed. To develop and validate a reference ultra-performance liquid chromatography - tandem mass spectrometry (UPLC-MS/MS) method and to evaluate the analytical performance of several coagulation assays for quantification of dabigatran, rivaroxaban, and apixaban. The developed UPLC-MS/MS method was validated by determination of precision, accuracy, specificity, matrix effects, lower limits of detection, carry-over, recovery, stability, and robustness. The following coagulation assays were evaluated for accuracy and precision: laboratory-developed (LD) diluted thrombin time (dTT), Hemoclot dTT, Pefakit PiCT, ECA, Liquid anti-Xa, Biophen Heparin (LRT), and Biophen DiXal anti-Xa. Agreement between the various coagulation assays and UPLC-MS/MS was determined with random samples from patients using dabigatran or rivaroxaban. The UPLC-MS/MS method was shown to be accurate, precise, sensitive, stable, and robust. The dabigatran coagulation assay showing the best precision, accuracy and agreement with the UPLC-MS/MS method was the LD dTT test. For rivaroxaban, the anti-factor Xa assays were superior to the PiCT-Xa assay with regard to precision, accuracy, and agreement with the reference method. For apixaban, the Liquid anti-Xa assay was superior to the PiCT-Xa assay. Statistically significant differences were observed between the various coagulation assays as compared with the UPLC-MS/MS reference method. It is currently unknown whether these differences are clinically relevant. When DOACs are quantified with coagulation assays, comparison with a reference method as part of proficiency testing is therefore pivotal. © 2014 International Society on Thrombosis and Haemostasis.
Developing and Evaluating a Target-Background Similarity Metric for Camouflage Detection
Lin, Chiuhsiang Joe; Chang, Chi-Chan; Liu, Bor-Shong
2014-01-01
Background Measurement of camouflage performance is of fundamental importance for military stealth applications. The goal of camouflage assessment algorithms is to automatically assess the effect of camouflage in agreement with human detection responses. In a previous study, we found that the Universal Image Quality Index (UIQI) correlated well with the psychophysical measures, and it could be a potentially camouflage assessment tool. Methodology In this study, we want to quantify the camouflage similarity index and psychophysical results. We compare several image quality indexes for computational evaluation of camouflage effectiveness, and present the results of an extensive human visual experiment conducted to evaluate the performance of several camouflage assessment algorithms and analyze the strengths and weaknesses of these algorithms. Significance The experimental data demonstrates the effectiveness of the approach, and the correlation coefficient result of the UIQI was higher than those of other methods. This approach was highly correlated with the human target-searching results. It also showed that this method is an objective and effective camouflage performance evaluation method because it considers the human visual system and image structure, which makes it consistent with the subjective evaluation results. PMID:24498310
SPSS Macros for Assessing the Reliability and Agreement of Student Evaluations of Teaching
ERIC Educational Resources Information Center
Morley, Donald D.
2009-01-01
This article reports and demonstrates two SPSS macros for calculating Krippendorff's alpha and intraclass reliability coefficients in repetitive situations where numerous coefficients are needed. Specifically, the reported SPSS macros were used to evaluate the interrater agreement and reliability of student evaluations of teaching in thousands of…
48 CFR 515.305-70 - Use of outside evaluators.
Code of Federal Regulations, 2010 CFR
2010-10-01
... CONTRACTING METHODS AND CONTRACT TYPES CONTRACTING BY NEGOTIATION Source Selection 515.305-70 Use of outside... evaluators. (2) Each evaluator will sign and provide to GSA a “Conflict of Interest Acknowledgment and Nondisclosure Agreement.” Figure 515.3-1—Conflict of Interest Acknowledgment and Nondisclosure Agreement...
Salvarani, Carlo; Girolomoni, Giampiero; Di Lernia, Vito; Gisondi, Paolo; Tripepi, Giovanni; Egan, Colin Gerard; Marchesoni, Antonio
2016-12-01
To evaluate the impact of training on the reliability among dermatologists and rheumatologists in the assessment of psoriatic arthritis (PsA) patients. Overall, 9 hospital-based rheumatologists and 8 hospital-based dermatologists met in Reggio Emilia, Italy on October 2015 to assess 17 PsA patients. After 1 month, physicians underwent a 3-h training session by 4 recognized experts and then assessed 19 different PsA patients according to a modified Latin square design. Measures included tender (TJC) and swollen joint count (SJC), dactylitis, enthesitis, Schober test, psoriasis body surface area (BSA), Psoriasis Area and Severity Index (PASI), Nail Psoriasis Severity Index (NAPSI), and static physician's global assessment of PsA disease activity (sPGA). Variance components analyses were performed to estimate the intraclass correlation coefficient (ICC). TJC and enthesitis-measured pre-training by dermatologists or rheumatologists revealed moderate-substantial agreement (ICC: 0.4-0.8). In contrast, SJC and Schober test showed fair (ICC: 0.2-0.4) and moderate agreement, respectively (ICC: 0.4-0.6), while poor agreement (ICC: 0-0.2) was represented by dactylitis. Moderate-substantial (ICC: 0.4-0.8) agreement was observed for most skin measures by dermatologists and rheumatologists, apart from BSA, where fair agreement (ICC: 0.2-0.4) was observed. Agreement levels were similar before and after training for arthritis measures. In contrast, levels of agreement after training for 3 of the 4 skin measures were increased for dermatologists and all 4 skin measures were increased for rheumatologists. Substantial to excellent agreement was observed for TJC, enthesitis, PASI, and sPGA. Rheumatologists benefited from training to a greater extent. Copyright © 2016 Elsevier Inc. All rights reserved.
Caldwell, Michelle; Dickerhoof, Erica; Hall, Anastasia; Odakura, Bryan; Fanchiang, Hsin-Chen
2014-01-01
Objective. To describe and analyze the potential use of games in the commercially available EyeToy Play and EyeToy Play 2 on required/targeted training skills and feedback provided for clinical application. Methods. A summary table including all games was created. Two movement experts naïve to the software validated required/targeted training skills and feedback for 10 randomly selected games. Ten healthy school-aged children played to further validate the required/targeted training skills. Results. All but two (muscular and cardiovascular endurance) had excellent agreement in required/targeted training skills, and there was 100% agreement on feedback. Children's performance in required/targeted training skills (number of unilateral reaches and bilateral reaches, speed, muscular endurance, and cardiovascular endurance) significantly differed between games (P < .05). Conclusion. EyeToy Play games could be used to train children's arm function. However, a careful evaluation of the games is needed since performance might not be consistent between players and therapists' interpretation. PMID:25610652
Validation of a Monte Carlo simulation of the Inveon PET scanner using GATE
NASA Astrophysics Data System (ADS)
Lu, Lijun; Zhang, Houjin; Bian, Zhaoying; Ma, Jianhua; Feng, Qiangjin; Chen, Wufan
2016-08-01
The purpose of this study is to validate the application of GATE (Geant4 Application for Tomographic Emission) Monte Carlo simulation toolkit in order to model the performance characteristics of Siemens Inveon small animal PET system. The simulation results were validated against experimental/published data in accordance with the NEMA NU-4 2008 protocol for standardized evaluation of spatial resolution, sensitivity, scatter fraction (SF) and noise equivalent counting rate (NECR) of a preclinical PET system. An agreement of less than 18% was obtained between the radial, tangential and axial spatial resolutions of the simulated and experimental results. The simulated peak NECR of mouse-size phantom agreed with the experimental result, while for the rat-size phantom simulated value was higher than experimental result. The simulated and experimental SFs of mouse- and rat- size phantom both reached an agreement of less than 2%. It has been shown the feasibility of our GATE model to accurately simulate, within certain limits, all major performance characteristics of Inveon PET system.
Lobb, Eric C
2016-07-08
Version 6.3 of the RITG148+ software package offers eight automated analysis routines for quality assurance of the TomoTherapy platform. A performance evaluation of each routine was performed in order to compare RITG148+ results with traditionally accepted analysis techniques and verify that simulated changes in machine parameters are correctly identified by the software. Reference films were exposed according to AAPM TG-148 methodology for each routine and the RITG148+ results were compared with either alternative software analysis techniques or manual analysis techniques in order to assess baseline agreement. Changes in machine performance were simulated through translational and rotational adjustments to subsequently irradiated films, and these films were analyzed to verify that the applied changes were accurately detected by each of the RITG148+ routines. For the Hounsfield unit routine, an assessment of the "Frame Averaging" functionality and the effects of phantom roll on the routine results are presented. All RITG148+ routines reported acceptable baseline results consistent with alternative analysis techniques, with 9 of the 11 baseline test results showing agreement of 0.1mm/0.1° or better. Simulated changes were correctly identified by the RITG148+ routines within approximately 0.2 mm/0.2° with the exception of the Field Centervs. Jaw Setting routine, which was found to have limited accuracy in cases where field centers were not aligned for all jaw settings due to inaccurate autorotation of the film during analysis. The performance of the RITG148+ software package was found to be acceptable for introduction into our clinical environment as an automated alternative to traditional analysis techniques for routine TomoTherapy quality assurance testing.
Schalasta, Gunnar; Börner, Anna; Speicher, Andrea; Enders, Martin
2018-03-28
Proper management of patients with chronic hepatitis B virus (HBV) infection requires monitoring of plasma or serum HBV DNA levels using a highly sensitive nucleic acid amplification test. Because commercially available assays differ in performance, we compared herein the performance of the Hologic Aptima HBV Quant assay (Aptima) to that of the Roche Cobas TaqMan HBV test for use with the high pure system (HPS/CTM). Assay performance was assessed using HBV reference panels as well as plasma and serum samples from chronically HBV-infected patients. Method correlation, analytical sensitivity, precision/reproducibility, linearity, bias and influence of genotype were evaluated. Data analysis was performed using linear regression, Deming correlation analysis and Bland-Altman analysis. Agreement between the assays for the two reference panels was good, with a difference in assay values vs. target <0.5 log. Qualitative assay results for 159 clinical samples showed good concordance (88.1%; κ=0.75; 95% confidence interval: 0.651-0.845). For the 106 samples quantitated by both assays, viral load results were highly correlated (R=0.92) and differed on average by 0.09 log, with 95.3% of the samples being within the 95% limit of agreement of the assays. Linearity for viral loads 1-7 log was excellent for both assays (R2>0.98). The two assays had similar bias and precision across the different genotypes tested at low viral loads (25-1000 IU/mL). Aptima has a performance comparable with that of HPS/CTM, making it suitable for use for HBV infection monitoring. Aptima runs on a fully automated platform (the Panther system) and therefore offers a significantly improved workflow compared with HPS/CTM.
49 CFR 23.9 - What are the nondiscrimination and assurance requirements of this part for recipients?
Code of Federal Regulations, 2010 CFR
2010-10-01
..., national origin, or sex in connection with the award or performance of any concession agreement, management... Transportation PARTICIPATION OF DISADVANTAGED BUSINESS ENTERPRISE IN AIRPORT CONCESSIONS General § 23.9 What are... performance of any concession agreement, management contract or subcontract, purchase or lease agreement, or...
Hirai, T; Kitajima, M; Nakamura, H; Okuda, T; Sasao, A; Shigematsu, Y; Utsunomiya, D; Oda, S; Uetani, H; Morioka, M; Yamashita, Y
2011-12-01
QUASAR is a particular application of the ASL method and facilitates the user-independent quantification of brain perfusion. The purpose of this study was to assess the intermodality agreement of TBF measurements obtained with ASL and DSC MR imaging and the inter- and intraobserver reproducibility of glioma TBF measurements acquired by ASL at 3T. Two observers independently measured TBF in 24 patients with histologically proved glioma. ASL MR imaging with QUASAR and DSC MR imaging were performed on 3T scanners. The observers placed 5 regions of interest in the solid tumor on rCBF maps derived from ASL and DSC MR images and 1 region of interest in the contralateral brain and recorded the measured values. Maximum and average sTBF values were calculated. Intermodality and intra- and interobsever agreement were determined by using 95% Bland-Altman limits of agreement and ICCs. The intermodality agreement for maximum sTBF was good to excellent on DSC and ASL images; ICCs ranged from 0.718 to 0.884. The 95% limits of agreement ranged from 59.2% to 65.4% of the mean. ICCs for intra- and interobserver agreement for maximum sTBF ranged from 0.843 to 0.850 and from 0.626 to 0.665, respectively. The reproducibility of maximum sTBF measurements obtained by methods was similar. In the evaluation of sTBF in gliomas, ASL with QUASAR at 3T yielded measurements and reproducibility similar to those of DSC perfusion MR imaging.
Uribarri-Gonzalez, Laura; Keane, Margaret G; Pereira, Stephen P; Iglesias-García, Julio; Dominguez-Muñoz, J Enrique; Lariño-Noia, Jose
2018-03-01
To evaluate the agreement between the imaging modalities MRI-MRCP and EUS in cystic lesions of the pancreas which were thought to be a BD-IPMN. Multicenter retrospective study included all patients between 2010 and 2015 with a suspected BD-IPMN who underwent an EUS and MRI-MRCP within 6 months or less of each other. Location, number, size, worrisome features and high-risk stigmata were evaluated. Interobserver agreement was evaluated by Kappa score. 173 patients were included (97 UHSC, 76 UCLH-RFH), mean age 65 (range 25-87 years), 66 males. When comparing both modalities there was good agreement for the location of the cyst. The median lesion size was larger by MRI-MRCP than EUS although it was not significant. With regards to worrisome features, there was moderate agreement for main PD of 5-9 mm and abrupt change (k = 0.45 and 0.52). Fair agreement was seen for the cyst wall thickening (k = 0.25). No agreement was seen between the presence of non-enhanced mural nodules or lymphadenopathy (k < 0). With regards to high-risk stigmata, poor agreement was obtained for the detection of an enhanced solid component (k = 0.12). No agreement was observed for main PD > 10 mm (k < 0). In this multicentre study of patients with a BD-IPMN under active surveillance, most disagreement between these modalities was seen in the proximal pancreas. There was generally only minimal concordance between the imaging findings of EUS and MRI-MRCP for the detection of high-risk stigmata and worrisome features. Copyright © 2018 IAP and EPC. All rights reserved.
Chacón, Jose
2010-01-01
Drug efficacy might differ from clinical trial results when performed in clinical daily conditions. Therefore, it is mandatory to conduct trials about effectiveness to improve external validity. This post-authorization, open-label, noncontrolled, prospective, multicenter, observational, and naturalistic trial was designed to search for factors influencing the racecadotril overall effect on childhood acute watery diarrhea in a real-world setting of Venezuela. There were 3,873 children with acute watery diarrhea treated with racecadotril, an enkephalin breakdown blocker plus oral rehydration therapy by 97 pediatricians. Evaluations were carried out daily until emission of two consecutive formed stools or absence of watery bowel movements for 24 hours. The primary end-point was time-to-relief, defined as the time from first racecadotril dose to the last watery bowel movement time. Age, gender, nursing type, nursing status during diarrhea, diarrhea severity, and co-medication were considered as factors in the statistical analysis. The primary end-point was evaluated by factors using UNIANOVA, and post-hoc tests were done. A multiple regression analysis was carried out to identify factors affecting drug performance, racecadotril effectiveness and tolerability overall assessment was searched by physicians and patients, and inter-observer agreement was evaluated by kappa statistics. The mean time-to-relief was 18.5 ± 12.5 hours [95% confidence interval 17.9–19.0] and the diarrhea severity was the only variable with significant and independent weight on racecadotril effectiveness explaining 23% of time-to-relief variance, but even in severe diarrhea cases this time was less than 24 hours. High agreement about satisfactory perception on effectiveness and tolerability was reached among physicians and patients. In conclusion, the racecadotril overall effect, evaluated in a real-world setting of Venezuela, was in agreement with results of some earlier controlled trials. It was only influenced by severity of diarrhea episode, as well as being considered an effective and well tolerated treatment by physicians and patients. PMID:20668711
Validation of the Italian version of the Coma Recovery Scale-Revised (CRS-R).
Sacco, Simona; Altobelli, Emma; Pistarini, Caterina; Cerone, Davide; Cazzulani, Benedetta; Carolei, Antonio
2011-01-01
To validate the Italian version of the Coma Recovery Scale-Revised (CRS-R). Two observers applied the Italian version of the CRS-R to selected patients. On day 1, observer A and B independently scored each patient; the comparison of their observations was used to evaluate inter-observer agreement. On day 2, observer A completed a second evaluation and the comparison of this observation with that obtained on day 1 by the same observer was used to evaluate test-re-test agreement. For each evaluation, also diagnostic impression (vegetative state/minimally conscious state) was reported. Thirty-eight patients were evaluated (mean age ± SD, 58.9 ± 13.8 years). Inter-observer (ρ = 0.81; p < 0.001) as well as test-re-test agreement (ρ = 0.97; p < 0.001) for the total score was high. Inter-observer agreement was excellent for the communication sub-scale, good for the auditory, visual and motor sub-scales and moderate for the oromotor/verbal and arousal sub-scales. Test-re-test agreement was excellent for the visual, motor, oromotor/verbal and communication sub-scales, good for the auditory sub-scale and moderate for the arousal sub-scale. When considering the diagnostic impression, inter-observer agreement was good (κ = 0.75; p < 0.001) and test-re-test agreement was excellent (κ = 0.92; p < 0.001). The Italian version of the CRS-R can be administered reliably and can be also employed to discriminate patients in vegetative and in minimally conscious state.
Ryoo, Inseon; Suh, Sangil; Lee, Young Hen; Seo, Hyung Suk; Seol, Hae Young; Woo, Jeong-Soo; Kim, Soo Chin
2018-03-01
Pleomorphic adenomas and Warthin tumors are the most common salivary gland tumors. It is important to differentiate between them because at least a partial parotidectomy is necessary for pleomorphic adenomas, whereas enucleation is sufficient for Warthin tumors. This study aimed to evaluate the usefulness of vascular pattern analysis using microvascular sonography to differentiate between the tumors. Sixty-two patients with pathologically proven pleomorphic adenomas (n = 38) and Warthin tumors (n = 24) were included. For all tumors, grayscale, power Doppler, and microvascular sonographic examinations were performed. Differences in vascular patterns (vascular distribution and internal vascularity) on power Doppler and microvascular sonography as well as grayscale sonographic features (size, shape, border, echogenicity, heterogeneity, and cystic change) between pleomorphic adenomas and Warthin tumors were evaluated. A comparison of diagnostic performances of grayscale sonography with power Doppler sonography and grayscale sonography with microvascular sonography was performed. The level of interobserver agreement between 2 reviewers in diagnosing tumors was evaluated. No grayscale sonographic features showed a significant difference between the tumors. Vascular distributions and internal vascularity on power Doppler sonography (P = .01 and .002) and microvascular sonography (both P < .001) were all significantly different. The diagnostic accuracy of grayscale sonography with microvascular sonography (79.0%) was higher than that of grayscale sonography with power Doppler sonography (72.6%). This difference was significant according to the McNemar test (P = .004). Interobserver agreement was excellent in diagnosing tumors on both grayscale sonography with power Doppler sonography (κ = 0.83) and grayscale sonography with microvascular sonography (κ = 0.94). Vascular pattern analysis using microvascular sonography with other sonographic features is helpful for differentiating between pleomorphic adenomas and Warthin tumors. © 2017 by the American Institute of Ultrasound in Medicine.
Magnetic particle imaging for in vivo blood flow velocity measurements in mice
NASA Astrophysics Data System (ADS)
Kaul, Michael G.; Salamon, Johannes; Knopp, Tobias; Ittrich, Harald; Adam, Gerhard; Weller, Horst; Jung, Caroline
2018-03-01
Magnetic particle imaging (MPI) is a new imaging technology. It is a potential candidate to be used for angiographic purposes, to study perfusion and cell migration. The aim of this work was to measure velocities of the flowing blood in the inferior vena cava of mice, using MPI, and to evaluate it in comparison with magnetic resonance imaging (MRI). A phantom mimicking the flow within the inferior vena cava with velocities of up to 21 cm s‑1 was used for the evaluation of the applied analysis techniques. Time–density and distance–density analyses for bolus tracking were performed to calculate flow velocities. These findings were compared with the calibrated velocities set by a flow pump, and it can be concluded that velocities of up to 21 cm s‑1 can be measured by MPI. A time–density analysis using an arrival time estimation algorithm showed the best agreement with the preset velocities. In vivo measurements were performed in healthy FVB mice (n = 10). MRI experiments were performed using phase contrast (PC) for velocity mapping. For MPI measurements, a standardized injection of a superparamagnetic iron oxide tracer was applied. In vivo MPI data were evaluated by a time–density analysis and compared to PC MRI. A Bland–Altman analysis revealed good agreement between the in vivo velocities acquired by MRI of 4.0 ± 1.5 cm s‑1 and those measured by MPI of 4.8 ± 1.1 cm s‑1. Magnetic particle imaging is a new tool with which to measure and quantify flow velocities. It is fast, radiation-free, and produces 3D images. It therefore offers the potential for vascular imaging.
Schlaeger, Sarah; Klupp, Elisabeth; Weidlich, Dominik; Cervantes, Barbara; Foreman, Sarah C; Deschauer, Marcus; Schoser, Benedikt; Katemann, Christoph; Kooijman, Hendrik; Rummeny, Ernst J; Zimmer, Claus; Kirschke, Jan S; Karampinos, Dimitrios C
2018-04-02
The assessment of fatty infiltration and edema in the musculature of patients with neuromuscular diseases (NMDs) typically requires the separate performance of T1-weighted and fat-suppressed T2-weighted sequences. T2-weighted Dixon turbo spin echo (TSE) enables the generation of T2-weighted fat- and water-separated images, which can be used to assess both pathologies simultaneously. The present study examines the diagnostic performance of T2-weighted Dixon TSE compared with the standard sequences in 10 patients with NMDs and 10 healthy subjects. Whole-body magnetic resonance imaging was performed including T1-weighted Dixon fast field echo, T2-weighted short-tau inversion recovery, and T2-weighted Dixon TSE. Fatty infiltration and intramuscular edema were rated by 2 radiologists using visual semiquantitative rating scales. To assess intermethod and interrater agreement, weighted Cohen's κ coefficients were calculated. The ratings of fatty infiltration showed high intermethod and high interrater agreement (T1-weighted Dixon fast field echo vs T2-weighted Dixon TSE fat image). The evaluation of edematous changes showed high intermethod and good interrater agreement (T2-weighted short-tau inversion recovery vs T2-weighted Dixon TSE water image). T2-weighted Dixon TSE imaging is an alternative for accelerated simultaneous grading of whole-body skeletal muscle fat infiltration and edema in patients with NMDs.
Muir-Hunter, Susan W; Graham, Laura; Montero Odasso, Manuel
2015-08-01
To measure test-retest and interrater reliability of the Berg Balance Scale (BBS) in community-dwelling adults with mild to moderate Alzheimer disease (AD). Method : A sample of 15 adults (mean age 80.20 [SD 5.03] years) with AD performed three balance tests: the BBS, timed up-and-go test (TUG), and Functional Reach Test (FRT). Both relative reliability, using the intra-class correlation coefficient (ICC), and absolute reliability, using standard error of measurement (SEM) and minimal detectable change (MDC95) values, were calculated; Bland-Altman plots were constructed to evaluate inter-tester agreement. The test-retest interval was 1 week. Results : For the BBS, relative reliability values were 0.95 (95% CI, 0.85-0.98) for test-retest reliability and 0.72 (95% CI, 0.31-0.91) for interrater reliability; SEM was 6.01 points and MDC95 was 16.66 points; and interrater agreement was 16.62 points. The BBS performed better in test-retest reliability than the TUG and FRT, tests with established reliability in AD. Between 33% and 50% of participants required cueing beyond standardized instructions because they were unable to remember test instructions. Conclusions : The BBS achieved relative reliability values that support its clinical utility, but MDC95 and agreement values indicate the scale has performance limitations in AD. Further research to optimize balance assessment for people with AD is required.
Evaluation of equations that estimate glomerular filtration rate in renal transplant recipients.
De Alencastro, M G; Veronese, F V; Vicari, A R; Gonçalves, L F; Manfro, R C
2014-03-01
The accuracy of equations that estimate the glomerular filtration rate (GFR) in renal transplant patients has not been established; thus their performance was assessed in stable renal transplant patients. Renal transplant patients (N.=213) with stable graft function were enrolled. The Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) equation was used as the reference method and compared with the Cockcroft-Gault (CG), Modification of Diet in Renal Disease (MDRD), Mayo Clinic (MC) and Nankivell equations. Bias, accuracy and concordance rates were determined for all equation relative to CKD-EPI. Mean estimated GFR values of the equations differed significantly from the CKD-EPI values, though the correlations with the reference method were significant. Values of MDRD differed from the CG, MC and Nankivell estimations. The best agreement to classify the chronic kidney disease (CKD) stages was for the MDRD (Kappa=0.649, P<0.001), and for the other equations the agreement was moderate. The MDRD had less bias and narrower agreement limits but underestimated the GFR at levels above 60 mL/min/1.73 m2. Conversely, the CG, MC and Nankivell equations overestimated the GFR, and the Nankivell equation had the worst performance. The MDRD equation P15 and P30 values were higher than those of the other equations (P<0.001). Despite their correlations, equations estimated the GFR and CKD stage differently. The MDRD equation was the most accurate, but the sub-optimal performance of all the equations precludes their accurate use in clinical practice.
van de Pol, Daan; Zacharian, Tigran; Maas, Mario; Kuijer, P Paul F M
2017-06-01
The Shoulder posterior circumflex humeral artery Pathology and digital Ischemia - questionnaire (SPI-Q) has been developed to enable periodic surveillance of elite volleyball players, who are at risk for digital ischemia. Prior to implementation, assessing reliability is mandatory. Therefore, the test-retest reliability and agreement of the SPI-Q were evaluated among the population at risk. A questionnaire survey was performed with a 2-week interval among 65 elite male volleyball players assessing symptoms of cold, pale and blue digits in the dominant hand during or after practice or competition using a 4-point Likert scale (never, sometimes, often and always). Kappa (κ) and percentage of agreement (POA) were calculated for individual symptoms, and to distinguish symptomatic and asymptomatic players. For the individual symptoms, κ ranged from "poor" (0.25) to "good" (0.63), and POA ranged from "moderate" (78%) to "good" (97%). To classify symptomatic players, the SPI-Q showed "good" reliability (κ = 0.83; 95%CI 0.69-0.97) and "good" agreement (POA = 92%). The current study has proven the SPI-Q to be reliable for detecting elite male indoor volleyball players with symptoms of digital ischemia.
Santana, Jamilly C V; Santos, Victor S; Gurgel, Ricardo Q; Santana, Julianne C V; Reis, Francisco P; Cuevas, Luis E; Feitosa, Vera L C
2016-10-05
Neuropathic pain (NP) often occurs during the course of leprosy, and screening tools to differentiate NP from non-NP are often used. However, their performance varies in different settings. The most frequently used scales are the Douleur Neuropathique in 4 questions (DN4) and the Leeds assessment of neuropathic symptoms and signs (LANSS) questionnaires. Thus, we conducted a study to evaluate the agreement between DN4 and LANSS questionnaires to classify NP in 195 leprosy patients attending two reference centers in Sergipe, Brazil. The DN4 and LANSS classified 166 and 110 patients, respectively, as having NP. One hundred and seven (54.8%) were classified as NP by both questionnaires; 59 (30.2%) solely by the DN4 questionnaire and three (1.5%) solely by the LANSS. The agreement of the questionnaires was 66.2% (weak agreement, Kappa = 0.30). Although both questionnaires identified a high proportion of NP, the development of more robust instruments is necessary to ensure the accuracy of diagnosis of leprosy patients classified as having NP. © The American Society of Tropical Medicine and Hygiene.
Nazerian, Peiman; Vanni, Simone; Morello, Fulvio; Castelli, Matteo; Ottaviani, Maddalena; Casula, Claudia; Petrioli, Alessandra; Bartolucci, Maurizio; Grifoni, Stefano
2015-05-01
The diagnostic performance of transthoracic focused cardiac ultrasound (FoCUS) performed by emergency physicians (EP) to estimate ascending aorta dimensions in the acute setting has not been prospectively studied. The diagnostic accuracy and the interobserver variability of EP-performed FoCUS were investigated to estimate thoracic aortic dilation and aneurysm compared with the results of computed tomography angiography (CTA). This was a prospective single-center cohort study of a convenience sample of patients who underwent CTA in the emergency department for suspected aortic pathology. FoCUS was performed before CTA, and the maximum ascending aorta diameter evaluated in parasternal long-axis view. Aorta diameter < 40 mm by visual estimation or by diameter measurement was considered normal. Measurements were recorded in all patients with aorta diameter ≥ 40 mm. Diagnostic accuracy of FoCUS for detection of aortic dilation (diameter ≥ 40 mm) and aneurysm (diameter ≥ 45 mm) were calculated considering the CTA result as reference standard. In a subgroup of patients, a second EP-sonographer performed FoCUS to evaluate interobserver agreement for the diagnosis of ascending aorta dilation. A total of 140 patients were enrolled in the study. Ascending aorta dilation and aneurysm were detected with FoCUS in 50 (35.7%) and in 27 (17.8%) patients, respectively. Sensitivity and specificity of FoCUS were 78.6% (95% confidence interval [CI] = 65.6% to 88.4%) and 92.9% (95% CI = 85.1% to 97.3%), respectively, for ascending aorta dilation and 64.7% (95% CI = 46.5% to 80.2%) and 95.3% (95% CI = 89.3% to 98.4%), respectively, for ascending aorta aneurysm. Interobserver agreement of FoCUS was k = 0.82. FoCUS performed by EP is specific for ascending aorta dilation and aneurysm when compared to CTA and appears a reproducible technique. © 2015 by the Society for Academic Emergency Medicine.
Marfeo, Elizabeth E; Ni, Pengsheng; Chan, Leighton; Rasch, Elizabeth K; Jette, Alan M
2014-07-01
The goal of this article was to investigate optimal functioning of using frequency vs. agreement rating scales in two subdomains of the newly developed Work Disability Functional Assessment Battery: the Mood & Emotions and Behavioral Control scales. A psychometric study comparing rating scale performance embedded in a cross-sectional survey used for developing a new instrument to measure behavioral health functioning among adults applying for disability benefits in the United States was performed. Within the sample of 1,017 respondents, the range of response category endorsement was similar for both frequency and agreement item types for both scales. There were fewer missing values in the frequency items than the agreement items. Both frequency and agreement items showed acceptable reliability. The frequency items demonstrated optimal effectiveness around the mean ± 1-2 standard deviation score range; the agreement items performed better at the extreme score ranges. Findings suggest an optimal response format requires a mix of both agreement-based and frequency-based items. Frequency items perform better in the normal range of responses, capturing specific behaviors, reactions, or situations that may elicit a specific response. Agreement items do better for those whose scores are more extreme and capture subjective content related to general attitudes, behaviors, or feelings of work-related behavioral health functioning. Copyright © 2014 Elsevier Inc. All rights reserved.
Constellation X-Ray Mission and Support
NASA Technical Reports Server (NTRS)
Tananbaum, H.; Grady, Jean (Technical Monitor)
2005-01-01
This Final Report summarizes work performed by the Smithsonian Astrophysical Observatory (SAO) for NASA Goddard Space Flight Center (GSFC) under Cooperative Agreement NCC5-368. The Agreement is entitled "Constellation X-ray Mission Study and Support." The report covers the full duration of the Agreement which ran from October 1,1998 to October 14,2004. Included in the report is a description of previously unreported work that was performed between October 2003 and the end of the Agreement. For convenience, the previously unreported work is covered first in Section 2.0. Then, an overall summary of all work performed under the Agreement is presented in Section 3. Section 4.0 contains a list of all formal reports that SAO has submitted to GSFC along with publications and presentations at various conferences.
Traveling-wave piezoelectric linear motor part II: experiment and performance evaluation.
Ting, Yung; Li, Chun-Chung; Chen, Liang-Chiang; Yang, Chieh-Min
2007-04-01
This article continues the discussion of a traveling-wave piezoelectric linear motor. Part I of this article dealt with the design and analysis of the stator of a traveling-wave piezoelectric linear motor. In this part, the discussion focuses on the structure and modeling of the contact layer and the carriage. In addition, the performance analysis and evaluation of the linear motor also are dealt with in this study. The traveling wave is created by stator, which is constructed by a series of bimorph actuators arranged in a line and connected to form a meander-line structure. Analytical and experimental results of the performance are presented and shown to be almost in agreement. Power losses due to friction and transmission are studied and found to be significant. Compared with other types of linear motors, the motor in this study is capable of supporting heavier loads and provides a larger thrust force.
Tao, Lingyan; Zhang, Qing; Wu, Yongjiang; Liu, Xuesong
2016-12-01
In this study, a fast and effective high-performance liquid chromatography method was developed to obtain a fingerprint chromatogram and quantitative analysis simultaneously of four indexes including gallic acid, chlorogenic acid, albiflorin and paeoniflorin of the traditional Chinese medicine Moluodan Concentrated Pill. The method was performed by using a Waters X-bridge C 18 reversed phase column on an Agilent 1200S high-performance liquid chromatography system coupled with diode array detection. The mobile phase of the high-performance liquid chromatography method was composed of 20 mmol/L phosphate solution and acetonitrile with a 1 mL/min eluent velocity, under a detection temperature of 30°C and a UV detection wavelength of 254 nm. After the methodology validation, 16 batches of Moluodan Concentrated Pill were analyzed by this high-performance liquid chromatography method and both qualitative and quantitative evaluation results were achieved by similarity analysis, principal component analysis and hierarchical cluster analysis. The results of these three chemometrics were in good agreement and all indicated that batch 10 and batch 16 showed significant differences with the other 14 batches. This suggested that the developed high-performance liquid chromatography method could be applied in the quality evaluation of Moluodan Concentrated Pill. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
41 CFR 60-741.62 - Conciliation agreements.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 41 Public Contracts and Property Management 1 2010-07-01 2010-07-01 true Conciliation agreements... Conciliation agreements. If a compliance evaluation, complaint investigation or other review by OFCCP finds a... consideration of formal enforcement) is appropriate, a written conciliation agreement will be required. The...
41 CFR 60-741.62 - Conciliation agreements.
Code of Federal Regulations, 2011 CFR
2011-07-01
... 41 Public Contracts and Property Management 1 2011-07-01 2009-07-01 true Conciliation agreements... Conciliation agreements. If a compliance evaluation, complaint investigation or other review by OFCCP finds a... consideration of formal enforcement) is appropriate, a written conciliation agreement will be required. The...
Pillai, Roshni; Yathiraj, Asha
2017-09-01
The study evaluated whether there exists a difference/relation in the way four different memory skills (memory score, sequencing score, memory span, & sequencing span) are processed through the auditory modality, visual modality and combined modalities. Four memory skills were evaluated on 30 typically developing children aged 7 years and 8 years across three modality conditions (auditory, visual, & auditory-visual). Analogous auditory and visual stimuli were presented to evaluate the three modality conditions across the two age groups. The children obtained significantly higher memory scores through the auditory modality compared to the visual modality. Likewise, their memory scores were significantly higher through the auditory-visual modality condition than through the visual modality. However, no effect of modality was observed on the sequencing scores as well as for the memory and the sequencing span. A good agreement was seen between the different modality conditions that were studied (auditory, visual, & auditory-visual) for the different memory skills measures (memory scores, sequencing scores, memory span, & sequencing span). A relatively lower agreement was noted only between the auditory and visual modalities as well as between the visual and auditory-visual modality conditions for the memory scores, measured using Bland-Altman plots. The study highlights the efficacy of using analogous stimuli to assess the auditory, visual as well as combined modalities. The study supports the view that the performance of children on different memory skills was better through the auditory modality compared to the visual modality. Copyright © 2017 Elsevier B.V. All rights reserved.
Weidinger, Paolina; Nilsson, J Lars G; Lindblad, Ulf
2009-05-01
To study the clinical evaluation and treatment of patients with asthma and COPD in primary care in Sweden, with a focus on adherence to recommended guidelines and quality indicators. All visits at health care centres in Skaraborg, Sweden, are documented in computerized medical records constituting the Skaraborg Primary Care Database (SPCD). In a register-based retrospective observational study, all patients diagnosed with asthma or COPD during 2000-2005 (n = 12,328) were identified. In a 5% random sample (n = 623), information on performed investigations at initial visits and at follow-up during 2004-2005 was collected. Compliance with procedures as recommended by national guidelines was used for quality assessment. Among 499 patients with asthma, 167 (33%) were investigated with spirometry or Peak Expiratory Flow (PEF) during initial visits in agreement with guidelines. Correspondingly, 40 out of 124 patients with COPD (32%) were investigated with spirometry. During follow-up, evaluation in agreement with guidelines was performed in 130 (60%) of patients with asthma and in 35 patients out of 77 (45%) with COPD. Prescribing of ICS reached quality target, still every second patient made an acute visit during follow-up. Adherence to recommended guidelines in asthma/COPD was low. Acute visits were common and despite the prescribing of ICS according to recommendations, patients still seem uncontrolled in their disease. There is a need for quality improvement in the clinical evaluation and treatment of patients with asthma and COPD.
46 CFR 8.130 - Agreement conditions.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 46 Shipping 1 2010-10-01 2010-10-01 false Agreement conditions. 8.130 Section 8.130 Shipping COAST... ALTERNATIVES General § 8.130 Agreement conditions. (a) Delegated functions performed by, and statutory... provisions of its agreement with the Commandant. Any agreement between the Commandant and a recognized...
46 CFR 8.130 - Agreement conditions.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 46 Shipping 1 2012-10-01 2012-10-01 false Agreement conditions. 8.130 Section 8.130 Shipping COAST... ALTERNATIVES General § 8.130 Agreement conditions. (a) Delegated functions performed by, and statutory... provisions of its agreement with the Commandant. Any agreement between the Commandant and a recognized...
46 CFR 8.130 - Agreement conditions.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 46 Shipping 1 2011-10-01 2011-10-01 false Agreement conditions. 8.130 Section 8.130 Shipping COAST... ALTERNATIVES General § 8.130 Agreement conditions. (a) Delegated functions performed by, and statutory... provisions of its agreement with the Commandant. Any agreement between the Commandant and a recognized...
Bergknut, Niklas; Grinwis, Guy; Pickee, Emile; Auriemma, Edoardo; Lagerstedt, Anne-Sofie; Hagman, Ragnvi; Hazewinkel, Herman A W; Meij, Björn P
2011-07-01
To evaluate the reliability of the Thompson system for use in grading the gross pathological changes of intervertebral disk (IVD) degeneration in dogs and to investigate the agreement between gross pathological findings and low-field (0.2-T) magnetic resonance imaging (MRI) findings. Vertebral columns from cadavers of 19 dogs of various ages, breeds, and origins. 182 intervertebral segments were collected from 19 canine cadavers. Sagittal T2-weighted MRI of the T11 through S1 portion of the vertebral column was performed within 24 hours after the dogs were euthanized. The vertebral columns were subsequently divided in the midsagittal plane, and high-resolution photographs were obtained of each intervertebral segment (end plate-disk-end plate). The MRI images and photographs were graded separately in a blinded manner by 4 observers who used both Pfirrmann and Thompson grading criteria. The interobserver agreement for Thompson scores ranged from 0.76 to 0.88, and the intraobserver agreement ranged from 0.88 to 0.94 (Cohen weighted κ analysis). Agreement between scores for the Pfirrmann and Thompson grading criteria was κ = 0.70. Grading of IVD degeneration in dogs by use of the Thompson system resulted in high interobserver and intraobserver agreement, and scores for the Thompson system had substantial agreement with low-field MRI findings graded by use of the Pfirrmann system. This suggested that low-field MRI can be used to diagnose IVD degeneration in dogs.
Colosi, Ioana A; Faure, Odile; Dessaigne, Bérangére; Bourdon, Cécile; Lebeau, Bernadette; Colosi, Horaţiu A; Pelloux, Hervé
2012-05-01
We compared the E-test method to that of the Neo-Sensitabs tablet diffusion assay for evaluating the in vitro susceptibility of 100 clinical isolates of filamentous fungi (Aspergillus spp., Fusarium spp., Scedosporium spp., zygomycetes and other molds) to amphotericin B, itraconazole, voriconazole, caspofungin, and posaconazole. We determined the categorical agreement level between E-test minimum inhibitory concentrations (MIC) and tablet end-points, as opposed to the following disagreement parameters: very major error - resistant parameter (R) in E-test and susceptible (S) in tablet; major error - S by E-test and R by tablet; minor error - shifts between S and susceptible dose-dependent (S-DD) or S-DD and R. We also performed linear regression analyses and computed Pearson's correlation coefficients (R values) between the log transforms of MICs and the inhibition zone diameters of the five studied antifungal agents. For itraconazole we obtained 97% categorical agreement and R = -0.727. Categorical agreement for caspofungin and voriconazole was 96% and R =-0.821 and R = -0.789, respectively. For posaconazole the categorical agreement was 94% and R =-0.743. Amphotericin B exhibited a lower degree of agreement (76%, R = -0.672), especially in studies of Aspergillus spp. Our results suggest a potential value of the Neo-Sensitabs assay for in vitro susceptibility testing of molds to itraconazole, voriconazole, caspofungin and posaconazole, while amphotericin B exhibited an overall lower degree of agreement.
Simms, Leonard J; Zelazny, Kerry; Yam, Wern How; Gros, Daniel F
2010-05-01
Little attention typically is paid to the way self-report measures are translated for use in self-informant agreement studies. We studied two possible methods for creating informant measures: (a) the traditional method in which self-report items were translated from the first- to the third-person and (b) an alternative meta-perceptual method in which informants were directed to rate their perception of the targets' self-perception. We hypothesized that the latter method would yield stronger self-informant agreement for evaluative personality dimensions measured by indirect item markers. We studied these methods in a sample of 303 undergraduate friendship dyads. Results revealed mean-level differences between methods, similar self-informant agreement across methods, stronger agreement for Big Five dimensions than for evaluative dimensions, and incremental validity for meta-perceptual informant rating methods. Limited power reduced the interpretability of several sparse acquaintanceship effects. We conclude that traditional informant methods are appropriate for most personality traits, but meta-perceptual methods may be more appropriate when personality questionnaire items reflect indirect indicators of the trait being measured, which is particularly likely for evaluative traits.
Simms, Leonard J.; Zelazny, Kerry; Yam, Wern How; Gros, Daniel F.
2011-01-01
Little attention typically is paid to the way self-report measures are translated for use in self-informant agreement studies. We studied two possible methods for creating informant measures: (a) the traditional method in which self-report items were translated from the first- to the third-person and (b) an alternative meta-perceptual method in which informants were directed to rate their perception of the targets’ self-perception. We hypothesized that the latter method would yield stronger self-informant agreement for evaluative personality dimensions measured by indirect item markers. We studied these methods in a sample of 303 undergraduate friendship dyads. Results revealed mean-level differences between methods, similar self-informant agreement across methods, stronger agreement for Big Five dimensions than for evaluative dimensions, and incremental validity for meta-perceptual informant rating methods. Limited power reduced the interpretability of several sparse acquaintanceship effects. We conclude that traditional informant methods are appropriate for most personality traits, but meta-perceptual methods may be more appropriate when personality questionnaire items reflect indirect indicators of the trait being measured, which is particularly likely for evaluative traits. PMID:21541262
Moore, Lynne; Turgeon, Alexis F; Sirois, Marie-Josée; Murat, Valérie; Lavoie, André
2011-09-01
Trauma center performance evaluations generally include adjustment for injury severity, age, and comorbidity. However, disparities across trauma centers may be due to other differences in source populations that are not accounted for, such as socioeconomic status (SES). We aimed to evaluate whether SES influences trauma center performance evaluations in an inclusive trauma system with universal access to health care. The study was based on data collected between 1999 and 2006 in a Canadian trauma system. Patient SES was quantified using an ecologic index of social and material deprivation. Performance evaluations were based on mortality adjusted using the Trauma Risk Adjustment Model. Agreement between performance results with and without additional adjustment for SES was evaluated with correlation coefficients. The study sample comprised a total of 71,784 patients from 48 trauma centers, including 3,828 deaths within 30 days (4.5%) and 5,549 deaths within 6 months (7.7%). The proportion of patients in the highest quintile of social and material deprivation varied from 3% to 43% and from 11% to 90% across hospitals, respectively. The correlation between performance results with or without adjustment for SES was almost perfect (r = 0.997; 95% CI 0.995-0.998) and the same hospital outliers were identified. We observed an important variation in SES across trauma centers but no change in risk-adjusted mortality estimates when SES was added to adjustment models. Results suggest that after adjustment for injury severity, age, comorbidity, and transfer status, disparities in SES across trauma center source populations do not influence trauma center performance evaluations in a system offering universal health coverage. Copyright © 2011 American College of Surgeons. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Wu, Qiusheng; Liu, Hongxing; Wang, Lei; Deng, Chengbin
2016-03-01
High quality soil moisture datasets are required for various environmental applications. The launch of the Advanced Microwave Scanning Radiometer 2 (AMSR2) on board the Global Change Observation Mission 1-Water (GCOM-W1) in May 2012 has provided global near-surface soil moisture data, with an average revisit frequency of two days. Since AMSR2 is a new passive microwave system in operation, it is very important to evaluate the quality of AMSR2 products before widespread utilization of the data for scientific research. In this paper, we provide a comprehensive evaluation of the AMSR2 soil moisture products retrieved by the Japan Aerospace Exploration Agency (JAXA) algorithm. The evaluation was performed for a three-year period (July 2012-June 2015) over the contiguous United States. The AMSR2 soil moisture products were evaluated by comparing ascending and descending overpass products to each other as well as comparing them to in situ soil moisture observations of 598 monitoring stations obtained from the International Soil Moisture Network (ISMN). The accuracy of AMSR2 soil moisture product was evaluated against several types of monitoring networks, and for different land cover types and ecoregions. Three performance metrics, including mean difference (MD), root mean squared difference (RMSD), and correlation coefficient (R), were used in our accuracy assessment. Our evaluation results revealed that AMSR2 soil moisture retrievals are generally lower than in situ measurements. The AMSR2 soil moisture retrievals showed the best agreement with in situ measurements over the Great Plains and the worst agreement over forested areas. This study offers insights into the suitability and reliability of AMSR2 soil moisture products for different ecoregions. Although AMSR2 soil moisture retrievals represent useful and effective measurements for some regions, further studies are required to improve the data accuracy.
Ammitzbøll-Danielsen, Mads; Østergaard, Mikkel; Naredo, Esperanza; Terslev, Lene
2016-12-01
The aim was to evaluate the metric properties of the semi-quantitative OMERACT US scoring system vs a novel quantitative US scoring system for tenosynovitis, by testing its intra- and inter-reader reliability, sensitivity to change and comparison with clinical tenosynovitis scoring in a 6-month follow-up study. US and clinical assessments of the tendon sheaths of the clinically most affected hand and foot were performed at baseline, 3 and 6 months in 51 patients with RA. Tenosynovitis was assessed using the semi-quantitative scoring system (0-3) proposed by the OMERACT US group and a new quantitative US evaluation (0-100). A sum for US grey scale (GS), colour Doppler (CD) and pixel index (PI), respectively, was calculated for each patient. In 20 patients, intra- and inter-observer agreement was established between two independent investigators. A binary clinical tenosynovitis score was performed, calculating a sum score per patient. The intra- and inter-observer agreements for US tenosynovitis assessments were very good at baseline and for change for GS and CD, but less good for PI. The smallest detectable change was 0.97 for GS, 0.93 for CD and 30.1 for PI. The sensitivity to change from month 0 to 6 was high for GS and CD, and slightly higher than for clinical tenosynovitis score and PI. This study demonstrated an excellent intra- and inter-reader agreement between two investigators for the OMERACT US scoring system for tenosynovitis and a high ability to detect changes over time. Quantitative assessment by PI did not add further information. © The Author 2016. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
San Francisco urban partnership agreement : national evaluation report.
DOT National Transportation Integrated Search
2014-11-01
This document presents the final report on the national evaluation of the San Francisco Urban Partnership Agreement (UPA) under the United States Department of Transportation (U.S. DOT) UPA Program. The UPA program targeted congestion reduction throu...
Urban Partnership Agreement and Congestion Reduction Demonstration : National Evaluation Framework
DOT National Transportation Integrated Search
2008-11-21
This report provides an analytical framework for evaluating six deployments under the United States Department of Transportation (U.S. DOT) Urban Partnership Agreement (UPA) and Congestion Reduction Demonstration (CRD) Programs. The six UPA/CRD sites...
Measuring symptoms and functioning of youth with ADHD in middle schools.
Evans, Steven W; Allen, Jessica; Moore, Sheryle; Strauss, Victoria
2005-12-01
The identification of reliable and valid means for evaluating the effectiveness of school-based treatments and completing diagnostic evaluations of middle school aged students are needed. The present study examined the inter-rater agreement of teacher ratings and the relationship between ratings and observational data in a middle school setting. The data are interpreted in the context of differences between a secondary and elementary school setting. Teacher ratings and observational data were collected regularly over the course of two academic years for middle school students diagnosed with ADHD. The results indicate low rates of inter-rater agreement as well as low rates of agreement between teachers and observational data, and between observational data collected in different classrooms. Inter-rater agreement was lowest in late fall and gradually increased over the second half of the year. Implications for conducting treatment outcome evaluations of school-based treatment programs and diagnostic evaluations are discussed.
Questel, E; Durbise, E; Bardy, A-L; Schmitt, A-M; Josse, G
2015-05-01
To assess an objective method evaluating the effects of a retinaldehyde-based cream (RA-cream) on solar lentigines; 29 women randomly applied RA-cream on lentigines of one hand and a control cream on the other, once daily for 3 months. A specific method enabling a reliable visualisation of the lesions was proposed, using high-magnification colour-calibrated camera imaging. Assessment was performed using clinical evaluation by Physician Global Assessment score and image analysis. Luminance determination on the numeric images was performed either on the basis of 5 independent expert's consensus borders or probability map analysis via an algorithm automatically detecting the pigmented area. Both image analysis methods showed a similar lightening of ΔL* = 2 after a 3-month treatment by RA-cream, in agreement with single-blind clinical evaluation. High-magnification colour-calibrated camera imaging combined with probability map analysis is a fast and precise method to follow lentigo depigmentation. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Frame-of-Reference Training: Establishing Reliable Assessment of Teaching Effectiveness.
Newman, Lori R; Brodsky, Dara; Jones, Richard N; Schwartzstein, Richard M; Atkins, Katharyn Meredith; Roberts, David H
2016-01-01
Frame-of-reference (FOR) training has been used successfully to teach faculty how to produce accurate and reliable workplace-based ratings when assessing a performance. We engaged 21 Harvard Medical School faculty members in our pilot and implementation studies to determine the effectiveness of using FOR training to assess health professionals' teaching performances. All faculty were novices at rating their peers' teaching effectiveness. Before FOR training, we asked participants to evaluate a recorded lecture using a criterion-based peer assessment of medical lecturing instrument. At the start of training, we discussed the instrument and emphasized its precise behavioral standards. During training, participants practiced rating lectures and received immediate feedback on how well they categorized and scored performances as compared with expert-derived scores of the same lectures. At the conclusion of the training, we asked participants to rate a post-training recorded lecture to determine agreement with the experts' scores. Participants and experts had greater rating agreement for the post-training lecture compared with the pretraining lecture. Through this investigation, we determined that FOR training is a feasible method to teach faculty how to accurately and reliably assess medical lectures. Medical school instructors and continuing education presenters should have the opportunity to be observed and receive feedback from trained peer observers. Our results show that it is possible to use FOR rater training to teach peer observers how to accurately rate medical lectures. The process is time efficient and offers the prospect for assessment and feedback beyond traditional learner evaluation of instruction.
Detection of Apical Root Cracks Using Spectral Domain and Swept-source Optical Coherence Tomography.
de Oliveira, Bruna Paloma; Câmara, Andréa Cruz; Duarte, Daniel Amancio; Gomes, Anderson Stevens Leonidas; Heck, Richard John; Antonino, Antonio Celso Dantas; Aguiar, Carlos Menezes
2017-07-01
This study aimed to evaluate the ability of 2 optical coherence tomographic (OCT) systems to detect apical dentinal microcracks. Twenty extracted human single-rooted mandibular incisors were selected. After root canal preparation with an R40 Reciproc file (VDW, Munich, Germany), the specimens presenting apical microcracks were identified using micro-computed tomographic (micro-CT) scanning as the gold standard. Then, the apical portions of the roots were imaged with spectral domain OCT (SD-OCT) and swept-source OCT systems, and the resulting images were blindly evaluated by 3 independent examiners to detect microcracks. The diagnostic performance of each OCT device was calculated, and statistical analysis was performed. Based on the micro-CT images, 12 (60%) roots presented dentinal microcracks in the apical region. The images generated by the OCT systems were able to show microcrack lines at the same location as the corresponding micro-CT cross sections. Although the diagnostic performance of the SD-OCT device was superior, there were no statistically significant differences between the 2 OCT devices (P > .05). Interexaminer agreement was substantial to almost perfect for the SD-OCT system and moderate to almost perfect for the swept-source OCT system, whereas intraexaminer agreement was substantial to almost perfect for both OCT devices. The detection ability verified for both OCT systems renders them promising tools for the diagnosis of apical microcracks. Copyright © 2017 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Brachial artery stiffness estimation using ARTSENS.
Kiran, V Raj; Nabeel, P M; Joseph, Jayaraj; Sivaprakasam, Mohanasankar
2017-07-01
Central and peripheral arteries stiffening prominently affect hemodynamics thus increasing the risk of coronary heart disease, chronic kidney disease and end stage renal disease. There are several commercially available non-invasive measurement technologies for the evaluation of stiffness that are expensive, demand dedicated expertise and fall short for mass screening. Considering this, we have developed ARTSENS ® , a highly compact and portable image-free ultrasound device for evaluation of arterial stiffness. The capability of the device to perform accurate measurements of carotid artery stiffness has been validated through extensive in-vivo studies. In this paper we demonstrate the feasibility of using ARTSENS ® for measuring brachial artery stiffness. An inter-operator repeatability study was done based on in-vivo experiments on 9 young healthy subjects. The study included measurement of distension, end diastolic lumen diameter, arterial compliance and stiffness index performed both on carotid artery and brachial artery by two operators successively. The degree of agreement between the measurements made by operators has been investigated based on Bland-Altman plots and paired t-test. The measurements were populated within the limits of agreement. No statistically significant difference (p-values from paired t-test for end-diastolic diameter, distension, stiffness index, arterial compliance were 0.36, 0.24, 0.47 and 0.11 respectively) was seen for the brachial artery measurements performed by the two operators. The correlation between the measurement made by the operators was highly significant (r=0.86, p-value=0.003).
Aroussi, Abdelkrim; Vignoles, Philippe; Dalmay, François; Wimel, Laurence; Dardé, Marie-Laure; Mercier, Aurélien; Ajzenberg, Daniel
2015-01-01
In France, some cases of severe toxoplasmosis have been linked to the consumption of horse meat that had been imported from the American continent where atypical strains of Toxoplasma gondii are more common than in Europe. Many seroprevalence studies are presented in the literature but risk assessment of T. gondii infection after horse meat consumption is not possible in the absence of validated serological tests and the unknown correlation between detection of antibodies against T. gondii and presence of tissue cysts. We performed magnetic-capture polymerase chain reaction (MC-PCR) to detect T. gondii DNA in 231 horse meat samples purchased in supermarkets in France and evaluated the performance and level of agreement of the modified agglutination test (MAT) and enzyme-linked immunosorbent assay (ELISA) in the meat juices. The serological tests lacked sensitivity, specificity, and agreement between them, and there was no correlation with the presence of T. gondii DNA in horse meat, raising concerns about the reliability of T. gondii seroprevalence data in horses from the literature. T. gondii DNA was detected in 43% of horse meat samples but the absence of strain isolation in mice following inoculation of more than 100 horse meat samples suggests a low distribution of cysts in skeletal muscles and a low risk of T. gondii infection associated with horse meat consumption. However, to avoid any risk of toxoplasmosis, thorough cooking of horse meat is recommended. © A. Aroussi et al., published by EDP Sciences, 2015.
Aroussi, Abdelkrim; Vignoles, Philippe; Dalmay, François; Wimel, Laurence; Dardé, Marie-Laure; Mercier, Aurélien; Ajzenberg, Daniel
2015-01-01
In France, some cases of severe toxoplasmosis have been linked to the consumption of horse meat that had been imported from the American continent where atypical strains of Toxoplasma gondii are more common than in Europe. Many seroprevalence studies are presented in the literature but risk assessment of T. gondii infection after horse meat consumption is not possible in the absence of validated serological tests and the unknown correlation between detection of antibodies against T. gondii and presence of tissue cysts. We performed magnetic-capture polymerase chain reaction (MC-PCR) to detect T. gondii DNA in 231 horse meat samples purchased in supermarkets in France and evaluated the performance and level of agreement of the modified agglutination test (MAT) and enzyme-linked immunosorbent assay (ELISA) in the meat juices. The serological tests lacked sensitivity, specificity, and agreement between them, and there was no correlation with the presence of T. gondii DNA in horse meat, raising concerns about the reliability of T. gondii seroprevalence data in horses from the literature. T. gondii DNA was detected in 43% of horse meat samples but the absence of strain isolation in mice following inoculation of more than 100 horse meat samples suggests a low distribution of cysts in skeletal muscles and a low risk of T. gondii infection associated with horse meat consumption. However, to avoid any risk of toxoplasmosis, thorough cooking of horse meat is recommended. PMID:25809058
Training less-experienced faculty improves reliability of skills assessment in cardiac surgery.
Lou, Xiaoying; Lee, Richard; Feins, Richard H; Enter, Daniel; Hicks, George L; Verrier, Edward D; Fann, James I
2014-12-01
Previous work has demonstrated high inter-rater reliability in the objective assessment of simulated anastomoses among experienced educators. We evaluated the inter-rater reliability of less-experienced educators and the impact of focused training with a video-embedded coronary anastomosis assessment tool. Nine less-experienced cardiothoracic surgery faculty members from different institutions evaluated 2 videos of simulated coronary anastomoses (1 by a medical student and 1 by a resident) at the Thoracic Surgery Directors Association Boot Camp. They then underwent a 30-minute training session using an assessment tool with embedded videos to anchor rating scores for 10 components of coronary artery anastomosis. Afterward, they evaluated 2 videos of a different student and resident performing the task. Components were scored on a 1 to 5 Likert scale, yielding an average composite score. Inter-rater reliabilities of component and composite scores were assessed using intraclass correlation coefficients (ICCs) and overall pass/fail ratings with kappa. All components of the assessment tool exhibited improvement in reliability, with 4 (bite, needle holder use, needle angles, and hand mechanics) improving the most from poor (ICC range, 0.09-0.48) to strong (ICC range, 0.80-0.90) agreement. After training, inter-rater reliabilities for composite scores improved from moderate (ICC, 0.76) to strong (ICC, 0.90) agreement, and for overall pass/fail ratings, from poor (kappa = 0.20) to moderate (kappa = 0.78) agreement. Focused, video-based anchor training facilitates greater inter-rater reliability in the objective assessment of simulated coronary anastomoses. Among raters with less teaching experience, such training may be needed before objective evaluation of technical skills. Published by Elsevier Inc.
Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U
2012-01-01
Objectives The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. Methods 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T1 weighted sequence. The reference standards were sagittal and axial T1 weighted and T2 weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. Results There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6–1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92–1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T1 weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. Conclusion The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited. PMID:22374284
Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U
2012-08-01
The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T(1) weighted sequence. The reference standards were sagittal and axial T(1) weighted and T(2) weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6-1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92-1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T(1) weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited.
Teletoxicology: Patient Assessment Using Wearable Audiovisual Streaming Technology.
Skolnik, Aaron B; Chai, Peter R; Dameff, Christian; Gerkin, Richard; Monas, Jessica; Padilla-Jones, Angela; Curry, Steven
2016-12-01
Audiovisual streaming technologies allow detailed remote patient assessment and have been suggested to change management and enhance triage. The advent of wearable, head-mounted devices (HMDs) permits advanced teletoxicology at a relatively low cost. A previously published pilot study supports the feasibility of using the HMD Google Glass® (Google Inc.; Mountain View, CA) for teletoxicology consultation. This study examines the reliability, accuracy, and precision of the poisoned patient assessment when performed remotely via Google Glass®. A prospective observational cohort study was performed on 50 patients admitted to a tertiary care center inpatient toxicology service. Toxicology fellows wore Google Glass® and transmitted secure, real-time video and audio of the initial physical examination to a remote investigator not involved in the subject's care. High-resolution still photos of electrocardiograms (ECGs) were transmitted to the remote investigator. On-site and remote investigators recorded physical examination findings and ECG interpretation. Both investigators completed a brief survey about the acceptability and reliability of the streaming technology for each encounter. Kappa scores and simple agreement were calculated for each examination finding and electrocardiogram parameter. Reliability scores and reliability difference were calculated and compared for each encounter. Data were available for analysis of 17 categories of examination and ECG findings. Simple agreement between on-site and remote investigators ranged from 68 to 100 % (median = 94 %, IQR = 10.5). Kappa scores could be calculated for 11/17 parameters and demonstrated slight to fair agreement for two parameters and moderate to almost perfect agreement for nine parameters (median = 0.653; substantial agreement). The lowest Kappa scores were for pupil size and response to light. On a 100-mm visual analog scale (VAS), mean comfort level was 93 and mean reliability rating was 89 for on-site investigators. For remote users, the mean comfort and reliability ratings were 99 and 86, respectively. The average difference in reliability scores between on-site and remote investigators was 2.6, with the difference increasing as reliability scores decreased. Remote evaluation of poisoned patients via Google Glass® is possible with a high degree of agreement on examination findings and ECG interpretation. Evaluation of pupil size and response to light is limited, likely by the quality of streaming video. Users of Google Glass® for teletoxicology reported high levels of comfort with the technology and found it reliable, though as reported reliability decreased, remote users were most affected. Further study should compare patient-centered outcomes when using HMDs for consultation to those resulting from telephone consultation.
Shawar, R; Paetznick, V; Witte, Z; Ensign, L G; Anaissie, E; LaRocco, M
1992-01-01
A study was performed in two laboratories to evaluate the effect of growth medium and test methodology on inter- and intralaboratory variations in the MICs of amphotericin B (AMB), flucytosine (5FC), fluconazole (FLU), itraconazole (ITRA), and the triazole Sch 39304 (SCH) against 14 isolates of Candida albicans. Testing was performed by broth microdilution and semisolid agar dilution with the following media, buffered to pH 7.0 with morpholinepropanesulfonic acid (MOPS): buffered yeast nitrogen base (BYNB), Eagle's minimal essential medium (EMEM), RPMI 1640 medium (RPMI), and synthetic amino acid medium for fungi (SAAMF). Inocula were standardized spectrophotometrically, and endpoints were defined by the complete absence of growth for AMB and by no more than 25% of the growth in the drug-free control for all other agents. Comparative analyses of median MICs, as determined by each test method, were made for all drug-medium combinations. Both methods yielded similar (+/- 1 twofold dilution) median MICs for AMB in EMEM and RPMI, 5FC in all media, and FLU in EMEM, RPMI, and SAAMF. In contrast, substantial between-method variations in median MICs were seen for AMB in BYNB and SAAMF, FLU In BYNB, and ITRA and SCH in all media. Interlaboratory concordance of median MICs was good for AMB, 5FC, and FLU but poor for ITRA and SCH in all media. Endpoint determinations were analyzed by use of kappa statistical analyses for evaluating the strength of observer agreement. Moderate to almost perfect interlaboratory agreement occurred with AMB and 5FC in all media and with FLU in EMEM, RPMI, and SAAMF, irrespective of the test method. Slight to almost perfect interlaboratory agreement occurred with ITRA and SCH in EMEM, RPMI, and SAAMF when tested by semisolid agar dilution but not broth microdilution. Kappa values assessing intralaboratory agreement between methods were high for 5FC in all media, for AMB in BYNB, ENEM, and RPMI, and for FLU in EMEM, RPMI, and SAAMF. One laboratory, but not the other, reported substantial to almost perfect agreement between methods for ITRA, and SCH in EMEM, RPMI, and SAAMF. Both laboratories reported poor agreement between methods for the azoles in BYNB. Discrepancies noted in azole-BYNB combinations were largely due to the greater inhibitory effect of these agents in BYNB than in other media. These results indicate that the semisolid agar dilution and broth microdilution methods with EMEM or RPMI yield equivalent and reproducible MICs for AMB, 5FC, and FLU but not ITRA and SCH. PMID:1500502
Shawar, R; Paetznick, V; Witte, Z; Ensign, L G; Anaissie, E; LaRocco, M
1992-08-01
A study was performed in two laboratories to evaluate the effect of growth medium and test methodology on inter- and intralaboratory variations in the MICs of amphotericin B (AMB), flucytosine (5FC), fluconazole (FLU), itraconazole (ITRA), and the triazole Sch 39304 (SCH) against 14 isolates of Candida albicans. Testing was performed by broth microdilution and semisolid agar dilution with the following media, buffered to pH 7.0 with morpholinepropanesulfonic acid (MOPS): buffered yeast nitrogen base (BYNB), Eagle's minimal essential medium (EMEM), RPMI 1640 medium (RPMI), and synthetic amino acid medium for fungi (SAAMF). Inocula were standardized spectrophotometrically, and endpoints were defined by the complete absence of growth for AMB and by no more than 25% of the growth in the drug-free control for all other agents. Comparative analyses of median MICs, as determined by each test method, were made for all drug-medium combinations. Both methods yielded similar (+/- 1 twofold dilution) median MICs for AMB in EMEM and RPMI, 5FC in all media, and FLU in EMEM, RPMI, and SAAMF. In contrast, substantial between-method variations in median MICs were seen for AMB in BYNB and SAAMF, FLU In BYNB, and ITRA and SCH in all media. Interlaboratory concordance of median MICs was good for AMB, 5FC, and FLU but poor for ITRA and SCH in all media. Endpoint determinations were analyzed by use of kappa statistical analyses for evaluating the strength of observer agreement. Moderate to almost perfect interlaboratory agreement occurred with AMB and 5FC in all media and with FLU in EMEM, RPMI, and SAAMF, irrespective of the test method. Slight to almost perfect interlaboratory agreement occurred with ITRA and SCH in EMEM, RPMI, and SAAMF when tested by semisolid agar dilution but not broth microdilution. Kappa values assessing intralaboratory agreement between methods were high for 5FC in all media, for AMB in BYNB, ENEM, and RPMI, and for FLU in EMEM, RPMI, and SAAMF. One laboratory, but not the other, reported substantial to almost perfect agreement between methods for ITRA, and SCH in EMEM, RPMI, and SAAMF. Both laboratories reported poor agreement between methods for the azoles in BYNB. Discrepancies noted in azole-BYNB combinations were largely due to the greater inhibitory effect of these agents in BYNB than in other media. These results indicate that the semisolid agar dilution and broth microdilution methods with EMEM or RPMI yield equivalent and reproducible MICs for AMB, 5FC, and FLU but not ITRA and SCH.
NASA Astrophysics Data System (ADS)
Koscheev, Vladimir; Manturov, Gennady; Pronyaev, Vladimir; Rozhikhin, Evgeny; Semenov, Mikhail; Tsibulya, Anatoly
2017-09-01
Several k∞ experiments were performed on the KBR critical facility at the Institute of Physics and Power Engineering (IPPE), Obninsk, Russia during the 1970s and 80s for study of neutron absorption properties of Cr, Mn, Fe, Ni, Zr, and Mo. Calculations of these benchmarks with almost any modern evaluated nuclear data libraries demonstrate bad agreement with the experiment. Neutron capture cross sections of the odd isotopes of Cr, Mn, Fe, and Ni in the ROSFOND-2010 library have been reevaluated and another evaluation of the Zr nuclear data has been adopted. Use of the modified nuclear data for Cr, Mn, Fe, Ni, and Zr leads to significant improvement of the C/E ratio for the KBR assemblies. Also a significant improvement in agreement between calculated and evaluated values for benchmarks with Fe reflectors was observed. C/E results obtained with the modified ROSFOND library for complex benchmark models that are highly sensitive to the cross sections of structural materials are no worse than results obtained with other major evaluated data libraries. Possible improvement in results by decreasing the capture cross section for Zr and Mo at the energies above 1 keV is indicated.
Salerno, Sergio; Tudisca, Chiara; Murmura, Elena; Matranga, Domenica; La Tona, Giuseppe; Lo Re, Giuseppe; Lo Casto, Antonio
2017-05-01
Umbilical Venous Catheter (UVC) are commonly used in neonatal period; they can be not correctly positioned and could be associated with complications. The purpose of this article is to suggest a flow-chart to evaluate the placement of UVC, testing it in young radiologists-in-training. We developed a simple flow-chart to asses, steps by step, UVC placement considering its course and tip location (ideally placed in the atriocaval junction). We tested the flow-chart impact asking to 20 residents to evaluate the placement of 10 UVC before and after they familiarized with the flow-chart and the anatomical findings of a newborn. The agreement among the 20 students was evaluated too. The number of correct characterizations was different due to the administration of the flow-chart. One hundred and six correct UVC assessments at the beginning switched to 196 after the administration of the flow-chart (p = 0.0001). The observed agreement among the twenty radiology residents was statistically significant, both before (kappa = 0.41, p < 0.001) and after (kappa = 0.37, p < 0.001) the flow-chart administration. The developed flow-chart demonstrated to be useful in increasing residents performance in UVC placement assessment.
Royo-Bordonada, Miguel A; Martínez-Huedo, María A
2014-01-01
To evaluate compliance with the self-regulation agreement of the food and drink vending machine sector in primary schools in Madrid, Spain. Cross-sectional study of the prevalence of vending machines in 558 primary schools in 2008. Using the directory of all registered primary schools in Madrid, we identified the presence of machines by telephone interviews and evaluated compliance with the agreement by visiting the schools and assessing accessibility, type of publicity, the products offered and knowledge of the agreement. The prevalence of schools with vending machines was 5.8%. None of the schools reported knowledge of the agreement or of its nutritional guidelines, and most machines were accessible to primary school pupils (79.3%) and packed with high-calorie, low-nutrient-dense foods (58.6%). Compliance with the self-regulation agreement of the vending machines sector was low. Stricter regulation should receive priority in the battle against the obesity epidemic. Copyright © 2013 SESPAS. Published by Elsevier Espana. All rights reserved.
Imputation of adverse drug reactions: Causality assessment in hospitals
Mastroianni, Patricia de Carvalho
2017-01-01
Background & objectives Different algorithms have been developed to standardize the causality assessment of adverse drug reactions (ADR). Although most share common characteristics, the results of the causality assessment are variable depending on the algorithm used. Therefore, using 10 different algorithms, the study aimed to compare inter-rater and multi-rater agreement for ADR causality assessment and identify the most consistent to hospitals. Methods Using ten causality algorithms, four judges independently assessed the first 44 cases of ADRs reported during the first year of implementation of a risk management service in a medium complexity hospital in the state of Sao Paulo (Brazil). Owing to variations in the terminology used for causality, the equivalent imputation terms were grouped into four categories: definite, probable, possible and unlikely. Inter-rater and multi-rater agreement analysis was performed by calculating the Cohen´s and Light´s kappa coefficients, respectively. Results None of the algorithms showed 100% reproducibility in the causal imputation. Fair inter-rater and multi-rater agreement was found. Emanuele (1984) and WHO-UMC (2010) algorithms showed a fair rate of agreement between the judges (k = 0.36). Interpretation & conclusions Although the ADR causality assessment algorithms were poorly reproducible, our data suggest that WHO-UMC algorithm is the most consistent for imputation in hospitals, since it allows evaluating the quality of the report. However, to improve the ability of assessing the causality using algorithms, it is necessary to include criteria for the evaluation of drug-related problems, which may be related to confounding variables that underestimate the causal association. PMID:28166274
Clinical evaluation of the 3M Littmann Electronic Stethoscope Model 3200 in 150 cats.
Blass, Keith A; Schober, Karsten E; Bonagura, John D; Scansen, Brian A; Visser, Lance C; Lu, Jennifer; Smith, Danielle N; Ward, Jessica L
2013-10-01
Detection of murmurs and gallops may help to identify cats with heart disease. However, auscultatory findings may be subject to clinically relevant observer variation. The objective of this study was to evaluate an electronic stethoscope (ES) in cats. We hypothesized that the ES would perform at least as well as a conventional stethoscope (CS) in the detection of abnormal heart sounds. One hundred and fifty consecutive cats undergoing echocardiography were enrolled prospectively. Cats were ausculted with a CS (WA Tycos Harvey Elite) by two observers, and heart sounds were recorded digitally using an ES (3M Littmann Stethoscope Model 3200) for off-line analysis. Echocardiography was used as the clinical standard method for validation of auscultatory findings. Additionally, digital recordings (DRs) were assessed by eight independent observers with various levels of expertise, and compared using interclass correlation and Cohen's weighted kappa analyses. Using the CS, a heart murmur (n = 88 cats) or gallop sound (n = 17) was identified in 105 cats, whereas 45 cats lacked abnormal heart sounds. There was good total agreement (83-90%) between the two observers using the CS. In contrast, there was only moderate agreement (P <0.001) between results from the CS and the DRs for murmurs, and poor agreement for gallops. The CS was more sensitive compared with the DRs with regard to murmurs and gallops. Agreement among the eight observers was good-to-excellent for murmur detection (81%). In conclusion, DRs made with the ES are less sensitive but comparably specific to a CS at detecting abnormal heart sounds in cats.
Augusto, Renan Maloni; Albuquerque, André Luis Pereira; Jaeger, Thomas; de Carvalho, Carlos Roberto Ribeiro; Caruso, Pedro
2017-02-01
The use of esophageal catheters with microtransducer promises advantages over traditional catheters with air-filled balloons. However, performance comparisons between these 2 types of catheters are scarce and incomplete. A catheter with a 9.5-cm air-filled balloon at the distal tip and a catheter with a microtransducer mounted within a flexible silicone rubber were tested in vitro and in vivo. In vitro, the response times of both catheters were compared, and the drift of the baseline pressure of the microtransducer catheter was evaluated over a 6-h period. In vivo, 11 healthy volunteers had both catheters inserted, and the drift of the baseline esophageal pressure was measured over a 3-h period. Also, the correlation and agreement of the baseline and changes in the esophageal pressure of both catheters were evaluated. In vitro, the microtransducer catheter had a response time significantly higher (262 × 114 Hz, P < .01) and a good pressure stability, with a mean baseline pressure drift of 1.4 cm H 2 O. In vivo, both catheters presented a small and similar baseline esophageal pressure drift (P = 0.08). For measurements of baseline and changes in esophageal pressure, the correlation and agreement between the catheters were poor, with a large bias between them. The catheter with the microtransducer had a small baseline pressure drift, similar to the air-filled balloon catheter. The low agreement between the catheters does not allow the microtransducer catheter to be used as a surrogate for the traditional air-filled balloon catheter. Copyright © 2017 by Daedalus Enterprises.
ERIC Educational Resources Information Center
Haselschwerdt, Megan L.; Hardesty, Jennifer L.; Hans, Jason D.
2011-01-01
Approximately, 20% of divorcing couples in the United States require judicial intervention to reach a custody agreement. In such cases, courts often call on child custody evaluators to conduct comprehensive evaluations and recommend custody agreements and services that meet children's best interests. Estimates suggest that allegations of domestic…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brundage, Aaron L.; Nicolette, Vernon F.; Donaldson, A. Burl
2005-09-01
A joint experimental and computational study was performed to evaluate the capability of the Sandia Fire Code VULCAN to predict thermocouple response temperature. Thermocouple temperatures recorded by an Inconel-sheathed thermocouple inserted into a near-adiabatic flat flame were predicted by companion VULCAN simulations. The predicted thermocouple temperatures were within 6% of the measured values, with the error primarily attributable to uncertainty in Inconel 600 emissivity and axial conduction losses along the length of the thermocouple assembly. Hence, it is recommended that future thermocouple models (for Inconel-sheathed designs) include a correction for axial conduction. Given the remarkable agreement between experiment and simulation,more » it is recommended that the analysis be repeated for thermocouples in flames with pollutants such as soot.« less
Finite-temperature interatomic exchange and magnon softening in Fe overlayers on Ir(001)
NASA Astrophysics Data System (ADS)
Rodrigues, D. C. M.; Szilva, A.; Klautau, A. B.; Bergman, A.; Eriksson, O.; Etz, C.
2016-07-01
We evaluate how thermal effects soften the magnon dispersion in 6 layers of Fe(001) on top of Ir(001). We perform a systematic study considering noncollinear spin arrangement and calculate configuration-dependent exchange parameters Jij n c following the methodology described by Szilva et al. [Phys. Rev. Lett. 111, 127204 (2013)], 10.1103/PhysRevLett.111.127204. In addition, Monte Carlo simulations were performed in order to estimate the noncollinear spin arrangement as a function of temperature. Hence the Jij n c's related to these configurations were calculated and used in an atomistic spin dynamics approach to evaluate the magnon spectra. Our results show good agreement with recent room-temperature measurements, and highlights how thermal effects produce magnon softening in this, and similar, systems.
Estimating learning outcomes from pre- and posttest student self-assessments: a longitudinal study.
Schiekirka, Sarah; Reinhardt, Deborah; Beißbarth, Tim; Anders, Sven; Pukrop, Tobias; Raupach, Tobias
2013-03-01
Learning outcome is an important measure for overall teaching quality and should be addressed by comprehensive evaluation tools. The authors evaluated the validity of a novel evaluation tool based on student self-assessments, which may help identify specific strengths and weaknesses of a particular course. In 2011, the authors asked 145 fourth-year students at Göttingen Medical School to self-assess their knowledge on 33 specific learning objectives in a pretest and posttest as part of a cardiorespiratory module. The authors compared performance gain calculated from self-assessments with performance gain derived from formative examinations that were closely matched to these 33 learning objectives. Eighty-three students (57.2%) completed the assessment. There was good agreement between performance gain derived from subjective data and performance gain derived from objective examinations (Pearson r=0.78; P<.0001) on the group level. The association between the two measures was much weaker when data were analyzed on the individual level. Further analysis determined a quality cutoff for performance gain derived from aggregated student self-assessments. When using this cutoff, the evaluation tool was highly sensitive in identifying specific learning objectives with favorable or suboptimal objective performance gains. The tool is easy to implement, takes initial performance levels into account, and does not require extensive pre-post testing. By providing valid estimates of actual performance gain obtained during a teaching module, it may assist medical teachers in identifying strengths and weaknesses of a particular course on the level of specific learning objectives.
2011-01-01
Introduction The human epidermal growth factor receptor 2 (HER2) is an established biomarker for management of patients with breast cancer. While conventional testing of HER2 protein expression is based on semi-quantitative visual scoring of the immunohistochemistry (IHC) result, efforts to reduce inter-observer variation and to produce continuous estimates of the IHC data are potentiated by digital image analysis technologies. Methods HER2 IHC was performed on the tissue microarrays (TMAs) of 195 patients with an early ductal carcinoma of the breast. Digital images of the IHC slides were obtained by Aperio ScanScope GL Slide Scanner. Membrane connectivity algorithm (HER2-CONNECT™, Visiopharm) was used for digital image analysis (DA). A pathologist evaluated the images on the screen twice (visual evaluations: VE1 and VE2). HER2 fluorescence in situ hybridization (FISH) was performed on the corresponding sections of the TMAs. The agreement between the IHC HER2 scores, obtained by VE1, VE2, and DA was tested for individual TMA spots and patient's maximum TMA spot values (VE1max, VE2max, DAmax). The latter were compared with the FISH data. Correlation of the continuous variable of the membrane connectivity estimate with the FISH data was tested. Results The pathologist intra-observer agreement (VE1 and VE2) on HER2 IHC score was almost perfect: kappa 0.91 (by spot) and 0.88 (by patient). The agreement between visual evaluation and digital image analysis was almost perfect at the spot level (kappa 0.86 and 0.87, with VE1 and VE2 respectively) and at the patient level (kappa 0.80 and 0.86, with VE1max and VE2max, respectively). The DA was more accurate than VE in detection of FISH-positive patients by recruiting 3 or 2 additional FISH-positive patients to the IHC score 2+ category from the IHC 0/1+ category by VE1max or VE2max, respectively. The DA continuous variable of the membrane connectivity correlated with the FISH data (HER2 and CEP17 copy numbers, and HER2/CEP17 ratio). Conclusion HER2 IHC digital image analysis based on membrane connectivity estimate was in almost perfect agreement with the visual evaluation of the pathologist and more accurate in detection of HER2 FISH-positive patients. Most immediate benefit of integrating the DA algorithm into the routine pathology HER2 testing may be obtained by alerting/reassuring pathologists of potentially misinterpreted IHC 0/1+ versus 2+ cases. PMID:21943197
Laurinaviciene, Aida; Dasevicius, Darius; Ostapenko, Valerijus; Jarmalaite, Sonata; Lazutka, Juozas; Laurinavicius, Arvydas
2011-09-23
The human epidermal growth factor receptor 2 (HER2) is an established biomarker for management of patients with breast cancer. While conventional testing of HER2 protein expression is based on semi-quantitative visual scoring of the immunohistochemistry (IHC) result, efforts to reduce inter-observer variation and to produce continuous estimates of the IHC data are potentiated by digital image analysis technologies. HER2 IHC was performed on the tissue microarrays (TMAs) of 195 patients with an early ductal carcinoma of the breast. Digital images of the IHC slides were obtained by Aperio ScanScope GL Slide Scanner. Membrane connectivity algorithm (HER2-CONNECT, Visiopharm) was used for digital image analysis (DA). A pathologist evaluated the images on the screen twice (visual evaluations: VE1 and VE2). HER2 fluorescence in situ hybridization (FISH) was performed on the corresponding sections of the TMAs. The agreement between the IHC HER2 scores, obtained by VE1, VE2, and DA was tested for individual TMA spots and patient's maximum TMA spot values (VE1max, VE2max, DAmax). The latter were compared with the FISH data. Correlation of the continuous variable of the membrane connectivity estimate with the FISH data was tested. The pathologist intra-observer agreement (VE1 and VE2) on HER2 IHC score was almost perfect: kappa 0.91 (by spot) and 0.88 (by patient). The agreement between visual evaluation and digital image analysis was almost perfect at the spot level (kappa 0.86 and 0.87, with VE1 and VE2 respectively) and at the patient level (kappa 0.80 and 0.86, with VE1max and VE2max, respectively). The DA was more accurate than VE in detection of FISH-positive patients by recruiting 3 or 2 additional FISH-positive patients to the IHC score 2+ category from the IHC 0/1+ category by VE1max or VE2max, respectively. The DA continuous variable of the membrane connectivity correlated with the FISH data (HER2 and CEP17 copy numbers, and HER2/CEP17 ratio). HER2 IHC digital image analysis based on membrane connectivity estimate was in almost perfect agreement with the visual evaluation of the pathologist and more accurate in detection of HER2 FISH-positive patients. Most immediate benefit of integrating the DA algorithm into the routine pathology HER2 testing may be obtained by alerting/reassuring pathologists of potentially misinterpreted IHC 0/1+ versus 2+ cases.
Ros, C; Martínez-Franco, E; Wozniak, M M; Cassado, J; Santoro, G A; Elías, N; López, M; Palacio, M; Wieczorek, A P; Espuña-Pons, M
2017-04-01
To compare the sensitivity and specificity of two- (2D) and three- (3D) dimensional transperineal ultrasound (TPUS) and 3D endovaginal ultrasound (EVUS) with the gold standard 3D endoanal ultrasound (EAUS) in detecting residual defects after primary repair of obstetric anal sphincter injuries (OASIS). External (EAS) and internal (IAS) anal sphincters were evaluated by the four ultrasound modalities in women with repaired OASIS. 2D-TPUS was evaluated in real-time, whereas 3D-TPUS, 3D-EVUS and 3D-EAUS volumes were evaluated offline by six blinded readers. The presence/absence of any tear in EAS or IAS was recorded and defects were scored according to the Starck system. Sensitivity, specificity and predictive values were calculated, using 3D-EAUS as reference standard. Inter- and intraobserver analyses were performed for all 3D imaging modalities. Association between patients' symptoms (Wexner score) and ultrasound findings (Starck score) was calculated. Images from 55 patients were analyzed. Compared with findings on 3D-EAUS, the agreement for EAS evaluation was poor for 3D-EVUS (κ = 0.01), fair for 2D-TPUS (κ = 0.30) and good for 3D-TPUS (κ = 0.73). The agreement for IAS evaluation was moderate for both 3D-EVUS (κ = 0.41) and 2D-TPUS (κ = 0.52) and good for 3D-TPUS (κ = 0.66). Good intraobserver (3D-EAUS, κ = 0.73; 3D-TPUS, κ = 0.78) and interobserver (3D-EAUS, κ = 0.68; 3D-TPUS, κ = 0.60) agreement was reported. Significant association between Starck and Wexner scores was found only for 3D-EAUS (Spearman's rho = 0.277, P = 0.04). 2D-TPUS and 3D-EVUS are not accurate modalities for the assessment of anal sphincters after repair of OASIS. 3D-TPUS shows good agreement with the gold standard 3D-EAUS and a high sensitivity in detecting residual defects. It, thus, has potential as a screening tool after primary repair of OASIS. Copyright © 2016 ISUOG. Published by John Wiley & Sons Ltd. Copyright © 2016 ISUOG. Published by John Wiley & Sons Ltd.
Kaji, Amy H; Langford, Vinette; Lewis, Roger J
2008-09-01
There is currently no validated method for assessing hospital disaster preparedness. We determine the degree of correlation between the results of 3 methods for assessing hospital disaster preparedness: administration of an on-site survey, drill observation using a structured evaluation tool, and video analysis of team performance in the hospital incident command center. This was a prospective, observational study conducted during a regional disaster drill, comparing the results from an on-site survey, a structured disaster drill evaluation tool, and a video analysis of teamwork, performed at 6 911-receiving hospitals in Los Angeles County, CA. The on-site survey was conducted separately from the drill and assessed hospital disaster plan structure, vendor agreements, modes of communication, medical and surgical supplies, involvement of law enforcement, mutual aid agreements with other facilities, drills and training, surge capacity, decontamination capability, and pharmaceutical stockpiles. The drill evaluation tool, developed by Johns Hopkins University under contract from the Agency for Healthcare Research and Quality, was used to assess various aspects of drill performance, such as the availability of the hospital disaster plan, the geographic configuration of the incident command center, whether drill participants were identifiable, whether the noise level interfered with effective communication, and how often key information (eg, number of available staffed floor, intensive care, and isolation beds; number of arriving victims; expected triage level of victims; number of potential discharges) was received by the incident command center. Teamwork behaviors in the incident command center were quantitatively assessed, using the MedTeams analysis of the video recordings obtained during the disaster drill. Spearman rank correlations of the results between pair-wise groupings of the 3 assessment methods were calculated. The 3 evaluation methods demonstrated qualitatively different results with respect to each hospital's level of disaster preparedness. The Spearman rank correlation coefficient between the results of the on-site survey and the video analysis of teamwork was -0.34; between the results of the on-site survey and the structured drill evaluation tool, 0.15; and between the results of the video analysis and the drill evaluation tool, 0.82. The disparate results obtained from the 3 methods suggest that each measures distinct aspects of disaster preparedness, and perhaps no single method adequately characterizes overall hospital preparedness.
Dankerl, Peter; Hammon, Matthias; Seuss, Hannes; Tröbs, Monique; Schuhbaeck, Annika; Hell, Michaela M; Cavallaro, Alexander; Achenbach, Stephan; Uder, Michael; Marwan, Mohamed
2017-05-01
To evaluate the performance of computer-aided evaluation software for a comprehensive workup of patients prior to transcatheter aortic valve implantation (TAVI) using low-contrast agent and low radiation dose third-generation dual-source CT angiography. We evaluated 30 consecutive patients scheduled for TAVI. All patients underwent ECG-triggered high-pitch dual-source CT angiography of the aortic root and aorta with a standardized contrast agent volume (30 ml Imeron350, flow rate 4 ml/s) and low-dose (100 kv/350 mAs) protocol. An expert (10 years of experience) manually evaluated aortic root and iliac access dimensions (distance between coronary ostia and aortic annulus, minimal/maximal diameters and area-derived diameter of the aortic annulus) and best CT-predicted fluoroscopic projection angle as the reference standard. Utilizing computer-aided software (syngo.via), the same pre-TAVI workup was performed and compared to the reference standard. Mean CTDI[Formula: see text] was 3.46 mGy and mean DLP 217.6 ± 12.1 mGy cm, corresponding to a mean effective dose of 3.7 ± 0.2 mSv. Computer-aided evaluation was successful in all but one patient. Compared to the reference standard, Bland-Altman analysis indicated very good agreement for the distances between aortic annulus and coronary ostia (RCA: mean difference 0.8 mm; 95 % CI 0.4-1.2 mm; LM: mean difference 0.9 mm; 95 % CI 0.5-1.3 mm); however, we demonstrated a systematic overestimation of annulus- derived diameter using the software (mean difference 44.4 mm[Formula: see text]; 95 % CI 30.4-58.3 mm[Formula: see text]). Based on respective annulus dimensions, the recommended prosthesis size (Edwards SAPIEN 3) matched in 26 out of the 29 patients (90 %). CT-derived fluoroscopic projection angles showed an excellent agreement for both methods. Out of 58 iliac arteries, 15 (25 %) arteries could not be segmented by the software. Preprocessing time of the software was 71 ± 11 s (range 51-96 s), and reading time with the software was 118 ± 31 s (range 68-201 s). In the workup of pre-TAVI CT angiography, computer-aided evaluation of low-contrast, low-dose examinations is feasible with good agreement and quick reading time. However, a systematic overestimation of the aortic annulus area is observed.
Morrow, A M; Hayen, A; Quine, S; Scheinberg, A; Craig, J C
2012-03-01
Health-related quality of life is an important outcome. Self-report is the gold standard, but in the paediatric setting we often rely on proxy reporting. Our understanding of the differences between self- and proxy reports and the factors that influence them is limited. These differences can impact on treatment choices and the patient-doctor relationship. To evaluate differences between children's, parents' and doctors' perceptions of health states and health-related quality of life in children with chronic illness and explore factors which explain these differences. Consecutive families attending eligible clinics at a tertiary paediatric centre were invited to complete the Health Utilities Index (HUI) 23 questionnaire. Percentage agreement and kappas were calculated as a measure of the agreement between pairs. Chi-squared tests or Fisher's exact test, if appropriate, were performed to determine if there was an association between level of agreement and participant variables. Data were collected for 130 parent-doctor pairs, 59 child-parent pairs and 59 child-doctor pairs. Overall health-related quality of life scores did not differ between responders, but there was poorer agreement for subjective domains. Doctor-child agreement was lower than parent-child agreement. Children with a diagnosis of cerebral palsy or chronic neurological condition were more likely to have lower inter-rater agreement for both subjective and objective domains. On the HUI2, agreement was lower for parent-child pairs when the father was the respondent. For child-doctor pairs, an increased frequency of patient-doctor visits and doctors' seniority were predictors of poorer agreement on the HUI3 and HUI2 respectively. We identified factors associated with level of agreement for self- and proxy reporting on the HUI23. Parent-child agreement was higher than doctor-child agreement. Patients with significant pain or emotional distress and patients with a diagnosis of severe cerebral palsy or chronic neurological conditions were more susceptible to under-reporting of subjective aspects of well-being by doctors and parents and may benefit from formal assessment of health-related quality of life in the clinical setting. © 2011 Blackwell Publishing Ltd.
Telehealth Stroke Dysphagia Evaluation Is Safe and Effective.
Morrell, Kate; Hyers, Megan; Stuchiner, Tamela; Lucas, Lindsay; Schwartz, Karissa; Mako, Jenniffer; Spinelli, Kateri J; Yanase, Lisa
2017-01-01
Rapid evaluation of dysphagia poststroke significantly lowers rates of aspiration pneumonia. Logistical barriers often significantly delay in-person dysphagia evaluation by speech language pathologists (SLPs) in remote and rural hospitals. Clinical swallow evaluations delivered via telehealth have been validated in a number of clinical contexts, yet no one has specifically validated a teleswallow evaluation for in-hospital post-stroke dysphagia assessment. A team of 6 SLPs experienced in stroke care and a telestroke neurologist designed, implemented, and tested a teleswallow evaluation for acute stroke patients, in which 100 patients across 2 affiliated, urban certified stroke centers were sequentially evaluated by a bedside and telehealth SLP. Inter-rater reliability was analyzed using percent agreement, Cohen's kappa, Kendall's tau-b, and Wilcoxon matched-pairs signed rank tests. Logistic regression models accounting for age and gender were used to test the impact of stroke severity and stroke location on agreement. We found excellent agreement for both liquid (91% agreement; kappa = 0.808; Kendall's tau-b = 0.813, p < 0.001; Wilcoxon signed rank = -0.818, p = 0.417) and solid (87% agreement; kappa = 0.792; Kendall's tau-b = 0.844, p < 0.001; Wilcoxon signed rank = 0.243, p = 0.808) dietary textures. From regression modeling, there is suggestive but inconclusive evidence that higher National Institute of Health Stroke Scale (NIHSS) scores correlate with lower levels of agreement for liquid diet recommendations (OR [95% CI] 0.895 [0.793-1.01]; p = 0.07). There was no impact of NIHSS score for solid diet recommendations and no impact of stroke location on solid or liquid diet recommendations. Qualitatively, we identified professional, logistical, technical, and patient barriers to implementation, many of which resolved with experience over time. Dysphagia evaluation by a remote SLP via telehealth is safe and effective following stroke. We plan to implement teleswallow across our multistate telestroke network as standard practice for poststroke dysphagia evaluation. © 2017 S. Karger AG, Basel.
ERIC Educational Resources Information Center
Mudford, Oliver C.; Hogg, James; Roberts, Jessica
1997-01-01
Continuous observational recording over 57 hours evaluated behavior states of three adults with profound and multiple disabilities. Two independent observers also recorded for 22 hours. Although overall percentage agreement was satisfactory (above 80%), agreement on occurrence was unsatisfactory (mean of 65%). Agreement data were superimposed on…
Regadas, F Sergio P; Haas, Eric M; Abbas, Maher A; Marcio Jorge, J; Habr-Gama, Angelita; Sands, Dana; Wexner, Steven D; Melo-Amaral, Ingrid; Sardiñas, Carlos; Lima, Doryane M; Sagae, Univaldo E; Sagae, Evaldo U; Murad-Regadas, Sthela M
2011-06-01
Defecography is the gold standard for assessing functional anorectal disorders but is limited by the need for a specific radiologic environment, exposure of patients to radiation, and inability to show all anatomic structures involved in defecation. Echodefecography is a 3-dimensional dynamic ultrasound technique developed to overcome these limitations. This study was designed to validate the effectiveness of echodefecography compared with defecography in the assessment of anorectal dysfunctions related to obstructed defecation. Multicenter, prospective observational study. Women with symptoms of obstructed defecation. Six centers for colorectal surgery (3 in Brazil, 1 in Texas, 1 in Florida, and 1 in Venezuela). Defecography was performed after inserting 150 mL of barium paste in the rectum. Echodefecography was performed with a 2050 endoprobe through 3 automatic scans. The κ statistic was used to assess agreement between echodefecography and defecography in the evaluation of rectocele, intussusception, anismus, and grade III enterocele. Eighty-six women were evaluated: median Wexner constipation score, 13.4 (range, 6-23); median age, 53.4 (range, 26-77) years. Rectocele was identified with substantial agreement between the 2 methods (defecography, 80 patients; echodefecography, 76 patients; κ = 0.61; 95% CI = 0.48-0.73). The 2 techniques demonstrated identical findings in 6 patients without rectocele, and in 9 patients with grade I, 29 with grade II, and 19 patients with grade III rectoceles. Defecography identified rectal intussusception in 42 patients, with echodefecography identifying 37 of these cases, plus 4 additional cases, yielding substantial agreement (κ = 0.79; 95% CI = 0.57-1.0). Intussusception was associated with rectocele in 28 patients for both methods (κ = 0.62; 95% CI = 0.41-0.83). There was substantial agreement for anismus (κ = 0.61; 95% CI = 0.40-0.81) and for rectocele combined with anismus (κ = 0.61; 95% CI = 0.40-0.82). Agreement for grade III enterocele was classified as almost perfect (κ = 0.87; 95% CI = 0.66-1.0). Echodefecography had limited use in identification of grade I and II enteroceles because of the type of probe used. Echodefecography may be used to assess patients with obstructed defecation, as it is able to detect the same anorectal dysfunctions found by defecography. It is minimally invasive and well tolerated, avoids exposure to radiation, and clearly demonstrates all the anatomic structures involved in defecation.
Porfyridis, Ilias; Georgiadis, Georgios; Michael, Michalis; Frangopoulos, Frangiskos; Vogazianos, Paris; Papadopoulos, Alexis; Kara, Panayiota; Charalampous, Charis; Georgiou, Andreas
2016-08-01
Medical thoracoscopy (MT) is useful for the management of pleural disease. Rapid on-site evaluation (ROSE) of transbronchial needle aspirates proved to be useful during bronchoscopy. We aimed to evaluate the diagnostic performance of ROSE of MT biopsy specimens and thoracoscopists' impression of the macroscopic appearance and assess the intermodality agreement between ROSE and final histopathologic diagnosis. Sixty two patients with exudative pleural effusions further investigated with MT were enrolled. MT was performed under local anaesthesia and conscious sedation, using the rigid pleuroscope. ROSE with the Hemacolor rapid staining method of the biopsy specimens was performed. Thoracoscopists' impression of the macroscopic appearance was recorded. The final diagnosis was established following histopathological examination. Thoracoscopic pleural biopsies were diagnosed in 61 patients (98.4%). Group A (n = 25) consisted of patients with malignancy and group B (n = 37) with benign disorders. Area under the curve of ROSE for the diagnosis of malignancy was 0.86 (95% CI: 0.76-0.96, P < 0.001), with a sensitivity of 79.17%, specificity of 94.59%, diagnostic accuracy of 88.5%, positive predictive value of 90.5% and negative predictive value of 87.5%. Intermodality agreement between ROSE and histopathology was good (κ ± SE = 0.615 ± 0.084, P < 0.001). Area under the curve of the thoracoscopists' impression of macroscopic appearance was 0.72 (95% CI: 0.58-0.85, P = 0.001), with a sensitivity of 100%, specificity of 44.7%, positive predictive value of 53.33% and negative predictive value of 100%. Rapid on-site evaluation during MT was found to have high accuracy for predicting malignancy. ROSE can provide the thoracoscopist with an on-site preliminary diagnosis, especially in cases with inconclusive macroscopic appearance. © 2016 Asian Pacific Society of Respirology.
Langarika-Rocafort, Argia; Emparanza, José Ignacio; Aramendi, José F; Castellano, Julen; Calleja-González, Julio
2017-01-01
To examine the intra-observer reliability and agreement between five methods of measurement for dorsiflexion during Weight Bearing Dorsiflexion Lunge Test and to assess the degree of agreement between three methods in female athletes. Repeated measurements study design. Volleyball club. Twenty-five volleyball players. Dorsiflexion was evaluated using five methods: heel-wall distance, first toe-wall distance, inclinometer at tibia, inclinometer at Achilles tendon and the dorsiflexion angle obtained by a simple trigonometric function. For the statistical analysis, agreement was studied using the Bland-Altman method, the Standard Error of Measurement and the Minimum Detectable Change. Reliability analysis was performed using the Intraclass Correlation Coefficient. Measurement methods using the inclinometer had more than 6° of measurement error. The angle calculated by trigonometric function had 3.28° error. The reliability of inclinometer based methods had ICC values < 0.90. Distance based methods and trigonometric angle measurement had an ICC values > 0.90. Concerning the agreement between methods, there was from 1.93° to 14.42° bias, and from 4.24° to 7.96° random error. To assess DF angle in WBLT, the angle calculated by a trigonometric function is the most repeatable method. The methods of measurement cannot be used interchangeably. Copyright © 2016 Elsevier Ltd. All rights reserved.
Cross-cultural adaptation and validation of the Behcet's Disease Current Activity Form in Korea.
Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo
2015-09-01
This study was undertaken to perform a cross-cultural adaptation of the Behcet's Disease Current Activity Form (BDCAF, version 2006) questionnaire to the Korean language and to evaluate its reliability and validity in a population of Korean patients with Behcet's disease (BD). A cross-cultural study was conducted among patients with BD who attended our rheumatology clinic between November 2012 and March 2013. There were 11 males and 35 females in the group. The mean age of the participants was 48.5 years and the mean disease duration was 6.4 years. The first BDCAF questionnaire was completed on arrival and the second assessment was performed 20 minutes later by a different physician. The test-retest reliability was analyzed by computing κ statistics. Kappa scores of > 0.6 indicated a good agreement. To assess the validity, we compared the total BDCAF score with the patient's/clinician's perception of disease activity and the Korean version of the Behcet's Disease Quality of Life (BDQOL). For the test-retest reliability, good agreements were achieved on items such as headache, oral/genital ulceration, erythema, skin pustules, arthralgia, nausea/vomiting/abdominal pain, and diarrhea with altered/frank blood per rectum. Moderate agreement was observed for eye and nervous system involvement. We achieved a fair agreement for arthritis and major vessel involvement. Significant correlations were obtained between the total BDCAF score with the BDQOL and the patient's/clinician's perception of disease activity p < 0.05). The Korean version of the BDCAF is a reliable and valid instrument for measuring current disease activity in Korean BD patients.
Does clinical experience affect the reproducibility of cervical vertebrae maturation method?
Rongo, Roberto; Valleta, Rosa; Bucci, Rosaria; Bonetti, Giulio Alessandri; Michelotti, Ambrosina; D'Antò, Vincenzo
2015-09-01
To assess interobserver and intraobserver reproducibility of the cervical vertebrae maturation method (CVMM) among three panels of judges with different levels of orthodontic experience (OE). Fifty individual lateral cephalograms of good quality with complete visualization of cervical vertebrae 1 to 4 were selected. Thirty clinicians, divided according to their OE into three groups (junior group, JU, OE ≤ 1 year; postgraduate group, PG, 2 ≤ OE ≤ 4 years; specialist group, SP, OE ≥ 7 years), evaluated the cephalograms in two sessions (T1 and T2) at 3 weeks apart. Kendall's W and weighted Cohen's kappa (κ) coefficients were performed to assess interobserver and intraobserver agreement. The level of significance was set as P < .05. For both the interobserver and the intraobserver datasets, the percentage of perfect agreement (PPA) and the number of stages apart for each disagreement were calculated. Kendall's W at T1 was SP = 0.61, PG = 0.70, and JU = 0.87; at T2 it was SP = 0.78, PG = 0.85, and JU = 0.86. The percentage of total interobserver perfect agreement (Inter-PPA) was 42.3% at T1 and 46.3% at T2. The JU group had the highest Cohen's κ coefficient at 0.78, while the PG and SP had coefficients of 0.64 each. The percentage of total intraobserver perfect agreement (Intra-PPA) was 54.2%. The reproducibility of the method was not improved by the level of orthodontic experience. The group with the lowest level of orthodontic experience had the best performance.
Campos, Fernanda Magalhães Freire; Repoles, Laura Cotta; de Araújo, Fernanda Fortes; Peruhype-Magalhães, Vanessa; Xavier, Marcelo Antônio Pascoal; Sabino, Ester Cerdeira; de Freitas Carneiro Proietti, Anna Bárbara; Andrade, Mariléia Chaves; Teixeira-Carvalho, Andréa; Martins-Filho, Olindo Assis; Gontijo, Célia Maria Ferreira
2018-04-01
A relevant issue in Chagas disease serological diagnosis regards the requirement of using several confirmatory methods to elucidate the status of non-negative results from blood bank screening. The development of a single reliable method may potentially contribute to distinguish true and false positive results. Our aim was to evaluate the performance of the multiplexed flow-cytometry anti-T. cruzi/Leishmania IgG1 serology/(FC-TRIPLEX Chagas/Leish IgG1) with three conventional confirmatory criteria (ELISA-EIA, Immunofluorescence assay-IIF and EIA/IIF consensus criterion) to define the final status of samples with actual/previous non-negative results during anti-T. cruzi ELISA-screening in blood banks. Apart from inconclusive results, the FC-TRIPLEX presented a weak agreement index with EIA, while a strong agreement was observed when either IIF or EIA/IIF consensus criteria were applied. Discriminant analysis and Spearman's correlation further corroborates the agreement scores. ROC curve analysis showed that FC-TRIPLEX performance indexes were higher when IIF and EIA/IIF consensus were used as a confirmatory criterion. Logistic regression analysis further demonstrated that the probability of FC-TRIPLEX to yield positive results was higher for inconclusive results from IIF and EIA/IIF consensus. Machine learning tools illustrated the high level of categorical agreement between FC-TRIPLEX versus IIF or EIA/IIF consensus. Together, these findings demonstrated the usefulness of FC-TRIPLEX as a tool to elucidate the status of non-negative results in blood bank screening of Chagas disease. Copyright © 2018. Published by Elsevier B.V.
The consistency of experts' evaluation of obstetric claims for compensation.
Andreasen, S; Backe, B; Lydersen, S; Øvrebø, K; Øian, P
2015-06-01
The aim of this study was to investigate the consistency of experts' evaluation of different types of obstetric claims for compensation. Inter-rater reliability study of obstetric claims for compensation. Medical experts' evaluation in The Norwegian System of Compensation to Patients, a no-blame system. The 15 most frequently used medical experts were asked to evaluate 12 obstetric claims applied for compensation. Inter-rater agreement was assessed by absolute agreement, Fleiss' kappa statistic and Gwet's AC1. Consistency in the evaluation of negligence (carelessness without intention to harm) and causality (relation between care and injury) between negligence and patient injury. The experts demonstrated moderate consistency in their evaluation of negligence (Fleiss' kappa = 0.53/AC1 = 0.54) and causality (Fleiss' kappa = 0.41/AC1 = 0.54). There was a higher level of agreement in clinical scenarios with well-documented diagnostic criteria and guidelines, including shoulder dystocia and asphyxia with low Apgar score and metabolic acidosis. We found a moderate level of agreement in experts' evaluation of negligence and causality between the injury and provided health care, the two most important questions to be answered in obstetric claims for compensation. © 2014 Royal College of Obstetricians and Gynaecologists.
Sabiani, Laura; Le Dû, Renaud; Loundou, Anderson; d'Ercole, Claude; Bretelle, Florence; Boubli, Léon; Carcopino, Xavier
2015-12-01
The objective of the study was to evaluate the intra- and interobserver agreement among obstetric experts in court regarding the retrospective review of abnormal fetal heart rate tracings and obstetrical management of patients with abnormal fetal heart rate during labor. A total of 22 French obstetric experts in court reviewed 30 cases of term deliveries of singleton pregnancies diagnosed with at least 1 hour of abnormal fetal heart rate, including 10 cases with adverse neonatal outcome. The experts reviewed all cases twice within a 3-month interval, with the first review being blinded to neonatal outcome. For each case reviewed, the experts were provided with the obstetric data and copies of the complete fetal heart rate recording and the partogram. The experts were asked to classify the abnormal fetal heart rate tracing and to express whether they agreed with the obstetrical management performed. When they disagreed, the experts were asked whether they concluded that an error had been made and whether they considered the obstetrical management as the cause of cerebral palsy in children if any. Compared with blinded review, the experts were significantly more likely to agree with the obstetric management performed (P < .001) and with the mode of delivery (P < .001) when informed about the neonatal outcome and were less likely to conclude that an error had been made (P < .001) or to establish a link with potential cerebral palsy (P = .003). The experts' intraobserver agreement for the review of abnormal fetal heart rate tracing and obstetrical management were both mediocre (kappa = 0.46-0.51 and kappa = 0.48-0.53, respectively). The interobserver agreement for the review of abnormal fetal heart rate tracing was low and was not improved by knowledge of the neonatal outcome (kappa = 0.11-0.18). The interobserver agreement for the interpretation of obstetrical management was also low (kappa = 0.08-0.19) but appeared to be improved by knowledge of the neonatal outcome (kappa = 0.15-0.32). The intra- and interobserver agreement among obstetric experts in court for the review of abnormal fetal heart rate tracing and the appropriateness of obstetrical care is poor, suggesting a lack of objectivity of obstetrical expertise as currently performed in court. Copyright © 2015 Elsevier Inc. All rights reserved.
Ramezani, Alireza; Ahmadieh, Hamid; Azarmina, Mohsen; Soheilian, Masoud; Dehghan, Mohammad H; Mohebbi, Mohammad R
2009-12-01
To evaluate the validity of a new method for the quantitative analysis of fundus or angiographic images using Photoshop 7.0 (Adobe, USA) software by comparing with clinical evaluation. Four hundred and eighteen fundus and angiographic images of diabetic patients were evaluated by three retina specialists and then by computing using Photoshop 7.0 software. Four variables were selected for comparison: amount of hard exudates (HE) on color pictures, amount of HE on red-free pictures, severity of leakage, and the size of the foveal avascular zone (FAZ). The coefficient of agreement (Kappa) between the two methods in the amount of HE on color and red-free photographs were 85% (0.69) and 79% (0.59), respectively. The agreement for severity of leakage was 72% (0.46). In the two methods for the evaluation of the FAZ size using the magic and lasso software tools, the agreement was 54% (0.09) and 89% (0.77), respectively. Agreement in the estimation of the FAZ size by the lasso magnetic tool was excellent and was almost as good in the quantification of HE on color and on red-free images. Considering the agreement of this new technique for the measurement of variables in fundus images using Photoshop software with the clinical evaluation, this method seems to have sufficient validity to be used for the quantitative analysis of HE, leakage, and FAZ size on the angiograms of diabetic patients.
Aydogan, Nevres Hurriyet; Tosun, Kursad
2018-01-01
Background The aim of this study was to evaluate the agreement between the fracture-risk assessment tool (FRAX®)-based intervention strategy in Turkey and the recommendations published in the Healthcare Practices Statement (HPS). Methods This descriptive cross-sectional study included individuals aged 40 to 90 years who were previously diagnosed as having osteoporosis but had not received any treatment. The intervention thresholds recommended by the National Osteoporosis Foundation for treatment were used. The criteria necessary for the start of administration of pharmacological agents in osteoporosis treatment were evaluated on the basis of the HPS guidelines. Results Of the 1,255 patients evaluated, 161 (12.8%) were male and 1,094 (87.2%) were female. In the evaluation, according to HPS, treatment was recommended for 783 patients (62.4%; HPS+) and not recommended for 472 (37.6%; HPS−). Of the 783 HPS+ patients, 391 (49.9%) were FRAX+, and of the 472 HPS− patients, 449 (95.1%) were FRAX−. A statistically significant difference was observed between the treatment recommendations of HPS and FRAX® (P<0.001). In the age group of 75 to 90 years, excellent agreement was found between the two strategies (Gwet's agreement coefficient 1=0.94). As age increased, the agreement between the two treatment strategies also increased. Conclusions The FRAX® model has different treatment recommendation rates from the HPS. The agreement between the two is at a minimal level. However, as age increased, so did the agreement between the FRAX® and the HPS treatment recommendations. In the recommendation to start pharmacological treatment primarily based on age, non-medical interventions that preserve bone density should be evaluated. PMID:29900157
The performance evaluation of WinOSPM model for urban street canyons of Nantes in France.
Gokhale, Sharad B; Rebours, Arnaud; Pavageau, Michel
2005-01-01
Air quality modelling is primarily the quantative approach. It is more difficult as it demands input data accuracy, uncertainties and the efficient methodologies to judge the extent of models accuracy. As a result, model validation has to be regarded as an integral part of the modelling process. Furthermore, models are often validated on a limited number of testcases therefore, appropriate evaluation procedure must be implemented to ensure these models will be applicable for various conditions. The study presented here was carried out to evaluate the WinOSPM (Preliminary version of windows based Operational Street Pollution Model) for air pollutants viz. CO, NO, NO2, NOx and C6H6 for three street canyons of Nantes (France) and for the three base years 1999, 2000, and 2001. Each street canyon selected for this study has typical and unidentical features. The rue de Strasbourg and Boulevard Victor Hugo have many building exceptions whereas rue Crébillon has not any. Application of the model above to the three street canyons revealed that WinOSPM could be used in the case when measurements are not available. This was justified from the results at rue Crébillon. The special interest was in the benzene modelled values as its content in fuel has been targeted to reduce to 1% for the years 2000 and onwards (from its 5% until the year 1999). The 50 to 70% reduction in the benzene concentrations is found for both the years i.e. in 2000 and 2001. This has further justified that air quality models are useful and interesting tools in optimising emission reduction strategies. Moreover, it is also the new pollutant added to the measurement campaign of Air Pays de la Loire (APL) for the city of Nantes. For benzene weekly averages are estimated from the hourly-modelled values for all the streets and compared with that of measurements. They are found in excellent agreement with each other's. For other pollutants annual means and percentiles were compared. The statistical analysis was done to evaluate the models performance using index of agreement and correlation coefficient. The index of agreement (d) and correlation coefficient (r) for all the streets show that estimated concentration levels are in good agreement with that of measurements. From the index of agreements, it can be inferred that model has very less potential for errors. The model's sensitivity to building-exceptions was also tested for the rue de Strasbourg. Results did not reflect this feature very well. It is perceived that the influence of this feature might have been suppressed in averaging the annual hourly values. This influence is apparently seen in hourly average time series variations. Finally, WinOSPM model was found a simple but very useful model. It could very well represent the detailed flow and dispersion conditions in urban streets.
NASA Technical Reports Server (NTRS)
Vane, Gregg; Porter, Wallace M.; Reimer, John H.; Chrien, Thomas G.; Green, Robert O.
1988-01-01
Results are presented of the assessment of AVIRIS performance during the 1987 flight season by the AVIRIS project and the earth scientists who were chartered by NASA to conduct an independent data quality and sensor performance evaluation. The AVIRIS evaluation program began in late June 1987 with the sensor meeting most of its design requirements except for signal-to-noise ratio in the fourth spectrometer, which was about half of the required level. Several events related to parts failures and design flaws further reduced sensor performance over the flight season. Substantial agreement was found between the assessments by the project and the independent investigators of the effects of these various factors. A summary of the engineering work that is being done to raise AVIRIS performance to its required level is given. In spite of degrading data quality over the flight season, several exciting scientific results were obtained from the data. These include the mapping of the spatial variation of atmospheric precipitable water, detection of environmentally-induced shifts in the spectral red edge of stressed vegetation, detection of spectral features related to pigment, leaf water and ligno-cellulose absorptions in plants, and the identification of many diagnostic mineral absorption features in a variety of geological settings.
7 CFR 1494.801 - Enforcement and termination of agreements with CCC.
Code of Federal Regulations, 2014 CFR
2014-01-01
... 7 Agriculture 10 2014-01-01 2014-01-01 false Enforcement and termination of agreements with CCC... Enhancement Program Operations § 1494.801 Enforcement and termination of agreements with CCC. (a) Performance in accordance with an Agreement with CCC. (1) An exporter which enters into an Agreement with CCC...
7 CFR 1494.801 - Enforcement and termination of agreements with CCC.
Code of Federal Regulations, 2013 CFR
2013-01-01
... 7 Agriculture 10 2013-01-01 2013-01-01 false Enforcement and termination of agreements with CCC... Enhancement Program Operations § 1494.801 Enforcement and termination of agreements with CCC. (a) Performance in accordance with an Agreement with CCC. (1) An exporter which enters into an Agreement with CCC...
7 CFR 1494.801 - Enforcement and termination of agreements with CCC.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 7 Agriculture 10 2012-01-01 2012-01-01 false Enforcement and termination of agreements with CCC... Enhancement Program Operations § 1494.801 Enforcement and termination of agreements with CCC. (a) Performance in accordance with an Agreement with CCC. (1) An exporter which enters into an Agreement with CCC...
Urrutia, Julio; Zamora, Tomas; Klaber, Ianiv; Carmona, Maximiliano; Palma, Joaquin; Campos, Mauricio; Yurac, Ratko
2016-04-01
It has been postulated that the complex patterns of spinal injuries have prevented adequate agreement using thoraco-lumbar spinal injuries (TLSI) classifications; however, limb fracture classifications have also shown variable agreements. This study compared agreement using two TLSI classifications with agreement using two classifications of fractures of the trochanteric area of the proximal femur (FTAPF). Six evaluators classified the radiographs and computed tomography scans of 70 patients with acute TLSI using the Denis and the new AO Spine thoraco-lumbar injury classifications. Additionally, six evaluators classified the radiographs of 70 patients with FTAPF using the Tronzo and the AO schemes. Six weeks later, all cases were presented in a random sequence for repeat assessment. The Kappa coefficient (κ) was used to determine agreement. Inter-observer agreement: For TLSI, using the AOSpine classification, the mean κ was 0.62 (0.57-0.66) considering fracture types, and 0.55 (0.52-0.57) considering sub-types; using the Denis classification, κ was 0.62 (0.59-0.65). For FTAPF, with the AO scheme, the mean κ was 0.58 (0.54-0.63) considering fracture types and 0.31 (0.28-0.33) considering sub-types; for the Tronzo classification, κ was 0.54 (0.50-0.57). Intra-observer agreement: For TLSI, using the AOSpine scheme, the mean κ was 0.77 (0.72-0.83) considering fracture types, and 0.71 (0.67-0.76) considering sub-types; for the Denis classification, κ was 0.76 (0.71-0.81). For FTAPF, with the AO scheme, the mean κ was 0.75 (0.69-0.81) considering fracture types and 0.45 (0.39-0.51) considering sub-types; for the Tronzo classification, κ was 0.64 (0.58-0.70). Using the main types of AO classifications, inter- and intra-observer agreement of TLSI were comparable to agreement evaluating FTAPF; including sub-types, inter- and intra-observer agreement evaluating TLSI were significantly better than assessing FTAPF. Inter- and intra-observer agreements using the Denis classification were also significantly better than agreement using the Tronzo scheme. Copyright © 2015 Elsevier Ltd. All rights reserved.
Moreno-Montañés, Javier; Antón, Vanesa; Antón, Alfonso; Larrosa, José M; Martinez-de-la-Casa, José María; Rebolleda, Gema; Ussa, Fernando; García-Granero, Marta
2017-04-01
It is important to evaluate intraobserver and interobserver agreement using visual field (VF) testing and optical coherence tomography (OCT) software in order to understand whether the use of this software is sufficient to detect glaucoma progression and to make decisions regarding its treatment. To evaluate agreement in VF and OCT software among 5 glaucoma specialists. The printout pages from VF progression software and OCT progression software from 100 patients were randomized, and the 5 glaucoma specialists subjectively and independently evaluated them for glaucoma. Each image was classified as having no progression, questionable progression, or progression. The principal investigator classified the patients previously as without variability (normal) or with high variability among tests (difficult). Using both software, the specialists also evaluated whether the glaucoma damage had progressed and if treatment change was needed. One month later, the same observers reevaluated the patients in a different order to determine intraobserver reproducibility. Intraobserver and interobserver agreement was estimated using κ statistics and Gwet second-order agreement coefficient. The agreement was compared with other factors. Of the 100 observed patients, half were male and all were white; the mean (SD) age was 69.7 (14.1) years. Intraobserver agreement was substantial to almost perfect for VF software (overall κ [95% CI], 0.59 [0.46-0.72] to 0.87 [0.79-0.96]) and similar for OCT software (overall κ [95% CI], 0.59 [0.46-0.71] to 0.85 [0.76-0.94]). Interobserver agreement among the 5 glaucoma specialists with the VF progression software was moderate (κ, 0.48; 95% CI, 0.41-0.55) and similar to OCT progression software (κ, 0.52; 95% CI, 0.44-0.59). Interobserver agreement was substantial in images classified as having no progression but only fair in those classified as having questionable glaucoma progression or glaucoma progression. Interobserver agreement was fair regarding questions about glaucoma progression (κ, 0.39; 95% CI, 0.32-0.48) and consideration about treatment changes (κ, 0.39; 95% CI, 0.32-0.48). The factors associated with agreement were the glaucoma stage and case difficulty. There was substantial intraobserver agreement but moderate interobserver agreement among glaucoma specialists using 2 glaucoma progression software packages. These data suggest that these glaucoma progression software packages are insufficient to obtain high interobserver agreement in both devices except in patients with no progression. The low agreement regarding progression or treatment changes suggests that both software programs used in isolation are insufficient for decision making.
Minnesota urban partnership agreement national evaluation : exogenous factors test plan.
DOT National Transportation Integrated Search
2009-11-17
This report presents the exogenous factors test plan for the national evaluation of the Minnesota Urban Partnership Agreement (UPA) under the United States Department of Transportation (U.S. DOT) UPA Program. The Minnesota UPA projects focus on reduc...
Minnesota urban partnership agreement national evaluation : transit system data test plan.
DOT National Transportation Integrated Search
2009-11-17
This report presents the test plan for collecting and analyzing transit system data for the Minnesota Urban Partnership Agreement (UPA) National Evaluation under the United States Department of Transportation (U.S. DOT) UPA Program. The Minnesota UPA...
DOT National Transportation Integrated Search
2009-11-17
This report presents the test plan for developing, conducting, and analyzing surveys, interviews, and focus groups for evaluating the Minnesota Urban Partnership Agreement (UPA) under the United States Department of Transportation (U.S. DOT) UPA Prog...
Seattle/Lake Washington corridor urban partnership agreement : national evaluation report.
DOT National Transportation Integrated Search
2014-12-01
This document presents the final report on the national evaluation of the Seattle/Lake Washington Corridor (LWC) Urban Partnership Agreement (UPA) under the United States Department of Transportation (U.S. DOT) UPA Program. The Seattle UPA projects f...
32 CFR 37.505 - What resources are available to assist me during the pre-award business evaluation?
Code of Federal Regulations, 2013 CFR
2013-07-01
... the pre-award business evaluation? 37.505 Section 37.505 National Defense Department of Defense OFFICE OF THE SECRETARY OF DEFENSE DoD GRANT AND AGREEMENT REGULATIONS TECHNOLOGY INVESTMENT AGREEMENTS Pre-Award Business Evaluation § 37.505 What resources are available to assist me during the pre-award...
32 CFR 37.505 - What resources are available to assist me during the pre-award business evaluation?
Code of Federal Regulations, 2012 CFR
2012-07-01
... the pre-award business evaluation? 37.505 Section 37.505 National Defense Department of Defense OFFICE OF THE SECRETARY OF DEFENSE DoD GRANT AND AGREEMENT REGULATIONS TECHNOLOGY INVESTMENT AGREEMENTS Pre-Award Business Evaluation § 37.505 What resources are available to assist me during the pre-award...
32 CFR 37.505 - What resources are available to assist me during the pre-award business evaluation?
Code of Federal Regulations, 2011 CFR
2011-07-01
... the pre-award business evaluation? 37.505 Section 37.505 National Defense Department of Defense OFFICE OF THE SECRETARY OF DEFENSE DoD GRANT AND AGREEMENT REGULATIONS TECHNOLOGY INVESTMENT AGREEMENTS Pre-Award Business Evaluation § 37.505 What resources are available to assist me during the pre-award...
32 CFR 37.505 - What resources are available to assist me during the pre-award business evaluation?
Code of Federal Regulations, 2014 CFR
2014-07-01
... the pre-award business evaluation? 37.505 Section 37.505 National Defense Department of Defense OFFICE OF THE SECRETARY OF DEFENSE DoD GRANT AND AGREEMENT REGULATIONS TECHNOLOGY INVESTMENT AGREEMENTS Pre-Award Business Evaluation § 37.505 What resources are available to assist me during the pre-award...
Personality as a Determinate of Response Dimension Scaling for Likert Rating Categories.
ERIC Educational Resources Information Center
Hensley, Wayne E.; Sanford, David L.
A study was designed to evaluate the use of summated rating (Likert) scales of agreement, evaluation, and frequency. The subjects, 58 female and 45 male college students, rank ordered the descriptive adjectives for the areas of agreement, evaluation, and frequency on a scale of 1 to 100. They also completed the Personal Report of Communication…
7 CFR 1709.12 - Reporting requirements.
Code of Federal Regulations, 2010 CFR
2010-01-01
.... To support Agency monitoring of project performance and use of grant funds, Grantees shall file... such times as provided in the grant agreement. (b) Project performance reports in such form and at such intervals as provided in the grant agreement. The project performance report shall compare accomplishments...
44 CFR 208.22 - Preparedness Cooperative Agreement process.
Code of Federal Regulations, 2010 CFR
2010-10-01
... and supporting documentation to be submitted with the application. (2) Period of performance. Absent special circumstances, the period of performance for Preparedness Cooperative Agreements will be 1 year from the date of award. The Assistance Officer may allow for an alternate period of performance with...
Belote, R Travis; Carroll, Carlos; Martinuzzi, Sebastián; Michalak, Julia; Williams, John W; Williamson, Matthew A; Aplet, Gregory H
2018-06-21
Addressing uncertainties in climate vulnerability remains a challenge for conservation planning. We evaluate how confidence in conservation recommendations may change with agreement among alternative climate projections and metrics of climate exposure. We assessed agreement among three multivariate estimates of climate exposure (forward velocity, backward velocity, and climate dissimilarity) using 18 alternative climate projections for the contiguous United States. For each metric, we classified maps into quartiles for each alternative climate projections, and calculated the frequency of quartiles assigned for each gridded location (high quartile frequency = more agreement among climate projections). We evaluated recommendations using a recent climate adaptation heuristic framework that recommends emphasizing various conservation strategies to land based on current conservation value and expected climate exposure. We found that areas where conservation strategies would be confidently assigned based on high agreement among climate projections varied substantially across regions. In general, there was more agreement in forward and backward velocity estimates among alternative projections than agreement in estimates of local dissimilarity. Consensus of climate predictions resulted in the same conservation recommendation assignments in a few areas, but patterns varied by climate exposure metric. This work demonstrates an approach for explicitly evaluating alternative predictions in geographic patterns of climate change.
Individual and Joint Expert Judgments as Reference Standards in Artifact Detection
Verduijn, Marion; Peek, Niels; de Keizer, Nicolette F.; van Lieshout, Erik-Jan; de Pont, Anne-Cornelie J.M.; Schultz, Marcus J.; de Jonge, Evert; de Mol, Bas A.J.M.
2008-01-01
Objective To investigate the agreement among clinical experts in their judgments of monitoring data with respect to artifacts, and to examine the effect of reference standards that consist of individual and joint expert judgments on the performance of artifact filters. Design Individual judgments of four physicians, a majority vote judgment, and a consensus judgment were obtained for 30 time series of three monitoring variables: mean arterial blood pressure (ABPm), central venous pressure (CVP), and heart rate (HR). The individual and joint judgments were used to tune three existing automated filtering methods and to evaluate the performance of the resulting filters. Measurements The interrater agreement was calculated in terms of positive specific agreement (PSA). The performance of the artifact filters was quantified in terms of sensitivity and positive predictive value (PPV). Results PSA values between 0.33 and 0.85 were observed among clinical experts in their selection of artifacts, with relatively high values for CVP data. Artifact filters developed using judgments of individual experts were found to moderately generalize to new time series and other experts; sensitivity values ranged from 0.40 to 0.60 for ABPm and HR filters (PPV: 0.57–0.84), and from 0.63 to 0.80 for CVP filters (PPV: 0.71–0.86). A higher performance value for the filters was found for the three variable types when joint judgments were used for tuning the filtering methods. Conclusion Given the disagreement among experts in their individual judgment of monitoring data with respect to artifacts, the use of joint reference standards obtained from multiple experts is recommended for development of automatic artifact filters. PMID:18096912
Gomes, Luciano T; Tada, Mauro S; Katsuragawa, Tony H; Povoa, Marinete M; Viana, Giselle Mr; Alecrim, Maria das Gracas C; De Santana-Filho, Frankllin S; Arcanjo, Ana Ruth L; Couto, Alvaro A R A; Calvosa, Vanja S P; Nery, Andreia F; Fontes, Cor J F
2013-03-14
In remote areas of the Amazon Region, diagnosis of malaria by microscopy is practically impossible. This study aimed to evaluate the performance of two rapid diagnostic tests (RDTs) targeting different malaria antigens stored at room temperature in the Brazilian Amazon Region. Performance of the OptiMal Pf/Pan test and ICT-Now Pf/Pan test was analyzed retrospectively in 1,627 and 1,602 blood samples, respectively. Tests were performed over a 15-month period. Kits were stored at room temperature in five community health centres located in the Brazilian Amazon Region. RDT results were compared with thick blood smear (TBS) results to determine sensitivity, specificity, and accuracy of the RDT. The sensitivities of the OptiMal Pf/Pan test were 79.7% for Plasmodium falciparum malaria diagnosis and 85.7% for non-P. falciparum infections. The results showed a crude agreement of 88.5% for P. falciparum, and 88.3% for non-P. falciparum infections (Kappa index = 0.74 and 0.75, respectively). For the ICT-Now Pf/Pan test (CI 95%), the sensitivities were 87.9% for P. falciparum malaria diagnosis and 72.5% for non-P. falciparum infection. Crude agreement between the ICT-Now Pf/Pan test and TBS was 91.4% for P. falciparum and 79.7% for non-P. falciparum infection. The Kappa index was 0.81 and 0.59 for the final diagnosis of P. falciparum and non-P. falciparum, respectively. Higher levels of parasitaemia were associated with higher crude agreement between RDT and TBS. The sensitivities of RDTs stored at room temperature over a 15-month period and performed in field conditions were lower than those previously reported.
Bobenchik, April M.; Deak, Eszter; Hindler, Janet A.; Charlton, Carmen L.
2016-01-01
ABSTRACT The performances of Vitek 2 AST-GN69 and AST-XN06 cards were compared to Clinical and Laboratory Standards Institute (CLSI) reference broth microdilution (BMD) for 99 isolates of Pseudomonas aeruginosa, 26 Acinetobacter baumannii isolates, and 11 Stenotrophomonas maltophilia isolates. In total, 15 antimicrobials were evaluated, with 11 for P. aeruginosa, 14 for A. baumannii, and 2 for S. maltophilia. Categorical agreement (CA) was assessed using both Vitek 2 breakpoints and 2016 CLSI M100S 26th edition breakpoints. The essential agreement values for P. aeruginosa, A. baumannii, and S. maltophilia were 99.5%, 99.2%, and 100%, respectively. The CA values for P. aeruginosa, A. baumannii, and S. maltophilia were 94.1%, 92.7%, and 95.5%, respectively, by the Vitek 2 breakpoints, and 93.4%, 92.3%, and 95.5%, respectively, by the CLSI breakpoints. Overall, the Vitek 2 performance was comparable to that of BMD using both Vitek 2 breakpoints and 2016 CLSI M100S 26th edition breakpoints. Improved performance was noted for the reformulated piperacillin-tazobactam and imipenem found on the AST-GN69 card, with no very major or major errors noted when using the CLSI breakpoints. PMID:27881616
Dalhuijsen, J; Zwaard, A M; Grol, R P; Mokkink, H M
1993-10-16
To establish to which degree the guidelines of the Otitis media acuta standard of the Netherlands college of general practitioners (NHG) are followed in practice and what is GPs' opinion of them. Survey. 65 general practitioners settled throughout the Netherlands. The GPs were recruited for the study by means of an advertisement and a personal approach. Opinions and performances were inventoried using a questionnaire and self-completion forms for successive doctor-patient contacts, respectively. Virtually all participants were familiar with the NHG standard and most were familiar with three central guidelines. Of these three, the recommendation of an active follow-up policy for children up to the age of 2 was known to the fewest GPs, and this recommendation also received the least support. In the 327 consultations recorded there was good agreement between actual performance and the guidelines, especially as regards history-taking, physical examination and diagnosing. Regarding management and patient information the agreement was less. Concerning the follow-up policy for young children and the prescribing of antibiotics, actual performance clearly differed from the NHG standard, possibly because the relevant guidelines were less well known. The results of this study are relevant to the planning of GP education and the evaluation of the guidelines themselves.
Bennett, Rebecca J; Taljaard, Dunay S; Olaithe, Michelle; Brennan-Jones, Chris; Eikelboom, Robert H
2017-09-18
The purpose of this study is to raise awareness of interobserver concordance and the differences between interobserver reliability and agreement when evaluating the responsiveness of a clinician-administered survey and, specifically, to demonstrate the clinical implications of data types (nominal/categorical, ordinal, interval, or ratio) and statistical index selection (for example, Cohen's kappa, Krippendorff's alpha, or interclass correlation). In this prospective cohort study, 3 clinical audiologists, who were masked to each other's scores, administered the Practical Hearing Aid Skills Test-Revised to 18 adult owners of hearing aids. Interobserver concordance was examined using a range of reliability and agreement statistical indices. The importance of selecting statistical measures of concordance was demonstrated with a worked example, wherein the level of interobserver concordance achieved varied from "no agreement" to "almost perfect agreement" depending on data types and statistical index selected. This study demonstrates that the methodology used to evaluate survey score concordance can influence the statistical results obtained and thus affect clinical interpretations.
Endodontic radiography: who is reading the digital radiograph?
Tewary, Shalini; Luzzo, Joseph; Hartwell, Gary
2011-07-01
Digital radiographic imaging systems have undergone tremendous improvements since their introduction. Advantages of digital radiographs over conventional films include lower radiation doses compared with conventional films, instantaneous images, archiving and sharing images easily, and manipulation of several radiographic properties that might help in diagnosis. A total of 6 observers including 2 endodontic residents, 3 endodontists, and 1 oral radiologist evaluated 150 molar digital periapical radiographs to determine which of the following conditions existed: normal periapical tissue, widened periodontal ligament, or presence of periapical radiolucency. The evaluators had full control over the radiograph's parameters of the Planmeca Dimaxis software program. All images were viewed on the same computer monitor with ideal vie-wing conditions. The same 6 observers evaluated the same 150 digital images 3 months later. The data were analyzed to determine how well the evaluators agreed with each other (interobserver agreement) for 2 rounds of observations and with themselves (intraobserver agreement). Fleiss kappa statistical analysis was used to measure the level of agreement among multiple raters. The overall Fleiss kappa value for interobserver agreement for the first round of interpretation was 0.34 (P < .001). The overall Fleiss kappa value for interobserver agreement for the second round of interpretation was 0.35 (P < .001). This resulted in fair (0.2-0.4) agreement among the 6 raters at both observation periods. A weighted kappa analysis was used to determine intraobserver agreement, which showed on average a moderate agreement. The results indicate that the interpretation of a dental radiograph is subjective, irrespective of whether conventional or digital radiographs are used. The factors that appeared to have the most impact were the years of experience of the examiner and familiarity of the operator with a given digital system. Copyright © 2011 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Conlin, Paul R; Asefzadeh, Baharak; Pasquale, Louis R; Selvin, Gerald; Lamkin, Rebecca; Cavallerano, Anthony A
2015-12-01
Digital retinal imaging using store-and-forward technology is used to screen for diabetic retinopathy (DR). Its usefulness in detecting non-diabetic eye diseases is uncertain. We determined the level of agreement between teleretinal imaging supplemented with visual acuity and intraocular pressure (IOP) measurements (ie, technology-assisted eye (TAE) exam) and a comprehensive eye exam in evaluation for DR and non-diabetic ocular conditions. We conducted a prospective, observational study with two parallel evaluations. Patients with diabetes (n=317) had a TAE exam and a comprehensive eye exam on the same day. A subset of participants with normal baseline exams (n=72) had follow-up exams 1 year later. We measured the level of agreement for referable ocular findings. Agreement for referable ocular findings was moderate (n=389, agreement: 77%; κ: 0.55), due in part to ungradable exams (22%). However, about half of the ungradable exams had findings that warranted referral. There was substantial agreement for follow-up exams (n=72, agreement: 93%; κ: 0.63). Among all gradable exams (n=303), the TAE exam had 86% sensitivity and 84% specificity for referable ocular findings, with high agreement (≥94%) for DR and other major ocular diagnoses. There was moderate-to-substantial agreement between a TAE exam and a comprehensive eye exam for referable ocular findings in patients with diabetes. Ungradable exams were a frequent marker of ocular pathology. Teleretinal imaging may be a useful evaluation for both diabetic and non-diabetic ocular conditions. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Yi, Ji Sook; Han, Jong Kyu; Kim, Hyun-Joo
2015-01-01
Objective To assess inter-modality variability when evaluating cervical intervertebral disc herniation using 64-slice multidetector-row computed tomography (MDCT) and magnetic resonance imaging (MRI). Materials and Methods Three musculoskeletal radiologists independently reviewed cervical spine 1.5-T MRI and 64-slice MDCT data on C2-3 though C6-7 of 51 patients in the context of intervertebral disc herniation. Interobserver and inter-modality agreements were expressed as unweighted kappa values. Weighted kappa statistics were used to assess the extents of agreement in terms of the number of involved segments (NIS) in disc herniation and epicenter measurements collected using MDCT and MRI. Results The interobserver agreement rates upon evaluation of disc morphology by the three radiologists were in fair to moderate agreement (k = 0.39-0.53 for MDCT images; k = 0.45-0.56 for MRIs). When the disc morphology was categorized into two and four grades, the inter-modality agreement rates were moderate (k-value, 0.59) and substantial (k-value, 0.66), respectively. The inter-modality agreements for evaluations of the NIS (k-value, 0.78) and the epicenter (k-value, 0.79) were substantial. Also, the interobserver agreements for the NIS (CT; k-value, 0.85 and MRI; k-value, 0.88) and epicenter (CT; k-value, 0.74 and MRI; k-value, 0.70) evaluations by two readers were substantial. MDCT tended to underestimate the extent of herniated disc lesions compared with MRI. Conclusion Multidetector-row computed tomography and MRI showed a moderate-to-substantial degree of inter-modality agreement for the assessment of herniated cervical discs. MDCT images have a tendency to underestimate the anterior/posterior extent of the herniated disc compared with MRI. PMID:26175589
Antifungal susceptibility testing of Malassezia yeast: comparison of two different methodologies.
Rojas, Florencia D; Córdoba, Susana B; de Los Ángeles Sosa, María; Zalazar, Laura C; Fernández, Mariana S; Cattana, María E; Alegre, Liliana R; Carrillo-Muñoz, Alfonso J; Giusiano, Gustavo E
2017-02-01
All Malassezia species are lipophilic; thus, modifications are required in susceptibility testing methods to ensure their growth. Antifungal susceptibility of Malassezia species using agar and broth dilution methods has been studied. Currently, few tests using disc diffusion methods are being performed. The aim was to evaluate the in vitro susceptibility of Malassezia yeast against antifungal agents using broth microdilution and disc diffusion methods, then to compare both methodologies. Fifty Malassezia isolates were studied. Microdilution method was performed as described in reference document and agar diffusion test was performed using antifungal tablets and discs. To support growth, culture media were supplemented. To correlate methods, linear regression analysis and categorical agreement was determined. The strongest linear association was observed for fluconazole and miconazole. The highest agreement between both methods was observed for itraconazole and voriconazole and the lowest for amphotericin B and fluconazole. Although modifications made to disc diffusion method allowed to obtain susceptibility data for Malassezia yeast, variables cannot be associated through a linear correlation model, indicating that inhibition zone values cannot predict MIC value. According to the results, disc diffusion assay may not represent an alternative to determine antifungal susceptibility of Malassezia yeast. © 2016 Blackwell Verlag GmbH.
NASA Technical Reports Server (NTRS)
Phillips, M. A.
1973-01-01
Results are presented of an analysis which compares the performance predictions of a thermal model of a multi-panel modular radiator system with thermal vacuum test data. Comparisons between measured and predicted individual panel outlet temperatures and pressure drops and system outlet temperatures have been made over the full range of heat loads, environments and plumbing arrangements expected for the shuttle radiators. Both two sided and one sided radiation have been included. The model predictions show excellent agreement with the test data for the maximum design conditions of high load and hot environment. Predictions under minimum design conditions of low load-cold environments indicate good agreement with the measured data, but evaluation of low load predictions should consider the possibility of parallel flow instabilities due to main system freezing. Performance predictions under intermediate conditions in which the majority of the flow is not in either the main or prime system are adequate although model improvements in this area may be desired. The primary modeling objective of providing an analytical technique for performance predictions of a multi-panel radiator system under the design conditions has been met.
2017-01-01
Background Machine learning techniques may be an effective and efficient way to classify open-text reports on doctor’s activity for the purposes of quality assurance, safety, and continuing professional development. Objective The objective of the study was to evaluate the accuracy of machine learning algorithms trained to classify open-text reports of doctor performance and to assess the potential for classifications to identify significant differences in doctors’ professional performance in the United Kingdom. Methods We used 1636 open-text comments (34,283 words) relating to the performance of 548 doctors collected from a survey of clinicians’ colleagues using the General Medical Council Colleague Questionnaire (GMC-CQ). We coded 77.75% (1272/1636) of the comments into 5 global themes (innovation, interpersonal skills, popularity, professionalism, and respect) using a qualitative framework. We trained 8 machine learning algorithms to classify comments and assessed their performance using several training samples. We evaluated doctor performance using the GMC-CQ and compared scores between doctors with different classifications using t tests. Results Individual algorithm performance was high (range F score=.68 to .83). Interrater agreement between the algorithms and the human coder was highest for codes relating to “popular” (recall=.97), “innovator” (recall=.98), and “respected” (recall=.87) codes and was lower for the “interpersonal” (recall=.80) and “professional” (recall=.82) codes. A 10-fold cross-validation demonstrated similar performance in each analysis. When combined together into an ensemble of multiple algorithms, mean human-computer interrater agreement was .88. Comments that were classified as “respected,” “professional,” and “interpersonal” related to higher doctor scores on the GMC-CQ compared with comments that were not classified (P<.05). Scores did not vary between doctors who were rated as popular or innovative and those who were not rated at all (P>.05). Conclusions Machine learning algorithms can classify open-text feedback of doctor performance into multiple themes derived by human raters with high performance. Colleague open-text comments that signal respect, professionalism, and being interpersonal may be key indicators of doctor’s performance. PMID:28298265
Gibbons, Chris; Richards, Suzanne; Valderas, Jose Maria; Campbell, John
2017-03-15
Machine learning techniques may be an effective and efficient way to classify open-text reports on doctor's activity for the purposes of quality assurance, safety, and continuing professional development. The objective of the study was to evaluate the accuracy of machine learning algorithms trained to classify open-text reports of doctor performance and to assess the potential for classifications to identify significant differences in doctors' professional performance in the United Kingdom. We used 1636 open-text comments (34,283 words) relating to the performance of 548 doctors collected from a survey of clinicians' colleagues using the General Medical Council Colleague Questionnaire (GMC-CQ). We coded 77.75% (1272/1636) of the comments into 5 global themes (innovation, interpersonal skills, popularity, professionalism, and respect) using a qualitative framework. We trained 8 machine learning algorithms to classify comments and assessed their performance using several training samples. We evaluated doctor performance using the GMC-CQ and compared scores between doctors with different classifications using t tests. Individual algorithm performance was high (range F score=.68 to .83). Interrater agreement between the algorithms and the human coder was highest for codes relating to "popular" (recall=.97), "innovator" (recall=.98), and "respected" (recall=.87) codes and was lower for the "interpersonal" (recall=.80) and "professional" (recall=.82) codes. A 10-fold cross-validation demonstrated similar performance in each analysis. When combined together into an ensemble of multiple algorithms, mean human-computer interrater agreement was .88. Comments that were classified as "respected," "professional," and "interpersonal" related to higher doctor scores on the GMC-CQ compared with comments that were not classified (P<.05). Scores did not vary between doctors who were rated as popular or innovative and those who were not rated at all (P>.05). Machine learning algorithms can classify open-text feedback of doctor performance into multiple themes derived by human raters with high performance. Colleague open-text comments that signal respect, professionalism, and being interpersonal may be key indicators of doctor's performance. ©Chris Gibbons, Suzanne Richards, Jose Maria Valderas, John Campbell. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 15.03.2017.
Barros, Zoraya Medeiros; de Alencar Ximenes, Ricardo Arraes; Miranda-Filho, Demócrito Barros; de Albuquerque, Maria de Fátima Pessoa Militão; Melo, Heloísa Ramos Lacerda; Carvalho, Erico Higino; Gelenske, Thais; Diniz, George; Bandeira, Francisco
2010-12-01
The Framingham score is used in most studies on human immunodeficiency virus (HIV)-positive patients to estimate the risk for coronary heart disease; however, it may have some limitations for detecting risk among these individuals. The aim of this study was to evaluate the agreement between the Framingham and Prospective Cardiovascular of Münster (PROCAM) scores among HIV-positive individuals and to investigate the factors associated with disagreement between the two scores. A cross-sectional study was conducted in a population of HIV/acquired immunodeficiency syndrome (AIDS) patients attending the outpatient's clinics of two reference centers for HIV/AIDS in Pernambuco, Brazil. Agreement between the Framingham and PROCAM scores was evaluated using the kappa index. From this analysis, a variable called "disagreement between scores" was created, and univariate and multivariate analysis were performed to investigate the factors associated with this variable. The prevalence of low, moderate, and high risk were, respectively, 78.7%, 13.5%, and 7.8% by Framingham score and 88.5%, 4.3%, and 7.2% by PROCAM (kappa = 0.64, P ≤ 0.0001). Agreement in the subgroup with metabolic syndrome by the International Diabetes Federation (IDF) (kappa = 0.51, P ≤ 0.0001) and the National Cholesterol Education Program (NCEP) (kappa = 0.59, P ≤ 0.0001) criteria was moderate. The Framingham score identified greater proportion of women with moderate risk. Factors independently associated with disagreement were: smoking, sex, age, low-density lipoprotein cholesterol, diastolic blood pressure, and metabolic syndrome. There was a good agreement between the Framingham and PROCAM scores in HIV-positive patients, but a higher proportion of moderate-high risk was identified by the Framingham score. This disagreement should be evaluated in cohort studies to observe clinical outcomes over the course of time.
Manevich-Mazor, Mirra; Weissmann-Brenner, Alina; Bar Yosef, Omer; Hoffmann, Chen; Mazor, Roei David; Mosheva, Mariela; Achiron, Reuven Ryszard; Katorza, Eldad
2018-06-07
To evaluate the added value of fetal MRI to ultrasound in detecting and specifying callosal anomalies, and its impact on clinical decision making. Fetuses with a sonographic diagnosis of an anomalous corpus callosum (CC) who underwent a subsequent fetal brain MRI between 2010 and 2015 were retrospectively evaluated and classified according to the severity of the findings. The findings detected on ultrasound were compared to those detected on MRI. An analysis was performed to assess whether fetal MRI altered the group classification, and thus the management of these pregnancies. 78 women were recruited following sonographic diagnoses of either complete or partial callosal agenesis, short, thin or thick CC. Normal MRI studies were obtained inµ19 cases (24 %). Among these, all children available for follow-up received an adequate adaptive score in their Vineland II adaptive behavior scale assessment. Analysis of the concordance between US and MRI demonstrated a substantial level of agreement for complete callosal agenesis (kappa: 0.742), moderate agreement for thin CC (kappa: 0.418) and fair agreement for all other callosal anomalies. Comparison between US and MRI-based mild/severe findings classifications revealed that MRI contributed to a change in the management for 28 fetuses (35.9 %), mostly (25 fetuses, 32.1 %) in favor of pregnancy preservation. Fetal MRI effectively detects callosal anomalies and enables satisfactory validation of the presence or absence of callosal anomalies identified by ultrasound and adds valuable data that improves clinical decision making. © Georg Thieme Verlag KG Stuttgart · New York.
ERIC Educational Resources Information Center
Stratis, Elizabeth A.; Lecavalier, Luc
2017-01-01
This study evaluated the magnitude of informant agreement and predictors of agreement on behavior and emotional problems and autism symptoms in 403 children with autism and their typically developing siblings. Parent-teacher agreement was investigated on the "Child Behavior Checklist" (CBCL) and "Social Responsiveness Scale"…
Fox, M R; Pandolfino, J E; Sweis, R; Sauter, M; Abreu Y Abreu, A T; Anggiansah, A; Bogte, A; Bredenoord, A J; Dengler, W; Elvevi, A; Fruehauf, H; Gellersen, S; Ghosh, S; Gyawali, C P; Heinrich, H; Hemmink, M; Jafari, J; Kaufman, E; Kessing, K; Kwiatek, M; Lubomyr, B; Banasiuk, M; Mion, F; Pérez-de-la-Serna, J; Remes-Troche, J M; Rohof, W; Roman, S; Ruiz-de-León, A; Tutuian, R; Uscinowicz, M; Valdovinos, M A; Vardar, R; Velosa, M; Waśko-Czopnik, D; Weijenborg, P; Wilshire, C; Wright, J; Zerbib, F; Menne, D
2015-01-01
High-resolution esophageal manometry (HRM) is a recent development used in the evaluation of esophageal function. Our aim was to assess the inter-observer agreement for diagnosis of esophageal motility disorders using this technology. Practitioners registered on the HRM Working Group website were invited to review and classify (i) 147 individual water swallows and (ii) 40 diagnostic studies comprising 10 swallows using a drop-down menu that followed the Chicago Classification system. Data were presented using a standardized format with pressure contours without a summary of HRM metrics. The sequence of swallows was fixed for each user but randomized between users to avoid sequence bias. Participants were blinded to other entries. (i) Individual swallows were assessed by 18 practitioners (13 institutions). Consensus agreement (≤ 2/18 dissenters) was present for most cases of normal peristalsis and achalasia but not for cases of peristaltic dysmotility. (ii) Diagnostic studies were assessed by 36 practitioners (28 institutions). Overall inter-observer agreement was 'moderate' (kappa 0.51) being 'substantial' (kappa > 0.7) for achalasia type I/II and no lower than 'fair-moderate' (kappa >0.34) for any diagnosis. Overall agreement was somewhat higher among those that had performed >400 studies (n = 9; kappa 0.55) and 'substantial' among experts involved in development of the Chicago Classification system (n = 4; kappa 0.66). This prospective, randomized, and blinded study reports an acceptable level of inter-observer agreement for HRM diagnoses across the full spectrum of esophageal motility disorders for a large group of clinicians working in a range of medical institutions. Suboptimal agreement for diagnosis of peristaltic motility disorders highlights contribution of objective HRM metrics. © 2014 International Society for Diseases of the Esophagus.
Defrance, Carine; Bollache, Emilie; Kachenoura, Nadjia; Perdrix, Ludivine; Hrynchyshyn, Nataliya; Bruguière, Eric; Redheuil, Alban; Diebold, Benoit; Mousseaux, Elie
2012-09-01
Accurate quantification of aortic valve stenosis (AVS) is needed for relevant management decisions. However, transthoracic Doppler echocardiography (TTE) remains inconclusive in a significant number of patients. Previous studies demonstrated the usefulness of phase-contrast cardiovascular magnetic resonance (PC-CMR) in noninvasive AVS evaluation. We hypothesized that semiautomated analysis of aortic hemodynamics from PC-CMR might provide reproducible and accurate evaluation of aortic valve area (AVA), aortic velocities, and gradients in agreement with TTE. We studied 53 AVS patients (AVA(TTE)=0.87±0.44 cm(2)) and 21 controls (AVA(TTE)=2.96±0.59 cm(2)) who had TTE and PC-CMR of aortic valve and left ventricular outflow tract on the same day. PC-CMR data analysis included left ventricular outflow tract and aortic valve segmentation, and extraction of velocities, gradients, and flow rates. Three AVA measures were performed: AVA(CMR1) based on Hakki formula, AVA(CMR2) based on continuity equation, AVA(CMR3) simplified continuity equation=left ventricular outflow tract peak flow rate/aortic peak velocity. Our analysis was reproducible, as reflected by low interoperator variability (<4.56±4.40%). Comparison of PC-CMR and TTE aortic peak velocities and mean gradients resulted in good agreement (r=0.92 with mean bias=-29±62 cm/s and r=0.86 with mean bias=-12±15 mm Hg, respectively). Although good agreement was found between TTE and continuity equation-based CMR-AVA (r>0.94 and mean bias=-0.01±0.38 cm(2) for AVA(CMR2), -0.09±0.28 cm(2) for AVA(CMR3)), AVA(CMR1) values were lower than AVA(TTE) especially for higher AVA (mean bias=-0.45±0.52 cm(2)). Besides, ability of PC-CMR to detect severe AVS, defined by TTE, provided the best results for continuity equation-based methods (accuracy >94%). Our PC-CMR semiautomated AVS evaluation provided reproducible measurements that accurately detected severe AVS and were in good agreement with TTE.
NASA Astrophysics Data System (ADS)
Camp, H. A.; Moyer, Steven; Moore, Richard K.
2010-04-01
The Night Vision and Electronic Sensors Directorate's current time-limited search (TLS) model, which makes use of the targeting task performance (TTP) metric to describe image quality, does not explicitly account for the effects of visual clutter on observer performance. The TLS model is currently based on empirical fits to describe human performance for a time of day, spectrum and environment. Incorporating a clutter metric into the TLS model may reduce the number of these empirical fits needed. The masked target transform volume (MTTV) clutter metric has been previously presented and compared to other clutter metrics. Using real infrared imagery of rural images with varying levels of clutter, NVESD is currently evaluating the appropriateness of the MTTV metric. NVESD had twenty subject matter experts (SME) rank the amount of clutter in each scene in a series of pair-wise comparisons. MTTV metric values were calculated and then compared to the SME observers rankings. The MTTV metric ranked the clutter in a similar manner to the SME evaluation, suggesting that the MTTV metric may emulate SME response. This paper is a first step in quantifying clutter and measuring the agreement to subjective human evaluation.
Tian, Bing; Xu, Bing; Lu, Jianping; Liu, Qi; Wang, Li; Wang, Minjie
2015-06-01
This study aimed to evaluate the usefulness of four-dimensional CTA before and after embolization treatment with ONYX-18 in eleven patients with cranial dural arteriovenous fistulas, and to compare the results with those of the reference standard DSA. Eleven patients with cranial dural arteriovenous fistulas detected on DSA underwent transarterial embolization with ONYX-18. Four-dimensional CTA was performed an average of 2 days before and 4 days after DSA. Four-dimensional CTA and DSA images were reviewed by two neuroradiologists for identification of feeding arteries and drainage veins and for determining treatment effects. Interobserver and intermodality agreement between four-dimensional CTA and DSA were assessed. Forty-two feeding arteries were identified for 14 fistulas in the 11 patients. Of these, 36 (85.71%) were detected on four-dimensional CTA. After transarterial embolization, one patient got partly embolized, and the fistulas in the remaining 10 patients were completely occluded. The interobserver agreement for four-dimensional CTA and intermodality agreement between four-dimensional CTA and DSA were excellent (κ=1) for shunt location, identification of drainage veins, and fistula occlusion after treatment. Four-dimensional CTA images are highly accurate when compared with DSA images both before and after transarterial embolization treatment. Four-dimensional CTA can be used for diagnosis as well as follow-up of cranial dural arteriovenous fistulas in clinical settings. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
2018-01-01
This study aimed to assess and validate the repeatability and agreement of quantitative elastography of novel shear wave methods on four individual tissue-mimicking liver fibrosis phantoms with different known Young’s modulus. We used GE Logiq E9 2D-SWE, Philips iU22 ARFI (pSWE), Samsung TS80A SWE (pSWE), Hitachi Ascendus (SWM) and Transient Elastography (TE). Two individual investigators performed all measurements non-continued and in parallel. The methods were evaluated for inter- and intraobserver variability by intraclass correlation, coefficient of variation and limits of agreement using the median elastography value. All systems used in this study provided high repeatability in quantitative measurements in a liver fibrosis phantom and excellent inter- and intraclass correlations. All four elastography platforms showed excellent intra-and interobserver agreement (interclass correlation 0.981–1.000 and intraclass correlation 0.987–1.000) and no significant difference in mean elasticity measurements for all systems, except for TE on phantom 4. All four liver fibrosis phantoms could be differentiated by quantitative elastography, by all platforms (p<0.001). In the Bland-Altman analysis the differences in measurements were larger for the phantoms with higher Young’s modulus. All platforms had a coefficient of variation in the range 0.00–0.21 for all four phantoms, equivalent to low variance and high repeatability. PMID:29293527
Cordova, James S; Schreibmann, Eduard; Hadjipanayis, Costas G; Guo, Ying; Shu, Hui-Kuo G; Shim, Hyunsuk; Holder, Chad A
2014-01-01
Standard-of-care therapy for glioblastomas, the most common and aggressive primary adult brain neoplasm, is maximal safe resection, followed by radiation and chemotherapy. Because maximizing resection may be beneficial for these patients, improving tumor extent of resection (EOR) with methods such as intraoperative 5-aminolevulinic acid fluorescence-guided surgery (FGS) is currently under evaluation. However, it is difficult to reproducibly judge EOR in these studies due to the lack of reliable tumor segmentation methods, especially for postoperative magnetic resonance imaging (MRI) scans. Therefore, a reliable, easily distributable segmentation method is needed to permit valid comparison, especially across multiple sites. We report a segmentation method that combines versatile region-of-interest blob generation with automated clustering methods. We applied this to glioblastoma cases undergoing FGS and matched controls to illustrate the method's reliability and accuracy. Agreement and interrater variability between segmentations were assessed using the concordance correlation coefficient, and spatial accuracy was determined using the Dice similarity index and mean Euclidean distance. Fuzzy C-means clustering with three classes was the best performing method, generating volumes with high agreement with manual contouring and high interrater agreement preoperatively and postoperatively. The proposed segmentation method allows tumor volume measurements of contrast-enhanced T1-weighted images in the unbiased, reproducible fashion necessary for quantifying EOR in multicenter trials. PMID:24772206
Mai, Hang-Nga; Lee, Kyeong Eun; Lee, Kyu-Bok; Jeong, Seung-Mi; Lee, Seok-Jae; Lee, Cheong-Hee; An, Seo-Young; Lee, Du-Hyeong
2017-10-01
The purpose of this study was to evaluate the reliability of computer-aided replica technique (CART) by calculating its agreement with the replica technique (RT), using statistical agreement analysis. A prepared metal die and a metal crown were fabricated. The gap between the restoration and abutment was replicated using silicone indicator paste (n = 25). Gap measurements differed in the control (RT) and experimental (CART) groups. In the RT group, the silicone replica was manually sectioned, and the marginal and occlusal gaps were measured using a microscope. In the CART group, the gap was digitized using optical scanning and image superimposition, and the gaps were measured using a software program. The agreement between the measurement techniques was evaluated by using the 95% Bland-Altman limits of agreement and concordance correlation coefficients (CCC). The least acceptable CCC was 0.90. The RT and CART groups showed linear association, with a strong positive correlation in gap measurements, but without significant differences. The 95% limits of agreement between the paired gap measurements were 3.84% and 7.08% of the mean. The lower 95% confidence limits of CCC were 0.9676 and 0.9188 for the marginal and occlusal gap measurements, respectively, and the values were greater than the allowed limit. The CART is a reliable digital approach for evaluating the fit accuracy of fixed dental prostheses.
Winkler, Peter; Zurl, Brigitte; Guss, Helmuth; Kindl, Peter; Stuecklschweiger, Georg
2005-02-21
A system for dosimetric verification of intensity-modulated radiotherapy (IMRT) treatment plans using absolute calibrated radiographic films is presented. At our institution this verification procedure is performed for all IMRT treatment plans prior to patient irradiation. Therefore clinical treatment plans are transferred to a phantom and recalculated. Composite treatment plans are irradiated to a single film. Film density to absolute dose conversion is performed automatically based on a single calibration film. A software application encompassing film calibration, 2D registration of measurement and calculated distributions, image fusion, and a number of visual and quantitative evaluation utilities was developed. The main topic of this paper is a performance analysis for this quality assurance procedure, with regard to the specification of tolerance levels for quantitative evaluations. Spatial and dosimetric precision and accuracy were determined for the entire procedure, comprising all possible sources of error. The overall dosimetric and spatial measurement uncertainties obtained thereby were 1.9% and 0.8 mm respectively. Based on these results, we specified 5% dose difference and 3 mm distance-to-agreement as our tolerance levels for patient-specific quality assurance for IMRT treatments.
Zeng, Qing T; Kogan, Sandra; Ngo, Long; Greenes, Robert A
2004-01-01
Millions of consumers perform health information retrieval (HIR) online. To better understand the consumers' perspective on HIR performance, we conducted an observation and interview study of 97 health information consumers. Consumers were asked to perform HIR tasks and we recorded their view regarding performance using several differ-ent subjective measurements: finding the desired information, usefulness of the information found, satisfaction with the information, and intention to continue searching. Statistical analysis was applied to verify if the multiple subjective measurements were redundant. The measurements ranged from slight agreement to no agreement among them. A number of reasons were identified for this lack of agreement. Although related, the four subjective measurements of HIR performance are distinct from each other and carried different useful information
Sandia National Laboratories: Agreements
Technology Partnerships Business, Industry, & Non-Profits Government Universities Center for Development Agreement (CRADA) Strategic Partnership Projects, Non-Federal Entity (SPP/NFE) Agreements New Projects, Non-Federal Entity (SPP/NFE) Agreements Sandia performs work on a reimbursable basis for a non
n+235U resonance parameters and neutron multiplicities in the energy region below 100 eV
NASA Astrophysics Data System (ADS)
Pigni, Marco T.; Capote, Roberto; Trkov, Andrej; Pronyaev, Vladimir G.
2017-09-01
In August 2016, following the recent effort within the Collaborative International Evaluated Library Organization (CIELO) pilot project to improve the neutron cross sections of 235U, Oak Ridge National Laboratory (ORNL) collaborated with the International Atomic Energy Agency (IAEA) to release a resonance parameter evaluation. This evaluation restores the performance of the evaluated cross sections for the thermal- and above-thermal-solution benchmarks on the basis of newly evaluated thermal neutron constants (TNCs) and thermal prompt fission neutron spectra (PFNS). Performed with support from the US Nuclear Criticality Safety Program (NCSP) in an effort to provide the highest fidelity general purpose nuclear database for nuclear criticality applications, the resonance parameter evaluation was submitted as an ENDF-compatible file to be part of the next release of the ENDF/B-VIII.0 nuclear data library. The resonance parameter evaluation methodology used the Reich-Moore approximation of the R-matrix formalism implemented in the code SAMMY to fit the available time-of-flight (TOF) measured data for the thermal induced cross section of n+235U up to 100 eV. While maintaining reasonably good agreement with the experimental data, the validation analysis focused on restoring the benchmark performance for 235U solutions by combining changes to the resonance parameters and to the prompt resonance v̅ below 100 eV.
Fahlenkamp, U L; Lembcke, A; Roesler, R; Schwenke, C; Huppertz, A; Streitparth, F; Taupitz, M; Hamm, B; Wagner, M
2013-10-01
To compare electrocardiography (ECG)-gated computed tomography angiography (CTA) with ECG-gated magnetic resonance angiography (MRA) for assessment of the left atrium (LA) and pulmonary veins (PVs). Twenty-nine consecutive patients who underwent both cardiac CTA and MRA were evaluated. Contrast-enhanced CTA was performed with prospective ECG-gating using a 320 detector row CT system. Contrast-enhanced MRA was performed with prospective ECG-gating using a 1.5 T MRI system equipped with a 32 channel cardiac coil. MRA was acquired during free-breathing with a navigator-gated inversion-recovery prepared steady-state free precession sequence. Two readers independently assessed the CTA and MRA images for vascular definition of the PVs (from 0, not visualized, to 4, excellent definition) and ostial PV diameters. Variants of LA anatomy were assessed in consensus. CTA was successfully performed in all patients with a mean radiation exposure of 5.1 ± 2.2 mSv. MRA was successfully performed in 27 of 29 patients (93 %). Visual definition of PVs was rated significantly higher on CTA compared to MRA (p < 0.0001; reader 1: excellent/good ratings of CTA versus MRA: 100% versus 86%; reader 2: excellent/good ratings of CTA versus MRA: 99% versus 89%). Assessment of ostial PV diameters showed good correlation between CTA and MRA (reader 1: Pearson r = 0.91; reader 2: Pearson r = 0.82). Moreover, agreement between both imaging methods for evaluation of variants of LA anatomy was high (agreement rate of 95% (95% CI: 92-99%). ECG-gated CTA provides higher image quality compared to ECG-gated MRA. Nevertheless, both CTA and MRA provided similar information of LA anatomy and ostial PV diameters. Copyright © 2013 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.
De Maio, Peter; White, Lawrence M; Bleakney, Robert; Menezes, Ravi J; Theodoropoulos, John
2014-07-01
To evaluate the diagnostic performance of viewing magnetic resonance (MR) images on a handheld mobile device compared with a conventional radiology workstation for the diagnosis of intra-articular knee pathology. Prospective comparison study. Tertiary care center. Fifty consecutive subjects who had MR imaging of the knee followed by knee arthroscopy were prospectively evaluated. Two musculoskeletal radiologists independently reviewed each MR study using 2 different viewers: the OsiriX DICOM viewer software on an Apple iPhone 3GS device and eFilm Workstation software on a conventional picture archiving and communications system workstation. Sensitivity and specificity of the iPhone and workstation interpretations was performed using knee arthroscopy as the reference standard. Intraobserver concordance and agreement between the iPhone and workstation interpretations were determined. There was no statistically significant difference between the 2 devices for each paired comparison of diagnostic performance. For the iPhone interpretations, sensitivity ranged from 77% (13 of 17) for the lateral meniscus to 100% (17 of 17) for the anterior cruciate ligament. Specificity ranged from 74% (14 of 19) for cartilage to 100% (50 of 50) for the posterior cruciate ligament. There was a very high level of interobserver and intraobserver agreement between devices and readers. The iPhone reads took longer than the corresponding workstation reads, with a significant mean difference between the iPhone and workstation reads of 3.98 minutes (P < 0.001). The diagnostic performance of interpreting MR images on a handheld mobile device for the assessment of intra-articular knee pathology is similar to that of a conventional radiology workstation, however, requires a longer viewing time. Timely and accurate interpretation of complex medical images using mobile device solutions could result in new workflow efficiencies and ultimately improve patient care.
Detecting ‘Wrong Blood in Tube’ Errors: Evaluation of a Bayesian Network Approach
Doctor, Jason N.; Strylewicz, Greg
2010-01-01
Objective In an effort to address the problem of laboratory errors, we develop and evaluate a method to detect mismatched specimens from nationally collected blood laboratory data in two experiments. Methods In Experiment 1 and 2 using blood labs from National Health and Nutrition Examination Survey (NHANES) and values derived from the Diabetes Prevention Program (DPP) respectively, a proportion of glucose and HbA1c specimens were randomly mismatched. A Bayesian network that encoded probabilistic relationships among analytes was used to predict mismatches. In Experiment 1 the performance of the network was compared against existing error detection software. In Experiment 2 the network was compared against 11 human experts recruited from the American Academy of Clinical Chemists. Results were compared via area under the receiver-operating characteristics curves (AUCs) and with agreement statistics. Results In Experiment 1 the network was most predictive of mismatches that produced clinically significant discrepancies between true and mismatched scores ((AUC of 0.87 (±0.04) for HbA1c and 0.83 (±0.02) for glucose), performed well in identifying errors among those self-reporting diabetes (N = 329) (AUC = 0.79 (± 0.02)) and performed significantly better than the established approach it was tested against (in all cases p < .0.05). In Experiment 2 it performed better (and in no case worse) than 7 of the 11 human experts. Average percent agreement was 0.79. and Kappa (κ) was 0.59, between experts and the Bayesian network. Conclusions Bayesian network can accurately identify mismatched specimens. The algorithm is best at identifying mismatches that result in a clinically significant magnitude of error. PMID:20566275
Inter- and intra-rater reliability and agreement in determining subcutaneous tumour margins in dogs.
Ranganathan, B; Milovancev, M; Leeper, H; Townsend, K L; Bracha, S; Curran, K
2018-03-01
The objective of this prospective study was to evaluate agreement and reliability of calliper-based measurements of locally invasive subcutaneous malignant tumours in dogs. Four raters measured the longest diameter of 12 subcutaneous tumours (7 soft tissue sarcomas and 5 mast cell tumours) from 11 client-owned dogs during 3 randomized, blinded measurement trials, both pre- and post-sedation. Inter- and intra-rater reliability was evaluated using intra-class correlation coefficient (ICC) and agreement was evaluated using Bland-Altman plots. Inter- and intra-rater reliability was good (ICC range of 0.8694-0.89520) and excellent (ICC range of 0.9720-0.9966), respectively. For agreement calculations, an a priori clinically relevant limit of agreement of 10 mm was set. Inter- and intra-rater agreement was unacceptable with inter-rater limits of agreement ranging from 15.9 to 55.6 mm and intra-rater limit of agreement ranging from 11.9 to 28.1 mm. Review of the measurement trial photographs revealed that calliper orientation changes were frequent, occurring in 9/12 (75%) and 8/12 (67%) pre- and post-sedation cases. No significant correlation was found between inter-rater measurement standard deviations and calliper orientation changes or dog body condition score. These findings suggest veterinarians may have poor agreement in determining the gross edge of tumours, which is expected to introduce bias and inconsistency in tumour staging, assessing response to therapy, and surgical margin planning. Due to the potential consequences for veterinary cancer patients, future studies are needed to validate the present findings. © 2018 John Wiley & Sons Ltd.
De Tobel, J; Phlypo, I; Fieuws, S; Politis, C; Verstraete, K L; Thevissen, P W
2017-12-01
The development of third molars can be evaluated with medical imaging to estimate age in subadults. The appearance of third molars on magnetic resonance imaging (MRI) differs greatly from that on radiographs. Therefore a specific staging technique is necessary to classify third molar development on MRI and to apply it for age estimation. To develop a specific staging technique to register third molar development on MRI and to evaluate its performance for age estimation in subadults. Using 3T MRI in three planes, all third molars were evaluated in 309 healthy Caucasian participants from 14 to 26 years old. According to the appearance of the developing third molars on MRI, descriptive criteria and schematic representations were established to define a specific staging technique. Two observers, with different levels of experience, staged all third molars independently with the developed technique. Intra- and inter-observer agreement were calculated. The data were imported in a Bayesian model for age estimation as described by Fieuws et al. (2016). This approach adequately handles correlation between age indicators and missing age indicators. It was used to calculate a point estimate and a prediction interval of the estimated age. Observed age minus predicted age was calculated, reflecting the error of the estimate. One-hundred and sixty-six third molars were agenetic. Five percent (51/1096) of upper third molars and 7% (70/1044) of lower third molars were not assessable. Kappa for inter-observer agreement ranged from 0.76 to 0.80. For intra-observer agreement kappa ranged from 0.80 to 0.89. However, two stage differences between observers or between staging sessions occurred in up to 2.2% (20/899) of assessments, probably due to a learning effect. Using the Bayesian model for age estimation, a mean absolute error of 2.0 years in females and 1.7 years in males was obtained. Root mean squared error equalled 2.38 years and 2.06 years respectively. The performance to discern minors from adults was better for males than for females, with specificities of 96% and 73% respectively. Age estimations based on the proposed staging method for third molars on MRI showed comparable reproducibility and performance as the established methods based on radiographs.
Warren, Ruth M L; Thompson, Deborah; Pointon, Linda J; Hoff, Rebecca; Gilbert, Fiona J; Padhani, Anwar R; Easton, Douglas F; Lakhani, Sunil R; Leach, Martin O
2006-06-01
To evaluate prospectively the accuracy of a lesion classification system designed for use in a magnetic resonance (MR) imaging high-breast-cancer-risk screening study. All participating patients provided written informed consent. Ethics committee approval was obtained. The results of 1541 contrast material-enhanced breast MR imaging examinations were analyzed; 1441 screening examinations were performed in 638 women aged 24-51 years at high risk for breast cancer, and 100 examinations were performed in 100 women aged 23-81 years. Lesion analysis was performed in 991 breasts, which were divided into design (491 breasts) and testing (500 breasts) sets. The reference standard was histologic analysis of biopsy samples, fine-needle aspiration cytology, or minimal follow-up of 24 months. The scoring system involved the use of five features: morphology (MOR), pattern of enhancement (POE), percentage of maximal focal enhancement (PMFE), maximal signal intensity-time ratio (MITR), and pattern of contrast material washout (POCW). The system was evaluated by means of (a) assessment of interreader agreement, as expressed in kappa statistics, for 315 breasts in which both readers analyzed the same lesion, (b) assessment of the diagnostic accuracy of the scored components with receiver operating characteristic curve analysis, and (c) logistic regression analysis to determine which components of the scoring system were critical to the final score. A new simplified scoring system developed with the design set was applied to the testing set. There was moderate reader agreement regarding overall lesion outcome (ie, malignant, suspicious, or benign) (kappa=0.58) and less agreement regarding the scored components. The area under the receiver operating characteristic curve (AUC) for the overall lesion score, 0.88, was higher than the AUC for any one component. The components MOR, POE, and POCW yielded the best overall result. PMFE and MITR did not contribute to diagnostic utility. Applying a simplified scoring system to the testing set yielded a nonsignificantly (P=.2) higher AUC than did applying the original scoring system (sensitivity, 84%; specificity, 86.0%). Good diagnostic accuracy can be achieved by using simple qualitative descriptors of lesion enhancement, including POCW. In the context of screening, quantitative enhancement parameters appear to be less useful for lesion characterization. Copyright (c) RSNA, 2006.
Mu, Yajun; Bi, Hua; Ekure, Edgar; Ding, Gang; Wei, Nan; Hua, Ning; Qian, Xuehan; Li, Xiaorong
2016-01-01
Purpose To evaluate the effectiveness of Spot photoscreener in detecting amblyopia risk factors meeting 2013 the American Association of Pediatric Ophthalmology and Strabismus (AAPOS) criteria in Chinese preschool and school-age children. Methods One hundred and fifty-five children (310 eyes), aged between 4 to 7 years (5.74 ± 1.2 years) underwent complete ophthalmologic examination, photoscreening, and cycloplegic retinoscopy refraction. The agreement of the results obtained with the photoscreening and retinoscopy was evaluated by linear regression and Bland-Altman plots. The sensitivity and specificity of detecting amblyopia risk factors were calculated based on the AAPOS 2013 guidelines. The overall effectiveness of detecting amblyopia risk factors was analyzed with Receiver Operating Characteristic (ROC) curves. Result The mean refractive errors measured with the Spot were: spherical equivalent (SE) = 0.70 ± 1.99 D, J0 = 0.87 ± 1.01 D, J45 = 0.09 ± 0.60 D. The mean results from retinoscopy were: SE = 1.19 ± 2.22 D, J0 = 0.77 ± 1.00 D, J45 = -0.02 ± 0.45 D. There was a strong linear agreement between results obtained from those two methods (R2 = 0.88, P<0.01). Bland–Altman plot indicated a moderate agreement of cylinder values between the two methods. Based on the criteria specified by the AAPOS 2013 guidelines, the sensitivity and specificity (in respective order) for detecting hyperopia were 98.31% and 97.14%; for detecting myopia were 78.50% and 88.64%; for detecting astigmatism were 90.91% and 80.37%; for detecting anisometropia were 93.10% and 85.25%; and for detection of strabismus was 77.55% and 88.18%. Conclusion The refractive values measured from Spot photoscreener showed a moderate agreement with the results from cycloplegic retinoscopy refraction, however there was an overall myopic shift of -0.49D. The performance in detecting individual amblyopia risk factors was satisfactory, but could be further improved by optimizing criteria based on ROC curves. PMID:26882106
McConnell, Kelly A; Krisher, Lyndsay K; Lenssen, Maureen; Bunik, Maya; Bunge Montes, Saskia; Domek, Gretchen J
2017-01-01
Telehealth education has the potential to serve as an important, low-cost method of expanding healthcare worker education and support, especially in rural settings of low- and middle-income countries. We describe an innovative educational strategy to strengthen a long-term health professional capacity building partnership between Guatemalan and US-based partners. In this pilot evaluation, community health nurses in rural Guatemala received customized, interactive education via telehealth from faculty at the supporting US-based institution. Program evaluation of this 10 lecture series demonstrated high levels of satisfaction among learners and instructors as well as knowledge gain by learners. An average of 5.5 learners and 2 instructors attended the 10 lectures and completed surveys using a Likert scale to rate statements regarding lecture content, technology, and personal connection. Positive statements about lecture content and the applicability to daily work had 98% or greater agreement as did statements regarding ease of technology and convenience. The learners agreed with feeling connected to the instructors 100% of the time, while instructors had 86.4% agreement with connection related statements. Instructors, joining at their respective work locations, rated convenience statements at 100% agreement. This evaluation also demonstrated effectiveness with an average 10.7% increase in pre- to posttest knowledge scores by learners. As the global health community considers efficiency in time, money, and our environment, telehealth education is a critical method to consider and develop for health worker education. Our pilot program evaluation shows that telehealth may be an effective method of delivering education to frontline health workers in rural Guatemala. While larger studies are needed to quantify the duration and benefits of specific knowledge gains and to perform a cost-effectiveness analysis of the program, our initial pilot results are encouraging and show that a telehealth program between a US-based university and a rural community health program in a low- and middle-income country is both feasible and acceptable.
McConnell, Kelly A.; Krisher, Lyndsay K.; Lenssen, Maureen; Bunik, Maya; Bunge Montes, Saskia; Domek, Gretchen J.
2017-01-01
Telehealth education has the potential to serve as an important, low-cost method of expanding healthcare worker education and support, especially in rural settings of low- and middle-income countries. We describe an innovative educational strategy to strengthen a long-term health professional capacity building partnership between Guatemalan and US-based partners. In this pilot evaluation, community health nurses in rural Guatemala received customized, interactive education via telehealth from faculty at the supporting US-based institution. Program evaluation of this 10 lecture series demonstrated high levels of satisfaction among learners and instructors as well as knowledge gain by learners. An average of 5.5 learners and 2 instructors attended the 10 lectures and completed surveys using a Likert scale to rate statements regarding lecture content, technology, and personal connection. Positive statements about lecture content and the applicability to daily work had 98% or greater agreement as did statements regarding ease of technology and convenience. The learners agreed with feeling connected to the instructors 100% of the time, while instructors had 86.4% agreement with connection related statements. Instructors, joining at their respective work locations, rated convenience statements at 100% agreement. This evaluation also demonstrated effectiveness with an average 10.7% increase in pre- to posttest knowledge scores by learners. As the global health community considers efficiency in time, money, and our environment, telehealth education is a critical method to consider and develop for health worker education. Our pilot program evaluation shows that telehealth may be an effective method of delivering education to frontline health workers in rural Guatemala. While larger studies are needed to quantify the duration and benefits of specific knowledge gains and to perform a cost-effectiveness analysis of the program, our initial pilot results are encouraging and show that a telehealth program between a US-based university and a rural community health program in a low- and middle-income country is both feasible and acceptable. PMID:28405582
Evaluation of the LIA-ANA-Profile-17S for the detection of autoantibodies to nuclear antigens.
Yi, Ahram; Lee, Chang-Hoon; Moon, Hee-Won; Kim, Hanah; Hur, Mina; Yun, Yeo-Min
2018-05-01
The diagnostic tests for autoimmune disease include screening for autoantibodies for nuclear antigens (ANA) and antibodies against extractable nuclear antigens (ENA). Using the line immunoassay (LIA) method, various kinds of ENA antibodies can be detected simultaneously. We evaluated the performance of the newly launched LIA-ANA-Profile-17S (Shenzhen YHLO Biotech, Shenzhen, China) as compared to a conventional LIA kit. Residual samples were collected from 200 patients who had been tested for ANA using indirect immunofluorescence. The LIA-ANA-Profile-17S was compared to the EuroLine ANA (Euroimmun, Oberlausitz, Germany) for the analysis of 17 different autoantibodies. The concordance rate and agreement between assays were determined. Samples showing discrepancies between the LIA-ANA-Profile-17S and EuroLine tests were further examined through additional analysis. The overall agreement was moderate (kappa = 0.759, 95% CI = 0.712-0.805). Agreement between assays ranged from weak to almost perfect, except for those tests targeting nucleosomes, histones, and PM-Scl. Of the 57 disparate results between LIA-ANA-Profile-17S and EuroLine, 38 (66.7%) samples tested positive under an additional assay, showing variable patterns between types of autoantibodies. The positive rate of each autoantibody between LIA-ANA-Profile-17S and EuroLine did not differ significantly, except for anti-nucleosome and anti-histone assays in samples from patients diagnosed with systemic lupus erythematosus (P = 0.004 and 0.001, respectively). Compared to those from the conventional EuroLine assay, the LIA-ANA-Profile-17S results showed variable agreement in samples showing different prevalence of each autoantibody. The most frequently detected antibodies showed almost perfect agreement. The LIA-ANA-Profile-17S could play a role in the diagnosis of systemic autoimmune disease in ANA-positive samples. Copyright © 2018 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.
Schoenmakers, Daphne A L; Feczko, Peter Z; Boonen, Bert; Schotanus, Martijn G M; Kort, Nanne P; Emans, Pieter J
2017-11-01
Previous studies have compared weight-bearing mechanical leg axis (MLA) measurements to non-weight-bearing measurement modalities. Most of these studies compared mean or median values and did not analyse within-person differences between measurements. This study evaluates the within-person agreement of MLA measurements between weight-bearing full-length radiographs (FLR) and non-weight-bearing measurement modalities (computer-assisted surgery (CAS) navigation or MRI). Two independent observers measured the MLA on pre- and postoperative weight-bearing FLR in 168 patients. These measurements were compared to non-weight-bearing measurements obtained by CAS navigation or MRI. Absolute differences in individual subjects were calculated to determine the agreement between measurement modalities. Linear regression was used to evaluate the possibility that other independent variables impact the differences in measurements. A difference was found in preoperative measurements between FLR and CAS navigation (mean of 2.5° with limit of agreement (1.96 SD) of 6.4°), as well as between FLR and MRI measurements (mean of 2.4° with limit of agreement (1.96 SD) of 6.9°). Postoperatively, the mean difference between MLA measured on FLR compared to CAS navigation was 1.5° (limit of agreement (1.96 SD) of 4.6°). Linear regression analysis showed that weight-bearing MLA measurements vary significantly from non-weight-bearing MLA measurements. Differences were more severe in patients with mediolateral instability (p = 0.010), age (p = 0.049) and ≥3° varus or valgus alignment (p = 0.008). The clinical importance of this study lies in the finding that there are within-person differences between weight-bearing and non-weight-bearing measurement modalities. This has implications for preoperative planning, performing total knee arthroplasty (TKA), and clinical follow-up after TKA surgery using CAS navigation or patient-specific instrumentation. III.
Quantitative MR Imaging of Hepatic Steatosis: Validation in Ex Vivo Human Livers
Bannas, Peter; Kramer, Harald; Hernando, Diego; Agni, Rashmi; Cunningham, Ashley M.; Mandal, Rakesh; Motosugi, Utaroh; Sharma, Samir D.; del Rio, Alejandro Munoz; Fernandez, Luis; Reeder, Scott B.
2015-01-01
Emerging magnetic resonance imaging (MRI) biomarkers of hepatic steatosis have demonstrated tremendous promise for accurate quantification of hepatic triglyceride concentration. These methods quantify the “proton density fat-fraction” (PDFF), which reflects the concentration of triglycerides in tissue. Previous in vivo studies have compared MRI-PDFF with histologic steatosis grading for assessment of hepatic steatosis. However, the correlation of MRI-PDFF with the underlying hepatic triglyceride content remained unknown. The aim of this ex vivo study was to validate the accuracy of MRI-PDFF as an imaging biomarker of hepatic steatosis. Using ex vivo human livers, we compared MRI-PDFF with magnetic resonance spectroscopy-PDFF (MRS-PDFF), biochemical triglyceride extraction and histology as three independent reference standards. A secondary aim was to compare the precision of MRI-PDFF relative to biopsy for the quantification of hepatic steatosis. MRI-PDFF was prospectively performed at 1.5T in 13 explanted human livers. We performed co-localized paired evaluation of liver fat content in all nine Couinaud segments using single-voxel MRS-PDFF (n=117), tissue wedges for biochemical triglyceride extraction (n=117), and five core biopsies performed in each segment for histologic grading (n=585). Accuracy of MRI-PDFF was assessed through linear regression with MRS-PDFF, triglyceride extraction and histology. Intra-observer agreement, inter-observer agreement and repeatability of MRI-PDFF and histologic grading were assessed through Bland-Altman analyses. MRI-PDFF showed an excellent correlation with MRS-PDFF (r=0.984; CI: 0.978–0.989) and strong correlation with histology (r=0.850; CI: 0.791–0.894) and triglyceride extraction (r=0.871; CI: 0.818–0.909). Intra-observer agreement, inter-observer agreement and repeatability showed a significantly smaller variance for MRI-PDFF than for histologic steatosis grading (all p<0.001). Conclusion MRI-PDFF is an accurate, precise and reader-independent non-invasive imaging biomarker of liver triglyceride content, capable of steatosis quantification over the entire liver. PMID:26224591
Research: Comparison of the Accuracy of a Pocket versus Standard Pulse Oximeter.
da Costa, João Cordeiro; Faustino, Paula; Lima, Ricardo; Ladeira, Inês; Guimarães, Miguel
2016-01-01
Pulse oximetry has become an essential tool in clinical practice. With patient self-management becoming more prevalent, pulse oximetry self-monitoring has the potential to become common practice in the near future. This study sought to compare the accuracy of two pulse oximeters, a high-quality standard pulse oximeter and an inexpensive pocket pulse oximeter, and to compare both devices with arterial blood co-oximetry oxygen saturation. A total of 95 patients (35.8% women; mean [±SD] age 63.1 ± 13.9 years; mean arterial pressure was 92 ± 12.0 mmHg; mean axillar temperature 36.3 ± 0.4°C) presenting to our hospital for blood gas analysis was evaluated. The Bland-Altman technique was performed to calculate bias and precision, as well as agreement limits. Student's t test was performed. Standard oximeter presented 1.84% bias and a precision error of 1.80%. Pocket oximeter presented a bias of 1.85% and a precision error of 2.21%. Agreement limits were -1.69% to 5.37% (standard oximeter) and -2.48% to 6.18% (pocket oximeter). Both oximeters presented bias, which was expected given previous research. The pocket oximeter was less precise but had agreement limits that were comparable with current evidence. Pocket oximeters can be powerful allies in clinical monitoring of patients based on a self-monitoring/efficacy strategy.
Land, Sally; Cunningham, Philip; Zhou, Jialun; Frost, Kevin; Katzenstein, David; Kantor, Rami; Chen, Yi-Ming Arthur; Oka, Shinichi; DeLong, Allison; Sayer, David; Smith, Jeffery; Dax, Elizabeth M.; Law, Matthew
2010-01-01
The TREAT Asia (Therapeutics, Research, Education, and AIDS Training in Asia) Network is building capacity for Human Immunodeficiency Virus Type-1 (HIV-1) drug resistance testing in the region. The objective of the TREAT Asia Quality Assessment Scheme – designated TAQAS – is to standardize HIV-1 genotypic resistance testing (HIV genotyping) among laboratories to permit rigorous comparison of results from different clinics and testing centres. TAQAS has evaluated three panels of HIV-1-positive plasma from clinical material or low-passage, culture supernatant for up to 10 Asian laboratories. Laboratory participants used their standard protocols to perform HIV genotyping. Assessment was in comparison to a target genotype derived from all participants and the reference laboratory’s result. Agreement between most participants at the edited nucleotide sequence level was high (>98%). Most participants performed to the reference laboratory standard in detection of drug resistance mutations (DRMs). However, there was variation in the detection of nucleotide mixtures (0–83%) and a significant correlation with the detection of DRMs (p < 0.01). Interpretation of antiretroviral resistance showed ~70% agreement among participants when different interpretation systems were used but >90% agreement with a common interpretation system, within the Stanford University Drug Resistance Database. Using the principles of external quality assessment and a reference laboratory, TAQAS has demonstrated high quality HIV genotyping results from Asian laboratories. PMID:19490972
The teamwork in assertive community treatment (TACT) scale: development and validation.
Wholey, Douglas R; Zhu, Xi; Knoke, David; Shah, Pri; Zellmer-Bruhn, Mary; Witheridge, Thomas F
2012-11-01
Team design is meticulously specified for assertive community treatment (ACT) teams, yet performance can vary across ACT teams, even those with high fidelity. By developing and validating the Teamwork in Assertive Community Treatment (TACT) scale, investigators examined the role of team processes in ACT performance. The TACT scale measuring ACT teamwork was developed from a conceptual model grounded in organizational research and adapted for the ACT and mental health context. TACT subscales were constructed after exploratory and confirmatory factor analyses. The reliability, discriminant validity, predictive validity, temporal stability, internal consistency, and within-team agreement were established with surveys from approximately 300 members of 26 Minnesota ACT teams who completed the questionnaire three times, at six-month intervals. Nine TACT subscales emerged from the analyses: exploration, exploitation of new and existing knowledge, psychological safety, goal agreement, conflict, constructive controversy, information accessibility, encounter preparedness, and consumer-centered care. These nine subscales demonstrated fit and temporal stability (confirmatory factor analysis), high internal consistency (Cronbach's alpha), and within-team agreement and between-team differences (rwg and intraclass correlations). Correlational analyses of the subscales revealed that they measure related yet distinctive aspects of ACT team processes, and regression analyses demonstrated predictive validity (encounter preparedness is related to staff outcomes). The TACT scale demonstrated high reliability and validity and can be included in research and evaluation of teamwork in ACT and mental health teams.
Stern, Robin L; Heaton, Robert; Fraser, Martin W; Goddu, S Murty; Kirby, Thomas H; Lam, Kwok Leung; Molineu, Andrea; Zhu, Timothy C
2011-01-01
The requirement of an independent verification of the monitor units (MU) or time calculated to deliver the prescribed dose to a patient has been a mainstay of radiation oncology quality assurance. The need for and value of such a verification was obvious when calculations were performed by hand using look-up tables, and the verification was achieved by a second person independently repeating the calculation. However, in a modern clinic using CT/MR/PET simulation, computerized 3D treatment planning, heterogeneity corrections, and complex calculation algorithms such as convolution/superposition and Monte Carlo, the purpose of and methodology for the MU verification have come into question. In addition, since the verification is often performed using a simpler geometrical model and calculation algorithm than the primary calculation, exact or almost exact agreement between the two can no longer be expected. Guidelines are needed to help the physicist set clinically reasonable action levels for agreement. This report addresses the following charges of the task group: (1) To re-evaluate the purpose and methods of the "independent second check" for monitor unit calculations for non-IMRT radiation treatment in light of the complexities of modern-day treatment planning. (2) To present recommendations on how to perform verification of monitor unit calculations in a modern clinic. (3) To provide recommendations on establishing action levels for agreement between primary calculations and verification, and to provide guidance in addressing discrepancies outside the action levels. These recommendations are to be used as guidelines only and shall not be interpreted as requirements.
48 CFR 242.002 - Interagency agreements.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 48 Federal Acquisition Regulations System 3 2012-10-01 2012-10-01 false Interagency agreements... agreements. (b)(i) DoD requires reimbursement, at a rate set by the Under Secretary of Defense (Comptroller... administration, and audit services provided under a no-charge reciprocal agreement; (B) Services performed under...
22 CFR 120.22 - Technical assistance agreement.
Code of Federal Regulations, 2014 CFR
2014-04-01
... 22 Foreign Relations 1 2014-04-01 2014-04-01 false Technical assistance agreement. 120.22 Section 120.22 Foreign Relations DEPARTMENT OF STATE INTERNATIONAL TRAFFIC IN ARMS REGULATIONS PURPOSE AND DEFINITIONS § 120.22 Technical assistance agreement. An agreement (e.g., contract) for the performance of a...
22 CFR 120.22 - Technical assistance agreement.
Code of Federal Regulations, 2010 CFR
2010-04-01
... 22 Foreign Relations 1 2010-04-01 2010-04-01 false Technical assistance agreement. 120.22 Section 120.22 Foreign Relations DEPARTMENT OF STATE INTERNATIONAL TRAFFIC IN ARMS REGULATIONS PURPOSE AND DEFINITIONS § 120.22 Technical assistance agreement. An agreement (e.g., contract) for the performance of a...
22 CFR 120.22 - Technical assistance agreement.
Code of Federal Regulations, 2012 CFR
2012-04-01
... 22 Foreign Relations 1 2012-04-01 2012-04-01 false Technical assistance agreement. 120.22 Section 120.22 Foreign Relations DEPARTMENT OF STATE INTERNATIONAL TRAFFIC IN ARMS REGULATIONS PURPOSE AND DEFINITIONS § 120.22 Technical assistance agreement. An agreement (e.g., contract) for the performance of a...
22 CFR 120.22 - Technical assistance agreement.
Code of Federal Regulations, 2013 CFR
2013-04-01
... 22 Foreign Relations 1 2013-04-01 2013-04-01 false Technical assistance agreement. 120.22 Section 120.22 Foreign Relations DEPARTMENT OF STATE INTERNATIONAL TRAFFIC IN ARMS REGULATIONS PURPOSE AND DEFINITIONS § 120.22 Technical assistance agreement. An agreement (e.g., contract) for the performance of a...
22 CFR 120.22 - Technical assistance agreement.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 22 Foreign Relations 1 2011-04-01 2011-04-01 false Technical assistance agreement. 120.22 Section 120.22 Foreign Relations DEPARTMENT OF STATE INTERNATIONAL TRAFFIC IN ARMS REGULATIONS PURPOSE AND DEFINITIONS § 120.22 Technical assistance agreement. An agreement (e.g., contract) for the performance of a...
Hong, Theodore S; Bosch, Walter R; Krishnan, Sunil; Kim, Tae K; Mamon, Harvey J; Shyn, Paul; Ben-Josef, Edgar; Seong, Jinsil; Haddock, Michael G; Cheng, Jason C; Feng, Mary U; Stephans, Kevin L; Roberge, David; Crane, Christopher; Dawson, Laura A
2014-07-15
Defining hepatocellular carcinoma (HCC) gross tumor volume (GTV) requires multimodal imaging, acquired in different perfusion phases. The purposes of this study were to evaluate the variability in contouring and to establish guidelines and educational recommendations for reproducible HCC contouring for treatment planning. Anonymous, multiphasic planning computed tomography scans obtained from 3 patients with HCC were identified and distributed to a panel of 11 gastrointestinal radiation oncologists. Panelists were asked the number of HCC cases they treated in the past year. Case 1 had no vascular involvement, case 2 had extensive portal vein involvement, and case 3 had minor branched portal vein involvement. The agreement between the contoured total GTVs (primary + vascular GTV) was assessed using the generalized kappa statistic. Agreement interpretation was evaluated using Landis and Koch's interpretation of strength of agreement. The S95 contour, defined using the simultaneous truth and performance level estimation (STAPLE) algorithm consensus at the 95% confidence level, was created for each case. Of the 11 panelists, 3 had treated >25 cases in the past year, 2 had treated 10 to 25 cases, 2 had treated 5 to 10 cases, 2 had treated 1 to 5 cases, 1 had treated 0 cases, and 1 did not respond. Near perfect agreement was seen for case 1, and substantial agreement was seen for cases 2 and 3. For case 2, there was significant heterogeneity in the volume identified as tumor thrombus (range 0.58-40.45 cc). For case 3, 2 panelists did not include the branched portal vein thrombus, and 7 panelists contoured thrombus separately from the primary tumor, also showing significant heterogeneity in volume of tumor thrombus (range 4.52-34.27 cc). In a group of experts, excellent agreement was seen in contouring total GTV. Heterogeneity exists in the definition of portal vein thrombus that may impact treatment planning, especially if differential dosing is contemplated. Guidelines for HCC GTV contouring are recommended. Copyright © 2014. Published by Elsevier Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hong, Theodore S., E-mail: tshong1@mgh.harvard.edu; Bosch, Walter R.; Krishnan, Sunil
2014-07-15
Purpose: Defining hepatocellular carcinoma (HCC) gross tumor volume (GTV) requires multimodal imaging, acquired in different perfusion phases. The purposes of this study were to evaluate the variability in contouring and to establish guidelines and educational recommendations for reproducible HCC contouring for treatment planning. Methods and Materials: Anonymous, multiphasic planning computed tomography scans obtained from 3 patients with HCC were identified and distributed to a panel of 11 gastrointestinal radiation oncologists. Panelists were asked the number of HCC cases they treated in the past year. Case 1 had no vascular involvement, case 2 had extensive portal vein involvement, and case 3more » had minor branched portal vein involvement. The agreement between the contoured total GTVs (primary + vascular GTV) was assessed using the generalized kappa statistic. Agreement interpretation was evaluated using Landis and Koch's interpretation of strength of agreement. The S95 contour, defined using the simultaneous truth and performance level estimation (STAPLE) algorithm consensus at the 95% confidence level, was created for each case. Results: Of the 11 panelists, 3 had treated >25 cases in the past year, 2 had treated 10 to 25 cases, 2 had treated 5 to 10 cases, 2 had treated 1 to 5 cases, 1 had treated 0 cases, and 1 did not respond. Near perfect agreement was seen for case 1, and substantial agreement was seen for cases 2 and 3. For case 2, there was significant heterogeneity in the volume identified as tumor thrombus (range 0.58-40.45 cc). For case 3, 2 panelists did not include the branched portal vein thrombus, and 7 panelists contoured thrombus separately from the primary tumor, also showing significant heterogeneity in volume of tumor thrombus (range 4.52-34.27 cc). Conclusions: In a group of experts, excellent agreement was seen in contouring total GTV. Heterogeneity exists in the definition of portal vein thrombus that may impact treatment planning, especially if differential dosing is contemplated. Guidelines for HCC GTV contouring are recommended.« less
2012-01-01
Background We introduce the linguistic annotation of a corpus of 97 full-text biomedical publications, known as the Colorado Richly Annotated Full Text (CRAFT) corpus. We further assess the performance of existing tools for performing sentence splitting, tokenization, syntactic parsing, and named entity recognition on this corpus. Results Many biomedical natural language processing systems demonstrated large differences between their previously published results and their performance on the CRAFT corpus when tested with the publicly available models or rule sets. Trainable systems differed widely with respect to their ability to build high-performing models based on this data. Conclusions The finding that some systems were able to train high-performing models based on this corpus is additional evidence, beyond high inter-annotator agreement, that the quality of the CRAFT corpus is high. The overall poor performance of various systems indicates that considerable work needs to be done to enable natural language processing systems to work well when the input is full-text journal articles. The CRAFT corpus provides a valuable resource to the biomedical natural language processing community for evaluation and training of new models for biomedical full text publications. PMID:22901054
Hong, Sun; Woo, Ok Hee; Shin, Hye Seon; Hwang, Soon-Young; Cho, Kyu Ran; Seo, Bo Kyoung
Shear wave elastography (SWE) was performed independently by two radiologists in 264 solid breast masses. The images were reviewed for color overlay pattern (COP) classification by the two radiologists, double blinded to any information. The interobserver agreement of the COP was almost perfect (κ=0.908) and high in E max (ICC=0.89). The AUC value of the COP (0.954) was significantly higher than that of E max (0.915) (p=0.002) but not significantly different from that of E max combined with COP (0.957) (p=0.098). The SWE color overlay pattern and E max of breast masses were highly reproducible. The COP had better diagnostic ability than E max , suggesting that COP may be a more reliable parameter for solid breast mass evaluation. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Lawson, John W.; Murray, Daw S.; Bauschlicher, Charles W., Jr.
2011-01-01
Atomistic Green-Kubo simulations are performed to evaluate the lattice thermal conductivity for single crystals of the ultra high temperature ceramics ZrB2 and HfB2 for a range of temperatures. Recently developed interatomic potentials are used for these simulations. Heat current correlation functions show rapid oscillations which can be identified with mixed metal-Boron optical phonon modes. Agreement with available experimental data is good.
Instrumental resolution of the chopper spectrometer 4SEASONS evaluated by Monte Carlo simulation
NASA Astrophysics Data System (ADS)
Kajimoto, Ryoichi; Sato, Kentaro; Inamura, Yasuhiro; Fujita, Masaki
2018-05-01
We performed simulations of the resolution function of the 4SEASONS spectrometer at J-PARC by using the Monte Carlo simulation package McStas. The simulations showed reasonably good agreement with analytical calculations of energy and momentum resolutions by using a simplified description. We implemented new functionalities in Utsusemi, the standard data analysis tool used in 4SEASONS, to enable visualization of the simulated resolution function and predict its shape for specific experimental configurations.
Chen, Yi; Pouillot, Régis; S Burall, Laurel; Strain, Errol A; Van Doren, Jane M; De Jesus, Antonio J; Laasri, Anna; Wang, Hua; Ali, Laila; Tatavarthy, Aparna; Zhang, Guodong; Hu, Lijun; Day, James; Sheth, Ishani; Kang, Jihun; Sahu, Surasri; Srinivasan, Devayani; Brown, Eric W; Parish, Mickey; Zink, Donald L; Datta, Atin R; Hammack, Thomas S; Macarisin, Dumitru
2017-01-16
A precise and accurate method for enumeration of low level of Listeria monocytogenes in foods is critical to a variety of studies. In this study, paired comparison of most probable number (MPN) and direct plating enumeration of L. monocytogenes was conducted on a total of 1730 outbreak-associated ice cream samples that were naturally contaminated with low level of L. monocytogenes. MPN was performed on all 1730 samples. Direct plating was performed on all samples using the RAPID'L.mono (RLM) agar (1600 samples) and agar Listeria Ottaviani and Agosti (ALOA; 130 samples). Probabilistic analysis with Bayesian inference model was used to compare paired direct plating and MPN estimates of L. monocytogenes in ice cream samples because assumptions implicit in ordinary least squares (OLS) linear regression analyses were not met for such a comparison. The probabilistic analysis revealed good agreement between the MPN and direct plating estimates, and this agreement showed that the MPN schemes and direct plating schemes using ALOA or RLM evaluated in the present study were suitable for enumerating low levels of L. monocytogenes in these ice cream samples. The statistical analysis further revealed that OLS linear regression analyses of direct plating and MPN data did introduce bias that incorrectly characterized systematic differences between estimates from the two methods. Published by Elsevier B.V.
Niesta Kayser, Daniela; Graupmann, Verena; Fryer, James W.; Frey, Dieter
2016-01-01
Two experiments examined how individuals respond to a restriction presented within an approach versus an avoidance frame. In Study 1, working on a problem-solving task, participants were initially free to choose their strategy, but for a second task were told to change their strategy. The message to change was embedded in either an approach or avoidance frame. When confronted with an avoidance compared to an approach frame, the participants’ reactance toward the request was greater and, in turn, led to impaired performance. The role of reactance as a response to threat to freedom was explicitly examined in Study 2, in which participants evaluated a potential change in policy affecting their program of study herein explicitly varying whether a restriction was present or absent and whether the message was embedded in an approach versus avoidance frame. When communicated with an avoidance frame and as a restriction, participants showed the highest resistance in terms of reactance, message agreement and evaluation of the communicator. The difference in agreement with the change was mediated by reactance only when a restriction was present. Overall, avoidance goal frames were associated with more resistance to change on different levels of experience (reactance, performance, and person perception). Reactance mediated the effect of goal frame on other outcomes only when a restriction was present. PMID:27242572
Martín-Martín, G P; García-Armengol, J; Roig-Vila, J V; Espí-Macías, A; Martínez-Sanjuán, V; Mínguez-Pérez, M; Lorenzo-Liñán, M Á; Mulas-Fernández, C; González-Argenté, F X
2017-10-01
The aim of the present study was to evaluate the diagnostic accuracy of magnetic resonance (MR) defecography and compare it with videodefecography in the evaluation of obstructed defecation syndrome. This was a prospective cohort test accuracy study conducted at one major tertiary referral center on patients with a diagnosis of obstructed defecation syndrome who were referred to the colorectal surgery clinic in a consecutive series from 2009 to 2012. All patients underwent a clinical examination, videodefecography, and MR defecography in the supine position. We analyzed diagnostic accuracy for MR defecography and performed an agreement analysis using Cohen's kappa index (κ) for each diagnostic imaging examination performed with videodefecography and MR defecography. We included 40 patients with Rome III diagnostic criteria of obstructed defecation syndrome. The degree of agreement between the two tests was as follows: almost perfect for anismus (κ = 0.88) and rectal prolapse (κ = 0.83), substantial for enterocele (κ = 0.80) and rectocele grade III (κ = 0.65), moderate for intussusception (κ = 0.50) and rectocele grade II (κ = 0.49), and slight for rectocele grade I (κ = 0.30) and excessive perineal descent (κ = 0.22). Eighteen cystoceles and 11 colpoceles were diagnosed only by MR defecography. Most patients (54%) stated that videodefecography was the more uncomfortable test. MR defecography could become the imaging test of choice for evaluating obstructed defecation syndrome.