Evaluating Comparability in the Scoring of Performance Assessments for Accountability Purposes
ERIC Educational Resources Information Center
Lyons, Susan; Evans, Carla
2017-01-01
This brief summarizes "Comparability in Balanced Assessment Systems for State Accountability," published in "Educational Measurement: Issues and Practice" (Evans & Lyons 2017). The study evaluated comparability claims in local scoring of performance assessments across districts participating in New Hampshire's Performance…
Proposed evaluation framework for assessing operator performance with multisensor displays
NASA Technical Reports Server (NTRS)
Foyle, David C.
1992-01-01
Despite aggressive work on the development of sensor fusion algorithms and techniques, no formal evaluation procedures have been proposed. Based on existing integration models in the literature, an evaluation framework is developed to assess an operator's ability to use multisensor, or sensor fusion, displays. The proposed evaluation framework for evaluating the operator's ability to use such systems is a normative approach: The operator's performance with the sensor fusion display can be compared to the models' predictions based on the operator's performance when viewing the original sensor displays prior to fusion. This allows for the determination as to when a sensor fusion system leads to: 1) poorer performance than one of the original sensor displays (clearly an undesirable system in which the fused sensor system causes some distortion or interference); 2) better performance than with either single sensor system alone, but at a sub-optimal (compared to the model predictions) level; 3) optimal performance (compared to model predictions); or, 4) super-optimal performance, which may occur if the operator were able to use some highly diagnostic 'emergent features' in the sensor fusion display, which were unavailable in the original sensor displays. An experiment demonstrating the usefulness of the proposed evaluation framework is discussed.
Boysen-Osborn, Megan; Yanuck, Justin; Mattson, James; Toohey, Shannon; Wray, Alisa; Wiechmann, Warren; Lahham, Shadi; Langdorf, Mark I
2017-01-01
The Medical Student Performance Evaluation (MSPE) appendices provide a program director with comparative performance for a student's academic and professional attributes, but they are frequently absent or incomplete. We reviewed MSPEs from applicants to our emergency medicine residency program from 134 of 136 (99%) U.S. allopathic medical schools, over two application cycles (2012-13, 2014-15). We determined the degree of compliance with each of the five recommended MSPE appendices. Only three (2%) medical schools were compliant with all five appendices. The medical school information page (MSIP, appendix E) was present most commonly (85%), followed by comparative clerkship performance (appendix B, 82%), overall performance (appendix D, 59%), preclinical performance (appendix A, 57%), and professional attributes (appendix C, 18%). Few schools (7%) provided student-specific, comparative professionalism assessments. Medical schools inconsistently provide graphic, comparative data for their students in the MSPE. Although program directors (PD) value evidence of an applicant's professionalism when selecting residents, medical schools rarely provide such useful, comparative professionalism data in their MSPEs. As PDs seek to evaluate applicants based on academic performance and professionalism, rather than standardized testing alone, medical schools must make MSPEs more consistent, objective, and comparative.
Results of the 1980 NACUBO Comparative Performance Study and Investment Questionnaire.
ERIC Educational Resources Information Center
Dresner, Bruce M.
The purpose of the annual National Association of College and University Business Officers' (NACUBO) Comparative Performance Study is to aid administrators in evaluating the performance of their investment pools. The 1980 study contains two parts: (1) comparative performance information and related investment performance statistics; and (2) other…
Results of the 1979 NACUBO Comparative Performance Study and Investment Questionnaire.
ERIC Educational Resources Information Center
Dresner, Bruce M.
Results of the 1979 Comparative Performance Study of the National Association of College and Business Officers are presented. The study is designed to aid administrators in evaluating the performance of their investment pools. The report covers comparative performance information and related investment performance statistics and other endowment…
NASA Technical Reports Server (NTRS)
Foyle, David C.
1993-01-01
Based on existing integration models in the psychological literature, an evaluation framework is developed to assess sensor fusion displays as might be implemented in an enhanced/synthetic vision system. The proposed evaluation framework for evaluating the operator's ability to use such systems is a normative approach: The pilot's performance with the sensor fusion image is compared to models' predictions based on the pilot's performance when viewing the original component sensor images prior to fusion. This allows for the determination as to when a sensor fusion system leads to: poorer performance than one of the original sensor displays, clearly an undesirable system in which the fused sensor system causes some distortion or interference; better performance than with either single sensor system alone, but at a sub-optimal level compared to model predictions; optimal performance compared to model predictions; or, super-optimal performance, which may occur if the operator were able to use some highly diagnostic 'emergent features' in the sensor fusion display, which were unavailable in the original sensor displays.
Yang, Chenxi; Tavassolian, Negar
2018-05-01
This work proposes a novel method of pulse transit time (PTT) measurement. The proximal arterial location data are collected from seismocardiogram (SCG) recordings by placing a micro-electromechanical accelerometer on the chest wall. The distal arterial location data are recorded using an acoustic sensor placed inside the ear. The performance of distal location recordings is evaluated by comparing SCG-acoustic and SCG-photoplethysmogram (PPG) measurements. PPG and acoustic performances under motion noise are also compared. Experimental results suggest comparable performances for the acoustic-based and PPG-based devices. The feasibility of each PTT measurement method is validated for blood pressure evaluations and its limitations are analyzed.
Performance evaluation of TDT soil water content and watermark soil water potential sensors
USDA-ARS?s Scientific Manuscript database
This study evaluated the performance of digitized Time Domain Transmissometry (TDT) soil water content sensors (Acclima, Inc., Meridian, ID) and resistance-based soil water potential sensors (Watermark 200, Irrometer Company, Inc., Riverside, CA) in two soils. The evaluation was performed by compar...
Holistic rubric vs. analytic rubric for measuring clinical performance levels in medical students.
Yune, So Jung; Lee, Sang Yeoup; Im, Sun Ju; Kam, Bee Sung; Baek, Sun Yong
2018-06-05
Task-specific checklists, holistic rubrics, and analytic rubrics are often used for performance assessments. We examined what factors evaluators consider important in holistic scoring of clinical performance assessment, and compared the usefulness of applying holistic and analytic rubrics respectively, and analytic rubrics in addition to task-specific checklists based on traditional standards. We compared the usefulness of a holistic rubric versus an analytic rubric in effectively measuring the clinical skill performances of 126 third-year medical students who participated in a clinical performance assessment conducted by Pusan National University School of Medicine. We conducted a questionnaire survey of 37 evaluators who used all three evaluation methods-holistic rubric, analytic rubric, and task-specific checklist-for each student. The relationship between the scores on the three evaluation methods was analyzed using Pearson's correlation. Inter-rater agreement was analyzed by Kappa index. The effect of holistic and analytic rubric scores on the task-specific checklist score was analyzed using multiple regression analysis. Evaluators perceived accuracy and proficiency to be major factors in objective structured clinical examinations evaluation, and history taking and physical examination to be major factors in clinical performance examinations evaluation. Holistic rubric scores were highly related to the scores of the task-specific checklist and analytic rubric. Relatively low agreement was found in clinical performance examinations compared to objective structured clinical examinations. Meanwhile, the holistic and analytic rubric scores explained 59.1% of the task-specific checklist score in objective structured clinical examinations and 51.6% in clinical performance examinations. The results show the usefulness of holistic and analytic rubrics in clinical performance assessment, which can be used in conjunction with task-specific checklists for more efficient evaluation.
Vadapalli, Sriharsha Babu; Atluri, Kaleswararao; Putcha, Madhu Sudhan; Kondreddi, Sirisha; Kumar, N. Suman; Tadi, Durga Prasad
2016-01-01
Objectives: This in vitro study was designed to compare polyvinyl-siloxane (PVS) monophase and polyether (PE) monophase materials under dry and moist conditions for properties such as surface detail reproduction, dimensional stability, and gypsum compatibility. Materials and Methods: Surface detail reproduction was evaluated using two criteria. Dimensional stability was evaluated according to American Dental Association (ADA) specification no. 19. Gypsum compatibility was assessed by two criteria. All the samples were evaluated, and the data obtained were analyzed by a two-way analysis of variance (ANOVA) and Pearson's Chi-square tests. Results: When surface detail reproduction was evaluated with modification of ADA specification no. 19, both the groups under the two conditions showed no significant difference statistically. When evaluated macroscopically both the groups showed statistically significant difference. Results for dimensional stability showed that the deviation from standard was significant among the two groups, where Aquasil group showed significantly more deviation compared to Impregum group (P < 0.001). Two conditions also showed significant difference, with moist conditions showing significantly more deviation compared to dry condition (P < 0.001). The results of gypsum compatibility when evaluated with modification of ADA specification no. 19 and by giving grades to the casts for both the groups and under two conditions showed no significant difference statistically. Conclusion: Regarding dimensional stability, both impregum and aquasil performed better in dry condition than in moist; impregum performed better than aquasil in both the conditions. When tested for surface detail reproduction according to ADA specification, under dry and moist conditions both of them performed almost equally. When tested according to macroscopic evaluation, impregum and aquasil performed significantly better in dry condition compared to moist condition. In dry condition, both the materials performed almost equally. In moist condition, aquasil performed significantly better than impregum. Regarding gypsum compatibility according to ADA specification, in dry condition both the materials performed almost equally, and in moist condition aquasil performed better than impregum. When tested by macroscopic evaluation, impregum performed better than aquasil in both the conditions. PMID:27583217
Vadapalli, Sriharsha Babu; Atluri, Kaleswararao; Putcha, Madhu Sudhan; Kondreddi, Sirisha; Kumar, N Suman; Tadi, Durga Prasad
2016-01-01
This in vitro study was designed to compare polyvinyl-siloxane (PVS) monophase and polyether (PE) monophase materials under dry and moist conditions for properties such as surface detail reproduction, dimensional stability, and gypsum compatibility. Surface detail reproduction was evaluated using two criteria. Dimensional stability was evaluated according to American Dental Association (ADA) specification no. 19. Gypsum compatibility was assessed by two criteria. All the samples were evaluated, and the data obtained were analyzed by a two-way analysis of variance (ANOVA) and Pearson's Chi-square tests. When surface detail reproduction was evaluated with modification of ADA specification no. 19, both the groups under the two conditions showed no significant difference statistically. When evaluated macroscopically both the groups showed statistically significant difference. Results for dimensional stability showed that the deviation from standard was significant among the two groups, where Aquasil group showed significantly more deviation compared to Impregum group (P < 0.001). Two conditions also showed significant difference, with moist conditions showing significantly more deviation compared to dry condition (P < 0.001). The results of gypsum compatibility when evaluated with modification of ADA specification no. 19 and by giving grades to the casts for both the groups and under two conditions showed no significant difference statistically. Regarding dimensional stability, both impregum and aquasil performed better in dry condition than in moist; impregum performed better than aquasil in both the conditions. When tested for surface detail reproduction according to ADA specification, under dry and moist conditions both of them performed almost equally. When tested according to macroscopic evaluation, impregum and aquasil performed significantly better in dry condition compared to moist condition. In dry condition, both the materials performed almost equally. In moist condition, aquasil performed significantly better than impregum. Regarding gypsum compatibility according to ADA specification, in dry condition both the materials performed almost equally, and in moist condition aquasil performed better than impregum. When tested by macroscopic evaluation, impregum performed better than aquasil in both the conditions.
Li, Chunhui; Yu, Chuanhua
2013-01-01
To provide a reference for evaluating public non-profit hospitals in the new environment of medical reform, we established a performance evaluation system for public non-profit hospitals. The new “input-output” performance model for public non-profit hospitals is based on four primary indexes (input, process, output and effect) that include 11 sub-indexes and 41 items. The indicator weights were determined using the analytic hierarchy process (AHP) and entropy weight method. The BP neural network was applied to evaluate the performance of 14 level-3 public non-profit hospitals located in Hubei Province. The most stable BP neural network was produced by comparing different numbers of neurons in the hidden layer and using the “Leave-one-out” Cross Validation method. The performance evaluation system we established for public non-profit hospitals could reflect the basic goal of the new medical health system reform in China. Compared with PLSR, the result indicated that the BP neural network could be used effectively for evaluating the performance public non-profit hospitals. PMID:23955238
Spreco, A; Eriksson, O; Dahlström, Ö; Timpka, T
2017-07-01
Methods for the detection of influenza epidemics and prediction of their progress have seldom been comparatively evaluated using prospective designs. This study aimed to perform a prospective comparative trial of algorithms for the detection and prediction of increased local influenza activity. Data on clinical influenza diagnoses recorded by physicians and syndromic data from a telenursing service were used. Five detection and three prediction algorithms previously evaluated in public health settings were calibrated and then evaluated over 3 years. When applied on diagnostic data, only detection using the Serfling regression method and prediction using the non-adaptive log-linear regression method showed acceptable performances during winter influenza seasons. For the syndromic data, none of the detection algorithms displayed a satisfactory performance, while non-adaptive log-linear regression was the best performing prediction method. We conclude that evidence was found for that available algorithms for influenza detection and prediction display satisfactory performance when applied on local diagnostic data during winter influenza seasons. When applied on local syndromic data, the evaluated algorithms did not display consistent performance. Further evaluations and research on combination of methods of these types in public health information infrastructures for 'nowcasting' (integrated detection and prediction) of influenza activity are warranted.
Methods for comparative evaluation of propulsion system designs for supersonic aircraft
NASA Technical Reports Server (NTRS)
Tyson, R. M.; Mairs, R. Y.; Halferty, F. D., Jr.; Moore, B. E.; Chaloff, D.; Knudsen, A. W.
1976-01-01
The propulsion system comparative evaluation study was conducted to define a rapid, approximate method for evaluating the effects of propulsion system changes for an advanced supersonic cruise airplane, and to verify the approximate method by comparing its mission performance results with those from a more detailed analysis. A table look up computer program was developed to determine nacelle drag increments for a range of parametric nacelle shapes and sizes. Aircraft sensitivities to propulsion parameters were defined. Nacelle shapes, installed weights, and installed performance was determined for four study engines selected from the NASA supersonic cruise aircraft research (SCAR) engine studies program. Both rapid evaluation method (using sensitivities) and traditional preliminary design methods were then used to assess the four engines. The method was found to compare well with the more detailed analyses.
Hecht, Silke; Adams, W H; Cunningham, M A; Lane, I F; Howell, N E
2013-01-01
Effective teaching of veterinary radiology can be challenging in a traditional classroom environment. Audience response systems, colloquially known as "clickers," provide a means of encouraging student interaction. The purpose of this study was to compare student performance and course evaluations before and after using the Classroom Performance System™ in the third-year (fifth semester) didactic radiology course at the University of Tennessee College of Veterinary Medicine. Overall student performance was assessed by comparing median numeric final course grades (%) between years without and with use of the Classroom Performance System™. Grades of students were determined for individual instructors' sections. Student evaluations of the radiology course were compared for the years available (2007-2010). Student interactions were also evaluated subjectively by instructors who used the Classroom Performance System™. There was a significant difference (p = 0.009) between the median student grade before (2005 - 2008, median 82.2%; interquartile range 77.6-85.7%; range 61.9-95.5%) and after use of the classroom performance system (2009-2010, median 83.6%; interquartile range 79.9-87.9%; range 68.2-93.2%). There was no statistically significant difference in median student grades for individual instructors over the study period. The radiology course student evaluation scores were significantly higher in years where the Classroom Performance System™ was used in comparison to previous years (P = 0.019). Subjectively, students appeared more involved when using clickers. Findings indicated that the Classroom Performance System™ may be a useful tool for enhancing veterinary radiology education. © 2012 Veterinary Radiology & Ultrasound.
A PERFORMANCE EVALUATION OF THE 2004 RELEASE OF MODELS-3 CMAQ
This performance evaluation compares a full annual simulation (2001) of CMAQ (Version4.4) covering the contiguous United States against monitoring data from four nationwide networks. This effort, which represents one of the most spatially and temporally comprehensive performance...
Evaluation of EIT system performance.
Yasin, Mamatjan; Böhm, Stephan; Gaggero, Pascal O; Adler, Andy
2011-07-01
An electrical impedance tomography (EIT) system images internal conductivity from surface electrical stimulation and measurement. Such systems necessarily comprise multiple design choices from cables and hardware design to calibration and image reconstruction. In order to compare EIT systems and study the consequences of changes in system performance, this paper describes a systematic approach to evaluate the performance of the EIT systems. The system to be tested is connected to a saline phantom in which calibrated contrasting test objects are systematically positioned using a position controller. A set of evaluation parameters are proposed which characterize (i) data and image noise, (ii) data accuracy, (iii) detectability of single contrasts and distinguishability of multiple contrasts, and (iv) accuracy of reconstructed image (amplitude, resolution, position and ringing). Using this approach, we evaluate three different EIT systems and illustrate the use of these tools to evaluate and compare performance. In order to facilitate the use of this approach, all details of the phantom, test objects and position controller design are made publicly available including the source code of the evaluation and reporting software.
Greene, Krista L; Tonjes, David J
2014-04-01
The primary objective of waste management technologies and policies in the United States is to reduce the harmful environmental impacts of waste, particularly those relating to energy consumption and climate change. Performance indicators are frequently used to evaluate the environmental quality of municipal waste systems, as well as to compare and rank programs relative to each other in terms of environmental performance. However, there currently is no consensus on the best indicator for performing these environmental evaluations. The purpose of this study is to examine the common performance indicators used to assess the environmental benefits of municipal waste systems to determine if there is agreement between them regarding which system performs best environmentally. Focus is placed on how indicator selection influences comparisons between municipal waste management programs and subsequent system rankings. The waste systems of ten municipalities in the state of New York, USA, were evaluated using each common performance indicator and Spearman correlations were calculated to see if there was a significant association between system rank orderings. Analyses showed that rank orders of waste systems differ substantially when different indicators are used. Therefore, comparative system assessments based on indicators should be considered carefully, especially those intended to gauge environmental quality. Insight was also gained into specific factors which may lead to one system achieving higher rankings than another. However, despite the insufficiencies of indicators for comparative quality assessments, they do provide important information for waste managers and they can assist in evaluating internal programmatic performance and progress. To enhance these types of assessments, a framework for scoring indicators based on criteria that evaluate their utility and value for system evaluations was developed. This framework was used to construct an improved model for waste system performance assessments. Copyright © 2014 Elsevier Ltd. All rights reserved.
Evaluation Of The Diagnostic Performance Of A Multimedia Medical Communications System.
NASA Astrophysics Data System (ADS)
Robertson, John G.; Coristine, Marjorie; Goldberg, Morris; Beeton, Carolyn; Belanger, Garry; Tombaugh, Jo W.; Hickey, Nancy M.; Millward, Steven F.; Davis, Michael; Whittingham, David
1989-05-01
The central concern of radiologists when evaluating Picture Archiving Communication System (PACS) is the diagnostic performance of digital images compared to the original analog versions of the same images. Considerable work has been done comparing the ROC curves of various types of digital systems to the corresponding analog systems for the detection of specific phantoms or diseases. Although the studies may notify the radiologists that for a specific lesion a digital system may perform as well as the analog system, it tells the radiologists very little about the impact on diagnostic performance of a digital system in the general practice of radiology. We describe in this paper an alternative method for evaluating the diagnostic performance of a digital system and a preliminary experiment we conducted to test the methodology.
Tian, Jing; Varga, Boglarka; Tatrai, Erika; Fanni, Palya; Somfai, Gabor Mark; Smiddy, William E.
2016-01-01
Over the past two decades a significant number of OCT segmentation approaches have been proposed in the literature. Each methodology has been conceived for and/or evaluated using specific datasets that do not reflect the complexities of the majority of widely available retinal features observed in clinical settings. In addition, there does not exist an appropriate OCT dataset with ground truth that reflects the realities of everyday retinal features observed in clinical settings. While the need for unbiased performance evaluation of automated segmentation algorithms is obvious, the validation process of segmentation algorithms have been usually performed by comparing with manual labelings from each study and there has been a lack of common ground truth. Therefore, a performance comparison of different algorithms using the same ground truth has never been performed. This paper reviews research-oriented tools for automated segmentation of the retinal tissue on OCT images. It also evaluates and compares the performance of these software tools with a common ground truth. PMID:27159849
Azizi, Kourosh; Aghamolaei, Teamur; Parsa, Nader; Dabbaghmanesh, Tahereh
2014-07-01
The present study aimed to compare self-assessment forms of coursework taught in the school of public health at undergraduate, graduate, and postgraduate levels and students' evaluation of the performance of the faculty members at these levels. The subjects in this cross-sectional study were the faculty members and students of the School of Public Health and Nutrition, Shiraz University of Medical Sciences, Shiraz, Iran. The data were collected using a socio-demographic information form and evaluation forms of professors prepared by the Educational Development Center (EDC). The faculty members were assessed by the students in undergraduate and graduate classes. Among the study subjects, 23 faculty members filled out the self-assessment forms which were then evaluated by 23 students. Then, the data were analyzed using the SPSS statistical 14. Paired t-test was used to compare the students' evaluation of the faculty members' performance and the professors' self-assessment. The mean score of self-assessment of the faculty members who taught undergraduate courses was 289.7±8.3, while that of the students' evaluation was 281.3±16.1; the difference was statistically significant (t=3.56, p=0.001). Besides, the mean score of the self-assessment of the faculty members who taught graduate courses was 269.0±9.7, while that of the students' evaluation was 265.7±14.6 but the difference was not statistically significant (t=1.09, p=0.28). Teaching performance perceptions of the faculty were similar to those of the graduate students as compared to the undergraduate ones. This may reflect better understanding of coursework at this level compared to the undergraduate students. Faculty members may need to adjust teaching methods to improve students' performance and understanding especially in the undergraduate level.
Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items
ERIC Educational Resources Information Center
Kieftenbeld, Vincent; Boyer, Michelle
2017-01-01
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
ERIC Educational Resources Information Center
Moore, Don A.; Klein, William M. P.
2008-01-01
Which matters more--beliefs about absolute ability or ability relative to others? This study set out to compare the effects of such beliefs on satisfaction with performance, self-evaluations, and bets on future performance. In Experiment 1, undergraduate participants were told they had answered 20% correct, 80% correct, or were not given their…
Wu, G; Li, J
1999-01-01
Identifying and accessing reliable, relevant consumer health information rapidly on the Internet may challenge the health sciences librarian and layperson alike. In this study, seven search engines are compared using representative consumer health topics for their content relevancy, system features, and attributes. The paper discusses evaluation criteria; systematically compares relevant results; analyzes performance in terms of the strengths and weaknesses of the search engines; and illustrates effective search engine selection, search formulation, and strategies. PMID:10550031
NASA Astrophysics Data System (ADS)
Mao, Chao; Chen, Shou
2017-01-01
According to the traditional entropy value method still have low evaluation accuracy when evaluating the performance of mining projects, a performance evaluation model of mineral project founded on improved entropy is proposed. First establish a new weight assignment model founded on compatible matrix analysis of analytic hierarchy process (AHP) and entropy value method, when the compatibility matrix analysis to achieve consistency requirements, if it has differences between subjective weights and objective weights, moderately adjust both proportions, then on this basis, the fuzzy evaluation matrix for performance evaluation. The simulation experiments show that, compared with traditional entropy and compatible matrix analysis method, the proposed performance evaluation model of mining project based on improved entropy value method has higher accuracy assessment.
Fatone, Stefania; Caldwell, Ryan
2017-06-01
Current transfemoral prosthetic sockets are problematic as they restrict function, lack comfort, and cause residual limb problems. Development of a subischial socket with lower proximal trim lines is an appealing way to address this problem and may contribute to improving quality of life of persons with transfemoral amputation. The purpose of this study was to illustrate the use of a new subischial socket in two subjects. Case series. Two unilateral transfemoral prosthesis users participated in preliminary socket evaluations comparing functional performance of the new subischial socket to ischial containment sockets. Testing included gait analysis, socket comfort score, and performance-based clinical outcome measures (Rapid-Sit-To-Stand, Four-Square-Step-Test, and Agility T-Test). For both subjects, comfort was better in the subischial socket, while gait and clinical outcomes were generally comparable between sockets. While these evaluations are promising regarding the ability to function in this new socket design, more definitive evaluation is needed. Clinical relevance Using gait analysis, socket comfort score and performance-based outcome measures, use of the Northwestern University Flexible Subischial Vaccum Socket was evaluated in two transfemoral prosthesis users. Socket comfort improved for both subjects with comparable function compared to ischial containment sockets.
We present an application of the online coupled WRF-CMAQ modeling system to two annual simulations over North America performed under Phase 2 of the Air Quality Model Evaluation International Initiative (AQMEII). Operational evaluation shows that model performance is comparable t...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hart, W.E.
1999-02-10
Evolutionary programs (EPs) and evolutionary pattern search algorithms (EPSAS) are two general classes of evolutionary methods for optimizing on continuous domains. The relative performance of these methods has been evaluated on standard global optimization test functions, and these results suggest that EPSAs more robustly converge to near-optimal solutions than EPs. In this paper we evaluate the relative performance of EPSAs and EPs on a real-world application: flexible ligand binding in the Autodock docking software. We compare the performance of these methods on a suite of docking test problems. Our results confirm that EPSAs and EPs have comparable performance, and theymore » suggest that EPSAs may be more robust on larger, more complex problems.« less
Performance of the SEAPROG prognosis variant of the forest vegetation simulator.
Michael H. McClellan; Frances E. Biles
2003-01-01
This paper reports the first phase of a recent effort to evaluate the performance and use of the FVS-SEAPROG vegetation growth model. In this paper, we present our evaluation of SEAPROGâs performance in modeling the growth of even-aged stands regenerated by clearcutting, windthrow, or fire. We evaluated the model by comparing model predictions to observed values from...
ERIC Educational Resources Information Center
Van Matre, Nicholas H.; Harrigan, Robert J.
A followup performance evaluation was conducted on a sample of Group 4 (low ability) personnel who had served 14 months aboard the mine contermeasures support ship U.S.S. Catskill (MCS-1). Shipboard assessments were made of the Group 4 sample and the non-Group 4 comparison sample in terms of performance test proficiency, supervisors' ratings, and…
Comparative Evaluation of Quantitative Test Methods for Gases on a Hard Surface
2017-02-01
COMPARATIVE EVALUATION OF QUANTITATIVE TEST METHODS FOR GASES ON A HARD SURFACE ECBC-TR-1426 Vipin Rastogi...1 COMPARATIVE EVALUATION OF QUANTITATIVE TEST METHODS FOR GASES ON A HARD SURFACE 1. INTRODUCTION Members of the U.S. Environmental...Generator 4 2.4 Experimental Design Each quantitative method was performed three times on three consecutive days. For the CD runs, three
Promotion Factors For Enlisted Infantry Marines
2017-06-01
description , billet accomplishments, mission accomplishment, individual character, leadership, intellect and wisdom, fulfillment of evaluation , RS...staff sergeant. To assess which ranks proportionally promote more high-quality Marines, we compare two performance evaluation methods: proficiency and...adverse fitness reports. From the two performance evaluation methods we find that the Marine Corps promotes proportionally more high-quality Marines
Comparing IndexedHBase and Riak for Serving Truthy: Performance of Data Loading and Query Evaluation
2013-08-01
Research Triangle Park, NC 27709-2211 15. SUBJECT TERMS performance evaluation, distributed database, noSQL , HBase, indexing Xiaoming Gao, Judy Qiu...common hashtags created during a given time window. With the purpose of finding a solution for these challenges, we evaluate NoSQL databases such as
Building China's municipal healthcare performance evaluation system: a Tuscan perspective.
Li, Hao; Barsanti, Sara; Bonini, Anna
2012-08-01
Regional healthcare performance evaluation systems can help optimize healthcare resources on regional basis and improve the performance of healthcare services provided. The Tuscany region in Italy is a good example of an institution which meets these requirements. China has yet to build such a system based on international experience. In this paper, based on comparative studies between Tuscany and China, we propose that the managing institutions in China's experimental cities can select and commission a third-party agency to, respectively, evaluate the performance of their affiliated hospitals and community health service centers. Following some features of the Tuscan experience, the Chinese municipal healthcare performance evaluation system can be built by focusing on the selection of an appropriate performance evaluation agency, the design of an adequate performance evaluation mechanism and the formulation of a complete set of laws, rules and regulations. When a performance evaluation system at city level is formed, the provincial government can extend the successful experience to other cities.
Fatone, Stefania; Caldwell, Ryan
2017-01-01
Background: Current transfemoral prosthetic sockets are problematic as they restrict function, lack comfort, and cause residual limb problems. Development of a subischial socket with lower proximal trim lines is an appealing way to address this problem and may contribute to improving quality of life of persons with transfemoral amputation. Objectives: The purpose of this study was to illustrate the use of a new subischial socket in two subjects. Study design: Case series. Methods: Two unilateral transfemoral prosthesis users participated in preliminary socket evaluations comparing functional performance of the new subischial socket to ischial containment sockets. Testing included gait analysis, socket comfort score, and performance-based clinical outcome measures (Rapid-Sit-To-Stand, Four-Square-Step-Test, and Agility T-Test). Results: For both subjects, comfort was better in the subischial socket, while gait and clinical outcomes were generally comparable between sockets. Conclusion: While these evaluations are promising regarding the ability to function in this new socket design, more definitive evaluation is needed. Clinical relevance Using gait analysis, socket comfort score and performance-based outcome measures, use of the Northwestern University Flexible Subischial Vaccum Socket was evaluated in two transfemoral prosthesis users. Socket comfort improved for both subjects with comparable function compared to ischial containment sockets. PMID:28132589
Evaluating the Effect of Learning Style and Student Background on Self-Assessment Accuracy
ERIC Educational Resources Information Center
Alaoutinen, Satu
2012-01-01
This study evaluates a new taxonomy-based self-assessment scale and examines factors that affect assessment accuracy and course performance. The scale is based on Bloom's Revised Taxonomy and is evaluated by comparing students' self-assessment results with course performance in a programming course. Correlation has been used to reveal possible…
Grading Standards and Student Performance in Community College and University Courses
ERIC Educational Resources Information Center
Friedl, John; Pittenger, David J.; Sherman, Michael
2012-01-01
Research was undertaken to determine whether comparable grading standards are used in evaluating student performance at two-year community colleges and four-year universities. Examination of academic records of 417 students who took college level math at the University of Tennessee at Chattanooga in fall 2009 compared the performance of those who…
Evaluation of urethane and carbide-tipped blades on wheel-supported snow plows.
DOT National Transportation Integrated Search
1997-01-01
The objective of this study was to evaluate the performance of urethane and carbide-tipped snow plow blades on wheel supported plows. Their performance was compared to that of VDOT's standard blade arrangement: carbide-tipped blades on plows without ...
Multiple performance measures are needed to evaluate triage systems in the emergency department.
Zachariasse, Joany M; Nieboer, Daan; Oostenbrink, Rianne; Moll, Henriëtte A; Steyerberg, Ewout W
2018-02-01
Emergency department triage systems can be considered prediction rules with an ordinal outcome, where different directions of misclassification have different clinical consequences. We evaluated strategies to compare the performance of triage systems and aimed to propose a set of performance measures that should be used in future studies. We identified performance measures based on literature review and expert knowledge. Their properties are illustrated in a case study evaluating two triage modifications in a cohort of 14,485 pediatric emergency department visits. Strengths and weaknesses of the performance measures were systematically appraised. Commonly reported performance measures are measures of statistical association (34/60 studies) and diagnostic accuracy (17/60 studies). The case study illustrates that none of the performance measures fulfills all criteria for triage evaluation. Decision curves are the performance measures with the most attractive features but require dichotomization. In addition, paired diagnostic accuracy measures can be recommended for dichotomized analysis, and the triage-weighted kappa and Nagelkerke's R 2 for ordinal analyses. Other performance measures provide limited additional information. When comparing modifications of triage systems, decision curves and diagnostic accuracy measures should be used in a dichotomized analysis, and the triage-weighted kappa and Nagelkerke's R 2 in an ordinal approach. Copyright © 2017 Elsevier Inc. All rights reserved.
Liu, L; Kan, A; Leckie, C; Hodgkin, P D
2017-04-01
Time-lapse fluorescence microscopy is a valuable technology in cell biology, but it suffers from the inherent problem of intensity inhomogeneity due to uneven illumination or camera nonlinearity, known as shading artefacts. This will lead to inaccurate estimates of single-cell features such as average and total intensity. Numerous shading correction methods have been proposed to remove this effect. In order to compare the performance of different methods, many quantitative performance measures have been developed. However, there is little discussion about which performance measure should be generally applied for evaluation on real data, where the ground truth is absent. In this paper, the state-of-the-art shading correction methods and performance evaluation methods are reviewed. We implement 10 popular shading correction methods on two artificial datasets and four real ones. In order to make an objective comparison between those methods, we employ a number of quantitative performance measures. Extensive validation demonstrates that the coefficient of joint variation (CJV) is the most applicable measure in time-lapse fluorescence images. Based on this measure, we have proposed a novel shading correction method that performs better compared to well-established methods for a range of real data tested. © 2016 The Authors Journal of Microscopy © 2016 Royal Microscopical Society.
Comparative analysis of techniques for evaluating the effectiveness of aircraft computing systems
NASA Technical Reports Server (NTRS)
Hitt, E. F.; Bridgman, M. S.; Robinson, A. C.
1981-01-01
Performability analysis is a technique developed for evaluating the effectiveness of fault-tolerant computing systems in multiphase missions. Performability was evaluated for its accuracy, practical usefulness, and relative cost. The evaluation was performed by applying performability and the fault tree method to a set of sample problems ranging from simple to moderately complex. The problems involved as many as five outcomes, two to five mission phases, permanent faults, and some functional dependencies. Transient faults and software errors were not considered. A different analyst was responsible for each technique. Significantly more time and effort were required to learn performability analysis than the fault tree method. Performability is inherently as accurate as fault tree analysis. For the sample problems, fault trees were more practical and less time consuming to apply, while performability required less ingenuity and was more checkable. Performability offers some advantages for evaluating very complex problems.
Economic evaluation of DSS 13 unattended operations demonstration
NASA Technical Reports Server (NTRS)
Remer, D. S.; Eisenberger, I.; Lorden, G.
1978-01-01
The goals and data collection requirements to be used for the economic and performance evaluation indexes and life cycle cost parameters for the upcoming operations demonstration of an automated Deep Space Station (DSS) run unattended and controlled remotely from JPL are presented. These evaluation indexes compare the remote operation of telemetry at DSS 13 with the cost and performance of a comparable manned operation at DSS 11. A description is presented of the data that needs to be collected, how the data will be analyzed, and what can and cannot be learned from this operations demonstration.
COMPARING THE SOLID PHASE AND SALINE EXTRACT MICROTOX(R) ASSAYS FOR TWO PAH CONTAMINATED SOILS
The performance of remedial treatments is typically evaluated by measuring the concentration of specific chemicals. By adding toxicity bioassays to treatment evaluations, a fuller understanding of treatment performance is obtained. The solid phase Microtox assay is one potenti...
Long, H. Keith; Daddow, Richard L.; Farrar, Jerry W.
1998-01-01
Since 1962, the U.S. Geological Survey (USGS) has operated the Standard Reference Sample Project to evaluate the performance of USGS, cooperator, and contractor analytical laboratories that analyze chemical constituents of environmental samples. The laboratories are evaluated by using performance evaluation samples, called Standard Reference Samples (SRSs). SRSs are submitted to laboratories semi-annually for round-robin laboratory performance comparison purposes. Currently, approximately 100 laboratories are evaluated for their analytical performance on six SRSs for inorganic and nutrient constituents. As part of the SRS Project, a surplus of homogeneous, stable SRSs is maintained for purchase by USGS offices and participating laboratories for use in continuing quality-assurance and quality-control activities. Statistical evaluation of the laboratories results provides information to compare the analytical performance of the laboratories and to determine possible analytical deficiences and problems. SRS results also provide information on the bias and variability of different analytical methods used in the SRS analyses.
Use of the Marshall Space Flight Center solar simulator in collector performance evaluation
NASA Technical Reports Server (NTRS)
Humphries, W. R.
1978-01-01
Actual measured values from simulator checkout tests are detailed. Problems encountered during initial startup are discussed and solutions described. Techniques utilized to evaluate collector performance from simulator test data are given. Performance data generated in the simulator are compared to equivalent data generated during natural outdoor testing. Finally, a summary of collector performance parameters generated to date as a result of simulator testing are given.
NASA Technical Reports Server (NTRS)
Moog, R. D.; Bacchus, D. L.; Utreja, L. R.
1979-01-01
The aerodynamic performance characteristics have been determined for the Space Shuttle Solid Rocket Booster drogue, main, and pilot parachutes. The performance evaluation on the 20-degree conical ribbon parachutes is based primarily on air drop tests of full scale prototype parachutes. In addition, parametric wind tunnel tests were performed and used in parachute configuration development and preliminary performance assessments. The wind tunnel test data are compared to the drop test results and both sets of data are used to determine the predicted performance of the Solid Rocket Booster flight parachutes. Data from other drop tests of large ribbon parachutes are also compared with the Solid Rocket Booster parachute performance characteristics. Parameters assessed include full open terminal drag coefficients, reefed drag area, opening characteristics, clustering effects, and forebody interference.
Glodt, Stephen R.; Pirkey, Kimberly D.
1998-01-01
Performance-evaluation studies provide customers of the U.S. Geological Survey National Water Quality Laboratory (NWQL) with data needed to evaluate performance and to compare of select laboratories for analytical work. The NWQL participates in national and international performance-evaluation (PE) studies that consist of samples of water, sediment, and aquatic biological materials for the analysis of inorganic constituents, organic compounds, and radionuclides. This Fact Sheet provides a summary of PE study results from January 1993 through April 1997. It should be of particular interest to USGS customers and potential customers of the NWQL, water-quality specialists, cooperators, and agencies of the Federal Government.
Wolfe, Jace; Morais Duke, Mila; Schafer, Erin; Cire, George; Menapace, Christine; O'Neill, Lori
2016-01-01
The objective of this study was to evaluate the potential improvement in word recognition in quiet and in noise obtained with use of a Bluetooth-compatible wireless hearing assistance technology (HAT) relative to the acoustic mobile telephone condition (e.g. the mobile telephone receiver held to the microphone of the sound processor). A two-way repeated measures design was used to evaluate differences in telephone word recognition obtained in quiet and in competing noise in the acoustic mobile telephone condition compared to performance obtained with use of the CI sound processor and a telephone HAT. Sixteen adult users of Nucleus cochlear implants and the Nucleus 6 sound processor were included in this study. Word recognition over the mobile telephone in quiet and in noise was significantly better with use of the wireless HAT compared to performance in the acoustic mobile telephone condition. Word recognition over the mobile telephone was better in quiet when compared to performance in noise. The results of this study indicate that use of a wireless HAT improves word recognition over the mobile telephone in quiet and in noise relative to performance in the acoustic mobile telephone condition for a group of adult cochlear implant recipients.
Evaluation of Eco-Efficiency and Performance of Retrofit Materials
NASA Astrophysics Data System (ADS)
Gopinath, Smitha; Rama Chandra Murthy, A.; Iyer, Nagesh R.; Kokila, S.
2015-12-01
In this work three materials namely Fiber Reinforced Polymer (FRP), ferrocement and Textile Reinforced Concrete (TRC) have been evaluated towards their performance efficiency and eco-effectiveness for sustainable retrofitting applications. Investigations have been carried out for flexural strengthening of RC beams with FRP, ferrocement and TRC. It is observed that in the case of FRP, it is not possible to tailor the material according to design requirements and most of the time strengthened structure becomes over stiff. Eco-effectiveness of these retrofitting materials has been evaluated by computing the embodied energy. It is observed that the amount of CO2 emitted by TRC is less compared to other retrofit materials. Further, the performance point of retrofitted RC frames has been evaluated and damage index has been calculated to find out the effective retrofit material. It is concluded that, if RC frame is retrofitted with FRP and TRC, it undergoes less damage compared to ferrocement.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Turner, S.L.
1996-02-01
The report gives results of an evaluation of the feasibility of using alternatives to high volatile organic compound/hazardous air pollutant (VOC/HAP) solvent-based, chemical strippers that are currently used in the furniture repair and refinishing industry to remove both traditional high-VOC lacquer and emerging, low-VOC, wood furniture coatings. Objectives of the research were to: (1) conduct a laboratory evaluation of the performance of five alternative chemical stripper formulations and compare their performance to that of a traditional solvent-based chemical stripper formulation on three coatings types found on wood furniture substrates, and (2) assess, in a furniture refinishing facility, the use ofmore » the best performing alternative chemical stripper on traditional furniture coatings and new emerging low-VOC furniture coatings. Alternative chemical strippers were evaluated based on their stripping effectiveness compare to a methylene-chloride-based stripper.« less
La Fata, Concetta Manuela; Lupo, Toni; Piazza, Tommaso
2017-11-21
A novel fuzzy-based approach which combines ELECTRE III along with the Importance-Performance Analysis (IPA) is proposed in the present work to comparatively evaluate the service quality in the public healthcare context. Specifically, ELECTRE III is firstly considered to compare the service performance of examined hospitals in a noncompensatory manner. Afterwards, IPA is employed to support the service quality management to point out improvement needs and their priorities. The proposed approach also incorporates features of the Fuzzy Set Theory so as to address the possible uncertainty, subjectivity and vagueness of involved experts in evaluating the service quality. The model is applied to five major Sicilian public hospitals, and strengths and criticalities of the delivered service are finally highlighted and discussed. Although several approaches combining multi-criteria methods have already been proposed in the literature to evaluate the service performance in the healthcare field, to the best of the authors' knowledge the present work represents the first attempt at comparing service performance of alternatives in a noncompensatory manner in the investigated context.
Fakih, Mohamad G; Skierczynski, Boguslow; Bufalino, Angelo; Groves, Clariecia; Roberts, Phillip; Heavens, Michelle; Hendrich, Ann; Haydar, Ziad
2016-12-01
The standardized infection ratio (SIR) evaluates individual publicly reported health care-associated infections, but it may not assess overall performance. We piloted an infection composite score (ICS) in 82 hospitals of a single health system. The ICS is a combined score for central line-associated bloodstream infections, catheter-associated urinary tract infections, colon and abdominal hysterectomy surgical site infections, and hospital-onset methicillin-resistant Staphylococcus aureus bacteremia and Clostridium difficile infections. Individual facility ICSs were calculated by normalizing each of the 6 SIR events to the system SIR for baseline and performance periods (ICS ib and ICS ip , respectively). A hospital ICS ib reflected its baseline performance compared with system baseline, whereas a ICS ip provided information of its outcome changes compared with system baseline. Both the ICS ib (baseline 2013) and ICS ip (performance 2014) were calculated for 63 hospitals (reporting at least 4 of the 6 event types). The ICS ip improved in 36 of 63 (57.1%) hospitals in 2014 when compared with the ICS ib in 2013. The ICS ib 2013 median was 0.96 (range, 0.13-2.94) versus the 2014 ICS ip median of 0.92 (range, 0-6.55). Variation was more evident in hospitals with ≤100 beds. The system performance score (ICS sp ) in 2014 was 0.95, a 5% improvement compared with 2013. The proposed ICS may help large health systems and state hospital associations better evaluate key infectious outcomes, comparing them with historic and concurrent performance of peers. Copyright © 2016 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Elsevier Inc. All rights reserved.
Comparative performance of rubber modified hot mix asphalt under ALF loading : technical summary.
DOT National Transportation Integrated Search
2004-07-01
The objectives of this study were to evaluate the overall performance under accelerated loading of hot mix asphalt mixtures containing powdered rubber modifier (PRM) as compared to similar mixes with conventional HMA and to optimize the use of these ...
NASA Astrophysics Data System (ADS)
Rahman, Tasneem; Tahtali, Murat; Pickering, Mark R.
2014-09-01
The purpose of this study is to derive optimized parameters for a detector module employing an off-the-shelf X-ray camera and a pinhole array collimator applicable for a range of different SPECT systems. Monte Carlo simulations using the Geant4 application for tomographic emission (GATE) were performed to estimate the performance of the pinhole array collimators and were compared to that of low energy high resolution (LEHR) parallel-hole collimator in a four head SPECT system. A detector module was simulated to have 48 mm by 48 mm active area along with 1mm, 1.6mm and 2 mm pinhole aperture sizes at 0.48 mm pitch on a tungsten plate. Perpendicular lead septa were employed to verify overlapping and non-overlapping projections against a proper acceptance angle without lead septa. A uniform shape cylindrical water phantom was used to evaluate the performance of the proposed four head SPECT system of the pinhole array detector module. For each head, 100 pinhole configurations were evaluated based on sensitivity and detection efficiency for 140 keV γ-rays, and compared to LEHR parallel-hole collimator. SPECT images were reconstructed based on filtered back projection (FBP) algorithm where neither scatter nor attenuation corrections were performed. A better reconstruction algorithm development for this specific system is in progress. Nevertheless, activity distribution was well visualized using the backprojection algorithm. In this study, we have evaluated several quantitative and comparative analyses for a pinhole array imaging system providing high detection efficiency and better system sensitivity over a large FOV, comparing to the conventional four head SPECT system. The proposed detector module is expected to provide improved performance in various SPECT imaging.
Evaluation of Asphalt Mixture Low-Temperature Performance in Bending Beam Creep Test.
Pszczola, Marek; Jaczewski, Mariusz; Rys, Dawid; Jaskula, Piotr; Szydlowski, Cezary
2018-01-10
Low-temperature cracking is one of the most common road pavement distress types in Poland. While bitumen performance can be evaluated in detail using bending beam rheometer (BBR) or dynamic shear rheometer (DSR) tests, none of the normalized test methods gives a comprehensive representation of low-temperature performance of the asphalt mixtures. This article presents the Bending Beam Creep test performed at temperatures from -20 °C to +10 °C in order to evaluate the low-temperature performance of asphalt mixtures. Both validation of the method and its utilization for the assessment of eight types of wearing courses commonly used in Poland were described. The performed test indicated that the source of bitumen and its production process (and not necessarily only bitumen penetration) had a significant impact on the low-temperature performance of the asphalt mixtures, comparable to the impact of binder modification (neat, polymer-modified, highly modified) and the aggregate skeleton used in the mixture (Stone Mastic Asphalt (SMA) vs. Asphalt Concrete (AC)). Obtained Bending Beam Creep test results were compared with the BBR bitumen test. Regression analysis confirmed that performing solely bitumen tests is insufficient for comprehensive low-temperature performance analysis.
Evaluation of Asphalt Mixture Low-Temperature Performance in Bending Beam Creep Test
Rys, Dawid; Jaskula, Piotr; Szydlowski, Cezary
2018-01-01
Low-temperature cracking is one of the most common road pavement distress types in Poland. While bitumen performance can be evaluated in detail using bending beam rheometer (BBR) or dynamic shear rheometer (DSR) tests, none of the normalized test methods gives a comprehensive representation of low-temperature performance of the asphalt mixtures. This article presents the Bending Beam Creep test performed at temperatures from −20 °C to +10 °C in order to evaluate the low-temperature performance of asphalt mixtures. Both validation of the method and its utilization for the assessment of eight types of wearing courses commonly used in Poland were described. The performed test indicated that the source of bitumen and its production process (and not necessarily only bitumen penetration) had a significant impact on the low-temperature performance of the asphalt mixtures, comparable to the impact of binder modification (neat, polymer-modified, highly modified) and the aggregate skeleton used in the mixture (Stone Mastic Asphalt (SMA) vs. Asphalt Concrete (AC)). Obtained Bending Beam Creep test results were compared with the BBR bitumen test. Regression analysis confirmed that performing solely bitumen tests is insufficient for comprehensive low-temperature performance analysis. PMID:29320443
VERIFICATION OF THE HYDROLOGIC EVALUATION OF LANDFILL PERFORMANCE (HELP) MODEL USING FIELD DATA
The report describes a study conducted to verify the Hydrologic Evaluation of Landfill Performance (HELP) computer model using existing field data from a total of 20 landfill cells at 7 sites in the United States. Simulations using the HELP model were run to compare the predicted...
Performance Outcomes of an Online First Aid and CPR Course for Laypersons
ERIC Educational Resources Information Center
Cason, Carolyn L.; Stiller, Janeth
2011-01-01
Objective: The study evaluated the effectiveness of an online first aid course by comparing it with the traditional instructor-led course. An effective online course increases course accessibility and mitigates the major deterrent to widespread layperson training. Design: A comparison group design evaluated performances among 25 laypersons…
Campagne, Fabien
2008-02-29
The evaluation of information retrieval techniques has traditionally relied on human judges to determine which documents are relevant to a query and which are not. This protocol is used in the Text Retrieval Evaluation Conference (TREC), organized annually for the past 15 years, to support the unbiased evaluation of novel information retrieval approaches. The TREC Genomics Track has recently been introduced to measure the performance of information retrieval for biomedical applications. We describe two protocols for evaluating biomedical information retrieval techniques without human relevance judgments. We call these protocols No Title Evaluation (NT Evaluation). The first protocol measures performance for focused searches, where only one relevant document exists for each query. The second protocol measures performance for queries expected to have potentially many relevant documents per query (high-recall searches). Both protocols take advantage of the clear separation of titles and abstracts found in Medline. We compare the performance obtained with these evaluation protocols to results obtained by reusing the relevance judgments produced in the 2004 and 2005 TREC Genomics Track and observe significant correlations between performance rankings generated by our approach and TREC. Spearman's correlation coefficients in the range of 0.79-0.92 are observed comparing bpref measured with NT Evaluation or with TREC evaluations. For comparison, coefficients in the range 0.86-0.94 can be observed when evaluating the same set of methods with data from two independent TREC Genomics Track evaluations. We discuss the advantages of NT Evaluation over the TRels and the data fusion evaluation protocols introduced recently. Our results suggest that the NT Evaluation protocols described here could be used to optimize some search engine parameters before human evaluation. Further research is needed to determine if NT Evaluation or variants of these protocols can fully substitute for human evaluations.
ERIC Educational Resources Information Center
Brown, William R.; And Others
An evaluation was conducted of minority (Black, Hispanic, Asian, American Indian) and female performance in the Army's Initial Entry Rotary Wing flight training program. Each minority group was compared to a matched sample of majority students. The groups were matched on several test scores, education level, age, rank, and source of entry. The…
Google Scholar Search Performance: Comparative Recall and Precision
ERIC Educational Resources Information Center
Walters, William H.
2009-01-01
This paper presents a comparative evaluation of Google Scholar and 11 other bibliographic databases (Academic Search Elite, AgeLine, ArticleFirst, EconLit, GEOBASE, MEDLINE, PAIS International, POPLINE, Social Sciences Abstracts, Social Sciences Citation Index, and SocINDEX), focusing on search performance within the multidisciplinary field of…
Passion, self-esteem, and the role of comparative performance evaluation.
Stenseng, Frode; Dalskau, Lina Harvold
2010-12-01
Two studies were conducted to investigate the paradoxical behavior of obsessively passionate individuals: they tend to continue involvement in their passion activity despite reporting the activity as a source of ill-being. We suggested that elevated self-esteem in activity engagement could be one such persistence-promoting factor. In Study 1, we found that obsessively passionate individuals reported lower levels of global self-esteem compared with harmoniously passionate individuals, whereas they reported similar levels of activity-related self-esteem. We suggest that this indicates that obsessively passionate individuals try to compensate for low global self-esteem by utilizing self-esteem contingencies in their passion activity. Study 2 showed that activity-related self-esteem among obsessively passionate individuals was found to be strongly related to comparative performance evaluations, whereas no such relationship was found among harmoniously passionate individuals. We suggest that self-esteem contingencies related to comparative performance criteria represent a persistence-promoting factor among obsessively passionate individuals.
Performance Evaluation of Fusing Protected Fingerprint Minutiae Templates on the Decision Level
Yang, Bian; Busch, Christoph; de Groot, Koen; Xu, Haiyun; Veldhuis, Raymond N. J.
2012-01-01
In a biometric authentication system using protected templates, a pseudonymous identifier is the part of a protected template that can be directly compared. Each compared pair of pseudonymous identifiers results in a decision testing whether both identifiers are derived from the same biometric characteristic. Compared to an unprotected system, most existing biometric template protection methods cause to a certain extent degradation in biometric performance. Fusion is therefore a promising way to enhance the biometric performance in template-protected biometric systems. Compared to feature level fusion and score level fusion, decision level fusion has not only the least fusion complexity, but also the maximum interoperability across different biometric features, template protection and recognition algorithms, templates formats, and comparison score rules. However, performance improvement via decision level fusion is not obvious. It is influenced by both the dependency and the performance gap among the conducted tests for fusion. We investigate in this paper several fusion scenarios (multi-sample, multi-instance, multi-sensor, multi-algorithm, and their combinations) on the binary decision level, and evaluate their biometric performance and fusion efficiency on a multi-sensor fingerprint database with 71,994 samples. PMID:22778583
MSFC Skylab thermal and environmental control system mission evaluation
NASA Technical Reports Server (NTRS)
Hopson, G. D.; Littles, J. W.; Patterson, W. C.
1974-01-01
An evaluation of the performance of the Skylab thermal and environmental control system is presented. Actual performance is compared to design and functional requirements and anomalies and discrepancies and their resolution are discussed. The thermal and environmental control systems performed their intended role. Based on the experience gained in design, development and flight, recommendations are provided which may be beneficial to future system designs.
Performance evaluation of two black nickel and two black chrome solar collectors
NASA Technical Reports Server (NTRS)
Losey, R.
1977-01-01
The test program was based on the evaluation of four unique solar collectors described below: (1) black nickel collector surface with a desiccant drying bed, (2) black nickel collector surface without a desiccant drying bed, (3) black chrome collector surface with a dessicant drying bed, and (4) black chrome collector surface without a desiccant drying bed. The test program included three distinct phases: Initial performance evaluation, natural environmental aging, and post-aging performance evaluation. Results of Phase III testing conclusively indicated a higher normalized efficiency for Black Chrome surfaces when compared to Black Nickel.
Evaluation of SAPHIRE: an automated approach to indexing and retrieving medical literature.
Hersh, W.; Hickam, D. H.; Haynes, R. B.; McKibbon, K. A.
1991-01-01
An analysis of SAPHIRE, an experimental information retrieval system featuring automated indexing and natural language retrieval, was performed on MEDLINE references using data previously generated for a MEDLINE evaluation. Compared with searches performed by novice and expert physicians using MEDLINE, SAPHIRE achieved comparable recall and precision. While its combined recall and precision performance did not equal the level of librarians, SAPHIRE did achieve a significantly higher level of absolute recall. SAPHIRE has other potential advantages over existing MEDLINE systems. Its natural language interface does not require knowledge of MeSH, and it provides relevance ranking of retrieved references. PMID:1807718
Comparative performance between compressed and uncompressed airborne imagery
NASA Astrophysics Data System (ADS)
Phan, Chung; Rupp, Ronald; Agarwal, Sanjeev; Trang, Anh; Nair, Sumesh
2008-04-01
The US Army's RDECOM CERDEC Night Vision and Electronic Sensors Directorate (NVESD), Countermine Division is evaluating the compressibility of airborne multi-spectral imagery for mine and minefield detection application. Of particular interest is to assess the highest image data compression rate that can be afforded without the loss of image quality for war fighters in the loop and performance of near real time mine detection algorithm. The JPEG-2000 compression standard is used to perform data compression. Both lossless and lossy compressions are considered. A multi-spectral anomaly detector such as RX (Reed & Xiaoli), which is widely used as a core algorithm baseline in airborne mine and minefield detection on different mine types, minefields, and terrains to identify potential individual targets, is used to compare the mine detection performance. This paper presents the compression scheme and compares detection performance results between compressed and uncompressed imagery for various level of compressions. The compression efficiency is evaluated and its dependence upon different backgrounds and other factors are documented and presented using multi-spectral data.
Evaluation, Language, and Untranslatables
ERIC Educational Resources Information Center
Dahler-Larsen, Peter; Abma, Tineke; Bustelo, María; Irimia, Roxana; Kosunen, Sonja; Kravchuk, Iryna; Minina, Elena; Segerholm, Christina; Shiroma, Eneida; Stame, Nicoletta; Tshali, Charlie Kabanga
2017-01-01
The issue of translatability is pressing in international evaluation, in global transfer of evaluative instruments, in comparative performance management, and in culturally responsive evaluation. Terms that are never fully understood, digested, or accepted may continue to influence issues, problems, and social interactions in and around and after…
Source term model evaluations for the low-level waste facility performance assessment
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yim, M.S.; Su, S.I.
1995-12-31
The estimation of release of radionuclides from various waste forms to the bottom boundary of the waste disposal facility (source term) is one of the most important aspects of LLW facility performance assessment. In this work, several currently used source term models are comparatively evaluated for the release of carbon-14 based on a test case problem. The models compared include PRESTO-EPA-CPG, IMPACTS, DUST and NEFTRAN-II. Major differences in assumptions and approaches between the models are described and key parameters are identified through sensitivity analysis. The source term results from different models are compared and other concerns or suggestions are discussed.
Clare, Linda; Whitaker, Christopher J; Roberts, Judith L; Nelis, Sharon M; Martyr, Anthony; Marková, Ivana S; Roth, Ilona; Woods, Robert T; Morris, Robin G
2013-01-01
Measures of memory awareness based on evaluative judgement and performance monitoring are often regarded as equivalent, but the Levels of Awareness Framework suggests they reflect different awareness phenomena. Examination of memory awareness among groups with differing degrees of impairment provides a test of this proposition. Ninety-nine people with dementia (PwD), 30 people with mild cognitive impairment (PwMCI), and their relatives completed isomorphic performance monitoring and evaluative judgement measures of memory awareness and were followed up at 12 and (PwD only) 20 months. In addition to the resulting awareness indices, comparative accuracy scores were calculated using the relatives' data to establish whether any inaccuracy was specific to self-ratings. When making evaluative judgements about their memory in general, both PwD and PwMCI tended to overestimate their own functioning relative to informant ratings made by relatives. When monitoring performance on memory tests, PwD again overestimated performance relative to test scores, but PwMCI were much more accurate. Comparative accuracy scores indicated that, unlike PwD, PwMCI do not show a specific inaccuracy in self-related appraisals. The results support the proposition that awareness indices at the levels of evaluative judgement and performance monitoring should be regarded as reflecting distinct awareness phenomena. Copyright © 2013 S. Karger AG, Basel.
NASA Astrophysics Data System (ADS)
Fanjul-Vélez, Félix; de la Torre-Hernández, José María; Ortega-Quijano, Noé; Zueco-Gil, José Javier; Arce-Diego, José Luis
2009-07-01
In this work, we present clinical images of IVUS and OCT in the evaluation of pharmacological stent endothelization. These preliminary imaging results are analyzed and compared in order to determine the ability of these technologies to visualize relevant intravascular features of interest in interventional cardiology. The results enable to compare the performance of both techniques and to evaluate their potential for clinical purposes.
ERIC Educational Resources Information Center
Christensen, William Howard
2013-01-01
In 2010, the federal government increased accountability expectations by placing more emphasis on monitoring teacher performance. Using a model that focuses on the New York State teacher evaluation system, that is comprised of a rubric for observation, local student assessment scores, and student state assessment scores, this…
DOT National Transportation Integrated Search
2009-01-01
Lightweight high performance concrete (LWHPC) is expected to provide high strength and high durability along with reduced weight. The purpose of this research was to evaluate and compare the prestressed LWHPC bulb-T beams and decks in two bridge stru...
An Evaluation of the IntelliMetric[SM] Essay Scoring System
ERIC Educational Resources Information Center
Rudner, Lawrence M.; Garcia, Veronica; Welch, Catherine
2006-01-01
This report provides a two-part evaluation of the IntelliMetric[SM] automated essay scoring system based on its performance scoring essays from the Analytic Writing Assessment of the Graduate Management Admission Test[TM] (GMAT[TM]). The IntelliMetric system performance is first compared to that of individual human raters, a Bayesian system…
NASA Technical Reports Server (NTRS)
Cook, M.
1989-01-01
The procedures, performance, and results obtained from the Thiokol Corporation/Wasatch Redesigned Field Joint Protection System (FJPS) Installation Evaluation are documented. The purpose of the evaluation was to demonstrate and develop the procedures required to install two different concepts (referred to as Concepts 1 and 3) of the redesigned FJPS. The processing capability of each configuration was then evaluated and compared. The FJPS is installed on redesigned solid rocket motors (RSRM) to protect the field joints from rain intrusion and to maintain the joint temperature sensor measurement between 85 and 122 F while the boosters are on the launch pad. The FJPS is being redesigned to reduce installation timelines at KSC and to simplify or eliminate installation processing problems related to the present design of an EPDM moisture seal/extruded cork combination. Several installation techniques were evaluated, and a preferred method of application was developed for each concept. The installations were performed with the test article in the vertical (flight) position. Comparative timelines between the two concepts were also developed. An additional evaluation of the Concept 3 configuration was performed with the test article in the horizontal position, to simulate an overhead installation on a technical evaluation motor (TEM).
NASA Technical Reports Server (NTRS)
Saini, Subhash; Hood, Robert T.; Chang, Johnny; Baron, John
2016-01-01
We present a performance evaluation conducted on a production supercomputer of the Intel Xeon Processor E5- 2680v3, a twelve-core implementation of the fourth-generation Haswell architecture, and compare it with Intel Xeon Processor E5-2680v2, an Ivy Bridge implementation of the third-generation Sandy Bridge architecture. Several new architectural features have been incorporated in Haswell including improvements in all levels of the memory hierarchy as well as improvements to vector instructions and power management. We critically evaluate these new features of Haswell and compare with Ivy Bridge using several low-level benchmarks including subset of HPCC, HPCG and four full-scale scientific and engineering applications. We also present a model to predict the performance of HPCG and Cart3D within 5%, and Overflow within 10% accuracy.
Ensemble of trees approaches to risk adjustment for evaluating a hospital's performance.
Liu, Yang; Traskin, Mikhail; Lorch, Scott A; George, Edward I; Small, Dylan
2015-03-01
A commonly used method for evaluating a hospital's performance on an outcome is to compare the hospital's observed outcome rate to the hospital's expected outcome rate given its patient (case) mix and service. The process of calculating the hospital's expected outcome rate given its patient mix and service is called risk adjustment (Iezzoni 1997). Risk adjustment is critical for accurately evaluating and comparing hospitals' performances since we would not want to unfairly penalize a hospital just because it treats sicker patients. The key to risk adjustment is accurately estimating the probability of an Outcome given patient characteristics. For cases with binary outcomes, the method that is commonly used in risk adjustment is logistic regression. In this paper, we consider ensemble of trees methods as alternatives for risk adjustment, including random forests and Bayesian additive regression trees (BART). Both random forests and BART are modern machine learning methods that have been shown recently to have excellent performance for prediction of outcomes in many settings. We apply these methods to carry out risk adjustment for the performance of neonatal intensive care units (NICU). We show that these ensemble of trees methods outperform logistic regression in predicting mortality among babies treated in NICU, and provide a superior method of risk adjustment compared to logistic regression.
Social status determines how we monitor and evaluate our performance
Kostermans, Evelien; Milivojevic, Branka; De Cremer, David
2012-01-01
Since people with low status are more likely to experience social evaluative threat and are therefore more inclined to monitor for these threats and inhibit approach behaviour, we expected that low-status subjects would be more engaged in evaluating their own performance, compared with high-status subjects. We created a highly salient social hierarchy based on the performance of a simple time estimation task. Subjects could achieve high, middle or low status while performing this task simultaneously with other two players who were either higher or lower in status. Subjects received feedback on their own performance, as well as on the performance of the other two players simultaneously. Electroencephalography (EEG) was recorded from all three participants. The results showed that medial frontal negativity (an event-related potential reflecting performance evaluation) was significantly enhanced for low-status subjects. Implications for status-related differences in goal-directed behaviour are discussed. PMID:21421733
Automatic evidence quality prediction to support evidence-based decision making.
Sarker, Abeed; Mollá, Diego; Paris, Cécile
2015-06-01
Evidence-based medicine practice requires practitioners to obtain the best available medical evidence, and appraise the quality of the evidence when making clinical decisions. Primarily due to the plethora of electronically available data from the medical literature, the manual appraisal of the quality of evidence is a time-consuming process. We present a fully automatic approach for predicting the quality of medical evidence in order to aid practitioners at point-of-care. Our approach extracts relevant information from medical article abstracts and utilises data from a specialised corpus to apply supervised machine learning for the prediction of the quality grades. Following an in-depth analysis of the usefulness of features (e.g., publication types of articles), they are extracted from the text via rule-based approaches and from the meta-data associated with the articles, and then applied in the supervised classification model. We propose the use of a highly scalable and portable approach using a sequence of high precision classifiers, and introduce a simple evaluation metric called average error distance (AED) that simplifies the comparison of systems. We also perform elaborate human evaluations to compare the performance of our system against human judgments. We test and evaluate our approaches on a publicly available, specialised, annotated corpus containing 1132 evidence-based recommendations. Our rule-based approach performs exceptionally well at the automatic extraction of publication types of articles, with F-scores of up to 0.99 for high-quality publication types. For evidence quality classification, our approach obtains an accuracy of 63.84% and an AED of 0.271. The human evaluations show that the performance of our system, in terms of AED and accuracy, is comparable to the performance of humans on the same data. The experiments suggest that our structured text classification framework achieves evaluation results comparable to those of human performance. Our overall classification approach and evaluation technique are also highly portable and can be used for various evidence grading scales. Copyright © 2015 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Camp, H. A.; Moyer, Steven; Moore, Richard K.
2010-04-01
The Night Vision and Electronic Sensors Directorate's current time-limited search (TLS) model, which makes use of the targeting task performance (TTP) metric to describe image quality, does not explicitly account for the effects of visual clutter on observer performance. The TLS model is currently based on empirical fits to describe human performance for a time of day, spectrum and environment. Incorporating a clutter metric into the TLS model may reduce the number of these empirical fits needed. The masked target transform volume (MTTV) clutter metric has been previously presented and compared to other clutter metrics. Using real infrared imagery of rural images with varying levels of clutter, NVESD is currently evaluating the appropriateness of the MTTV metric. NVESD had twenty subject matter experts (SME) rank the amount of clutter in each scene in a series of pair-wise comparisons. MTTV metric values were calculated and then compared to the SME observers rankings. The MTTV metric ranked the clutter in a similar manner to the SME evaluation, suggesting that the MTTV metric may emulate SME response. This paper is a first step in quantifying clutter and measuring the agreement to subjective human evaluation.
Key results of battery performance and life tests at Argonne National Laboratory
NASA Astrophysics Data System (ADS)
Deluca, W. H.; Gillie, K. R.; Kulaga, J. E.; Smaga, J. A.; Tummillo, A. F.; Webster, C. E.
1991-12-01
Advanced battery technology evaluations are performed under simulated electric vehicle operating conditions at Argonne National Laboratory's & Diagnostic Laboratory (ADL). The ADL provide a common basis for both performance characterization and life evaluation with unbiased application of tests and analyses. This paper summarizes the performance characterizations and life evaluations conducted in 1991 on twelve single cells and eight 3- to 360-cell modules that encompass six battery technologies (Na/S, Li/MS, Ni/MH, Zn/Br, Ni/Fe, and Pb-Acid). These evaluations were performed for the Department of Energy, Office of Transportation Technologies, Electric and Hybrid Propulsion Division. The results measure progress in battery R & D programs, compare battery technologies, and provide basic data for modeling and continuing R & D to battery users, developers, and program managers.
Ouro-Koura, Abdou-Rahim; Sopoh, Emmanuel Ghislain; Sossa, Jerôme Charles; Glèlè-Ahanhanzo, Yolaine; Agueh, Victoire; Ouendo, Edgard-Marius; Ouedraogo, Laurent
2018-01-01
This study aimed to evaluate the performance of the logistics management system (LMS) of malaria control (MC) resources in the Littoral Department, Benin, in 2017. In June 2017, we conducted a cross-sectional evaluative study focusing on the structures for the storage and the disposal of MC resources as well as on staff involved in their management. The performance of the the logistics management system was evaluated on the basis of the observed compliance of the components and sub-components of the "Structure", the "Process" and the "Results" with the norms and standards defined by the Ministry of Health. A total of 36 structures were investigated and secondary target was surveyed. It followed that 52,78% of the structures for the storage and the disposal of MC resources met the requirements for resources storage while only 33.33% of MC resources management staff were trained in logistics management. The performance of the logistics management system of MC resources was inadequate (compliance 59,13 % compared to the expected score). The structure, as well as the process were non-compliant with the standards ( 60,20% and 73.22% compared to the expected score respectively), leading to negative results (41.53% compared to the expected score). The most inadequate sub-component was the logistics management information system (LMIS). This study highlights the role of LMS for better performance of MC resources management. Particular attention should be given to this component.
Natt, Neena; Chang, Alice Y; Berbari, Elie F; Kennel, Kurt A; Kearns, Ann E
2016-01-01
To determine which residency characteristics are associated with performance during endocrinology fellowship training as measured by competency-based faculty evaluation scores and faculty global ratings of trainee performance. We performed a retrospective review of interview applications from endocrinology fellows who graduated from a single academic institution between 2006 and 2013. Performance measures included competency-based faculty evaluation scores and faculty global ratings. The association between applicant characteristics and measures of performance during fellowship was examined by linear regression. The presence of a laudatory comparative statement in the residency program director's letter of recommendation (LoR) or experience as a chief resident was significantly associated with competency-based faculty evaluation scores (β = 0.22, P = .001; and β = 0.24, P = .009, respectively) and faculty global ratings (β = 0.85, P = .006; and β = 0.96, P = .015, respectively). The presence of a laudatory comparative statement in the residency program director's LoR or experience as a chief resident were significantly associated with overall performance during subspecialty fellowship training. Future studies are needed in other cohorts to determine the broader implications of these findings in the application and selection process.
Comparative evaluation of e-beam sensitive chemically amplified resists for mask making
NASA Astrophysics Data System (ADS)
Irmscher, Mathias; Beyer, Dirk; Butschke, Joerg; Constantine, Chris; Hoffmann, Thomas; Koepernik, Corinna; Krauss, Christian; Leibold, Bernd; Letzkus, Florian; Mueller, Dietmar; Springer, Reinhard; Voehringer, Peter
2002-07-01
Positive tone chemically amplified resists CAP209, EP012M (TOK), KRS-XE (JSR) and FEP171 (Fuji) were evaluated for mask making. The investigations were performed on an advanced tool set comprising of a Steag coater ASR5000, Steag developer ASP5000, 50kV e-beam writer Leica SB350, UNAXIS MASK ETCHER III , STS ICP silicon etcher and a CD-SEM KLA8100. We investigated and compared resolution, sensitivity, resist slope, dark field loss, CD-uniformity, line edge roughness, and etch resistance of the evaluated resists. Furthermore, the influence of post coating delay, post exposure delay and other process parameters on the resist performance was determined.
Editorial: Cognitive Architectures, Model Comparison and AGI
NASA Astrophysics Data System (ADS)
Lebiere, Christian; Gonzalez, Cleotilde; Warwick, Walter
2010-12-01
Cognitive Science and Artificial Intelligence share compatible goals of understanding and possibly generating broadly intelligent behavior. In order to determine if progress is made, it is essential to be able to evaluate the behavior of complex computational models, especially those built on general cognitive architectures, and compare it to benchmarks of intelligent behavior such as human performance. Significant methodological challenges arise, however, when trying to extend approaches used to compare model and human performance from tightly controlled laboratory tasks to complex tasks involving more open-ended behavior. This paper describes a model comparison challenge built around a dynamic control task, the Dynamic Stocks and Flows. We present and discuss distinct approaches to evaluating performance and comparing models. Lessons drawn from this challenge are discussed in light of the challenge of using cognitive architectures to achieve Artificial General Intelligence.
Evaluating the Performance of Online K-12 Schools
ERIC Educational Resources Information Center
Carpenter, Dick; Kafer, Krista; Reeser, Kelly; Shafer, Sheryl
2015-01-01
This article examines K-12 online student and school performance across an entire state (Colorado) in the United States through two comparisons. First, state assessment scores of students in online schools are compared to those in traditional brick and mortar schools. Second, the accountability scores of online schools are compared to those of…
Comparing Performance of Methods to Deal with Differential Attrition in Lottery Based Evaluations
ERIC Educational Resources Information Center
Zamarro, Gema; Anderson, Kaitlin; Steele, Jennifer; Miller, Trey
2016-01-01
The purpose of this study is to study the performance of different methods (inverse probability weighting and estimation of informative bounds) to control for differential attrition by comparing the results of different methods using two datasets: an original dataset from Portland Public Schools (PPS) subject to high rates of differential…
Comparative Life Cycle Assessment between Warm SMA and Conventional SMA
DOT National Transportation Integrated Search
2011-09-01
This report presents the comparative life cycle assessment (LCA) between warm stone mastic asphalt (SMA) and conventional : SMA. Specifically, the study evaluated and compared the life cycle environmental and economic performances of two mixtures: a ...
Development and evaluation of automatic landing control laws for light wing loading STOL aircraft
NASA Technical Reports Server (NTRS)
Feinreich, B.; Degani, O.; Gevaert, G.
1981-01-01
Automatic flare and decrab control laws were developed for NASA's experimental Twin Otter. This light wing loading STOL aircraft was equipped with direct lift control (DLC) wing spoilers to enhance flight path control. Automatic landing control laws that made use of the spoilers were developed, evaluated in a simulation and the results compared with these obtained for configurations that did not use DLC. The spoilers produced a significant improvement in performance. A simulation that could be operated faster than real time in order to provide statistical landing data for a large number of landings over a wide spectrum of disturbances in a short time was constructed and used in the evaluation and refinement of control law configurations. A longitudinal control law that had been previously developed and evaluated in flight was also simulated and its performance compared with that of the control laws developed. Runway alignment control laws were also defined, evaluated, and refined to result in a final recommended configuration. Good landing performance, compatible with Category 3 operation into STOL runways, was obtained.
NASA Astrophysics Data System (ADS)
Lee, Donghoon; Choi, Sunghoon; Kim, Hee-Joung
2018-03-01
When processing medical images, image denoising is an important pre-processing step. Various image denoising algorithms have been developed in the past few decades. Recently, image denoising using the deep learning method has shown excellent performance compared to conventional image denoising algorithms. In this study, we introduce an image denoising technique based on a convolutional denoising autoencoder (CDAE) and evaluate clinical applications by comparing existing image denoising algorithms. We train the proposed CDAE model using 3000 chest radiograms training data. To evaluate the performance of the developed CDAE model, we compare it with conventional denoising algorithms including median filter, total variation (TV) minimization, and non-local mean (NLM) algorithms. Furthermore, to verify the clinical effectiveness of the developed denoising model with CDAE, we investigate the performance of the developed denoising algorithm on chest radiograms acquired from real patients. The results demonstrate that the proposed denoising algorithm developed using CDAE achieves a superior noise-reduction effect in chest radiograms compared to TV minimization and NLM algorithms, which are state-of-the-art algorithms for image noise reduction. For example, the peak signal-to-noise ratio and structure similarity index measure of CDAE were at least 10% higher compared to conventional denoising algorithms. In conclusion, the image denoising algorithm developed using CDAE effectively eliminated noise without loss of information on anatomical structures in chest radiograms. It is expected that the proposed denoising algorithm developed using CDAE will be effective for medical images with microscopic anatomical structures, such as terminal bronchioles.
Evaluating the effectiveness of performance based pavement marking maintenance contracts in Texas.
DOT National Transportation Integrated Search
2014-01-01
Performance-based pavement marking maintenance contracts (PBPMMCs) are one of the latest : mechanisms used to maintain adequate pavement marking performance levels. TxDOT has issued two : PBPMMCs, but the effectiveness of these contracts as compared ...
Yoon, Hee Mang; Suh, Chong Hyun; Cho, Young Ah; Kim, Jeong Rye; Lee, Jin Seong; Jung, Ah Young; Kim, Jung Heon; Lee, Jeong-Yong; Kim, So Yeon
2018-06-01
To evaluate the diagnostic performance of reduced-dose CT for suspected appendicitis. A systematic search of the MEDLINE and EMBASE databases was carried out through to 10 January 2017. Studies evaluating the diagnostic performance of reduced-dose CT for suspected appendicitis in paediatric and adult patients were selected. Pooled summary estimates of sensitivity and specificity were calculated using hierarchical logistic regression modelling. Meta-regression was performed. Fourteen original articles with a total of 3,262 patients were included. For all studies using reduced-dose CT, the summary sensitivity was 96 % (95 % CI 93-98) with a summary specificity of 94 % (95 % CI 92-95). For the 11 studies providing a head-to-head comparison between reduced-dose CT and standard-dose CT, reduced-dose CT demonstrated a comparable summary sensitivity of 96 % (95 % CI 91-98) and specificity of 94 % (95 % CI 93-96) without any significant differences (p=.41). In meta-regression, there were no significant factors affecting the heterogeneity. The median effective radiation dose of the reduced-dose CT was 1.8 mSv (1.46-4.16 mSv), which was a 78 % reduction in effective radiation dose compared to the standard-dose CT. Reduced-dose CT shows excellent diagnostic performance for suspected appendicitis. • Reduced-dose CT shows excellent diagnostic performance for evaluating suspected appendicitis. • Reduced-dose CT has a comparable diagnostic performance to standard-dose CT. • Median effective radiation dose of reduced-dose CT was 1.8 mSv (1.46-4.16). • Reduced-dose CT achieved a 78 % dose reduction compared to standard-dose CT.
Strom, Suzanne L; Anderson, Craig L; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C Eric; Osborn, Megan Boysen; Langdorf, Mark I
2015-11-01
Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.
Van Roey, Karel; Sokny, Mao; Denis, Leen; Van den Broeck, Nick; Heng, Somony; Siv, Sovannaroth; Sluydts, Vincent; Sochantha, Tho; Coosemans, Marc; Durnez, Lies
2014-12-01
Scaling up of insecticide treated nets has contributed to a substantial malaria decline. However, some malaria vectors, and most arbovirus vectors, bite outdoors and in the early evening. Therefore, topically applied insect repellents may provide crucial additional protection against mosquito-borne pathogens. Among topical repellents, DEET is the most commonly used, followed by others such as picaridin. The protective efficacy of two formulated picaridin repellents against mosquito bites, including arbovirus and malaria vectors, was evaluated in a field study in Cambodia. Over a period of two years, human landing collections were performed on repellent treated persons, with rotation to account for the effect of collection place, time and individual collector. Based on a total of 4996 mosquitoes collected on negative control persons, the overall five hour protection rate was 97.4% [95%CI: 97.1-97.8%], not decreasing over time. Picaridin 20% performed equally well as DEET 20% and better than picaridin 10%. Repellents performed better against Mansonia and Culex spp. as compared to aedines and anophelines. A lower performance was observed against Aedes albopictus as compared to Aedes aegypti, and against Anopheles barbirostris as compared to several vector species. Parity rates were higher in vectors collected on repellent treated person as compared to control persons. As such, field evaluation shows that repellents can provide additional personal protection against early and outdoor biting malaria and arbovirus vectors, with excellent protection up to five hours after application. The heterogeneity in repellent sensitivity between mosquito genera and vector species could however impact the efficacy of repellents in public health programs. Considering its excellent performance and potential to protect against early and outdoor biting vectors, as well as its higher acceptability as compared to DEET, picaridin is an appropriate product to evaluate the epidemiological impact of large scale use of topical repellents on arthropod borne diseases.
Denis, Leen; Van den Broeck, Nick; Heng, Somony; Siv, Sovannaroth; Sluydts, Vincent; Sochantha, Tho; Coosemans, Marc; Durnez, Lies
2014-01-01
Scaling up of insecticide treated nets has contributed to a substantial malaria decline. However, some malaria vectors, and most arbovirus vectors, bite outdoors and in the early evening. Therefore, topically applied insect repellents may provide crucial additional protection against mosquito-borne pathogens. Among topical repellents, DEET is the most commonly used, followed by others such as picaridin. The protective efficacy of two formulated picaridin repellents against mosquito bites, including arbovirus and malaria vectors, was evaluated in a field study in Cambodia. Over a period of two years, human landing collections were performed on repellent treated persons, with rotation to account for the effect of collection place, time and individual collector. Based on a total of 4996 mosquitoes collected on negative control persons, the overall five hour protection rate was 97.4% [95%CI: 97.1–97.8%], not decreasing over time. Picaridin 20% performed equally well as DEET 20% and better than picaridin 10%. Repellents performed better against Mansonia and Culex spp. as compared to aedines and anophelines. A lower performance was observed against Aedes albopictus as compared to Aedes aegypti, and against Anopheles barbirostris as compared to several vector species. Parity rates were higher in vectors collected on repellent treated person as compared to control persons. As such, field evaluation shows that repellents can provide additional personal protection against early and outdoor biting malaria and arbovirus vectors, with excellent protection up to five hours after application. The heterogeneity in repellent sensitivity between mosquito genera and vector species could however impact the efficacy of repellents in public health programs. Considering its excellent performance and potential to protect against early and outdoor biting vectors, as well as its higher acceptability as compared to DEET, picaridin is an appropriate product to evaluate the epidemiological impact of large scale use of topical repellents on arthropod borne diseases. PMID:25522134
Assessment of Anatomical Knowledge and Core Trauma Competency Vascular Skills.
Granite, Guinevere; Pugh, Kristy; Chen, Hegang; Longinaker, Nyaradzo; Garofalo, Evan; Shackelford, Stacy; Shalin, Valerie; Puche, Adam; Pasley, Jason; Sarani, Babak; Henry, Sharon; Bowyer, Mark; Mackenzie, Colin
2018-03-01
Surgical residents express confidence in performing specific vascular exposures before training, but such self-reported confidence did not correlate with co-located evaluator ratings. This study reports residents' self-confidence evaluated before and after Advanced Surgical Skills for Exposure in Trauma (ASSET) cadaver-based training, and 12-18 mo later. We hypothesize that residents will better judge their own skill after ASSET than before when compared with evaluator ratings. Forty PGY2-7 surgical residents performed four procedures: axillary artery (AA), brachial artery (BA), femoral artery exposure and control (FA), and lower extremity fasciotomy (FAS) at the three evaluations. Using 5-point Likert scales, surgeons self-assessed their confidence in anatomical understanding and procedure performance after each procedure and evaluators rated each surgeon accordingly. For all the three evaluations, residents consistently rated their anatomical understanding (p < 0.04) and surgical performance (p < 0.03) higher than evaluators for both FA and FAS. Residents rated their anatomical understanding and surgical performance higher (p < 0.005) than evaluators for BA after training and up to 18 mo later. Only for third AA evaluation were there no rating differences. Residents overrate their anatomical understanding and performance abilities for BA, FA, and FAS even after performing the procedures and being debriefed three times in 18 mo.
Vanniasegaram, Iyngaram; Cohen, Mazal; Rosen, Stuart
2004-12-01
To compare the auditory function of normal-hearing children attending mainstream schools who were referred for an auditory evaluation because of listening/hearing problems (suspected auditory processing disorders [susAPD]) with that of normal-hearing control children. Sixty-five children with a normal standard audiometric evaluation, ages 6-14 yr (32 of whom were referred for susAPD, with the rest age-matched control children), completed a battery of four auditory tests: a dichotic test of competing sentences; a simple discrimination of short tone pairs differing in fundamental frequency at varying interstimulus intervals (TDT); a discrimination task using consonant cluster minimal pairs of real words (CCMP), and an adaptive threshold task for detecting a brief tone presented either simultaneously with a masker (simultaneous masking) or immediately preceding it (backward masking). Regression analyses, including age as a covariate, were performed to determine the extent to which the performance of the two groups differed on each task. Age-corrected z-scores were calculated to evaluate the effectiveness of the complete battery in discriminating the groups. The performance of the susAPD group was significantly poorer than the control group on all but the masking tasks, which failed to differentiate the two groups. The CCMP discriminated the groups most effectively, as it yielded the lowest number of control children with abnormal scores, and performance in both groups was independent of age. By contrast, the proportion of control children who performed poorly on the competing sentences test was unacceptably high. Together, the CCMP (verbal) and TDT (nonverbal) tasks detected impaired listening skills in 56% of the children who were referred to the clinic, compared with 6% of the control children. Performance on the two tasks was not correlated. Two of the four tests evaluated, the CCMP and TDT, proved effective in differentiating the two groups of children of this study. The application of both tests increased the proportion of susAPD children who performed poorly compared with the application of each test alone, while reducing the proportion of control subjects who performed poorly. The findings highlight the importance of carrying out a complete auditory evaluation in children referred for medical attention, even if their standard audiometric evaluation is unremarkable.
Evaluation of a hyperspectral image database for demosaicking purposes
NASA Astrophysics Data System (ADS)
Larabi, Mohamed-Chaker; Süsstrunk, Sabine
2011-01-01
We present a study on the the applicability of hyperspectral images to evaluate color filter array (CFA) design and the performance of demosaicking algorithms. The aim is to simulate a typical digital still camera processing pipe-line and to compare two different scenarios: evaluate the performance of demosaicking algorithms applied to raw camera RGB values before color rendering to sRGB, and evaluate the performance of demosaicking algorithms applied on the final sRGB color rendered image. The second scenario is the most frequently used one in literature because CFA design and algorithms are usually tested on a set of existing images that are already rendered, such as the Kodak Photo CD set containing the well-known lighthouse image. We simulate the camera processing pipe-line with measured spectral sensitivity functions of a real camera. Modeling a Bayer CFA, we select three linear demosaicking techniques in order to perform the tests. The evaluation is done using CMSE, CPSNR, s-CIELAB and MSSIM metrics to compare demosaicking results. We find that the performance, and especially the difference between demosaicking algorithms, is indeed significant depending if the mosaicking/demosaicking is applied to camera raw values as opposed to already rendered sRGB images. We argue that evaluating the former gives a better indication how a CFA/demosaicking combination will work in practice, and that it is in the interest of the community to create a hyperspectral image dataset dedicated to that effect.
Lima, Ellen Marise; Rzezak, Patricia; Dos Santos, Bernardo; Gentil, Letícia; Montenegro, Maria A; Guerreiro, Marilisa M; Valente, Kette D
2018-05-01
In this study, we aimed to evaluate the attentional and executive functions in patients with benign childhood epilepsy with centrotemporal spikes (BCECTS) with and without attention-deficit hyperactivity disorder (ADHD) compared with controls and compared with patients with ADHD without epilepsy. We evaluated 12 patients with BCECTS and ADHD (66.7% boys; mean age of 9.67years); 11 children with non-ADHD BCECTS (63.6% boys; mean age of 11.91years); 20 healthy children (75% boys; mean age of 10.15years); and 20 subjects with ADHD without epilepsy (60% boys; mean age of 10.9years). We used a comprehensive battery of neuropsychological tests to evaluate attentional and executive functions in their broad domains. Patients with BCECTS and ADHD had worse performance in Conners' Continuous Performance Test II (reaction time standard error [p=0.008], variability [p=0.033], perseverations [p=0.044] and in reaction time interstimuli interval [p=0.016]). Patients with ADHD showed worse performance in Trail Making Test B errors [p=0.012]. In conclusion, patients with BCECTS and ADHD had worse executive and attentional performance compared with controls than non-ADHD patients with BCECTS. Regardless of the presence of epilepsy, ADHD also negatively impacted executive and attentional functions but in different executive subdomains compared with patients with epilepsy. Copyright © 2018 Elsevier Inc. All rights reserved.
[New trends in the evaluation of mathematics learning disabilities. The role of metacognition].
Miranda-Casas, A; Acosta-Escareño, G; Tarraga-Minguez, R; Fernández, M I; Rosel-Remírez, J
2005-01-15
The current trends in the evaluation of mathematics learning disabilities (MLD), based on cognitive and empirical models, are oriented towards combining procedures involving the criteria and the evaluation of cognitive and metacognitive processes, associated to performance in mathematical tasks. The objective of this study is to analyse the metacognitive skills of prediction and evaluation in performing maths tasks and to compare metacognitive performance among pupils with MLD and younger pupils without MLD, who have the same level of mathematical performance. Likewise, we analyse these pupils' desire to learn. Subjects and methods. We compare a total of 44 pupils from the second cycle of primary education (8-10 years old) with and without mathematics learning disabilities. Significant differences are observed between pupils with and without mathematics learning disabilities in their capacity to predict and assess all of the tasks evaluated. As regards their 'desire to learn', no significant differences were found between pupils with and without MLD, which indicated that those with MLD assess their chances of successfully performing maths tasks in the same way as those without MLD. Finally, the findings reveal a similar metacognitive profile in pupils with MLD and the younger pupils with no mathematics learning disabilities. In future studies we consider it important to analyse the influence of the socio-affective belief system in the use of metacognitive skills.
NASA Astrophysics Data System (ADS)
Jeon, Hosang; Kim, Hyunduk; Cha, Bo Kyung; Kim, Jong Yul; Cho, Gyuseong; Chung, Yong Hyun; Yun, Jong-Il
2009-06-01
Presently, the gamma camera system is widely used in various medical diagnostic, industrial and environmental fields. Hence, the quantitative and effective evaluation of its imaging performance is essential for design and quality assurance. The National Electrical Manufacturers Association (NEMA) standards for gamma camera evaluation are insufficient to perform sensitive evaluation. In this study, modulation transfer function (MTF) and normalized noise power spectrum (NNPS) will be suggested to evaluate the performance of small gamma camera with changeable pinhole collimators using Monte Carlo simulation. We simulated the system with a cylinder and a disk source, and seven different pinhole collimators from 1- to 4-mm-diameter pinhole with lead. The MTF and NNPS data were obtained from output images and were compared with full-width at half-maximum (FWHM), sensitivity and differential uniformity. In the result, we found that MTF and NNPS are effective and novel standards to evaluate imaging performance of gamma cameras instead of conventional NEMA standards.
ERIC Educational Resources Information Center
Jenkins, Craig
2015-01-01
This paper is a comparative quantitative evaluation of an approach to teaching poetry in the subject domain of English that employs a "guided discovery" pedagogy using computer-based microworlds. It uses a quasi-experimental design in order to measure performance gains in computational thinking and poetic thinking following a…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Johnson, N.; Davis, S.
1979-07-01
Performance and handling tests on the Calspan RSV were performed in Italy by the Istituto Sperimentale Auto E Motori (ISAM) and in West Germany by Volkswagenwerk AG Wolfsburg. The ISAM tests evaluated the Calspan RSV in the areas of fuel economy, vehicle response, braking and handling, and driver environment. The Volkswagen tests evaluated the Calspan RSV in the areas of braking, steering, handling, and overturning immunity. The ISAM tests are unlike any previously used to evaluate American vehicles. Therefore, the Calspan RSV results are compared to those of ten European cars which had undergone identical tests. The Volkswagen test proceduresmore » were identical to those specified in the Research Safety Vehicle program. The Calspan RSV results are compared to the RSV specifications for these tests.« less
Anesthesiology resident personality type correlates with faculty assessment of resident performance.
Schell, Randall M; Dilorenzo, Amy N; Li, Hsin-Fang; Fragneto, Regina Y; Bowe, Edwin A; Hessel, Eugene A
2012-11-01
To study the association between anesthesiology residents' personality preference types, faculty evaluations of residents' performance, and knowledge. Convenience sample and prospective study. Academic department of anesthesiology. Consenting anesthesiology residents (n = 36). All participants completed the Myers Briggs Type Indicator® (MBTI®). All residents' 6-month summation of daily focal evaluations completed by faculty [daily performance score (DPS); 1 = unsatisfactory, 2 = needs improvement, 3 = meets expectations, 4 = exceeds expectations], as well as a global assessment of performance (GAP) score based on placement of each resident into perceived quartile compared with their peers (ie,1 = first, or top, quartile) by senior faculty (n = 7) who also completed the MBTI, were obtained. The resident MBTI personality preferences were compared with the DPS and GAP scores, the United States Medical Licensing Examination (USMLE) I and II scores, and faculty MBTI personality type. There was no association between personality preference type and performance on standardized examinations (USMLE I, II). The mean GAP score was better (higher quartile score) for Extraverts than Introverts (median 2.0 vs 2.6, P = 0.0047) and for Sensing versus Intuition (median 2.0 vs 2.6, P = 0.0206) preference. Faculty evaluator MBTI preference type did not influence the GAP scores they assigned residents. Like GAP, the DPS was better for residents with Sensing versus Intuition preference (median 3.5 vs 3.3, P = 0.0111). No difference in DPS was noted between Extraverts and Introverts. Personality preference type was not associated with resident performance on standardized examinations, but it was associated with faculty evaluations of resident performance. Residents with Sensing personality preference were evaluated more favorably on global and focal faculty evaluations than those residents who chose the Intuition preference. Extraverted residents were evaluated more favorably on global but not focal assessment of performance. Copyright © 2012 Elsevier Inc. All rights reserved.
Smrkovski, O A; Koo, Y; Kazemi, R; Lembcke, L M; Fathy, A; Liu, Q; Phillips, J C
2013-03-01
Performance and clinical characteristics of a novel hyperthermia antenna operating at 434 MHz were evaluated for the adjuvant treatment of locally advanced superficial tumours in cats, dogs and horses. Electromagnetic simulations were performed to determine electric field characteristics and compared to simulations for a flat microwave antenna with similar dimensions. Simulation results show a reduced skin surface and backfield irradiation and improved directional irradiation (at broadside) compared to a flat antenna. Radiated power and penetration is notably increased with a penetration depth of 4.59 cm compared to 2.74 cm for the flat antenna. Clinical use of the antenna was then evaluated in six animals with locoregionally advanced solid tumours receiving adjuvant chemotherapy. During clinical applications, therapeutic temperatures were achieved at depths ≥4 cm. Objective responses were seen in all patients; tissue toxicity in one case limited further therapy. This antenna provides compact, efficient, focused and deep-penetrating clinical hyperthermia for the treatment of solid tumours in veterinary patients. © 2011 Blackwell Publishing Ltd.
Duke, Mila Morais; Wolfe, Jace; Schafer, Erin
2016-05-01
Cochlear implant (CI) recipients often experience difficulty understanding speech in noise and speech that originates from a distance. Many CI recipients also experience difficulty understanding speech originating from a television. Use of hearing assistance technology (HAT) may improve speech recognition in noise and for signals that originate from more than a few feet from the listener; however, there are no published studies evaluating the potential benefits of a wireless HAT designed to deliver audio signals from a television directly to a CI sound processor. The objective of this study was to compare speech recognition in quiet and in noise of CI recipients with the use of their CI alone and with the use of their CI and a wireless HAT (Cochlear Wireless TV Streamer). A two-way repeated measures design was used to evaluate performance differences obtained in quiet and in competing noise (65 dBA) with the CI sound processor alone and with the sound processor coupled to the Cochlear Wireless TV Streamer. Sixteen users of Cochlear Nucleus 24 Freedom, CI512, and CI422 implants were included in the study. Participants were evaluated in four conditions including use of the sound processor alone and use of the sound processor with the wireless streamer in quiet and in the presence of competing noise at 65 dBA. Speech recognition was evaluated in each condition with two full lists of Computer-Assisted Speech Perception Testing and Training Sentence-Level Test sentences presented from a light-emitting diode television. Speech recognition in noise was significantly better with use of the wireless streamer compared to participants' performance with their CI sound processor alone. There was also a nonsignificant trend toward better performance in quiet with use of the TV Streamer. Performance was significantly poorer when evaluated in noise compared to performance in quiet when the TV Streamer was not used. Use of the Cochlear Wireless TV Streamer designed to stream audio from a television directly to a CI sound processor provides better speech recognition in quiet and in noise when compared to performance obtained with use of the CI sound processor alone. American Academy of Audiology.
Broeckhoven, K.; Cabooter, D.; Desmet, G.
2012-01-01
The reintroduction of superficially porous particles has resulted in a leap forward for the separation performance in liquid chromatography. The underlying reasons for the higher efficiency of columns packed with these particles are discussed. The performance of the newly introduced 5 μm superficially porous particles is evaluated and compared to 2.7 μm superficially porous and 3.5 and 5 μm fully porous columns using typical test compounds (alkylphenones) and a relevant pharmaceutical compound (impurity of amoxicillin). The 5 μm superficially porous particles provide a superior kinetic performance compared to both the 3.5 and 5 μm fully porous particles over the entire relevant range of separation conditions. The performance of the superficially porous particles, however, appears to depend strongly on retention and analyte properties, emphasizing the importance of comparing different columns under realistic conditions (high enough k) and using the compound of interest. PMID:29403833
Piloting an outcome-based programme evaluation tool in undergraduate medical education.
Raupach, Tobias; Schiekirka, Sarah; Münscher, Christian; Beißbarth, Tim; Himmel, Wolfgang; Burckhardt, Gerhard; Pukrop, Tobias
2012-01-01
Different approaches to performance-oriented allocation of resources according to teaching quality are currently being discussed within German medical schools. The implementation of these programmes is impeded by a lack of valid criteria to measure teaching quality. An assessment of teaching quality should include structural and procedural aspects but focus on learning outcome itself. The aim of this study was to implement a novel, outcome-based evaluation tool within the clinical phase of a medical curriculum and address differences between the novel tool and traditional evaluation methods. Student self-assessments before and after completion of a teaching module were used to compute performance gains for specific learning objectives. Mean performance gains in each module were compared to student expectations before the module and data derived from a traditional evaluation tool using overall course ratings at the end of the module. A ranking of the 21 modules according to computed performance gains yielded entirely different results than module rankings based on overall course ratings. There was no significant correlation between performance gain and overall ratings. However, the latter were significantly correlated to student expectations before entering the module as well as structural and procedural parameters (Pearson's r 0.7-0.9). Performance gain computed from comparative self-assessments adds an important new dimension to course evaluation in medical education. In contrast to overall course ratings, the novel tool is less heavily confounded by construct-irrelevant factors. Thus, it appears to be more appropriate than overall course ratings in determining teaching quality and developing algorithms to guide performance-oriented resource allocation in medical education.
Performance of high intensity fed-batch mammalian cell cultures in disposable bioreactor systems.
Smelko, John Paul; Wiltberger, Kelly Rae; Hickman, Eric Francis; Morris, Beverly Janey; Blackburn, Tobias James; Ryll, Thomas
2011-01-01
The adoption of disposable bioreactor technology as an alternate to traditional nondisposable technology is gaining momentum in the biotechnology industry. Evaluation of current disposable bioreactors systems to sustain high intensity fed-batch mammalian cell culture processes needs to be explored. In this study, an assessment was performed comparing single-use bioreactors (SUBs) systems of 50-, 250-, and 1,000-L operating scales with traditional stainless steel (SS) and glass vessels using four distinct mammalian cell culture processes. This comparison focuses on expansion and production stage performance. The SUB performance was evaluated based on three main areas: operability, process scalability, and process performance. The process performance and operability aspects were assessed over time and product quality performance was compared at the day of harvest. Expansion stage results showed disposable bioreactors mirror traditional bioreactors in terms of cellular growth and metabolism. Set-up and disposal times were dramatically reduced using the SUB systems when compared with traditional systems. Production stage runs for both Chinese hamster ovary and NS0 cell lines in the SUB system were able to model SS bioreactors runs at 100-, 200-, 2,000-, and 15,000-L scales. A single 1,000-L SUB run applying a high intensity fed-batch process was able to generate 7.5 kg of antibody with comparable product quality. Copyright © 2011 American Institute of Chemical Engineers (AIChE).
Impact of hybrid delivery of education on student academic performance and the student experience.
Congdon, Heather Brennan; Nutter, Douglas A; Charneski, Lisa; Butko, Peter
2009-11-12
To compare student academic performance and the student experience in the first-year doctor of pharmacy (PharmD) program between the main and newly opened satellite campuses of the University of Maryland. Student performance indicators including graded assessments, course averages, cumulative first-year grade point average (GPA), and introductory pharmacy practice experience (IPPE) evaluations were analyzed retrospectively. Student experience indicators were obtained via an online survey instrument and included involvement in student organizations; time-budgeting practices; and stress levels and their perceived effect on performance. Graded assessments, course averages, GPA, and IPPE evaluations were indistinguishable between campuses. Students' time allocation was not different between campuses, except for time spent attending class and watching lecture videos. There was no difference between students' stress levels at each campus. The implementation of a satellite campus to expand pharmacy education yielded academic performance and student engagement comparable to those from traditional delivery methods.
The use of medical quality indices as a performance-enhancement tool for community clinics.
Elhayany, A
2001-12-01
One of the most important issues for a country, its population and doctors is the effective use of its health system. The extensive variation in doctors' performance leads to a tremendous waste of resources. To combat this, and at the same time ensure that medical quality plays a role when making decisions on interventions, it is essential to equip doctors and clinic directors with information on the quality of the medical care they are providing. In order to assist clinic directors in maintaining medical quality, Clalit Health Services has developed comparative medical indices enabling doctors to compare their performance to that of their colleagues, as well as to the standard and their performance over time. The development of an index to evaluate the quality of medical treatment offered in clinics provides doctors and the health system with an essential tool to lessen the existing variation among doctors and to enhance and evaluate performance.
Rinewalt, Daniel; Williams, Betsy W; Reeves, Anthony P; Shah, Palmi; Hong, Edward; Mulshine, James L
2015-03-01
Higher resolution medical imaging platforms are rapidly emerging, but there is a challenge in applying these tools in a clinically meaningful way. The purpose of the current study was to evaluate a novel three-dimensional (3D) software imaging environment, known as interactive science publishing (ISP), in appraising 3D computed tomography images and to compare this approach with traditional planar (2D) imaging in a series of lung cancer cases. Twenty-four physician volunteers at different levels of training across multiple specialties were recruited to evaluate eight lung cancer-related clinical vignettes. The volunteers were asked to compare the performance of traditional 2D versus the ISP 3D imaging in assessing different visualization environments for diagnostic and measurement processes and to further evaluate the ISP tool in terms of general satisfaction, usability, and probable applicability. Volunteers were satisfied with both imaging methods; however, the 3D environment had significantly higher ratings. Measurement performance was comparable using both traditional 2D and 3D image evaluation. Physicians not trained in 2D measurement approaches versus those with such training demonstrated better performance with ISP and preferred working in the ISP environment. Recent postgraduates with only modest self-administered training performed equally well on 3D and 2D cases. This suggests that the 3D environment has no reduction in accuracy over the conventional 2D approach, while providing the advantage of a digital environment for cross-disciplinary interaction for shared problem solving. Exploration of more effective, efficient, self-directed training could potentially result in further improvement in image evaluation proficiency and potentially decrease training costs. Copyright © 2015. Published by Elsevier Inc.
POCO-MOEA: Using Evolutionary Algorithms to Solve the Controller Placement Problem
2016-03-24
to gather data on POCO-MOEA performance to a series of iv model networks. The algorithm’s behavior is then evaluated and compared to ex- haustive... evaluation of a third heuristic based on a Multi 3 Objective Evolutionary Algorithm (MOEA). This heuristic is modeled after one of the most well known MOEAs...researchers to extend into more realistic evaluations of the performance characteristics of SDN controllers, such as the use of simulators or live
Performance Funding in Higher Education: Panacea or Peril?
ERIC Educational Resources Information Center
Pickens, William H.
The use of performance funding, which provides institutions with income for educational results, was tested in Tennessee. Traditionally, the budget has been separated from performance evaluation, and state formulas have evolved from the need for funding to be objective, comparable, and predictable. The Performance Funding Project in Tennessee,…
The Linear Programming to evaluate the performance of Oral Health in Primary Care.
Colussi, Claudia Flemming; Calvo, Maria Cristina Marino; Freitas, Sergio Fernando Torres de
2013-01-01
To show the use of Linear Programming to evaluate the performance of Oral Health in Primary Care. This study used data from 19 municipalities of Santa Catarina city that participated of the state evaluation in 2009 and have more than 50,000 habitants. A total of 40 indicators were evaluated, calculated using the Microsoft Excel 2007, and converted to the interval [0, 1] in ascending order (one indicating the best situation and zero indicating the worst situation). Applying the Linear Programming technique municipalities were assessed and compared among them according to performance curve named "quality estimated frontier". Municipalities included in the frontier were classified as excellent. Indicators were gathered, and became synthetic indicators. The majority of municipalities not included in the quality frontier (values different of 1.0) had lower values than 0.5, indicating poor performance. The model applied to the municipalities of Santa Catarina city assessed municipal management and local priorities rather than the goals imposed by pre-defined parameters. In the final analysis three municipalities were included in the "perceived quality frontier". The Linear Programming technique allowed to identify gaps that must be addressed by city managers to enhance actions taken. It also enabled to observe each municipal performance and compare results among similar municipalities.
Physical evaluation of color and monochrome medical displays using an imaging colorimeter
NASA Astrophysics Data System (ADS)
Roehrig, Hans; Gu, Xiliang; Fan, Jiahua
2013-03-01
This paper presents an approach to physical evaluation of color and monochrome medical grade displays using an imaging colorimeter. The purpose of this study was to examine the influence of medical display types, monochrome or color at the same maximum luminance settings, on diagnostic performance. The focus was on the measurements of physical characteristics including spatial resolution and noise performance, which we believed could affect the clinical performance. Specifically, Modulation Transfer Function (MTF) and Noise Power Spectrum (NPS) were evaluated and compared at different digital driving levels (DDL) between two EIZO displays.
Pine, P S; Boedigheimer, M; Rosenzweig, B A; Turpaz, Y; He, Y D; Delenstarr, G; Ganter, B; Jarnagin, K; Jones, W D; Reid, L H; Thompson, K L
2008-11-01
Effective use of microarray technology in clinical and regulatory settings is contingent on the adoption of standard methods for assessing performance. The MicroArray Quality Control project evaluated the repeatability and comparability of microarray data on the major commercial platforms and laid the groundwork for the application of microarray technology to regulatory assessments. However, methods for assessing performance that are commonly applied to diagnostic assays used in laboratory medicine remain to be developed for microarray assays. A reference system for microarray performance evaluation and process improvement was developed that includes reference samples, metrics and reference datasets. The reference material is composed of two mixes of four different rat tissue RNAs that allow defined target ratios to be assayed using a set of tissue-selective analytes that are distributed along the dynamic range of measurement. The diagnostic accuracy of detected changes in expression ratios, measured as the area under the curve from receiver operating characteristic plots, provides a single commutable value for comparing assay specificity and sensitivity. The utility of this system for assessing overall performance was evaluated for relevant applications like multi-laboratory proficiency testing programs and single-laboratory process drift monitoring. The diagnostic accuracy of detection of a 1.5-fold change in signal level was found to be a sensitive metric for comparing overall performance. This test approaches the technical limit for reliable discrimination of differences between two samples using this technology. We describe a reference system that provides a mechanism for internal and external assessment of laboratory proficiency with microarray technology and is translatable to performance assessments on other whole-genome expression arrays used for basic and clinical research.
USDA-ARS?s Scientific Manuscript database
To compare the validity of direct pediatric developmental evaluation with developmental screening by parent report, parents completed a developmental screen (the Child Development Review), a pediatrician performed a direct developmental evaluation (Capute Scales), and a psychologist administered the...
Experimental Evaluation of a Water Shield for a Surface Power Reactor
NASA Technical Reports Server (NTRS)
Pearson, J. B.; Reid, R.; Sadasivan, P.; Stewart, E.
2007-01-01
A water based shielding system is being investigated for use on initial lunar surface power systems. The use of water may lower overall cost (as compared to development cost for other materials) and simplify operations in the setup and handling. The thermal hydraulic performance of the shield is of significant interest. The mechanism for transferring heat through the shield is natural convection. A representative lunar surface reactor design is evaluated at various power levels in the Water Shield Testbed (WST) at the NASA Marshall Space Flight Center. The evaluation compares the experimental data from the WST to CFD models. Performance of a water shield on the lunar surface is predicted by CFD models anchored to test data, and by matching relevant dimensionless parameters.
NASA Astrophysics Data System (ADS)
Huynh, Trong-Phuoc; Hwang, Chao-Lung; Yang, Shu-Ti
2017-12-01
This experimental study evaluated the performance of normal ordinary Portland cement (OPC) concrete and high-performance concrete (HPC) that were designed by the conventional method (ACI) and densified mixture design algorithm (DMDA) method, respectively. Engineering properties and durability performance of both the OPC and HPC samples were studied using the tests of workability, compressive strength, water absorption, ultrasonic pulse velocity, and electrical surface resistivity. Test results show that the HPC performed good fresh property and further showed better performance in terms of strength and durability as compared to the OPC.
Evaluating supplier quality performance using analytical hierarchy process
NASA Astrophysics Data System (ADS)
Kalimuthu Rajoo, Shanmugam Sundram; Kasim, Maznah Mat; Ahmad, Nazihah
2013-09-01
This paper elaborates the importance of evaluating supplier quality performance to an organization. Supplier quality performance evaluation reflects the actual performance of the supplier exhibited at customer's end. It is critical in enabling the organization to determine the area of improvement and thereafter works with supplier to close the gaps. Success of the customer partly depends on supplier's quality performance. Key criteria as quality, cost, delivery, technology support and customer service are categorized as main factors in contributing to supplier's quality performance. 18 suppliers' who were manufacturing automotive application parts evaluated in year 2010 using weight point system. There were few suppliers with common rating which led to common ranking observed by few suppliers'. Analytical Hierarchy Process (AHP), a user friendly decision making tool for complex and multi criteria problems was used to evaluate the supplier's quality performance challenging the weight point system that was used for 18 suppliers'. The consistency ratio was checked for criteria and sub-criteria. Final results of AHP obtained with no overlap ratings, therefore yielded a better decision making methodology as compared to weight point rating system.
Prefrontal Neural Activity When Feedback Is Not Relevant to Adjust Performance
Özyurt, Jale; Rietze, Mareike; Thiel, Christiane M.
2012-01-01
It has been shown that the rostral cingulate zone (RCZ) in humans uses both positive and negative feedback to evaluate performance and to flexibly adjust behaviour. Less is known on how the feedback types are processed by the RCZ and other prefrontal brain areas, when feedback can only be used to evaluate performance, but cannot be used to adjust behaviour. The present fMRI study aimed at investigating feedback that can only be used to evaluate performance in a word-learning paradigm. One group of volunteers (N = 17) received informative, performance-dependent positive or negative feedback after each trial. Since new words had to be learnt in each trial, the feedback could not be used for task-specific adaptations. The other group (N = 17) always received non-informative feedback, providing neither information about performance nor about possible task-specific adaptations. Effects of the informational value of feedback were assessed between-subjects, comparing trials with positive and negative informative feedback to non-informative feedback. Effects of feedback valence were assessed by comparing neural activity to positive and negative feedback within the informative-feedback group. Our results show that several prefrontal regions, including the pre-SMA, the inferior frontal cortex and the insula were sensitive to both, the informational value and the valence aspect of the feedback with stronger activations to informative as compared to non-informative feedback and to informative negative compared to informative positive feedback. The only exception was RCZ which was sensitive to the informational value of the feedback, but not to feedback valence. The findings indicate that outcome information per se is sufficient to activate prefrontal brain regions, with the RCZ being the only prefrontal brain region which is equally sensitive to positive and negative feedback. PMID:22615774
NASA Astrophysics Data System (ADS)
Zhou, Xiangrong; Yamada, Kazuma; Kojima, Takuya; Takayama, Ryosuke; Wang, Song; Zhou, Xinxin; Hara, Takeshi; Fujita, Hiroshi
2018-02-01
The purpose of this study is to evaluate and compare the performance of modern deep learning techniques for automatically recognizing and segmenting multiple organ regions on 3D CT images. CT image segmentation is one of the important task in medical image analysis and is still very challenging. Deep learning approaches have demonstrated the capability of scene recognition and semantic segmentation on nature images and have been used to address segmentation problems of medical images. Although several works showed promising results of CT image segmentation by using deep learning approaches, there is no comprehensive evaluation of segmentation performance of the deep learning on segmenting multiple organs on different portions of CT scans. In this paper, we evaluated and compared the segmentation performance of two different deep learning approaches that used 2D- and 3D deep convolutional neural networks (CNN) without- and with a pre-processing step. A conventional approach that presents the state-of-the-art performance of CT image segmentation without deep learning was also used for comparison. A dataset that includes 240 CT images scanned on different portions of human bodies was used for performance evaluation. The maximum number of 17 types of organ regions in each CT scan were segmented automatically and compared to the human annotations by using ratio of intersection over union (IU) as the criterion. The experimental results demonstrated the IUs of the segmentation results had a mean value of 79% and 67% by averaging 17 types of organs that segmented by a 3D- and 2D deep CNN, respectively. All the results of the deep learning approaches showed a better accuracy and robustness than the conventional segmentation method that used probabilistic atlas and graph-cut methods. The effectiveness and the usefulness of deep learning approaches were demonstrated for solving multiple organs segmentation problem on 3D CT images.
ERIC Educational Resources Information Center
Liu, Shujie; Xu, Xianxuan; Grant, Leslie; Strong, James; Fang, Zheng
2017-01-01
This article presents the results of an interpretive policy analysis of China's Ministry of Education Standards (2013) for the professional practice of principals. In addition to revealing the evolution of the evaluation of principals in China and the processes by which this policy is formulated, a comparative analysis was conducted to compare it…
ERIC Educational Resources Information Center
Ishimoto, Michi; Thornton, Ronald K.; Sokoloff, David R.
2014-01-01
This study assesses the Japanese translation of the Force and Motion Conceptual Evaluation (FMCE). Researchers are often interested in comparing the conceptual ideas of students with different cultural backgrounds. The FMCE has been useful in identifying the concepts of English-speaking students from different backgrounds. To identify effectively…
Natt, Neena; Chang, Alice Y.; Berbari, Elie F.; Kennel, Kurt A.; Kearns, Ann E.
2016-01-01
Objective To determine which residency characteristics are associated with performance during endocrinology fellowship training as measured by competency-based faculty evaluation scores and faculty global ratings of trainee performance. Method We performed a retrospective review of interview applications from endocrinology fellows who graduated from a single academic institution between 2006 and 2013. Performance measures included competency-based faculty evaluation scores and faculty global ratings. The association between applicant characteristics and measures of performance during fellowship was examined by linear regression. Results The presence of a laudatory comparative statement in the residency program director’s letter of recommendation (LoR) or experience as a chief resident was significantly associated with competency-based faculty evaluation scores (β = 0.22, P = 0.001; and β = 0.24, P = 0.009, respectively) and faculty global ratings (β = 0.85, P = 0.006; and β = 0.96, P = 0.015, respectively). Conclusion The presence of a laudatory comparative statement in the residency program director’s LoR or experience as a chief resident were significantly associated with overall performance during subspecialty fellowship training. Future studies are needed in other cohorts to determine the broader implications of these findings in the application and selection process. PMID:26437219
NASA Astrophysics Data System (ADS)
Scaduto, David A.; Lubinsky, Anthony R.; Rowlands, John A.; Kenmotsu, Hidenori; Nishimoto, Norihito; Nishino, Takeshi; Tanioka, Kenkichi; Zhao, Wei
2014-03-01
We have previously proposed SAPHIRE (scintillator avalanche photoconductor with high resolution emitter readout), a novel detector concept with potentially superior spatial resolution and low-dose performance compared with existing flat-panel imagers. The detector comprises a scintillator that is optically coupled to an amorphous selenium photoconductor operated with avalanche gain, known as high-gain avalanche rushing photoconductor (HARP). High resolution electron beam readout is achieved using a field emitter array (FEA). This combination of avalanche gain, allowing for very low-dose imaging, and electron emitter readout, providing high spatial resolution, offers potentially superior image quality compared with existing flat-panel imagers, with specific applications to fluoroscopy and breast imaging. Through the present collaboration, a prototype HARP sensor with integrated electrostatic focusing and nano- Spindt FEA readout technology has been fabricated. The integrated electron-optic focusing approach is more suitable for fabricating large-area detectors. We investigate the dependence of spatial resolution on sensor structure and operating conditions, and compare the performance of electrostatic focusing with previous technologies. Our results show a clear dependence of spatial resolution on electrostatic focusing potential, with performance approaching that of the previous design with external mesh-electrode. Further, temporal performance (lag) of the detector is evaluated and the results show that the integrated electrostatic focusing design exhibits comparable or better performance compared with the mesh-electrode design. This study represents the first technical evaluation and characterization of the SAPHIRE concept with integrated electrostatic focusing.
Comparative analysis of on-orbit dynamic performance of several large antenna concepts
NASA Technical Reports Server (NTRS)
Andersen, G. C.; Garrett, L. B.; Calleson, R. E.
1985-01-01
A comparative analysis of the on-orbit dynamic performance of four large anetanna concepts is presented. Among the antenna concepts evaluated are: the box truss; tetrahedral truss; warp-radial rib; and the hoop and column antenna designs. The characteristics and magnitudes of the antennas' dynamic response were evaluated in terms of structural displacements and member loads incurred during various slew-rate maneuvers. The results of the dynamic response analysis are compared to the design requirements of the Land Mobile Satellite System (LMSS) with respect to surface accuracy, decenter, defocus, and angular rocking. Comments are made on the effectiveness of structural damping and the application of active controls for vibrational response reduction. Schematic illustrations of the antenna design concepts are provided.
Improving the performances of autofocus based on adaptive retina-like sampling model
NASA Astrophysics Data System (ADS)
Hao, Qun; Xiao, Yuqing; Cao, Jie; Cheng, Yang; Sun, Ce
2018-03-01
An adaptive retina-like sampling model (ARSM) is proposed to balance autofocusing accuracy and efficiency. Based on the model, we carry out comparative experiments between the proposed method and the traditional method in terms of accuracy, the full width of the half maxima (FWHM) and time consumption. Results show that the performances of our method are better than that of the traditional method. Meanwhile, typical autofocus functions, including sum-modified-Laplacian (SML), Laplacian (LAP), Midfrequency-DCT (MDCT) and Absolute Tenengrad (ATEN) are compared through comparative experiments. The smallest FWHM is obtained by the use of LAP, which is more suitable for evaluating accuracy than other autofocus functions. The autofocus function of MDCT is most suitable to evaluate the real-time ability.
Employee Performance in the Context of the Problems of Measurement and Evaluation in Practice
NASA Astrophysics Data System (ADS)
Szabó, Peter; Mĺkva, Miroslava; Vaňová, Jaromíra; Marková, Petra
2017-09-01
Employee performance is a condition and an assumption for the performance and success of a company on the market. In order to ensure competitive ability, the quality of human resources, their management, and related measurement and performance assessment are at the forefront of company interest. Employee assessment affects the performance, development and motivation of people and also provides the necessary information about the employees. It allows the organization to monitor employee performance and compare their work with other collaborators. Many companies have the problem of setting up evaluation system so that it carried itself elements of responsibility and objectivity. The result of conceptual work in this area is the ultimate use of tools whose deployment, if possible, motivates employees to perform better. The aim of the paper is to refer to problems that arise in companies in evaluating the performance of employees.
ERIC Educational Resources Information Center
Scherer, Paul R.
1978-01-01
Observations of a departmental and non-departmental outpatient service organization are compared regarding student attitude and evaluation, patient care, clinical exposure control, and faculty attitude and performance. Tables show the evaluation topics of each rotation in the departmental system. (LBH)
Kramer, Andrew A; Higgins, Thomas L; Zimmerman, Jack E
2015-02-01
To compare ICU performance using standardized mortality ratios generated by the Acute Physiology and Chronic Health Evaluation IVa and a National Quality Forum-endorsed methodology and examine potential reasons for model-based standardized mortality ratio differences. Retrospective analysis of day 1 hospital mortality predictions at the ICU level using Acute Physiology and Chronic Health Evaluation IVa and National Quality Forum models on the same patient cohort. Forty-seven ICUs at 36 U.S. hospitals from January 2008 to May 2013. Eighty-nine thousand three hundred fifty-three consecutive unselected ICU admissions. None. We assessed standardized mortality ratios for each ICU using data for patients eligible for Acute Physiology and Chronic Health Evaluation IVa and National Quality Forum predictions in order to compare unit-level model performance, differences in ICU rankings, and how case-mix adjustment might explain standardized mortality ratio differences. Hospital mortality was 11.5%. Overall standardized mortality ratio was 0.89 using Acute Physiology and Chronic Health Evaluation IVa and 1.07 using National Quality Forum, the latter having a widely dispersed and multimodal standardized mortality ratio distribution. Model exclusion criteria eliminated mortality predictions for 10.6% of patients for Acute Physiology and Chronic Health Evaluation IVa and 27.9% for National Quality Forum. The two models agreed on the significance and direction of standardized mortality ratio only 45% of the time. Four ICUs had standardized mortality ratios significantly less than 1.0 using Acute Physiology and Chronic Health Evaluation IVa, but significantly greater than 1.0 using National Quality Forum. Two ICUs had standardized mortality ratios exceeding 1.75 using National Quality Forum, but nonsignificant performance using Acute Physiology and Chronic Health Evaluation IVa. Stratification by patient and institutional characteristics indicated that units caring for more severely ill patients and those with a higher percentage of patients on mechanical ventilation had the most discordant standardized mortality ratios between the two predictive models. Acute Physiology and Chronic Health Evaluation IVa and National Quality Forum models yield different ICU performance assessments due to differences in case-mix adjustment. Given the growing role of outcomes in driving prospective payment patient referral and public reporting, performance should be assessed by models with fewer exclusions, superior accuracy, and better case-mix adjustment.
EVALUATIONS ON ASR DAMAGE OF CONCRETE STRUCTURE AND ITS STRUCTURAL PERFORMANCE
NASA Astrophysics Data System (ADS)
Ueda, Naoshi; Nakamura, Hikaru; Kunieda, Minoru; Maeno, Hirofumi; Morishit, Noriaki; Asai, Hiroshi
In this paper, experiments and finite element analyses were conducted in order to evaluate effects of ASR on structural performance of RC and PC structures. From the experimental results, it was confirmed that the ASR expansion was affected by the restraint of reinforcement and the magnitude of prestress. The material properties of concrete damaged by ASR had anisotropic characteristics depending on the degree of ASR expansion. Therefore, when the structural performance of RC and PC structures were evaluated by using the material properties of core concrete, the direction and place where cylinder specimens were cored should be considered. On the other hand, by means of proposed analytical method, ASR expansion behaviors of RC and PC beams and changing of their structural performance were evaluated. As the results, it was confirmed that PC structure had much advantage comparing with RC structure regarding the structural performance under ASR damage because of restraint by prestress against the ASR.
ERIC Educational Resources Information Center
Gong, Yue; Beck, Joseph E.; Heffernan, Neil T.
2011-01-01
Student modeling is a fundamental concept applicable to a variety of intelligent tutoring systems (ITS). However, there is not a lot of practical guidance on how to construct and train such models. This paper compares two approaches for student modeling, Knowledge Tracing (KT) and Performance Factors Analysis (PFA), by evaluating their predictive…
PrismTech Data Distribution Service Java API Evaluation
NASA Technical Reports Server (NTRS)
Riggs, Cortney
2008-01-01
My internship duties with Launch Control Systems required me to start performance testing of an Object Management Group's (OMG) Data Distribution Service (DDS) specification implementation by PrismTech Limited through the Java programming language application programming interface (API). DDS is a networking middleware for Real-Time Data Distribution. The performance testing involves latency, redundant publishers, extended duration, redundant failover, and read performance. Time constraints allowed only for a data throughput test. I have designed the testing applications to perform all performance tests when time is allowed. Performance evaluation data such as megabits per second and central processing unit (CPU) time consumption were not easily attainable through the Java programming language; they required new methods and classes created in the test applications. Evaluation of this product showed the rate that data can be sent across the network. Performance rates are better on Linux platforms than AIX and Sun platforms. Compared to previous C++ programming language API, the performance evaluation also shows the language differences for the implementation. The Java API of the DDS has a lower throughput performance than the C++ API.
The Performance of Fundamental Gross Motor Skills by Children Enrolled in Head Start.
ERIC Educational Resources Information Center
Woodard, Rebecca J.; Yun, Joonkoo
2001-01-01
This study sought to descriptively evaluate the performance of fundamental gross motor skills among Head Start children. Levels of performance were compared and contrasted with performance profiles of the Test of Gross Motor Development. Findings suggest that Head Start curriculum should focus on the importance of developing fundamental gross…
Business School's Performance Management System Standards Design
ERIC Educational Resources Information Center
Azis, Anton Mulyono; Simatupang, Togar M.; Wibisono, Dermawan; Basri, Mursyid Hasan
2014-01-01
This paper aims to compare various Performance Management Systems (PMS) for business school in order to find the strengths of each standard as inputs to design new model of PMS. There are many critical aspects and gaps notified for new model to improve performance and even recognized that self evaluation performance management is not well…
Ryan, James G; Barlas, David; Pollack, Simcha
2013-12-01
Medical knowledge (MK) in residents is commonly assessed by the in-training examination (ITE) and faculty evaluations of resident performance. We assessed the reliability of clinical evaluations of residents by faculty and the relationship between faculty assessments of resident performance and ITE scores. We conducted a cross-sectional, observational study at an academic emergency department with a postgraduate year (PGY)-1 to PGY-3 emergency medicine residency program, comparing summative, quarterly, faculty evaluation data for MK and overall clinical competency (OC) with annual ITE scores, accounting for PGY level. We also assessed the reliability of faculty evaluations using a random effects, intraclass correlation analysis. We analyzed data for 59 emergency medicine residents during a 6-year period. Faculty evaluations of MK and OC were highly reliable (κ = 0.99) and remained reliable after stratification by year of training (mean κ = 0.68-0.84). Assessments of resident performance (MK and OC) and the ITE increased with PGY level. The MK and OC results had high correlations with PGY level, and ITE scores correlated moderately with PGY. The OC and MK results had a moderate correlation with ITE score. When residents were grouped by PGY level, there was no significant correlation between MK as assessed by the faculty and the ITE score. Resident clinical performance and ITE scores both increase with resident PGY level, but ITE scores do not predict resident clinical performance compared with peers at their PGY level.
Ryan, James G.; Barlas, David; Pollack, Simcha
2013-01-01
Background Medical knowledge (MK) in residents is commonly assessed by the in-training examination (ITE) and faculty evaluations of resident performance. Objective We assessed the reliability of clinical evaluations of residents by faculty and the relationship between faculty assessments of resident performance and ITE scores. Methods We conducted a cross-sectional, observational study at an academic emergency department with a postgraduate year (PGY)-1 to PGY-3 emergency medicine residency program, comparing summative, quarterly, faculty evaluation data for MK and overall clinical competency (OC) with annual ITE scores, accounting for PGY level. We also assessed the reliability of faculty evaluations using a random effects, intraclass correlation analysis. Results We analyzed data for 59 emergency medicine residents during a 6-year period. Faculty evaluations of MK and OC were highly reliable (κ = 0.99) and remained reliable after stratification by year of training (mean κ = 0.68–0.84). Assessments of resident performance (MK and OC) and the ITE increased with PGY level. The MK and OC results had high correlations with PGY level, and ITE scores correlated moderately with PGY. The OC and MK results had a moderate correlation with ITE score. When residents were grouped by PGY level, there was no significant correlation between MK as assessed by the faculty and the ITE score. Conclusions Resident clinical performance and ITE scores both increase with resident PGY level, but ITE scores do not predict resident clinical performance compared with peers at their PGY level. PMID:24455005
Isolated cognitive relapses in multiple sclerosis.
Pardini, Matteo; Uccelli, Antonio; Grafman, Jordan; Yaldizli, Özgür; Mancardi, Gianluigi; Roccatagliata, Luca
2014-09-01
While cognition can be affected during sensorimotor multiple sclerosis (MS) relapses, the relevance of isolated cognitive relapses (ICRs ie, those occurring in absence of new sensorimotor symptoms) remain poorly characterised. Here, we decided to explore the relationship between ICR, subjective evaluation of cognitive performance and long-term cognitive decline in a group of subjects with relapsing-remitting MS. We analysed the cognitive performance of 99 clinically stable relapsing-remitting MS for whom data from four consequent clinical and cognitive evaluations were available, that is, a baseline evaluation (t₀), followed in the subsequent 6 months by a second evaluation performed not later than 2 weeks after a routine brain scan positive for at least one area of gadolinium enhancement (t₁) and two gadolinium enhancement-negative follow-up evaluations after 6 months (t₂) and 1 year (t₃) from t₁. Based on published literature, we defined as a meaningful change in cognition a transient reduction of Symbol Digit Modalities Test score of at least four points at t₁ compared with t₀ and t₂. ICRs were found in 17 patients and were not associated with subjective cognitive deficits or depression. Subjects who presented with an ICR at t₁ presented with a significantly reduced cognitive performance at the follow-up evaluations compared with patients without ICR. We showed that ICRs were not associated with changes in mood, fatigue levels or cognitive performance self-evaluations. Our study introduces an operational definition of ICRs and suggests to their role as a factor for cognitive decline in MS. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Bringing a transgenic crop to market: where compositional analysis fits.
Privalle, Laura S; Gillikin, Nancy; Wandelt, Christine
2013-09-04
In the process of developing a biotechnology product, thousands of genes and transformation events are evaluated to select the event that will be commercialized. The ideal event is identified on the basis of multiple characteristics including trait efficacy, the molecular characteristics of the insert, and agronomic performance. Once selected, the commercial event is subjected to a rigorous safety evaluation taking a multipronged approach including examination of the safety of the gene and gene product - the protein, plant performance, impact of cultivating the crop on the environment, agronomic performance, and equivalence of the crop/food to conventional crops/food - by compositional analysis. The compositional analysis is composed of a comparison of the nutrient and antinutrient composition of the crop containing the event, its parental line (variety), and other conventional lines (varieties). Different geographies have different requirements for the compositional analysis studies. Parameters that vary include the number of years (seasons) and locations (environments) to be evaluated, the appropriate comparator(s), analytes to be evaluated, and statistical analysis. Specific examples of compositional analysis results will be presented.
Statistical analysis for validating ACO-KNN algorithm as feature selection in sentiment analysis
NASA Astrophysics Data System (ADS)
Ahmad, Siti Rohaidah; Yusop, Nurhafizah Moziyana Mohd; Bakar, Azuraliza Abu; Yaakub, Mohd Ridzwan
2017-10-01
This research paper aims to propose a hybrid of ant colony optimization (ACO) and k-nearest neighbor (KNN) algorithms as feature selections for selecting and choosing relevant features from customer review datasets. Information gain (IG), genetic algorithm (GA), and rough set attribute reduction (RSAR) were used as baseline algorithms in a performance comparison with the proposed algorithm. This paper will also discuss the significance test, which was used to evaluate the performance differences between the ACO-KNN, IG-GA, and IG-RSAR algorithms. This study evaluated the performance of the ACO-KNN algorithm using precision, recall, and F-score, which were validated using the parametric statistical significance tests. The evaluation process has statistically proven that this ACO-KNN algorithm has been significantly improved compared to the baseline algorithms. The evaluation process has statistically proven that this ACO-KNN algorithm has been significantly improved compared to the baseline algorithms. In addition, the experimental results have proven that the ACO-KNN can be used as a feature selection technique in sentiment analysis to obtain quality, optimal feature subset that can represent the actual data in customer review data.
Thermal Performance of Aircraft Polyurethane Seat Cushions
NASA Technical Reports Server (NTRS)
Kourtides, D. A.; Parker, J. A.
1982-01-01
Aircraft seat materials were evaluated in terms of their thermal performance. The materials were evaluated using (a) thermogravimetric analysis, (b) differential scanning calorimetry, (c) a modified NBS smoke chamber to determine the rate of mass loss and (d) the NASA T-3 apparatus to determine the thermal efficiency. In this paper, the modified NBS smoke chamber will be described in detail since it provided the most conclusive results. The NBS smoke chamber was modified to measure the weight loss of material when exposed to a radiant heat source over the range of 2.5 to 7.5 W/sq cm. This chamber has been utilized to evaluate the thermal performance of various heat blocking layers utilized to protect the polyurethane cushioning foam used in aircraft seats. Various kinds of heat blocking layers were evaluated by monitoring the weight loss of miniature seat cushions when exposed to the radiant heat. The effectiveness of aluminized heat blocking systems was demonstrated when compared to conventional heat blocking layers such as neoprene. All heat blocking systems showed good fire protection capabilities when compared to the state-of-the-art, i.e., wool-nylon over polyurethane foam.
Smith, Jennifer L.; Sturrock, Hugh J. W.; Assefa, Liya; Nikolay, Birgit; Njenga, Sammy M.; Kihara, Jimmy; Mwandawiro, Charles S.; Brooker, Simon J.
2015-01-01
Transmission assessment surveys (TAS) for lymphatic filariasis have been proposed as a platform to assess the impact of mass drug administration (MDA) on soil-transmitted helminths (STHs). This study used computer simulation and field data from pre- and post-MDA settings across Kenya to evaluate the performance and cost-effectiveness of the TAS design for STH assessment compared with alternative survey designs. Variations in the TAS design and different sample sizes and diagnostic methods were also evaluated. The district-level TAS design correctly classified more districts compared with standard STH designs in pre-MDA settings. Aggregating districts into larger evaluation units in a TAS design decreased performance, whereas age group sampled and sample size had minimal impact. The low diagnostic sensitivity of Kato-Katz and mini-FLOTAC methods was found to increase misclassification. We recommend using a district-level TAS among children 8–10 years of age to assess STH but suggest that key consideration is given to evaluation unit size. PMID:25487730
Evaluation of heat sink materials for thermal management of lithium batteries
NASA Astrophysics Data System (ADS)
Dimpault-Darcy, E. C.; Miller, K.
Aluminum, neopentyl glycol (NPG), and resins FT and KT are evaluated theoretically and experimentally as heat sink materials for lithium battery packs. The thermal performances of the two resins are compared in a thermal vacuum experiment. As solutions to the sublimation property were not immediately apparent, a theoretical comparison of the thermal performance of NPG versus KT, Al, and no material, is presented.
Evaluation of heat sink materials for thermal management of lithium batteries
NASA Technical Reports Server (NTRS)
Dimpault-Darcy, E. C.; Miller, K.
1988-01-01
Aluminum, neopentyl glycol (NPG), and resins FT and KT are evaluated theoretically and experimentally as heat sink materials for lithium battery packs. The thermal performances of the two resins are compared in a thermal vacuum experiment. As solutions to the sublimation property were not immediately apparent, a theoretical comparison of the thermal performance of NPG versus KT, Al, and no material, is presented.
Transient multivariable sensor evaluation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vilim, Richard B.; Heifetz, Alexander
A method and system for performing transient multivariable sensor evaluation. The method and system includes a computer system for identifying a model form, providing training measurement data, generating a basis vector, monitoring system data from sensor, loading the system data in a non-transient memory, performing an estimation to provide desired data and comparing the system data to the desired data and outputting an alarm for a defective sensor.
Comparative assessment of three standardized robotic surgery training methods.
Hung, Andrew J; Jayaratna, Isuru S; Teruya, Kara; Desai, Mihir M; Gill, Inderbir S; Goh, Alvin C
2013-10-01
To evaluate three standardized robotic surgery training methods, inanimate, virtual reality and in vivo, for their construct validity. To explore the concept of cross-method validity, where the relative performance of each method is compared. Robotic surgical skills were prospectively assessed in 49 participating surgeons who were classified as follows: 'novice/trainee': urology residents, previous experience <30 cases (n = 38) and 'experts': faculty surgeons, previous experience ≥30 cases (n = 11). Three standardized, validated training methods were used: (i) structured inanimate tasks; (ii) virtual reality exercises on the da Vinci Skills Simulator (Intuitive Surgical, Sunnyvale, CA, USA); and (iii) a standardized robotic surgical task in a live porcine model with performance graded by the Global Evaluative Assessment of Robotic Skills (GEARS) tool. A Kruskal-Wallis test was used to evaluate performance differences between novices and experts (construct validity). Spearman's correlation coefficient (ρ) was used to measure the association of performance across inanimate, simulation and in vivo methods (cross-method validity). Novice and expert surgeons had previously performed a median (range) of 0 (0-20) and 300 (30-2000) robotic cases, respectively (P < 0.001). Construct validity: experts consistently outperformed residents with all three methods (P < 0.001). Cross-method validity: overall performance of inanimate tasks significantly correlated with virtual reality robotic performance (ρ = -0.7, P < 0.001) and in vivo robotic performance based on GEARS (ρ = -0.8, P < 0.0001). Virtual reality performance and in vivo tissue performance were also found to be strongly correlated (ρ = 0.6, P < 0.001). We propose the novel concept of cross-method validity, which may provide a method of evaluating the relative value of various forms of skills education and assessment. We externally confirmed the construct validity of each featured training tool. © 2013 BJU International.
A follow-up study of adolescent attempted suicide in Israel.
Farbstein, Ilana; Dycian, Anat; Gothelf, Doron; King, Robert A; Cohen, Donald J; Kron, Shmuel; Apter, Alan
2002-11-01
(1) To compare the outcome of adolescent subjects who have made a suicide attempt with the outcome of matched controls, using their psychological and psychometric screening tests for military service at age 16.5 years. Their subsequent performance during military service between ages 18 and 21 was also evaluated. (2) To compare the prognosis of those attempters who received intensive psychiatric inpatient evaluation in a general hospital with the prognosis of those who received emergency room treatment only. The computerized military records of 216 adolescents, who had been treated between 1987 and 1988 for attempted suicide in a general hospital emergency room, prior to their induction into the army, were evaluated. They were rated on the following tests: cognitive/educational performance and psychosocial adaptation, psychiatric and psychological health diagnoses, and performance during their military service between 1989 and 1992. Although the female attempters had slightly more problems in the military than the controls, their overall prognosis was surprisingly good. The male suicide attempters did very poorly in their subsequent military service. There was no long-term advantage in having had a psychiatric evaluation performed in a hospital over a brief emergency room evaluation. Most differences between attempters and controls were in service performance, rather than in cognitive and psychometric tests. There may be marked differences between the sexes in the significance of attempted suicide and in the indications for intervention. The policy of mandatory general hospitalization for suicide attempters may need reevaluation.
Cost effective prevention of reflective cracking of composite pavement.
DOT National Transportation Integrated Search
2011-09-01
Louisiana experimented with various techniques and treatments to control reflective cracking since the : 1970s. The objective of this study is to evaluate and compare different reflective cracking control : treatments by evaluating the performance, c...
Hybrid Electric Vehicle Testing | Transportation Research | NREL
Hybrid Electric Vehicle Evaluations Hybrid Electric Vehicle Evaluations How Hybrid Electric Vehicles Work Hybrid electric vehicles combine a primary power source, an energy storage system, and an performance evaluations of hybrid electric vehicles compared to similar conventional vehicles. Learn about
Connecting Theory to Practice: Evaluating a Brain-Based Writing Curriculum
ERIC Educational Resources Information Center
Griffee, Dale T.
2007-01-01
This 10 week longitudinal evaluation study evaluated a brain-based learning curriculum proposed by Smilkstein (2003) by comparing student performance in a traditional basic writing curriculum with NHLP-oriented basic writing curriculum. The study included two classes each of experimental and traditional methods. Results of the data, gathered by…
Accuracy, reliability, and timing of visual evaluations of decay in fresh-cut lettuce
USDA-ARS?s Scientific Manuscript database
Visual assessments are used for evaluating the quality of food products, such as fresh-cut lettuce packaged in bags with modified atmosphere. We have compared the accuracy and the reliability of visual evaluations of decay on fresh-cut lettuce performed with experienced and inexperienced raters. In ...
Gratton, D G; Kwon, S R; Blanchette, D R; Aquilino, S A
2017-11-01
Proper integration of newly emerging digital assessment tools is a central issue in dental education in an effort to provide more accurate and objective feedback to students. The study examined how the outcomes of students' tooth preparation were correlated when evaluated using traditional faculty assessment and two types of digital assessment approaches. Specifically, incorporation of the Romexis Compare 2.0 (Compare) and Sirona prepCheck 1.1 (prepCheck) systems was evaluated. Additionally, satisfaction of students based on the type of software was evaluated through a survey. Students in a second-year pre-clinical prosthodontics course were allocated to either Compare (n = 42) or prepCheck (n = 37) systems. All students received conventional instruction and used their assigned digital system as an additional evaluation tool to aid in assessing their work. Examinations assessed crown preparations of the maxillary right central incisor (#8) and the mandibular left first molar (#19). All submissions were graded by faculty, Compare and prepCheck. Technical scores did not differ between student groups for any of the assessment approaches. Compare and prepCheck had modest, statistically significant correlations with faculty scores with a minimum correlation of 0.3944 (P = 0.0011) and strong, statistically significant correlations with each other with a minimum correlation of 0.8203 (P < 0.0001). A post-course student survey found that 55.26% of the students felt unfavourably about learning the digital evaluation protocols. A total of 62.31% felt favourably about the integration of these digital tools into the curriculum. Comparison of Compare and prepCheck showed no evidence of significant difference in students' prosthodontics technical performance and perception. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Extra-Vehicular Activity (EVA) glove evaluation test protocol
NASA Technical Reports Server (NTRS)
Hinman-Sweeney, E. M.
1994-01-01
One of the most critical components of a space suit is the gloves, yet gloves have traditionally presented significant design challenges. With continued efforts at glove development, a method for evaluating glove performance is needed. This paper presents a pressure-glove evaluation protocol. A description of this evaluation protocol, and its development is provided. The protocol allows comparison of one glove design to another, or any one design to bare-handed performance. Gloves for higher pressure suits may be evaluated at current and future design pressures to drive out differences in performance due to pressure effects. Using this protocol, gloves may be evaluated during design to drive out design problems and determine areas for improvement, or fully mature designs may be evaluated with respect to mission requirements. Several different test configurations are presented to handle these cases. This protocol was run on a prototype glove. The prototype was evaluated at two operating pressures and in the unpressurized state, with results compared to bare-handed performance. Results and analysis from this test series are provided, as is a description of the configuration used for this test.
Wolfe, Jace; Neumann, Sara; Schafer, Erin; Marsh, Megan; Wood, Mark; Baker, R Stanley
2017-02-01
A number of published studies have demonstrated the benefits of electric-acoustic stimulation (EAS) over conventional electric stimulation for adults with functional low-frequency acoustic hearing and severe-to-profound high-frequency hearing loss. These benefits potentially include better speech recognition in quiet and in noise, better localization, improvements in sound quality, better music appreciation and aptitude, and better pitch recognition. There is, however, a paucity of published reports describing the potential benefits and limitations of EAS for children with functional low-frequency acoustic hearing and severe-to-profound high-frequency hearing loss. The objective of this study was to explore the potential benefits of EAS for children. A repeated measures design was used to evaluate performance differences obtained with EAS stimulation versus acoustic- and electric-only stimulation. Seven users of Cochlear Nucleus Hybrid, Nucleus 24 Freedom, CI512, and CI422 implants were included in the study. Sentence recognition (assayed using the pediatric version of the AzBio sentence recognition test) was evaluated in quiet and at three fixed signal-to-noise ratios (SNR) (0, +5, and +10 dB). Functional hearing performance was also evaluated with the use of questionnaires, including the comparative version of the Speech, Spatial, and Qualities, the Listening Inventory for Education Revised, and the Children's Home Inventory for Listening Difficulties. Speech recognition in noise was typically better with EAS compared to participants' performance with acoustic- and electric-only stimulation, particularly when evaluated at the less favorable SNR. Additionally, in real-world situations, children generally preferred to use EAS compared to electric-only stimulation. Also, the participants' classroom teachers observed better hearing performance in the classroom with the use of EAS. Use of EAS provided better speech recognition in quiet and in noise when compared to performance obtained with use of acoustic- and electric-only stimulation, and children responded favorably to the use of EAS implemented in an integrated sound processor for real-world use. American Academy of Audiology
Investigation of warm-mix asphalt for Iowa roadways.
DOT National Transportation Integrated Search
2013-09-01
Phase II of this study further evaluated the performance of plant-produced warm-mix asphalt (WMA) mixes by conducting : additional mixture performance tests at a broader range of temperatures, adding additional pavements to the study, comparing : vir...
Jorritsma, Wiard; Cnossen, Fokie; Dierckx, Rudi A; Oudkerk, Matthijs; van Ooijen, Peter M A
2016-01-01
To perform a post-deployment usability evaluation of a radiology Picture Archiving and Communication System (PACS) client based on pattern mining of user interaction log data, and to assess the usefulness of this approach compared to a field study. All user actions performed on the PACS client were logged for four months. A data mining technique called closed sequential pattern mining was used to automatically extract frequently occurring interaction patterns from the log data. These patterns were used to identify usability issues with the PACS. The results of this evaluation were compared to the results of a field study based usability evaluation of the same PACS client. The interaction patterns revealed four usability issues: (1) the display protocols do not function properly, (2) the line measurement tool stays active until another tool is selected, rather than being deactivated after one use, (3) the PACS's built-in 3D functionality does not allow users to effectively perform certain 3D-related tasks, (4) users underuse the PACS's customization possibilities. All usability issues identified based on the log data were also found in the field study, which identified 48 issues in total. Post-deployment usability evaluation based on pattern mining of user interaction log data provides useful insights into the way users interact with the radiology PACS client. However, it reveals few usability issues compared to a field study and should therefore not be used as the sole method of usability evaluation. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Van Herpe, Tom; De Brabanter, Jos; Beullens, Martine; De Moor, Bart; Van den Berghe, Greet
2008-01-01
Introduction Blood glucose (BG) control performed by intensive care unit (ICU) nurses is becoming standard practice for critically ill patients. New (semi-automated) 'BG control' algorithms (or 'insulin titration' algorithms) are under development, but these require stringent validation before they can replace the currently used algorithms. Existing methods for objectively comparing different insulin titration algorithms show weaknesses. In the current study, a new approach for appropriately assessing the adequacy of different algorithms is proposed. Methods Two ICU patient populations (with different baseline characteristics) were studied, both treated with a similar 'nurse-driven' insulin titration algorithm targeting BG levels of 80 to 110 mg/dl. A new method for objectively evaluating BG deviations from normoglycemia was founded on a smooth penalty function. Next, the performance of this new evaluation tool was compared with the current standard assessment methods, on an individual as well as a population basis. Finally, the impact of four selected parameters (the average BG sampling frequency, the duration of algorithm application, the severity of disease, and the type of illness) on the performance of an insulin titration algorithm was determined by multiple regression analysis. Results The glycemic penalty index (GPI) was proposed as a tool for assessing the overall glycemic control behavior in ICU patients. The GPI of a patient is the average of all penalties that are individually assigned to each measured BG value based on the optimized smooth penalty function. The computation of this index returns a number between 0 (no penalty) and 100 (the highest penalty). For some patients, the assessment of the BG control behavior using the traditional standard evaluation methods was different from the evaluation with GPI. Two parameters were found to have a significant impact on GPI: the BG sampling frequency and the duration of algorithm application. A higher BG sampling frequency and a longer algorithm application duration resulted in an apparently better performance, as indicated by a lower GPI. Conclusion The GPI is an alternative method for evaluating the performance of BG control algorithms. The blood glucose sampling frequency and the duration of algorithm application should be similar when comparing algorithms. PMID:18302732
Performance evaluation of the time delay digital tanlock loop architectures
NASA Astrophysics Data System (ADS)
Al-Kharji Al-Ali, Omar; Anani, Nader; Al-Qutayri, Mahmoud; Al-Araji, Saleh; Ponnapalli, Prasad
2016-01-01
This article presents the architectures, theoretical analyses and testing results of modified time delay digital tanlock loop (TDTLs) system. The modifications to the original TDTL architecture were introduced to overcome some of the limitations of the original TDTL and to enhance the overall performance of the particular systems. The limitations addressed in this article include the non-linearity of the phase detector, the restricted width of the locking range and the overall system acquisition speed. Each of the modified architectures was tested by subjecting the system to sudden positive and negative frequency steps and comparing its response with that of the original TDTL. In addition, the performance of all the architectures was evaluated under noise-free as well as noisy environments. The extensive simulation results using MATLAB/SIMULINK demonstrate that the new architectures overcome the limitations they addressed and the overall results confirmed significant improvements in performance compared to the conventional TDTL system.
NASA Technical Reports Server (NTRS)
Poeschel, R. L.; Hawthorne, E. I.; Weisman, Y. C.; Frisman, M.; Benson, G. C.; Mcgrath, R. J.; Martinelli, R. M.; Linsenbardt, T. L.; Beattie, J. R.
1977-01-01
Several thrust system design concepts were evaluated and compared using the specifications of the most advanced 30 cm engineering model thruster as the technology base. Emphasis was placed on relatively high power missions (60 to 100 kW) such as a Halley's comet rendezvous. The extensions in thruster performance required for the Halley's comet mission were defined and alternative thrust system concepts were designed in sufficient detail for comparing mass, efficiency, reliability, structure, and thermal characteristics. Confirmation testing and analysis of thruster and power processing components were performed, and the feasibility of satisfying extended performance requirements was verified. A baseline design was selected from the alternatives considered, and the design analysis and documentation were refined. The baseline thrust system design features modular construction, conventional power processing, and a concentrator solar array concept and is designed to interface with the Space Shuttle.
A comparison of distal canal restrictors in primary cemented femoral hip arthroplasty.
Smith, Eric L; Wohlrab, Kurt P; Matzkin, Elizabeth G; Providence, Bertram C
2004-08-01
A retrospective study evaluated 75 total hip arthroplasties performed over a 4-year period using 4 different cement restrictors. A harvested bone restrictor, polyethylene restrictor (Smith & Nephew, Richards Inc, Memphis, Tenn), Biostop G (Depuy Orthopaedics, Warsaw, Ind), and polymethylmethacrylate (PMMA) (Wright Medical Technology, Arlington, Tex) were compared for the percentage of failures, the average length of the cement mantle, and the width of the femoral canal compared to the cement grade. Patient age, sex, and cement type were also evaluated for their influence on cement grade. The PMMA restrictor and bone performed better than the Richards plug and Biostop G restrictor.
Mackenzie, Colin F; Pasley, Jason; Garofalo, Evan; Shackelford, Stacy; Chen, Hegang; Longinaker, Nyaradzo; Granite, Guinevere; Pugh, Kristy; Hagegeorge, George; Tisherman, Samuel A
2017-07-01
Unbiased evaluation of trauma core competency procedures is necessary to determine if residency and predeployment training courses are useful. We tested whether a previously validated individual procedure score (IPS) for individual procedure vascular exposure and fasciotomy (FAS) performance skills could discriminate training status by comparing IPS of evaluators colocated with surgeons to blind video evaluations. Performance of axillary artery (AA), brachial artery (BA), and femoral artery (FA) vascular exposures and lower extremity FAS on fresh cadavers by 40 PGY-2 to PGY-6 residents was video-recorded from head-mounted cameras. Two colocated trained evaluators assessed IPS before and after training. One surgeon in each pretraining tertile of IPS for each procedure was randomly identified for blind video review. The same 12 surgeons were video-recorded repeating the procedures less than 4 weeks after training. Five evaluators independently reviewed all 96 randomly arranged deidentified videos. Inter-rater reliability/consistency, intraclass correlation coefficients were compared by colocated versus video review of IPS, and errors. Study methodology and bias were judged by Medical Education Research Study Quality Instrument and the Quality Assessment of Diagnostic Accuracy Studies criteria. There were no differences (p ≥ 0.5) in IPS for AA, FA, FAS, whether evaluators were colocated or reviewed video recordings. Evaluator consistency was 0.29 (BA) - 0.77 (FA). Video and colocated evaluators were in total agreement (p = 1.0) for error recognition. Intraclass correlation coefficient was 0.73 to 0.92, dependent on procedure. Correlations video versus colocated evaluations were 0.5 to 0.9. Except for BA, blinded video evaluators discriminated (p < 0.002) whether procedures were performed before training versus after training. Study methodology by Medical Education Research Study Quality Instrument criteria scored 15.5/19, Quality Assessment of Diagnostic Accuracy Studies 2 showed low bias risk. Video evaluations of AA, FA, and FAS procedures with IPS are unbiased, valid, and have potential for formative assessments of competency. Prognostic study, level II.
Li, Bo; Tang, Jing; Yang, Qingxia; Cui, Xuejiao; Li, Shuang; Chen, Sijie; Cao, Quanxing; Xue, Weiwei; Chen, Na; Zhu, Feng
2016-12-13
In untargeted metabolomics analysis, several factors (e.g., unwanted experimental &biological variations and technical errors) may hamper the identification of differential metabolic features, which requires the data-driven normalization approaches before feature selection. So far, ≥16 normalization methods have been widely applied for processing the LC/MS based metabolomics data. However, the performance and the sample size dependence of those methods have not yet been exhaustively compared and no online tool for comparatively and comprehensively evaluating the performance of all 16 normalization methods has been provided. In this study, a comprehensive comparison on these methods was conducted. As a result, 16 methods were categorized into three groups based on their normalization performances across various sample sizes. The VSN, the Log Transformation and the PQN were identified as methods of the best normalization performance, while the Contrast consistently underperformed across all sub-datasets of different benchmark data. Moreover, an interactive web tool comprehensively evaluating the performance of 16 methods specifically for normalizing LC/MS based metabolomics data was constructed and hosted at http://server.idrb.cqu.edu.cn/MetaPre/. In summary, this study could serve as a useful guidance to the selection of suitable normalization methods in analyzing the LC/MS based metabolomics data.
Li, Bo; Tang, Jing; Yang, Qingxia; Cui, Xuejiao; Li, Shuang; Chen, Sijie; Cao, Quanxing; Xue, Weiwei; Chen, Na; Zhu, Feng
2016-01-01
In untargeted metabolomics analysis, several factors (e.g., unwanted experimental & biological variations and technical errors) may hamper the identification of differential metabolic features, which requires the data-driven normalization approaches before feature selection. So far, ≥16 normalization methods have been widely applied for processing the LC/MS based metabolomics data. However, the performance and the sample size dependence of those methods have not yet been exhaustively compared and no online tool for comparatively and comprehensively evaluating the performance of all 16 normalization methods has been provided. In this study, a comprehensive comparison on these methods was conducted. As a result, 16 methods were categorized into three groups based on their normalization performances across various sample sizes. The VSN, the Log Transformation and the PQN were identified as methods of the best normalization performance, while the Contrast consistently underperformed across all sub-datasets of different benchmark data. Moreover, an interactive web tool comprehensively evaluating the performance of 16 methods specifically for normalizing LC/MS based metabolomics data was constructed and hosted at http://server.idrb.cqu.edu.cn/MetaPre/. In summary, this study could serve as a useful guidance to the selection of suitable normalization methods in analyzing the LC/MS based metabolomics data. PMID:27958387
Osoga, Joseph; Waitumbi, John; Guyah, Bernard; Sande, James; Arima, Cornel; Ayaya, Michael; Moseti, Caroline; Morang'a, Collins; Wahome, Martin; Achilla, Rachel; Awinda, George; Nyakoe, Nancy; Wanja, Elizabeth
2017-07-24
Early and accurate diagnosis of malaria is important in treatment as well as in the clinical evaluation of drugs and vaccines. Evaluation of Giemsa-stained smears remains the gold standard for malaria diagnosis, although diagnostic errors and potential bias estimates of protective efficacy have been reported in practice. Plasmodium genus fluorescent in situ hybridization (P-Genus FISH) is a microscopy-based method that uses fluorescent labelled oligonucleotide probes targeted to pathogen specific ribosomal RNA fragments to detect malaria parasites in whole blood. This study sought to evaluate the diagnostic performance of P-Genus FISH alongside Giemsa microscopy compared to quantitative reverse transcription polymerase chain reaction (qRT-PCR) in a clinical setting. Five hundred study participants were recruited prospectively and screened for Plasmodium parasites by P-Genus FISH assay, and Giemsa microscopy. The microscopic methods were performed by two trained personnel and were blinded, and if the results were discordant a third reading was performed as a tie breaker. The diagnostic performance of both methods was evaluated against qRT-PCR as a more sensitive method. The number of Plasmodium positive cases was 26.8% by P-Genus FISH, 33.2% by Giemsa microscopy, and 51.2% by qRT-PCR. The three methods had 46.8% concordant results with 61 positive cases and 173 negative cases. Compared to qRT-PCR the sensitivity and specificity of P-Genus FISH assay was 29.3 and 75.8%, respectively, while microscopy had 58.2 and 93.0% respectively. Microscopy had a higher positive and negative predictive values (89.8 and 68.0% respectively) compared to P-Genus FISH (56.0 and 50.5%). In overall, microscopy had a good measure of agreement (76%, k = 0.51) compared to P-Genus FISH (52%, k = 0.05). The diagnostic performance of P-Genus FISH was shown to be inferior to Giemsa microscopy in the clinical samples. This hinders the possible application of the method in the field despite the many advantages of the method especially diagnosis of low parasite density infections. The P-Genus assay has great potential but application of the method in clinical setting would rely on extensive training of microscopist and continuous proficiency testing.
OAO-3 end of mission power subsystem evaluation
NASA Technical Reports Server (NTRS)
Tasevoli, M.
1982-01-01
End of mission tests were performed on the OAO-3 power subsystem in three component areas: solar array, nickel-cadmium batteries and the On-Board Processor (OBP) power boost operation. Solar array evaluation consisted of analyzing array performance characteristics and comparing them to earlier flight data. Measured solar array degradation of 14.1 to 17.7% after 8 1/3 years is in good agreement with theortical radiation damage losses. Battery discharge characteristics were compared to results of laboratory life cycle tests performed on similar cells. Comparison of cell voltage profils reveals close correlation and confirms the validity of real time life cycle simulation. The successful operation of the system in the OBP/power boost regulation mode demonstrates the excellent life, reliability and greater system utilization of power subsystems using maximum power trackers.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fan, Chengguang; Drinkwater, Bruce W.
In this paper the performance of total focusing method is compared with the widely used time-reversal MUSIC super resolution technique. The algorithms are tested with simulated and experimental ultrasonic array data, each containing different noise levels. The simulated time domain signals allow the effects of array geometry, frequency, scatterer location, scatterer size, scatterer separation and random noise to be carefully controlled. The performance of the imaging algorithms is evaluated in terms of resolution and sensitivity to random noise. It is shown that for the low noise situation, time-reversal MUSIC provides enhanced lateral resolution when compared to the total focusing method.more » However, for higher noise levels, the total focusing method shows robustness, whilst the performance of time-reversal MUSIC is significantly degraded.« less
A multifaceted independent performance analysis of facial subspace recognition algorithms.
Bajwa, Usama Ijaz; Taj, Imtiaz Ahmad; Anwar, Muhammad Waqas; Wang, Xuan
2013-01-01
Face recognition has emerged as the fastest growing biometric technology and has expanded a lot in the last few years. Many new algorithms and commercial systems have been proposed and developed. Most of them use Principal Component Analysis (PCA) as a base for their techniques. Different and even conflicting results have been reported by researchers comparing these algorithms. The purpose of this study is to have an independent comparative analysis considering both performance and computational complexity of six appearance based face recognition algorithms namely PCA, 2DPCA, A2DPCA, (2D)(2)PCA, LPP and 2DLPP under equal working conditions. This study was motivated due to the lack of unbiased comprehensive comparative analysis of some recent subspace methods with diverse distance metric combinations. For comparison with other studies, FERET, ORL and YALE databases have been used with evaluation criteria as of FERET evaluations which closely simulate real life scenarios. A comparison of results with previous studies is performed and anomalies are reported. An important contribution of this study is that it presents the suitable performance conditions for each of the algorithms under consideration.
New developments in supra-threshold perimetry.
Henson, David B; Artes, Paul H
2002-09-01
To describe a series of recent enhancements to supra-threshold perimetry. Computer simulations were used to develop an improved algorithm (HEART) for the setting of the supra-threshold test intensity at the beginning of a field test, and to evaluate the relationship between various pass/fail criteria and the test's performance (sensitivity and specificity) and how they compare with modern threshold perimetry. Data were collected in optometric practices to evaluate HEART and to assess how the patient's response times can be analysed to detect false positive response errors in visual field test results. The HEART algorithm shows improved performance (reduced between-eye differences) over current algorithms. A pass/fail criterion of '3 stimuli seen of 3-5 presentations' at each test location reduces test/retest variability and combines high sensitivity and specificity. A large percentage of false positive responses can be detected by comparing their latencies to the average response time of a patient. Optimised supra-threshold visual field tests can perform as well as modern threshold techniques. Such tests may be easier to perform for novice patients, compared with the more demanding threshold tests.
Kao, Ying-Chia; Kramer, Jessica M; Liljenquist, Kendra; Tian, Feng; Coster, Wendy J
2012-01-01
OBJECTIVE. We compared the functional performance of children with autism spectrum disorders (ASD), intellectual and developmental disabilities (IDD), and without disabilities using the revised Pediatric Evaluation of Disability Inventory-Computer Adaptive Test (PEDI-CAT) Social/Cognitive, Daily Activities, and Responsibility domains. METHOD. A nationally representative sample of parents of children ages 0-21 without disabilities (n = 2,205), with ASD (n = 108), or with IDD (n = 150) completed an online survey. We obtained predicted PEDI-CAT scaled scores for three reference ages (5, 10, 15) from a modified analysis of covariance model and compared each group's scores using contrasts of the regression parameters. RESULTS. We found no significant differences between the ASD and IDD groups. The group with ASD demonstrated significantly lower performance than the group without disabilities across the three domains at ages 10 and 15. CONCLUSION. Scores on the PEDI-CAT differentiated the group with ASD from the group without disabilities. Children with ASD and IDD did not demonstrate different performance profiles. Copyright © 2012 by the American Occupational Therapy Association, Inc.
Evaluation of winter pothole patching methods.
DOT National Transportation Integrated Search
2014-01-01
The main objective of this study was to evaluate the performance and cost-effectiveness of the tow-behind combination : infrared asphalt heater/reclaimer patching method and compare it to the throw and roll and spray injection methods. To : achieve t...
Pulmonary tumor measurements from x-ray computed tomography in one, two, and three dimensions.
Villemaire, Lauren; Owrangi, Amir M; Etemad-Rezai, Roya; Wilson, Laura; O'Riordan, Elaine; Keller, Harry; Driscoll, Brandon; Bauman, Glenn; Fenster, Aaron; Parraga, Grace
2011-11-01
We evaluated the accuracy and reproducibility of three-dimensional (3D) measurements of lung phantoms and patient tumors from x-ray computed tomography (CT) and compared these to one-dimensional (1D) and two-dimensional (2D) measurements. CT images of three spherical and three irregularly shaped tumor phantoms were evaluated by three observers who performed five repeated measurements. Additionally, three observers manually segmented 29 patient lung tumors five times each. Follow-up imaging was performed for 23 tumors and response criteria were compared. For a single subject, imaging was performed on nine occasions over 2 years to evaluate multidimensional tumor response. To evaluate measurement accuracy, we compared imaging measurements to ground truth using analysis of variance. For estimates of precision, intraobserver and interobserver coefficients of variation and intraclass correlations (ICC) were used. Linear regression and Pearson correlations were used to evaluate agreement and tumor response was descriptively compared. For spherical shaped phantoms, all measurements were highly accurate, but for irregularly shaped phantoms, only 3D measurements were in high agreement with ground truth measurements. All phantom and patient measurements showed high intra- and interobserver reproducibility (ICC >0.900). Over a 2-year period for a single patient, there was disagreement between tumor response classifications based on 3D measurements and those generated using 1D and 2D measurements. Tumor volume measurements were highly reproducible and accurate for irregular, spherical phantoms and patient tumors with nonuniform dimensions. Response classifications obtained from multidimensional measurements suggest that 3D measurements provide higher sensitivity to tumor response. Copyright © 2011 AUR. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Lenkeit, Jenny; Caro, Daniel H.
2014-01-01
Reports of international large-scale assessments tend to evaluate and compare education system performance based on absolute scores. And policymakers refer to high-performing and economically prosperous education systems to enhance their own systemic features. But socioeconomic differences between systems compromise the plausibility of those…
PAINT ADHESION AND CORROSION PERFORMANCE OF CHROMIUM-FREE PRETREATMENTS OF 55% AL-ZN-COATED STEEL
The adhesion and corrosion performances for several pretreatments of 55% Al-Zn-coated steels which were coil-coated with polyester paint systems were determined. The objective of this study was to evaluate new, silane-based metal pretreatments and to compare their performance wit...
Evaluating Performances of Solar-Energy Systems
NASA Technical Reports Server (NTRS)
Jaffe, L. D.
1987-01-01
CONC11 computer program calculates performances of dish-type solar thermal collectors and power systems. Solar thermal power system consists of one or more collectors, power-conversion subsystems, and powerprocessing subsystems. CONC11 intended to aid system designer in comparing performance of various design alternatives. Written in Athena FORTRAN and Assembler.
A Data-Driven Design Evaluation Tool for Handheld Device Soft Keyboards
Trudeau, Matthieu B.; Sunderland, Elsie M.; Jindrich, Devin L.; Dennerlein, Jack T.
2014-01-01
Thumb interaction is a primary technique used to operate small handheld devices such as smartphones. Despite the different techniques involved in operating a handheld device compared to a personal computer, the keyboard layouts for both devices are similar. A handheld device keyboard that considers the physical capabilities of the thumb may improve user experience. We developed and applied a design evaluation tool for different geometries of the QWERTY keyboard using a performance evaluation model. The model utilizes previously collected data on thumb motor performance and posture for different tap locations and thumb movement directions. We calculated a performance index (PITOT, 0 is worst and 2 is best) for 663 designs consisting in different combinations of three variables: the keyboard's radius of curvature (R) (mm), orientation (O) (°), and vertical location on the screen (L). The current standard keyboard performed poorly (PITOT = 0.28) compared to other designs considered. Keyboard location (L) contributed to the greatest variability in performance out of the three design variables, suggesting that designers should modify this variable first. Performance was greatest for designs in the middle keyboard location. In addition, having a slightly upward curve (R = −20 mm) and orientated perpendicular to the thumb's long axis (O = −20°) improved performance to PITOT = 1.97. Poorest performances were associated with placement of the keyboard's spacebar in the bottom right corner of the screen (e.g., the worst was for R = 20 mm, O = 40°, L = Bottom (PITOT = 0.09)). While this evaluation tool can be used in the design process as an ergonomic reference to promote user motor performance, other design variables such as visual access and usability still remain unexplored. PMID:25211465
Airola, Antti; Pyysalo, Sampo; Björne, Jari; Pahikkala, Tapio; Ginter, Filip; Salakoski, Tapio
2008-11-19
Automated extraction of protein-protein interactions (PPI) is an important and widely studied task in biomedical text mining. We propose a graph kernel based approach for this task. In contrast to earlier approaches to PPI extraction, the introduced all-paths graph kernel has the capability to make use of full, general dependency graphs representing the sentence structure. We evaluate the proposed method on five publicly available PPI corpora, providing the most comprehensive evaluation done for a machine learning based PPI-extraction system. We additionally perform a detailed evaluation of the effects of training and testing on different resources, providing insight into the challenges involved in applying a system beyond the data it was trained on. Our method is shown to achieve state-of-the-art performance with respect to comparable evaluations, with 56.4 F-score and 84.8 AUC on the AImed corpus. We show that the graph kernel approach performs on state-of-the-art level in PPI extraction, and note the possible extension to the task of extracting complex interactions. Cross-corpus results provide further insight into how the learning generalizes beyond individual corpora. Further, we identify several pitfalls that can make evaluations of PPI-extraction systems incomparable, or even invalid. These include incorrect cross-validation strategies and problems related to comparing F-score results achieved on different evaluation resources. Recommendations for avoiding these pitfalls are provided.
Performance of universal health insurance: lessons from South Korea.
Moon, Sangho; Shin, Jaeun
2007-04-01
The aim of this study was to assess the maturity of the South Korean healthcare system in comparison with those of the 30 countries of the Organization for Economic Co-operation and Development (OECD) and to provide a foundation to evaluate the performance of the South Korean healthcare system. Using OECD Health Data 2005, we evaluated the performance of the healthcare system of the 30 industrialized countries. The evaluation focused on three dimensions that have remained central to healthcare debates internationally for years: access, cost and outcomes. Although South Korea has successfully implemented its universal health insurance scheme in a very short period of time and possesses highly advanced medical technologies, we found that South Koreans incurred more out-of-pocket expenditures on healthcare. Health outcomes were of relatively low quality compared with those of other OECD countries, but compared relatively well with the four countries (Greece, New Zealand, Portugal and Spain) with similar per capita gross domestic product (GDP).
Performance of universal health insurance: lessons from South Korea.
Moon, Sangho; Shin, Jaeun
2007-01-01
The aim of this study was to assess the maturity of the South Korean healthcare system in comparison with those of the 30 countries of the Organization for Economic Co-operation and Development (OECD) and to provide a foundation to evaluate the performance of the South Korean healthcare system. Using OECD Health Data 2005, we evaluated the performance of the healthcare system of the 30 industrialized countries. The evaluation focused on three dimensions that have remained central to healthcare debates internationally for years: access, cost and outcomes. Although South Korea has successfully implemented its universal health insurance scheme in a very short period of time and possesses highly advanced medical technologies, we found that South Koreans incurred more out-of-pocket expenditures on healthcare. Health outcomes were of relatively low quality compared with those of other OECD countries, but compared relatively well with the four countries (Greece, New Zealand, Portugal and Spain) with similar per capita gross domestic product (GDP).
Hummel, J M Marjan; Snoek, Govert J; van Til, Janine A; van Rossum, Wouter; Ijzerman, Maarten J
2005-01-01
This study supported the evaluation by a rehabilitation team of the performance of two treatment options that improve the arm-hand function in subjects with sixth cervical vertebra (C6) level Motor Group 2 tetraplegia. The analytic hierarchy process, a technique for multicriteria decision analysis, was used by a rehabilitation team and potential recipients to quantitatively compare a new technology, Functional Elec trical Stimulation (FES), with conventional surgery. Perform-ance was measured by functional improvement, treatment load, risks, user-friendliness, and social outcomes. Functional improvement after FES was considered better than that after conventional surgery. However, the rehabilitation team's overall rating for conventional surgery was slightly higher than that for FES (57% vs 44%). Compared with the rehabilitation team, potential recipients gave greater weight to burden of treatment and less weight to functional improvement. This study shows that evaluation of new technology must be more comprehensive than the evaluation of functional improvement alone, and that patient preferences may differ from those of the rehabilitation team.
ERIC Educational Resources Information Center
Chae, Ho-Chang
2009-01-01
This study empirically examines the impact of IT capability on firms' performance and evaluates whether firms' IT capabilities play a role in improving employee capability, customer value, customer satisfaction, and ultimately business performance. The results were based on comparing the business performance of the IT leader companies with that of…
Rutz, Matt A; Clary, Julie M; Kline, Jeffrey A; Russell, Frances M
2017-07-01
Focused cardiac ultrasound (FOCUS) is a useful tool in evaluating patients presenting to the emergency department (ED) with acute dyspnea. Prior work has shown that right ventricular (RV) dilation is associated with repeat hospitalizations and shorter life expectancy. Traditionally, RV assessment has been evaluated by cardiologist-interpreted comprehensive echocardiography. The primary goal of this study was to determine the inter-rater reliability between emergency physicians (EPs) and a cardiologist for determining RV dilation on FOCUS performed on ED patients with acute dyspnea. This was a prospective, observational study at two urban academic EDs; patients were enrolled if they had acute dyspnea and a computed tomographic pulmonary angiogram without acute disease. All patients had an EP-performed FOCUS to assess for RV dilation. RV dilation was defined as an RV to left ventricular ratio greater than 1. FOCUS interpretations were compared to a blinded cardiologist FOCUS interpretation using agreement and kappa statistics. Of 84 FOCUS examinations performed on 83 patients, 17% had RV dilation. Agreement and kappa, for EP-performed FOCUS for RV dilation were 89% (95% confidence interval [CI] 80-95%) and 0.68 (95% CI 0.48-0.88), respectively. Emergency physician sonographers are able to detect RV dilation with good agreement when compared to cardiology. These results support the wider use of EP-performed FOCUS to evaluate for RV dilation in ED patients with dyspnea. © 2017 by the Society for Academic Emergency Medicine.
Comparative performance evaluation of a new a-Si EPID that exceeds quad high-definition resolution.
McConnell, Kristen A; Alexandrian, Ara; Papanikolaou, Niko; Stathakis, Sotiri
2018-01-01
Electronic portal imaging devices (EPIDs) are an integral part of the radiation oncology workflow for treatment setup verification. Several commercial EPID implementations are currently available, each with varying capabilities. To standardize performance evaluation, Task Group Report 58 (TG-58) and TG-142 outline specific image quality metrics to be measured. A LinaTech Image Viewing System (IVS), with the highest commercially available pixel matrix (2688x2688 pixels), was independently evaluated and compared to an Elekta iViewGT (1024x1024 pixels) and a Varian aSi-1000 (1024x768 pixels) using a PTW EPID QC Phantom. The IVS, iViewGT, and aSi-1000 were each used to acquire 20 images of the PTW QC Phantom. The QC phantom was placed on the couch and aligned at isocenter. The images were exported and analyzed using the epidSoft image quality assurance (QA) software. The reported metrics were signal linearity, isotropy of signal linearity, signal-tonoise ratio (SNR), low contrast resolution, and high-contrast resolution. These values were compared between the three EPID solutions. Computed metrics demonstrated comparable results between the EPID solutions with the IVS outperforming the aSi-1000 and iViewGT in the low and high-contrast resolution analysis. The performance of three commercial EPID solutions have been quantified, evaluated, and compared using results from the PTW QC Phantom. The IVS outperformed the other panels in low and high-contrast resolution, but to fully realize the benefits of the IVS, the selection of the monitor on which to view the high-resolution images is important to prevent down sampling and visual of resolution.
ERIC Educational Resources Information Center
Zorrilla-Silvestre, Lorena; Presentación-Herrero, María Jesús; Gil-Gómez, Jesús
2016-01-01
Introduction: This study explored the variables of executive functioning (EF) that permitted the evaluation of EF both at home and at school. The objective was to compare the results of the evaluations of these functions in children aged 5 to 6 years, and see to what extent these variables predicted mathematics performance best. Method: Sixty-six…
Project Startup: Evaluating the Performance of Electric Buses
DOE Office of Scientific and Technical Information (OSTI.GOV)
2016-04-01
The National Renewable Energy Laboratory (NREL) is evaluating the in-service performance of fast-charge battery electric buses compared to compressed natural gas (CNG) buses operated by Foothill Transit in West Covina, California. Launched in 2015 in collaboration with the California Air Resources Board, this study aims to improve understanding of the overall use and effectiveness of fast-charge electric buses and associated charging infrastructure in transit operation.
Impact of Hybrid Delivery of Education on Student Academic Performance and the Student Experience
Nutter, Douglas A.; Charneski, Lisa; Butko, Peter
2009-01-01
Objectives To compare student academic performance and the student experience in the first-year doctor of pharmacy (PharmD) program between the main and newly opened satellite campuses of the University of Maryland. Methods Student performance indicators including graded assessments, course averages, cumulative first-year grade point average (GPA), and introductory pharmacy practice experience (IPPE) evaluations were analyzed retrospectively. Student experience indicators were obtained via an online survey instrument and included involvement in student organizations; time-budgeting practices; and stress levels and their perceived effect on performance. Results Graded assessments, course averages, GPA, and IPPE evaluations were indistinguishable between campuses. Students' time allocation was not different between campuses, except for time spent attending class and watching lecture videos. There was no difference between students' stress levels at each campus. Conclusions The implementation of a satellite campus to expand pharmacy education yielded academic performance and student engagement comparable to those from traditional delivery methods. PMID:19960080
Reading performance with large fonts on high-resolution displays
NASA Astrophysics Data System (ADS)
Powers, Maureen K.; Larimer, James O.; Gille, Jennifer; Liu, Hsien-Chang
2004-06-01
Reading is a fundamental task and skill in many environments including business, education, and the home. Today, reading often occurs on electronic displays in addition to traditional hard copy media such as books and magazines, presenting issues of legibility and other factors that can affect human performance [1]. In fact, the transition to soft copy media for text images is often met with worker complaints about their vision and comfort while reading [2-6]. Careful comparative evaluations of reading performance across hard and soft copy device types are rare, even though they are clearly important given the rapid and substantial improvements in soft copy devices available in the marketplace over the last 5 years. To begin to fill this evaluation gap, we compared reading performance on three different soft copy devices and traditional paper. This study does not investigate comfort factors such as display location, seating comfort, and more general issues of lighting, rather we focus instead on a narrow examination of reading performance differences across display types when font sizes are large.
Gowd, Snigdha; Shankar, T; Dash, Samarendra; Sahoo, Nivedita; Chatterjee, Suravi; Mohanty, Pritam
2017-01-01
The aim of the study was to evaluate the reliability of cone beam computed tomography (CBCT) obtained image over plaster model for the assessment of mixed dentition analysis. Thirty CBCT-derived images and thirty plaster models were derived from the dental archives, and Moyer's and Tanaka-Johnston analyses were performed. The data obtained were interpreted and analyzed statistically using SPSS 10.0/PC (SPSS Inc., Chicago, IL, USA). Descriptive and analytical analysis along with Student's t -test was performed to qualitatively evaluate the data and P < 0.05 was considered statistically significant. Statistically, significant results were obtained on data comparison between CBCT-derived images and plaster model; the mean for Moyer's analysis in the left and right lower arch for CBCT and plaster model was 21.2 mm, 21.1 mm and 22.5 mm, 22.5 mm, respectively. CBCT-derived images were less reliable as compared to data obtained directly from plaster model for mixed dentition analysis.
Performance of hepatitis B assays on the Bayer ADVIA Centaur Immunoassay System.
van Helden, Josef; Denoyel, Gérard; Karwowska, Sylwia; Reamer, Randy; Schmalz, John; Wright, Ted; Preisel-Simmons, Barbara
2004-01-01
Bayer HealthCare LLC, Diagnostics Division, has developed several new assays on the ADVIA Centaur immunoassay system for the detection of markers of hepatitis B virus infection in human serum and plasma. This panel includes assays for: hepatitis B surface antigen (HBsAg), a confirmatory test method for HBsAg, antibodies to hepatitis B surface antigen (anti-HBs), IgM and IgG antibodies to hepatitis B core antigen (anti-HBc Total) and IgM antibodies to hepatitis B core antigen (anti-HBc IgM). These assays employ magnetic particle separation technology with direct chemiluminescence for optimal assay performance. All of the assays are fully automated, require sample volumes ranging from 15 microl to 100 microl (with the exception of the ADVIA Centaur HBsAg Confirmatory Assay, which requires 2 x 100 microl), and have throughputs of up to 240 tests per hour. The five ADVIA Centaur HBV assays were tested in extensive performance evaluations conducted at two sites in Europe. The performance evaluations, which included samples from HBV-infected individuals, blood donors, hospitalized/clinical patients, and HBV vaccinees (for Anti-HBs evaluation), generated performance data in support of obtaining the Communautés Européennes (CE) mark for European market distribution. The HBV performance evaluations resulted in an overall diagnostic specificity > 99%, i.e. 99.94% for the ADVIA Centaur HBsAg Assay, 100% for the ADVIA Centaur Anti-HBs Assay, 100% for the ADVIA Centaur HBc IgM Assay and 99.94% for the ADVIA Centaur HBc Total Assay. All of the ADVIA Centaur assays showed a very good diagnostic sensitivity on these populations with 100% for the ADVIA Centaur HBsAg Assay, 99.0% for the ADVIA Centaur Anti-HBs Assay, 98.53% for the ADVIA Centaur HBc IgM Assay and 100% for the ADVIA Centaur HBc Total Assay. The ADVIA Centaur HBsAg Confirmatory Test confirmed 100% of the positive HBsAg samples. Testing of interfering substances and potential cross-reacting samples for all ADVIA Centaur HBV assays resulted in no change in interpretation of the results. Assay performance was further evaluated using HBV seroconversion panels with comparable or better results when compared to the comparison assays. The performance evaluation data demonstrate that the ADVIA Centaur HBV assays are specific and sensitive automated immunoassays for detection of antigens and antibodies to hepatitis B virus with performance that is comparable to those of currently marketed assays. Additionally, these assays have the advantage of being available on the ADVIA Centaur immunoassay system, which provides for the flexibility of high throughput and full automation.
Gadelha, A; Vendramini, A M; Yonamine, C M; Nering, M; Berberian, A; Suiama, M A; Oliveira, V; Lima-Landman, M T; Breen, G; Bressan, R A; Abílio, V; Hayashi, M A F
2015-12-08
In schizophrenia (SCZ), higher angiotensin I-converting enzyme (ACE) levels have been reported in patient's blood and cerebrospinal fluid (CSF). Hereby, we propose to explore whether the ACE activity levels are associated to cognitive performance in SCZ. Seventy-two patients with SCZ or schizoaffective disorder diagnosis, and 69 healthy controls (HCs) underwent a cognitive battery with parallel collection of peripheral blood samples to measure ACE activity. Significant higher ACE activity levels were confirmed in the plasma of SCZ patients compared with HCs (Student's t=-5.216; P<0.001). ACE activity significantly correlated to Hopkins delayed recall measures (r=-0.247; P=0.004) and Hopkins total (r=-0.214; P=0.012). Subjects grouped as high ACE activity (above average) had worse performance compared with low ACE activity level group for Hopkins delayed recall measure, even after correction for clinical condition, age, gender and years of education (P=0.029). The adjusted R squared for this final model was 0.343. This result was evident only comparing extreme groups for ACE activity, when splitting the sample in three groups with similar number of subjects. To clarify this finding, we performed an evaluation of the cognitive performance of transgenic mice with three copies of ACE gene in novel object recognition (NOR) test, which showed that such animals presented impairment in NOR (P<0.05) compared with two copies of wild-type animals. The results observed in SCZ patients and animal model suggest both the association of ACE to cognitive deficits in SCZ. This finding may support the evaluation of novel treatment protocols and/or of innovative drugs for specific intervention of cognitive deficits in SCZ envisioning concomitant ACE activity and behavior evaluations.
Aggregate Interview Method of ranking orthopedic applicants predicts future performance.
Geissler, Jacqueline; VanHeest, Ann; Tatman, Penny; Gioe, Terence
2013-07-01
This article evaluates and describes a process of ranking orthopedic applicants using what the authors term the Aggregate Interview Method. The authors hypothesized that higher-ranking applicants using this method at their institution would perform better than those ranked lower using multiple measures of resident performance. A retrospective review of 115 orthopedic residents was performed at the authors' institution. Residents were grouped into 3 categories by matching rank numbers: 1-5, 6-14, and 15 or higher. Each rank group was compared with resident performance as measured by faculty evaluations, the Orthopaedic In-Training Examination (OITE), and American Board of Orthopaedic Surgery (ABOS) test results. Residents ranked 1-5 scored significantly better on patient care, behavior, and overall competence by faculty evaluation (P<.05). Residents ranked 1-5 scored higher on the OITE compared with those ranked 6-14 during postgraduate years 2 and 3 (P⩽.5). Graduates who had been ranked 1-5 had a 100% pass rate on the ABOS part 1 examination on the first attempt. The most favorably ranked residents performed at or above the level of other residents in the program; they did not score inferiorly on any measure. These results support the authors' method of ranking residents. The rigorous Aggregate Interview Method for ranking applicants consistently identified orthopedic resident candidates who scored highly on the Accreditation Council for Graduate Medical Education resident core competencies as measured by faculty evaluations, performed above the national average on the OITE, and passed the ABOS part 1 examination at rates exceeding the national average. Copyright 2013, SLACK Incorporated.
A revision of the gamma-evaluation concept for the comparison of dose distributions.
Bakai, Annemarie; Alber, Markus; Nüsslin, Fridtjof
2003-11-07
A method for the quantitative four-dimensional (4D) evaluation of discrete dose data based on gradient-dependent local acceptance thresholds is presented. The method takes into account the local dose gradients of a reference distribution for critical appraisal of misalignment and collimation errors. These contribute to the maximum tolerable dose error at each evaluation point to which the local dose differences between comparison and reference data are compared. As shown, the presented concept is analogous to the gamma-concept of Low et al (1998a Med. Phys. 25 656-61) if extended to (3+1) dimensions. The pointwise dose comparisons of the reformulated concept are easier to perform and speed up the evaluation process considerably, especially for fine-grid evaluations of 3D dose distributions. The occurrences of false negative indications due to the discrete nature of the data are reduced with the method. The presented method was applied to film-measured, clinical data and compared with gamma-evaluations. 4D and 3D evaluations were performed. Comparisons prove that 4D evaluations have to be given priority, especially if complex treatment situations are verified, e.g., non-coplanar beam configurations.
Statistical Evaluation of Time Series Analysis Techniques
NASA Technical Reports Server (NTRS)
Benignus, V. A.
1973-01-01
The performance of a modified version of NASA's multivariate spectrum analysis program is discussed. A multiple regression model was used to make the revisions. Performance improvements were documented and compared to the standard fast Fourier transform by Monte Carlo techniques.
Performance and cost effectiveness of permeable friction course (PFC) pavements.
DOT National Transportation Integrated Search
2013-02-01
In this project, the research team evaluated the performance of Permeable Friction Courses (PFC) over time and compared it against other types of wearing surface pavement layers. Several pavement sections including Asphalt Rubber (AR) PFCs, Performan...
2012-01-01
Background Disaster medicine is a fairly young scientific discipline and there is a need for the development of new methods for evaluation and research. This includes full-scale disaster exercisers. A standardized concept on how to evaluate these exercises, could lead to easier identification of pitfalls caused by system-errors in the organization. The aim of this study was to demonstrate the feasibility of using a combination of performance and outcome indicators so that results can be compared in standardized full-scale exercises. Methods Two multidisciplinary, full-scale exercises were studied in 2008 and 2010. The panorama had the same setup. Sets of performance indicators combined with indicators for unfavorable patient outcome were recorded in predesigned templates. Evaluators, all trained in a standardized way at a national disaster medicine centre, scored the results on predetermined locations; at the scene, at hospital and at the regional command and control. Results All data regarding the performance indicators of the participants during the exercises were obtained as well as all data regarding indicators for patient outcome. Both exercises could therefore be compared regarding performance (processes) as well as outcome indicators. The data from the performance indicators during the exercises showed higher scores for the prehospital command in the second exercise 15 points and 3 points respectively. Results from the outcome indicators, patient survival and patient complications, demonstrated a higher number of preventable deaths and a lower number of preventable complications in the exercise 2010. In the exercise 2008 the number of preventable deaths was lower and the number of preventable complications was higher. Conclusions Standardized multidisciplinary, full-scale exercises in different settings can be conducted and evaluated with performance indicators combined with outcome indicators enabling results from exercises to be compared. If exercises are performed in a standardized way, results may serve as a basis for lessons learned. Future use of the same concept using the combination of performance indicators and patient outcome indicators may demonstrate new and important evidence that could lead to new and better knowledge that also may be applied during real incidents. PMID:22929479
Rådestad, Monica; Nilsson, Heléne; Castrén, Maaret; Svensson, Leif; Rüter, Anders; Gryth, Dan
2012-08-28
Disaster medicine is a fairly young scientific discipline and there is a need for the development of new methods for evaluation and research. This includes full-scale disaster exercisers. A standardized concept on how to evaluate these exercises, could lead to easier identification of pitfalls caused by system-errors in the organization. The aim of this study was to demonstrate the feasibility of using a combination of performance and outcome indicators so that results can be compared in standardized full-scale exercises. Two multidisciplinary, full-scale exercises were studied in 2008 and 2010. The panorama had the same setup. Sets of performance indicators combined with indicators for unfavorable patient outcome were recorded in predesigned templates. Evaluators, all trained in a standardized way at a national disaster medicine centre, scored the results on predetermined locations; at the scene, at hospital and at the regional command and control. All data regarding the performance indicators of the participants during the exercises were obtained as well as all data regarding indicators for patient outcome. Both exercises could therefore be compared regarding performance (processes) as well as outcome indicators. The data from the performance indicators during the exercises showed higher scores for the prehospital command in the second exercise 15 points and 3 points respectively. Results from the outcome indicators, patient survival and patient complications, demonstrated a higher number of preventable deaths and a lower number of preventable complications in the exercise 2010. In the exercise 2008 the number of preventable deaths was lower and the number of preventable complications was higher. Standardized multidisciplinary, full-scale exercises in different settings can be conducted and evaluated with performance indicators combined with outcome indicators enabling results from exercises to be compared. If exercises are performed in a standardized way, results may serve as a basis for lessons learned. Future use of the same concept using the combination of performance indicators and patient outcome indicators may demonstrate new and important evidence that could lead to new and better knowledge that also may be applied during real incidents.
Leithner, Doris; Mahmoudi, Scherwin; Wichmann, Julian L; Martin, Simon S; Lenga, Lukas; Albrecht, Moritz H; Booz, Christian; Arendt, Christophe T; Beeres, Martin; D'Angelo, Tommaso; Bodelle, Boris; Vogl, Thomas J; Scholtz, Jan-Erik
2018-02-01
To investigate the impact of traditional (VMI) and noise-optimized virtual monoenergetic imaging (VMI+) algorithms on quantitative and qualitative image quality, and the assessment of stenosis in carotid and intracranial dual-energy CTA (DE-CTA). DE-CTA studies of 40 patients performed on a third-generation 192-slice dual-source CT scanner were included in this retrospective study. 120-kVp image-equivalent linearly-blended, VMI and VMI+ series were reconstructed. Quantitative analysis included evaluation of contrast-to-noise ratios (CNR) of the aorta, common carotid artery, internal carotid artery, middle cerebral artery, and basilar artery. VMI and VMI+ with highest CNR, and linearly-blended series were rated qualitatively. Three radiologists assessed artefacts and suitability for evaluation at shoulder height, carotid bifurcation, siphon, and intracranial using 5-point Likert scales. Detection and grading of stenosis were performed at carotid bifurcation and siphon. Highest CNR values were observed for 40-keV VMI+ compared to 65-keV VMI and linearly-blended images (P < 0.001). Artefacts were low in all qualitatively assessed series with excellent suitability for supraaortic artery evaluation at shoulder and bifurcation height. Suitability was significantly higher in VMI+ and VMI compared to linearly-blended images for intracranial and ICA assessment (P < 0.002). VMI and VMI+ showed excellent accordance for detection and grading of stenosis at carotid bifurcation and siphon with no differences in diagnostic performance. 40-keV VMI+ showed improved quantitative image quality compared to 65-keV VMI and linearly-blended series in supraaortic DE-CTA. VMI and VMI+ provided increased suitability for carotid and intracranial artery evaluation with excellent assessment of stenosis, but did not translate into increased diagnostic performance. Copyright © 2017 Elsevier B.V. All rights reserved.
Evaluating Curriculum-Based Measurement from a Behavioral Assessment Perspective
ERIC Educational Resources Information Center
Ardoin, Scott P.; Roof, Claire M.; Klubnick, Cynthia; Carfolite, Jessica
2008-01-01
Curriculum-based measurement Reading (CBM-R) is an assessment procedure used to evaluate students' relative performance compared to peers and to evaluate their growth in reading. Within the response to intervention (RtI) model, CBM-R data are plotted in time series fashion as a means modeling individual students' response to varying levels of…
Lai, Fu-Jou; Chang, Hong-Tsun; Wu, Wei-Sheng
2015-01-01
Computational identification of cooperative transcription factor (TF) pairs helps understand the combinatorial regulation of gene expression in eukaryotic cells. Many advanced algorithms have been proposed to predict cooperative TF pairs in yeast. However, it is still difficult to conduct a comprehensive and objective performance comparison of different algorithms because of lacking sufficient performance indices and adequate overall performance scores. To solve this problem, in our previous study (published in BMC Systems Biology 2014), we adopted/proposed eight performance indices and designed two overall performance scores to compare the performance of 14 existing algorithms for predicting cooperative TF pairs in yeast. Most importantly, our performance comparison framework can be applied to comprehensively and objectively evaluate the performance of a newly developed algorithm. However, to use our framework, researchers have to put a lot of effort to construct it first. To save researchers time and effort, here we develop a web tool to implement our performance comparison framework, featuring fast data processing, a comprehensive performance comparison and an easy-to-use web interface. The developed tool is called PCTFPeval (Predicted Cooperative TF Pair evaluator), written in PHP and Python programming languages. The friendly web interface allows users to input a list of predicted cooperative TF pairs from their algorithm and select (i) the compared algorithms among the 15 existing algorithms, (ii) the performance indices among the eight existing indices, and (iii) the overall performance scores from two possible choices. The comprehensive performance comparison results are then generated in tens of seconds and shown as both bar charts and tables. The original comparison results of each compared algorithm and each selected performance index can be downloaded as text files for further analyses. Allowing users to select eight existing performance indices and 15 existing algorithms for comparison, our web tool benefits researchers who are eager to comprehensively and objectively evaluate the performance of their newly developed algorithm. Thus, our tool greatly expedites the progress in the research of computational identification of cooperative TF pairs.
2015-01-01
Background Computational identification of cooperative transcription factor (TF) pairs helps understand the combinatorial regulation of gene expression in eukaryotic cells. Many advanced algorithms have been proposed to predict cooperative TF pairs in yeast. However, it is still difficult to conduct a comprehensive and objective performance comparison of different algorithms because of lacking sufficient performance indices and adequate overall performance scores. To solve this problem, in our previous study (published in BMC Systems Biology 2014), we adopted/proposed eight performance indices and designed two overall performance scores to compare the performance of 14 existing algorithms for predicting cooperative TF pairs in yeast. Most importantly, our performance comparison framework can be applied to comprehensively and objectively evaluate the performance of a newly developed algorithm. However, to use our framework, researchers have to put a lot of effort to construct it first. To save researchers time and effort, here we develop a web tool to implement our performance comparison framework, featuring fast data processing, a comprehensive performance comparison and an easy-to-use web interface. Results The developed tool is called PCTFPeval (Predicted Cooperative TF Pair evaluator), written in PHP and Python programming languages. The friendly web interface allows users to input a list of predicted cooperative TF pairs from their algorithm and select (i) the compared algorithms among the 15 existing algorithms, (ii) the performance indices among the eight existing indices, and (iii) the overall performance scores from two possible choices. The comprehensive performance comparison results are then generated in tens of seconds and shown as both bar charts and tables. The original comparison results of each compared algorithm and each selected performance index can be downloaded as text files for further analyses. Conclusions Allowing users to select eight existing performance indices and 15 existing algorithms for comparison, our web tool benefits researchers who are eager to comprehensively and objectively evaluate the performance of their newly developed algorithm. Thus, our tool greatly expedites the progress in the research of computational identification of cooperative TF pairs. PMID:26677932
Computer-aided detection (CAD) of breast cancer on full field digital and screening film mammograms
NASA Astrophysics Data System (ADS)
Sun, Xuejun; Qian, Wei; Song, Xiaoshan; Qian, Yuyan; Song, Dansheng; Clark, Robert A.
2003-05-01
Full-field digital mammography (FFDM) as a new breast imaging modality has potential to detect more breast cancers or to detect them at smaller sizes and earlier stages compared with screening film mammography (SFM). However, its performance needs verification, and it would pose new problems for the development of CAD methods for breast cancer detection and diagnosis. Performance evaluation of CAD systems on FFDM and SFM has been conducted in this study, respectively. First, an adaptive CAD system employing a series of advanced modules has been developed on FFDM. Second, a standardization approach has been developed to make the CAD system independent of characteristics of digitizer or imaging modalities for mammography. CAD systems developed previously for SFM and developed in this study for FFDM have been evaluated on FFDM and SFM images without and with standardization, respectively, to examine the performance improvement of the CAD system developed in this study. Computerized free-response receiver operating characteristic (FROC) analysis has been adopted as performance evaluation method. Compared with previous one, the CAD system developed in this study demonstrated significantly performance improvements. However, the comparison results have shown that the performances of final CAD system in this study are not significantly different on FFDM and on SFM after standardization. It needs further study on the assessment of CAD system performance on FFDM and SFM modalities.
Development of a test protocol for evaluating EVA glove performance
NASA Technical Reports Server (NTRS)
Hinman, Elaine M.
1992-01-01
Testing gloved hand performance involves work from several disciplines. Evaluations performed in the course of reenabling a disabled hand, designing a robotic end effector or master controller, or hard-suit design have all yielded relevant information, and, in most cases, produced performance test methods. Most times, these test methods have been primarily oriented toward their parent discipline. For space operations, a comparative test which would provide a way to quantify pressure glove and end effector performance would be useful in dividing tasks between humans and robots. Such a test would have to rely heavily on sensored measurement, as opposed to questionnaires, to produce relevant data. However, at some point human preference would have to be taken into account. This paper presents a methodology for evaluating gloved hand performance which attempts to respond to these issues. Glove testing of a prototype glove design using this method is described.
Shrestha, Namita; Chilkoor, Govinda; Wilder, Joseph; Ren, Zhiyong Jason; Gadhamshetty, Venkataramana
2018-06-01
This study evaluates and compares the performance of microbial fuel cells (MFCs) and microbial capacitive deionization cells (MCDCs) fed with wastewater produced from the Bakken shale. The produced water was characterized by high levels of dissolved solids and chemical oxygen demand (COD). Two-compartment MFCs and three-compartment MCDCs were evaluated under batch-fed mode using mixed microbial consortia in the anode, ferricyanide in the cathode, and produced water as the electrolyte in the anode and capacitive deionization units. COD removal in the MFCs was 88%, while that in the MCDCs was limited to 76%. The lower performance of the MCDCs was due to the large impedance (6600 Ω cm 2 ) compared with the MFCs (870 Ω cm 2 ). However, the MCDCs achieved two-fold higher removal of dissolved solids. Both the MFCs and MCDCs suffered from a higher impedance induced by fouling in the latter stages of the operation. Copyright © 2018 Elsevier B.V. All rights reserved.
Islam, Naz Niamul; Hannan, M A; Shareef, Hussain; Mohamed, Azah; Salam, M A
2014-01-01
Power oscillation damping controller is designed in linearized model with heuristic optimization techniques. Selection of the objective function is very crucial for damping controller design by optimization algorithms. In this research, comparative analysis has been carried out to evaluate the effectiveness of popular objective functions used in power system oscillation damping. Two-stage lead-lag damping controller by means of power system stabilizers is optimized using differential search algorithm for different objective functions. Linearized model simulations are performed to compare the dominant mode's performance and then the nonlinear model is continued to evaluate the damping performance over power system oscillations. All the simulations are conducted in two-area four-machine power system to bring a detailed analysis. Investigated results proved that multiobjective D-shaped function is an effective objective function in terms of moving unstable and lightly damped electromechanical modes into stable region. Thus, D-shape function ultimately improves overall system damping and concurrently enhances power system reliability.
Evaluating distance learning in health informatics education.
Russell, Barbara L; Barefield, Amanda C; Turnbull, Diane; Leibach, Elizabeth; Pretlow, Lester
2008-04-24
The purpose of this study was to compare academic performance between distance-learning and on-campus health informatics students. A quantitative causal-comparative research design was utilized, and academic performance was measured by final GPA scores and Registered Health Information Administrator certification exam scores. Differences in previous academic performance between the two groups were also determined by comparing overall admission GPA and math/science admission GPA. The researchers found no difference in academic performance between the two groups when final GPA scores and total certification scores were compared. However, there were statistically significant differences between the two groups in 4 of the 17 sub-domains of the certification examination, with the on-campus students scoring slightly higher than the distance students. Correlation studies were also performed, and the researchers found significant correlations between overall admission GPA, math/science admission GPA, final GPA, and certification scores.
Comparative evaluation of concrete bridge deck sealers.
DOT National Transportation Integrated Search
2015-08-01
The main objective of this research was to compare the performance of five bridge deck sealer products using a : synthesis of two testing methods: NCHRP Report 244 Series II tests and standards developed by the Alberta : Ministry of Transportation (B...
Performance evaluation and specification of trackless tack : [project summary].
DOT National Transportation Integrated Search
2016-09-01
Researchers compared two tracking resistance tests for tack: a track-free time test and a modified dynamic shear rheometer (DSR) tackiness test. : Researchers compared four bond strength tests: interface shear, pull-off, torque, and Arcan. Then, usin...
NASA Astrophysics Data System (ADS)
Wunderlich, Adam; Goossens, Bart
2014-03-01
The majority of the literature on task-based image quality assessment has focused on lesion detection tasks, using the receiver operating characteristic (ROC) curve, or related variants, to measure performance. However, since many clinical image evaluation tasks involve both detection and estimation (e.g., estimation of kidney stone composition, estimation of tumor size), there is a growing interest in performance evaluation for joint detection and estimation tasks. To evaluate observer performance on such tasks, Clarkson introduced the estimation ROC (EROC) curve, and the area under the EROC curve as a summary figure of merit. In the present work, we propose nonparametric estimators for practical EROC analysis from experimental data, including estimators for the area under the EROC curve and its variance. The estimators are illustrated with a practical example comparing MRI images reconstructed from different k-space sampling trajectories.
MSFC Skylab instrumentation and communication system mission evaluation
NASA Technical Reports Server (NTRS)
Adair, B. M.
1974-01-01
An evaluation of the in-orbit performance of the instrumentation and communications systems installed on Skylab is presented. Performance is compared with functional requirements and the fidelity of communications. In-orbit performance includes processing engineering, scientific, experiment, and biomedical data, implementing ground-generated commands, audio and video communication, generating rendezvous ranging information, and radio frequency transmission and reception. A history of the system evolution based on the functional requirements and a physical description of the launch configuration is included. The report affirms that the instrumentation and communication system satisfied all imposed requirements.
Jeon, Jin-Hun; Kim, Hae-Young; Kim, Ji-Hwan; Kim, Woong-Chul
2014-12-01
This study aimed to evaluate the accuracy of digitizing dental impressions of abutment teeth using a white light scanner and to compare the findings among teeth types. To assess precision, impressions of the canine, premolar, and molar prepared to receive all-ceramic crowns were repeatedly scanned to obtain five sets of 3-D data (STL files). Point clouds were compared and error sizes were measured (n=10 per type). Next, to evaluate trueness, impressions of teeth were rotated by 10°-20° and scanned. The obtained data were compared with the first set of data for precision assessment, and the error sizes were measured (n=5 per type). The Kruskal-Wallis test was performed to evaluate precision and trueness among three teeth types, and post-hoc comparisons were performed using the Mann-Whitney U test with Bonferroni correction (α=.05). Precision discrepancies for the canine, premolar, and molar were 3.7 µm, 3.2 µm, and 7.3 µm, respectively, indicating the poorest precision for the molar (P<.001). Trueness discrepancies for teeth types were 6.2 µm, 11.2 µm, and 21.8 µm, respectively, indicating the poorest trueness for the molar (P=.007). In respect to accuracy the molar showed the largest discrepancies compared with the canine and premolar. Digitizing of dental impressions of abutment teeth using a white light scanner was assessed to be a highly accurate method and provided discrepancy values in a clinically acceptable range. Further study is needed to improve digitizing performance of white light scanning in axial wall.
Thermal Performance Testing of EMU and CSAFE Liquid Cooling Garments
NASA Technical Reports Server (NTRS)
Rhodes, Richard; Bue, Grant; Hakam, Mark; Radford, Tamara
2013-01-01
Future exploration missions require the development of a new liquid cooling garment (LCG) that offers greater system reliability, is more comfortable, and maximizes thermal performance. To inform the development of a future LCG a thermal performance test was conducted to evaluate three factors: (1) the effect of the thermal comfort undergarment (TCU) on tactile and thermal comfort, (2) the comparable thermal performance of an CSAFE developed engineering evaluation unit (EEU) LCG, which uses a commercial-off-the-shelf (COTS) wicking garment as the base, and (3) the performance of a torso or upper body only LCG configuration to evaluate a proposed auxiliary loop configuration. To evaluate the thermal performance of each configuration a metabolic suit test was conducted, utilizing suited subjects to generate metabolic heat by walking on a treadmill at various speeds. Three (3) test subjects of similar height and weight produced a metabolic load for five tests by either resting (300-600 BTU/hr), walking at a slow pace (1200 BTU/hr), and walking at a brisk pace (2200 BTU/hr). During the test, data was collected that would allow us to track the heat transfer to the LCG and ventilation system to determine the thermal performance of the LCG configurations. Four different test configurations were tested, with one configuration tested twice. The test results show that the CSAFE EEU LCG and EMU LCG had comparable performance. The testing also showed that an auxiliary loop LCG, sized similarly to the shirt-only configuration, should provide adequate cooling for contingency scenarios. Finally, the testing showed the previous analysis that assumed a UA deterioration from the TCU was too conservative and the TCU may prove to be acceptable for future development with additional analysis and testing.
Duan, Junbo; Zhang, Ji-Gang; Deng, Hong-Wen; Wang, Yu-Ping
2013-01-01
Copy number variation (CNV) has played an important role in studies of susceptibility or resistance to complex diseases. Traditional methods such as fluorescence in situ hybridization (FISH) and array comparative genomic hybridization (aCGH) suffer from low resolution of genomic regions. Following the emergence of next generation sequencing (NGS) technologies, CNV detection methods based on the short read data have recently been developed. However, due to the relatively young age of the procedures, their performance is not fully understood. To help investigators choose suitable methods to detect CNVs, comparative studies are needed. We compared six publicly available CNV detection methods: CNV-seq, FREEC, readDepth, CNVnator, SegSeq and event-wise testing (EWT). They are evaluated both on simulated and real data with different experiment settings. The receiver operating characteristic (ROC) curve is employed to demonstrate the detection performance in terms of sensitivity and specificity, box plot is employed to compare their performances in terms of breakpoint and copy number estimation, Venn diagram is employed to show the consistency among these methods, and F-score is employed to show the overlapping quality of detected CNVs. The computational demands are also studied. The results of our work provide a comprehensive evaluation on the performances of the selected CNV detection methods, which will help biological investigators choose the best possible method.
An Evaluation Method for PV Systems by using Limited Data Item
NASA Astrophysics Data System (ADS)
Oozeki, Takashi; Izawa, Toshiyasu; Otani, Kenji; Tsuzuku, Ken; Koike, Hisafumi; Kurokawa, Kosuke
Beside photovoltaic (PV) systems are recently expected to introduce around Japan, almost all of them have not been taken care after established since PV systems are called maintenance free. In fact, there are few troubles about PV operations behind owners of PV systems because characteristics of them cannot be identified completely such as the ideal output energy. Therefore, it is very important to evaluate the characteristics of them. For evaluating them, equipments of measuring are required, and they, especially Pyrheliometer, are expensive as much as owners of the PV system cannot equip usually. Consequently, An evaluation method which can reveal the performance of operation such as the performance ratio with a very few kinds of data is necessary. In this paper, proposed method can evaluate performance ratio, shading losses, inverter efficiency losses by using only system output data items. The adequacies of the method are indicated by comparing with actual data and field survey results. As a result, the method is intended to be checking tool of PV system performance.
Escandón-García, Santiago; Ribeiro, Filomena J.; McAlinden, Colm
2018-01-01
Purpose To compare the through-focus visual performance in a clinical population of pseudophakic patients implanted with two new trifocal intraocular lenses (IOLs) and one extended depth of focus IOL. Methods Prospective, nonrandomized, examiner-masked case series. Twenty-three patients received the FineVision® and seven patients received the PanOptix™ trifocal IOLs. Fifteen patients received the Symfony extended depth of focus IOL. Mean age of patients was 63 ± 8 years. Through-focus visual acuity was measured from –3.00 to +1.00 D vergences. Contrast sensitivity was measured with and without a source of glare. Light disturbances were evaluated with the Light Distortion Analyzer. Results Though-focus evaluation showed that trifocal IOLs performed significantly better at near distance (33 and 40 cm), and extended depth of focus performed significantly better at intermediate distance (1.0 m). Contrast sensitivity function with glare and dysphotopsia was similar between the three IOLs and subjective response to questionnaire showed a significantly higher score (worse performance) for the extended depth of focus IOL compared to both trifocal IOLs in the bothersome subscale (p < 0.05). Conclusions Trifocal IOLs grant better performance at near distance while extended depth of focus IOL performs better at intermediate distance. Objective dysphotopsia measured with the Light Distortion Analyzer is not reduced in extended depth of focus IOL compared to trifocal IOLs. PMID:29651343
Kinoshita, Manabu; Taniguchi, Mai; Takagaki, Masatoshi; Seike, Nobuhisa; Hashimoto, Naoya; Yoshimine, Toshiki
2015-05-01
Neurosurgical patties are the most frequently used instruments during neurosurgical procedures, and their high performance is required to ensure safe operations. They must offer cushioning, water-absorbing, water-retaining, and non-tissue adherent characteristics. Here, the authors describe a revised neurosurgical patty that is superior in all respects to the conventional patty available in Japan. Patty characteristics were critically and scientifically evaluated using various in vitro assays. Moreover, a novel ex vivo evaluation system focusing on the adherent characteristics of the neurosurgical patty was developed. The proposed assay could provide benchmark data for comparing different neurosurgical patties, offering neurosurgeons objective data on the performance of patties. The newly developed patty was also evaluated in real neurosurgical settings and showed superb performance during various neurosurgical procedures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Im, Piljae; Malhotra, Mini; Munk, Jeffrey D.
This report provides second-year cooling season test results for the multi-year project titled “Evaluation of Variable Refrigeration Flow (VRF) System on Oak Ridge National Laboratory (ORNL)’s Flexible Research Platform (FRP).” The purpose of the second-year project was to (1) evaluate the full- and partload performance of VRF systems compared with an existing baseline heating, ventilation, and airconditioning (HVAC) system, which is a conventional rooftop unit (RTU) variable-air-volume (VAV) system with electric resistance heating and (2) use hourly building energy simulation to evaluate the energy savings potential of using VRF systems in major US cities. The second-year project performance period wasmore » from July 2015 through June 2016.« less
Situation Awareness and Workload Measures for SAFOR
NASA Technical Reports Server (NTRS)
DeMaio, Joe; Hart, Sandra G.; Allen, Ed (Technical Monitor)
1999-01-01
The present research was conducted in support of the NASA Safe All-Weather Flight Operations for Rotorcraft (SAFOR) program. The purpose of the work was to investigate the utility of two measurement tools developed by the British Defense Evaluation Research Agency. These tools were a subjective workload assessment scale, the DRA Workload Scale (DRAWS), and a situation awareness measurement tool in which the crews self-evaluation of performance is compared against actual performance. These two measurement tools were evaluated in the context of a test of an innovative approach to alerting the crew by way of a helmet mounted display. The DRAWS was found to be usable, but it offered no advantages over extant scales, and it had only limited resolution. The performance self-evaluation metric of situation awareness was found to be highly effective.
Standard test evaluation of graphite fiber/resin matrix composite materials for improved toughness
NASA Technical Reports Server (NTRS)
Chapman, Andrew J.
1984-01-01
Programs sponsored by NASA with the commercial transport manufacturers to develop a technology data base are required to design and build composite wing and fuselage structures. To realize the full potential of composite structures in these strength critical designs, material systems having improved ductility and interlaminar toughness are being sought. To promote systematic evaluation of new materials, NASA and the commercial transport manufacturers have selected and standardized a set of five common tests. These tests evaluate open hole tension and compression performance, compression performance after impact at an energy level of 20 ft-lb, and resistance to delamination. Ten toughened resin matrix/graphite fiber composites were evaluated using this series of tests, and their performance is compared with a widely used composite system.
ERIC Educational Resources Information Center
Olive, G.; And Others
A selective dissemination of information service based on computer scanning of Nuclear Science Abstracts tapes has operated at the Atomic Energy Research Establishment, Harwell, England since October, 1968. The performance of the mechanized SDI service has been compared with that of the pre-existing current awareness service which is based on…
Evaluation of Sc-Bearing Aluminum Alloy C557 for Aerospace Applications
NASA Technical Reports Server (NTRS)
Domack, Marcia S.; Dicus, Dennis L.
2002-01-01
The performance of the Al-Mg-Sc alloy C557 was evaluated to assess its potential for a broad range of aerospace applications, including airframe and launch vehicle structures. Of specific interest were mechanical properties at anticipated service temperatures and thermal stability of the alloy. Performance was compared with conventional airframe aluminum alloys and with other emerging aluminum alloys developed for specific service environments. Mechanical properties and metallurgical structure were evaluated for commercially rolled sheet in the as-received H116 condition and after thermal exposures at 107 C. Metallurgical analyses were performed to de.ne grain morphology and texture, strengthening precipitates, and to assess the effect of thermal exposure.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Diaz, Aaron A.; Baldwin, David L.; Cinson, Anthony D.
2014-08-06
This Technical Letter Report satisfies the M3AR-14PN2301022 milestone, and is focused on identifying and quantifying the mechanistic sources of sensor performance variation between individual 22-element, linear phased-array sensor prototypes, SN1 and SN2. This effort constitutes an iterative evolution that supports the longer term goal of producing and demonstrating a pre-manufacturing prototype ultrasonic probe that possesses the fundamental performance characteristics necessary to enable the development of a high-temperature sodium-cooled fast reactor inspection system. The scope of the work for this portion of the PNNL effort conducted in FY14 includes performing a comparative evaluation and assessment of the performance characteristics of themore » SN1 and SN2 22 element PA-UT probes manufactured at PNNL. Key transducer performance parameters, such as sound field dimensions, resolution capabilities, frequency response, and bandwidth are used as a metric for the comparative evaluation and assessment of the SN1 and SN2 engineering test units.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hart, Darren M.
Sandia National Laboratories has tested and evaluated Geotech Smart24 data acquisition system with active Fortezza crypto card data signing and authentication. The test results included in this report were in response to static and tonal-dynamic input signals. Most test methodologies used were based on IEEE Standards 1057 for Digitizing Waveform Recorders and 1241 for Analog to Digital Converters; others were designed by Sandia specifically for infrasound application evaluation and for supplementary criteria not addressed in the IEEE standards. The objective of this work was to evaluate the overall technical performance of the Geotech Smart24 digitizer with a Fortezza PCMCIA cryptomore » card actively implementing the signing of data packets. The results of this evaluation were compared to relevant specifications provided within manufacturer's documentation notes. The tests performed were chosen to demonstrate different performance aspects of the digitizer under test. The performance aspects tested include determining noise floor, least significant bit (LSB), dynamic range, cross-talk, relative channel-to-channel timing, time-tag accuracy, analog bandwidth and calibrator performance.« less
Metrics for Offline Evaluation of Prognostic Performance
NASA Technical Reports Server (NTRS)
Saxena, Abhinav; Celaya, Jose; Saha, Bhaskar; Saha, Sankalita; Goebel, Kai
2010-01-01
Prognostic performance evaluation has gained significant attention in the past few years. Currently, prognostics concepts lack standard definitions and suffer from ambiguous and inconsistent interpretations. This lack of standards is in part due to the varied end-user requirements for different applications, time scales, available information, domain dynamics, etc. to name a few. The research community has used a variety of metrics largely based on convenience and their respective requirements. Very little attention has been focused on establishing a standardized approach to compare different efforts. This paper presents several new evaluation metrics tailored for prognostics that were recently introduced and were shown to effectively evaluate various algorithms as compared to other conventional metrics. Specifically, this paper presents a detailed discussion on how these metrics should be interpreted and used. These metrics have the capability of incorporating probabilistic uncertainty estimates from prognostic algorithms. In addition to quantitative assessment they also offer a comprehensive visual perspective that can be used in designing the prognostic system. Several methods are suggested to customize these metrics for different applications. Guidelines are provided to help choose one method over another based on distribution characteristics. Various issues faced by prognostics and its performance evaluation are discussed followed by a formal notational framework to help standardize subsequent developments.
Field evaluation of Fourier transform infrared continuous emissions monitoring (FTIR CEM) systems
NASA Astrophysics Data System (ADS)
Dunder, Thomas A.; Geyer, Thomas J.; Kinner, Laura L.; Plummer, Grant M.
1995-02-01
Recent environmental regulations, including the Clean Air Act and the Enhanced Monitoring Regulations, may require continuous emissions monitoring (CEM) of hazardous air pollutants (HAPs). A promising technique for this application is Fourier transform infrared spectroscopy (FTIR). FTIR spectroscopy can, in principle, be used to monitor virtually any gas phase species. Two evaluations of FTIR CEM systems are discussed. The first study, performed in 1993 - 94, compared two FTIR CEM systems on a side-by-side basis in an extended field test at two coal-fired electric power plants. The FTIR CEM systems monitored the legally mandated criteria pollutants and diluents (CO, CO2, NO, NO2, and SO2) as well as H2O. In addition, one system monitored two HAPs (HCl and HF) and NH3. The FTIR CEM measurements were compared with those from the compliance CEM systems at the facilities. Several relative accuracy test audits were also performed to verify the FTIR CEM accuracy. The second evaluation was recently commenced on behalf of the Environmental Protection Agency. In this study, FTIR CEM systems are evaluated specifically for the monitoring of HAP species by conducting laboratory and field tests. The evaluation culminates in the development of proposed performance specifications and protocols for FTIR CEM systems.
Evaluation of microwave traffic detector at the Chemawa Road/Interstate 5 interchange: final report.
DOT National Transportation Integrated Search
2002-04-01
In 2001, the Oregon Department of Transportation installed a microwave traffic detection sensor, and compared its performance to conventional inductive traffic loops. The objective of the study was to evaluate the capabilities of the microwave tra...
Performance of concrete joint sealants in hot climates
DOT National Transportation Integrated Search
1987-11-01
A research project was initiated in 1979 on project IR-17-1(126) to compare and evaluate several joint sealants. In 1982 another test section on project F 028-1-506 was established to evaluate several other sealants. This report documents the placeme...
Evaluation of the locations of Kentucky's traffic crash data.
DOT National Transportation Integrated Search
2010-11-01
An evaluation of a random sample of crashes from 2009 was performed to assess the current accuracy of the crash data's location information.The location of the crash was compared to the presumed location using several report data elements such as nea...
Evaluation of alternative dowel bar materials and coatings.
DOT National Transportation Integrated Search
2011-11-01
This study provided for a continuation of the long-term performance evaluation of 1.5-in (38-mm) diameter FRP dowels : and Type 304 stainless steel solid dowels or mortar-filled tubes compared to epoxy-coated dowels. This primarily : included an eval...
Halimi, C; Montembault, A; Guerry, A; Delair, T; Viguier, E; Fulchiron, R; David, L
2015-01-01
A new generation of dermal filler for wrinkle filler based on chitosan was compared to current hyaluronic acid-based dermal fillers by using a new rheological performance criterion based on viscosity during injection related to Newtonian viscosity. In addition an in vivo evaluation was performed for preclinical evidence of chitosan use as dermal filler. In this way, biocompatibility and dermis reconstruction was evaluated on a pig model.
NASA Astrophysics Data System (ADS)
Vikhoff-Baaz, Barbro
2000-10-01
The doctoral thesis concerns development, evaluation and performance of quality assessment methods for volume- selection methods in 31P and 1H MR spectroscopy (MRS). It also contains different aspects of the measurement procedure for 1H MR spectroscopic imaging (MRSI) with application on the human brain, image reconstruction of the MRSI images and evaluation methods for lateralization of temporal lobe epilepsy (TLE). Two complementary two-compartment phantoms and evaluation methods for quality assessment of 31P MRS in small-bore MR systems were presented. The first phantom consisted of an inner cube inside a sphere phantom where measurements with and without volume selection where compared for various VOI sizes. The multi-centre showed that the evaluated parameters provide useful information of the performance of volume-selective MRS at the MR system. The second phantom consisted of two compartments divided by a very thin wall and was found useful for measurements of the appearance and position of the VOI profile in specific gradient directions. The second part concerned 1H MRS and MRSI of whole-body MR systems. Different factors that may degrade or complicate the measurement procedure like for MRSI were evaluated, e.g. the volume selection performance, contamination, susceptibility and motion. Two interpolation methods for reconstruction of MRSI images were compared. Measurements and computer simulations showed that Fourier interpolation correctly visualizes the information inherent in the data set, while the results were dependent on the position of the object relative the original matrix using Cubic spline interpolation. Application of spatial filtering may improve the image representation of the data. Finally, 1H MRSI was performed on healthy volunteers and patients with temporal lobe epilepsy (TLE). Metabolite concentration images were used for lateralization of TLE, where the signal intensity in the two hemispheres were compared. Visual analysis of the metabolite concentration images can, with high accuracy, be used for lateralization in routine examinations. Analysis from measurements with region-of-interests (ROI) in different locations gives quantitative information about the degree of signal loss and the spatial distribution.
Rosato, Stefano; D'Errigo, Paola; Badoni, Gabriella; Fusco, Danilo; Perucci, Carlo A; Seccareccia, Fulvia
2008-08-01
The availability of two contemporary sources of information about coronary artery bypass graft (CABG) interventions, allowed 1) to verify the feasibility of performing outcome evaluation studies using administrative data sources, and 2) to compare hospital performance obtainable using the CABG Project clinical database with hospital performance derived from the use of current administrative data. Interventions recorded in the CABG Project were linked to the hospital discharge record (HDR) administrative database. Only the linked records were considered for subsequent analyses (46% of the total CABG Project). A new selected population "clinical card-HDR" was then defined. Two independent risk-adjustment models were applied, each of them using information derived from one of the two different sources. Then, HDR information was supplemented with some patient preoperative conditions from the CABG clinical database. The two models were compared in terms of their adaptability to data. Hospital performances identified by the two different models and significantly different from the mean was compared. In only 4 of the 13 hospitals considered for analysis, the results obtained using the HDR model did not completely overlap with those obtained by the CABG model. When comparing statistical parameters of the HDR model and the HDR model + patient preoperative conditions, the latter showed the best adaptability to data. In this "clinical card-HDR" population, hospital performance assessment obtained using information from the clinical database is similar to that derived from the use of current administrative data. However, when risk-adjustment models built on administrative databases are supplemented with a few clinical variables, their statistical parameters improve and hospital performance assessment becomes more accurate.
The relationship of post-event processing to self-evaluation of performance in social anxiety.
Brozovich, Faith; Heimberg, Richard G
2011-06-01
Socially anxious and control participants engaged in a social interaction with a confederate and then wrote about themselves or the other person (i.e., self-focused post-event processing [SF-PEP] vs. other-focused post-event processing [OF-PEP]) and completed several questionnaires. One week later, participants completed measures concerning their evaluation of their performance in the social interaction and the degree to which they engaged in post-event processing (PEP) during the week. Socially anxious individuals evaluated their performance in the social interaction more poorly than control participants, both immediately after and 1 week later. Socially anxious individuals assigned to the SF-PEP condition displayed fewer positive feelings about their performance compared to the socially anxious individuals in the OF-PEP condition as well as controls in either condition. Also, the trait tendency to engage in PEP moderated the effect of social anxiety on participants' evaluation of their performance in the interaction, such that high socially anxious individuals with high trait PEP scores evaluated themselves in the interaction more negatively at the later assessment. These results suggest that PEP and other self-evaluative processes may perpetuate the cycle of social anxiety. Copyright © 2011. Published by Elsevier Ltd.
Widjaja, Susana; Cohen, Surekha; Brady, William E.; O'reilly, Kevin; Susanto; Wibowo, Ajar; Cahyono; Graham, Robert R.; Porter, Kevin R.
1999-01-01
A multicenter cross-sectional survey was conducted comparing a commercially available chlamydial optical immunoassay (OIA) to the chlamydial ligase chain reaction (LCR). Endocervical samples from 415 outpatients visiting clinics from three hospitals in South Kalimantan, Indonesia, were evaluated. Relative to the LCR, the overall sensitivity and specificity of the OIA were 31.6 and 98.9%, respectively. The sensitivity of the OIA varied among the three hospital laboratories, ranging from 20 to 50%. The OIA performance was slightly lower on samples from patients attending dermatovenereology clinics than on samples from nondermatovenereology clinic patients. The results indicate that the OIA did not perform well compared to LCR. PMID:10565960
Widjaja, S; Cohen, S; Brady, W E; O'reilly, K; Susanto; Wibowo, A; Cahyono; Graham, R R; Porter, K R
1999-12-01
A multicenter cross-sectional survey was conducted comparing a commercially available chlamydial optical immunoassay (OIA) to the chlamydial ligase chain reaction (LCR). Endocervical samples from 415 outpatients visiting clinics from three hospitals in South Kalimantan, Indonesia, were evaluated. Relative to the LCR, the overall sensitivity and specificity of the OIA were 31.6 and 98.9%, respectively. The sensitivity of the OIA varied among the three hospital laboratories, ranging from 20 to 50%. The OIA performance was slightly lower on samples from patients attending dermatovenereology clinics than on samples from nondermatovenereology clinic patients. The results indicate that the OIA did not perform well compared to LCR.
Performance comparison of optical interference cancellation system architectures.
Lu, Maddie; Chang, Matt; Deng, Yanhua; Prucnal, Paul R
2013-04-10
The performance of three optics-based interference cancellation systems are compared and contrasted with each other, and with traditional electronic techniques for interference cancellation. The comparison is based on a set of common performance metrics that we have developed for this purpose. It is shown that thorough evaluation of our optical approaches takes into account the traditional notions of depth of cancellation and dynamic range, along with notions of link loss and uniformity of cancellation. Our evaluation shows that our use of optical components affords performance that surpasses traditional electronic approaches, and that the optimal choice for an optical interference canceller requires taking into account the performance metrics discussed in this paper.
Evaluation of Sub Query Performance in SQL Server
NASA Astrophysics Data System (ADS)
Oktavia, Tanty; Sujarwo, Surya
2014-03-01
The paper explores several sub query methods used in a query and their impact on the query performance. The study uses experimental approach to evaluate the performance of each sub query methods combined with indexing strategy. The sub query methods consist of in, exists, relational operator and relational operator combined with top operator. The experimental shows that using relational operator combined with indexing strategy in sub query has greater performance compared with using same method without indexing strategy and also other methods. In summary, for application that emphasized on the performance of retrieving data from database, it better to use relational operator combined with indexing strategy. This study is done on Microsoft SQL Server 2012.
Sakurai, Hiroaki; Kanada, Yoshikiyo; Sugiura, Yoshito; Motoya, Ikuo; Yamada, Masayuki; Tomita, Masao; Naka, Toru; Teranishi, Toshio; Tanabe, Shigeo; Tsujimura, Toru; Okanishi, Tetsuo
2013-01-01
[Purpose] A major issue in physical/occupational therapist education is the improvement of students' clinical techniques. In this study, we introduced an education system using an Objective Structured Clinical Examination (OSCE), and made an attempt at standardization of its evaluation. [Subjects] The subjects were 227 students in the classes of 2008 to 2010 who enrolled at our university between 2004 and 2006, before the introduction of the education system using OSCE, and 221 students in the classes of 2011 to 2013 who enrolled between 2007 and 2009, after the introduction. [Methods] Performances in attitude and skills (performance in clinical training and OSCE) were compared between before and after the introduction of OSCE. OSCE results were compared between before and after clinical trainings at each OSCE Level; and the correlation of between performances in clinical training and OSCE was examined. [Results] Performances in OSCE and clinical training (attitude, skills) were improved by the introduction of the education system using OSCE, but no significant correlation was observed in the relationship between performances in OSCE and clinical training. [Conclusion] Further studies should be conducted aiming at the standardization of clinical skill evaluation in postgraduate education to establish an education system using OSCE. PMID:24259918
NASA Astrophysics Data System (ADS)
Gupta, Arun; Kim, Kyeong Yun; Hwang, Donghwi; Lee, Min Sun; Lee, Dong Soo; Lee, Jae Sung
2018-06-01
SPECT plays important role in peptide receptor targeted radionuclide therapy using theranostic radionuclides such as Lu-177 for the treatment of various cancers. However, SPECT studies must be quantitatively accurate because the reliable assessment of tumor uptake and tumor-to-normal tissue ratios can only be performed using quantitatively accurate images. Hence, it is important to evaluate performance parameters and quantitative accuracy of preclinical SPECT systems for therapeutic radioisotopes before conducting pre- and post-therapy SPECT imaging or dosimetry studies. In this study, we evaluated system performance and quantitative accuracy of NanoSPECT/CT scanner for Lu-177 imaging using point source and uniform phantom studies. We measured recovery coefficient, uniformity, spatial resolution, system sensitivity and calibration factor for mouse whole body standard aperture. We also performed the experiments using Tc-99m to compare the results with that of Lu-177. We found that the recovery coefficient of more than 70% for Lu-177 at the optimum noise level when nine iterations were used. The spatial resolutions of Lu-177 with and without adding uniform background was comparable to that of Tc-99m in axial, radial and tangential directions. System sensitivity measured for Lu-177 was almost three times less than that of Tc-99m.
On Applying the Prognostic Performance Metrics
NASA Technical Reports Server (NTRS)
Saxena, Abhinav; Celaya, Jose; Saha, Bhaskar; Saha, Sankalita; Goebel, Kai
2009-01-01
Prognostics performance evaluation has gained significant attention in the past few years. As prognostics technology matures and more sophisticated methods for prognostic uncertainty management are developed, a standardized methodology for performance evaluation becomes extremely important to guide improvement efforts in a constructive manner. This paper is in continuation of previous efforts where several new evaluation metrics tailored for prognostics were introduced and were shown to effectively evaluate various algorithms as compared to other conventional metrics. Specifically, this paper presents a detailed discussion on how these metrics should be interpreted and used. Several shortcomings identified, while applying these metrics to a variety of real applications, are also summarized along with discussions that attempt to alleviate these problems. Further, these metrics have been enhanced to include the capability of incorporating probability distribution information from prognostic algorithms as opposed to evaluation based on point estimates only. Several methods have been suggested and guidelines have been provided to help choose one method over another based on probability distribution characteristics. These approaches also offer a convenient and intuitive visualization of algorithm performance with respect to some of these new metrics like prognostic horizon and alpha-lambda performance, and also quantify the corresponding performance while incorporating the uncertainty information.
Evaluation of a New Mean Scaled and Moment Adjusted Test Statistic for SEM
ERIC Educational Resources Information Center
Tong, Xiaoxiao; Bentler, Peter M.
2013-01-01
Recently a new mean scaled and skewness adjusted test statistic was developed for evaluating structural equation models in small samples and with potentially nonnormal data, but this statistic has received only limited evaluation. The performance of this statistic is compared to normal theory maximum likelihood and 2 well-known robust test…
Technology Evaluation Tools and Teacher Performance in Public Schools
ERIC Educational Resources Information Center
Stonehouse, Pauline; Keengwe, Jared
2013-01-01
The purpose of this study was, (a) to describe the introduction of mVAL software and Charlotte Danielson Rubrics (CDR) as teacher evaluation tools; (b) to compare the process and outcomes of the new initiative with traditional systems; and (c) to evaluate the software from the perspective of participants in the system. This study highlights the…
Systems Analysis Of Advanced Coal-Based Power Plants
NASA Technical Reports Server (NTRS)
Ferrall, Joseph F.; Jennings, Charles N.; Pappano, Alfred W.
1988-01-01
Report presents appraisal of integrated coal-gasification/fuel-cell power plants. Based on study comparing fuel-cell technologies with each other and with coal-based alternatives and recommends most promising ones for research and development. Evaluates capital cost, cost of electricity, fuel consumption, and conformance with environmental standards. Analyzes sensitivity of cost of electricity to changes in fuel cost, to economic assumptions, and to level of technology. Recommends further evaluation of integrated coal-gasification/fuel-cell integrated coal-gasification/combined-cycle, and pulverized-coal-fired plants. Concludes with appendixes detailing plant-performance models, subsystem-performance parameters, performance goals, cost bases, plant-cost data sheets, and plant sensitivity to fuel-cell performance.
[Multifactorial method for assessing the physical work capacity of mice].
Dubovik, B V; Bogomazov, S D
1987-01-01
Based on the swimming test according to Kiplinger, in experiments on (CBA X C57BL)F1 mice there were elaborated criteria for animal performance evaluation in the process of repeated swimming of a standard distance thus measuring power, volume of work and rate of the fatigue development in relative units. From the study of effects of sydnocarb, bemethyl and phenazepam on various parameters of physical performance of mice a conclusion was made that the proposed method provides a more informative evaluation of the pharmacological effect on physical performance of animals as compared to the methods based on the record of time of performing the load.
Guo-Qiang, Zhang; Yan, Huang; Licong, Cui
2017-01-01
We introduce RGT, Retrospective Ground-Truthing, as a surrogate reference standard for evaluating the performance of automated Ontology Quality Assurance (OQA) methods. The key idea of RGT is to use cumulative SNOMED CT changes derived from its regular longitudinal distributions by the official SNOMED CT editorial board as a partial, surrogate reference standard. The contributions of this paper are twofold: (1) to construct an RGT reference set for SNOMED CT relational changes; and (2) to perform a comparative evaluation of the performances of lattice, non-lattice, and randomized relational error detection methods using the standard precision, recall, and geometric measures. An RGT relational-change reference set of 32,241 IS-A changes were constructed from 5 U.S. editions of SNOMED CT from September 2014 to September 2016, with reversals and changes due to deletion or addition of new concepts excluded. 68,849 independent non-lattice fragments, 118,587 independent lattice fragments, and 446,603 relations were extracted from the SNOMED CT March 2014 distribution. Comparative performance analysis of smaller (less than 15) lattice vs. non-lattice fragments was also given to approach the more realistic setting in which such methods may be applied. Among the 32,241 IS-A changes, independent non-lattice fragments covered 52.8% changes with 26.4% precision with a G-score of 0.373. Even though this G-score is significantly lower in comparison to those in information retrieval, it breaks new ground in that such evaluations have never performed before in the highly discovery-oriented setting of OQA. PMID:29854262
Guo-Qiang, Zhang; Yan, Huang; Licong, Cui
2017-01-01
We introduce RGT, Retrospective Ground-Truthing, as a surrogate reference standard for evaluating the performance of automated Ontology Quality Assurance (OQA) methods. The key idea of RGT is to use cumulative SNOMED CT changes derived from its regular longitudinal distributions by the official SNOMED CT editorial board as a partial, surrogate reference standard. The contributions of this paper are twofold: (1) to construct an RGT reference set for SNOMED CT relational changes; and (2) to perform a comparative evaluation of the performances of lattice, non-lattice, and randomized relational error detection methods using the standard precision, recall, and geometric measures. An RGT relational-change reference set of 32,241 IS-A changes were constructed from 5 U.S. editions of SNOMED CT from September 2014 to September 2016, with reversals and changes due to deletion or addition of new concepts excluded. 68,849 independent non-lattice fragments, 118,587 independent lattice fragments, and 446,603 relations were extracted from the SNOMED CT March 2014 distribution. Comparative performance analysis of smaller (less than 15) lattice vs. non-lattice fragments was also given to approach the more realistic setting in which such methods may be applied. Among the 32,241 IS-A changes, independent non-lattice fragments covered 52.8% changes with 26.4% precision with a G-score of 0.373. Even though this G-score is significantly lower in comparison to those in information retrieval, it breaks new ground in that such evaluations have never performed before in the highly discovery-oriented setting of OQA.
Double Density Dual Tree Discrete Wavelet Transform implementation for Degraded Image Enhancement
NASA Astrophysics Data System (ADS)
Vimala, C.; Aruna Priya, P.
2018-04-01
Wavelet transform is a main tool for image processing applications in modern existence. A Double Density Dual Tree Discrete Wavelet Transform is used and investigated for image denoising. Images are considered for the analysis and the performance is compared with discrete wavelet transform and the Double Density DWT. Peak Signal to Noise Ratio values and Root Means Square error are calculated in all the three wavelet techniques for denoised images and the performance has evaluated. The proposed techniques give the better performance when comparing other two wavelet techniques.
Evaluative-feedback stimuli selectively activate the self-related brain area: an fMRI study.
Pan, Xiaohong; Hu, Yang; Li, Lei; Li, Jianqi
2009-11-06
Evaluative-feedback, occurring in our daily life, generally contains subjective appraisal of one's specific abilities and personality characteristics besides objective right-or-wrong information. Traditional psychological researches have proved it to be important in building up one's self-concept; however, the neural basis underlying its cognitive processing remains unclear. The present neuroimaging study revealed the mechanism of evaluative-feedback processing at the neural level. 19 healthy Chinese subjects participated in this experiment, and completed the time-estimation task to better their performance according to four types of feedback, namely positive evaluative- and performance-feedback as well as negative evaluative- and performance-feedback. Neuroimaging findings showed that evaluative- rather than performance-feedback can induce increased activities mainly distributed in the cortical midline structures (CMS), including medial prefrontal cortex (BA 8/9)/anterior cigulate cortex (ACC, BA 20), precuneus (BA 7/31) adjacent to posterior cingulate gyrus (PCC, BA 23) of both hemispheres, as well as right inferior lobule (BA 40). This phenomenon can provide evidence that evaluative-feedback may significantly elicit the self-related processing in our brain. In addition, our results also revealed that more brain areas, particularly some self-related neural substrates were activated by the positive evaluative-feedback, in comparative with the negative one. In sum, this study suggested that evaluative-feedback was closely correlated with the self-concept processing, which distinguished it from the performance-feedback.
Data Container Study for Handling Array-based Data Using Rasdaman, Hive, Spark, and MongoDB
NASA Astrophysics Data System (ADS)
Xu, M.; Hu, F.; Yu, M.; Scheele, C.; Liu, K.; Huang, Q.; Yang, C. P.; Little, M. M.
2016-12-01
Geoscience communities have come up with various big data storage solutions, such as Rasdaman and Hive, to address the grand challenges for massive Earth observation data management and processing. To examine the readiness of current solutions in supporting big Earth observation, we propose to investigate and compare four popular data container solutions, including Rasdaman, Hive, Spark, and MongoDB. Using different types of spatial and non-spatial queries, datasets stored in common scientific data formats (e.g., NetCDF and HDF), and two applications (i.e. dust storm simulation data mining and MERRA data analytics), we systematically compare and evaluate the feature and performance of these four data containers in terms of data discover and access. The computing resources (e.g. CPU, memory, hard drive, network) consumed while performing various queries and operations are monitored and recorded for the performance evaluation. The initial results show that 1) Rasdaman has the best performance for queries on statistical and operational functions, and supports NetCDF data format better than HDF; 2) Rasdaman clustering configuration is more complex than the others; 3) Hive performs better on single pixel extraction from multiple images; and 4) Except for the single pixel extractions, Spark performs better than Hive and its performance is close to Rasdaman. A comprehensive report will detail the experimental results, and compare their pros and cons regarding system performance, ease of use, accessibility, scalability, compatibility, and flexibility.
Evaluation of grapevine as a host for the glassy-winged sharpshooter
USDA-ARS?s Scientific Manuscript database
Grapevine was evaluated as a feeding and oviposition host for the glassy-winged sharpshooter. Two sets of experiments were conducted. The first set compared performance and preference of glassy-winged sharpshooter females for grapevine (cv. Chardonnay) versus cowpea (Vigna unguiculata cultivar black...
COMPUTERIZED NEEDS-ORIENTED QUALITY MEASUREMENT EVALUATION SYSTEM (CONQUEST)
CONQUEST is an easy-to-use quality improvement software tool that uses a common structure and language to help users identity, understand, compare, evaluate, and select among 1,200 clinical performance measures that can be used to assess and improve quality of care. CONQUEST's in...
Comparative Evaluation of Alternative Disinfectants for Drinking Water and Wastewater Treatment
The Water Supply and Water Resources Division (WSWRD) of the U.S. Environmental Protection Agency’s (EPA) National Risk Management Research Laboratory (NRMRL) initiated a research program to evaluate the performance of various disinfectants that could potentially be used in drink...
Optical performance of multifocal soft contact lenses via a single-pass method.
Bakaraju, Ravi C; Ehrmann, Klaus; Falk, Darrin; Ho, Arthur; Papas, Eric
2012-08-01
A physical model eye capable of carrying soft contact lenses (CLs) was used as a platform to evaluate optical performance of several commercial multifocals (MFCLs) with high- and low-add powers and a single-vision control. Optical performance was evaluated at three pupil sizes, six target vergences, and five CL-correcting positions using a spatially filtered monochromatic (632.8 nm) light source. The various target vergences were achieved by using negative trial lenses. A photosensor in the retinal plane recorded the image point-spread that enabled the computation of visual Strehl ratios. The centration of CLs was monitored by an additional integrated en face camera. Hydration of the correcting lens was maintained using a humidity chamber and repeated instillations of rewetting saline drops. All the MFCLs reduced performance for distance but considerably improved performance along the range of distance to near target vergences, relative to the single-vision CL. Performance was dependent on add power, design, pupil, and centration of the correcting CLs. Proclear (D) design produced good performance for intermediate vision, whereas Proclear (N) design performed well at near vision (p < 0.05). AirOptix design exhibited good performance for distance and intermediate vision. PureVision design showed improved performance across the test vergences, but only for pupils ≥4 mm in diameter. Performance of Acuvue bifocal was comparable with other MFCLs, but only for pupils >4 mm in diameter. Acuvue Oasys bifocal produced performance comparable with single-vision CL for most vergences. Direct measurement of single-pass images at the retinal plane of a physical model eye used in conjunction with various MFCLs is demonstrated. This method may have utility in evaluating the relative effectiveness of commercial and prototype designs.
El-Wakeel, Amr S; Mohammed, Nazmi A; Aly, Moustafa H
2016-09-10
In this work, a free space optical communication (FSO) link is proposed and utilized to explore and evaluate the FSO link performance under the joint occurrence of the atmospheric scattering and turbulence phenomena for 850 and 1550 nm operation. Diffraction and nondiffraction-limited systems are presented and evaluated for both wavelengths' operation, considering far-field conditions under different link distances. Bit error rate, pointing error angles, beam divergence angles, and link distance are the main performance indicators that are used to evaluate and compare the link performance under different system configurations and atmospheric phenomena combinations. A detailed study is performed to provide the merits of this work. For both far-field diffraction-limited and nondiffraction-limited systems, it is concluded that 1550 nm system operation is better than 850 nm for the whole presented joint occurrences of atmospheric scattering and turbulence.
Extended performance electric propulsion power processor design study. Volume 2: Technical summary
NASA Technical Reports Server (NTRS)
Biess, J. J.; Inouye, L. Y.; Schoenfeld, A. D.
1977-01-01
Electric propulsion power processor technology has processed during the past decade to the point that it is considered ready for application. Several power processor design concepts were evaluated and compared. Emphasis was placed on a 30 cm ion thruster power processor with a beam power rating supply of 2.2KW to 10KW for the main propulsion power stage. Extension in power processor performance were defined and were designed in sufficient detail to determine efficiency, component weight, part count, reliability and thermal control. A detail design was performed on a microprocessor as the thyristor power processor controller. A reliability analysis was performed to evaluate the effect of the control electronics redesign. Preliminary electrical design, mechanical design and thermal analysis were performed on a 6KW power transformer for the beam supply. Bi-Mod mechanical, structural and thermal control configurations were evaluated for the power processor and preliminary estimates of mechanical weight were determined.
Gaspar, Pâmela Cristina; Wohlke, Bruna Lovizutto Protti; Brunialti, Milena Karina Coló; Pires, Ana Flávia; Kohiyama, Igor Massaki; Salomão, Reinaldo; Alonso Neto, José Boullosa; Júnior, Orlando da Costa Ferreira; Franchini, Miriam; Bazzo, Maria Luiza; Benzaken, Adele Schwartz
2018-05-01
The National Network for CD4+ T-lymphocyte counting of Brazil comprises 93 laboratories. This study reports the laboratory performances achieved in external quality assessment (EQA) rounds provides by Ministry of Health to evaluate the quality of the kits used and the performance of test by the technicians.Ten EQA rounds were analyzed according the EQA criteria aimed to evaluate individual laboratory performance on the basis of the accuracy of their results compared to the general mean obtained by all participating laboratories and the reproducibility of the results obtained between 2 samples from the same donor.The percentage of approved and failed laboratories in the EQAs tends to follow a uniform pattern. Since 2011, approval has remained above 80% and the failure rate has never exceeded 15%.EQA is very important to evaluate the performance of the laboratories, to identify monitor, and to resolve errors as quickly as possible.
Lithographic performance comparison with various RET for 45-nm node with hyper NA
NASA Astrophysics Data System (ADS)
Adachi, Takashi; Inazuki, Yuichi; Sutou, Takanori; Kitahata, Yasuhisa; Morikawa, Yasutaka; Toyama, Nobuhito; Mohri, Hiroshi; Hayashi, Naoya
2006-05-01
In order to realize 45 nm node lithography, strong resolution enhancement technology (RET) and water immersion will be needed. In this research, we discussed about various RET performance comparison for 45 nm node using 3D rigorous simulation. As a candidate, we chose binary mask (BIN), several kinds of attenuated phase-shifting mask (att-PSM) and chrome-less phase-shifting lithography mask (CPL). The printing performance was evaluated and compared for each RET options, after the optimizing illumination conditions, mask structure and optical proximity correction (OPC). The evaluation items of printing performance were CD-DOF, contrast-DOF, conventional ED-window and MEEF, etc. It's expected that effect of mask 3D topography becomes important at 45 nm node, so we argued about not only the case of ideal structures, but also the mask topography error effects. Several kinds of mask topography error were evaluated and we confirmed how these errors affect to printing performance.
Schacht, M J; Toustrup, C B; Madsen, L B; Martiny, M S; Larsen, B B; Simonsen, J T
2016-10-01
Rapid on-site evaluation (ROSE) of endobronchial ultrasound-guided transbronchial needle aspiration (EBUS-TBNA) followed by a subsequent preliminary adequacy assessment and a preliminary diagnosis, was performed at Aarhus University Hospital by biomedical scientists (BMS). The aim of this study was to evaluate the BMS accuracy of ROSE adequacy assessment, the preliminary adequacy assessment and the preliminary diagnosis as compared with the cytopathologist-rendered final adequacy assessment and final diagnosis. The BMS-rendered assessments for 717 sites from 319 consecutive patients over a 4-month period were compared with the cytopathologist-rendered assessments. Comparisons of adequacy and preliminary diagnoses were based on inter-observer Cohen's Kappa coefficient with a 95% confidence interval (CI). Strong correlations between ROSE and final adequacy assessments [Kappa coefficient of 0.90 (CI: 0.85-0.96)] and between the preliminary and final adequacy assessments [Kappa coefficient of 0.93 (CI: 0.87-0.99)] were found. As for the correlation between the preliminary and final diagnoses, the Kappa coefficient was 0.99 (CI: 0.98-1). Both ROSE and preliminary adequacy assessments as well as preliminary diagnoses, all performed by BMS, were highly accurate when compared with the final assessment by the cytopathologist. © 2016 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Klomp, Sander; van der Sommen, Fons; Swager, Anne-Fré; Zinger, Svitlana; Schoon, Erik J.; Curvers, Wouter L.; Bergman, Jacques J.; de With, Peter H. N.
2017-03-01
Volumetric Laser Endomicroscopy (VLE) is a promising technique for the detection of early neoplasia in Barrett's Esophagus (BE). VLE generates hundreds of high resolution, grayscale, cross-sectional images of the esophagus. However, at present, classifying these images is a time consuming and cumbersome effort performed by an expert using a clinical prediction model. This paper explores the feasibility of using computer vision techniques to accurately predict the presence of dysplastic tissue in VLE BE images. Our contribution is threefold. First, a benchmarking is performed for widely applied machine learning techniques and feature extraction methods. Second, three new features based on the clinical detection model are proposed, having superior classification accuracy and speed, compared to earlier work. Third, we evaluate automated parameter tuning by applying simple grid search and feature selection methods. The results are evaluated on a clinically validated dataset of 30 dysplastic and 30 non-dysplastic VLE images. Optimal classification accuracy is obtained by applying a support vector machine and using our modified Haralick features and optimal image cropping, obtaining an area under the receiver operating characteristic of 0.95 compared to the clinical prediction model at 0.81. Optimal execution time is achieved using a proposed mean and median feature, which is extracted at least factor 2.5 faster than alternative features with comparable performance.
Comparison of parameter-adapted segmentation methods for fluorescence micrographs.
Held, Christian; Palmisano, Ralf; Häberle, Lothar; Hensel, Michael; Wittenberg, Thomas
2011-11-01
Interpreting images from fluorescence microscopy is often a time-consuming task with poor reproducibility. Various image processing routines that can help investigators evaluate the images are therefore useful. The critical aspect for a reliable automatic image analysis system is a robust segmentation algorithm that can perform accurate segmentation for different cell types. In this study, several image segmentation methods were therefore compared and evaluated in order to identify the most appropriate segmentation schemes that are usable with little new parameterization and robustly with different types of fluorescence-stained cells for various biological and biomedical tasks. The study investigated, compared, and enhanced four different methods for segmentation of cultured epithelial cells. The maximum-intensity linking (MIL) method, an improved MIL, a watershed method, and an improved watershed method based on morphological reconstruction were used. Three manually annotated datasets consisting of 261, 817, and 1,333 HeLa or L929 cells were used to compare the different algorithms. The comparisons and evaluations showed that the segmentation performance of methods based on the watershed transform was significantly superior to the performance of the MIL method. The results also indicate that using morphological opening by reconstruction can improve the segmentation of cells stained with a marker that exhibits the dotted surface of cells. Copyright © 2011 International Society for Advancement of Cytometry.
Diagnostic Accuracy of Rating Scales for Attention-Deficit/Hyperactivity Disorder: A Meta-analysis.
Chang, Ling-Yin; Wang, Mei-Yeh; Tsai, Pei-Shan
2016-03-01
The Child Behavior Checklist-Attention Problem (CBCL-AP) scale and Conners Rating Scale-Revised (CRS-R) are commonly used behavioral rating scales for diagnosing attention-deficit/hyperactivity disorder (ADHD) in children and adolescents. To evaluate and compare the diagnostic performance of CBCL-AP and CRS-R in diagnosing ADHD in children and adolescents. PubMed, Ovid Medline, and other relevant electronic databases were searched for articles published up to May 2015. We included studies evaluating the diagnostic performance of either CBCL-AP scale or CRS-R for diagnosing ADHD in pediatric populations in comparison with a defined reference standard. Bivariate random effects models were used for pooling and comparing diagnostic performance. We identified and evaluated 14 and 11 articles on CBCL-AP and CRS-R, respectively. The results revealed pooled sensitivities of 0.77, 0.75, 0.72, and 0.83 and pooled specificities of 0.73, 0.75, 0.84, and 0.84 for CBCL-AP, Conners Parent Rating Scale-Revised, Conners Teacher Rating Scale-Revised, and Conners Abbreviated Symptom Questionnaire (ASQ), respectively. No difference was observed in the diagnostic performance of the various scales. Study location, age of participants, and percentage of female participants explained the heterogeneity in the specificity of the CBCL-AP. CBCL-AP and CRS-R both yielded moderate sensitivity and specificity in diagnosing ADHD. According to the comparable diagnostic performance of all examined scales, ASQ may be the most effective diagnostic tool in assessing ADHD because of its brevity and high diagnostic accuracy. CBCL is recommended for more comprehensive assessments. Copyright © 2016 by the American Academy of Pediatrics.
On-Road Driving Performance by Persons with Hemianopia and Quadrantanopia
Wood, Joanne M.; McGwin, Gerald; Elgin, Jennifer; Vaphiades, Michael S.; Braswell, Ronald A.; DeCarlo, Dawn K.; Kline, Lanning B.; Meek, G. Christine; Searcey, Karen; Owsley, Cynthia
2009-01-01
Purpose This study was designed to examine the on-road driving performance of drivers with hemianopia and quadrantanopia compared with age-matched controls. Methods Participants included persons with hemianopia or quadrantanopia and those with normal visual fields. Visual and cognitive function tests were administered, including confirmation of hemianopia and quadrantanopia through visual field testing. Driving performance was assessed using a dual-brake vehicle and monitored by a certified driving rehabilitation specialist. The route was 14.1 miles of city and interstate driving. Two “back-seat” evaluators masked to drivers’ clinical characteristics independently assessed driving performance using a standard scoring system. Results Participants were 22 persons with hemianopia and 8 with quadrantanopia (mean age, 53 ± 20 years) and 30 participants with normal fields (mean age, 52 ± 19 years). Inter-rater agreement for back-seat evaluators was 96%. All drivers with normal fields were rated as safe to drive, while 73% (16/22) of hemianopic and 88% (7/8) of quadrantanopic drivers received safe ratings. Drivers with hemianopia or quadrantanopia who displayed on-road performance problems tended to have difficulty with lane position, steering steadiness, and gap judgment compared to controls. Clinical characteristics associated with unsafe driving were slowed visual processing speed, reduced contrast sensitivity and visual field sensitivity. Conclusions Some drivers with hemianopia or quadrantanopia are fit to drive compared with age-matched control drivers. Results call into question the fairness of governmental policies that categorically deny licensure to persons with hemianopia or quadrantanopia without the opportunity for on-road evaluation. PMID:18936138
NASA Astrophysics Data System (ADS)
Vorotnikov, A. A.; Klimov, D. D.; Romash, E. V.; Bashevskaya, O. S.; Poduraev, Yu. V.; Bazykyan, E. A.; Chunihin, A. A.
2018-03-01
Industrial robots perform technological operations, such as spot and arc welding, machining and laser cutting along different trajectories within their performance characteristics. The evaluation of these characteristics is carried out according to the criteria of the standard ISO 9283. The criteria of this standard are applicable in industrial manufacturing, but not in the medical industry, as they are not developed in the framework of medical tasks. Therefore, it is necessary to evaluate according to criteria built on different principles. In this article, the question of comparative evaluation of trajectories from program movements of a robot and manual movements of a surgeon, arising during the development of robotic medical complexes using industrial robots, is considered. A comparative evaluation is required to prove the expediency of automating medical operations in maxillofacial surgery. This study focuses on the estimation of velocity accuracy of a medical instrument. To obtain the velocity of the medical instrument, coordinates of the trajectory points from the program movements of the robot KUKA LWR4+ and trajectories from the manual movements of a professional surgeon have been measured. The measurement was carried out using a coordinate measuring machine, the laser tracker Leica LTD800. The accuracy estimation was carried out by two criteria: the criterion set out in the ISO 9283 standard, and the developed alternative criterion, the description of which is presented in this article. A quantitative comparative evaluation of the trajectories of a robot and a surgeon was obtained.
Berteau, Cecile; Schwarzenbach, Florence; Donazzolo, Yves; Latreille, Mathilde; Berube, Julie; Abry, Herve; Cotten, Joël; Feger, Celine; Laurent, Philippe E
2010-10-05
A disposable autoinjector was developed for subcutaneous (SC) self-injection by patients with chronic diseases. To verify its performance and evaluate its acceptance, a clinical study was conducted in healthy volunteers, comparing SC injections performed by subjects using the autoinjector with SC injections performed by nurses using a syringe. This was a randomized, single-center, crossover study comparing SC self-injection using an autoinjector with SC nurse-administered injection using a syringe. Two volumes (0.2 mL and 1 mL) were injected into healthy volunteers. Study objectives included assessment of the accuracy and consistency of the volume injected by the injection systems, and skin reaction and pain associated with the injection. The fluid depot in the SC tissue layer was evaluated by ultrasound. Subject acceptance was evaluated using questionnaires on attitudes and emotions towards the injection technique, and challenged by seeking the subjects' preferred system for a final study injection or future treatment. A total of 960 injections (480 with autoinjector, 480 with syringe) were performed in 40 subjects. There were no significant differences in mean fluid leakage and injected volumes between the systems. Pain associated with the injection was significantly lower with the auto-injector than with the syringe. Local skin reaction at the injection site was overall satisfactory. Injections were appropriately performed by all subjects. At study end, all 40 subjects preferred the autoinjector for a final study injection and for future treatment. This study indicated that the autoinjector used by the subject was similar to a syringe used by a nurse in terms of performance and safety in administering the injections, and better in terms of pain, overall acceptance, and preference.
Kahsay, Getu; Broeckhoven, Ken; Adams, Erwin; Desmet, Gert; Cabooter, Deirdre
2014-05-01
After the great commercial success of sub-3 µm superficially porous particles, vendors are now also starting to commercialize 5 µm superficially porous particles, as an alternative to their fully porous counterparts which are routinely used in pharmaceutical analysis. In this study, the performance of 5 µm superficially porous particles was compared to that of fully porous 5 µm particles in terms of efficiency, separation performance and loadability on a conventional HPLC instrument. Van Deemter and kinetic plots were first used to evaluate the efficiency and performance of both particle types using alkylphenones as a test mixture. The van Deemter and kinetic plots showed that the superficially porous particles provide a superior kinetic performance compared to the fully porous particles over the entire relevant range of separation conditions, when both support types were evaluated at the same operating pressure. The same observations were made both for isocratic and gradient analysis. The superior performance was further demonstrated for the separation of a pharmaceutical compound (griseofulvin) and its impurities, where a gain in analysis time of around 2 could be obtained using the superficially porous particles. Finally, both particle types were evaluated in terms of loadability by plotting the resolution of the active pharmaceutical ingredient and its closest impurity as a function of the signal-to-noise ratio obtained for the smallest impurity. It was demonstrated that the superficially porous particles show better separation performance for griseofulvin and its impurities without significantly compromising sensitivity due to loadability issues in comparison with their fully porous counterparts. Moreover these columns can be used on conventional equipment without modifications to obtain a significant improvement in analysis time. Copyright © 2014 Elsevier B.V. All rights reserved.
Israely, Sharon; Carmeli, Eli
2017-01-01
Evidence-based studies regarding deficits in handwriting performance relative to hand reaching and grasping after a stroke are lacking. To evaluate the extent of damage to handwriting skills compared to arm reach and grasp task among post-stroke patients. Eighteen patients and 19 healthy subjects were recruited to this case-control study. Patients were evaluated 15.2 days (±6.5) after the stroke using a Computerized Penmanship Evaluation Tool, surface Electromyography and Fugl-Meyer assessment. This study compared motor deficits in hand reaching and grasping and in handwriting between stroke patients and healthy subjects. Damage to handwriting performance relative to hand reaching and grasping skills was also evaluated. Significant differences were found between groups in handwriting performance (p < 0.05). The performance of the trapezius, biceps, and triceps muscles can predict 63.5% of the variance in the ability to write a short sentence (p < 0.023). Pen pressure can predict 74.9% of the hand motor performance from Fugl-Meyer assessment (p < 0.05). Handwriting was more damaged than was the pattern of activation of the proximal muscles of the shoulder and arm (p < 0.05). FM scores were highly, negatively correlated with the in-air writing time across tasks (r = -0.819, p < 0.004). This study confirms the clinical observation that dexterity skills are more damaged than are arm forward reach after a stroke. However, these differences in motor performance were not significant in mildly disabled patients, demonstrating the feasibility of handwriting rehabilitation in these patients. Therefore, we modestly recommend focusing on handwriting rehabilitation of the hemiparetic upper extremity in mildly impaired patients after a stroke.
Castillo, Jordi; Gallart, Aberto; Rodríguez, Encarnación; Castillo, Jorge; Gomar, Carmen
2018-06-01
The objective of this study was to compare the immediate and 6-month efficacy of basic life support (BLS) and automatic external defibrillation (AED) training using standard or blended methods. First-year students of medicine and nursing (n = 129) were randomly assigned to a control group (face-to-face training based on the European Resuscitation Council [ERC] Guidelines) or to an experimental group that trained with a self-training video, a new website, a Moodle platform, an intelligent manikin, and 45 min of instructor presence. Both groups were homogeneous and were evaluated identically. Theoretical knowledge was evaluated using a multi-choice questionnaire (MCQ). Skill performance was evaluated by the instructor's rubric and on a high-fidelity Resusci Anne QCPR manikin. Immediately after the course, there were no statistically significant differences in knowledge between the two groups. The median score of practical evaluation assessed by the instructor was significantly better in the experimental group (8.15, SD 0.93 vs 7.7, SD 1.18; P = 0.02). No differences between groups were found when using a high-fidelity manikin to evaluate chest compressions and lung inflations. At six months, the scores in knowledge and skill performance were significantly lower compared to the evaluations at the end of the instruction, but they remained still higher compared to baseline. The experimental group had higher scores in practical skills evaluated by the instructor than the control group (7.44, SD 1.85 vs 6.10, SD 2.6; P = 0.01). The blended method provides the same or even higher levels of knowledge and skills than standard instruction both immediately after the course and six months later. Copyright © 2018 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Cieslar, Whitney; McLaughlin, T. F.; Derby, K. Mark
2008-01-01
The purpose of this study was to evaluate the effects of the copy, cover, and compare (CCC) procedure on improving the mathematics and spelling performance of a freshman attending a high school special education class. The participant was a 16-year-old high school student enrolled in special education classes for 3 periods of the school day. Math…
NASA Astrophysics Data System (ADS)
Dar, Aasif Bashir; Jha, Rakesh Kumar
2017-03-01
Various dispersion compensation units are presented and evaluated in this paper. These dispersion compensation units include dispersion compensation fiber (DCF), DCF merged with fiber Bragg grating (FBG) (joint technique), and linear, square root, and cube root chirped tanh apodized FBG. For the performance evaluation 10 Gb/s NRZ transmission system over 100-km-long single-mode fiber is used. The three chirped FBGs are optimized individually to yield pulse width reduction percentage (PWRP) of 86.66, 79.96, 62.42% for linear, square root, and cube root, respectively. The DCF and Joint technique both provide a remarkable PWRP of 94.45 and 96.96%, respectively. The performance of optimized linear chirped tanh apodized FBG and DCF is compared for long-haul transmission system on the basis of quality factor of received signal. For both the systems maximum transmission distance is calculated such that quality factor is ≥ 6 at the receiver and result shows that performance of FBG is comparable to that of DCF with advantages of very low cost, small size and reduced nonlinear effects.
Evaluation of alternative phase change materials for energy storage in solar dynamic applications
NASA Technical Reports Server (NTRS)
Crane, R. A.; Dustin, M. O.
1988-01-01
The performance of fluoride salt and metallic thermal energy storage materials are compared in terms of basic performance as applied to solar dynamic power generation. Specific performance considerations include uniformity of cycle inlet temperature, peak cavity temperature, TES utilization, and system weights. Also investigated were means of enhancing the thermal conductivity of the salts and its effect on the system performance.
ERIC Educational Resources Information Center
White, Mona E.; And Others
Comprehensive Achievement Monitoring (CAM) is a system designed to provide a curriculum defined in terms of performance objectives, test items to measure student performance on each objective, a set of comparable test forms to evaluate performance, testing throughout the period of the course, computerized analysis and reporting of results after…
Directly Comparing Computer and Human Performance in Language Understanding and Visual Reasoning.
ERIC Educational Resources Information Center
Baker, Eva L.; And Others
Evaluation models are being developed for assessing artificial intelligence (AI) systems in terms of similar performance by groups of people. Natural language understanding and vision systems are the areas of concentration. In simplest terms, the goal is to norm a given natural language system's performance on a sample of people. The specific…
ERIC Educational Resources Information Center
Chapman, Diane D.
2004-01-01
This survey-based study addressed a perceived gap between training performance evaluation practice and decision-making criteria required in business. Training professionals and non-training managers in North Carolina were surveyed. The study found that the groups differ in the performance measures that motivate them to act on training issues.…
A Comparative Assessment of the Performance of Select Higher Education Institutes in India
ERIC Educational Resources Information Center
Sahney, Sangeeta; Thakkar, Jitesh
2016-01-01
Purpose: The purpose of this paper is to evaluate the performance of select technical higher education institutes of national importance in India. This helps to judge the efficiency and effectiveness of an institute to provide valuable insights on performance measurement and effectiveness not only to the respective institute but also to…
Brewin, James; Tang, Jessica; Dasgupta, Prokar; Khan, Muhammad S; Ahmed, Kamran; Bello, Fernando; Kneebone, Roger; Jaye, Peter
2015-07-01
To evaluate the face, content and construct validity of the distributed simulation (DS) environment for technical and non-technical skills training in endourology. To evaluate the educational impact of DS for urology training. DS offers a portable, low-cost simulated operating room environment that can be set up in any open space. A prospective mixed methods design using established validation methodology was conducted in this simulated environment with 10 experienced and 10 trainee urologists. All participants performed a simulated prostate resection in the DS environment. Outcome measures included surveys to evaluate the DS, as well as comparative analyses of experienced and trainee urologist's performance using real-time and 'blinded' video analysis and validated performance metrics. Non-parametric statistical methods were used to compare differences between groups. The DS environment demonstrated face, content and construct validity for both non-technical and technical skills. Kirkpatrick level 1 evidence for the educational impact of the DS environment was shown. Further studies are needed to evaluate the effect of simulated operating room training on real operating room performance. This study has shown the validity of the DS environment for non-technical, as well as technical skills training. DS-based simulation appears to be a valuable addition to traditional classroom-based simulation training. © 2014 The Authors BJU International © 2014 BJU International Published by John Wiley & Sons Ltd.
Mitchell, Elizabeth O; Stewart, Greg; Bajzik, Olivier; Ferret, Mathieu; Bentsen, Christopher; Shriver, M Kathleen
2013-12-01
A multisite study was conducted to evaluate the performance of the Bio-Rad 4th generation GS HIV Combo Ag/Ab EIA versus Abbott 4th generation ARCHITECT HIV Ag/Ab Combo. The performance of two 3rd generation EIAs, Ortho Diagnostics Anti-HIV 1+2 EIA and Siemens HIV 1/O/2 was also evaluated. Study objective was comparison of analytical HIV-1 p24 antigen detection, sensitivity in HIV-1 seroconversion panels, specificity in blood donors and two HIV false reactive panels. Analytical sensitivity was evaluated with International HIV-1 p24 antigen standards, the AFFSAPS (pg/mL) and WHO 90/636 (IU/mL) standards; sensitivity in acute infection was compared on 55 seroconversion samples, and specificity was evaluated on 1000 negative blood donors and two false reactive panels. GS HIV Combo Ag/Ab demonstrated better analytical HIV antigen sensitivity compared to ARCHITECT HIV Ag/Ab Combo: 0.41 IU/mL versus 1.2 IU/mL (WHO) and 12.7 pg/mL versus 20.1 pg/mL (AFSSAPS); GS HIV Combo Ag/Ab EIA also demonstrated slightly better specificity compared to ARCHITECT HIV Ag/Ab Combo (100% versus 99.7%). The 4th generation HIV Combo tests detected seroconversion 7-11 days earlier than the 3rd generation HIV antibody only EIAs. Both 4th generation immunoassays demonstrated excellent performance in sensitivity, with the reduction of the serological window period (7-11 days earlier detection than the 3rd generation HIV tests). However, GS HIV Combo Ag/Ab demonstrated improved HIV antigen analytical sensitivity and slightly better specificity when compared to ARCHITECT HIV Ag/Ab Combo assay, with higher positive predictive values (PPV) for low prevalence populations. Copyright © 2013 Elsevier B.V. All rights reserved.
Lasalvia, Pieralessandro; Barahona-Correa, Julián Esteban; Romero-Alvernia, Diana Marcela; Gil-Tamayo, Sebastián; Castañeda-Cardona, Camilo; Bayona, Juan Gabriel; Triana, Juan José; Laserna, Andrés Felipe; Mejía-Torres, Miguel; Restrepo-Jimenez, Paula; Jimenez-Zapata, Juliana; Rosselli, Diego
2016-07-01
Pen devices offer advantages compared with vial and syringe (VaS). The purpose of this article was to evaluate efficacy of pen devices compared to VaS. A systematic review of literature was performed in 8 different databases. References were independently screened and selected. Primary observational or experimental studies comparing pen devices with VaS for insulin administrations were included. Studies on specific populations were excluded. Risk of bias was evaluated using appropriate tools. Data on glycosylated hemoglobin (HbA1c), hypoglycemia, adherence, persistence, patient preference, and quality of life (QOL) were collected. Meta-analysis was performed when appropriate. Heterogeneity and risk of publication bias were evaluated. Otherwise, descriptive analyses of the available data was done. In all, 10 348 articles were screened. A total of 17 studies were finally selected: 7 experimental and 10 analytical. The populations of the included articles were mainly composed of adults with type 2 diabetes mellitus. Important risk of bias was found in all of the articles, particularly experimental studies. Meta-analyses were performed for HbA1c, hypoglycemia, adherence and persistence. Pen device showed better results in mean HbA1c change, patients with hypoglycemia, adherence and persistence compared to VaS. No difference was observed in number of patients achieving <7% HbA1c. Preference studies showed a tendency favoring pen devices, however nonvalidated tools were used. One QoL study showed improvements in some subscales of SF-36. There is evidence that pen devices offer benefits in clinical and, less clearly, patient-reported outcomes compared to VaS for insulin administration. However, these results should be taken with caution. © 2016 Diabetes Technology Society.
Kinematic Analysis and Performance Evaluation of Novel PRS Parallel Mechanism
NASA Astrophysics Data System (ADS)
Balaji, K.; Khan, B. Shahul Hamid
2018-02-01
In this paper, a 3 DoF (Degree of Freedom) novel PRS (Prismatic-Revolute- Spherical) type parallel mechanisms has been designed and presented. The combination of striaght and arc type linkages for 3 DOF parallel mechanism is introduced for the first time. The performances of the mechanisms are evaluated based on the indices such as Minimum Singular Value (MSV), Condition Number (CN), Local Conditioning Index (LCI), Kinematic Configuration Index (KCI) and Global Conditioning Index (GCI). The overall reachable workspace of all mechanisms are presented. The kinematic measure, dexterity measure and workspace analysis for all the mechanism have been evaluated and compared.
Rutting performance of cold bituminous emulsion mixtures
NASA Astrophysics Data System (ADS)
Arshad, Ahmad Kamil; Ali, Noor Azilatom; Shaffie, Ekarizan; Hashim, Wardati; Rahman, Zanariah Abd
2017-10-01
Cold Bituminous Emulsion Mixture (CBEM) is an environmentally friendly alternative to hot mix asphalt (HMA) for road surfacing, due to its low energy requirements. However, CBEM has generally been perceived to be less superior in performance, compared to HMA. This paper details a laboratory study on the rutting performance of CBEM. The main objective of this study is to determine the Marshall properties of CBEM and to evaluate the rutting performance. The effect of cement in CBEM was also evaluated in this study. The specimens were prepared using Marshall Mix Design Method and rutting performance was evaluated using the Asphalt Pavement Analyzer (APA). Marshall Properties were analysed to confirm compliance with the PWD Malaysia's specification requirements. The rutting performance for specimens with cement was also found to perform better than specimens without cement. It can be concluded that Cold Bituminous Emulsion Mixtures (CBEM) with cement is a viable alternative to Hot Mix Asphalt (HMA) as their Marshall Properties and performance obtained from this study meets the requirements of the specifications. It is recommended that further study be conducted on CBEM for other performance criteria such as moisture susceptibility and fatigue.
The Effect of Training Data Set Composition on the Performance of a Neural Image Caption Generator
2017-09-01
objects was compared using the Metric for Evaluation of Translation with Explicit Ordering (METEOR) and Consensus-Based Image Description Evaluation...using automated scoring systems. Many such systems exist, including Bilingual Evaluation Understudy (BLEU), Consensus-Based Image Description Evaluation...shown to be essential to automated scoring, which correlates highly with human precision.5 CIDEr uses a system of consensus among the captions and
Applicability evaluation of Deep Eutectic Solvents-Cellulase system for lignocellulose hydrolysis.
Gunny, Ahmad Anas Nagoor; Arbain, Dachyar; Nashef, Enas Muen; Jamal, Parveen
2015-04-01
Deep Eutectic Solvents (DESs) have recently emerged as a new generation of ionic liquids for lignocellulose pretreatment. However, DESs contain salt components which tend to inactivate cellulase in the subsequent saccharification process. To alleviate this problem, it is necessary to evaluate the applicability of the DESs-Cellulase system. This was accomplished in the present study by first studying the stability of cellulase in the presence of selected DESs followed by applicability evaluation based on glucose production, energy consumption and kinetic performance. Results showed that the cellulase was able to retain more than 90% of its original activity in the presence of 10% (v/v) for glycerol based DES (GLY) and ethylene glycol based DES (EG). Furthermore, both DESs system exhibited higher glucose percentage enhancement and lower energy consumption as compared to diluted alkali system. Among the two DESs studied, EG showed comparatively better kinetic performance. Copyright © 2015 Elsevier Ltd. All rights reserved.
Assessing implicit gender bias in Medical Student Performance Evaluations.
Axelson, Rick D; Solow, Catherine M; Ferguson, Kristi J; Cohen, Michael B
2010-09-01
For medical schools, the increasing presence of women makes it especially important that potential sources of gender bias be identified and removed from student evaluation methods. Our study looked for patterns of gender bias in adjective data used to inform our Medical Student Performance Evaluations (MSPEs). Multigroup Confirmatory Factor Analysis (CFA) was used to model the latent structure of the adjectives attributed to students (n = 657) and to test for systematic scoring errors by gender. Gender bias was evident in two areas: (a) women were more likely than comparable men to be described as ''compassionate,'' ''sensitive,'' and ''enthusiastic'' and (b) men were more likely than comparable women to be seen as ''quick learners.'' The gender gap in ''quick learner'' attribution grows with increasing student proficiency; men's rate of increase is over twice that of women's. Technical and nontechnical approaches for ameliorating the impact of gender bias on student recommendations are suggested.
Characteristic Evaluation on Cooling Performance of Thermoelectric Modules.
Seo, Sae Rom; Han, Seungwoo
2015-10-01
The aim of this work is to develop a performance evaluation system for thermoelectric cooling modules. We describe the design of such a system, composed of a vacuum chamber with a heat sink along with a metal block to measure the absorbed heat Qc. The system has a simpler structure than existing water-cooled or air-cooled systems. The temperature difference between the cold and hot sides of the thermoelectric module ΔT can be accurately measured without any effects due to convection, and the temperature equilibrium time is minimized compared to a water-cooled system. The evaluation system described here can be used to measure characteristic curves of Qc as a function of ΔT, as well as the current-voltage relations. High-performance thermoelectric systems can therefore be developed using optimal modules evaluated with this system.
Teodoro, George; Kurc, Tahsin; Andrade, Guilherme; Kong, Jun; Ferreira, Renato; Saltz, Joel
2015-01-01
We carry out a comparative performance study of multi-core CPUs, GPUs and Intel Xeon Phi (Many Integrated Core-MIC) with a microscopy image analysis application. We experimentally evaluate the performance of computing devices on core operations of the application. We correlate the observed performance with the characteristics of computing devices and data access patterns, computation complexities, and parallelization forms of the operations. The results show a significant variability in the performance of operations with respect to the device used. The performances of operations with regular data access are comparable or sometimes better on a MIC than that on a GPU. GPUs are more efficient than MICs for operations that access data irregularly, because of the lower bandwidth of the MIC for random data accesses. We propose new performance-aware scheduling strategies that consider variabilities in operation speedups. Our scheduling strategies significantly improve application performance compared to classic strategies in hybrid configurations. PMID:28239253
CARDIOVASCULAR SCREENING OF YOUNG ATHLETES: A REVIEW OF ECONOMIC EVALUATIONS.
Gerkens, Sophie; Van Brabandt, Hans; Desomer, Anja; Leonard, Christian; Neyt, Mattias
2017-01-01
Some experts have promoted preparticipative cardiovascular screening programs for young athletes and have claimed that such programs were cost-effective without performing a critical analysis of studies supporting this statement. In this systematic review, a critical assessment of economic evaluations on these programs is performed to determine if they really provide value for money. A systematic review of economic evaluations was performed on December 24, 2014. Web sites of health technology assessment agencies, the Cochrane database of systematic review, the National Health Service Economic Evaluation Database of the Cochrane Library, EMBASE, Medline, Psychinfo, and EconLit were searched to retrieve (reviews of) economic evaluations. No language or time restrictions were imposed and predefined selection criteria were used. Selected studies were critically assessed applying a structured data extraction sheet. Five relevant economic evaluations were critically assessed. Results of these studies were mixed. However, those in favor of screening made (methodological) incorrect choices, of which the most important one was not taking into account a no-screening alternative as comparator. Compared with no screening, other strategies (history and physical examination or history and physical examination plus electrocardiogram) were not considered cost-effective. Results of primary economic evaluations should not be blindly copied without critical assessment. Economic evaluations in this field lack the support of robust evidence. Negative consequences of screening (false positive findings, overtreatment) should also be taken into account and may cause more harm than good. A mass screening of young athletes for cardiovascular diseases does not provide value for money and should be discouraged.
Feng, Chuan; Rozenblit, Jerzy W; Hamilton, Allan J
2010-11-01
Surgeons performing laparoscopic surgery have strong biases regarding the quality and nature of the laparoscopic video monitor display. In a comparative study, we used a unique computerized sensing and analysis system to evaluate the various types of monitors employed in laparoscopic surgery. We compared the impact of different types of monitor displays on an individual's performance of a laparoscopic training task which required the subject to move the instrument to a set of targets. Participants (varying from no laparoscopic experience to board-certified surgeons) were asked to perform the assigned task while using all three display systems, which were randomly assigned: a conventional laparoscopic monitor system (2D), a high-definition monitor system (HD), and a stereoscopic display (3D). The effects of monitor system on various performance parameters (total time consumed to finish the task, average speed, and movement economy) were analyzed by computer. Each of the subjects filled out a subjective questionnaire at the end of their training session. A total of 27 participants completed our study. Performance with the HD monitor was significantly slower than with either the 3D or 2D monitor (p < 0.0001). Movement economy with the HD monitor was significantly reduced compared with the 3D (p < 0.0004) or 2D (p < 0.0001) monitor. In terms of average time required to complete the task, performance with the 3D monitor was significantly faster than with the HD (p < 0.0001) or 2D (p < 0.0086) monitor. However, the HD system was the overwhelming favorite according to subjective evaluation. Computerized sensing and analysis is capable of quantitatively assessing the seemingly minor effect of monitor display on surgical training performance. The study demonstrates that, while users expressed a decided preference for HD systems, actual quantitative analysis indicates that HD monitors offer no statistically significant advantage and may even worsen performance compared with standard 2D or 3D laparoscopic monitors.
Maurer, Max; Lienert, Judit
2017-01-01
We compare the use of multi-criteria decision analysis (MCDA)–or more precisely, models used in multi-attribute value theory (MAVT)–to integrated assessment (IA) models for supporting long-term water supply planning in a small town case study in Switzerland. They are used to evaluate thirteen system scale water supply alternatives in four future scenarios regarding forty-four objectives, covering technical, social, environmental, and economic aspects. The alternatives encompass both conventional and unconventional solutions and differ regarding technical, spatial and organizational characteristics. This paper focuses on the impact assessment and final evaluation step of the structured MCDA decision support process. We analyze the performance of the alternatives for ten stakeholders. We demonstrate the implications of model assumptions by comparing two IA and three MAVT evaluation model layouts of different complexity. For this comparison, we focus on the validity (ranking stability), desirability (value), and distinguishability (value range) of the alternatives given the five model layouts. These layouts exclude or include stakeholder preferences and uncertainties. Even though all five led us to identify the same best alternatives, they did not produce identical rankings. We found that the MAVT-type models provide higher distinguishability and a more robust basis for discussion than the IA-type models. The needed complexity of the model, however, should be determined based on the intended use of the model within the decision support process. The best-performing alternatives had consistently strong performance for all stakeholders and future scenarios, whereas the current water supply system was outperformed in all evaluation layouts. The best-performing alternatives comprise proactive pipe rehabilitation, adapted firefighting provisions, and decentralized water storage and/or treatment. We present recommendations for possible ways of improving water supply planning in the case study and beyond. PMID:28481881
Improving Vector Evaluated Particle Swarm Optimisation by Incorporating Nondominated Solutions
Lim, Kian Sheng; Ibrahim, Zuwairie; Buyamin, Salinda; Ahmad, Anita; Naim, Faradila; Ghazali, Kamarul Hawari; Mokhtar, Norrima
2013-01-01
The Vector Evaluated Particle Swarm Optimisation algorithm is widely used to solve multiobjective optimisation problems. This algorithm optimises one objective using a swarm of particles where their movements are guided by the best solution found by another swarm. However, the best solution of a swarm is only updated when a newly generated solution has better fitness than the best solution at the objective function optimised by that swarm, yielding poor solutions for the multiobjective optimisation problems. Thus, an improved Vector Evaluated Particle Swarm Optimisation algorithm is introduced by incorporating the nondominated solutions as the guidance for a swarm rather than using the best solution from another swarm. In this paper, the performance of improved Vector Evaluated Particle Swarm Optimisation algorithm is investigated using performance measures such as the number of nondominated solutions found, the generational distance, the spread, and the hypervolume. The results suggest that the improved Vector Evaluated Particle Swarm Optimisation algorithm has impressive performance compared with the conventional Vector Evaluated Particle Swarm Optimisation algorithm. PMID:23737718
Improving Vector Evaluated Particle Swarm Optimisation by incorporating nondominated solutions.
Lim, Kian Sheng; Ibrahim, Zuwairie; Buyamin, Salinda; Ahmad, Anita; Naim, Faradila; Ghazali, Kamarul Hawari; Mokhtar, Norrima
2013-01-01
The Vector Evaluated Particle Swarm Optimisation algorithm is widely used to solve multiobjective optimisation problems. This algorithm optimises one objective using a swarm of particles where their movements are guided by the best solution found by another swarm. However, the best solution of a swarm is only updated when a newly generated solution has better fitness than the best solution at the objective function optimised by that swarm, yielding poor solutions for the multiobjective optimisation problems. Thus, an improved Vector Evaluated Particle Swarm Optimisation algorithm is introduced by incorporating the nondominated solutions as the guidance for a swarm rather than using the best solution from another swarm. In this paper, the performance of improved Vector Evaluated Particle Swarm Optimisation algorithm is investigated using performance measures such as the number of nondominated solutions found, the generational distance, the spread, and the hypervolume. The results suggest that the improved Vector Evaluated Particle Swarm Optimisation algorithm has impressive performance compared with the conventional Vector Evaluated Particle Swarm Optimisation algorithm.
Comparative performance evaluation of advanced AC and DC EV propulsion systems
NASA Astrophysics Data System (ADS)
MacDowall, R. D.; Crumley, R. L.
Idaho National Engineering Laboratory (INEL) evaluates EV propulsion systems and components for the U.S. Department of Energy (DOE) Electric and Hybrid Vehicle (EHV) Program. In this study, experimental data were used to evaluate the relative performances of the benchmark Chrysler/GE ETV-1 DC and the Ford/GE First Generation Single-Shaft AC (ETX-I) propulsion systems. Tests were conducted on the INEL's chassis dynamometer using identical aerodynamic and rolling resistance road-load coefficients and vehicle test weights. The results allowed a direct comparison of selected efficiency and performance characteristics for the two propulsion system technologies. The ETX-I AC system exhibited slightly lower system efficiency during constant speed testing than the ETV-1 DC propulsion system.
Smith, Christopher J; Morad, Abdulrahman; Balwanz, Christopher; Lyden, Elizabeth; Matthias, Tabatha
2018-04-24
Point-of-care (POCUS) education is rapidly expanding within medical schools and internal medicine residency programs, but lack of trained faculty is a major barrier. While POCUS training can improve short-term outcomes, knowledge and skills rapidly decay without deliberate practice and feedback. The purpose of this study was to evaluate the performance of focused cardiac ultrasound (FCU) by volunteer general internal medicine (GIM) faculty participating in a longitudinal POCUS curriculum. Participants: Nine GIM clinician-educators participated in a 6-month POCUS curriculum. Faculty performance was compared to three cardiology fellows. Three diagnostic cardiac sonographers (DCS) were also evaluated and served as the gold standard. the primary outcome was a FCU efficiency score, calculated by dividing image quality score by exam duration. FCU exams were conducted on three standardized patients after completion of an introductory workshop, at 3 months, and at 6 months. Two blinded cardiologists scored the exams. mean efficiency scores were compared using a linear mixed effects model, followed by pairwise comparisons using Tukey's test. GIM faculty's FCU efficiency scores were maintained over the 6-month period (2.2, SE 1.0 vs. 3.8, SE 1.0, p = 0.076). Their scores at each session were similar to cardiology fellows (p > 0.69), but inferior to DCSs (p < 0.0001). GIM faculty participating in a POCUS curriculum maintained their FCU performance over 6 months with efficiency scores comparable to experienced cardiology fellows.
Performance study of SKIROC2/A ASIC for ILD Si-W ECAL
NASA Astrophysics Data System (ADS)
Suehara, T.; Sekiya, I.; Callier, S.; Balagura, V.; Boudry, V.; Brient, J.-C.; de la Taille, C.; Kawagoe, K.; Irles, A.; Magniette, F.; Nanni, J.; Pöschl, R.; Yoshioka, T.
2018-03-01
The ILD Si-W ECAL is a sampling calorimeter with tungsten absorber and highly segmented silicon layers for the International Large Detector (ILD), one of the two detector concepts for the International Linear Collider. SKIROC2 is an ASIC for the ILD Si-W ECAL. To investigate the issues found in prototype detectors, we prepared dedicated ASIC evaluation boards with either BGA sockets or directly soldered SKIROC2. We report a performance study with the evaluation boards, including signal-to-noise ratio and TDC performance with comparing SKIROC2 and an updated version, SKIROC2A.
Kim, Youngseop; Choi, Eun Seo; Kwak, Wooseop; Shin, Yongjin; Jung, Woonggyu; Ahn, Yeh-Chan; Chen, Zhongping
2008-06-01
We demonstrate the use of optical coherence tomography (OCT) as a non-destructive diagnostic tool for evaluating laser-processing performance by imaging the features of a pit and a rim. A pit formed on a material at different laser-processing conditions is imaged using both a conventional scanning electron microscope (SEM) and OCT. Then using corresponding images, the geometrical characteristics of the pit are analyzed and compared. From the results, we could verify the feasibility and the potential of the application of OCT to the monitoring of the laser-processing performance.
Performance Evaluation of Supercomputers using HPCC and IMB Benchmarks
NASA Technical Reports Server (NTRS)
Saini, Subhash; Ciotti, Robert; Gunney, Brian T. N.; Spelce, Thomas E.; Koniges, Alice; Dossa, Don; Adamidis, Panagiotis; Rabenseifner, Rolf; Tiyyagura, Sunil R.; Mueller, Matthias;
2006-01-01
The HPC Challenge (HPCC) benchmark suite and the Intel MPI Benchmark (IMB) are used to compare and evaluate the combined performance of processor, memory subsystem and interconnect fabric of five leading supercomputers - SGI Altix BX2, Cray XI, Cray Opteron Cluster, Dell Xeon cluster, and NEC SX-8. These five systems use five different networks (SGI NUMALINK4, Cray network, Myrinet, InfiniBand, and NEC IXS). The complete set of HPCC benchmarks are run on each of these systems. Additionally, we present Intel MPI Benchmarks (IMB) results to study the performance of 11 MPI communication functions on these systems.
Kim, Youngseop; Choi, Eun Seo; Kwak, Wooseop; Shin, Yongjin; Jung, Woonggyu; Ahn, Yeh-Chan; Chen, Zhongping
2014-01-01
We demonstrate the use of optical coherence tomography (OCT) as a non-destructive diagnostic tool for evaluating laser-processing performance by imaging the features of a pit and a rim. A pit formed on a material at different laser-processing conditions is imaged using both a conventional scanning electron microscope (SEM) and OCT. Then using corresponding images, the geometrical characteristics of the pit are analyzed and compared. From the results, we could verify the feasibility and the potential of the application of OCT to the monitoring of the laser-processing performance. PMID:24932051
Comparing Methods for UAV-Based Autonomous Surveillance
NASA Technical Reports Server (NTRS)
Freed, Michael; Harris, Robert; Shafto, Michael
2004-01-01
We describe an approach to evaluating algorithmic and human performance in directing UAV-based surveillance. Its key elements are a decision-theoretic framework for measuring the utility of a surveillance schedule and an evaluation testbed consisting of 243 scenarios covering a well-defined space of possible missions. We apply this approach to two example UAV-based surveillance methods, a TSP-based algorithm and a human-directed approach, then compare them to identify general strengths, and weaknesses of each method.
Evaluation of a Mobile Phone Image-Based Dietary Assessment Method in Adults with Type 2 Diabetes.
Rollo, Megan E; Ash, Susan; Lyons-Wall, Philippa; Russell, Anthony W
2015-06-17
Image-based dietary records have limited evidence evaluating their performance and use among adults with a chronic disease. This study evaluated the performance of a 3-day mobile phone image-based dietary record, the Nutricam Dietary Assessment Method (NuDAM), in adults with type 2 diabetes mellitus (T2DM). Criterion validity was determined by comparing energy intake (EI) with total energy expenditure (TEE) measured by the doubly-labelled water technique. Relative validity was established by comparison to a weighed food record (WFR). Inter-rater reliability was assessed by comparing estimates of intake from three dietitians. Ten adults (6 males, age: 61.2 ± 6.9 years old, BMI: 31.0 ± 4.5 kg/m(2)) participated. Compared to TEE, mean EI (MJ/day) was significantly under-reported using both methods, with a mean ratio of EI:TEE 0.76 ± 0.20 for the NuDAM and 0.76 ± 0.17 for the WFR. Correlations between the NuDAM and WFR were mostly moderate for energy (r = 0.57), carbohydrate (g/day) (r = 0.63, p < 0.05), protein (g/day) (r = 0.78, p < 0.01) and alcohol (g/day) (rs = 0.85, p < 0.01), with a weaker relationship for fat (g/day) (r = 0.24). Agreement between dietitians for nutrient intake for the 3-day NuDAM (Intra-class Correlation Coefficient (ICC) = 0.77-0.99) was lower when compared with the 3-day WFR (ICC = 0.82-0.99). These findings demonstrate the performance and feasibility of the NuDAM to assess energy and macronutrient intake in a small sample. Some modifications to the NuDAM could improve efficiency and an evaluation in a larger group of adults with T2DM is required.
An evaluation of the use of oral contrast media in abdominopelvic CT.
Buttigieg, Erica Lauren; Grima, Karen Borg; Cortis, Kelvin; Soler, Sandro Galea; Zarb, Francis
2014-11-01
To evaluate the diagnostic efficacy of different oral contrast media (OCM) for abdominopelvic CT examinations performed for follow-up general oncological indications. The objectives were to establish anatomical image quality criteria for abdominopelvic CT; use these criteria to evaluate and compare image quality using positive OCM, neutral OCM and no OCM; and evaluate possible benefits for the medical imaging department. Forty-six adult patients attending a follow-up abdominopelvic CT for general oncological indications and who had a previous abdominopelvic CT with positive OCM (n = 46) were recruited and prospectively placed into either the water (n = 25) or no OCM (n = 21) group. Three radiologists performed absolute visual grading analysis (VGA) to assess image quality by grading the fulfilment of 24 anatomical image quality criteria. Visual grading characteristics (VGC) analysis of the data showed comparable image quality with regards to reproduction of abdominal structures, bowel discrimination, presence of artefacts, and visualization of the amount of intra-abdominal fat for the three OCM protocols. All three OCM protocols provided similar image quality for follow-up abdominopelvic CT for general oncological indications. • Positive oral contrast media are routinely used for abdominopelvic multidetector computed tomography • Experimental study comparing image quality using three different oral contrast materials • Three different oral contrast materials result in comparable CT image quality • Benefits for patients and medical imaging department.
The Case for a Joint Evaluation
2017-01-01
intelligence, and engineering. Finally, the comparative time ex - pended by the combatant commanders (CCDRs) on fulfilling four different evaluation...template for the joint-centric construct would align with the four de facto sections noted earlier: an identifica- tion section, a performance metric...intangible or have not been properly researched. For example, under one evaluation system, a Servicemember’s separation or retire- ment into a post
Evaluating Algorithm Performance Metrics Tailored for Prognostics
NASA Technical Reports Server (NTRS)
Saxena, Abhinav; Celaya, Jose; Saha, Bhaskar; Saha, Sankalita; Goebel, Kai
2009-01-01
Prognostics has taken a center stage in Condition Based Maintenance (CBM) where it is desired to estimate Remaining Useful Life (RUL) of the system so that remedial measures may be taken in advance to avoid catastrophic events or unwanted downtimes. Validation of such predictions is an important but difficult proposition and a lack of appropriate evaluation methods renders prognostics meaningless. Evaluation methods currently used in the research community are not standardized and in many cases do not sufficiently assess key performance aspects expected out of a prognostics algorithm. In this paper we introduce several new evaluation metrics tailored for prognostics and show that they can effectively evaluate various algorithms as compared to other conventional metrics. Specifically four algorithms namely; Relevance Vector Machine (RVM), Gaussian Process Regression (GPR), Artificial Neural Network (ANN), and Polynomial Regression (PR) are compared. These algorithms vary in complexity and their ability to manage uncertainty around predicted estimates. Results show that the new metrics rank these algorithms in different manner and depending on the requirements and constraints suitable metrics may be chosen. Beyond these results, these metrics offer ideas about how metrics suitable to prognostics may be designed so that the evaluation procedure can be standardized. 1
Comparing the performance of biomedical clustering methods.
Wiwie, Christian; Baumbach, Jan; Röttger, Richard
2015-11-01
Identifying groups of similar objects is a popular first step in biomedical data analysis, but it is error-prone and impossible to perform manually. Many computational methods have been developed to tackle this problem. Here we assessed 13 well-known methods using 24 data sets ranging from gene expression to protein domains. Performance was judged on the basis of 13 common cluster validity indices. We developed a clustering analysis platform, ClustEval (http://clusteval.mpi-inf.mpg.de), to promote streamlined evaluation, comparison and reproducibility of clustering results in the future. This allowed us to objectively evaluate the performance of all tools on all data sets with up to 1,000 different parameter sets each, resulting in a total of more than 4 million calculated cluster validity indices. We observed that there was no universal best performer, but on the basis of this wide-ranging comparison we were able to develop a short guideline for biomedical clustering tasks. ClustEval allows biomedical researchers to pick the appropriate tool for their data type and allows method developers to compare their tool to the state of the art.
WE-E-217A-02: Methodologies for Evaluation of Standalone CAD System Performance.
Sahiner, B
2012-06-01
Standalone performance evaluation of a CAD system provides information about the abnormality detection or classification performance of the computerized system alone. Although the performance of the reader with CAD is the final step in CAD system assessment, standalone performance evaluation is an important component for several reasons: First, standalone evaluation informs the reader about the performance level of the CAD system and may have an impact on how the reader uses the system. Second, it provides essential information to the system designer for algorithm optimization during system development. Third, standalone evaluation can provide a detailed description of algorithm performance (e.g., on subgroups of the population) because a larger data set with more samples from different subgroups can be included in standalone studies compared to reader studies. Proper standalone evaluation of a CAD system involves a number of key components, some of which are shared with the assessment of reader performance with CAD. These include (1) selection of a test data set that allows performance assessment with little or no bias and acceptable uncertainty; (2) a reference standard that indicates disease status as well as the location and extent of disease; (3) a clearly defined method for labeling each CAD mark as a true-positive or false-positive; and (4) a properly selected set of metrics to summarize the accuracy of the computer marks and their corresponding scores. In this lecture, we will discuss various approaches for the key components of standalone CAD performance evaluation listed above, and present some of the recommendations and opinions from the AAPM CAD subcommittee on these issues. Learning Objectives 1. Identify basic components and metrics in the assessment of standalone CAD systems 2. Understand how each component may affect the assessed performance 3. Learn about AAPM CAD subcommittee's opinions and recommendations on factors and metrics related to the evaluation of standalone CAD system performance. © 2012 American Association of Physicists in Medicine.
Hoekman, Berend; Breitling, Rainer; Suits, Frank; Bischoff, Rainer; Horvatovich, Peter
2012-01-01
Data processing forms an integral part of biomarker discovery and contributes significantly to the ultimate result. To compare and evaluate various publicly available open source label-free data processing workflows, we developed msCompare, a modular framework that allows the arbitrary combination of different feature detection/quantification and alignment/matching algorithms in conjunction with a novel scoring method to evaluate their overall performance. We used msCompare to assess the performance of workflows built from modules of publicly available data processing packages such as SuperHirn, OpenMS, and MZmine and our in-house developed modules on peptide-spiked urine and trypsin-digested cerebrospinal fluid (CSF) samples. We found that the quality of results varied greatly among workflows, and interestingly, heterogeneous combinations of algorithms often performed better than the homogenous workflows. Our scoring method showed that the union of feature matrices of different workflows outperformed the original homogenous workflows in some cases. msCompare is open source software (https://trac.nbic.nl/mscompare), and we provide a web-based data processing service for our framework by integration into the Galaxy server of the Netherlands Bioinformatics Center (http://galaxy.nbic.nl/galaxy) to allow scientists to determine which combination of modules provides the most accurate processing for their particular LC-MS data sets. PMID:22318370
IBM PC/IX operating system evaluation plan
NASA Technical Reports Server (NTRS)
Dominick, Wayne D. (Editor); Granier, Martin; Hall, Philip P.; Triantafyllopoulos, Spiros
1984-01-01
An evaluation plan for the IBM PC/IX Operating System designed for IBM PC/XT computers is discussed. The evaluation plan covers the areas of performance measurement and evaluation, software facilities available, man-machine interface considerations, networking, and the suitability of PC/IX as a development environment within the University of Southwestern Louisiana NASA PC Research and Development project. In order to compare and evaluate the PC/IX system, comparisons with other available UNIX-based systems are also included.
Novel index for micromixing characterization and comparative analysis
Jain, Mranal; Nandakumar, K.
2010-01-01
The most basic micromixer is a T- or Y-mixer, where two confluent streams mix due to transverse diffusion. To enhance micromixing, various modifications of T-mixers are reported such as heterogeneously charged walls, grooves on the channel base, geometric variations by introducing physical constrictions, etc. The performance of these reported designs is evaluated against the T-mixer in terms of the deviation from perfectly mixed state and mixing length (device length required to achieve perfect mixing). Although many studies have noticed the reduced flow rates for improved mixer designs, the residence time is not taken into consideration for micromixing performance evaluation. In this work, we propose a novel index, based on residence time, for micromixing characterization and comparative analysis. For any given mixer, the proposed index identifies the nondiffusive mixing enhancement with respect to the T-mixer. Various micromixers are evaluated using the proposed index to demonstrate the usefulness of the index. It is also shown that physical constriction mixer types are equivalent to T-mixers. The proposed index is found to be insightful and could be used as a benchmark for comparing different mixing strategies. PMID:20689773
Gowd, Snigdha; Shankar, T; Dash, Samarendra; Sahoo, Nivedita; Chatterjee, Suravi; Mohanty, Pritam
2017-01-01
Aims and Objective: The aim of the study was to evaluate the reliability of cone beam computed tomography (CBCT) obtained image over plaster model for the assessment of mixed dentition analysis. Materials and Methods: Thirty CBCT-derived images and thirty plaster models were derived from the dental archives, and Moyer's and Tanaka-Johnston analyses were performed. The data obtained were interpreted and analyzed statistically using SPSS 10.0/PC (SPSS Inc., Chicago, IL, USA). Descriptive and analytical analysis along with Student's t-test was performed to qualitatively evaluate the data and P < 0.05 was considered statistically significant. Results: Statistically, significant results were obtained on data comparison between CBCT-derived images and plaster model; the mean for Moyer's analysis in the left and right lower arch for CBCT and plaster model was 21.2 mm, 21.1 mm and 22.5 mm, 22.5 mm, respectively. Conclusion: CBCT-derived images were less reliable as compared to data obtained directly from plaster model for mixed dentition analysis. PMID:28852639
Quantitative evaluation of performance of three-dimensional printed lenses
NASA Astrophysics Data System (ADS)
Gawedzinski, John; Pawlowski, Michal E.; Tkaczyk, Tomasz S.
2017-08-01
We present an analysis of the shape, surface quality, and imaging capabilities of custom three-dimensional (3-D) printed lenses. 3-D printing technology enables lens prototypes to be fabricated without restrictions on surface geometry. Thus, spherical, aspherical, and rotationally nonsymmetric lenses can be manufactured in an integrated production process. This technique serves as a noteworthy alternative to multistage, labor-intensive, abrasive processes, such as grinding, polishing, and diamond turning. Here, we evaluate the quality of lenses fabricated by Luxexcel using patented Printoptical©; technology that is based on an inkjet printing technique by comparing them to lenses made with traditional glass processing technologies (grinding, polishing, etc.). The surface geometry and roughness of the lenses were evaluated using white-light and Fizeau interferometers. We have compared peak-to-valley wavefront deviation, root mean square (RMS) wavefront error, radii of curvature, and the arithmetic roughness average (Ra) profile of plastic and glass lenses. In addition, the imaging performance of selected pairs of lenses was tested using 1951 USAF resolution target. The results indicate performance of 3-D printed optics that could be manufactured with surface roughness comparable to that of injection molded lenses (Ra<20 nm). The RMS wavefront error of 3-D printed prototypes was at a minimum 18.8 times larger than equivalent glass prototypes for a lens with a 12.7 mm clear aperture, but, when measured within 63% of its clear aperture, the 3-D printed components' RMS wavefront error was comparable to glass lenses.
Quantitative evaluation of performance of 3D printed lenses
Gawedzinski, John; Pawlowski, Michal E.; Tkaczyk, Tomasz S.
2017-01-01
We present an analysis of the shape, surface quality, and imaging capabilities of custom 3D printed lenses. 3D printing technology enables lens prototypes to be fabricated without restrictions on surface geometry. Thus, spherical, aspherical and rotationally non-symmetric lenses can be manufactured in an integrated production process. This technique serves as a noteworthy alternative to multistage, labor-intensive, abrasive processes such as grinding, polishing and diamond turning. Here, we evaluate the quality of lenses fabricated by Luxexcel using patented Printoptical© technology that is based on an inkjet printing technique by comparing them to lenses made with traditional glass processing technologies (grinding, polishing etc.). The surface geometry and roughness of the lenses were evaluated using white-light and Fizeau interferometers. We have compared peak-to-valley wavefront deviation, root-mean-squared wavefront error, radii of curvature and the arithmetic average of the roughness profile (Ra) of plastic and glass lenses. Additionally, the imaging performance of selected pairs of lenses was tested using 1951 USAF resolution target. The results indicate performance of 3D printed optics that could be manufactured with surface roughness comparable to that of injection molded lenses (Ra < 20 nm). The RMS wavefront error of 3D printed prototypes was at a minimum 18.8 times larger than equivalent glass prototypes for a lens with a 12.7 mm clear aperture, but when measured within 63% of its clear aperture, 3D printed components’ RMS wavefront error was comparable to glass lenses. PMID:29238114
Review article: scoring systems for assessing prognosis in critically ill adult cirrhotics.
Cholongitas, E; Senzolo, M; Patch, D; Shaw, S; Hui, C; Burroughs, A K
2006-08-01
Cirrhotic patients admitted to intensive care units (ICU) still have poor outcomes. Some current ICU prognostic models [Acute Physiology and Chronic Health Evaluation (APACHE), Organ System Failure (OSF) and Sequential Organ Failure Assessment (SOFA)] were used to stratify cirrhotics into risk categories, but few cirrhotics were included in the original model development. Liver-specific scores [Child-Turcotte-Pugh (CTP) and model for end-stage liver disease (MELD)] could be useful in this setting. To evaluate whether ICU prognostic models perform better compared with liver-disease specific ones in cirrhotics admitted to ICU. We performed a structured literature review identifying clinical studies focusing on prognosis and risk factors for mortality in adult cirrhotics admitted to ICU. We found 21 studies (five solely dealing with gastrointestinal bleeding) published during the last 20 years (54-420 patients in each). APACHE II and III, SOFA and OSF had better discrimination for correctly predicting death compared with the CTP score. The MELD score was evaluated only in one study and had good predictive accuracy [receiver operator characteristic (ROC) curve: 0.81). Organ dysfunction models (OSF, SOFA) were superior compared with APACHE II and III (ROC curve: range 0.83-0.94 vs. 0.66-0.88 respectively). Cardiovascular, liver and renal system dysfunction were more frequently independently associated with mortality. General-ICU models had better performance in cirrhotic populations compared with CTP score; OSF and SOFA had the best predictive ability. Further prospective and validation studies are needed.
Comparative evaluation of ultrasound scanner accuracy in distance measurement
NASA Astrophysics Data System (ADS)
Branca, F. P.; Sciuto, S. A.; Scorza, A.
2012-10-01
The aim of the present study is to develop and compare two different automatic methods for accuracy evaluation in ultrasound phantom measurements on B-mode images: both of them give as a result the relative error e between measured distances, performed by 14 brand new ultrasound medical scanners, and nominal distances, among nylon wires embedded in a reference test object. The first method is based on a least squares estimation, while the second one applies the mean value of the same distance evaluated at different locations in ultrasound image (same distance method). Results for both of them are proposed and explained.
Evaluation of the KLA-Tencor 2138 for line monitoring applications
NASA Astrophysics Data System (ADS)
Metteer, Brian; Garvin, James F., Jr.; Cataldi, Frank; Ng, Albert; Button, Jon; Newell, Robyn; Rodriguez, Mike D.; Miller, Arlisa
1998-06-01
This report summarizes the results of an evaluation of the KLA-Tencor (KT) 2138 Ultra-Broadband (UBB) optical inspection system performed in the DP1 development facility at Texas Instruments from July 1997 to November 1997. The purpose of this project was to evaluate the effectiveness of the new 2138 UBB system compared to a KT AIT, non-SAT tests on a KT 2135, and SAT recipes on the KT 2132. The 2138 system was designed to provide improved sensitivity and defect detection over the 2135 and other tools. In particular, the UBB illumination source utilized by the 2138 system was expected to provide a significant sensitivity improvement over the 2135 on wafers with color variation as a source of noise. The speeds of the individual pixel tests on the 2138 are the same as those on the 2135. However, it was found that the 2138 0.62 micrometer pixel tests actually found more defects than did the 0.39 micrometer pixel tests on the 2132 on the process levels where this comparison was studied. This type of comparison was not performed between the 2138 and the 2135 since SAT capability was not available on the DP1 2135 during the evaluation. Initially, the primary objective of this project was to measure the UBB system's performance as compared to the 2135 on two Memory levels and three Logic levels. However, since the DP1 2135 system did not possess segmented autothreshold (SAT) capability during this evaluation and the DP1 2132 system did possess SAT capability, the DP1 2132 was added to the evaluation for a 2138 versus 213X SAT direct comparison. Also, the AIT was added to the evaluation plan for a brightfield versus darkfield technology comparison. Finally, three additional Logic levels were added to the evaluation plan, including one Post-CMP level. During this evaluation, the 2138 was proven to be significantly more sensitive than was the 2135, 2132, and the AIT on all process levels compared. Also, very few hardware or software problems were noted during the evaluation.
Analysis of 238Pu and 56Fe Evaluated Data for Use in MYRRHA
NASA Astrophysics Data System (ADS)
Díez, C. J.; Cabellos, O.; Martínez, J. S.; Stankovskiy, A.; Van den Eynde, G.; Schillebeeckx, P.; Heyse, J.
2014-04-01
A sensitivity analysis on the multiplication factor, keff, to the cross section data has been carried out for the MYRRHA critical configuration in order to show the most relevant reactions. With these results, a further analysis on the 238Pu and 56Fe cross sections has been performed, comparing the evaluations provided in the JEFF-3.1.2 and ENDF/B-VII.1 libraries for these nuclides. Then, the effect in MYRRHA of the differences between evaluations are analysed, presenting the source of the differences. With these results, recommendations for the 56Fe and 238Pu evaluations are suggested. These calculations have been performed with SCALE6.1 and MCNPX-2.7e.
Evaluation of a neuropsychological screen in an incarcerated population.
Ball, Tabitha D; Pastore, Richard E; Sollman, Miriam J; Burright, Richard G; Donovick, Peter J
2009-08-01
The Brief Neuropsychological Cognitive Examination (BNCE) is a screening device designed to rapidly assess neuropsychological functioning. The availability of an effective and efficient screen for neuropsychological and/or cognitive disorders is an important concern within various settings such as correctional facilities, where there are likely to be large numbers of individuals in need of evaluation. In the current study the utility of the BNCE in detecting cognitive impairments among a clinical sample of incarcerated individuals was evaluated by comparing performance on this instrument to performance on measures of general cognitive functioning. Results indicate that the BNCE demonstrates some utility in its ability to determine those in need of further evaluation of cognitive functioning.
Evaluation of CNN as anthropomorphic model observer
NASA Astrophysics Data System (ADS)
Massanes, Francesc; Brankov, Jovan G.
2017-03-01
Model observers (MO) are widely used in medical imaging to act as surrogates of human observers in task-based image quality evaluation, frequently towards optimization of reconstruction algorithms. In this paper, we explore the use of convolutional neural networks (CNN) to be used as MO. We will compare CNN MO to alternative MO currently being proposed and used such as the relevance vector machine based MO and channelized Hotelling observer (CHO). As the success of the CNN, and other deep learning approaches, is rooted in large data sets availability, which is rarely the case in medical imaging systems task-performance evaluation, we will evaluate CNN performance on both large and small training data sets.
Assessment of the cPAS-based BGISEQ-500 platform for metagenomic sequencing.
Fang, Chao; Zhong, Huanzi; Lin, Yuxiang; Chen, Bing; Han, Mo; Ren, Huahui; Lu, Haorong; Luber, Jacob M; Xia, Min; Li, Wangsheng; Stein, Shayna; Xu, Xun; Zhang, Wenwei; Drmanac, Radoje; Wang, Jian; Yang, Huanming; Hammarström, Lennart; Kostic, Aleksandar D; Kristiansen, Karsten; Li, Junhua
2018-03-01
More extensive use of metagenomic shotgun sequencing in microbiome research relies on the development of high-throughput, cost-effective sequencing. Here we present a comprehensive evaluation of the performance of the new high-throughput sequencing platform BGISEQ-500 for metagenomic shotgun sequencing and compare its performance with that of 2 Illumina platforms. Using fecal samples from 20 healthy individuals, we evaluated the intra-platform reproducibility for metagenomic sequencing on the BGISEQ-500 platform in a setup comprising 8 library replicates and 8 sequencing replicates. Cross-platform consistency was evaluated by comparing 20 pairwise replicates on the BGISEQ-500 platform vs the Illumina HiSeq 2000 platform and the Illumina HiSeq 4000 platform. In addition, we compared the performance of the 2 Illumina platforms against each other. By a newly developed overall accuracy quality control method, an average of 82.45 million high-quality reads (96.06% of raw reads) per sample, with 90.56% of bases scoring Q30 and above, was obtained using the BGISEQ-500 platform. Quantitative analyses revealed extremely high reproducibility between BGISEQ-500 intra-platform replicates. Cross-platform replicates differed slightly more than intra-platform replicates, yet a high consistency was observed. Only a low percentage (2.02%-3.25%) of genes exhibited significant differences in relative abundance comparing the BGISEQ-500 and HiSeq platforms, with a bias toward genes with higher GC content being enriched on the HiSeq platforms. Our study provides the first set of performance metrics for human gut metagenomic sequencing data using BGISEQ-500. The high accuracy and technical reproducibility confirm the applicability of the new platform for metagenomic studies, though caution is still warranted when combining metagenomic data from different platforms.
Haji Ali Afzali, Hossein; Gray, Jodi; Karnon, Jonathan
2013-04-01
Decision analytic models play an increasingly important role in the economic evaluation of health technologies. Given uncertainties around the assumptions used to develop such models, several guidelines have been published to identify and assess 'best practice' in the model development process, including general modelling approach (e.g., time horizon), model structure, input data and model performance evaluation. This paper focuses on model performance evaluation. In the absence of a sufficient level of detail around model performance evaluation, concerns regarding the accuracy of model outputs, and hence the credibility of such models, are frequently raised. Following presentation of its components, a review of the application and reporting of model performance evaluation is presented. Taking cardiovascular disease as an illustrative example, the review investigates the use of face validity, internal validity, external validity, and cross model validity. As a part of the performance evaluation process, model calibration is also discussed and its use in applied studies investigated. The review found that the application and reporting of model performance evaluation across 81 studies of treatment for cardiovascular disease was variable. Cross-model validation was reported in 55 % of the reviewed studies, though the level of detail provided varied considerably. We found that very few studies documented other types of validity, and only 6 % of the reviewed articles reported a calibration process. Considering the above findings, we propose a comprehensive model performance evaluation framework (checklist), informed by a review of best-practice guidelines. This framework provides a basis for more accurate and consistent documentation of model performance evaluation. This will improve the peer review process and the comparability of modelling studies. Recognising the fundamental role of decision analytic models in informing public funding decisions, the proposed framework should usefully inform guidelines for preparing submissions to reimbursement bodies.
Consonni, Stefano; Viganò, Federico
2012-04-01
A number of waste gasification technologies are currently proposed as an alternative to conventional Waste-to-Energy (WtE) plants. Assessing their potential is made difficult by the scarce operating experience and the fragmentary data available. After defining a conceptual framework to classify and assess waste gasification technologies, this paper compares two of the proposed technologies with conventional WtE plants. Performances are evaluated by proprietary software developed at Politecnico di Milano and compared on the basis of a coherent set of assumptions. Since the two gasification technologies are configured as "two-step oxidation" processes, their energy performances are very similar to those of conventional plants. The potential benefits that may justify their adoption relate to material recovery and operation/emission control: recovery of metals in non-oxidized form; collection of ashes in inert, vitrified form; combustion control; lower generation of some pollutants. Copyright © 2012 Elsevier Ltd. All rights reserved.
Cognitive-evaluative features of childhood social anxiety in a performance task.
Tuschen-Caffier, Brunna; Kühl, Sigrid; Bender, Caroline
2011-06-01
Using an experimental design, we analysed differences in the occurrence of cognitive-evaluative distortions and performance deficits across children with social anxiety disorder, with subclinical anxiety and without any anxiety symptoms. Twenty-one children with full syndrome social phobia, 18 children with partial syndrome social phobia and 20 children without any symptoms of social phobia were compared with respect to their degree of anxiety, negative thinking and task performance during two social-evaluative tasks. In addition, self-ratings of task performance, performance estimations for other children and objective behavioural ratings by two independent observers were obtained. Children with social anxiety disorder and subclinical social anxiety showed higher degrees of experienced anxiety and negative thinking than healthy control children. There was no group difference in respect to actual task performance. Findings are discussed with regard to the continuum assumption of childhood social anxiety disorder and the need of well-adapted early interventions. Copyright © 2010. Published by Elsevier Ltd.
[Teaching performance assessment in Public Health employing three different strategies].
Martínez-González, Adrián; Moreno-Altamirano, Laura; Ponce-Rosas, Efrén Raúl; Martínez-Franco, Adrián Israel; Urrutia-Aguilar, María Esther
2011-01-01
The educational system depends upon the quality and performance of their faculty and should therefore be process of continuous improvement. To assess the teaching performance of the Public Health professors, at the Faculty of Medicine, UNAM through three strategies. Justification study. The evaluation was conducted under a mediational model through three strategies: students' opinion assessment, self-assessment and students' academic achievement. We applied descriptive statistics, Student t test, ANOVA and Pearson correlation. Twenty professors were evaluated from the Public Health department, representing 57% of all them who teach the subject. The professor's performance was highly valued self-assessment compared with assessment of student opinion, was confirmed by statistical analysis the difference was significant. The difference amongst the three evaluation strategies became more evident between self-assessment and the scores obtained by students in their academic achievement. The integration of these three strategies offers a more complete view of the teacher's performance quality. Academic achievement appears to be a more objective strategy for teaching performance assessment than students' opinion and self-assessment.
Reyes, Jeanette M; Xu, Yadong; Vizuete, William; Serre, Marc L
2017-01-01
The regulatory Community Multiscale Air Quality (CMAQ) model is a means to understanding the sources, concentrations and regulatory attainment of air pollutants within a model's domain. Substantial resources are allocated to the evaluation of model performance. The Regionalized Air quality Model Performance (RAMP) method introduced here explores novel ways of visualizing and evaluating CMAQ model performance and errors for daily Particulate Matter ≤ 2.5 micrometers (PM2.5) concentrations across the continental United States. The RAMP method performs a non-homogenous, non-linear, non-homoscedastic model performance evaluation at each CMAQ grid. This work demonstrates that CMAQ model performance, for a well-documented 2001 regulatory episode, is non-homogeneous across space/time. The RAMP correction of systematic errors outperforms other model evaluation methods as demonstrated by a 22.1% reduction in Mean Square Error compared to a constant domain wide correction. The RAMP method is able to accurately reproduce simulated performance with a correlation of r = 76.1%. Most of the error coming from CMAQ is random error with only a minority of error being systematic. Areas of high systematic error are collocated with areas of high random error, implying both error types originate from similar sources. Therefore, addressing underlying causes of systematic error will have the added benefit of also addressing underlying causes of random error.
DOT National Transportation Integrated Search
2008-01-01
The Midwest States Accelerated Pavement Testing Pooled Fund Program, financed by the highway departments : of Missouri, Iowa, Kansas and Nebraska, has supported an accelerated pavement testing (APT) project to compare : the performance of stabilized ...
40 CFR 59.208 - Charcoal lighter material testing protocol.
Code of Federal Regulations, 2012 CFR
2012-07-01
... the performance, design, and operation specifications of the prescribed equipment. A demonstration... Administrator prior to compliance testing, based on an evaluation of comparative performance specifications and... stack barbecue charcoal that is designed to be lit without the packaging, the same as in paragraph (h)(1...
40 CFR 59.208 - Charcoal lighter material testing protocol.
Code of Federal Regulations, 2013 CFR
2013-07-01
... the performance, design, and operation specifications of the prescribed equipment. A demonstration... Administrator prior to compliance testing, based on an evaluation of comparative performance specifications and... stack barbecue charcoal that is designed to be lit without the packaging, the same as in paragraph (h)(1...
40 CFR 59.208 - Charcoal lighter material testing protocol.
Code of Federal Regulations, 2014 CFR
2014-07-01
... the performance, design, and operation specifications of the prescribed equipment. A demonstration... Administrator prior to compliance testing, based on an evaluation of comparative performance specifications and... stack barbecue charcoal that is designed to be lit without the packaging, the same as in paragraph (h)(1...
Vehicle test report: Electric Vehicle Associates electric conversion of an AMC Pacer
NASA Technical Reports Server (NTRS)
Price, T. W.; Wirth, V. A., Jr.; Pampa, M. F.
1981-01-01
The change of pace, an electric vehicle was tested. These tests were performed to characterize certain parameters of the electric vehicle pacer and to provide baseline data that can be used for the comparison of improved batteries that may be incorporated into the vehicle at a later time. The vehicle tests were concentrated on the electrical drive subsystem, the batteries, controller and motor. Coastdowns to characterize the road load, and range evaluations for both cyclic and constant speed conditions were performed. The vehicle's performance was evaluated by comparing its constant speed range performance with described vehicles. It is found that the pacer performance is approximately equal to the majority of the vehicles tested in the 1977 assessment.
48 CFR 15.305 - Proposal evaluation.
Code of Federal Regulations, 2011 CFR
2011-10-01
... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
48 CFR 15.305 - Proposal evaluation.
Code of Federal Regulations, 2010 CFR
2010-10-01
... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
48 CFR 15.305 - Proposal evaluation.
Code of Federal Regulations, 2014 CFR
2014-10-01
... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
48 CFR 15.305 - Proposal evaluation.
Code of Federal Regulations, 2012 CFR
2012-10-01
... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
48 CFR 15.305 - Proposal evaluation.
Code of Federal Regulations, 2013 CFR
2013-10-01
... comparative assessment of past performance information is separate from the responsibility determination... requirements; and (ii) A summary, matrix, or quantitative ranking, along with appropriate supporting narrative...
Evaluation of a Low-Noise Formate Spiral-Bevel Gear Set
NASA Technical Reports Server (NTRS)
Lewicki, David g.; Woods, Ron L.; Litvin, Faydor L.; Fuentes, Alfonso
2007-01-01
Studies to evaluate low-noise Formate spiral-bevel gears were performed. Experimental tests were performed on the OH-58D helicopter main-rotor transmission in the NASA Glenn 500-hp Helicopter Transmission Test Stand. Low-noise Formate spiral-bevel gears were compared to the baseline OH-58D spiral-bevel gear design, a high-strength design, and previously tested low-noise designs (including an original low-noise design and an improved-bearing-contact low-noise design). Noise, vibration, and tooth strain tests were performed. The Formate design showed a decrease in noise and vibration compared to the baseline OH-58D design, and was similar to that of the previously tested improved-bearing contact low-noise design. The pinion tooth stresses for the Formate design significantly decreased in comparison to the baseline OH-58D design. Also similar to that of the improved bearing-contact low-noise design, the maximum stresses of the Formate design shifted toward the heel, compared to the center of the face width for the baseline, high-strength, and previously tested low-noise designs.
Iwashita, Hayato; Tsukiyama, Yoshihiro; Kori, Hidehiro; Kuwatsuru, Rika; Yamasaki, Yo; Koyano, Kiyoshi
2014-10-01
Missing posterior teeth can decrease masticatory function and cause horizontal mastication deviation, i.e., mastication predominance. Mastication predominance may lead to abnormal tooth attrition and temporomandibular disorders. This study evaluated masticatory performance and mastication predominance in patients with missing posterior teeth to investigate effects of missing posterior teeth on masticatory performance and mastication predominance. Thirty volunteers with normal dentition (control group), 30 patients with unilateral missing posterior teeth (unilateral group), and 23 patients with bilateral missing posterior teeth (bilateral group) participated. Gummy jellies were used to evaluate participants' masticatory performance, and electromyography was used to assess the degree of mastication predominance. Chewing gums, gummy jellies, and peanuts were used as foods of various hardnesses for evaluating mastication predominance. Compared with the control group, masticatory performance did not differ in the unilateral group but was significantly decreased in the bilateral group. With chewing gum and gummy jellies, the degree of mastication predominance was significantly increased in both unilateral and bilateral groups than the control group. With peanuts, the degree of mastication predominance was significantly increased in the unilateral group than the control group. Although masticatory performance was not decreased in the unilateral group, the degree of mastication predominance was increased. Decreased masticatory performance was observed in the bilateral group, and for foods with normal hardness and soft foods, the degree of mastication predominance was increased. These results suggested that mastication predominance should be considered in the recovery of masticatory performance in patients with missing posterior teeth. Copyright © 2014 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Onate, James A; Starkel, Cambrie; Clifton, Daniel R; Best, Thomas M; Borchers, James; Chaudhari, Ajit; Comstock, R Dawn; Cortes, Nelson; Grooms, Dustin R; Hertel, Jay; Hewett, Timothy E; Miller, Meghan Maume; Pan, Xueliang; Schussler, Eric; Van Lunen, Bonnie L
2018-01-01
The fourth edition of the Preparticipation Physical Evaluation recommends functional testing for the musculoskeletal portion of the examination; however, normative data across sex and grade level are limited. Establishing normative data can provide clinicians reference points with which to compare their patients, potentially aiding in the development of future injury-risk assessments and injury-mitigation programs. To establish normative functional performance and limb-symmetry data for high school-aged male and female athletes in the United States. Cross-sectional study. Athletic training facilities and gymnasiums across the United States. A total of 3951 male and female athletes who participated on high school-sponsored basketball, football, lacrosse, or soccer teams enrolled in this nationwide study. Functional performance testing consisted of 3 evaluations. Ankle-joint range of motion, balance, and lower extremity muscular power and landing control were assessed via the weight-bearing ankle-dorsiflexion-lunge, single-legged anterior-reach, and anterior single-legged hop-for-distance (SLHOP) tests, respectively. We used 2-way analyses of variance and χ 2 analyses to examine the effects of sex and grade level on ankle-dorsiflexion-lunge, single-legged anterior-reach, and SLHOP test performance and symmetry. The SLHOP performance differed between sexes (males = 187.8% ± 33.1% of limb length, females = 157.5% ± 27.8% of limb length; t = 30.3, P < .001). A Cohen d value of 0.97 indicated a large effect of sex on SLHOP performance. We observed differences for SLHOP and ankle-dorsiflexion-lunge performance among grade levels, but these differences were not clinically meaningful. We demonstrated differences in normative data for lower extremity functional performance during preparticipation physical evaluations across sex and grade levels. The results of this study will allow clinicians to compare sex- and grade-specific functional performances and implement approaches for preventing musculoskeletal injuries in high school-aged athletes.
Thermal Performance Testing of EMU and CSAFE Liquid Cooling Gannents
NASA Technical Reports Server (NTRS)
Rhodes, Richard; Bue, Grant; Meginnis, Ian; Hakam, Mary; Radford, Tamara
2013-01-01
Future exploration missions require the development of a new liquid cooling garment (LCG) to support the next generation extravehicular activity (EVA) suit system. The new LCG must offer greater system reliability, optimal thermal performance as required by mission directive, and meet other design requirements including improved tactile comfort. To advance the development of a future LCG, a thermal performance test was conducted to evaluate: (1) the comparable thermal performance of the EMU LCG and the CSAFE developed engineering evaluation unit (EEU) LCG, (2) the effect of the thermal comfort undergarment (TCU) on the EMU LCG tactile and thermal comfort, and (3) the performance of a torso or upper body only LCG shirt to evaluate a proposed auxiliary loop. To evaluate the thermal performance of each configuration, a metabolic test was conducted using the Demonstrator Spacesuit to create a relevant test environment. Three (3) male test subjects of similar height and weight walked on a treadmill at various speeds to produce three different metabolic loads - resting (300-600 BTU/hr), walking at a slow pace (1200 BTU/hr), and walking at a brisk pace (2200 BTU/hr). Each subject participated in five tests - two wearing the CSAFE full LCG, one wearing the EMU LCG without TCUs, one wearing the EMU LCG with TCUs, and one with the CSAFE shirt-only. During the test, performance data for the breathing air and cooling water systems and subject specific data was collected to define the thermal performance of the configurations. The test results show that the CSAFE EEU LCG and EMU LCG with TCU had comparable performance. The testing also showed that an auxiliary loop LCG, sized similarly to the shirt-only configuration, should provide adequate cooling for contingency scenarios. Finally, the testing showed that the TCU did not significantly hinder LCG heat transfer, and may prove to be acceptable for future suit use with additional analysis and testing.
Comparative evaluation of distributed-collector solar thermal electric power plants
NASA Technical Reports Server (NTRS)
Fujita, T.; El Gabalawi, N.; Herrera, G. G.; Caputo, R. S.
1978-01-01
Distributed-collector solar thermal-electric power plants are compared by projecting power plant economics of selected systems to the 1990-2000 timeframe. The approach taken is to evaluate the performance of the selected systems under the same weather conditions. Capital and operational costs are estimated for each system. Energy costs are calculated for different plant sizes based on the plant performance and the corresponding capital and maintenance costs. Optimum systems are then determined as the systems with the minimum energy costs for a given load factor. The optimum system is comprised of the best combination of subsystems which give the minimum energy cost for every plant size. Sensitivity analysis is done around the optimum point for various plant parameters.
DOT National Transportation Integrated Search
2012-03-31
This report evaluates the performance of Continuous Risk Profile (CRP) compared with the : Sliding Window Method (SWM) and Peak Searching (PS) methods. These three network : screening methods all require the same inputs: traffic collision data and Sa...
DOT National Transportation Integrated Search
2012-03-01
This report evaluates the performance of Continuous Risk Profile (CRP) compared with the : Sliding Window Method (SWM) and Peak Searching (PS) methods. These three network : screening methods all require the same inputs: traffic collision data and Sa...
Decision support systems for ecosystem management: An evaluation of existing systems
H. Todd Mowrer; Klaus Barber; Joe Campbell; Nick Crookston; Cathy Dahms; John Day; Jim Laacke; Jim Merzenich; Steve Mighton; Mike Rauscher; Rick Sojda; Joyce Thompson; Peter Trenchi; Mark Twery
1997-01-01
This report evaluated 24 computer-aided decision support systems (DSS) that can support management decision-making in forest ecosystems. It compares the scope of each system, spatial capabilities, computational methods, development status, input and output requirements, user support availability, and system performance. Questionnaire responses from the DSS developers (...
The report gives results of field tests conducted to determine the emission characteristics of a Babcock and Wilcox Circular burner and Dual Register burner (DRB). The field tests were performed at two utility boilers, generally comparable in design and size except for the burner...
Evaluation of the Teaching Performance of University Lecturers: Comparison between Mexico and Spain
ERIC Educational Resources Information Center
Diaz, Matilde; Borges, Africa; Valadez, Dolores; Zambrano, Rogelio
2015-01-01
Comparative educational studies allow the study of the differences and similarities between different educational systems. This research, which consists on an educational evaluation, has studied the teaching behavior of ten university lecturers from a Spanish university--the University of La Laguna--, and seven from a Mexican…
EVALUATION OF THE AGDISP AERIAL SPRAY ALGORITHMS IN THE AGDRIFT MODEL
A systematic evaluation of the AgDISP algorithms, which simulate off-site drift and deposition of aerially applied pesticides, contained in the AgDRIFT model was performed by comparing model simulations to field-trial data collected by the Spray Drift Task Force. Field-trial data...
COST EVALUATION OF AUTOMATED AND MANUAL POST- CONSUMER PLASTIC BOTTLE SORTING SYSTEMS
This project evaluates, on the basis of performance and cost, two Automated BottleSort® sorting systems for post-consumer commingled plastic containers developed by Magnetic Separation Systems. This study compares the costs to sort mixed bales of post-consumer plastic at these t...
Summary of ORSphere Critical and Reactor Physics Measurements
DOE Office of Scientific and Technical Information (OSTI.GOV)
Marshall, Margaret A.; Bess, John D.
In the early 1970s Dr. John T. Mihalczo (team leader), J. J. Lynn, and J. R. Taylor performed experiments at the Oak Ridge Critical Experiments Facility (ORCEF) with highly enriched uranium (HEU) metal (called Oak Ridge Alloy or ORALLOY) to recreate GODIVA I results with greater accuracy than those performed at Los Alamos National Laboratory in the 1950s. The purpose of the Oak Ridge ORALLOY Sphere (ORSphere) experiments was to estimate the unreflected and unmoderated critical mass of an idealized sphere of uranium metal corrected to a density, purity, and enrichment such that it could be compared with the GODIVAmore » I experiments. This critical configuration has been evaluated. Preliminary results were presented at ND2013. Since then, the evaluation was finalized and judged to be an acceptable benchmark experiment for the International Criticality Safety Benchmark Experiment Project (ICSBEP). Additionally, reactor physics measurements were performed to determine surface button worths, central void worth, delayed neutron fraction, prompt neutron decay constant, fission density and neutron importance. These measurements have been evaluated and found to be acceptable experiments and are discussed in full detail in the International Handbook of Evaluated Reactor Physics Benchmark Experiments. The purpose of this paper is summary summarize all the critical and reactor physics measurements evaluations and, when possible, to compare them to GODIVA experiment results.« less
Screening programme to select a resin for Gravity Probe-B composites
NASA Technical Reports Server (NTRS)
Will, E. T.
1992-01-01
The Gravity Probe-B (GP-B) program undertook a screening program to select a possible replacement resin for the E-787 resin currently used in composite neck tubes and support struts. The goal was to find a resin with good cryogenic and structural properties, low-helium permeation and an easily repeatable fabrication process. Cycom 92, SCI REZ 081 and RS-3 were selected for comparison with E-787. Identical composite tubes made from each resin and gamma-alumina fiber (85 percent Al2O3, 15 percent SiO2) were evaluated for cryogenic and structural performance and for processability. Cryogenic performance was evaluated by measuring low-temperature permeation and leaks to determine cryogenic strain behavior. Structural performance was evaluated by comparing the resin-dominated shear strength of the composites. Processability was evaluated from fabrication comments and GP-B's own experience. SCI REZ 081 was selected as the best overall resin with superior strength and cryogenic performance and consistent processability.
Bron, Esther E; Smits, Marion; van der Flier, Wiesje M; Vrenken, Hugo; Barkhof, Frederik; Scheltens, Philip; Papma, Janne M; Steketee, Rebecca M E; Méndez Orellana, Carolina; Meijboom, Rozanna; Pinto, Madalena; Meireles, Joana R; Garrett, Carolina; Bastos-Leite, António J; Abdulkadir, Ahmed; Ronneberger, Olaf; Amoroso, Nicola; Bellotti, Roberto; Cárdenas-Peña, David; Álvarez-Meza, Andrés M; Dolph, Chester V; Iftekharuddin, Khan M; Eskildsen, Simon F; Coupé, Pierrick; Fonov, Vladimir S; Franke, Katja; Gaser, Christian; Ledig, Christian; Guerrero, Ricardo; Tong, Tong; Gray, Katherine R; Moradi, Elaheh; Tohka, Jussi; Routier, Alexandre; Durrleman, Stanley; Sarica, Alessia; Di Fatta, Giuseppe; Sensi, Francesco; Chincarini, Andrea; Smith, Garry M; Stoyanov, Zhivko V; Sørensen, Lauge; Nielsen, Mads; Tangaro, Sabina; Inglese, Paolo; Wachinger, Christian; Reuter, Martin; van Swieten, John C; Niessen, Wiro J; Klein, Stefan
2015-05-01
Algorithms for computer-aided diagnosis of dementia based on structural MRI have demonstrated high performance in the literature, but are difficult to compare as different data sets and methodology were used for evaluation. In addition, it is unclear how the algorithms would perform on previously unseen data, and thus, how they would perform in clinical practice when there is no real opportunity to adapt the algorithm to the data at hand. To address these comparability, generalizability and clinical applicability issues, we organized a grand challenge that aimed to objectively compare algorithms based on a clinically representative multi-center data set. Using clinical practice as the starting point, the goal was to reproduce the clinical diagnosis. Therefore, we evaluated algorithms for multi-class classification of three diagnostic groups: patients with probable Alzheimer's disease, patients with mild cognitive impairment and healthy controls. The diagnosis based on clinical criteria was used as reference standard, as it was the best available reference despite its known limitations. For evaluation, a previously unseen test set was used consisting of 354 T1-weighted MRI scans with the diagnoses blinded. Fifteen research teams participated with a total of 29 algorithms. The algorithms were trained on a small training set (n=30) and optionally on data from other sources (e.g., the Alzheimer's Disease Neuroimaging Initiative, the Australian Imaging Biomarkers and Lifestyle flagship study of aging). The best performing algorithm yielded an accuracy of 63.0% and an area under the receiver-operating-characteristic curve (AUC) of 78.8%. In general, the best performances were achieved using feature extraction based on voxel-based morphometry or a combination of features that included volume, cortical thickness, shape and intensity. The challenge is open for new submissions via the web-based framework: http://caddementia.grand-challenge.org. Copyright © 2015 Elsevier Inc. All rights reserved.
The Effects of Target Orientation on the Dynamic Contrast Sensitivity Function
1994-01-01
tests: A comparative evaluation. Journal of Amlied ychology, 50, 460-466. Burg, A. (1971). Vision and driving: A report on research. Human Factors...evaluation. Journal of Applied Ps ogy, 50, 460-466. Burg, A. (1971). Vision and driving: A report on research. Human Factors, 13, 79-87. Campbell, F. W...FUNDING NUMBERS 6. AUTHOR(S)G~ioD A- 0ecoKT, ) 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION REPORT NUMBER AFIT Student
Design and performance evaluation of a cryogenic condenser for an in-pile experiment
NASA Technical Reports Server (NTRS)
Graham, R. W.; Crum, R. J.; Hsu, Y.
1972-01-01
An apparatus was designed to enable in-pile irradiation of materials in liquid hydrogen at cryogenic temperatures. One of the principal components of this apparatus was a horizontal tube condenser. The performance of the condenser was evaluated by running a liquid-nitrogen prototype of the apparatus at heat loads comparable to or greater than those expected during the irradiation. The test showed that the condenser was capable of handling the design heat load and that the design procedure was sound.
NASA Astrophysics Data System (ADS)
Strader, Anne; Schorlemmer, Danijel; Beutin, Thomas
2017-04-01
The Global Earthquake Activity Rate Model (GEAR1) is a hybrid seismicity model, constructed from a loglinear combination of smoothed seismicity from the Global Centroid Moment Tensor (CMT) earthquake catalog and geodetic strain rates (Global Strain Rate Map, version 2.1). For the 2005-2012 retrospective evaluation period, GEAR1 outperformed both parent strain rate and smoothed seismicity forecasts. Since 1. October 2015, GEAR1 has been prospectively evaluated by the Collaboratory for the Study of Earthquake Predictability (CSEP) testing center. Here, we present initial one-year test results of the GEAR1, GSRM and GSRM2.1, as well as localized evaluation of GEAR1 performance. The models were evaluated on the consistency in number (N-test), spatial (S-test) and magnitude (M-test) distribution of forecasted and observed earthquakes, as well as overall data consistency (CL-, L-tests). Performance at target earthquake locations was compared between models using the classical paired T-test and its non-parametric equivalent, the W-test, to determine if one model could be rejected in favor of another at the 0.05 significance level. For the evaluation period from 1. October 2015 to 1. October 2016, the GEAR1, GSRM and GSRM2.1 forecasts pass all CSEP likelihood tests. Comparative test results show statistically significant improvement of GEAR1 performance over both strain rate-based forecasts, both of which can be rejected in favor of GEAR1. Using point process residual analysis, we investigate the spatial distribution of differences in GEAR1, GSRM and GSRM2 model performance, to identify regions where the GEAR1 model should be adjusted, that could not be inferred from CSEP test results. Furthermore, we investigate whether the optimal combination of smoothed seismicity and strain rates remains stable over space and time.
NASA Technical Reports Server (NTRS)
Tirres, Lizet
1991-01-01
An evaluation of the aerodynamic performance of the solid version of an Allison-designed cooled radial turbine was conducted at NASA Lewis' Warm Turbine Test Facility. The resulting pressure and temperature measurements are used to calculate vane, rotor, and overall stage performance. These performance results are then compared to the analytical results obtained by using NASA's MTSB (MERIDL-TSONIC-BLAYER) code.
Implementation and performance evaluation of acoustic denoising algorithms for UAV
NASA Astrophysics Data System (ADS)
Chowdhury, Ahmed Sony Kamal
Unmanned Aerial Vehicles (UAVs) have become popular alternative for wildlife monitoring and border surveillance applications. Elimination of the UAV's background noise and classifying the target audio signal effectively are still a major challenge. The main goal of this thesis is to remove UAV's background noise by means of acoustic denoising techniques. Existing denoising algorithms, such as Adaptive Least Mean Square (LMS), Wavelet Denoising, Time-Frequency Block Thresholding, and Wiener Filter, were implemented and their performance evaluated. The denoising algorithms were evaluated for average Signal to Noise Ratio (SNR), Segmental SNR (SSNR), Log Likelihood Ratio (LLR), and Log Spectral Distance (LSD) metrics. To evaluate the effectiveness of the denoising algorithms on classification of target audio, we implemented Support Vector Machine (SVM) and Naive Bayes classification algorithms. Simulation results demonstrate that LMS and Discrete Wavelet Transform (DWT) denoising algorithm offered superior performance than other algorithms. Finally, we implemented the LMS and DWT algorithms on a DSP board for hardware evaluation. Experimental results showed that LMS algorithm's performance is robust compared to DWT for various noise types to classify target audio signals.
Thrust stand evaluation of engine performance improvement algorithms in an F-15 airplane
NASA Technical Reports Server (NTRS)
Conners, Timothy R.
1992-01-01
Results are presented from the evaluation of the performance seeking control (PSC) optimization algorithm developed by Smith et al. (1990) for F-15 aircraft, which optimizes the quasi-steady-state performance of an F100 derivative turbofan engine for several modes of operation. The PSC algorithm uses onboard software engine model that calculates thrust, stall margin, and other unmeasured variables for use in the optimization. Comparisons are presented between the load cell measurements, PSC onboard model thrust calculations, and posttest state variable model computations. Actual performance improvements using the PSC algorithm are presented for its various modes. The results of using PSC algorithm are compared with similar test case results using the HIDEC algorithm.
Vehicle test report: Electric Vehicle Associates electric conversion of an AMC Pacer
NASA Technical Reports Server (NTRS)
Price, T. W.; Wirth, V. A., Jr.; Pompa, M. F.
1981-01-01
Tests were performed to characterize certain parameters of the EVA Pacer and to provide baseline data that can be used for the comparison of improved batteries that may be incorporated into the vehicle at a later time. The vehicle tests were concentrated on the electrical drive subsystem; i.e., the batteries, controller and motor. The tests included coastdowns to characterize the road load, and range evaluations for both cyclic and constant speed conditions. A qualitative evaluation of the vehicle's performance was made by comparing its constant speed range performance with other electric and hybrid vehicles. The Pacer performance was approximately equal to the majority of those vehicles assessed in 1977.
A Comparison of Four Simulation and Instructional Methods for Endodontic Review.
ERIC Educational Resources Information Center
Sandoval, Victor A.; And Others
1987-01-01
The effects of four different endodontic self-instructional review formats (slide-tape, latent-image simulation, computer text simulation, and computer-assisted video interactive simulation) on senior clinical endodontic performance are compared. Student evaluations, as well as comparative developmental expenditures, are discussed. (Author/MLW)
Sustainability performance evaluation: Literature review and future directions.
Büyüközkan, Gülçin; Karabulut, Yağmur
2018-07-01
Current global economic activities are increasingly being perceived as unsustainable. Despite the high number of publications, sustainability science remains highly dispersed over diverse approaches and topics. This article aims to provide a structured overview of sustainability performance evaluation related publications and to document the current state of literature, categorize publications, analyze and link trends, as well as highlight gaps and provide research recommendations. 128 articles between 2007 and 2018 are identified. The results suggest that sustainability performance evaluation models shall be more balanced, suitable criteria and their interrelations shall be well defined and subjectivity of qualitative criteria inherent to sustainability indicators shall be considered. To address this subjectivity, group decision-making techniques and other analytical methods that can deal with uncertainty, conflicting indicators, and linguistic evaluations can be used in future works. By presenting research gaps, this review stimulates researchers to establish practically applicable sustainability performance evaluation frameworks to help assess and compare the degree of sustainability, leading to more sustainable business practices. The review is unique in defining corporate sustainability performance evaluation for the first time, exploring the gap between sustainability accounting and sustainability assessment, and coming up with a structured overview of innovative research recommendations about integrating analytical assessment methods into conceptual sustainability frameworks. Copyright © 2018 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Easter, John; And Others
Comprehensive Achievement Monitoring (CAM) is a system designed to provide a curriculum defined in terms of performance objectives, test items to measure student performance on each objective, a set of comparable test forms to evaluate performance, testing throughout the period of the course, computerized analysis and reporting of results after…
ERIC Educational Resources Information Center
Easter, John; And Others
Comprehensive Achievement Monitoring (CAM) is a system designed to provide a curriculum defined in terms of performance objectives, test items to measure student performance on each objective, a set of comparable test forms to evaluate performance, testing throughout the period of course, computerized analysis and reporting of results after test…
Samuel A. Cushman; Jesse S. Lewis; Erin L. Landguth
2014-01-01
There have been few assessments of the performance of alternative resistance surfaces, and little is known about how connectivity modeling approaches differ in their ability to predict organism movements. In this paper, we evaluate the performance of four connectivity modeling approaches applied to two resistance surfaces in predicting the locations of highway...
ERIC Educational Resources Information Center
Darabi, A. Aubteen
2005-01-01
This article reports a case study describing how the principles of a cognitive apprenticeship (CA) model developed by Collins, Brown, and Holum (1991) were applied to a graduate course on performance systems analysis (PSA), and the differences this application made in student performance and evaluation of the course compared to the previous…
ERIC Educational Resources Information Center
Goomas, David T.
2012-01-01
The effects of wireless ring scanners, which provided immediate auditory and visual feedback, were evaluated to increase the performance and accuracy of order selectors at a meat distribution center. The scanners not only increased performance and accuracy compared to paper pick sheets, but were also instrumental in immediate and accurate data…
A dual cone-beam CT system for image guided radiotherapy: initial performance characterization.
Li, Hao; Giles, William; Bowsher, James; Yin, Fang-Fang
2013-02-01
The purpose of this study is to evaluate the performance of a recently developed benchtop dual cone-beam computed tomography (CBCT) system with two orthogonally placed tube∕detector sets. The benchtop dual CBCT system consists of two orthogonally placed 40 × 30 cm flat-panel detectors and two conventional x-ray tubes with two individual high-voltage generators sharing the same rotational axis. The x-ray source to detector distance is 150 cm and x-ray source to rotational axis distance is 100 cm for both subsystems. The objects are scanned through 200° of rotation. The dual CBCT system utilized 110° of projection data from one detector and 90° from the other while the two individual single CBCTs utilized 200° data from each detector. The system performance was characterized in terms of uniformity, contrast, spatial resolution, noise power spectrum, and CT number linearity. The uniformities, within the axial slice and along the longitudinal direction, and noise power spectrum were assessed by scanning a water bucket; the contrast and CT number linearity were measured using the Catphan phantom; and the spatial resolution was evaluated using a tungsten wire phantom. A skull phantom and a ham were also scanned to provide qualitative evaluation of high- and low-contrast resolution. Each measurement was compared between dual and single CBCT systems. Compared to single CBCT, the dual CBCT presented: (1) a decrease in uniformity by 1.9% in axial view and 1.1% in the longitudinal view, as averaged for four energies (80, 100, 125, and 150 kVp); (2) comparable or slightly better contrast (0∼25 HU) for low-contrast objects and comparable contrast for high-contrast objects; (3) comparable spatial resolution; (4) comparable CT number linearity with R(2) ≥ 0.99 for all four tested energies; (5) lower noise power spectrum in magnitude. Dual CBCT images of the skull phantom and the ham demonstrated both high-contrast resolution and good soft-tissue contrast. The performance of a benchtop dual CBCT imaging system has been characterized and is comparable to that of a single CBCT.
Performance in physiology evaluation: possible improvement by active learning strategies.
Montrezor, Luís H
2016-12-01
The evaluation process is complex and extremely important in the teaching/learning process. Evaluations are constantly employed in the classroom to assist students in the learning process and to help teachers improve the teaching process. The use of active methodologies encourages students to participate in the learning process, encourages interaction with their peers, and stimulates thinking about physiological mechanisms. This study examined the performance of medical students on physiology over four semesters with and without active engagement methodologies. Four activities were used: a puzzle, a board game, a debate, and a video. The results show that engaging in activities with active methodologies before a physiology cognitive monitoring test significantly improved student performance compared with not performing the activities. We integrate the use of these methodologies with classic lectures, and this integration appears to improve the teaching/learning process in the discipline of physiology and improves the integration of physiology with cardiology and neurology. In addition, students enjoy the activities and perform better on their evaluations when they use them. Copyright © 2016 The American Physiological Society.
Performance evaluation of haptic hand-controllers in a robot-assisted surgical system.
Zareinia, Kourosh; Maddahi, Yaser; Ng, Canaan; Sepehri, Nariman; Sutherland, Garnette R
2015-12-01
This paper presents the experimental evaluation of three commercially available haptic hand-controllers to evaluate which was more suitable to the participants. Two surgeons and seven engineers performed two peg-in-hole tasks with different levels of difficulty. Each operator guided the end-effector of a Kuka manipulator that held surgical forceps and was equipped with a surgical microscope. Sigma 7, HD(2) and PHANToM Premium 3.0 hand-controllers were compared. Ten measures were adopted to evaluate operators' performances with respect to effort, speed and accuracy in completing a task, operator improvement during the tests, and the force applied by each haptic device. The best performance was observed with the Premium 3.0; the hand-piece was able to be held in a similar way to that used by surgeons to hold conventional tools. Hand-controllers with a linkage structure similar to the human upper extremity take advantage of the inherent human brain connectome, resulting in improved surgeon performance during robotic-assisted surgery. Copyright © 2015 John Wiley & Sons, Ltd.
External validation of Global Evaluative Assessment of Robotic Skills (GEARS).
Aghazadeh, Monty A; Jayaratna, Isuru S; Hung, Andrew J; Pan, Michael M; Desai, Mihir M; Gill, Inderbir S; Goh, Alvin C
2015-11-01
We demonstrate the construct validity, reliability, and utility of Global Evaluative Assessment of Robotic Skills (GEARS), a clinical assessment tool designed to measure robotic technical skills, in an independent cohort using an in vivo animal training model. Using a cross-sectional observational study design, 47 voluntary participants were categorized as experts (>30 robotic cases completed as primary surgeon) or trainees. The trainee group was further divided into intermediates (≥5 but ≤30 cases) or novices (<5 cases). All participants completed a standardized in vivo robotic task in a porcine model. Task performance was evaluated by two expert robotic surgeons and self-assessed by the participants using the GEARS assessment tool. Kruskal-Wallis test was used to compare the GEARS performance scores to determine construct validity; Spearman's rank correlation measured interobserver reliability; and Cronbach's alpha was used to assess internal consistency. Performance evaluations were completed on nine experts and 38 trainees (14 intermediate, 24 novice). Experts demonstrated superior performance compared to intermediates and novices overall and in all individual domains (p < 0.0001). In comparing intermediates and novices, the overall performance difference trended toward significance (p = 0.0505), while the individual domains of efficiency and autonomy were significantly different between groups (p = 0.0280 and 0.0425, respectively). Interobserver reliability between expert ratings was confirmed with a strong correlation observed (r = 0.857, 95 % CI [0.691, 0.941]). Experts and participant scoring showed less agreement (r = 0.435, 95 % CI [0.121, 0.689] and r = 0.422, 95 % CI [0.081, 0.0672]). Internal consistency was excellent for experts and participants (α = 0.96, 0.98, 0.93). In an independent cohort, GEARS was able to differentiate between different robotic skill levels, demonstrating excellent construct validity. As a standardized assessment tool, GEARS maintained consistency and reliability for an in vivo robotic surgical task and may be applied for skills evaluation in a broad range of robotic procedures.
Kwon, S R; Meharry, M; Oyoyo, U; Li, Y
2015-01-01
To evaluate the efficacy of do-it-yourself (DIY) whitening as compared to conventional tooth whitening modalities using different shade assessment tools. Extracted human molars (120) were randomly distributed to six groups (n=20). Whitening was performed according to manufacturer's directions for over-the-counter, dentist-dispensed for home use, and in-office whitening. DIY whitening consisted of a strawberry and baking soda mix. Additionally, negative and positive controls were used. Two evaluators used the Vita Classical (VC) and Vita Bleachedguide 3D-Master with interpolated numbers (BGi) for visual assessment at baseline and one-week, one-month, and three-month postwhitening. Instrumental measurements were performed with a spectrophotometer. Kruskal-Wallis procedure was used to assess color changes among groups and intraclass correlation (ICC) to evaluate agreement between evaluators. DIY exhibited lower color change (ΔSGUVC, ΔSGUBGi, ΔE*, where SGU = shade guide unit and E = overall color change) compared to other whitening groups at all time points (p<0.05). ICC demonstrated very good agreement between evaluators with VC and BGi at each time point. Both shade guides were related with each other and strongly related to instrumental measurements (p<0.05). DIY whitening was the least effective whitening modality. Both VC and BGi are related with each other and have good correlation with instrumental measurements.
Rueda, Sylvia; Fathima, Sana; Knight, Caroline L; Yaqub, Mohammad; Papageorghiou, Aris T; Rahmatullah, Bahbibi; Foi, Alessandro; Maggioni, Matteo; Pepe, Antonietta; Tohka, Jussi; Stebbing, Richard V; McManigle, John E; Ciurte, Anca; Bresson, Xavier; Cuadra, Meritxell Bach; Sun, Changming; Ponomarev, Gennady V; Gelfand, Mikhail S; Kazanov, Marat D; Wang, Ching-Wei; Chen, Hsiang-Chou; Peng, Chun-Wei; Hung, Chu-Mei; Noble, J Alison
2014-04-01
This paper presents the evaluation results of the methods submitted to Challenge US: Biometric Measurements from Fetal Ultrasound Images, a segmentation challenge held at the IEEE International Symposium on Biomedical Imaging 2012. The challenge was set to compare and evaluate current fetal ultrasound image segmentation methods. It consisted of automatically segmenting fetal anatomical structures to measure standard obstetric biometric parameters, from 2D fetal ultrasound images taken on fetuses at different gestational ages (21 weeks, 28 weeks, and 33 weeks) and with varying image quality to reflect data encountered in real clinical environments. Four independent sub-challenges were proposed, according to the objects of interest measured in clinical practice: abdomen, head, femur, and whole fetus. Five teams participated in the head sub-challenge and two teams in the femur sub-challenge, including one team who tackled both. Nobody attempted the abdomen and whole fetus sub-challenges. The challenge goals were two-fold and the participants were asked to submit the segmentation results as well as the measurements derived from the segmented objects. Extensive quantitative (region-based, distance-based, and Bland-Altman measurements) and qualitative evaluation was performed to compare the results from a representative selection of current methods submitted to the challenge. Several experts (three for the head sub-challenge and two for the femur sub-challenge), with different degrees of expertise, manually delineated the objects of interest to define the ground truth used within the evaluation framework. For the head sub-challenge, several groups produced results that could be potentially used in clinical settings, with comparable performance to manual delineations. The femur sub-challenge had inferior performance to the head sub-challenge due to the fact that it is a harder segmentation problem and that the techniques presented relied more on the femur's appearance.
NASA Astrophysics Data System (ADS)
Christensen, Hannah; Moroz, Irene; Palmer, Tim
2015-04-01
Forecast verification is important across scientific disciplines as it provides a framework for evaluating the performance of a forecasting system. In the atmospheric sciences, probabilistic skill scores are often used for verification as they provide a way of unambiguously ranking the performance of different probabilistic forecasts. In order to be useful, a skill score must be proper -- it must encourage honesty in the forecaster, and reward forecasts which are reliable and which have good resolution. A new score, the Error-spread Score (ES), is proposed which is particularly suitable for evaluation of ensemble forecasts. It is formulated with respect to the moments of the forecast. The ES is confirmed to be a proper score, and is therefore sensitive to both resolution and reliability. The ES is tested on forecasts made using the Lorenz '96 system, and found to be useful for summarising the skill of the forecasts. The European Centre for Medium-Range Weather Forecasts (ECMWF) ensemble prediction system (EPS) is evaluated using the ES. Its performance is compared to a perfect statistical probabilistic forecast -- the ECMWF high resolution deterministic forecast dressed with the observed error distribution. This generates a forecast that is perfectly reliable if considered over all time, but which does not vary from day to day with the predictability of the atmospheric flow. The ES distinguishes between the dynamically reliable EPS forecasts and the statically reliable dressed deterministic forecasts. Other skill scores are tested and found to be comparatively insensitive to this desirable forecast quality. The ES is used to evaluate seasonal range ensemble forecasts made with the ECMWF System 4. The ensemble forecasts are found to be skilful when compared with climatological or persistence forecasts, though this skill is dependent on region and time of year.
Chung, Cheng-Shiu; Wang, Hongwu; Cooper, Rory A
2013-07-01
The user interface development of assistive robotic manipulators can be traced back to the 1960s. Studies include kinematic designs, cost-efficiency, user experience involvements, and performance evaluation. This paper is to review studies conducted with clinical trials using activities of daily living (ADLs) tasks to evaluate performance categorized using the International Classification of Functioning, Disability, and Health (ICF) frameworks, in order to give the scope of current research and provide suggestions for future studies. We conducted a literature search of assistive robotic manipulators from 1970 to 2012 in PubMed, Google Scholar, and University of Pittsburgh Library System - PITTCat. Twenty relevant studies were identified. Studies were separated into two broad categories: user task preferences and user-interface performance measurements of commercialized and developing assistive robotic manipulators. The outcome measures and ICF codes associated with the performance evaluations are reported. Suggestions for the future studies include (1) standardized ADL tasks for the quantitative and qualitative evaluation of task efficiency and performance to build comparable measures between research groups, (2) studies relevant to the tasks from user priority lists and ICF codes, and (3) appropriate clinical functional assessment tests with consideration of constraints in assistive robotic manipulator user interfaces. In addition, these outcome measures will help physicians and therapists build standardized tools while prescribing and assessing assistive robotic manipulators.
Leue, Anja; Cano Rodilla, Carmen; Beauducel, André
2015-01-01
Individuals typically evaluate whether their performance and the obtained feedback match. Previous research has shown that feedback negativity (FN) depends on outcome probability and feedback valence. It is, however, less clear to what extent previous effects of outcome probability on FN depend on self-evaluations of response correctness. Therefore, we investigated the effects of outcome probability on FN amplitude in a simple go/no-go task that allowed for the self-evaluation of response correctness. We also investigated effects of performance incompatibility and feedback valence. In a sample of N = 22 participants, outcome probability was manipulated by means of precues, feedback valence by means of monetary feedback, and performance incompatibility by means of feedback that induced a match versus mismatch with individuals' performance. We found that the 100% outcome probability condition induced a more negative FN following no-loss than the 50% outcome probability condition. The FN following loss was more negative in the 50% compared to the 100% outcome probability condition. Performance-incompatible loss resulted in a more negative FN than performance-compatible loss. Our results indicate that the self-evaluation of the correctness of responses should be taken into account when the effects of outcome probability and expectation mismatch on FN are investigated. PMID:26783525
Leue, Anja; Cano Rodilla, Carmen; Beauducel, André
2015-01-01
Individuals typically evaluate whether their performance and the obtained feedback match. Previous research has shown that feedback negativity (FN) depends on outcome probability and feedback valence. It is, however, less clear to what extent previous effects of outcome probability on FN depend on self-evaluations of response correctness. Therefore, we investigated the effects of outcome probability on FN amplitude in a simple go/no-go task that allowed for the self-evaluation of response correctness. We also investigated effects of performance incompatibility and feedback valence. In a sample of N = 22 participants, outcome probability was manipulated by means of precues, feedback valence by means of monetary feedback, and performance incompatibility by means of feedback that induced a match versus mismatch with individuals' performance. We found that the 100% outcome probability condition induced a more negative FN following no-loss than the 50% outcome probability condition. The FN following loss was more negative in the 50% compared to the 100% outcome probability condition. Performance-incompatible loss resulted in a more negative FN than performance-compatible loss. Our results indicate that the self-evaluation of the correctness of responses should be taken into account when the effects of outcome probability and expectation mismatch on FN are investigated.
Park, Jiyoon; Woo, Ok Hee; Shin, Hye Seon; Cho, Kyu Ran; Seo, Bo Kyoung; Kang, Eun Young
2015-10-01
The purpose of this study is to evaluate the diagnostic performance of SWE in palpable breast mass and to compare with color overlay pattern in SWE with conventional US and quantitative SWE for assessing palpable breast mass. SWE and conventional breast US were performed in 133 women with 156 palpable breast lesions (81 benign, 75 malignant) between August 2013 to June 2014. Either pathology or periodic imaging surveillance more than 2 years was a reference standard. Existence of previous image was blinded to performing radiologists. US BI-RADS final assessment, qualitative and quantitative SWE measurements were evaluated. Diagnostic performances of grayscale US, SWE and US combined to SWE were calculated and compared. Correlation between pattern classification and quantitative SWE was evaluated. Both color overlay pattern and quantitative SWE improved the specificity of conventional US, from 81.48% to 96.30% (p=0.0005), without improvement in sensitivity. Color overlay pattern was significantly related to all quantitative SWE parameters and malignancy rate (p<0.0001.). The optimal cutoff of color overlay pattern was between 2 and 3. Emax with optimal cutoff at 45.1 kPa showed the highest Az value, sensitivity, specificity and accuracy among other quantitative SWE parameters (p<0.0001). Echogenic halo on grayscale US showed significant correlation with color overlay pattern and pathology (p<0.0001). In evaluation of palpable breast mass, conventional US combine to SWE improves specificity and reduces the number of biopsies that ultimately yield a benign result. Color overlay pattern classification is more quick and easy and may represent quantitative SWE measurements with similar diagnostic performances. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
NASA Technical Reports Server (NTRS)
Shinoda, Patrick M.
1996-01-01
A full-scale helicopter rotor test was conducted in the NASA Ames 80- by 120-Foot Wind Tunnel with a four-bladed S-76 rotor system. Rotor performance and loads data were obtained over a wide range of rotor shaft angles-of-attack and thrust conditions at tunnel speeds ranging from 0 to 100 kt. The primary objectives of this test were (1) to acquire forward flight rotor performance and loads data for comparison with analytical results; (2) to acquire S-76 forward flight rotor performance data in the 80- by 120-Foot Wind Tunnel to compare with existing full-scale 40- by 80-Foot Wind Tunnel test data that were acquired in 1977; (3) to evaluate the acoustic capability of the 80- by 120- Foot Wind Tunnel for acquiring blade vortex interaction (BVI) noise in the low speed range and compare BVI noise with in-flight test data; and (4) to evaluate the capability of the 80- by 120-Foot Wind Tunnel test section as a hover facility. The secondary objectives were (1) to evaluate rotor inflow and wake effects (variations in tunnel speed, shaft angle, and thrust condition) on wind tunnel test section wall and floor pressures; (2) to establish the criteria for the definition of flow breakdown (condition where wall corrections are no longer valid) for this size rotor and wind tunnel cross-sectional area; and (3) to evaluate the wide-field shadowgraph technique for visualizing full-scale rotor wakes. This data base of rotor performance and loads can be used for analytical and experimental comparison studies for full-scale, four-bladed, fully articulated rotor systems. Rotor performance and structural loads data are presented in this report.
A simple video-based timing system for on-ice team testing in ice hockey: a technical report.
Larson, David P; Noonan, Benjamin C
2014-09-01
The purpose of this study was to describe and evaluate a newly developed on-ice timing system for team evaluation in the sport of ice hockey. We hypothesized that this new, simple, inexpensive, timing system would prove to be highly accurate and reliable. Six adult subjects (age 30.4 ± 6.2 years) performed on ice tests of acceleration and conditioning. The performance times of the subjects were recorded using a handheld stopwatch, photocell, and high-speed (240 frames per second) video. These results were then compared to allow for accuracy calculations of the stopwatch and video as compared with filtered photocell timing that was used as the "gold standard." Accuracy was evaluated using maximal differences, typical error/coefficient of variation (CV), and intraclass correlation coefficients (ICCs) between the timing methods. The reliability of the video method was evaluated using the same variables in a test-retest analysis both within and between evaluators. The video timing method proved to be both highly accurate (ICC: 0.96-0.99 and CV: 0.1-0.6% as compared with the photocell method) and reliable (ICC and CV within and between evaluators: 0.99 and 0.08%, respectively). This video-based timing method provides a very rapid means of collecting a high volume of very accurate and reliable on-ice measures of skating speed and conditioning, and can easily be adapted to other testing surfaces and parameters.
Performance Evaluation of New-Generation Pulse Oximeters in the NICU: Observational Study.
Nizami, Shermeen; Greenwood, Kim; Barrowman, Nick; Harrold, JoAnn
2015-09-01
This crossover observational study compares the data characteristics and performance of new-generation Nellcor OXIMAX and Masimo SET SmartPod pulse oximeter technologies. The study was conducted independent of either original equipment manufacturer (OEM) across eleven preterm infants in a Neonatal Intensive Care Unit (NICU). The SmartPods were integrated with Dräger Infinity Delta monitors. The Delta monitor measured the heart rate (HR) using an independent electrocardiogram sensor, and the two SmartPods collected arterial oxygen saturation (SpO2) and pulse rate (PR). All patient data were non-Gaussian. Nellcor PR showed a higher correlation with the HR as compared to Masimo PR. The statistically significant difference found in their median values (1% for SpO2, 1 bpm for PR) was deemed clinically insignificant. SpO2 alarms generated by both SmartPods were observed and categorized for performance evaluation. Results for sensitivity, positive predictive value, accuracy and false alarm rates were Nellcor (80.3, 50, 44.5, 50%) and Masimo (72.2, 48.2, 40.6, 51.8%) respectively. These metrics were not statistically significantly different between the two pulse oximeters. Despite claims by OEMs, both pulse oximeters exhibited high false alarm rates, with no statistically or clinically significant difference in performance. These findings have a direct impact on alarm fatigue in the NICU. Performance evaluation studies can also impact medical device purchase decisions made by hospital administrators.
Electric-hybrid-vehicle simulation
NASA Astrophysics Data System (ADS)
Pasma, D. C.
The simulation of electric hybrid vehicles is to be performed using experimental data to model propulsion system components. The performance of an existing ac propulsion system will be used as the baseline for comparative purposes. Hybrid components to be evaluated include electrically and mechanically driven flywheels, and an elastomeric regenerative braking system.
Abstract: Two physically based and deterministic models, CASC2-D and KINEROS are evaluated and compared for their performances on modeling sediment movement on a small agricultural watershed over several events. Each model has different conceptualization of a watershed. CASC...
Evaluation of Course-Specific Self-Efficacy Assessment Methods.
ERIC Educational Resources Information Center
Bong, Mimi
A study was conducted to compare three methods of assessing course-level self-efficacy beliefs within a multitrait multimethod (MTMM) framework. The methods involved: (1) successfully performing a number of domain-related tasks; (2) obtaining specific letter grades in the course; and (3) successfully performing generic academic tasks in the…
DOT National Transportation Integrated Search
2009-06-01
Seasonal variation of measured pavement responses with temperature and its relationship to pavement performance has not been : thoroughly evaluated for ALF Experiments II and III. Such information may be used to improve instrumentation strategies in ...
Two physically based watershed models, GSSHA and KINEROS-2 are evaluated and compared for their performances on modeling flow and sediment movement. Each model has a different watershed conceptualization. GSSHA divides the watershed into cells, and flow and sediments are routed t...
Kim, Se-Young; Kim, Kyoung Won; Choi, Sang Hyun; Kwon, Jae Hyun; Song, Gi-Won; Kwon, Heon-Ju; Yun, Young Ju; Lee, Jeongjin; Lee, Sung-Gyu
2017-11-01
To determine the feasibility of using UltraFast Doppler in post-operative evaluation of the hepatic artery (HA) after liver transplantation (LT), we evaluated 283 simultaneous conventional and UltraFast Doppler sessions in 126 recipients over a 2-mo period after LT, using an Aixplorer scanner The Doppler indexes of the HA (peak systolic velocity [PSV], end-diastolic velocity [EDV], resistive index [RI] and systolic acceleration time [SAT]) by retrospective analysis of retrieved waves from UltraFast Doppler clips were compared with those obtained by conventional spectral Doppler. Correlation, performance in diagnosing the pathologic wave, examination time and reproducibility were evaluated. The PSV, EDV, RI and SAT of spectral and UltraFast Doppler measurements exhibited excellent correlation with favorable diagnostic performance. During the bedside examination, the mean time spent for UltraFast clip storing was significantly shorter than that for conventional Doppler US measurements. Both conventional and UltraFast Doppler exhibited good to excellent inter-analysis consistency. In conclusion, compared with conventional spectral Doppler, UltraFast Doppler values correlated excellently and yielded acceptable pathologic wave diagnostic performance with reduced examination time at the bedside and excellent reproducibility. Copyright © 2017 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
Comparison of methods for quantitative evaluation of endoscopic distortion
NASA Astrophysics Data System (ADS)
Wang, Quanzeng; Castro, Kurt; Desai, Viraj N.; Cheng, Wei-Chung; Pfefer, Joshua
2015-03-01
Endoscopy is a well-established paradigm in medical imaging, and emerging endoscopic technologies such as high resolution, capsule and disposable endoscopes promise significant improvements in effectiveness, as well as patient safety and acceptance of endoscopy. However, the field lacks practical standardized test methods to evaluate key optical performance characteristics (OPCs), in particular the geometric distortion caused by fisheye lens effects in clinical endoscopic systems. As a result, it has been difficult to evaluate an endoscope's image quality or assess its changes over time. The goal of this work was to identify optimal techniques for objective, quantitative characterization of distortion that are effective and not burdensome. Specifically, distortion measurements from a commercially available distortion evaluation/correction software package were compared with a custom algorithm based on a local magnification (ML) approach. Measurements were performed using a clinical gastroscope to image square grid targets. Recorded images were analyzed with the ML approach and the commercial software where the results were used to obtain corrected images. Corrected images based on the ML approach and the software were compared. The study showed that the ML method could assess distortion patterns more accurately than the commercial software. Overall, the development of standardized test methods for characterizing distortion and other OPCs will facilitate development, clinical translation, manufacturing quality and assurance of performance during clinical use of endoscopic technologies.
de Waaij, Dewi J; Ouburg, Sander; Dubbink, Jan Henk; Peters, Remco P H; Morré, Servaas A
2016-08-01
This is an evaluation study of the Presto(plus) Assay for T. vaginalis by comparing to the TIB MOLBIOL LightMix Kit Trichomonas vaginalis Assay using 615 dry collected vaginal and rectal swabs. Discordant samples were analyzed by the Qiagen® Microbial DNA qPCR for TV Assay. Both assays showed comparable performances (McNemar p>0.05). Copyright © 2016 Elsevier B.V. All rights reserved.
Fuermaier, Anselm B M; Tucha, Oliver; Koerts, Janneke; Lange, Klaus W; Weisbrod, Matthias; Aschenbrenner, Steffen; Tucha, Lara
2017-12-01
The assessment of performance validity is an essential part of the neuropsychological evaluation of adults with attention-deficit/hyperactivity disorder (ADHD). Most available tools, however, are inaccurate regarding the identification of noncredible performance. This study describes the development of a visuospatial working memory test, including a validity indicator for noncredible cognitive performance of adults with ADHD. Visuospatial working memory of adults with ADHD (n = 48) was first compared to the test performance of healthy individuals (n = 48). Furthermore, a simulation design was performed including 252 individuals who were randomly assigned to either a control group (n = 48) or to 1 of 3 simulation groups who were requested to feign ADHD (n = 204). Additional samples of 27 adults with ADHD and 69 instructed simulators were included to cross-validate findings from the first samples. Adults with ADHD showed impaired visuospatial working memory performance of medium size as compared to healthy individuals. Simulation groups committed significantly more errors and had shorter response times as compared to patients with ADHD. Moreover, binary logistic regression analysis was carried out to derive a validity index that optimally differentiates between true and feigned ADHD. ROC analysis demonstrated high classification rates of the validity index, as shown in excellent specificity (95.8%) and adequate sensitivity (60.3%). The visuospatial working memory test as presented in this study therefore appears sensitive in indicating cognitive impairment of adults with ADHD. Furthermore, the embedded validity index revealed promising results concerning the detection of noncredible cognitive performance of adults with ADHD. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
NASA Technical Reports Server (NTRS)
Wood, Jessica J.; Foster, Lee W.
2013-01-01
A Multilayer Insulation (MLI) and Broad Area Cooling (BAC) shield thermal control system shows promise for long-duration storage of cryogenic propellant. The NASA Cryogenic Propellant Storage and Transfer (CPST) project is investigating the thermal and structural performance of this tank-applied integrated system. The MLI/BAC Shield Acoustic and Thermal Test was performed to evaluate the MLI/BAC shield's structural performance by subjecting it to worst-case launch acoustic loads. Identical thermal tests using Liquid Nitrogen (LN2) were performed before and after the acoustic test. The data from these tests was compared to determine if any degradation occurred in the thermal performance of the system as a result of exposure to the acoustic loads. The thermal test series consisted of two primary components: a passive boil-off test to evaluate the MLI performance and an active cooling test to evaluate the integrated MLI/BAC shield system with chilled vapor circulating through the BAC shield tubes. The acoustic test used loads closely matching the worst-case envelope of all launch vehicles currently under consideration for CPST. Acoustic test results yielded reasonable responses for the given load. The thermal test matrix was completed prior to the acoustic test and successfully repeated after the acoustic test. Data was compared and yielded near identical results, indicating that the MLI/BAC shield configuration tested in this series is an option for structurally implementing this thermal control system concept.
Information theoretic analysis of edge detection in visual communication
NASA Astrophysics Data System (ADS)
Jiang, Bo; Rahman, Zia-ur
2010-08-01
Generally, the designs of digital image processing algorithms and image gathering devices remain separate. Consequently, the performance of digital image processing algorithms is evaluated without taking into account the artifacts introduced into the process by the image gathering process. However, experiments show that the image gathering process profoundly impacts the performance of digital image processing and the quality of the resulting images. Huck et al. proposed one definitive theoretic analysis of visual communication channels, where the different parts, such as image gathering, processing, and display, are assessed in an integrated manner using Shannon's information theory. In this paper, we perform an end-to-end information theory based system analysis to assess edge detection methods. We evaluate the performance of the different algorithms as a function of the characteristics of the scene, and the parameters, such as sampling, additive noise etc., that define the image gathering system. The edge detection algorithm is regarded to have high performance only if the information rate from the scene to the edge approaches the maximum possible. This goal can be achieved only by jointly optimizing all processes. People generally use subjective judgment to compare different edge detection methods. There is not a common tool that can be used to evaluate the performance of the different algorithms, and to give people a guide for selecting the best algorithm for a given system or scene. Our information-theoretic assessment becomes this new tool to which allows us to compare the different edge detection operators in a common environment.
Using the weighted area under the net benefit curve for decision curve analysis.
Talluri, Rajesh; Shete, Sanjay
2016-07-18
Risk prediction models have been proposed for various diseases and are being improved as new predictors are identified. A major challenge is to determine whether the newly discovered predictors improve risk prediction. Decision curve analysis has been proposed as an alternative to the area under the curve and net reclassification index to evaluate the performance of prediction models in clinical scenarios. The decision curve computed using the net benefit can evaluate the predictive performance of risk models at a given or range of threshold probabilities. However, when the decision curves for 2 competing models cross in the range of interest, it is difficult to identify the best model as there is no readily available summary measure for evaluating the predictive performance. The key deterrent for using simple measures such as the area under the net benefit curve is the assumption that the threshold probabilities are uniformly distributed among patients. We propose a novel measure for performing decision curve analysis. The approach estimates the distribution of threshold probabilities without the need of additional data. Using the estimated distribution of threshold probabilities, the weighted area under the net benefit curve serves as the summary measure to compare risk prediction models in a range of interest. We compared 3 different approaches, the standard method, the area under the net benefit curve, and the weighted area under the net benefit curve. Type 1 error and power comparisons demonstrate that the weighted area under the net benefit curve has higher power compared to the other methods. Several simulation studies are presented to demonstrate the improvement in model comparison using the weighted area under the net benefit curve compared to the standard method. The proposed measure improves decision curve analysis by using the weighted area under the curve and thereby improves the power of the decision curve analysis to compare risk prediction models in a clinical scenario.
Santarelli, Rosamaria; Magnavita, Vincenzo; De Filippi, Roberta; Ventura, Laura; Genovese, Elisabetta; Arslan, Edoardo
2009-04-01
To compare speech perception performance in children fitted with previous generation Nucleus sound processor, Sprint or Esprit 3G, and the Freedom, the most recently released system from the Cochlear Corporation that features a larger input dynamic range. Prospective intrasubject comparative study. University Medical Center. Seventeen prelingually deafened children who had received the Nucleus 24 cochlear implant and used the Sprint or Esprit 3G sound processor. Cochlear implantation with Cochlear device. Speech perception was evaluated at baseline (Sprint, n = 11; Esprit 3G, n = 6) and after 1 month's experience with the Freedom sound processor. Identification and recognition of disyllabic words and identification of vowels were performed via recorded voice in quiet (70 dB [A]), in the presence of background noise at various levels of signal-to-noise ratio (+10, +5, 0, -5) and at a soft presentation level (60 dB [A]). Consonant identification and recognition of disyllabic words, trisyllabic words, and sentences were evaluated in live voice. Frequency discrimination was measured in a subset of subjects (n = 5) by using an adaptive, 3-interval, 3-alternative, forced-choice procedure. Identification of disyllabic words administered at a soft presentation level showed a significant increase when switching to the Freedom compared with the previously worn processor in children using the Sprint or Esprit 3G. Identification and recognition of disyllabic words in the presence of background noise as well as consonant identification and sentence recognition increased significantly for the Freedom compared with the previously worn device only in children fitted with the Sprint. Frequency discrimination was significantly better when switching to the Freedom compared with the previously worn processor. Serial comparisons revealed that that speech perception performance evaluated in children aged 5 to 15 years was superior with the Freedom than previous generations of Nucleus sound processors. These differences are deemed to ensue from an increased input dynamic range, a feature that offers potentially enhanced phonemic discrimination.
Calibration of the Spanish PROMIS Smoking Item Banks.
Huang, Wenjing; Stucky, Brian D; Edelen, Maria O; Tucker, Joan S; Shadel, William G; Hansen, Mark; Cai, Li
2016-07-01
The Patient-Reported Outcomes Measurement Information System (PROMIS) Smoking Initiative has developed item banks for assessing six smoking behaviors and biopsychosocial correlates of smoking among adult cigarette smokers. The goal of this study is to evaluate the performance of the Spanish version of the PROMIS smoking item banks as compared to the original banks developed in English. The six PROMIS banks for daily smokers were translated into Spanish and administered to a sample of Spanish-speaking adult daily smokers in the United States (N = 302). We first evaluated the unidimensionality of each bank using confirmatory factor analysis. We then conducted a two-group item response theory calibration, including an item response theory-based Differential Item Functioning (DIF) analysis by language of administration (Spanish vs. English). Finally, we generated full bank and short form scores for the translated banks and evaluated their psychometric performance. Unidimensionality of the Spanish smoking item banks was supported by confirmatory factor analysis results. Out of a total of 109 items that were evaluated for language DIF, seven items in three of the six banks were identified as having levels of DIF that exceeded an established criterion. The psychometric performance of the Spanish daily smoker banks is largely comparable to that of the English versions. The Spanish PROMIS smoking item banks are highly similar, but not entirely equivalent, to the original English versions. The parameters from these two-group calibrations can be used to generate comparable bank scores across the two language versions. In this study, we developed a Spanish version of the PROMIS smoking toolkit, which was originally designed and developed for English speakers. With the growing Spanish-speaking population, it is important to make the toolkit more accessible by translating the items and calibrating the Spanish version to be comparable with English-language scores. This study provided the translated item banks and short forms, comparable unbiased scores for Spanish speakers and evaluations of the psychometric properties of the new Spanish toolkit. © The Author 2016. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Ceramic automotive Stirling engine program
NASA Technical Reports Server (NTRS)
1986-01-01
The Ceramic Automotive Stirling Engine Program evaluated the application of advanced ceramic materials to an automotive Stirling engine. The objective of the program was to evaluate the technical feasibility of utilizing advanced ceramics to increase peak engine operating temperature, and to evaluate the performance benefits of such an increase. Manufacturing cost estimates were also developed for various ceramic engine components and compared with conventional metallic engine component costs.
Motivation in vigilance - Effects of self-evaluation and experimenter-controlled feedback.
NASA Technical Reports Server (NTRS)
Warm, J. S.; Kanfer, F. H.; Kuwada, S.; Clark, J. L.
1972-01-01
Vigilance experiments have been performed to study the relative efficiency of feedback operations in enhancing vigilance performance. Two feedback operations were compared - i.e., experimenter-controlled feedback in the form of knowledge of results (KR) regarding response times to signal detections, and subject-controlled feedback in the form of self-evaluation (SE) of response times to signal detections. The subjects responded to the aperiodic offset of a visual signal during a 1-hr vigil. Both feedback operations were found to enhance performance efficiency: subjects in the KR and SE conditions had faster response times than controls receiving no evaluative feedback. Moreover, the data of the KR and SE groups did not differ significantly from each other. The results are discussed in terms of the hypothesis that self-evaluation is a critical factor underlying the incentive value of KR in vigilance tasks.
[Pre-operation evaluation and intra-operation management of cochlear implantation].
Zhang, Dao-xing; Hu, Bao-hua; Xiao, Yu-li; Shi, Bo-ning
2004-10-01
To summarize pre-operation evaluation experiences in cochlear implantation. Performing auditory evaluation and image analysis seriously in 158 severe hearing loss or total deaf cases before cochlear implantation, comparing their performance with the findings during and post operation. Among the total 158 cases, 116 cases with normal structure, 42 cases with the abnormal findings of the inner or middle ear. Stapedial gusher happened in 6 cases, 1 case was not predicted before operation. Except 1 case with serious malformation, the findings of other 157 cases in operation were consistent with the pre-operation evaluation. We helped all patients reconstruct auditory conduction with cochlear implantation, and the average hearing level up to 37.6 dB SPL. Performing image analysis seriously before operation and planning for operation according to HRCT can do great help to cochlear implantation. The operation under the HRCT instruction has less complications.
Dingare, Shipra; Nissim, Malvina; Finkel, Jenny; Grover, Claire
2005-01-01
We present a maximum entropy-based system for identifying named entities (NEs) in biomedical abstracts and present its performance in the only two biomedical named entity recognition (NER) comparative evaluations that have been held to date, namely BioCreative and Coling BioNLP. Our system obtained an exact match F-score of 83.2% in the BioCreative evaluation and 70.1% in the BioNLP evaluation. We discuss our system in detail, including its rich use of local features, attention to correct boundary identification, innovative use of external knowledge resources, including parsing and web searches, and rapid adaptation to new NE sets. We also discuss in depth problems with data annotation in the evaluations which caused the final performance to be lower than optimal. PMID:18629295
Performance Evaluation of the microPET®—FOCUS-F120
NASA Astrophysics Data System (ADS)
Laforest, Richard; Longford, Desmond; Siegel, Stefan; Newport, Danny F.; Yap, Jeffrey
2007-02-01
microPETreg-Focus-F120 is the latest model of dedicated small animal PET scanners from CTI-Concorde Microsystems LLC, (Knoxville, TN). This scanner, based on the geometry of the microPET-R4, takes advantage of several detector modifications to the coincidence processing electronics that improve the image resolution, sensitivity, and counting rate performance as compared to the predecessor models. This work evaluates the performance of the Focus-F120 system and shows its improvement over the earlier models. In particular, the spatial resolution is shown to improve from 2.32 to 1.69 mm at 5 mm radial distance and the peak absolute sensitivity increases from 4.1% to 7.1% compared to the microPET-R4. The counting rate capability, expressed in noise equivalent counting rate (NEC-1R), was shown to peak at over 800 kcps at 88 MBq for both systems using a mouse phantom. For this small phantom, the NECR counting rate is limited by the data transmission bandwidth between the scanner and the acquisition console. The rat-like phantom showed peak NEC-1R value at 300 kcps at 140 MBq. Evaluation of image quality and quantitation accuracy was also performed using specially designed phantoms and animal experiments
Towards systematic evaluation of crop model outputs for global land-use models
NASA Astrophysics Data System (ADS)
Leclere, David; Azevedo, Ligia B.; Skalský, Rastislav; Balkovič, Juraj; Havlík, Petr
2016-04-01
Land provides vital socioeconomic resources to the society, however at the cost of large environmental degradations. Global integrated models combining high resolution global gridded crop models (GGCMs) and global economic models (GEMs) are increasingly being used to inform sustainable solution for agricultural land-use. However, little effort has yet been done to evaluate and compare the accuracy of GGCM outputs. In addition, GGCM datasets require a large amount of parameters whose values and their variability across space are weakly constrained: increasing the accuracy of such dataset has a very high computing cost. Innovative evaluation methods are required both to ground credibility to the global integrated models, and to allow efficient parameter specification of GGCMs. We propose an evaluation strategy for GGCM datasets in the perspective of use in GEMs, illustrated with preliminary results from a novel dataset (the Hypercube) generated by the EPIC GGCM and used in the GLOBIOM land use GEM to inform on present-day crop yield, water and nutrient input needs for 16 crops x 15 management intensities, at a spatial resolution of 5 arc-minutes. We adopt the following principle: evaluation should provide a transparent diagnosis of model adequacy for its intended use. We briefly describe how the Hypercube data is generated and how it articulates with GLOBIOM in order to transparently identify the performances to be evaluated, as well as the main assumptions and data processing involved. Expected performances include adequately representing the sub-national heterogeneity in crop yield and input needs: i) in space, ii) across crop species, and iii) across management intensities. We will present and discuss measures of these expected performances and weight the relative contribution of crop model, input data and data processing steps in performances. We will also compare obtained yield gaps and main yield-limiting factors against the M3 dataset. Next steps include iterative improvement of parameter assumptions and evaluation of implications of GGCM performances for intended use in the IIASA EPIC-GLOBIOM model cluster. Our approach helps targeting future efforts at improving GGCM accuracy and would achieve highest efficiency if combined with traditional field-scale evaluation and sensitivity analysis.
External quality assessment for CD4 + T-lymphocyte count test
Gaspar, Pâmela Cristina; Wohlke, Bruna Lovizutto Protti; Brunialti, Milena Karina Coló; Pires, Ana Flávia; Kohiyama, Igor Massaki; Salomão, Reinaldo; Alonso Neto, José Boullosa; Júnior, Orlando da Costa Ferreira; Franchini, Miriam; Bazzo, Maria Luiza; Benzaken, Adele Schwartz
2018-01-01
Abstract The National Network for CD4+ T-lymphocyte counting of Brazil comprises 93 laboratories. This study reports the laboratory performances achieved in external quality assessment (EQA) rounds provides by Ministry of Health to evaluate the quality of the kits used and the performance of test by the technicians. Ten EQA rounds were analyzed according the EQA criteria aimed to evaluate individual laboratory performance on the basis of the accuracy of their results compared to the general mean obtained by all participating laboratories and the reproducibility of the results obtained between 2 samples from the same donor. The percentage of approved and failed laboratories in the EQAs tends to follow a uniform pattern. Since 2011, approval has remained above 80% and the failure rate has never exceeded 15%. EQA is very important to evaluate the performance of the laboratories, to identify monitor, and to resolve errors as quickly as possible. PMID:29794603
Performance assessment of a closed-loop system for diabetes management.
Martinez-Millana, A; Fico, G; Fernández-Llatas, C; Traver, V
2015-12-01
Telemedicine systems can play an important role in the management of diabetes, a chronic condition that is increasing worldwide. Evaluations on the consistency of information across these systems and on their performance in a real situation are still missing. This paper presents a remote monitoring system for diabetes management based on physiological sensors, mobile technologies and patient/doctor applications over a service-oriented architecture that has been evaluated in an international trial (83,905 operation records). The proposed system integrates three types of running environments and data engines in a single service-oriented architecture. This feature is used to assess key performance indicators comparing them with other type of architectures. Data sustainability across the applications has been evaluated showing better outcomes for full integrated sensors. At the same time, runtime performance of clients has been assessed spotting no differences regarding the operative environment.
[IMSS in numbers. Evaluation of the performance of health institutions in Mexico, 2004].
2006-01-01
The evaluation of health institutions performance in Mexico during 2004 was done using 29 indicators that describe intra-hospital mortality rates, productivity of health services, availability of health resources, quality of care, security, investment and costs of health care and the satisfaction level by users of health services. This exercise describes the efficiency and organization of health services provided by the different health institutions and allows comparing and balancing the performance of each institution. Results indicate the differences in availability of resources, inequity in the financing health care services, and inefficiency in the use of resources but also describe the level of efficacy of certain institutions and the satisfaction level that different users have of health services. The evaluation of the performance of the entire health institutions should provide the means to improve all the process of health care and to increase the quality of care in all health institutions in the country.
Montreal Communication Evaluation Battery--Portuguese version: age and education effects.
Kerr, Mônica de Souza; Pagliarin, Karina Carlesso; Mineiro, Ana; Ferré, Perrine; Joanette, Yves; Fonseca, Rochele Paz
2015-01-01
To verify age and education effects on communication performance of healthy adults in the Montreal Communication Evaluation Battery, Portuguese version (MAC-PT). The sample comprised 90 healthy adults from Portugal, European Portuguese speakers, divided into nine groups according to educational level (4-9, 10-13, and > 13 years of formal schooling) and age (19-40, 41-64, and 65-80 years). The influence of age and education was assessed by comparing mean scores between groups, using a two-way analysis of variance followed by Bonferroni post hoc tests (p ≤ 0.05). The results showed that participants' performance was influenced by age in pragmatic-inferential, discursive, and prosodic tasks. Education had the greatest influence on the performance in all processes evaluated by the MAC-PT. Age and education seem to influence the communicative performance and should be considered in the assessment of neurological patients.
Gryth, Dan; Rådestad, Monica; Nilsson, Heléne; Nerf, Ola; Svensson, Leif; Castrén, Maaret; Rüter, Anders
2010-01-01
Large, functional, disaster exercises are expensive to plan and execute, and often are difficult to evaluate objectively. Command and control in disaster medicine organizations can benefit from objective results from disaster exercises to identify areas that must be improved. The objective of this pilot study was to examine if it is possible to use performance indicators for documentation and evaluation of medical command and control in a full-scale major incident exercise at two levels: (1) local level (scene of the incident and hospital); and (2) strategic level of command and control. Staff procedure skills also were evaluated. Trained observers were placed in each of the three command and control locations. These observers recorded and scored the performance of command and control using templates of performance indicators. The observers scored the level of performance by awarding 2, 1, or 0 points according to the template and evaluated content and timing of decisions. Results from 11 performance indicators were recorded at each template and scores greater than 11 were considered as acceptable. Prehospital command and control had the lowest score. This also was expressed by problems at the scene of the incident. The scores in management and staff skills were at the strategic level 15 and 17, respectively; and at the hospital level, 17 and 21, respectively. It is possible to use performance indicators in a full-scale, major incident exercise for evaluation of medical command and control. The results could be used to compare similar exercises and evaluate real incidents in the future.
Evaluating Equating Accuracy and Assumptions for Groups that Differ in Performance
ERIC Educational Resources Information Center
Powers, Sonya; Kolen, Michael J.
2014-01-01
Accurate equating results are essential when comparing examinee scores across exam forms. Previous research indicates that equating results may not be accurate when group differences are large. This study compared the equating results of frequency estimation, chained equipercentile, item response theory (IRT) true-score, and IRT observed-score…
Comparative Robustness of Recent Methods for Analyzing Multivariate Repeated Measures Designs
ERIC Educational Resources Information Center
Seco, Guillermo Vallejo; Gras, Jaime Arnau; Garcia, Manuel Ato
2007-01-01
This study evaluated the robustness of two recent methods for analyzing multivariate repeated measures when the assumptions of covariance homogeneity and multivariate normality are violated. Specifically, the authors' work compares the performance of the modified Brown-Forsythe (MBF) procedure and the mixed-model procedure adjusted by the…
Evaluating health worker performance in Benin using the simulated client method with real children.
Rowe, Alexander K; Onikpo, Faustin; Lama, Marcel; Deming, Michael S
2012-10-08
The simulated client (SC) method for evaluating health worker performance utilizes surveyors who pose as patients to make surreptitious observations during consultations. Compared to conspicuous observation (CO) by surveyors, which is commonly done in developing countries, SC data better reflect usual health worker practices. This information is important because CO can cause performance to be better than usual. Despite this advantage of SCs, the method's full potential has not been realized for evaluating performance for pediatric illnesses because real children have not been utilized as SCs. Previous SC studies used scenarios of ill children that were not actually brought to health workers. During a trial that evaluated a quality improvement intervention in Benin (the Integrated Management of Childhood Illness [IMCI] strategy), we conducted an SC survey with adult caretakers as surveyors and real children to evaluate the feasibility of this approach and used the results to assess the validity of CO. We conducted an SC survey and a CO survey (one right after the other) of health workers in the same 55 health facilities. A detailed description of the SC survey process was produced. Results of the two surveys were compared for 27 performance indicators using logistic regression modeling. SC and CO surveyors observed 54 and 185 consultations, respectively. No serious problems occurred during the SC survey. Performance levels measured by CO were moderately higher than those measured by SCs (median CO - SC difference = 16.4 percentage-points). Survey differences were sometimes much greater for IMCI-trained health workers (median difference = 29.7 percentage-points) than for workers without IMCI training (median difference = 3.1 percentage-points). SC surveys can be done safely with real children if appropriate precautions are taken. CO can introduce moderately large positive biases, and these biases might be greater for health workers exposed to quality improvement interventions. http://clinicaltrials.gov Identifier NCT00510679.
Validating Visual Cues In Flight Simulator Visual Displays
NASA Astrophysics Data System (ADS)
Aronson, Moses
1987-09-01
Currently evaluation of visual simulators are performed by either pilot opinion questionnaires or comparison of aircraft terminal performance. The approach here is to compare pilot performance in the flight simulator with a visual display to his performance doing the same visual task in the aircraft as an indication that the visual cues are identical. The A-7 Night Carrier Landing task was selected. Performance measures which had high pilot performance prediction were used to compare two samples of existing pilot performance data to prove that the visual cues evoked the same performance. The performance of four pilots making 491 night landing approaches in an A-7 prototype part task trainer were compared with the performance of 3 pilots performing 27 A-7E carrier landing qualification approaches on the CV-60 aircraft carrier. The results show that the pilots' performances were similar, therefore concluding that the visual cues provided in the simulator were identical to those provided in the real world situation. Differences between the flight simulator's flight characteristics and the aircraft have less of an effect than the pilots individual performances. The measurement parameters used in the comparison can be used for validating the visual display for adequacy for training.
Influence of signal processing strategy in auditory abilities.
Melo, Tatiana Mendes de; Bevilacqua, Maria Cecília; Costa, Orozimbo Alves; Moret, Adriane Lima Mortari
2013-01-01
The signal processing strategy is a parameter that may influence the auditory performance of cochlear implant and is important to optimize this parameter to provide better speech perception, especially in difficult listening situations. To evaluate the individual's auditory performance using two different signal processing strategy. Prospective study with 11 prelingually deafened children with open-set speech recognition. A within-subjects design was used to compare performance with standard HiRes and HiRes 120 in three different moments. During test sessions, subject's performance was evaluated by warble-tone sound-field thresholds, speech perception evaluation, in quiet and in noise. In the silence, children S1, S4, S5, S7 showed better performance with the HiRes 120 strategy and children S2, S9, S11 showed better performance with the HiRes strategy. In the noise was also observed that some children performed better using the HiRes 120 strategy and other with HiRes. Not all children presented the same pattern of response to the different strategies used in this study, which reinforces the need to look at optimizing cochlear implant clinical programming.
Kocher, Benjamin K; Chalupa, Robyn L; Lopez, Donna M; Kirk, Kevin L
2016-11-01
Functional limitations after lower extremity surgery often require the use of an assistive device for ambulation during rehabilitation and recovery. There are no known objective data evaluating the wheeled knee walker as an assistive device for protected ambulation. The purpose of this study was to compare assisted ambulation and perceived exertion with the wheeled knee walker and the axillary crutches in healthy participants. A prospective, randomized crossover study was performed using 24 healthy volunteers. Each participant performed a 6-minute walk test (6MWT) using each assistive device in a crossover manner. Preactivity and postactivity heart rates were recorded. The self-selected walking velocity (SSWV) was calculated and the participant's rating of perceived exertion was recorded using the OMNI Rating of Perceived Exertion (OMNI-RPE). Participant's preference for assistive device was identified. The 6MWT, SSWV, and the Omni-RPE were evaluated using paired t tests and determined to be statistically significant for the wheeled knee walker compared with axillary crutches. Evaluation of the preactivity and postactivity heart rates demonstrated a statistically significant difference for the wheeled knee walker compared with axillary crutches. The wheeled knee walker was preferred by 88% of participants. The wheeled knee walker provided increased assisted ambulation and had a lower rating of perceived exertion than axillary crutches on level surfaces in healthy participants. Level III, comparative study. © The Author(s) 2016.
Performance evaluation of the new hematology analyzer Sysmex XN-series.
Seo, J Y; Lee, S-T; Kim, S-H
2015-04-01
The Sysmex XN-series is a new automated hematology analyzer designed to improve the accuracy of cell counts and the specificity of the flagging events. The basic characteristics and the performance of new measurement channels of the XN were evaluated and compared with the Sysmex XE-2100 and the manual method. Fluorescent platelet count (PLT-F) was compared with the flow cytometric method. The low WBC mode and body fluid mode were also evaluated. For workflow analysis, 1005 samples were analyzed on both the XN and the XE-2100, and manual review rates were compared. All parameters measured by the XN correlated well with the XE-2100. PLT-F showed better correlation with the flow cytometric method (r(2) = 0.80) compared with optical platelet count (r(2) = 0.73) for platelet counts <70 × 10(9) /L. The low WBC mode reported accurate leukocyte differentials for samples with a WBC count <0.5 × 10(9) /L. Relatively good correlation was found for WBC counts between the manual method and the body fluid mode (r = 0.88). The XN made less flags than the XE-2100, while the sensitivities of both instruments were comparable. The XN provided reliable results on low cell counts, as well as reduced manual blood film reviews, while maintaining a proper level of diagnostic sensitivity. © 2014 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
He, Zhibin; Wen, Xiaohu; Liu, Hu; Du, Jun
2014-02-01
Data driven models are very useful for river flow forecasting when the underlying physical relationships are not fully understand, but it is not clear whether these data driven models still have a good performance in the small river basin of semiarid mountain regions where have complicated topography. In this study, the potential of three different data driven methods, artificial neural network (ANN), adaptive neuro fuzzy inference system (ANFIS) and support vector machine (SVM) were used for forecasting river flow in the semiarid mountain region, northwestern China. The models analyzed different combinations of antecedent river flow values and the appropriate input vector has been selected based on the analysis of residuals. The performance of the ANN, ANFIS and SVM models in training and validation sets are compared with the observed data. The model which consists of three antecedent values of flow has been selected as the best fit model for river flow forecasting. To get more accurate evaluation of the results of ANN, ANFIS and SVM models, the four quantitative standard statistical performance evaluation measures, the coefficient of correlation (R), root mean squared error (RMSE), Nash-Sutcliffe efficiency coefficient (NS) and mean absolute relative error (MARE), were employed to evaluate the performances of various models developed. The results indicate that the performance obtained by ANN, ANFIS and SVM in terms of different evaluation criteria during the training and validation period does not vary substantially; the performance of the ANN, ANFIS and SVM models in river flow forecasting was satisfactory. A detailed comparison of the overall performance indicated that the SVM model performed better than ANN and ANFIS in river flow forecasting for the validation data sets. The results also suggest that ANN, ANFIS and SVM method can be successfully applied to establish river flow with complicated topography forecasting models in the semiarid mountain regions.
NASA Astrophysics Data System (ADS)
Adnan, F. A.; Romlay, F. R. M.; Shafiq, M.
2018-04-01
Owing to the advent of the industrial revolution 4.0, the need for further evaluating processes applied in the additive manufacturing application particularly the computational process for slicing is non-trivial. This paper evaluates a real-time slicing algorithm for slicing an STL formatted computer-aided design (CAD). A line-plane intersection equation was applied to perform the slicing procedure at any given height. The application of this algorithm has found to provide a better computational time regardless the number of facet in the STL model. The performance of this algorithm is evaluated by comparing the results of the computational time for different geometry.
Performance of biometric quality measures.
Grother, Patrick; Tabassi, Elham
2007-04-01
We document methods for the quantitative evaluation of systems that produce a scalar summary of a biometric sample's quality. We are motivated by a need to test claims that quality measures are predictive of matching performance. We regard a quality measurement algorithm as a black box that converts an input sample to an output scalar. We evaluate it by quantifying the association between those values and observed matching results. We advance detection error trade-off and error versus reject characteristics as metrics for the comparative evaluation of sample quality measurement algorithms. We proceed this with a definition of sample quality, a description of the operational use of quality measures. We emphasize the performance goal by including a procedure for annotating the samples of a reference corpus with quality values derived from empirical recognition scores.
Schoenthaler, Martin; Avcil, Tuba; Sevcenco, Sabina; Nagele, Udo; Hermann, Thomas E W; Kuehhas, Franklin E; Shariat, Shahrokh F; Frankenschmidt, Alexander; Wetterauer, Ulrich; Miernik, Arkadiusz
2015-01-01
To evaluate the Single-Incision Transumbilical Surgery (SITUS) technique as compared to an established laparoendoscopic single-site surgery (LESS) technique (Single-Port Laparoscopic Surgery, SPLS) and conventional laparoscopy (CLS) in a surgical simulator model. Sixty-three medical students without previous laparoscopic experience were randomly assigned to one of the three groups (SITUS, SPLS and CLS). Subjects were asked to perform five standardized tasks of increasing difficulty adopted from the Fundamentals of Laparoscopic Surgery curriculum. Statistical evaluation included task completion times and accuracy. Overall performances of all tasks (except precision cutting) were significantly faster and of higher accuracy in the CLS and SITUS groups than in the SPLS group (p = 0.004 to p < 0.001). CLS and SITUS groups alone showed no significant difference in performance times and accuracy measurements for all tasks (p = 0.048 to p = 0.989). SITUS proved to be a simple, but highly effective technique to overcome restrictions of SPLS. In a surgical simulator model, novices were able to achieve task performances comparable to CLS and did significantly better than using a port-assisted LESS technique such as SPLS. The demonstrated advantages of SITUS may be attributed to a preservation of the basic principles of conventional laparoscopy, such as the use of straight instruments and an adequate degree of triangulation.
Evaluation of control strategies using an oxidation ditch benchmark.
Abusam, A; Keesman, K J; Spanjers, H; van, Straten G; Meinema, K
2002-01-01
This paper presents validation and implementation results of a benchmark developed for a specific full-scale oxidation ditch wastewater treatment plant. A benchmark is a standard simulation procedure that can be used as a tool in evaluating various control strategies proposed for wastewater treatment plants. It is based on model and performance criteria development. Testing of this benchmark, by comparing benchmark predictions to real measurements of the electrical energy consumptions and amounts of disposed sludge for a specific oxidation ditch WWTP, has shown that it can (reasonably) be used for evaluating the performance of this WWTP. Subsequently, the validated benchmark was then used in evaluating some basic and advanced control strategies. Some of the interesting results obtained are the following: (i) influent flow splitting ratio, between the first and the fourth aerated compartments of the ditch, has no significant effect on the TN concentrations in the effluent, and (ii) for evaluation of long-term control strategies, future benchmarks need to be able to assess settlers' performance.
Goal-oriented evaluation of binarization algorithms for historical document images
NASA Astrophysics Data System (ADS)
Obafemi-Ajayi, Tayo; Agam, Gady
2013-01-01
Binarization is of significant importance in document analysis systems. It is an essential first step, prior to further stages such as Optical Character Recognition (OCR), document segmentation, or enhancement of readability of the document after some restoration stages. Hence, proper evaluation of binarization methods to verify their effectiveness is of great value to the document analysis community. In this work, we perform a detailed goal-oriented evaluation of image quality assessment of the 18 binarization methods that participated in the DIBCO 2011 competition using the 16 historical document test images used in the contest. We are interested in the image quality assessment of the outputs generated by the different binarization algorithms as well as the OCR performance, where possible. We compare our evaluation of the algorithms based on human perception of quality to the DIBCO evaluation metrics. The results obtained provide an insight into the effectiveness of these methods with respect to human perception of image quality as well as OCR performance.
Safety performance evaluation of cable median barriers on freeways in Florida.
Alluri, Priyanka; Haleem, Kirolos; Gan, Albert; Mauthner, John
2016-07-03
This article aims to evaluate the safety performance of cable median barriers on freeways in Florida. The safety performance evaluation was based on the percentages of barrier and median crossovers by vehicle type, crash severity, and cable median barrier type (Trinity Cable Safety System [CASS] and Gibraltar system). Twenty-three locations with cable median barriers totaling about 101 miles were identified. Police reports of 6,524 crashes from years 2005-2010 at these locations were reviewed to verify and obtain detailed crash information. A total of 549 crashes were determined to be barrier related (i.e., crashes involving vehicles hitting the cable median barrier) and were reviewed in further detail to identify crossover crashes and the manner in which the vehicles crossed the barriers; that is, by either overriding, underriding, or penetrating the barriers. Overall, 2.6% of vehicles that hit the cable median barrier crossed the median and traversed into the opposite travel lane. Overall, 98.1% of cars and 95.5% of light trucks that hit the barrier were prevented from crossing the median. In other words, 1.9% of cars and 4.5% of light trucks that hit the barrier had crossed the median and encroached on the opposite travel lanes. There is no significant difference in the performance of cable median barrier for cars versus light trucks in terms of crossover crashes. In terms of severity, overrides were more severe compared to underrides and penetrations. The statistics showed that the CASS and Gibraltar systems performed similarly in terms of crossover crashes. However, the Gibraltar system experienced a higher proportion of penetrations compared to the CASS system. The CASS system resulted in a slightly higher percentage of moderate and minor injury crashes compared to the Gibraltar system. Cable median barriers are successful in preventing median crossover crashes; 97.4% of the cable median barrier crashes were prevented from crossing over the median. Of all of the vehicles that hit the barrier, 83.6% were either redirected or contained by the cable barrier system. Barrier crossover crashes were found to be more severe compared to barrier noncrossover crashes. In addition, overrides were found to be more severe compared to underrides and penetrations.
Simon, Alice; Amaro, Maria Inês; Healy, Anne Marie; Cabral, Lucio Mendes; de Sousa, Valeria Pereira
2016-10-15
In the present study, in vitro permeation experiments in a Franz diffusion cell were performed using different synthetic polymeric membranes and pig ear skin to evaluate a rivastigmine (RV) transdermal drug delivery system. In vitro-in vivo correlations (IVIVC) were examined to determine the best model membrane. In vitro permeation studies across different synthetic membranes and skin were performed for the Exelon(®) Patch (which contains RV), and the results were compared. Deconvolution of bioavailability data using the Wagner-Nelson method enabled the fraction of RV absorbed to be determined and a point-to-point IVIVC to be established. The synthetic membrane, Strat-M™, showed a RV permeation profile similar to that obtained with pig ear skin (R(2)=0.920). Studies with Strat-M™ resulted in a good and linear IVIVC (R(2)=0.991) when compared with other synthetic membranes that showed R(2) values less than 0.90. The R(2) for pig ear skin was 0.982. Strat-M™ membrane was the only synthetic membrane that adequately simulated skin barrier performance and therefore it can be considered to be a suitable alternative to human or animal skin in evaluating transdermal drug transport, potentially reducing the number of studies requiring human or animal samples. Copyright © 2016 Elsevier B.V. All rights reserved.
Semiautomated external defibrillators for in-hospital early defibrillation: a comparative study.
Nocchi, Federico; Derrico, Pietro; Masucci, Gerardina; Capussotto, Carlo; Cecchetti, Corrado; Ritrovato, Matteo
2014-01-01
Semiautomated external defibrillators (AEDs) should be considered as a means to facilitate in-hospital early defibrillation (IHED) in areas where advanced life support rescuers are not readily available. In this study, we aimed to develop a checklist and a measurement protocol to evaluate and compare AEDs by assessing factors that may affect IHED. A clinical and technical comparison of six AEDs was performed. Technical specifications were analyzed, while an emergency team evaluated ergonomics and appropriateness for IHED at Bambino Gesù Children's Hospital. A measurement protocol was implemented, which aimed to assess the ability of defibrillators to recognize shockable and nonshockable rhythms, accuracy of delivered energy, and charging time. Designs of AEDs differed in several features which influence their appropriateness for IHED. Some units showed poor ergonomics and instructions/feedback for cardiopulmonary resuscitation. Differences between defibrillators in recognizing shockable and nonshockable rhythms emerged for polymorphic ventricular tachycardia waveforms and when the frequency and amplitude of input signals varied. Tests for accuracy revealed poor performances at low and high impedance levels for most AEDs. Notably, differences greater than 20 seconds were found in the time from power-on to "ready for discharge." The approach we used to assess AEDs allowed us to evaluate their appropriateness with respect to the organizational context, to measure their parameters, and to compare models. Results showed that ergonomics and/or performances (timing and accuracy) could be improved in each device.
NASA Technical Reports Server (NTRS)
Irvine, R.; Van Alstine, R.
1979-01-01
The paper compares and describes the advantages of dry tuned gyros over floated gyros for space applications. Attention is given to describing the Teledyne SDG-5 gyro and the second-generation NASA Standard Dry Rotor Inertial Reference Unit (DRIRU II). Certain tests which were conducted to evaluate the SDG-5 and DRIRU II for specific mission requirements are outlined, and their results are compared with published test results on other gyro types. Performance advantages are highlighted.
NASA Technical Reports Server (NTRS)
1975-01-01
Gas turbine engines were assessed for application to hear duty transportation. A summary of the assumptions, applications, and methods of analysis is included along with a discussion of the approach taken, the technical program flow chart, and weighting criteria used for performance evaluation. The various engines are compared on the bases of weight, performance, emissions and noise, technology status, and growth potential. The results of the engine screening phase and the conceptual design phase are presented.
Khajouei, Reza; Hajesmaeel Gohari, Sadrieh; Mirzaee, Moghaddameh
2018-04-01
In addition to following the usual Heuristic Evaluation (HE) method, the usability of health information systems can also be evaluated using a checklist. The objective of this study is to compare the performance of these two methods in identifying usability problems of health information systems. Eight evaluators independently evaluated different parts of a Medical Records Information System using two methods of HE (usual and with a checklist). The two methods were compared in terms of the number of problems identified, problem type, and the severity of identified problems. In all, 192 usability problems were identified by two methods in the Medical Records Information System. This was significantly higher than the number of usability problems identified by the checklist and usual method (148 and 92, respectively) (p < 0.0001). After removing the duplicates, the difference between the number of unique usability problems identified by the checklist method (n = 100) and usual method (n = 44) was significant (p < 0.0001). Differences between the mean severity of the real usability problems (1.83) and those identified by only one of the methods (usual = 2.05, checklist = 1.74) were significant (p = 0.001). This study revealed the potential of the two HE methods for identifying usability problems of health information systems. The results demonstrated that the checklist method had significantly better performance in terms of the number of identified usability problems; however, the performance of the usual method for identifying problems of higher severity was significantly better. Although the checklist method can be more efficient for less experienced evaluators, wherever usability is critical, the checklist should be used with caution in usability evaluations. Copyright © 2018 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Mudigonda, Naga R.; Kacelenga, Ray; Edwards, Mark
2004-09-01
This paper evaluates the performance of a holographic neural network in comparison with a conventional feedforward backpropagation neural network for the classification of landmine targets in ground penetrating radar images. The data used in the study was acquired from four different test sites using the landmine detection system developed by General Dynamics Canada Ltd., in collaboration with the Defense Research and Development Canada, Suffield. A set of seven features extracted for each detected alarm is used as stimulus inputs for the networks. The recall responses of the networks are then evaluated against the ground truth to declare true or false detections. The area computed under the receiver operating characteristic curve is used for comparative purposes. With a large dataset comprising of data from multiple sites, both the holographic and conventional networks showed comparable trends in recall accuracies with area values of 0.88 and 0.87, respectively. By using independent validation datasets, the holographic network"s generalization performance was observed to be better (mean area = 0.86) as compared to the conventional network (mean area = 0.82). Despite the widely publicized theoretical advantages of the holographic technology, use of more than the required number of cortical memory elements resulted in an over-fitting phenomenon of the holographic network.
Schold, Jesse D; Miller, Charles M; Henry, Mitchell L; Buccini, Laura D; Flechner, Stuart M; Goldfarb, David A; Poggio, Emilio D; Andreoni, Kenneth A
2017-06-01
Scientific Registry of Transplant Recipients report cards of US organ transplant center performance are publicly available and used for quality oversight. Low center performance (LP) evaluations are associated with changes in practice including reduced transplant rates and increased waitlist removals. In 2014, Scientific Registry of Transplant Recipients implemented new Bayesian methodology to evaluate performance which was not adopted by Center for Medicare and Medicaid Services (CMS). In May 2016, CMS altered their performance criteria, reducing the likelihood of LP evaluations. Our aims were to evaluate incidence, survival rates, and volume of LP centers with Bayesian, historical (old-CMS) and new-CMS criteria using 6 consecutive program-specific reports (PSR), January 2013 to July 2015 among adult kidney transplant centers. Bayesian, old-CMS and new-CMS criteria identified 13.4%, 8.3%, and 6.1% LP PSRs, respectively. Over the 3-year period, 31.9% (Bayesian), 23.4% (old-CMS), and 19.8% (new-CMS) of centers had 1 or more LP evaluation. For small centers (<83 transplants/PSR), there were 4-fold additional LP evaluations (52 vs 13 PSRs) for 1-year mortality with Bayesian versus new-CMS criteria. For large centers (>183 transplants/PSR), there were 3-fold additional LP evaluations for 1-year mortality with Bayesian versus new-CMS criteria with median differences in observed and expected patient survival of -1.6% and -2.2%, respectively. A significant proportion of kidney transplant centers are identified as low performing with relatively small survival differences compared with expected. Bayesian criteria have significantly higher flagging rates and new-CMS criteria modestly reduce flagging. Critical appraisal of performance criteria is needed to assess whether quality oversight is meeting intended goals and whether further modifications could reduce risk aversion, more efficiently allocate resources, and increase transplant opportunities.
Training gastroenterology fellows to perform gastric polypectomy using a novel ex vivo model
Chen, Ming-Jen; Lin, Ching-Chung; Liu, Chia-Yuan; Chen, Chih-Jen; Chang, Chen-Wang; Chang, Ching-Wei; Lee, Chien-Wei; Shih, Shou-Chuan; Wang, Horng-Yuan
2011-01-01
AIM: To evaluate the effect of hands-on training of gastroenterology fellows in gastric polypectomy using an ex vivo simulator. METHODS: Eight gastroenterology fellows at Mackay Memorial Hospital, Taipei were evaluated in gastric polypectomy techniques using a pig stomach with artificial polyps created by a rubber band ligation device. The performance of four second year (year-2) fellows who had undergone one year of clinical training was compared with that of four first year (year-1) fellows both before and after a 4-h workshop using the ex vivo simulator. The workshop allowed for hands-on training in the removal of multiple artificial polyps and the placement of hemoclips at the excision site. Evaluation included observation of technical skills, procedure time, and the fellows’ confidence scale. RESULTS: One week after the workshop, the year-1 fellows were re-evaluated and had significantly improved mean performance scores (from 17.9 ± 1.8 to 22.5 ± 0.7), confidence scale (from 4.5 ± 1.0 to 7.8 ± 0.5) and procedure time (from 615.0 ± 57.4 s to 357.5 ± 85.0 s) compared with their baseline performance. After 4 h of training using the ex vivo simulator, the skills of the year-1 fellows were statistically similar to those of the year-2 fellows. CONCLUSION: Use of this ex vivo simulator significantly improved the endoscopic gastric polypectomy skills of gastroenterology fellows who had not had previous clinical training in gastric polypectomy. PMID:22147969
NASA Astrophysics Data System (ADS)
Molea, A.; Visuian, P.; Barabás, I.; Suciu, R. C.; Burnete, N. V.
2017-10-01
In this paper there were presented researches related to preparation and characterization of physicochemical properties of diesel-ethanol blends stabilized with tetrahydrofuran as surfactant, in order to be used as fuels in compression ignition engines. The main spray characteristics and engine performances of these blends were evaluated by using AVL Fire software. In the first stage of the studies, commercial diesel was mixed with ethanol, in different concentrations (between 2% and 15% v/v), followed by the addition of tetrahydrofuran (THF) until the blends were miscible, i.e. the blends were stabilized. The experiments were done at room temperature (22 °C). The obtained blends were characterized in order to determine the chemical composition and physicochemical properties, i.e. density, kinematic viscosity, surface tension. UV-Vis spectroscopy was utilized in order to determine a semi-quantitative evaluation regarding the chemical composition of the prepared blends and chemical interaction between diesel, ethanol and THF. Based on the determined properties, the fuel spray characteristics, engine performances and emission characteristics were evaluated by simulation using the AVL Fire software. The obtained results regarding physicochemical properties of blends were compared with diesel. Some improvements were observed when operating with the prepared blends compared to diesel with respect to engine performances and emission characteristics. Based on physicochemical evaluation and computer simulation, it was demonstrated that diesel-ethanol-tetrahydrofuran blends can be used as alternative fuel in compression ignition engines.
Habibov, Nazim N; Fan, Lida
2010-11-01
In the mid-1990s, the responsibilities to design, implement, and evaluate social welfare programs were transferred from federal to local jurisdictions in many countries of North America and Europe through devolution processes. Devolution has caused the need for a technique to measure and compare the performances of social welfare programs across multiple jurisdictions. This paper utilizes Data Envelopment Analysis (DEA) for a comparison of poverty reduction performances of jurisdictional social welfare programs across Canadian provinces. From the theoretical perspective, findings of this paper demonstrates that DEA is a promising method to evaluate, compare, and benchmark poverty reduction performance across multiple jurisdictions using multiple inputs and outputs. This paper demonstrates that DEA generates easy to comprehend composite rankings of provincial performances, identifies appropriate benchmarks for each inefficient province, and estimates sources and amounts of improvement needed to make the provinces efficient. From a practical perspective the empirical results presented in this paper indicate that Newfoundland, Prince Edwards Island, and Alberta achieve better efficiency in poverty reduction than other provinces. Policy makers and social administrators of the ineffective provinces across Canada may find benefit in selecting one of the effective provinces as a benchmark for improving their own performance based on similar size and structure of population, size of the budget for social programs, and traditions with administering particular types of social programs. Copyright (c) 2009 Elsevier Ltd. All rights reserved.
Ren, Wei-Wei; Li, Xiao-Long; Wang, Dan; Liu, Bo-Ji; Zhao, Chong-Ke; Xu, Hui-Xiong
2018-04-13
To evaluate a special kind of ultrasound (US) shear wave elastography for differential diagnosis of breast lesions, using a new qualitative analysis (i.e. the elasticity score in the travel time map) compared with conventional quantitative analysis. From June 2014 to July 2015, 266 pathologically proven breast lesions were enrolled in this study. The maximum, mean, median, minimum, and standard deviation of shear wave speed (SWS) values (m/s) were assessed. The elasticity score, a new qualitative feature, was evaluated in the travel time map. The area under the receiver operating characteristic (AUROC) curves were plotted to evaluate the diagnostic performance of both qualitative and quantitative analyses for differentiation of breast lesions. Among all quantitative parameters, SWS-max showed the highest AUROC (0.805; 95% CI: 0.752, 0.851) compared with SWS-mean (0.786; 95% CI:0.732, 0.834; P = 0.094), SWS-median (0.775; 95% CI:0.720, 0.824; P = 0.046), SWS-min (0.675; 95% CI:0.615, 0.731; P = 0.000), and SWS-SD (0.768; 95% CI:0.712, 0.817; P = 0.074). The AUROC of qualitative analysis in this study obtained the best diagnostic performance (0.871; 95% CI: 0.825, 0.909, compared with the best parameter of SWS-max in quantitative analysis, P = 0.011). The new qualitative analysis of shear wave travel time showed the superior diagnostic performance in the differentiation of breast lesions in comparison with conventional quantitative analysis.
Nuclear thermal rocket nozzle testing and evaluation program
NASA Technical Reports Server (NTRS)
Davidian, Kenneth O.; Kacynski, Kenneth J.
1993-01-01
Performance characteristics of the Nuclear Thermal Rocket can be enhanced through the use of unconventional nozzles as part of the propulsion system. The Nuclear Thermal Rocket nozzle testing and evaluation program being conducted at the NASA Lewis is outlined and the advantages of a plug nozzle are described. A facility description, experimental designs and schematics are given. Results of pretest performance analyses show that high nozzle performance can be attained despite substantial nozzle length reduction through the use of plug nozzles as compared to a convergent-divergent nozzle. Pretest measurement uncertainty analyses indicate that specific impulse values are expected to be within + or - 1.17 pct.
A number of PCR-based methods for detecting human fecal material in environmental waters have been developed over the past decade, but these methods have rarely received independent comparative testing. Here, we evaluated ten of these methods (BacH, BacHum-UCD, B. thetaiotaomic...
Bipolar-power-transistor-based limiter for high frequency ultrasound imaging systems.
Choi, Hojong; Yang, Hao-Chung; Shung, K Kirk
2014-03-01
High performance limiters are described in this paper for applications in high frequency ultrasound imaging systems. Limiters protect the ultrasound receiver from the high voltage (HV) spikes produced by the transmitter. We present a new bipolar power transistor (BPT) configuration and compare its design and performance to a diode limiter used in traditional ultrasound research and one commercially available limiter. Limiter performance depends greatly on the insertion loss (IL), total harmonic distortion (THD) and response time (RT), each of which will be evaluated in all the limiters. The results indicated that, compared with commercial limiter, BPT-based limiter had less IL (-7.7 dB), THD (-74.6 dB) and lower RT (43 ns) at 100 MHz. To evaluate the capability of these limiters, they were connected to a 100 MHz single element transducer and a two-way pulse-echo test was performed. It was found that the -6 dB bandwidth and sensitivity of the transducer using BPT-based limiter were better than those of the commercial limiter by 22% and 140%, respectively. Compared to the commercial limiter, BPT-based limiter is shown to be capable of minimizing signal attenuation, RT and THD at high frequencies and is thus suited for high frequency ultrasound applications. Copyright © 2013 Elsevier B.V. All rights reserved.
von Sperling, M; Oliveira, S C
2009-01-01
This article evaluates and compares the actual behavior of 166 full-scale anaerobic and aerobic wastewater treatment plants in operation in Brazil, providing information on the performance of the processes in terms of the quality of the generated effluent and the removal efficiency achieved. The observed results of effluent concentrations and removal efficiencies of the constituents BOD, COD, TSS (total suspended solids), TN (total nitrogen), TP (total phosphorus) and FC (faecal or thermotolerant coliforms) have been compared with the typical expected performance reported in the literature. The treatment technologies selected for study were: (a) predominantly anaerobic: (i) septic tank + anaerobic filter (ST + AF), (ii) UASB reactor without post-treatment (UASB) and (iii) UASB reactor followed by several post-treatment processes (UASB + POST); (b) predominantly aerobic: (iv) facultative pond (FP), (v) anaerobic pond followed by facultative pond (AP + FP) and (vi) activated sludge (AS). The results, confirmed by statistical tests, showed that, in general, the best performance was achieved by AS, but closely followed by UASB reactor, when operating with any kind of post-treatment. The effluent quality of the anaerobic processes ST + AF and UASB reactor without post-treatment was very similar to the one presented by facultative pond, a simpler aerobic process, regarding organic matter.
Assessment of Existing Data and Reports for System Evaluation
NASA Technical Reports Server (NTRS)
Matolak, David W.; Skidmore, Trent A.
2000-01-01
This report describes work done as part of the Weather Datalink Research project grant. We describe the work done under Task 1 of this project: the assessment of the suitability of available reports and data for use in evaluation of candidate weather datalink systems, and the development of a performance parameter set for comparative system evaluation. It was found that existing data and reports are inadequate for a complete physical layer characterization, but that these reports provide a good foundation for system comparison. In addition, these reports also contain some information useful for evaluation at higher layers. The performance parameter list compiled can be viewed as near complete-additional investigations, both analytical/simulation and experimental, will likely result in additions and improvements to this list.
Evaluating Hydrogen Evolution and Oxidation in Alkaline Media to Establish Baselines
Alia, Shaun M.; Pivovar, Bryan S.
2018-04-28
This paper fills a significant gap in the literature for alkaline hydrogen evolution (HER) and oxidation (HOR) baseline performance, while reviewing the different variables that influence observed properties. Although high-performing HER-HOR catalysts in acidic electrolytes are too active to measure kinetic in rotating disk electrode (RDE) half-cells, under alkaline conditions RDE kinetics evaluations are relevant and half-cell performances are comparable to hydrogen pump data. This paper focuses on best practices to ensure that half-cell tests don't unnecessarily lower platinum group metal (PGM) performance or improve non-PGM performance. Specific aspects examined include experiments on PGMs minimizing the impact of impurities (electrolyte,more » cell material) and experiments on non-PGMs minimizing the impact from test protocols (counter electrode).« less
Evaluating Hydrogen Evolution and Oxidation in Alkaline Media to Establish Baselines
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alia, Shaun M.; Pivovar, Bryan S.
This paper fills a significant gap in the literature for alkaline hydrogen evolution (HER) and oxidation (HOR) baseline performance, while reviewing the different variables that influence observed properties. Although high-performing HER-HOR catalysts in acidic electrolytes are too active to measure kinetic in rotating disk electrode (RDE) half-cells, under alkaline conditions RDE kinetics evaluations are relevant and half-cell performances are comparable to hydrogen pump data. This paper focuses on best practices to ensure that half-cell tests don't unnecessarily lower platinum group metal (PGM) performance or improve non-PGM performance. Specific aspects examined include experiments on PGMs minimizing the impact of impurities (electrolyte,more » cell material) and experiments on non-PGMs minimizing the impact from test protocols (counter electrode).« less
Hripcsak, George; Wilcox, Adam
2002-01-01
Medical informatics systems are often designed to perform at the level of human experts. Evaluation of the performance of these systems is often constrained by lack of reference standards, either because the appropriate response is not known or because no simple appropriate response exists. Even when performance can be assessed, it is not always clear whether the performance is sufficient or reasonable. These challenges can be addressed if an evaluator enlists the help of clinical domain experts. 1) The experts can carry out the same tasks as the system, and then their responses can be combined to generate a reference standard. 2)The experts can judge the appropriateness of system output directly. 3) The experts can serve as comparison subjects with which the system can be compared. These are separate roles that have different implications for study design, metrics, and issues of reliability and validity. Diagrams help delineate the roles of experts in complex study designs.
Evaluation of FSO System Availability in Haze Condition
NASA Astrophysics Data System (ADS)
Anis, A. A.; Rashidi, C. B. M.; Aljunid, S. A.; Rahman, A. K.
2018-03-01
In this paper, we proposed the evaluation of FSO system availability in haze condition. The atmospheric attenuation by weather conditions in the atmosphere as the most challenging problem of FSO system as the system performance is severely degraded and causing the signal optic to be transmitted poorly. The effects of haze condition on the performance of FSO system is stressed out and focused in this paper. From the evaluation of the analysis, designs of FSO system are proposed to obtain a system with improved link performance in haze conditions. The scattering coefficient and the atmospheric attenuation are determined using Beer’s Lambert equation. From the research, the link performance of the system is greatly improved using Design 2 with minimum BER of 10-127127 and maximu m Q Factor of 23.98. The FSO system using Design 2 has better performance compared to Design 1 in haze condition as the optical signals could penetrate the dense haze better without losing much optical power during the transmission to the scattering.
Economic evaluations in pain management: principles and methods.
Asche, Carl V; Seal, Brian; Jackson, Kenneth C; Oderda, Gary M
2006-01-01
This paper describes how investigators may design, conduct, and report economic evaluations of pharmacotherapy for pain and symptom management. Because economic evaluation of therapeutic interventions is becoming increasingly important, there is a need for guidance on how economic evaluations can be optimally conducted. The steps required to conduct an economic evaluation are described to provide this guidance. Economic evaluations require two or more therapeutic interventions to be compared in relation to costs and effects. There are five types of economic evaluations, based on analysis of: (1) cost-effectiveness, (2) cost-utility, (3) cost-minimization, (4) cost-consequence, and (5) cost-benefit analyses. The six required steps are: identify the perspective of the study; identify the alternatives that will be compared; identify the relevant costs and effects; determine how to collect the cost and effect data; determine how to perform calculation for cost and effects data; and determine the manner in which to depict the results and draw comparisons.
Damman, Olga C; De Jong, Anco; Hibbard, Judith H; Timmermans, Danielle R M
2016-01-01
Study objectives We aimed to investigate how different presentation formats influence comprehension and use of comparative performance information (CPI) among consumers. Methods An experimental between-subjects and within-subjects design with manipulations of CPI presentation formats. We enrolled both consumers with lower socioeconomic status (SES)/cognitive skills and consumers with higher SES/cognitive skills, recruited through an online access panel. Respondents received fictitious CPI and completed questions about interpretation and information use. Between subjects, we tested (1) displaying an overall performance score (yes/no); (2) displaying a small number of quality indicators (5 vs 9); and (3) displaying different types of evaluative symbols (star ratings, coloured dots and word icons vs numbers and bar graphs). Within subjects, we tested the effect of a reduced number of healthcare providers (5 vs 20). Data were analysed using descriptive analysis, analyses of variance and paired-sampled t tests. Results A total of 902 (43%) respondents participated. Displaying an overall performance score and the use of coloured dots and word icons particularly enhanced consumer understanding. Importantly, respondents provided with coloured dots most often correctly selected the top three healthcare providers (84.3%), compared with word icons (76.6% correct), star ratings (70.6% correct), numbers (62.0%) and bars (54.2%) when viewing performance scores of 20 providers. Furthermore, a reduced number of healthcare providers appeared to support consumers, for example, when provided with 20 providers, 69.5% correctly selected the top three, compared with 80.2% with five providers. Discussion Particular presentation formats enhanced consumer understanding of CPI, most importantly the use of overall performance scores, word icons and coloured dots, and a reduced number of providers displayed. Public report efforts should use these formats to maximise impact on consumers. PMID:26543066
DOT National Transportation Integrated Search
2003-11-01
The objectives of this research are to determine the effect of unbound drainable base types on the performance of PCCP and the efficiency of fiber-reinforced polymer (FRP) dowels, compared to epoxy coated steel dowels, when retrofitted to re-establis...
Uniform task level definitions for robotic system performance comparisons
NASA Technical Reports Server (NTRS)
Price, Charles; Tesar, Delbert
1989-01-01
A series of ten task levels of increasing difficulty was compiled for use in comparative performance evaluations of available and future robotics technology. Each level has a breakdown of ten additional levels of difficulty to provide a layering of 100 levels. It is assumed that each level of task performance must be achieved by the system before it can be appropriately considered for the next level.
Performance of Color Camera Machine Vision in Automated Furniture Rough Mill Systems
D. Earl Kline; Agus Widoyoko; Janice K. Wiedenbeck; Philip A. Araman
1998-01-01
The objective of this study was to evaluate the performance of color camera machine vision for lumber processing in a furniture rough mill. The study used 134 red oak boards to compare the performance of automated gang-rip-first rough mill yield based on a prototype color camera lumber inspection system developed at Virginia Tech with both estimated optimum rough mill...
Zijlstra, Eric; Heinemann, Lutz; Fischer, Annelie; Kapitza, Christoph
2016-01-01
Background: The objective was to evaluate the performance (in terms of accuracy, precision, and trueness) of 5 CE-certified and commercially available blood glucose (BG) systems (meters plus test strips) using an innovative clinical-experimental study design with a 3-step glucose clamp approach and frequent capillary BG measurements. Methods: Sixteen subjects with type 1 diabetes participated in this open label, single center trial. BG was clamped at 3 levels for 60 minutes each: 60-100-200 mg/dL. Medical staff performed regular finger pricks (up to 10 per BG level) to obtain capillary blood samples for paired BG measurements with the 5 BG systems and a laboratory method as comparison. Results: Three BG systems displayed significantly lower mean absolute relative deviations (MARD) (ACCU-Chek® Aviva Nano [5.4%], BGStar® [5.1%], iBGStar® [5.3%]) than 2 others (FreeStyle InsuLinx® [7.7%], OneTouch Verio®IQ [10.3%]). The measurement precision of all BG systems was comparable, but relative bias was also lower for the 3 systems with lower MARD (ACCU-Chek [1.3%], BGStar [–0.9%], iBGStar [1.0%]) compared with the 2 others (FreeStyle [–7.2%], OneTouch [8.9%]). Conclusions: This 3 range glucose clamp approach enables a systematic performance evaluation of BG systems under controlled and reproducible conditions. The random error of the tested BG systems was comparable, but some showed a lower systematic error than others. These BG systems allow an accurate glucose measurement at low, normal and high BG levels. PMID:27605592
Zijlstra, Eric; Heinemann, Lutz; Fischer, Annelie; Kapitza, Christoph
2016-11-01
The objective was to evaluate the performance (in terms of accuracy, precision, and trueness) of 5 CE-certified and commercially available blood glucose (BG) systems (meters plus test strips) using an innovative clinical-experimental study design with a 3-step glucose clamp approach and frequent capillary BG measurements. Sixteen subjects with type 1 diabetes participated in this open label, single center trial. BG was clamped at 3 levels for 60 minutes each: 60-100-200 mg/dL. Medical staff performed regular finger pricks (up to 10 per BG level) to obtain capillary blood samples for paired BG measurements with the 5 BG systems and a laboratory method as comparison. Three BG systems displayed significantly lower mean absolute relative deviations (MARD) (ACCU-Chek® Aviva Nano [5.4%], BGStar® [5.1%], iBGStar® [5.3%]) than 2 others (FreeStyle InsuLinx® [7.7%], OneTouch Verio®IQ [10.3%]). The measurement precision of all BG systems was comparable, but relative bias was also lower for the 3 systems with lower MARD (ACCU-Chek [1.3%], BGStar [-0.9%], iBGStar [1.0%]) compared with the 2 others (FreeStyle [-7.2%], OneTouch [8.9%]). This 3 range glucose clamp approach enables a systematic performance evaluation of BG systems under controlled and reproducible conditions. The random error of the tested BG systems was comparable, but some showed a lower systematic error than others. These BG systems allow an accurate glucose measurement at low, normal and high BG levels. © 2016 Diabetes Technology Society.
Noël, P; Fagot, H; Fabre, J M; Mann, C; Quenet, F; Guillon, F; Baumel, H; Domergue, J
1994-01-01
Laparoscopic intestinal anastomosis is not very reliable and needs to be evaluate in an experimental model in animals before being performed in man. The purpose of this study was to evaluate the feasibility, efficacy and safety of manual anastomosis comparatively to the standard stapling suture. Twenty female pigs weighing 20 +/- 5 kg. were used for this study. A 5 cm ileal segment resection was performed under laparoscopy. The animals were assigned to 2 groups. Group I: 10 animals underwent end-to-end hand-swen anastomosis with Polyglactin 910, dec 1.5. Group II: 10 animals underwent side-to-side anastomosis using the Endo stapler. Operating time and anastomosis time were compared using the Mann-Whitney test for statistical analysis. On the 15th postoperative day, the animals were sacrificed and the anastomoses were evaluated. There was no operating death in the 2 groups. The operative time was significantly longer in group I than in group II (p < 0.01), with 180 +/- 40 min vs 49 +/- 25 min respectively. This difference was due to the anastomosis time of 130 +/- 40 min vs 16 +/- 6 min respectively (p < 0.01). There was 1 postoperative death related to fistula and peritonitis in group I and none in group II. The post-operative follow-ing showed 5 anastomotic leakages (4 in group I and 1 in group II) and 2 relative stenoses in group I. This study shows the simplicity and rapidity of performing stapling intestinal anastomosis under laparoscopy. Hand-sewn anastomosis is technically more difficult to perform under laparoscopy and requires a greater experience.
Toward imaging the body at 10.5 tesla.
Ertürk, M Arcan; Wu, Xiaoping; Eryaman, Yiğitcan; Van de Moortele, Pierre-François; Auerbach, Edward J; Lagore, Russell L; DelaBarre, Lance; Vaughan, J Thomas; Uğurbil, Kâmil; Adriany, Gregor; Metzger, Gregory J
2017-01-01
To explore the potential of performing body imaging at 10.5 Tesla (T) compared with 7.0T through evaluating the transmit/receive performance of similarly configured dipole antenna arrays. Fractionated dipole antenna elements for 10.5T body imaging were designed and evaluated using numerical simulations. Transmit performance of antenna arrays inside the prostate, kidneys and heart were investigated and compared with those at 7.0T using both phase-only radiofrequency (RF) shimming and multi-spoke pulses. Signal-to-noise ratio (SNR) comparisons were also performed. A 10-channel antenna array was constructed to image the abdomen of a swine at 10.5T. Numerical methods were validated with phantom studies at both field strengths. Similar power efficiencies were observed inside target organs with phase-only shimming, but RF nonuniformity was significantly higher at 10.5T. Spokes RF pulses allowed similar transmit performance with accompanying local specific absorption rate increases of 25-90% compared with 7.0T. Relative SNR gains inside the target anatomies were calculated to be >two-fold higher at 10.5T, and 2.2-fold SNR gain was measured in a phantom. Gradient echo and fast spin echo imaging demonstrated the feasibility of body imaging at 10.5T with the designed array. While comparable power efficiencies can be achieved using dipole antenna arrays with static shimming at 10.5T; increasing RF nonuniformities underscore the need for efficient, robust, and safe parallel transmission methods. Magn Reson Med 77:434-443, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
MRI compatibility of robot actuation techniques--a comparative study.
Fischer, Gregory S; Krieger, Axel; Iordachita, Iulian; Csoma, Csaba; Whitcomb, Louis L; Gabor, Fichtinger
2008-01-01
This paper reports an experimental evaluation of the following three different MRI-compatible actuators: a Shinsei ultrasonic motor a Nanomotion ultrasonic motor and a pneumatic cylinder actuator. We report the results of a study comparing the effect of these actuators on the signal to noise ratio (SNR) of MRJ images under a variety of experimental conditions. Evaluation was performed with the controller inside and outside the scanner room and with both 1.5T and 3T MRI scanners. Pneumatic cylinders function with no loss of SNR with controller both inside and outside of the scanner room. The Nanomotion motor performs with moderate loss of SNR when moving during imaging. The Shinsei is unsuitable for motion during imaging. All may be used when motion is appropriately interleaved with imaging cycles.
Analytical methods development for supramolecular design in solar hydrogen production
NASA Astrophysics Data System (ADS)
Brown, J. R.; Elvington, M.; Mongelli, M. T.; Zigler, D. F.; Brewer, K. J.
2006-08-01
In the investigation of alternative energy sources, specifically, solar hydrogen production from water, the ability to perform experiments with a consistent and reproducible light source is key to meaningful photochemistry. The design, construction, and evaluation of a series of LED array photolysis systems for high throughput photochemistry have been performed. Three array systems of increasing sophistication are evaluated using calorimetric measurements and potassium tris(oxalato)ferrate(II) chemical actinometry and compared with a traditional 1000 W Xe arc lamp source. The results are analyzed using descriptive statistics and analysis of variance (ANOVA). The third generation array is modular, and controllable in design. Furthermore, the third generation array system is shown to be comparable in both precision and photonic output to a 1000 W Xe arc lamp.
Model Performance Evaluation and Scenario Analysis ...
This tool consists of two parts: model performance evaluation and scenario analysis (MPESA). The model performance evaluation consists of two components: model performance evaluation metrics and model diagnostics. These metrics provides modelers with statistical goodness-of-fit measures that capture magnitude only, sequence only, and combined magnitude and sequence errors. The performance measures include error analysis, coefficient of determination, Nash-Sutcliffe efficiency, and a new weighted rank method. These performance metrics only provide useful information about the overall model performance. Note that MPESA is based on the separation of observed and simulated time series into magnitude and sequence components. The separation of time series into magnitude and sequence components and the reconstruction back to time series provides diagnostic insights to modelers. For example, traditional approaches lack the capability to identify if the source of uncertainty in the simulated data is due to the quality of the input data or the way the analyst adjusted the model parameters. This report presents a suite of model diagnostics that identify if mismatches between observed and simulated data result from magnitude or sequence related errors. MPESA offers graphical and statistical options that allow HSPF users to compare observed and simulated time series and identify the parameter values to adjust or the input data to modify. The scenario analysis part of the too
Hoffmann, Clement; Falzone, Elisabeth; Verret, Catherine; Pasquier, Pierre; Leclerc, Thomas; Donat, Nicolas; Jost, Daniel; Mérat, Stephane; Maurice, Guillaume de Saint; Lenoir, Bernard; Auroy, Yves; Tourtier, Jean-Pierre
2013-09-01
We compared the subjective quality of pulmonary auscultation between 2 acoustic stethoscopes (Holtex Ideal® and Littmann Cardiology III®) and an electronic stethoscope (Littmann 3200®) in the operating room. A prospective double-blind randomized study with an evaluation during mechanical ventilation was performed in 100 patients. After each examination, the listeners using a numeric scale (0-10) rated the quality of auscultation. Auscultation quality was compared in patients among stethoscopes with a multilevel mixed-effects linear regression with random intercept (operator effect), adjusted on significant factors in univariate analysis. A significant difference was defined as P < 0.05. One hundred comparative evaluations of pulmonary auscultation were performed. The quality of auscultation was rated 8.2 ± 1.6 for the electronic stethoscope, 7.4 ± 1.8 for the Littmann Cardiology III, and 4.6 ± 1.8 for the Holtex Ideal. Compared with Holtex Ideal, auscultation quality was significantly higher with other stethoscopes (P < 0.0001). Compared with Littmann Cardiology III, auscultation quality was significantly higher with Littmann 3200 electronic stethoscope (β = 0.9 [95% confidence interval, 0.5-1.3]). An electronic stethoscope can provide a better quality of pulmonary auscultation than acoustic stethoscopes in the operating room, yet with a magnitude of improvement marginally higher than that provided with a high performance acoustic stethoscope. Whether this can translate into a clinically relevant benefit requires further studies.
2010 Perinatal GBS Prevention Guideline and Resource Utilization
Mukhopadhyay, Sagori; Dukhovny, Dmitry; Mao, Wenyang; Eichenwald, Eric C.
2014-01-01
OBJECTIVES: To quantify differences in early-onset sepsis (EOS) evaluations, evaluation-associated resource utilization, and EOS cases detected, when comparing time periods before and after the implementation of an EOS algorithm based on the Centers for Disease Control and Prevention (CDC) 2010 guidelines for prevention of perinatal Group B Streptococcus (GBS) disease. METHODS: Retrospective cohort study of infants born at ≥36 weeks’ gestation from 2009 to 2012 in a single tertiary care center. One 12-month period during which EOS evaluations were based on the CDC 2002 guideline was compared with a second 12-month period during which EOS evaluations were based on the CDC 2010 guideline. A cost minimization analysis was performed to determine the EOS evaluation-associated costs and resources during each time period. RESULTS: During the study periods, among well-appearing infants ≥36 weeks’ gestation, EOS evaluations for inadequate GBS prophylaxis decreased from 32/1000 to <1/1000 live births; EOS evaluation-associated costs decreased by $6994 per 1000 live births; and EOS evaluation-associated work hours decreased by 29 per 1000 live births. We found no increase in EOS evaluations for other indications, total NICU admissions, frequency of infants evaluated for symptoms before hospital discharge, or incidence of EOS during the 2 study periods. CONCLUSIONS: Implementation of an EOS algorithm based on CDC 2010 GBS guidelines resulted in a 25% decrease in EOS evaluations performed among well-appearing infants ≥36 weeks’ gestation, attributable to decreased evaluation of infants born in the setting of inadequate indicated GBS prophylaxis. This resulted in significant changes in EOS evaluation-associated resource expenditures. PMID:24446442
Belgian guidelines for economic evaluations: second edition.
Thiry, Nancy; Neyt, Mattias; Van De Sande, Stefaan; Cleemput, Irina
2014-12-01
The aim of this study was to present the updated methodological guidelines for economic evaluations of healthcare interventions (drugs, medical devices, and other interventions) in Belgium. The update of the guidelines was performed by three Belgian health economists following feedback from users of the former guidelines and personal experience. The updated guidelines were discussed with a multidisciplinary team consisting of other health economists, assessors of reimbursement request files, representatives of Belgian databases and representatives of the drugs and medical devices industry. The final document was validated by three external validators that were not involved in the previous discussions. The guidelines give methodological guidance for the following components of an economic evaluation: literature review, perspective of the evaluation, definition of the target population, choice of the comparator, analytic technique and study design, calculation of costs, valuation of outcomes, definition of the time horizon, modeling, handling uncertainty and discounting. We present a reference case that can be considered as the minimal requirement for Belgian economic evaluations of health interventions. These guidelines will improve the methodological quality, transparency and uniformity of the economic evaluations performed in Belgium. The guidelines will also provide support to the researchers and assessors performing or evaluating economic evaluations.
Caradeux, J; Eixarch, E; Mazarico, E; Basuki, T R; Gratacós, E; Figueras, F
2018-02-01
Detection of fetal growth restriction (FGR) remains poor and most screening strategies rely on cross-sectional evaluation of fetal size during the third trimester. A longitudinal and individualized approach has been proposed as an alternative method of evaluation. The aim of this study was to compare second- to third-trimester longitudinal growth assessment to cross-sectional evaluation in the third trimester for the prediction of small-for-gestational age (SGA) and late FGR in low-risk singleton pregnancy. This was a prospective cohort study of 2696 unselected consecutive low-risk singleton pregnancies scanned at 21 ± 2 and 32 ± 2 weeks. For cross-sectional growth assessment, abdominal circumference (AC) measurements were transformed to z-values according the 21st-INTERGROWTH standards. Longitudinal growth assessment was performed by calculating the AC z-velocity and the second- to third-trimester AC conditional growth centile. Longitudinal assessment was compared with cross-sectional assessment at 32 weeks. Association of cross-sectional and longitudinal evaluations with SGA and late FGR was assessed by logistic regression analysis. Predictive performance was determined by receiver-operating characteristics curve analysis. In total, 210 (7.8%) newborns were classified as SGA and 103 (3.8%) as late FGR. Neither longitudinal measurement improved the association with SGA or late FGR provided by cross-sectional evaluation of AC z-score at 32 weeks. Areas under the curves of AC z-velocity and conditional AC growth were significantly smaller than those of cross-sectional AC z-scores (P < 0.001), although AC z-velocity performed significantly better than did conditional AC growth (P < 0.001). Longitudinal assessment of fetal growth from the second to third trimester has a low predictive capacity for SGA and late FGR in low-risk singleton pregnancy compared with cross-sectional growth evaluation. Copyright © 2017 ISUOG. Published by John Wiley & Sons Ltd. Copyright © 2017 ISUOG. Published by John Wiley & Sons Ltd.
Application of Support Vector Machine to Forex Monitoring
NASA Astrophysics Data System (ADS)
Kamruzzaman, Joarder; Sarker, Ruhul A.
Previous studies have demonstrated superior performance of artificial neural network (ANN) based forex forecasting models over traditional regression models. This paper applies support vector machines to build a forecasting model from the historical data using six simple technical indicators and presents a comparison with an ANN based model trained by scaled conjugate gradient (SCG) learning algorithm. The models are evaluated and compared on the basis of five commonly used performance metrics that measure closeness of prediction as well as correctness in directional change. Forecasting results of six different currencies against Australian dollar reveal superior performance of SVM model using simple linear kernel over ANN-SCG model in terms of all the evaluation metrics. The effect of SVM parameter selection on prediction performance is also investigated and analyzed.
Metrology Standards for Quantitative Imaging Biomarkers
Obuchowski, Nancy A.; Kessler, Larry G.; Raunig, David L.; Gatsonis, Constantine; Huang, Erich P.; Kondratovich, Marina; McShane, Lisa M.; Reeves, Anthony P.; Barboriak, Daniel P.; Guimaraes, Alexander R.; Wahl, Richard L.
2015-01-01
Although investigators in the imaging community have been active in developing and evaluating quantitative imaging biomarkers (QIBs), the development and implementation of QIBs have been hampered by the inconsistent or incorrect use of terminology or methods for technical performance and statistical concepts. Technical performance is an assessment of how a test performs in reference objects or subjects under controlled conditions. In this article, some of the relevant statistical concepts are reviewed, methods that can be used for evaluating and comparing QIBs are described, and some of the technical performance issues related to imaging biomarkers are discussed. More consistent and correct use of terminology and study design principles will improve clinical research, advance regulatory science, and foster better care for patients who undergo imaging studies. © RSNA, 2015 PMID:26267831
High-definition television evaluation for remote handling task performance
NASA Astrophysics Data System (ADS)
Fujita, Y.; Omori, E.; Hayashi, S.; Draper, J. V.; Herndon, J. N.
Described are experiments designed to evaluate the impact of HDTV (High-Definition Television) on the performance of typical remote tasks. The experiments described in this paper compared the performance of four operators using HDTV with their performance while using other television systems. The experiments included four television systems: (1) high-definition color television, (2) high-definition monochromatic television, (3) standard-resolution monochromatic television, and (4) standard-resolution stereoscopic monochromatic television. The stereo system accomplished stereoscopy by displaying two cross-polarized images, one reflected by a half-silvered mirror and one seen through the mirror. Observers wore spectacles with cross-polarized lenses so that the left eye received only the view from the left camera and the right eye received only the view from the right camera.
Wentzensen, Nicolas; Fetterman, Barbara; Tokugawa, Diane; Schiffman, Mark; Castle, Philip E; Wood, Shannon N; Stiemerling, Eric; Poitras, Nancy; Lorey, Thomas; Kinney, Walter
2014-12-01
Dual-stain cytology for p16 and Ki-67 has been proposed as a biomarker in cervical cancer screening. The authors evaluated the reproducibility and accuracy of dual-stain cytology among 10 newly trained evaluators. In total, 480 p16/Ki-67-stained slides from human papillomavirus-positive women were evaluated in masked fashion by 10 evaluators. None of the evaluators had previous experience with p16 or p16/Ki-67 cytology. All participants underwent p16/Ki-67 training and subsequent proficiency testing. Reproducibility of dual-stain cytology was measured using the percentage agreement, individual and aggregate κ values, as well as McNemar statistics. Clinical performance for the detection of cervical intraepithelial neoplasia grade 2 or greater (CIN2+) was evaluated for each individual evaluator and for all evaluators combined compared with the reference evaluation by a cytotechnologist who had extensive experience with dual-stain cytology. The percentage agreement of individual evaluators with the reference evaluation ranged from 83% to 91%, and the κ values ranged from 0.65 to 0.81. The combined κ value was 0.71 for all evaluators and 0.73 for cytotechnologists. The average sensitivity and specificity for the detection of CIN2+ among novice evaluators was 82% and 64%, respectively; whereas the reference evaluation had 84% sensitivity and 63% specificity, respectively. Agreement on dual-stain positivity increased with greater numbers of p16/Ki-67-positive cells on the slides. Good to excellent reproducibility of p16/Ki-67 dual-stain cytology was observed with almost identical clinical performance of novice evaluators compared with reference evaluations. The current findings suggest that p16/Ki-67 dual-stain evaluation can be implemented in routine cytology practice with limited training. © 2014 American Cancer Society.
Schulz, Katja; Peyre, Marisa; Staubach, Christoph; Schauer, Birgit; Schulz, Jana; Calba, Clémentine; Häsler, Barbara; Conraths, Franz J.
2017-01-01
Surveillance of Classical Swine Fever (CSF) should not only focus on livestock, but must also include wild boar. To prevent disease transmission into commercial pig herds, it is therefore vital to have knowledge about the disease status in wild boar. In the present study, we performed a comprehensive evaluation of alternative surveillance strategies for Classical Swine Fever (CSF) in wild boar and compared them with the currently implemented conventional approach. The evaluation protocol was designed using the EVA tool, a decision support tool to help in the development of an economic and epidemiological evaluation protocol for surveillance. To evaluate the effectiveness of the surveillance strategies, we investigated their sensitivity and timeliness. Acceptability was analysed and finally, the cost-effectiveness of the surveillance strategies was determined. We developed 69 surveillance strategies for comparative evaluation between the existing approach and the novel proposed strategies. Sampling only within sub-adults resulted in a better acceptability and timeliness than the currently implemented strategy. Strategies that were completely based on passive surveillance performance did not achieve the desired detection probability of 95%. In conclusion, the results of the study suggest that risk-based approaches can be an option to design more effective CSF surveillance strategies in wild boar. PMID:28266576
Schulz, Katja; Peyre, Marisa; Staubach, Christoph; Schauer, Birgit; Schulz, Jana; Calba, Clémentine; Häsler, Barbara; Conraths, Franz J
2017-03-07
Surveillance of Classical Swine Fever (CSF) should not only focus on livestock, but must also include wild boar. To prevent disease transmission into commercial pig herds, it is therefore vital to have knowledge about the disease status in wild boar. In the present study, we performed a comprehensive evaluation of alternative surveillance strategies for Classical Swine Fever (CSF) in wild boar and compared them with the currently implemented conventional approach. The evaluation protocol was designed using the EVA tool, a decision support tool to help in the development of an economic and epidemiological evaluation protocol for surveillance. To evaluate the effectiveness of the surveillance strategies, we investigated their sensitivity and timeliness. Acceptability was analysed and finally, the cost-effectiveness of the surveillance strategies was determined. We developed 69 surveillance strategies for comparative evaluation between the existing approach and the novel proposed strategies. Sampling only within sub-adults resulted in a better acceptability and timeliness than the currently implemented strategy. Strategies that were completely based on passive surveillance performance did not achieve the desired detection probability of 95%. In conclusion, the results of the study suggest that risk-based approaches can be an option to design more effective CSF surveillance strategies in wild boar.
Financial performance of loblolly and longleaf pine plantations
Steven D. Mills; Charles T. Stiff
2013-01-01
The financial performance of selected management regimes for loblolly (Pinus taeda L.) and longleaf pine (P. palustris Mill.) plantations were compared for four cases, each with low- and high-site productivity levels and each evaluated using 5 and 7 percent real discount rates. In all cases, longleaf pine was considered both with...
A Comparison of Two Scoring Methods for an Automated Speech Scoring System
ERIC Educational Resources Information Center
Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David
2012-01-01
This paper compares two alternative scoring methods--multiple regression and classification trees--for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation and empirical performance in predicting human scores. The empirical performance of the two scoring models…
Safeguards Technology Development Program 1st Quarter FY 2018 Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prasad, Manoj K.
LLNL will evaluate the performance of a stilbene-based scintillation detector array for IAEA neutron multiplicity counting (NMC) applications. This effort will combine newly developed modeling methodologies and recently acquired high-efficiency stilbene detector units to quantitatively compare the prototype system performance with the conventional He-3 counters and liquid scintillator alternatives.
Preliminary Evaluation of the "Playing the Game" Sexual Assault Prevention Theatre Program
ERIC Educational Resources Information Center
Thatcher, W. Gregory
2011-01-01
Purpose: To compare the effectiveness of a one-time sexual assault prevention theatre performance against a similar content video performance and a non-intervention control group. Methods: Using the College Date Rape Attitudes and Behaviors Survey, four-hundred ninety-seven students provided matched pairs data for analysis. Results: At a…
Do University Students Know How They Perform?
ERIC Educational Resources Information Center
Battistelli, Piergiorgio; Cadamuro, Alessia; Farneti, Alessandra; Versari, Annalisa
2009-01-01
The aim of the research is to study the capacity for self-evaluation of University students undergoing tests involving mathematics, linguistic and formal reasoning. Subjects were asked to estimate the number of correct answers and subsequently to compare their performance with that of their peers. We divided the subjects into three groups on the…
DOT National Transportation Integrated Search
2012-10-01
The main objective of this study was to determine the effect on bond performance : of high-volume fly ash (HVFA) concrete. The HVFA concrete test program consisted of : comparing the bond performance of two concrete mix designs with 70% cement : repl...
A Methodology for Making Early Comparative Architecture Performance Evaluations
ERIC Educational Resources Information Center
Doyle, Gerald S.
2010-01-01
Complex and expensive systems' development suffers from a lack of method for making good system-architecture-selection decisions early in the development process. Failure to make a good system-architecture-selection decision increases the risk that a development effort will not meet cost, performance and schedule goals. This research provides a…
Linking TIMSS and NAEP Assessments to Evaluate International Trends in Achievement
ERIC Educational Resources Information Center
Lim, Hwanggyu; Sireci, Stephen G.
2017-01-01
The Trends in International Mathematics and Science Study (TIMSS) makes it possible to compare the performance of students in the US in Mathematics and Science to the performance of students in other countries. TIMSS uses four international benchmarks for describing student achievement: Low, Intermediate, High, and Advanced. In this study, we…
Green, M; Dashefsky, B; Wald, E R; Laifer, S; Harger, J; Guthrie, R
1993-01-01
As part of a clinical investigation evaluating the efficacy of intrapartum antigen detection for screening for heavy vaginal colonization with group B streptococci (GBS), we compared the performance of modified Bactigen and Directigen GBS latex particle agglutination (LPA) kits. Paired vaginal swabs obtained from women in labor were rapidly transported to the laboratory and used for culturing (both swabs) and LPA testing (one swab by each method). GBS growth was estimated semiquantitatively and further designated as light or heavy growth. Performance specifications for each method were determined by comparing LPA and culture results from the same swab. A total of 4,251 paired swabs were evaluated during the study period. The performance specifications for detecting GBS growth of any degree for Bactigen and Directigen, respectively, were as follows: sensitivity, 20 and 24%; specificity, 99 and 99%. The performance specifications for heavy colonization for Bactigen and Directigen, respectively, were as follows: sensitivity, 57 and 62%; specificity, 99 and 99%. Neither LPA kit was a sensitive indicator of vaginal colonization with GBS or neonatal infection.
Yonker, V A; Young, K P; Beecham, S K; Horwitz, S; Cousin, K
1990-01-01
This study was designed to make a comparative evaluation of the performance of MEDLINE in covering serial literature. Forensic medicine was chosen because it is an interdisciplinary subject area that would test MEDLARS at the periphery of the system. The evaluation of database coverage was based upon articles included in the bibliographies of scholars in the field of forensic medicine. This method was considered appropriate for characterizing work used by researchers in this field. The results of comparing MEDLINE to other databases evoked some concerns about the selective indexing policy of MEDLINE in serving the interests of those working in forensic medicine. PMID:2403829
10 CFR 960.5 - Preclosure guidelines.
Code of Federal Regulations, 2010 CFR
2010-01-01
... REPOSITORY Preclosure Guidelines § 960.5 Preclosure guidelines. The guidelines in this subpart specify the factors to be considered in evaluating and comparing sites on the basis of expected repository performance...
Comparative ergonomic assessment of manual wheelchairs by paraplegic users.
Gil-Agudo, Angel; Solís-Mozos, Marta; del-Ama, Antonio J; Crespo-Ruiz, Beatriz; de la Peña-González, Ana Isabel; Pérez-Nombela, Soraya
2013-07-01
The aim of the present study was to describe and test the reliability of a comprehensive product-centered approach to assessing functional performance and wheelchair user perceptions on device ergonomics and satisfaction of performance. A pilot study was implemented using this approach to evaluate differences among four manual wheelchairs. Six wheelchair users with complete spinal cord injury (SCI) at the thoracic level and with no previous upper limbs impairment were recruited for this study. After finishing circuit tasks, subjects were asked to complete a questionnaire about ergonomic wheelchair characteristics (manoeuvrability, stability, comfort and ease of propulsion) and satisfaction about task performance. On the other hand, objective data were recorded during user performance as the time required to complete each test, kinetic wheelchair propulsion data obtained with two SMARTWheels® and physiological parameters (heart rate and physiological index). Kuschall Champion® and Otto Bock Voyage® wheelchairs were ranked best for most ergonomic aspects specially in manoeuvrability (p < 0.05). Less time was required to execute most of the circuit tasks in both wheelchair models (p < 0.05). This approach proposed highlight the importance of looking both kinds of information, user perception and user functional performance when evaluating a wheelchair or comparing across devices.
A protocol for evaluating video trackers under real-world conditions.
Nawaz, Tahir; Cavallaro, Andrea
2013-04-01
The absence of a commonly adopted performance evaluation framework is hampering advances in the design of effective video trackers. In this paper, we present a single-score evaluation measure and a protocol to objectively compare trackers. The proposed measure evaluates tracking accuracy and failure, and combines them for both summative and formative performance assessment. The proposed protocol is composed of a set of trials that evaluate the robustness of trackers on a range of test scenarios representing several real-world conditions. The protocol is validated on a set of sequences with a diversity of targets (head, vehicle and person) and challenges (occlusions, background clutter, pose changes and scale changes) using six state-of-the-art trackers, highlighting their strengths and weaknesses on more than 187000 frames. The software implementing the protocol and the evaluation results are made available online and new results can be included, thus facilitating the comparison of trackers.
Towards a Framework for Evaluating and Comparing Diagnosis Algorithms
NASA Technical Reports Server (NTRS)
Kurtoglu, Tolga; Narasimhan, Sriram; Poll, Scott; Garcia,David; Kuhn, Lukas; deKleer, Johan; vanGemund, Arjan; Feldman, Alexander
2009-01-01
Diagnostic inference involves the detection of anomalous system behavior and the identification of its cause, possibly down to a failed unit or to a parameter of a failed unit. Traditional approaches to solving this problem include expert/rule-based, model-based, and data-driven methods. Each approach (and various techniques within each approach) use different representations of the knowledge required to perform the diagnosis. The sensor data is expected to be combined with these internal representations to produce the diagnosis result. In spite of the availability of various diagnosis technologies, there have been only minimal efforts to develop a standardized software framework to run, evaluate, and compare different diagnosis technologies on the same system. This paper presents a framework that defines a standardized representation of the system knowledge, the sensor data, and the form of the diagnosis results and provides a run-time architecture that can execute diagnosis algorithms, send sensor data to the algorithms at appropriate time steps from a variety of sources (including the actual physical system), and collect resulting diagnoses. We also define a set of metrics that can be used to evaluate and compare the performance of the algorithms, and provide software to calculate the metrics.
Asymmetric balance control between legs for quiet but not for perturbed stance.
Vieira, Osvaldo; Coelho, Daniel Boari; Teixeira, Luis Augusto
2014-10-01
Interlateral performance asymmetry in upright balance control was evaluated in this investigation by comparing unipedal stance on the right versus the left leg. Participants were healthy young adults, hand-foot congruent preference for the right body side. Balance performance was evaluated in unperturbed quiet stance and in the recovery of balance stability following a mechanical perturbation induced by unexpected load release. Evaluation was made under availability of full sensory information, and under deprivation of vision combined with distortion of sensory inputs from the feet soles. Results from perturbed posture revealed that muscular response latency and postural sway were symmetric between the legs. Unipedal stance was more stable when the body was supported on the right as compared with the left leg. No interaction was found between leg and sensory condition. Our findings are interpreted as resulting from specialization of the sensorimotor system controlling the right leg for continuous low-magnitude postural adjustments, while corrections to large-scale stance sway are symmetrically controlled between body sides.
Evaluation of Basalt Fibre Composites for Marine Applications
NASA Astrophysics Data System (ADS)
Davies, P.; Verbouwe, W.
2018-04-01
Basalt fibres offer potential for use in marine structures, but few data exist to evaluate the influence of seawater immersion on their mechanical behaviour. This paper provides the results from a study in which basalt fibre reinforced epoxy composites were aged in natural seawater at different temperatures. Tests were performed under quasi-static and cyclic loading, first in the as-received state then after saturation in natural seawater. Results are compared to those for an E-glass reinforced composite with the same epoxy matrix. They indicate similar mechanical performance for both materials after seawater saturation.
NASA Technical Reports Server (NTRS)
Wolf, M.
1981-01-01
The effect of solar cell metallization pattern design on solar cell performance and the costs and performance effects of different metallization processes are discussed. Definitive design rules for the front metallization pattern for large area solar cells are presented. Chemical and physical deposition processes for metallization are described and compared. An economic evaluation of the 6 principal metallization options is presented. Instructions for preparing Format A cost data for solar cell manufacturing processes from UPPC forms for input into the SAMIC computer program are presented.
Vehicle test report: Battronic pickup truck
NASA Technical Reports Server (NTRS)
Price, T. W.; Shain, T. W.; Freeman, R. J.; Pompa, M. F.
1982-01-01
An electric pickup truck was tested to characterize certain parameters and to provide baseline data that can be used for the comparison of improved batteries that may be incorporated into the vehicle at a later time. The vehicle tests were concentrated on the electrical drive subsystem; i.e., the batteries, controller, and motor. The tests included coastdowns to characterize the road load and range evaluations for both cyclic and constant speed conditions. A qualitative evaluation of the vehicle's performance was made by comparing its constant speed range performance with other vehicles.
Zhang, Hua; Chen, Qing-song; Li, Nan; Hua, Yan; Zeng, Lin; Xu, Guo-yang; Tao, Li-yuan; Zhao, Yi-ming
2013-05-01
To compare the results of noise hazard evaluations based on area sampling and personal sampling in a new thermal power plant and to analyze the similarities and differences between the two measurement methods. According to Measurement of Physical agents in Workplace Part 8: Noise(GBZff 189.8-2007), area sampling was performed at various operating points for noise measurement, and meanwhile the workers under different types of work wore noise dosimeters for personal noise exposure measurement. The two measurement methods were used to evaluate the level of noise hazards in the enterprise according to the corresponding occupational health standards, and the evaluation results were compared. Area sampling was performed at 99 operating points, the mean noise level was 88.9 ± 11.1 dB (A)(range, 51.3-107.0 dB (A)), with an over-standard rate of 75.8%. Personal sampling was performed (73 person times),and the mean noise level was 79.3 ± 6.3 dB (A), with an over-standard rate of 6.6% ( 16/241 ). There was a statistically significant difference in the over-standard rate between the evaluation results of the two measurement methods ( x2=53.869, ?<0.001 ). Because of the characteristics of the work in new thermal power plants, the noise hazard evaluation based on area sampling cannot be used instead of personal noise exposure measurement among workers. Personal sampling should be used in the noise measurement in new thermal power plant.
Comparing Dual-Language Versions of an International Computerized-Adaptive Certification Exam.
ERIC Educational Resources Information Center
Sireci, Stephen G.; Foster, David F.; Robin, Frederic; Olsen, James
Evaluating the comparability of a test administered in different languages is a difficult, if not impossible, task. Comparisons are problematic because observed differences in test performance between groups who take different language versions of a test could be due to a difference in difficulty between the tests, to cultural differences in test…
The Assessment of Cognitive Development in Blind Infants and Preschoolers.
ERIC Educational Resources Information Center
Brambring, M.; Troster, H.
1994-01-01
This study evaluated the Bielefeld Developmental Test for Blind Infants and Preschoolers by comparing cognitive performance of blind and sighted children (ages three and four). Results indicated that even this test (with "blind-neutral" items) did not permit a fair comparative assessment, though it did prove suitable for within-group…
Functional Evaluation of Hidden Figures Object Analysis in Children with Autistic Disorder
ERIC Educational Resources Information Center
Malisza, Krisztina L.; Clancy, Christine; Shiloff, Deborah; Foreman, Derek; Holden, Jeanette; Jones, Cheryl; Paulson, K.; Summers, Randy; Yu, C. T.; Chudley, Albert E.
2011-01-01
Functional magnetic resonance imaging (fMRI) during performance of a hidden figures task (HFT) was used to compare differences in brain function in children diagnosed with autism disorder (AD) compared to children with attention-deficit/hyperactivity disorder (ADHD) and typical controls (TC). Overall greater functional MRI activity was observed in…
Evaluation of a Novel Single-Tube Method for Extended Genotyping of Human Papillomavirus
Serrano, I.; Wennington, H.; Graham, C.; Cubie, H.; Boland, E.; Fu, G.; Cuschieri, K.
2017-01-01
ABSTRACT The use of high-risk human papillomavirus (HPV) testing for surveillance and clinical applications is increasing globally, and it is important that tests are evaluated to ensure they are fit for this purpose. In this study, the performance of a new HPV genotyping test, the Papilloplex high-risk HPV (HR-HPV) test, was compared to two well-established genotyping tests. Preliminary clinical performance was also ascertained for the detection of CIN2+ in a disease-enriched retrospective cohort. A panel of 500 cervical liquid-based cytology samples with known clinical outcomes were tested by the Papilloplex HR-HPV test. Analytical concordance was compared to two assays: a Linear Array (LA) HPV genotyping test and an Optiplex HPV genotyping test. The initial clinical performance for the detection for CIN2+ samples was performed and compared to that of two clinically validated HPV tests: a RealTime High-Risk HPV test (RealTime) and a Hybrid Capture 2 HPV test (HC2). High agreement for HR-HPV was observed between the Papilloplex and LA and Optiplex HPV tests (97 and 95%, respectively), with kappa values for HPV16 and HPV18 being 0.90 and 0.81 compared to the LA and 0.70 and 0.82 compared to the Optiplex test. The sensitivity, specificity, positive predictive value, and negative predictive value of the Papilloplex test for the detection of CIN2+ were 92, 54, 33, and 96%, respectively, and very similar to the values observed with RealTime and HC2. The Papilloplex HR-HPV test demonstrated a analytical performance similar to those of the two HPV genotyping tests at the HR-HPV level and the type-specific level. The preliminary data on clinical performance look encouraging, although further longitudinal studies within screening populations are required to confirm these findings. PMID:29237790
On-road Driving Performance of Patients with Bilateral Moderate and Advanced Glaucoma
Bhorade, Anjali M.; Yom, Victoria H.; Barco, Peggy; Wilson, Bradley; Gordon, Mae; Carr, David
2017-01-01
Purpose To compare on-road driving performance of patients with moderate or advanced glaucoma to controls and evaluate factors associated with unsafe driving. Design Case-control pilot study. Methods A consecutive sample of 21 patients with bilateral moderate or advanced glaucoma from Washington University, St. Louis, MO and 38 community-dwelling controls were enrolled. Participants, ages 55–90 years, underwent a comprehensive clinical evaluation by a trained occupational therapist and an on-road driving evaluation by a masked driver rehabilitation specialist. Overall driving performance of pass vs. marginal/fail and number of wheel and/or brake interventions were recorded. Results Fifty-two percent of glaucoma participants scored a marginal/fail compared to 21% of controls (odds ratio [OR], 4.1; 95% CI, 1.30–13.14;p=.02). Glaucoma participants had a higher risk of wheel interventions than controls (OR, 4.67; 95% CI, 1.03–21.17;p=.046). There were no differences detected between glaucoma participants who scored a pass vs. marginal/fail for visual field mean deviation of the better (p=.62) or worse (p=.88) eye, binocular distance (p=.15) or near (p=.23) visual acuity, contrast sensitivity (p=.28) or glare (p=.88). However, glaucoma participants with a marginal/fail score performed worse on Trail Making Tests A (p=.03) and B (p=.05), right-sided Jamar grip strength (p=.02), Rapid Pace Walk (p=.03), Braking Response Time (p=.03), and identifying traffic signs (p=.05). Conclusions and Relevance Patients with bilateral moderate or advanced glaucoma are at risk for unsafe driving – particularly those with impairments on psychometric and mobility tests. A comprehensive clinical assessment and on-road driving evaluation is recommended to effectively evaluate driving safety of these patients. PMID:26949136
Leech, Kristan A.; Kinnaird, Catherine R.; Holleran, Carey L.; Kahn, Jennifer
2016-01-01
Background High-intensity stepping practice may be a critical component to improve gait following motor incomplete spinal cord injury (iSCI). However, such practice is discouraged by traditional theories of rehabilitation that suggest high-intensity locomotor exercise degrades gait performance. Accordingly, such training is thought to reinforce abnormal movement patterns, although evidence to support this notion is limited. Objective The purposes of this study were: (1) to evaluate the effects of short-term manipulations in locomotor intensity on gait performance in people with iSCI and (2) to evaluate potential detrimental effects of high-intensity locomotor training on walking performance. Design A single-day, repeated-measures, pretraining-posttraining study design was used. Methods Nineteen individuals with chronic iSCI performed a graded-intensity locomotor exercise task with simultaneous collection of lower extremity kinematic and electromyographic data. Measures of interest were compared across intensity levels of 33%, 67%, and 100% of peak gait speed. A subset of 9 individuals participated in 12 weeks of high-intensity locomotor training. Similar measurements were collected and compared between pretraining and posttraining evaluations. Results The results indicate that short-term increases in intensity led to significant improvements in muscle activity, spatiotemporal metrics, and joint excursions, with selected improvements in measures of locomotor coordination. High-intensity locomotor training led to significant increases in peak gait speed (0.64–0.80 m/s), and spatiotemporal and kinematic metrics indicate a trend for improved coordination. Limitations Measures of gait performance were assessed during treadmill ambulation and not compared with a control group. Generalizability of these results to overground ambulation is unknown. Conclusions High-intensity locomotor exercise and training does not degrade, but rather improves, locomotor function and quality in individuals with iSCI, which contrasts with traditional theories of motor dysfunction following neurologic injury. PMID:27313241
FitzGerald, Paul F.; Butts, Matthew D.; Roberts, Jeannette C.; Colborn, Robert E.; Torres, Andrew S.; Lee, Brian D.; Yeh, Benjamin M.; Bonitatibus, Peter J.
2016-01-01
Objectives To produce and evaluate a proposed computed tomography (CT) contrast agent based on carboxybetaine zwitterionic (CZ) coated soluble tantalum oxide nanoparticles (CZ-TaO NPs). We chose tantalum to provide superior imaging performance compared to current iodine-based clinical CT contrast agents. We developed the CZ coating to provide biological and physical performance similar to that of current iodinated contrast agents. The aim of this study was to evaluate the imaging, biological, and physicochemical performance of this proposed contrast agent compared to clinically-used iodinated agents. Materials and Methods We evaluated CT imaging performance of our CZ-TaO NPs compared to an iodinated agent in live rats, imaged centrally-located within a tissue-equivalent plastic phantom that simulated a large patient. To evaluate vascular contrast enhancement, we scanned the rats’ great vessels at high temporal resolution during and following contrast agent injection. We performed several in vivo CZ-TaO NP studies in healthy rats to evaluate tolerability. These studies included injecting the agent at the anticipated clinical dose (ACD) and at 3 times and 6 times the ACD, followed by longitudinal hematology to assess impact to blood cells and organ function (from 4 hours to 1 week). Kidney histological analysis was performed 48 hours after injection at 3 times the ACD. We measured the elimination half-life of CZ-TaO NPs from blood, and we monitored acute kidney injury biomarkers with a kidney injury assay using urine collected from 4 hours to 1 week. We measured tantalum retention in individual organs and in the whole carcass 48 hours after injection at ACD. CZ-TaO NPs were synthesized and analyzed in detail. We used multi-dimensional nuclear magnetic resonance (NMR) to determine surface functionality of the nanoparticles. We measured nanoparticle size and solution properties (osmolality and viscosity) of the agent over a range of tantalum concentrations, including the high concentrations required for standard clinical CT imaging. Results CT imaging studies demonstrated image contrast improvement of approximately 40–50% using CZ-TaO NPs compared with an iodinated agent injected at the same mass concentration. Blood and organ analyses showed no adverse effects following injection in healthy naïve rats at 3 times the ACD. Retention of tantalum at 48 hours after injection was less than 2% of the injected dose in the whole carcass, which very closely matched the reported retention of existing commercial iodine-based contrast agents. Urine analysis of sensitive markers for acute kidney injury showed no responses at 1 week following injection at 3 times the ACD; however, a moderate response in the neutrophil gelatinase-associated lipocalin (NGAL) biomarker was measured at 24 and 48 hours. Compared to other tantalum oxide nanoparticles reported in the literature, CZ-TaO NPs had relatively low osmolality and viscosity at concentrations >200 mg Ta/mL, and were similar in these physical properties to dimeric iodine-based contrast agents. Conclusions We found that a CZ-TaO NP-based contrast agent is potentially viable for general-purpose clinical CT imaging. Our results suggest that such an agent can be formulated with clinically-viable physicochemical properties, can be biologically safe and cleared rapidly in urine, and can provide substantially improved image contrast at CT compared to current iodinated agents. PMID:27115702
FitzGerald, Paul F; Butts, Matthew D; Roberts, Jeannette C; Colborn, Robert E; Torres, Andrew S; Lee, Brian D; Yeh, Benjamin M; Bonitatibus, Peter J
2016-12-01
The aim of this study was to produce and evaluate a proposed computed tomography (CT) contrast agent based on carboxybetaine zwitterionic (CZ)-coated soluble tantalum oxide (TaO) nanoparticles (NPs). We chose tantalum to provide superior imaging performance compared with current iodine-based clinical CT contrast agents. We developed the CZ coating to provide biological and physical performance similar to that of current iodinated contrast agents. In addition, the aim of this study was to evaluate the imaging, biological, and physicochemical performance of this proposed contrast agent compared with clinically used iodinated agents. We evaluated CT imaging performance of our CZ-TaO NPs compared with that of an iodinated agent in live rats, imaged centrally located within a tissue-equivalent plastic phantom that simulated a large patient. To evaluate vascular contrast enhancement, we scanned the rats' great vessels at high temporal resolution during and after contrast agent injection. We performed several in vivo CZ-TaO NP studies in healthy rats to evaluate tolerability. These studies included injecting the agent at the anticipated clinical dose (ACD) and at 3 times and 6 times the ACD, followed by longitudinal hematology to assess impact to blood cells and organ function (from 4 hours to 1 week). Kidney histological analysis was performed 48 hours after injection at 3 times the ACD. We measured the elimination half-life of CZ-TaO NPs from blood, and we monitored acute kidney injury biomarkers with a kidney injury assay using urine collected from 4 hours to 1 week. We measured tantalum retention in individual organs and in the whole carcass 48 hours after injection at ACD. Carboxybetaine zwitterionic TaO NPs were synthesized and analyzed in detail. We used multidimensional nuclear magnetic resonance to determine surface functionality of the NPs. We measured NP size and solution properties (osmolality and viscosity) of the agent over a range of tantalum concentrations, including the high concentrations required for standard clinical CT imaging. Computed tomography imaging studies demonstrated image contrast improvement of approximately 40% to 50% using CZ-TaO NPs compared with an iodinated agent injected at the same mass concentration. Blood and organ analyses showed no adverse effects after injection in healthy naive rats at 3 times the ACD. Retention of tantalum at 48 hours after injection was less than 2% of the injected dose in the whole carcass, which very closely matched the reported retention of existing commercial iodine-based contrast agents. Urine analysis of sensitive markers for acute kidney injury showed no responses at 1 week after injection at 3 times the ACD; however, a moderate response in the neutrophil gelatinase-associated lipocalin biomarker was measured at 24 and 48 hours. Compared with other TaO NPs reported in the literature, CZ-TaO NPs had relatively low osmolality and viscosity at concentrations greater than 200 mg Ta/mL and were similar in these physical properties to dimeric iodine-based contrast agents. We found that a CZ-TaO NP-based contrast agent is potentially viable for general-purpose clinical CT imaging. Our results suggest that such an agent can be formulated with clinically viable physicochemical properties, can be biologically safe and cleared rapidly in urine, and can provide substantially improved image contrast at CT compared with current iodinated agents.
Image Retrieval using Integrated Features of Binary Wavelet Transform
NASA Astrophysics Data System (ADS)
Agarwal, Megha; Maheshwari, R. P.
2011-12-01
In this paper a new approach for image retrieval is proposed with the application of binary wavelet transform. This new approach facilitates the feature calculation with the integration of histogram and correlogram features extracted from binary wavelet subbands. Experiments are performed to evaluate and compare the performance of proposed method with the published literature. It is verified that average precision and average recall of proposed method (69.19%, 41.78%) is significantly improved compared to optimal quantized wavelet correlogram (OQWC) [6] (64.3%, 38.00%) and Gabor wavelet correlogram (GWC) [10] (64.1%, 40.6%). All the experiments are performed on Corel 1000 natural image database [20].
Chung, Cheng-Shiu; Wang, Hongwu; Cooper, Rory A.
2013-01-01
Context The user interface development of assistive robotic manipulators can be traced back to the 1960s. Studies include kinematic designs, cost-efficiency, user experience involvements, and performance evaluation. This paper is to review studies conducted with clinical trials using activities of daily living (ADLs) tasks to evaluate performance categorized using the International Classification of Functioning, Disability, and Health (ICF) frameworks, in order to give the scope of current research and provide suggestions for future studies. Methods We conducted a literature search of assistive robotic manipulators from 1970 to 2012 in PubMed, Google Scholar, and University of Pittsburgh Library System – PITTCat. Results Twenty relevant studies were identified. Conclusion Studies were separated into two broad categories: user task preferences and user-interface performance measurements of commercialized and developing assistive robotic manipulators. The outcome measures and ICF codes associated with the performance evaluations are reported. Suggestions for the future studies include (1) standardized ADL tasks for the quantitative and qualitative evaluation of task efficiency and performance to build comparable measures between research groups, (2) studies relevant to the tasks from user priority lists and ICF codes, and (3) appropriate clinical functional assessment tests with consideration of constraints in assistive robotic manipulator user interfaces. In addition, these outcome measures will help physicians and therapists build standardized tools while prescribing and assessing assistive robotic manipulators. PMID:23820143
LOX/hydrocarbon auxiliary propulsion system study
NASA Technical Reports Server (NTRS)
Orton, G. F.; Mark, T. D.; Weber, D. D.
1982-01-01
Liquid oxygen/hydrocarbon propulsion systems applicable to a second generation orbiter OMS/RCS were compared, and major system/component options were evaluated. A large number of propellant combinations and system concepts were evaluated. The ground rules were defined in terms of candidate propellants, system/component design options, and design requirements. System and engine component math models were incorporated into existing computer codes for system evaluations. The detailed system evaluations and comparisons were performed to identify the recommended propellant combination and system approach.
Evaluation of three coding schemes designed for improved data communication
NASA Technical Reports Server (NTRS)
Snelsire, R. W.
1974-01-01
Three coding schemes designed for improved data communication are evaluated. Four block codes are evaluated relative to a quality function, which is a function of both the amount of data rejected and the error rate. The Viterbi maximum likelihood decoding algorithm as a decoding procedure is reviewed. This evaluation is obtained by simulating the system on a digital computer. Short constraint length rate 1/2 quick-look codes are studied, and their performance is compared to general nonsystematic codes.
Mansoorian, Mohammad Reza; Hosseiny, Marzeih Sadat; Khosravan, Shahla; Alami, Ali; Alaviani, Mehri
2015-06-01
Despite the benefits of the objective structured assessment of technical skills (OSATS) and it appropriateness for evaluating clinical abilities of nursing students , few studies are available on the application of this method in nursing education. The purpose of this study was to compare the effect of using OSATS and traditional methods on the students' learning. We also aimed to signify students' views about these two methods and their views about the scores they received in these methods in a medical emergency course. A quasi-experimental study was performed on 45 first semester students in nursing and medical emergencies passing a course on fundamentals of practice. The students were selected by a census method and evaluated by both the OSATS and traditional methods. Data collection was performed using checklists prepared based on the 'text book of nursing procedures checklists' published by Iranian nursing organization and a questionnaire containing learning rate and students' estimation of their received scores. Descriptive statistics as well as paired t-test and independent samples t-test were used in data analysis. The mean of students' score in OSATS was significantly higher than their mean score in traditional method (P = 0.01). Moreover, the mean of self-evaluation score after the traditional method was relatively the same as the score the students received in the exam. However, the mean of self-evaluation score after the OSATS was relatively lower than the scores the students received in the OSATS exam. Most students believed that OSATS can evaluate a wide range of students' knowledge and skills compared to traditional method. Results of this study indicated the better effect of OSATS on learning and its relative superiority in precise assessment of clinical skills compared with the traditional evaluation method. Therefore, we recommend using this method in evaluation of students in practical courses.
Evaluation of antistripping additives.
DOT National Transportation Integrated Search
1989-01-01
Several chemical antistripping additives were used in field installations and compared to a similar installation using hydrated lime. The performance of the installations was monitored periodically, and material that was sampled during construction w...
Racial Earnings Differentials and Performance Pay
ERIC Educational Resources Information Center
Heywood, John S.; O'Halloran, Patrick L.
2005-01-01
A comparative analysis between output-based payment and time rates payment is presented. It is observed that racial or gender earnings discrimination is more likely in time rates payment and supervisory evaluations.
Lawson, Nathaniel C.; Robles, Augusto; Fu, Chin-Chuan; Lin, Chee Paul; Sawlani, Kanchan; Burgess, John O.
2016-01-01
Objectives To compare the clinical performance of Scotchbond™ Universal Adhesive used in self- and total-etch modes and two-bottle Scotchbond™ Multi-purpose Adhesive in total-etch mode for Class 5 non-carious cervical lesions (NCCLs). Methods 37 adults were recruited with 3 or 6 NCCLs (>1.5 mm deep). Teeth were isolated, and a short cervical bevel was prepared. Teeth were restored randomly with Scotchbond Universal total-etch, Scotchbond Universal self-etch or Scotchbond Multi-purpose followed with a composite resin. Restorations were evaluated at baseline, 6, 12 and 24 months for marginal adaptation, marginal discoloration, secondary caries, and sensitivity to cold using modified USPHS Criteria. Patients and evaluators were blinded. Logistic and linear regression models using a generalized estimating equation were applied to evaluate the effects of time and adhesive material on clinical assessment outcomes over the 24 month follow-up period. Kaplan–Meier method was used to compare the retention between adhesive materials. Results Clinical performance of all adhesive materials deteriorated over time for marginal adaptation, and discoloration (p <0.0001). Both Scotchbond Universal self-etch and Scotchbond Multi-purpose materials were more than three times as likely to contribute to less satisfying performance in marginal discoloration over time than Scotchbond Universal total-etch. The retention rates up to 24 months were 87.6%, 94.9% and 100% for Scotchbond Multi-purpose and Scotchbond Universal self-etch and total-etch, respectively. Conclusions Scotchbond Universal in self- and total- etch modes performed similar to or better than Scotchbond Multipurpose, respectively. Clinical significance 24 month evaluation of a universal adhesive indicates acceptable clinical performance, particularly in a total-etch mode. PMID:26231300
Schønning, Kristian; Pedersen, Martin Schou; Johansen, Kim; Landt, Bodil; Nielsen, Lone Gilmor; Weis, Nina; Westh, Henrik
2017-10-01
Chronic hepatitis C virus (HCV) infection can be effectively treated with directly acting antiviral (DAA) therapy. Measurement of HCV RNA is used to evaluate patient compliance and virological response during and after treatment. To compare the analytical performance of the Aptima HCV Quant Dx Assay (Aptima) and the COBAS Ampliprep/COBAS TaqMan HCV Test v2.0 (CAPCTMv2) for the quantification of HCV RNA in plasma samples, and compare the clinical utility of the two tests in patients undergoing treatment with DAA therapy. Analytical performance was evaluated on two sets of plasma samples: 125 genotyped samples and 172 samples referred for quantification of HCV RNA. Furthermore, performance was evaluated using dilutions series of four samples containing HCV genotype 1a, 2b, 3a, and 4a, respectively. Clinical utility was evaluated on 118 plasma samples obtained from 13 patients undergoing treatment with DAAs. Deming regression of results from 187 plasma samples with HCV RNA >2 Log IU/mL indicated that the Aptima assay quantified higher than the CAPCTMv2 test for HCV RNA >4.9 Log IU/mL. The linearity of the Aptima assay was excellent across dilution series of four HCV genotypes (slope of the regression line: 1.00-1.02). The Aptima assay detected significantly more replicates below targeted 2 Log IU/mL than the CAPCTMv2 test, and yielded clearly interpretable results when used to analyze samples from patients treated with DAAs. The analytical performance of the Aptima assay makes it well suited for monitoring patients with chronic HCV infection undergoing antiviral treatment. Copyright © 2017 Elsevier B.V. All rights reserved.
Simulating the Use of Alternative Fuels in a Turbofan Engine
NASA Technical Reports Server (NTRS)
Litt, Jonathan S.; Chin, Jeffrey Chevoor; Liu, Yuan
2013-01-01
The interest in alternative fuels for aviation has created a need to evaluate their effect on engine performance. The use of dynamic turbofan engine simulations enables the comparative modeling of the performance of these fuels on a realistic test bed in terms of dynamic response and control compared to traditional fuels. The analysis of overall engine performance and response characteristics can lead to a determination of the practicality of using specific alternative fuels in commercial aircraft. This paper describes a procedure to model the use of alternative fuels in a large commercial turbofan engine, and quantifies their effects on engine and vehicle performance. In addition, the modeling effort notionally demonstrates that engine performance may be maintained by modifying engine control system software parameters to account for the alternative fuel.
Blind Source Parameters for Performance Evaluation of Despeckling Filters.
Biradar, Nagashettappa; Dewal, M L; Rohit, ManojKumar; Gowre, Sanjaykumar; Gundge, Yogesh
2016-01-01
The speckle noise is inherent to transthoracic echocardiographic images. A standard noise-free reference echocardiographic image does not exist. The evaluation of filters based on the traditional parameters such as peak signal-to-noise ratio, mean square error, and structural similarity index may not reflect the true filter performance on echocardiographic images. Therefore, the performance of despeckling can be evaluated using blind assessment metrics like the speckle suppression index, speckle suppression and mean preservation index (SMPI), and beta metric. The need for noise-free reference image is overcome using these three parameters. This paper presents a comprehensive analysis and evaluation of eleven types of despeckling filters for echocardiographic images in terms of blind and traditional performance parameters along with clinical validation. The noise is effectively suppressed using the logarithmic neighborhood shrinkage (NeighShrink) embedded with Stein's unbiased risk estimation (SURE). The SMPI is three times more effective compared to the wavelet based generalized likelihood estimation approach. The quantitative evaluation and clinical validation reveal that the filters such as the nonlocal mean, posterior sampling based Bayesian estimation, hybrid median, and probabilistic patch based filters are acceptable whereas median, anisotropic diffusion, fuzzy, and Ripplet nonlinear approximation filters have limited applications for echocardiographic images.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rembold, Randy Kai; Hart, Darren M.
Sandia National Laboratories has tested and evaluated Geotech SMART24BH borehole data acquisition system with active Fortezza crypto card data signing and authentication. The test results included in this report were in response to static and tonal-dynamic input signals. Most test methodologies used were based on IEEE Standards 1057 for Digitizing Waveform Recorders and 1241 for Analog to Digital Converters; others were designed by Sandia specifically for infrasound application evaluation and for supplementary criteria not addressed in the IEEE standards. The objective of this work was to evaluate the overall technical performance of two Geotech SMART24BH digitizers with a Fortezza PCMCIAmore » crypto card actively implementing the signing of data packets. The results of this evaluation were compared to relevant specifications provided within manufacturer's documentation notes. The tests performed were chosen to demonstrate different performance aspects of the digitizer under test. The performance aspects tested include determining noise floor, least significant bit (LSB), dynamic range, cross-talk, relative channel-to-channel timing, time-tag accuracy/statistics/drift, analog bandwidth.« less
Blind Source Parameters for Performance Evaluation of Despeckling Filters
Biradar, Nagashettappa; Dewal, M. L.; Rohit, ManojKumar; Gowre, Sanjaykumar; Gundge, Yogesh
2016-01-01
The speckle noise is inherent to transthoracic echocardiographic images. A standard noise-free reference echocardiographic image does not exist. The evaluation of filters based on the traditional parameters such as peak signal-to-noise ratio, mean square error, and structural similarity index may not reflect the true filter performance on echocardiographic images. Therefore, the performance of despeckling can be evaluated using blind assessment metrics like the speckle suppression index, speckle suppression and mean preservation index (SMPI), and beta metric. The need for noise-free reference image is overcome using these three parameters. This paper presents a comprehensive analysis and evaluation of eleven types of despeckling filters for echocardiographic images in terms of blind and traditional performance parameters along with clinical validation. The noise is effectively suppressed using the logarithmic neighborhood shrinkage (NeighShrink) embedded with Stein's unbiased risk estimation (SURE). The SMPI is three times more effective compared to the wavelet based generalized likelihood estimation approach. The quantitative evaluation and clinical validation reveal that the filters such as the nonlocal mean, posterior sampling based Bayesian estimation, hybrid median, and probabilistic patch based filters are acceptable whereas median, anisotropic diffusion, fuzzy, and Ripplet nonlinear approximation filters have limited applications for echocardiographic images. PMID:27298618
Pianists exhibit enhanced memory for vocal melodies but not piano melodies.
Weiss, Michael W; Vanzella, Patrícia; Schellenberg, E Glenn; Trehub, Sandra E
2015-01-01
Nonmusicians remember vocal melodies (i.e., sung to la la) better than instrumental melodies. If greater exposure to the voice contributes to those effects, then long-term experience with instrumental timbres should elicit instrument-specific advantages. Here we evaluate this hypothesis by comparing pianists with other musicians and nonmusicians. We also evaluate the possibility that absolute pitch (AP), which involves exceptional memory for isolated pitches, influences melodic memory. Participants heard 24 melodies played in four timbres (voice, piano, banjo, marimba) and were subsequently required to distinguish the melodies heard previously from 24 novel melodies presented in the same timbres. Musicians performed better than nonmusicians, but both groups showed a comparable memory advantage for vocal melodies. Moreover, pianists performed no better on melodies played on piano than on other instruments, and AP musicians performed no differently than non-AP musicians. The findings confirm the robust nature of the voice advantage and rule out explanations based on familiarity, practice, and motor representations.
Prostate biopsies assisted by comanipulated probe-holder: first in man.
Vitrani, Marie-Aude; Baumann, Michael; Reversat, David; Morel, Guillaume; Moreau-Gaudry, Alexandre; Mozer, Pierre
2016-06-01
A comanipulator for assisting endorectal prostate biopsies is evaluated through a first-in-man clinical trial. This lightweight system, based on conventional robotic components, possesses six degrees of freedom. It uses three electric motors and three brakes. It features a free mode, where its low friction and inertia allow for natural manipulation of the probe and a locked mode, exhibiting both a very low stiffness and a high steady-state precision. Clinical trials focusing on the free mode and the locked mode of the robot are presented. The objective was to evaluate the practical usability and performance of the robot during clinical procedures. A research protocol for a prospective randomized clinical trial has been designed. Its specific goal was to compare the accuracy of biopsies performed with and without the assistance of the comanipulator. The accuracy is compared between biopsies performed with and without the assistance of the comanipulator, across the 10 first patients included in the trial. Results show a statistically significant increase in the precision.
Warot, D; Lacomblez, L; Danjou, P; Weiller, E; Payan, C; Puech, A J
1991-01-01
The effect on psychomotor and mnesic performances of acute oral dose (600 mg) of 2 Ginkgo biloba extracts were evaluated in twelve healthy female in a dummy placebo-controlled double blind study. Tests were performed comprising: objective measures of vigilance [critical flicker frequency (CFF), choice reaction time (CRT)], memory tasks (pictures and Sternberg scanning tests) and self-rating evaluation (visual analogue scales). Tests session took place before and 1 hour post-dosing. No statistically significant changes from placebo were observed on CFF, CRT or subjective rating of drug effects. No differences between treatment were evidenced on Sternberg scanning test and pictures recognition. Comparing to baseline, free recall score, while decreasing under placebo and Ginkgo, remained the same under Tanakan. As the differences between treatment are localized on one test, it appears important to examine the reproductility in healthy subjects. In order to verify the clinical relevance of these results, they need to be replicated in older healthy volunteers with age-associated memory impairment.
Performance of ZnO based piezo-generators under controlled compression
NASA Astrophysics Data System (ADS)
Tao, Ran; Parmar, Mitesh; Ardila, Gustavo; Oliveira, Paulo; Marques, Daniel; Montès, Laurent; Mouis, Mireille
2017-06-01
This paper reports on the fabrication and characterization of ZnO based vertically integrated nanogenerator (VING) devices under controlled compression. The basic NG structure is a composite material integrating hydrothermally grown vertical piezoelectric zinc oxide (ZnO) nanowires (NWs) into a dielectric matrix (PMMA). A specific characterization set-up has been developed to control the applied compression and the perpendicularity of the applied force on the devices. The role of different fabrication parameters has been evaluated experimentally and compared with previously reported theoretical models, including the thickness of the top PMMA layer and the density of the NWs array in the matrix. Finally, the performance of the VING structure has been evaluated experimentally for different resistive loads obtaining a power density of 85 μW cm-3 considering only the active layer of the device. This has been compared to the performance of a commercial bulk layer of PZT (25 μW cm-3) under the same applied force of 5 N.
Performance of RVGui sensor and Kodak Ektaspeed Plus film for proximal caries detection.
Abreu, M; Mol, A; Ludlow, J B
2001-03-01
A high-resolution charge-coupled device was used to compare the diagnostic performances obtained with Trophy's new RVGui sensor and Kodak Ektaspeed Plus film with respect to caries detection. Three acquisition modes of the Trophy RVGui sensor were compared with Kodak Ektaspeed Plus film. Images of the proximal surfaces of 40 extracted posterior teeth were evaluated by 6 observers. The presence or absence of caries was scored by means of a 5-point confidence scale. The actual caries status of each surface was determined through ground-section histology. Responses were evaluated by means of receiver operating characteristic analysis. Areas under receiver operating characteristic curves (A(Z)) were assessed through analysis of variance. The mean A(Z) scores were 0.85 for film, 0.84 for the high-resolution caries mode, and 0.82 for both the low resolution caries mode and the high-resolution periodontal mode. These differences were not statistically significant (P =.70). The differences among observers also were not statistically significant (P =.23). The performance of the RVGui sensor in high- and low-resolution modes for proximal caries detection is comparable to that of Ektaspeed Plus film.
Comparative study of shear wave-based elastography techniques in optical coherence tomography
NASA Astrophysics Data System (ADS)
Zvietcovich, Fernando; Rolland, Jannick P.; Yao, Jianing; Meemon, Panomsak; Parker, Kevin J.
2017-03-01
We compare five optical coherence elastography techniques able to estimate the shear speed of waves generated by one and two sources of excitation. The first two techniques make use of one piezoelectric actuator in order to produce a continuous shear wave propagation or a tone-burst propagation (TBP) of 400 Hz over a gelatin tissue-mimicking phantom. The remaining techniques utilize a second actuator located on the opposite side of the region of interest in order to create three types of interference patterns: crawling waves, swept crawling waves, and standing waves, depending on the selection of the frequency difference between the two actuators. We evaluated accuracy, contrast to noise ratio, resolution, and acquisition time for each technique during experiments. Numerical simulations were also performed in order to support the experimental findings. Results suggest that in the presence of strong internal reflections, single source methods are more accurate and less variable when compared to the two-actuator methods. In particular, TBP reports the best performance with an accuracy error <4.1%. Finally, the TBP was tested in a fresh chicken tibialis anterior muscle with a localized thermally ablated lesion in order to evaluate its performance in biological tissue.