observed performance evaluation: Topics by Science.gov

Sample records for observed performance evaluation

Evaluating performance of risk identification methods through a large-scale simulation of observational data.

PubMed

Ryan, Patrick B; Schuemie, Martijn J

2013-10-01

There has been only limited evaluation of statistical methods for identifying safety risks of drug exposure in observational healthcare data. Simulations can support empirical evaluation, but have not been shown to adequately model the real-world phenomena that challenge observational analyses. To design and evaluate a probabilistic framework (OSIM2) for generating simulated observational healthcare data, and to use this data for evaluating the performance of methods in identifying associations between drug exposure and health outcomes of interest. Seven observational designs, including case-control, cohort, self-controlled case series, and self-controlled cohort design were applied to 399 drug-outcome scenarios in 6 simulated datasets with no effect and injected relative risks of 1.25, 1.5, 2, 4, and 10, respectively. Longitudinal data for 10 million simulated patients were generated using a model derived from an administrative claims database, with associated demographics, periods of drug exposure derived from pharmacy dispensings, and medical conditions derived from diagnoses on medical claims. Simulation validation was performed through descriptive comparison with real source data. Method performance was evaluated using Area Under ROC Curve (AUC), bias, and mean squared error. OSIM2 replicates prevalence and types of confounding observed in real claims data. When simulated data are injected with relative risks (RR) ≥ 2, all designs have good predictive accuracy (AUC > 0.90), but when RR < 2, no methods achieve 100 % predictions. Each method exhibits a different bias profile, which changes with the effect size. OSIM2 can support methodological research. Results from simulation suggest method operating characteristics are far from nominal properties.
Evaluation of camouflage pattern performance of textiles by human observers and CAMAELEON

NASA Astrophysics Data System (ADS)

Heinrich, Daniela H.; Selj, Gorm K.

2017-10-01

Military textiles with camouflage pattern are an important part of the protection measures for soldiers. Military operational environments differ a lot depending on climate and vegetation. This requires very different camouflage pattern to achieve good protection. To find the best performing pattern for given environments we have in earlier evaluations mainly applied observer trials as evaluation method. In these camouflage evaluation test human observers were asked to search for targets (in natural settings) presented on a high resolution PC screen, and the corresponding detection times were recorded. Another possibility is to base the evaluation on simulations. CAMAELEON is a licensed tool that ranks camouflaged targets by their similarity with local backgrounds. The similarity is estimated through the parameters local contrast, orientation of structures in the pattern and spatial frequency, by mimicking the response and signal processing in the visual cortex of the human eye. Simulations have a number of advantages over observer trials, for example, that they are more flexible, cheaper, and faster. Applying these two methods to the same images of camouflaged targets we found that CAMAELEON simulation results didn't match observer trial results for targets with disruptive patterns. This finding now calls for follow up studies in order to learn more about the advantages and pitfalls of CAMAELEON. During recent observer trials we studied new camouflage patterns and the effect of additional equipment, such as combat vests. In this paper we will present the results from a study comparing evaluation results of human based observer trials and CAMAELEON.
Clinical Observed Performance Evaluation: A Prospective Study in Final Year Students of Surgery

ERIC Educational Resources Information Center

Markey, G. C.; Browne, K.; Hunter, K.; Hill, A. D.

2011-01-01

We report a prospective study of clinical observed performance evaluation (COPE) for 197 medical students in the pre-qualification year of clinical education. Psychometric quality was the main endpoint. Students were assessed in groups of 5 in 40-min patient encounters, with each student the focus of evaluation for 8 min. Each student had a series…
Nonparametric EROC analysis for observer performance evaluation on joint detection and estimation tasks

NASA Astrophysics Data System (ADS)

Wunderlich, Adam; Goossens, Bart

2014-03-01

The majority of the literature on task-based image quality assessment has focused on lesion detection tasks, using the receiver operating characteristic (ROC) curve, or related variants, to measure performance. However, since many clinical image evaluation tasks involve both detection and estimation (e.g., estimation of kidney stone composition, estimation of tumor size), there is a growing interest in performance evaluation for joint detection and estimation tasks. To evaluate observer performance on such tasks, Clarkson introduced the estimation ROC (EROC) curve, and the area under the EROC curve as a summary figure of merit. In the present work, we propose nonparametric estimators for practical EROC analysis from experimental data, including estimators for the area under the EROC curve and its variance. The estimators are illustrated with a practical example comparing MRI images reconstructed from different k-space sampling trajectories.
Performance Evaluation of New-Generation Pulse Oximeters in the NICU: Observational Study.

PubMed

Nizami, Shermeen; Greenwood, Kim; Barrowman, Nick; Harrold, JoAnn

2015-09-01

This crossover observational study compares the data characteristics and performance of new-generation Nellcor OXIMAX and Masimo SET SmartPod pulse oximeter technologies. The study was conducted independent of either original equipment manufacturer (OEM) across eleven preterm infants in a Neonatal Intensive Care Unit (NICU). The SmartPods were integrated with Dräger Infinity Delta monitors. The Delta monitor measured the heart rate (HR) using an independent electrocardiogram sensor, and the two SmartPods collected arterial oxygen saturation (SpO2) and pulse rate (PR). All patient data were non-Gaussian. Nellcor PR showed a higher correlation with the HR as compared to Masimo PR. The statistically significant difference found in their median values (1% for SpO2, 1 bpm for PR) was deemed clinically insignificant. SpO2 alarms generated by both SmartPods were observed and categorized for performance evaluation. Results for sensitivity, positive predictive value, accuracy and false alarm rates were Nellcor (80.3, 50, 44.5, 50%) and Masimo (72.2, 48.2, 40.6, 51.8%) respectively. These metrics were not statistically significantly different between the two pulse oximeters. Despite claims by OEMs, both pulse oximeters exhibited high false alarm rates, with no statistically or clinically significant difference in performance. These findings have a direct impact on alarm fatigue in the NICU. Performance evaluation studies can also impact medical device purchase decisions made by hospital administrators.
Correlation between human observer performance and model observer performance in differential phase contrast CT

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Ke; Garrett, John; Chen, Guang-Hong

2013-11-15

Purpose: With the recently expanding interest and developments in x-ray differential phase contrast CT (DPC-CT), the evaluation of its task-specific detection performance and comparison with the corresponding absorption CT under a given radiation dose constraint become increasingly important. Mathematical model observers are often used to quantify the performance of imaging systems, but their correlations with actual human observers need to be confirmed for each new imaging method. This work is an investigation of the effects of stochastic DPC-CT noise on the correlation of detection performance between model and human observers with signal-known-exactly (SKE) detection tasks.Methods: The detectabilities of different objectsmore » (five disks with different diameters and two breast lesion masses) embedded in an experimental DPC-CT noise background were assessed using both model and human observers. The detectability of the disk and lesion signals was then measured using five types of model observers including the prewhitening ideal observer, the nonprewhitening (NPW) observer, the nonprewhitening observer with eye filter and internal noise (NPWEi), the prewhitening observer with eye filter and internal noise (PWEi), and the channelized Hotelling observer (CHO). The same objects were also evaluated by four human observers using the two-alternative forced choice method. The results from the model observer experiment were quantitatively compared to the human observer results to assess the correlation between the two techniques.Results: The contrast-to-detail (CD) curve generated by the human observers for the disk-detection experiments shows that the required contrast to detect a disk is inversely proportional to the square root of the disk size. Based on the CD curves, the ideal and NPW observers tend to systematically overestimate the performance of the human observers. The NPWEi and PWEi observers did not predict human performance well either, as the slopes of
A four-alternative forced choice (4AFC) software for observer performance evaluation in radiology

NASA Astrophysics Data System (ADS)

Zhang, Guozhi; Cockmartin, Lesley; Bosmans, Hilde

2016-03-01

Four-alternative forced choice (4AFC) test is a psychophysical method that can be adopted for observer performance evaluation in radiological studies. While the concept of this method is well established, difficulties to handle large image data, perform unbiased sampling, and keep track of the choice made by the observer have restricted its application in practice. In this work, we propose an easy-to-use software that can help perform 4AFC tests with DICOM images. The software suits for any experimental design that follows the 4AFC approach. It has a powerful image viewing system that favorably simulates the clinical reading environment. The graphical interface allows the observer to adjust various viewing parameters and perform the selection with very simple operations. The sampling process involved in 4AFC as well as the speed and accuracy of the choice made by the observer is precisely monitored in the background and can be easily exported for test analysis. The software has also a defensive mechanism for data management and operation control that minimizes the possibility of mistakes from user during the test. This software can largely facilitate the use of 4AFC approach in radiological observer studies and is expected to have widespread applicability.
Kalman-Filter-Based Orientation Determination Using Inertial/Magnetic Sensors: Observability Analysis and Performance Evaluation

PubMed Central

Sabatini, Angelo Maria

2011-01-01

In this paper we present a quaternion-based Extended Kalman Filter (EKF) for estimating the three-dimensional orientation of a rigid body. The EKF exploits the measurements from an Inertial Measurement Unit (IMU) that is integrated with a tri-axial magnetic sensor. Magnetic disturbances and gyro bias errors are modeled and compensated by including them in the filter state vector. We employ the observability rank criterion based on Lie derivatives to verify the conditions under which the nonlinear system that describes the process of motion tracking by the IMU is observable, namely it may provide sufficient information for performing the estimation task with bounded estimation errors. The observability conditions are that the magnetic field, perturbed by first-order Gauss-Markov magnetic variations, and the gravity vector are not collinear and that the IMU is subject to some angular motions. Computer simulations and experimental testing are presented to evaluate the algorithm performance, including when the observability conditions are critical. PMID:22163689
Using satellite observations in performance evaluation for regulatory air quality modeling: Comparison with ground-level measurements

NASA Astrophysics Data System (ADS)

Odman, M. T.; Hu, Y.; Russell, A.; Chai, T.; Lee, P.; Shankar, U.; Boylan, J.

2012-12-01

Regulatory air quality modeling, such as State Implementation Plan (SIP) modeling, requires that model performance meets recommended criteria in the base-year simulations using period-specific, estimated emissions. The goal of the performance evaluation is to assure that the base-year modeling accurately captures the observed chemical reality of the lower troposphere. Any significant deficiencies found in the performance evaluation must be corrected before any base-case (with typical emissions) and future-year modeling is conducted. Corrections are usually made to model inputs such as emission-rate estimates or meteorology and/or to the air quality model itself, in modules that describe specific processes. Use of ground-level measurements that follow approved protocols is recommended for evaluating model performance. However, ground-level monitoring networks are spatially sparse, especially for particulate matter. Satellite retrievals of atmospheric chemical properties such as aerosol optical depth (AOD) provide spatial coverage that can compensate for the sparseness of ground-level measurements. Satellite retrievals can also help diagnose potential model or data problems in the upper troposphere. It is possible to achieve good model performance near the ground, but have, for example, erroneous sources or sinks in the upper troposphere that may result in misleading and unrealistic responses to emission reductions. Despite these advantages, satellite retrievals are rarely used in model performance evaluation, especially for regulatory modeling purposes, due to the high uncertainty in retrievals associated with various contaminations, for example by clouds. In this study, 2007 was selected as the base year for SIP modeling in the southeastern U.S. Performance of the Community Multiscale Air Quality (CMAQ) model, at a 12-km horizontal resolution, for this annual simulation is evaluated using both recommended ground-level measurements and non-traditional satellite
Evaluation of CNN as anthropomorphic model observer

NASA Astrophysics Data System (ADS)

Massanes, Francesc; Brankov, Jovan G.

2017-03-01

Model observers (MO) are widely used in medical imaging to act as surrogates of human observers in task-based image quality evaluation, frequently towards optimization of reconstruction algorithms. In this paper, we explore the use of convolutional neural networks (CNN) to be used as MO. We will compare CNN MO to alternative MO currently being proposed and used such as the relevance vector machine based MO and channelized Hotelling observer (CHO). As the success of the CNN, and other deep learning approaches, is rooted in large data sets availability, which is rarely the case in medical imaging systems task-performance evaluation, we will evaluate CNN performance on both large and small training data sets.
Empirical Performance of Covariates in Education Observational Studies

ERIC Educational Resources Information Center

Wong, Vivian C.; Valentine, Jeffrey C.; Miller-Bains, Kate

2017-01-01

This article summarizes results from 12 empirical evaluations of observational methods in education contexts. We look at the performance of three common covariate-types in observational studies where the outcome is a standardized reading or math test. They are: pretest measures, local geographic matching, and rich covariate sets with a strong…
Distributed Space Mission Design for Earth Observation Using Model-Based Performance Evaluation

NASA Technical Reports Server (NTRS)

Nag, Sreeja; LeMoigne-Stewart, Jacqueline; Cervantes, Ben; DeWeck, Oliver

2015-01-01

Distributed Space Missions (DSMs) are gaining momentum in their application to earth observation missions owing to their unique ability to increase observation sampling in multiple dimensions. DSM design is a complex problem with many design variables, multiple objectives determining performance and cost and emergent, often unexpected, behaviors. There are very few open-access tools available to explore the tradespace of variables, minimize cost and maximize performance for pre-defined science goals, and therefore select the most optimal design. This paper presents a software tool that can multiple DSM architectures based on pre-defined design variable ranges and size those architectures in terms of predefined science and cost metrics. The tool will help a user select Pareto optimal DSM designs based on design of experiments techniques. The tool will be applied to some earth observation examples to demonstrate its applicability in making some key decisions between different performance metrics and cost metrics early in the design lifecycle.
Guidelines for reporting evaluations based on observational methodology.

PubMed

Portell, Mariona; Anguera, M Teresa; Chacón-Moscoso, Salvador; Sanduvete-Chaves, Susana

2015-01-01

Observational methodology is one of the most suitable research designs for evaluating fidelity of implementation, especially in complex interventions. However, the conduct and reporting of observational studies is hampered by the absence of specific guidelines, such as those that exist for other evaluation designs. This lack of specific guidance poses a threat to the quality and transparency of these studies and also constitutes a considerable publication hurdle. The aim of this study thus was to draw up a set of proposed guidelines for reporting evaluations based on observational methodology. The guidelines were developed by triangulating three sources of information: observational studies performed in different fields by experts in observational methodology, reporting guidelines for general studies and studies with similar designs to observational studies, and proposals from experts in observational methodology at scientific meetings. We produced a list of guidelines grouped into three domains: intervention and expected outcomes, methods, and results. The result is a useful, carefully crafted set of simple guidelines for conducting and reporting observational studies in the field of program evaluation.
Model Performance Evaluation and Scenario Analysis ...

EPA Pesticide Factsheets

This tool consists of two parts: model performance evaluation and scenario analysis (MPESA). The model performance evaluation consists of two components: model performance evaluation metrics and model diagnostics. These metrics provides modelers with statistical goodness-of-fit measures that capture magnitude only, sequence only, and combined magnitude and sequence errors. The performance measures include error analysis, coefficient of determination, Nash-Sutcliffe efficiency, and a new weighted rank method. These performance metrics only provide useful information about the overall model performance. Note that MPESA is based on the separation of observed and simulated time series into magnitude and sequence components. The separation of time series into magnitude and sequence components and the reconstruction back to time series provides diagnostic insights to modelers. For example, traditional approaches lack the capability to identify if the source of uncertainty in the simulated data is due to the quality of the input data or the way the analyst adjusted the model parameters. This report presents a suite of model diagnostics that identify if mismatches between observed and simulated data result from magnitude or sequence related errors. MPESA offers graphical and statistical options that allow HSPF users to compare observed and simulated time series and identify the parameter values to adjust or the input data to modify. The scenario analysis part of the too
Milestone-specific, Observed data points for evaluating levels of performance (MODEL) assessment strategy for anesthesiology residency programs.

PubMed

Nagy, Christopher J; Fitzgerald, Brian M; Kraus, Gregory P

2014-01-01

Anesthesiology residency programs will be expected to have Milestones-based evaluation systems in place by July 2014 as part of the Next Accreditation System. The San Antonio Uniformed Services Health Education Consortium (SAUSHEC) anesthesiology residency program developed and implemented a Milestones-based feedback and evaluation system a year ahead of schedule. It has been named the Milestone-specific, Observed Data points for Evaluating Levels of performance (MODEL) assessment strategy. The "MODEL Menu" and the "MODEL Blueprint" are tools that other anesthesiology residency programs can use in developing their own Milestones-based feedback and evaluation systems prior to ACGME-required implementation. Data from our early experience with the streamlined MODEL blueprint assessment strategy showed substantially improved faculty compliance with reporting requirements. The MODEL assessment strategy provides programs with a workable assessment method for residents, and important Milestones data points to programs for ACGME reporting.
Evaluation of Multiclass Model Observers in PET LROC Studies

NASA Astrophysics Data System (ADS)

Gifford, H. C.; Kinahan, P. E.; Lartizien, C.; King, M. A.

2007-02-01

A localization ROC (LROC) study was conducted to evaluate nonprewhitening matched-filter (NPW) and channelized NPW (CNPW) versions of a multiclass model observer as predictors of human tumor-detection performance with PET images. Target localization is explicitly performed by these model observers. Tumors were placed in the liver, lungs, and background soft tissue of a mathematical phantom, and the data simulation modeled a full-3D acquisition mode. Reconstructions were performed with the FORE+AWOSEM algorithm. The LROC study measured observer performance with 2D images consisting of either coronal, sagittal, or transverse views of the same set of cases. Versions of the CNPW observer based on two previously published difference-of-Gaussian channel models demonstrated good quantitative agreement with human observers. One interpretation of these results treats the CNPW observer as a channelized Hotelling observer with implicit internal noise
Multi-phenomenology Observation Network Evaluation Tool'' (MONET)

NASA Astrophysics Data System (ADS)

Oltrogge, D.; North, P.; Vallado, D.

2014-09-01

Evaluating overall performance of an SSA "system-of-systems" observational network collecting against thousands of Resident Space Objects (RSO) is very difficult for typical tasking or scheduling-based analysis tools. This is further complicated by networks that have a wide variety of sensor types and phenomena, to include optical, radar and passive RF types, each having unique resource, ops tempo, competing customer and detectability constraints. We present details of the Multi-phenomenology Observation Network Evaluation Tool (MONET), which circumvents these difficulties by assessing the ideal performance of such a network via a digitized supply-vs-demand approach. Cells of each sensors supply time are distributed among RSO targets of interest to determine the average performance of the network against that set of RSO targets. Orbit Determination heuristics are invoked to represent observation quantity and geometry notionally required to obtain the desired orbit estimation quality. To feed this approach, we derive the detectability and collection rate performance of optical, radar and passive RF sensor physical and performance characteristics. We then prioritize the selected RSO targets according to object size, active/inactive status, orbit regime, and/or other considerations. Finally, the OD-derived tracking demands of each RSO of interest are levied against remaining sensor supply until either (a) all sensor time is exhausted; or (b) the list of RSO targets is exhausted. The outputs from MONET include overall network performance metrics delineated by sensor type, objects and orbits tracked, along with likely orbit accuracies which might result from the conglomerate network tracking.
When the third party observer of a neuropsychological evaluation is an audio-recorder.

PubMed

Constantinou, Marios; Ashendorf, Lee; McCaffrey, Robert J

2002-08-01

The presence of third parties during neuropsychological evaluations is an issue of concern for contemporary neuropsychologists. Previous studies have reported that the presence of an observer during neuropsychological testing alters the performance of individuals under evaluation. The present study sought to investigate whether audio-recording affects the neuropsychological test performance of individuals in the same way that third party observation does. In the presence of an audio-recorder the performance of the participants on memory tests declined. Performance on motor tests, on the other hand, was not affected by the presence of an audio-recorder. The implications of these findings in forensic neuropsychological evaluations are discussed.
Observational uncertainty and regional climate model evaluation: A pan-European perspective

NASA Astrophysics Data System (ADS)

Kotlarski, Sven; Szabó, Péter; Herrera, Sixto; Räty, Olle; Keuler, Klaus; Soares, Pedro M.; Cardoso, Rita M.; Bosshard, Thomas; Pagé, Christian; Boberg, Fredrik; Gutiérrez, José M.; Jaczewski, Adam; Kreienkamp, Frank; Liniger, Mark. A.; Lussana, Cristian; Szepszo, Gabriella

2017-04-01

Local and regional climate change assessments based on downscaling methods crucially depend on the existence of accurate and reliable observational reference data. In dynamical downscaling via regional climate models (RCMs) observational data can influence model development itself and, later on, model evaluation, parameter calibration and added value assessment. In empirical-statistical downscaling, observations serve as predictand data and directly influence model calibration with corresponding effects on downscaled climate change projections. Focusing on the evaluation of RCMs, we here analyze the influence of uncertainties in observational reference data on evaluation results in a well-defined performance assessment framework and on a European scale. For this purpose we employ three different gridded observational reference grids, namely (1) the well-established EOBS dataset (2) the recently developed EURO4M-MESAN regional re-analysis, and (3) several national high-resolution and quality-controlled gridded datasets that recently became available. In terms of climate models five reanalysis-driven experiments carried out by five different RCMs within the EURO-CORDEX framework are used. Two variables (temperature and precipitation) and a range of evaluation metrics that reflect different aspects of RCM performance are considered. We furthermore include an illustrative model ranking exercise and relate observational spread to RCM spread. The results obtained indicate a varying influence of observational uncertainty on model evaluation depending on the variable, the season, the region and the specific performance metric considered. Over most parts of the continent, the influence of the choice of the reference dataset for temperature is rather small for seasonal mean values and inter-annual variability. Here, model uncertainty (as measured by the spread between the five RCM simulations considered) is typically much larger than reference data uncertainty. For
Clinical Performance Evaluations of Third-Year Medical Students and Association With Student and Evaluator Gender.

PubMed

Riese, Alison; Rappaport, Leah; Alverson, Brian; Park, Sangshin; Rockney, Randal M

2017-06-01

Clinical performance evaluations are major components of medical school clerkship grades. But are they sufficiently objective? This study aimed to determine whether student and evaluator gender is associated with assessment of overall clinical performance. This was a retrospective analysis of 4,272 core clerkship clinical performance evaluations by 829 evaluators of 155 third-year students, within the Alpert Medical School grading database for the 2013-2014 academic year. Overall clinical performance, assessed on a three-point scale (meets expectations, above expectations, exceptional), was extracted from each evaluation, as well as evaluator gender, age, training level, department, student gender and age, and length of observation time. Hierarchical ordinal regression modeling was conducted to account for clustering of evaluations. Female students were more likely to receive a better grade than males (adjusted odds ratio [AOR] 1.30, 95% confidence interval [CI] 1.13-1.50), and female evaluators awarded lower grades than males (AOR 0.72, 95% CI 0.55-0.93), adjusting for department, observation time, and student and evaluator age. The interaction between student and evaluator gender was significant (P = .03), with female evaluators assigning higher grades to female students, while male evaluators' grading did not differ by student gender. Students who spent a short time with evaluators were also more likely to get a lower grade. A one-year examination of all third-year clerkship clinical performance evaluations at a single institution revealed that male and female evaluators rated male and female students differently, even when accounting for other measured variables.

Balancing the Role of Priors in Multi-Observer Segmentation Evaluation

PubMed Central

Huang, Xiaolei; Wang, Wei; Lopresti, Daniel; Long, Rodney; Antani, Sameer; Xue, Zhiyun; Thoma, George

2009-01-01

Comparison of a group of multiple observer segmentations is known to be a challenging problem. A good segmentation evaluation method would allow different segmentations not only to be compared, but to be combined to generate a “true” segmentation with higher consensus. Numerous multi-observer segmentation evaluation approaches have been proposed in the literature, and STAPLE in particular probabilistically estimates the true segmentation by optimal combination of observed segmentations and a prior model of the truth. An Expectation–Maximization (EM) algorithm, STAPLE’S convergence to the desired local minima depends on good initializations for the truth prior and the observer-performance prior. However, accurate modeling of the initial truth prior is nontrivial. Moreover, among the two priors, the truth prior always dominates so that in certain scenarios when meaningful observer-performance priors are available, STAPLE can not take advantage of that information. In this paper, we propose a Bayesian decision formulation of the problem that permits the two types of prior knowledge to be integrated in a complementary manner in four cases with differing application purposes: (1) with known truth prior; (2) with observer prior; (3) with neither truth prior nor observer prior; and (4) with both truth prior and observer prior. The third and fourth cases are not discussed (or effectively ignored) by STAPLE, and in our research we propose a new method to combine multiple-observer segmentations based on the maximum a posterior (MAP) principle, which respects the observer prior regardless of the availability of the truth prior. Based on the four scenarios, we have developed a web-based software application that implements the flexible segmentation evaluation framework for digitized uterine cervix images. Experiment results show that our framework has flexibility in effectively integrating different priors for multi-observer segmentation evaluation and it also
Evaluation of nursing faculty through observation.

PubMed

Crawford, L H

1998-10-01

The purpose of this study was to assess current use and faculty perceptions of classroom observation as a method of faculty evaluation in schools of nursing. Baccalaureate schools of nursing were surveyed to determine current use of classroom observation and its worth from the perception of administrators and faculty. Although most schools used classroom observation as a method of faculty evaluation, further clarification and research is needed in the following areas: purpose of classroom observation; number of observations necessary; weight given to classroom observation in relation to other evaluation methods; and tools used.
[Inferential evaluation of intimacy based on observation of interpersonal communication].

PubMed

Kimura, Masanori

2015-06-01

How do people inferentially evaluate others' levels of intimacy with friends? We examined the inferential evaluation of intimacy based on the observation of interpersonal communication. In Experiment 1, participants (N = 41) responded to questions after observing conversations between friends. Results indicated that participants inferentially evaluated not only goodness of communication, but also intimacy between friends, using an expressivity heuristic approach. In Experiment 2, we investigated how inferential evaluation of intimacy was affected by prior information about relationships and by individual differences in face-to-face interactional ability. Participants (N = 64) were divided into prior- and no-prior-information groups and all performed the same task as in Experiment 1. Additionally, their interactional ability was assessed. In the prior-information group, individual differences had no effect on inferential evaluation of intimacy. On the other hand, in the no-prior-information group, face-to-face interactional ability partially influenced evaluations of intimacy. Finally, we discuss the fact that to understand one's social environment, it is important to observe others' interpersonal communications.
Apprentice Performance Evaluation.

ERIC Educational Resources Information Center

Gast, Clyde W.

The Granite City (Illinois) Steel apprentices are under a performance evaluation from entry to graduation. Federally approved, the program is guided by joint apprenticeship committees whose monthly meetings include performance evaluation from three information sources: journeymen, supervisors, and instructors. Journeymen's evaluations are made…
Evaluating supplier quality performance using analytical hierarchy process

NASA Astrophysics Data System (ADS)

Kalimuthu Rajoo, Shanmugam Sundram; Kasim, Maznah Mat; Ahmad, Nazihah

2013-09-01

This paper elaborates the importance of evaluating supplier quality performance to an organization. Supplier quality performance evaluation reflects the actual performance of the supplier exhibited at customer's end. It is critical in enabling the organization to determine the area of improvement and thereafter works with supplier to close the gaps. Success of the customer partly depends on supplier's quality performance. Key criteria as quality, cost, delivery, technology support and customer service are categorized as main factors in contributing to supplier's quality performance. 18 suppliers' who were manufacturing automotive application parts evaluated in year 2010 using weight point system. There were few suppliers with common rating which led to common ranking observed by few suppliers'. Analytical Hierarchy Process (AHP), a user friendly decision making tool for complex and multi criteria problems was used to evaluate the supplier's quality performance challenging the weight point system that was used for 18 suppliers'. The consistency ratio was checked for criteria and sub-criteria. Final results of AHP obtained with no overlap ratings, therefore yielded a better decision making methodology as compared to weight point rating system.
Performance and Evaluation of the Global Modeling and Assimilation Office Observing System Simulation Experiment

NASA Technical Reports Server (NTRS)

Prive, Nikki; Errico, R. M.; Carvalho, D.

2018-01-01

The National Aeronautics and Space Administration Global Modeling and Assimilation Office (NASA/GMAO) has spent more than a decade developing and implementing a global Observing System Simulation Experiment framework for use in evaluting both new observation types as well as the behavior of data assimilation systems. The NASA/GMAO OSSE has constantly evolved to relect changes in the Gridpoint Statistical Interpolation data assimiation system, the Global Earth Observing System model, version 5 (GEOS-5), and the real world observational network. Software and observational datasets for the GMAO OSSE are publicly available, along with a technical report. Substantial modifications have recently been made to the NASA/GMAO OSSE framework, including the character of synthetic observation errors, new instrument types, and more sophisticated atmospheric wind vectors. These improvements will be described, along with the overall performance of the current OSSE. Lessons learned from investigations into correlated errors and model error will be discussed.
Group 3: Performance evaluation and assessment

NASA Technical Reports Server (NTRS)

Frink, A.

1981-01-01

Line-oriented flight training provides a unique learning experience and an opportunity to look at aspects of performance other types of training did not provide. Areas such as crew coordination, resource management, leadership, and so forth, can be readily evaluated in such a format. While individual performance is of the utmost importance, crew performance deserves equal emphasis, therefore, these areas should be carefully observed by the instructors as an rea for discussion in the same way that individual performane is observed. To be effective, it must be accepted by the crew members, and administered by the instructors as pure training-learning through experience. To keep open minds, to benefit most from the experience, both in the doing and in the follow-on discussion, it is essential that it be entered into with a feeling of freedom, openness, and enthusiasm. Reserve or defensiveness because of concern for failure must be inhibit participation.
Self-Handicapping and Interpersonal Trade-Offs: The Effects of Claimed Self-Handicaps on Observers' Performance Evaluations and Feedback.

ERIC Educational Resources Information Center

Rhodewalt, Frederick; And Others

1995-01-01

Male subjects (n=130) evaluated performance of targets who, prior to and during the performance, offered no excuse, claimed intended low effort, claimed anxiety, or claimed drug impairment. Subjects evaluated objectively equivalent performances more negatively if they came from an excuse-making target than a no-excuse target. (JBJ)
A Framework for Orbital Performance Evaluation in Distributed Space Missions for Earth Observation

NASA Technical Reports Server (NTRS)

Nag, Sreeja; LeMoigne-Stewart, Jacqueline; Miller, David W.; de Weck, Olivier

2015-01-01

Distributed Space Missions (DSMs) are gaining momentum in their application to earth science missions owing to their unique ability to increase observation sampling in spatial, spectral and temporal dimensions simultaneously. DSM architectures have a large number of design variables and since they are expected to increase mission flexibility, scalability, evolvability and robustness, their design is a complex problem with many variables and objectives affecting performance. There are very few open-access tools available to explore the tradespace of variables which allow performance assessment and are easy to plug into science goals, and therefore select the most optimal design. This paper presents a software tool developed on the MATLAB engine interfacing with STK, for DSM orbit design and selection. It is capable of generating thousands of homogeneous constellation or formation flight architectures based on pre-defined design variable ranges and sizing those architectures in terms of predefined performance metrics. The metrics can be input into observing system simulation experiments, as available from the science teams, allowing dynamic coupling of science and engineering designs. Design variables include but are not restricted to constellation type, formation flight type, FOV of instrument, altitude and inclination of chief orbits, differential orbital elements, leader satellites, latitudes or regions of interest, planes and satellite numbers. Intermediate performance metrics include angular coverage, number of accesses, revisit coverage, access deterioration over time at every point of the Earth's grid. The orbit design process can be streamlined and variables more bounded along the way, owing to the availability of low fidelity and low complexity models such as corrected HCW equations up to high precision STK models with J2 and drag. The tool can thus help any scientist or program manager select pre-Phase A, Pareto optimal DSM designs for a variety of science
Teacher Evaluations: Do Classroom Observations and Evaluator Training Really Matter?

ERIC Educational Resources Information Center

Pies, Sarah J.

2017-01-01

The purpose of this study was to determine if the minimum number of observations stated in a district's teacher evaluation plan, observation characteristics described in a district's evaluation plan, and the characteristic of those evaluating teachers had an impact on whether a school would receive a bonus or penalty point for Indiana's A-F…
Obs4MIPS: Satellite Observations for Model Evaluation

NASA Astrophysics Data System (ADS)

Ferraro, R.; Waliser, D. E.; Gleckler, P. J.

2017-12-01

This poster will review the current status of the obs4MIPs project, whose purpose is to provide a limited collection of well-established and documented datasets for comparison with Earth system models (https://www.earthsystemcog.org/projects/obs4mips/). These datasets have been reformatted to correspond with the CMIP5 model output requirements, and include technical documentation specifically targeted for their use in model output evaluation. The project holdings now exceed 120 datasets with observations that directly correspond to CMIP5 model output variables, with new additions in response to the CMIP6 experiments. With the growth in climate model output data volume, it is increasing more difficult to bring the model output and the observations together to do evaluations. The positioning of the obs4MIPs datasets within the Earth System Grid Federation (ESGF) allows for the use of currently available and planned online tools within the ESGF to perform analysis using model output and observational datasets without necessarily downloading everything to a local workstation. This past year, obs4MIPs has updated its submission guidelines to closely align with changes in the CMIP6 experiments, and is implementing additional indicators and ancillary data to allow users to more easily determine the efficacy of an obs4MIPs dataset for specific evaluation purposes. This poster will present the new guidelines and indicators, and update the list of current obs4MIPs holdings and their connection to the ESGF evaluation and analysis tools currently available, and being developed for the CMIP6 experiments.
Evaluation of Eco-Efficiency and Performance of Retrofit Materials

NASA Astrophysics Data System (ADS)

Gopinath, Smitha; Rama Chandra Murthy, A.; Iyer, Nagesh R.; Kokila, S.

2015-12-01

In this work three materials namely Fiber Reinforced Polymer (FRP), ferrocement and Textile Reinforced Concrete (TRC) have been evaluated towards their performance efficiency and eco-effectiveness for sustainable retrofitting applications. Investigations have been carried out for flexural strengthening of RC beams with FRP, ferrocement and TRC. It is observed that in the case of FRP, it is not possible to tailor the material according to design requirements and most of the time strengthened structure becomes over stiff. Eco-effectiveness of these retrofitting materials has been evaluated by computing the embodied energy. It is observed that the amount of CO2 emitted by TRC is less compared to other retrofit materials. Further, the performance point of retrofitted RC frames has been evaluated and damage index has been calculated to find out the effective retrofit material. It is concluded that, if RC frame is retrofitted with FRP and TRC, it undergoes less damage compared to ferrocement.
Exercise Performance and Corticospinal Excitability during Action Observation

PubMed Central

Wrightson, James G.; Twomey, Rosie; Smeeton, Nicholas J.

2016-01-01

Purpose: Observation of a model performing fast exercise improves simultaneous exercise performance; however, the precise mechanism underpinning this effect is unknown. The aim of the present study was to investigate whether the speed of the observed exercise influenced both upper body exercise performance and the activation of a cortical action observation network (AON). Method: In Experiment 1, 10 participants completed a 5 km time trial on an arm-crank ergometer whilst observing a blank screen (no-video) and a model performing exercise at both a typical (i.e., individual mean cadence during baseline time trial) and 15% faster than typical speed. In Experiment 2, 11 participants performed arm crank exercise whilst observing exercise at typical speed, 15% slower and 15% faster than typical speed. In Experiment 3, 11 participants observed the typical, slow and fast exercise, and a no-video, whilst corticospinal excitability was assessed using transcranial magnetic stimulation. Results: In Experiment 1, performance time decreased and mean power increased, during observation of the fast exercise compared to the no-video condition. In Experiment 2, cadence and power increased during observation of the fast exercise compared to the typical speed exercise but there was no effect of observation of slow exercise on exercise behavior. In Experiment 3, observation of exercise increased corticospinal excitability; however, there was no difference between the exercise speeds. Conclusion: Observation of fast exercise improves simultaneous upper-body exercise performance. However, because there was no effect of exercise speed on corticospinal excitability, these results suggest that these improvements are not solely due to changes in the activity of the AON. PMID:27014037
Performance evaluation of automated segmentation software on optical coherence tomography volume data

PubMed Central

Tian, Jing; Varga, Boglarka; Tatrai, Erika; Fanni, Palya; Somfai, Gabor Mark; Smiddy, William E.

2016-01-01

Over the past two decades a significant number of OCT segmentation approaches have been proposed in the literature. Each methodology has been conceived for and/or evaluated using specific datasets that do not reflect the complexities of the majority of widely available retinal features observed in clinical settings. In addition, there does not exist an appropriate OCT dataset with ground truth that reflects the realities of everyday retinal features observed in clinical settings. While the need for unbiased performance evaluation of automated segmentation algorithms is obvious, the validation process of segmentation algorithms have been usually performed by comparing with manual labelings from each study and there has been a lack of common ground truth. Therefore, a performance comparison of different algorithms using the same ground truth has never been performed. This paper reviews research-oriented tools for automated segmentation of the retinal tissue on OCT images. It also evaluates and compares the performance of these software tools with a common ground truth. PMID:27159849
Administrators' Perceptions Regarding the Effectiveness of the Teacher Observation Evaluation System

ERIC Educational Resources Information Center

Williams, Kathleen Riley

2015-01-01

This phenomenological narrative study was designed to explore public school administrators' perceptions regarding Louisiana's Compass teacher observation evaluation system as a method for assessing teacher performance. Participants were administrators with at least two years of experience as a public school administrator at the secondary level,…
"Evaluations" of Observables Versus Measurements in Quantum Theory

NASA Astrophysics Data System (ADS)

Nisticò, Giuseppe; Sestito, Angela

2016-03-01

In Quantum Physics there are circumstances where the direct measurement of a given observable encounters difficulties; in some of these cases, however, its value can be "evaluated", i.e. it can be inferred by measuring another observable characterized by perfect correlation with the observable of interest. Though an evaluation is often interpreted as a measurement of the evaluated observable, we prove that the two concepts cannot be identified in Quantum Physics, because the identification yields contradictions. Then, we establish the conceptual status of evaluations in Quantum Theory and how they are related to measurements.
Carbon Monoxide Data Assimilation for Atmospheric Composition and Climate Science: Evaluating Performance with Current and Future Observations

NASA Astrophysics Data System (ADS)

Barre, J.; Edwards, D. P.; Gaubert, B.; Worden, H. M.; Arellano, A. F.; Anderson, J. L.

2015-12-01

Current satellite observations of tropospheric composition made from low Earth orbit provide at best one or two measurements each day at any given location. Comparisons of Terra/MOPITT carbon monoxide (CO) and IASI/Metop CO observation assimilations will be presented. We use the DART Ensemble Adjustment Kalman Filter to assimilate observations in the CAM-Chem global chemistry-climate model. Data assimilation impacts due to both different instrument capabilities (i.e. vertical sensitivity and global coverage) will be discussed. Coverage is global but sparse, often with large uncertainties in individual measurements that limit examination of local and regional atmospheric composition over short time periods. This has hindered the operational uptake of these data for monitoring air quality and population exposure, and for initializing and evaluating chemical weather forecasts. By the end of the current decade there are planned geostationary Earth orbit (GEO) satellite missions for atmospheric composition over North America, East Asia and Europe with additional missions proposed. Together, these present the possibility of a constellation of geostationary platforms to achieve continuous time-resolved high-density observations of continental domains for mapping pollutant sources and variability on diurnal and local scales. We describe Observing System Simulation Experiments (OSSEs) to evaluate the contributions of these GEO missions to improve knowledge of near-surface air pollution due to intercontinental long-range transport and quantify chemical precursor emissions. Our approach uses an efficient computational method to sample a high-resolution global GEOS-5 chemistry Nature Run over each geographical region of the GEO constellation. The demonstration carbon monoxide (CO) observation simulator, which will be expanded to other chemical pollutants, currently produces multispectral retrievals (MOPITT-like) and captures realistic scene-dependent variation in measurement
Lessons from cross-fleet/cross-airline observations - Evaluating the impact of CRM/LOFT training

NASA Technical Reports Server (NTRS)

Butler, Roy E.

1991-01-01

A review is presented of the crew resource management/line oriented flight training (CRM/LOFT) program to help determine the level of standardization across fleets and airlines in the critical area of evaluating crew behavior and performance. One of the goals of the project is to verify that check airmen and LOFT instructors within organizations are evaluating CRM issues consistently and that differences observed between fleets are not a function of idiosyncracies on the part of observers. Attention is given to the research tools for crew evaluation.
Foveated model observers to predict human performance in 3D images

NASA Astrophysics Data System (ADS)

Lago, Miguel A.; Abbey, Craig K.; Eckstein, Miguel P.

2017-03-01

We evaluate 3D search requires model observers that take into account the peripheral human visual processing (foveated models) to predict human observer performance. We show that two different 3D tasks, free search and location-known detection, influence the relative human visual detectability of two signals of different sizes in synthetic backgrounds mimicking the noise found in 3D digital breast tomosynthesis. One of the signals resembled a microcalcification (a small and bright sphere), while the other one was designed to look like a mass (a larger Gaussian blob). We evaluated current standard models observers (Hotelling; Channelized Hotelling; non-prewhitening matched filter with eye filter, NPWE; and non-prewhitening matched filter model, NPW) and showed that they incorrectly predict the relative detectability of the two signals in 3D search. We propose a new model observer (3D Foveated Channelized Hotelling Observer) that incorporates the properties of the visual system over a large visual field (fovea and periphery). We show that the foveated model observer can accurately predict the rank order of detectability of the signals in 3D images for each task. Together, these results motivate the use of a new generation of foveated model observers for predicting image quality for search tasks in 3D imaging modalities such as digital breast tomosynthesis or computed tomography.
48 CFR 436.604 - Performance evaluation.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 48 Federal Acquisition Regulations System 4 2012-10-01 2012-10-01 false Performance evaluation... Performance evaluation. Preparation of performance evaluation reports. (a) In addition to the requirements of FAR 36.604, performance evaluation reports shall be prepared for indefinite-delivery type contracts...

48 CFR 436.604 - Performance evaluation.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 48 Federal Acquisition Regulations System 4 2014-10-01 2014-10-01 false Performance evaluation... Performance evaluation. Preparation of performance evaluation reports. (a) In addition to the requirements of FAR 36.604, performance evaluation reports shall be prepared for indefinite-delivery type contracts...
48 CFR 436.604 - Performance evaluation.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 48 Federal Acquisition Regulations System 4 2011-10-01 2011-10-01 false Performance evaluation... Performance evaluation. Preparation of performance evaluation reports. (a) In addition to the requirements of FAR 36.604, performance evaluation reports shall be prepared for indefinite-delivery type contracts...
48 CFR 436.604 - Performance evaluation.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 48 Federal Acquisition Regulations System 4 2013-10-01 2013-10-01 false Performance evaluation... Performance evaluation. Preparation of performance evaluation reports. (a) In addition to the requirements of FAR 36.604, performance evaluation reports shall be prepared for indefinite-delivery type contracts...
48 CFR 436.604 - Performance evaluation.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 4 2010-10-01 2010-10-01 false Performance evaluation... Performance evaluation. Preparation of performance evaluation reports. (a) In addition to the requirements of FAR 36.604, performance evaluation reports shall be prepared for indefinite-delivery type contracts...
Exercising upper respiratory videoendoscopic evaluation of 100 nonracing performance horses with abnormal respiratory noise and/or poor performance.

PubMed

Davidson, E J; Martin, B B; Boston, R C; Parente, E J

2011-01-01

Although well documented in racehorses, there is paucity in the literature regarding the prevalence of dynamic upper airway abnormalities in nonracing performance horses. To describe upper airway function of nonracing performance horses with abnormal respiratory noise and/or poor performance via exercising upper airway videoendoscopy. Medical records of nonracing performance horses admitted for exercising evaluation with a chief complaint of abnormal respiratory noise and/or poor performance were reviewed. All horses had video recordings of resting and exercising upper airway endoscopy. Relationships between horse demographics, resting endoscopic findings, treadmill intensity and implementation of head and neck flexion during exercise with exercising endoscopic findings were examined. Dynamic upper airway obstructions were observed in 72% of examinations. Head and neck flexion was necessary to obtain a diagnosis in 21 horses. Pharyngeal wall collapse was the most prevalent upper airway abnormality, observed in 31% of the examinations. Complex abnormalities were noted in 27% of the examinations. Resting laryngeal dysfunction was significantly associated with dynamic arytenoid collapse and the odds of detecting intermittent dorsal displacement of the soft palate (DDSP) during exercise in horses with resting DDSP was only 7.7%. Exercising endoscopic observations were different from the resting observations in 54% of examinations. Dynamic upper airway obstructions were common in nonracing performance horses with respiratory noise and/or poor performance. Resting endoscopy was only helpful in determining exercising abnormalities with recurrent laryngeal neuropathy. This study emphasises the importance of exercising endoscopic evaluation in nonracing performance horses with abnormal respiratory noise and/or poor performance for accurate assessment of dynamic upper airway function. © 2010 EVJ Ltd.
A Gold Standards Approach to Training Instructors to Evaluate Crew Performance

NASA Technical Reports Server (NTRS)

Baker, David P.; Dismukes, R. Key

2003-01-01

The Advanced Qualification Program requires that airlines evaluate crew performance in Line Oriented Simulation. For this evaluation to be meaningful, instructors must observe relevant crew behaviors and evaluate those behaviors consistently and accurately against standards established by the airline. The airline industry has largely settled on an approach in which instructors evaluate crew performance on a series of event sets, using standardized grade sheets on which behaviors specific to event set are listed. Typically, new instructors are given a class in which they learn to use the grade sheets and practice evaluating crew performance observed on videotapes. These classes emphasize reliability, providing detailed instruction and practice in scoring so that all instructors within a given class will give similar scores to similar performance. This approach has value but also has important limitations; (1) ratings within one class of new instructors may differ from those of other classes; (2) ratings may not be driven primarily by the specific behaviors on which the company wanted the crews to be scored; and (3) ratings may not be calibrated to company standards for level of performance skill required. In this paper we provide a method to extend the existing method of training instructors to address these three limitations. We call this method the "gold standards" approach because it uses ratings from the company's most experienced instructors as the basis for training rater accuracy. This approach ties the training to the specific behaviors on which the experienced instructors based their ratings.
Performance Evaluation of the United Nations Environment Programme Air Quality Monitoring Unit

EPA Pesticide Factsheets

This report defines the specifics of the environmental test conditions used in the evaluation (systems and conditions), data observations, summarization of key performance evaluation findings, and ease of use features concerning the UNEP pod.
Compact Solar Spectrometer Column CO2, and CH4 Observations: Performance Evaluation at Multiple North American TCCON Sites

NASA Astrophysics Data System (ADS)

Parker, H. A.; Hedelius, J.; Viatte, C.; Wunch, D.; Wennberg, P. O.; Chen, J.; Wofsy, S.; Jones, T.; Franklin, J.; Dubey, M. K.; Roehl, C. M.; Podolske, J. R.; Hillyard, P. W.; Iraci, L. T.

2015-12-01

Measurement, reporting and verification (MRV) of anthropogenic emissions and natural sources and sinks of carbon dioxide (CO2) and methane (CH4) are crucial to predict climate change and develop transparent accounting policies to contain climate forcing. Remote sensing technologies are monitoring column averaged dry air mole fractions of CO2 and CH4 (XCO2 & XCH4) from ground and space (OCO-2 and GOSAT) with solar spectroscopy enabling direct MRV. However, current ground based coverage is sparse due to the need for large and expensive high-resolution spectrometers that are part of the Total Column Carbon Observing Network (TCCON, Bruker 125HR). This limits our MRV and satellite validation abilities, both regionally and globally. There are striking monitoring gaps in Asia, South America and Africa where the CO2 emissions are growing and there is a large uncertainty in fluxes from land use change, biomass burning and rainforest vulnerability. To fill this gap we evaluate the precision, accuracy and stability of compact, affordable and easy to use low-resolution spectrometers (Bruker EM27/SUN) by comparing with XCO2 and XCH4 retrieved from much larger high-resolution TCCON instruments. As these instruments will be used in a variety of locations, we evaluate their performance by comparing with 2 previous and 4 current United States TCCON sites in different regions up to 2700 km apart. These sites range from polluted to unpolluted, latitudes of 32 to 46°N, and altitudes of 230 to 2241 masl. Comparisons with some of these sites cover multiple years allowing assessment of the EM27/SUN performance not only in various regions, but also over an extended period of time and with different seasonal influences. Results show that our 2 EM27/SUN instruments capture the diurnal variability of the aforementioned constituents very well, but with offsets from TCCON and long-term variability which may be due in part to the extensive movement these spectrometers were subjected to. These
Memory for performed and observed activities following traumatic brain injury

PubMed Central

Wright, Matthew J.; Wong, Andrew L.; Obermeit, Lisa C.; Woo, Ellen; Schmitter-Edgecombe, Maureen; Fuster, Joaquín M.

2014-01-01

Traumatic brain injury (TBI) is associated with deficits in memory for the content of completed activities. However, TBI groups have shown variable memory for the temporal order of activities. We sought to clarify the conditions under which temporal order memory for activities is intact following TBI. Additionally, we evaluated activity source memory and the relationship between activity memory and functional outcome in TBI participants. Thus, we completed a study of activity memory with 18 severe TBI survivors and 18 healthy age- and education-matched comparison participants. Both groups performed eight activities and observed eight activities that were fashioned after routine daily tasks. Incidental encoding conditions for activities were utilized. The activities were drawn from two counterbalanced lists, and both performance and observation were randomly determined and interspersed. After all of the activities were completed, content memory (recall and recognition), source memory (conditional source identification), and temporal order memory (correlation between order reconstruction and actual order) for the activities were assessed. Functional ability was assessed via the Community Integration Questionnaire (CIQ). In terms of content memory, TBI participants recalled and recognized fewer activities than comparison participants. Recognition of performed and observed activities was strongly associated with social integration on the CIQ. There were no between- or within-group differences in temporal order or source memory, although source memory performances were near ceiling. The findings were interpreted as suggesting that temporal order memory following TBI is intact under conditions of both purposeful activity completion and incidental encoding, and that activity memory is related to functional outcomes following TBI. PMID:24524393
Performance evaluation of contrast-detail in full field digital mammography systems using ideal (Hotelling) observer vs. conventional automated analysis of CDMAM images for quality control of contrast-detail characteristics.

PubMed

Delakis, Ioannis; Wise, Robert; Morris, Lauren; Kulama, Eugenia

2015-11-01

The purpose of this work was to evaluate the contrast-detail performance of full field digital mammography (FFDM) systems using ideal (Hotelling) observer Signal-to-Noise Ratio (SNR) methodology and ascertain whether it can be considered an alternative to the conventional, automated analysis of CDMAM phantom images. Five FFDM units currently used in the national breast screening programme were evaluated, which differed with respect to age, detector, Automatic Exposure Control (AEC) and target/filter combination. Contrast-detail performance was analysed using CDMAM and ideal observer SNR methodology. The ideal observer SNR was calculated for input signal originating from gold discs of varying thicknesses and diameters, and then used to estimate the threshold gold thickness for each diameter as per CDMAM analysis. The variability of both methods and the dependence of CDMAM analysis on phantom manufacturing discrepancies also investigated. Results from both CDMAM and ideal observer methodologies were informative differentiators of FFDM systems' contrast-detail performance, displaying comparable patterns with respect to the FFDM systems' type and age. CDMAM results suggested higher threshold gold thickness values compared with the ideal observer methodology, especially for small-diameter details, which can be attributed to the behaviour of the CDMAM phantom used in this study. In addition, ideal observer methodology results showed lower variability than CDMAM results. The Ideal observer SNR methodology can provide a useful metric of the FFDM systems' contrast detail characteristics and could be considered a surrogate for conventional, automated analysis of CDMAM images. Copyright © 2015 Associazione Italiana di Fisica Medica. Published by Elsevier Ltd. All rights reserved.
Intra-observer reproducibility and diagnostic performance of breast shear-wave elastography in Asian women.

PubMed

Park, Hye Young; Han, Kyung Hwa; Yoon, Jung Hyun; Moon, Hee Jung; Kim, Min Jung; Kim, Eun-Kyung

2014-06-01

Our aim was to evaluate intra-observer reproducibility of shear-wave elastography (SWE) in Asian women. Sixty-four breast masses (24 malignant, 40 benign) were examined with SWE in 53 consecutive Asian women (mean age, 44.9 y old). Two SWE images were obtained for each of the lesions. The intra-observer reproducibility was assessed by intra-class correlation coefficients (ICC). We also evaluated various clinicoradiologic factors that can influence reproducibility in SWE. The ICC of intra-observer reproducibility was 0.789. In clinicoradiologic factor evaluation, masses surrounded by mixed fatty and glandular tissue (ICC: 0.619) showed lower intra-observer reproducibility compared with lesions that were surrounded by glandular tissue alone (ICC: 0.937; p < 0.05). Overall, the intra-observer reproducibility of breast SWE was excellent in Asian women. However, it may decrease when breast tissue is in a heterogeneous background. Therefore, SWE should be performed carefully in these cases. Copyright © 2014 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
48 CFR 236.604 - Performance evaluation.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 48 Federal Acquisition Regulations System 3 2014-10-01 2014-10-01 false Performance evaluation... Architect-Engineer Services 236.604 Performance evaluation. Prepare a separate performance evaluation after... familiar with the architect-engineer contractor's performance. [76 FR 58155, Sept. 20, 2011] ...
48 CFR 236.604 - Performance evaluation.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 48 Federal Acquisition Regulations System 3 2013-10-01 2013-10-01 false Performance evaluation... Architect-Engineer Services 236.604 Performance evaluation. Prepare a separate performance evaluation after... familiar with the architect-engineer contractor's performance. [76 FR 58155, Sept. 20, 2011] ...
48 CFR 236.604 - Performance evaluation.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 48 Federal Acquisition Regulations System 3 2012-10-01 2012-10-01 false Performance evaluation... Architect-Engineer Services 236.604 Performance evaluation. Prepare a separate performance evaluation after... familiar with the architect-engineer contractor's performance. [76 FR 58155, Sept. 20, 2011] ...
48 CFR 236.604 - Performance evaluation.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 48 Federal Acquisition Regulations System 3 2011-10-01 2011-10-01 false Performance evaluation... Architect-Engineer Services 236.604 Performance evaluation. Prepare a separate performance evaluation after... familiar with the architect-engineer contractor's performance. [76 FR 58155, Sept. 20, 2011] ...
48 CFR 236.604 - Performance evaluation.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 3 2010-10-01 2010-10-01 false Performance evaluation... Architect-Engineer Services 236.604 Performance evaluation. (a) Preparation of performance reports. Use DD Form 2631, Performance Evaluation (Architect-Engineer), instead of SF 1421. (2) Prepare a separate...
Performance evaluation of Bragg coherent diffraction imaging

NASA Astrophysics Data System (ADS)

Öztürk, H.; Huang, X.; Yan, H.; Robinson, I. K.; Noyan, I. C.; Chu, Y. S.

2017-10-01

In this study, we present a numerical framework for modeling three-dimensional (3D) diffraction data in Bragg coherent diffraction imaging (Bragg CDI) experiments and evaluating the quality of obtained 3D complex-valued real-space images recovered by reconstruction algorithms under controlled conditions. The approach is used to systematically explore the performance and the detection limit of this phase-retrieval-based microscopy tool. The numerical investigation suggests that the superb performance of Bragg CDI is achieved with an oversampling ratio above 30 and a detection dynamic range above 6 orders. The observed performance degradation subject to the data binning processes is also studied. This numerical tool can be used to optimize experimental parameters and has the potential to significantly improve the throughput of Bragg CDI method.
48 CFR 2936.604 - Performance evaluation.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 7 2010-10-01 2010-10-01 false Performance evaluation... Performance evaluation. (a) The HCA must establish procedures to evaluate architect-engineer contractor performance as required in FAR 36.604. Normally, the performance report must be prepared by the contracting...
48 CFR 2936.604 - Performance evaluation.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 48 Federal Acquisition Regulations System 7 2011-10-01 2011-10-01 false Performance evaluation... Performance evaluation. (a) The HCA must establish procedures to evaluate architect-engineer contractor performance as required in FAR 36.604. Normally, the performance report must be prepared by the contracting...
48 CFR 2936.604 - Performance evaluation.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 48 Federal Acquisition Regulations System 7 2014-10-01 2014-10-01 false Performance evaluation... Performance evaluation. (a) The HCA must establish procedures to evaluate architect-engineer contractor performance as required in FAR 36.604. Normally, the performance report must be prepared by the contracting...

48 CFR 2936.604 - Performance evaluation.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 48 Federal Acquisition Regulations System 7 2012-10-01 2012-10-01 false Performance evaluation... Performance evaluation. (a) The HCA must establish procedures to evaluate architect-engineer contractor performance as required in FAR 36.604. Normally, the performance report must be prepared by the contracting...
48 CFR 2936.604 - Performance evaluation.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 48 Federal Acquisition Regulations System 7 2013-10-01 2012-10-01 true Performance evaluation... Performance evaluation. (a) The HCA must establish procedures to evaluate architect-engineer contractor performance as required in FAR 36.604. Normally, the performance report must be prepared by the contracting...
Classroom Composition and Measured Teacher Performance: What Do Teacher Observation Scores Really Measure?

ERIC Educational Resources Information Center

Steinberg, Matthew P.; Garrett, Rachel

2016-01-01

As states and districts implement more rigorous teacher evaluation systems, measures of teacher performance are increasingly being used to support instruction and inform retention decisions. Classroom observations take a central role in these systems, accounting for the majority of teacher ratings upon which accountability decisions are based.…
Tuberculosis control program in the municipal context: performance evaluation

PubMed Central

Arakawa, Tiemi; Magnabosco, Gabriela Tavares; Andrade, Rubia Laine de Paula; Brunello, Maria Eugenia Firmino; Monroe, Aline Aparecida; Ruffino-Netto, Antonio; Scatena, Lucia Marina; Villa, Tereza Cristina Scatena

2017-01-01

ABSTRACT OBJECTIVE The objective of this study is to evaluate the performance of the Tuberculosis Control Program in municipalities of the State of São Paulo. METHODS This is a program evaluation research, with ecological design, which uses three non-hierarchical groups of the municipalities of the State of São Paulo according to their performance in relation to operational indicators. We have selected 195 municipalities with at least five new cases of tuberculosis notified in the Notification System of the State of São Paulo and with 20,000 inhabitants or more in 2010. The multiple correspondence analysis was used to identify the association between the groups of different performances, the epidemiological and demographic characteristics, and the characteristics of the health systems of the municipalities. RESULTS The group with the worst performance showed the highest rates of abandonment (average [avg] = 10.4, standard deviation [sd] = 9.4) and the lowest rates of supervision of Directly Observed Treatment (avg = 6.1, sd = 12.9), and it was associated with low incidence of tuberculosis, high tuberculosis and HIV, small population, high coverage of the Family Health Strategy/Program of Community Health Agents, and being located on the countryside. The group with the best performance presented the highest cure rate (avg = 83.7, sd = 10.5) and the highest rate of cases in Directly Observed Treatment (avg = 83.0, sd = 12.7); the group of regular performance showed regular results for outcome (avg cure = 79.8, sd = 13.2; abandonment avg = 9.5, sd = 8.3) and supervision of the Directly Observed Treatment (avg = 42.8, sd = 18.8). Large population, low coverage of the Family Health Strategy/Program of Community Health Agents, high incidence of tuberculosis and AIDS, and being located on the coast and in metropolitan areas were associated with these groups. CONCLUSIONS The findings highlight the importance of the Directly Observed Treatment in relation to the outcome
X-Ray Phantom Development For Observer Performance Studies

NASA Astrophysics Data System (ADS)

Kelsey, C. A.; Moseley, R. D.; Mettler, F. A.; Parker, T. W.

1981-07-01

The requirements for radiographic imaging phantoms for observer performance testing include realistic tasks which mimic at least some portion of the diagnostic examination presented in a setting which approximates clinically derived images. This study describes efforts to simulate chest and vascular diseases for evaluation of conventional and digital radiographic systems. Images of lung nodules, pulmonary infiltrates, as well as hilar and mediastinal masses are generated with a conventional chest phantom to make up chest disease test series. Vascular images are simulated by hollow tubes embedded in tissue density plastic with widening and narrowing added to mimic aneurysms and stenoses. Both sets of phantoms produce images which allow simultaneous determination of true positive and false positive rates as well as complete ROC curves.
Low-cost high performance distributed data storage for multi-channel observations

NASA Astrophysics Data System (ADS)

Liu, Ying-bo; Wang, Feng; Deng, Hui; Ji, Kai-fan; Dai, Wei; Wei, Shou-lin; Liang, Bo; Zhang, Xiao-li

2015-10-01

The New Vacuum Solar Telescope (NVST) is a 1-m solar telescope that aims to observe the fine structures in both the photosphere and the chromosphere of the Sun. The observational data acquired simultaneously from one channel for the chromosphere and two channels for the photosphere bring great challenges to the data storage of NVST. The multi-channel instruments of NVST, including scientific cameras and multi-band spectrometers, generate at least 3 terabytes data per day and require high access performance while storing massive short-exposure images. It is worth studying and implementing a storage system for NVST which would balance the data availability, access performance and the cost of development. In this paper, we build a distributed data storage system (DDSS) for NVST and then deeply evaluate the availability of real-time data storage on a distributed computing environment. The experimental results show that two factors, i.e., the number of concurrent read/write and the file size, are critically important for improving the performance of data access on a distributed environment. Referring to these two factors, three strategies for storing FITS files are presented and implemented to ensure the access performance of the DDSS under conditions of multi-host write and read simultaneously. The real applications of the DDSS proves that the system is capable of meeting the requirements of NVST real-time high performance observational data storage. Our study on the DDSS is the first attempt for modern astronomical telescope systems to store real-time observational data on a low-cost distributed system. The research results and corresponding techniques of the DDSS provide a new option for designing real-time massive astronomical data storage system and will be a reference for future astronomical data storage.
Evaluation of Surface Flux Parameterizations with Long-Term ARM Observations

DOE PAGES

Liu, Gang; Liu, Yangang; Endo, Satoshi

2013-02-01

Surface momentum, sensible heat, and latent heat fluxes are critical for atmospheric processes such as clouds and precipitation, and are parameterized in a variety of models ranging from cloud-resolving models to large-scale weather and climate models. However, direct evaluation of the parameterization schemes for these surface fluxes is rare due to limited observations. This study takes advantage of the long-term observations of surface fluxes collected at the Southern Great Plains site by the Department of Energy Atmospheric Radiation Measurement program to evaluate the six surface flux parameterization schemes commonly used in the Weather Research and Forecasting (WRF) model and threemore » U.S. general circulation models (GCMs). The unprecedented 7-yr-long measurements by the eddy correlation (EC) and energy balance Bowen ratio (EBBR) methods permit statistical evaluation of all six parameterizations under a variety of stability conditions, diurnal cycles, and seasonal variations. The statistical analyses show that the momentum flux parameterization agrees best with the EC observations, followed by latent heat flux, sensible heat flux, and evaporation ratio/Bowen ratio. The overall performance of the parameterizations depends on atmospheric stability, being best under neutral stratification and deteriorating toward both more stable and more unstable conditions. Further diagnostic analysis reveals that in addition to the parameterization schemes themselves, the discrepancies between observed and parameterized sensible and latent heat fluxes may stem from inadequate use of input variables such as surface temperature, moisture availability, and roughness length. The results demonstrate the need for improving the land surface models and measurements of surface properties, which would permit the evaluation of full land surface models.« less
Feedback-giving behaviour in performance evaluations during clinical clerkships.

PubMed

Bok, Harold G J; Jaarsma, Debbie A D C; Spruijt, Annemarie; Van Beukelen, Peter; Van Der Vleuten, Cees P M; Teunissen, Pim W

2016-01-01

Narrative feedback documented in performance evaluations by the teacher, i.e. the clinical supervisor, is generally accepted to be essential for workplace learning. Many studies have examined factors of influence on the usage of mini-clinical evaluation exercise (mini-CEX) instruments and provision of feedback, but little is known about how these factors influence teachers' feedback-giving behaviour. In this study, we investigated teachers' use of mini-CEX in performance evaluations to provide narrative feedback in undergraduate clinical training. We designed an exploratory qualitative study using an interpretive approach. Focusing on the usage of mini-CEX instruments in clinical training, we conducted semi-structured interviews to explore teachers' perceptions. Between February and June 2013, we conducted interviews with 14 clinicians participated as teachers during undergraduate clinical clerkships. Informed by concepts from the literature, we coded interview transcripts and iteratively reduced and displayed data using template analysis. We identified three main themes of interrelated factors that influenced teachers' practice with regard to mini-CEX instruments: teacher-related factors; teacher-student interaction-related factors, and teacher-context interaction-related factors. Four issues (direct observation, relationship between teacher and student, verbal versus written feedback, formative versus summative purposes) that are pertinent to workplace-based performance evaluations were presented to clarify how different factors interact with each other and influence teachers' feedback-giving behaviour. Embedding performance observation in clinical practice and establishing trustworthy teacher-student relationships in more longitudinal clinical clerkships were considered important in creating a learning environment that supports and facilitates the feedback exchange. Teachers' feedback-giving behaviour within the clinical context results from the interaction
48 CFR 36.604 - Performance evaluation.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 48 Federal Acquisition Regulations System 1 2011-10-01 2011-10-01 false Performance evaluation. 36.604 Section 36.604 Federal Acquisition Regulations System FEDERAL ACQUISITION REGULATION SPECIAL... Performance evaluation. See 42.1502(f) for the requirements for preparing past performance evaluations for...
48 CFR 36.604 - Performance evaluation.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 48 Federal Acquisition Regulations System 1 2012-10-01 2012-10-01 false Performance evaluation. 36.604 Section 36.604 Federal Acquisition Regulations System FEDERAL ACQUISITION REGULATION SPECIAL... Performance evaluation. See 42.1502(f) for the requirements for preparing past performance evaluations for...
48 CFR 36.604 - Performance evaluation.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 1 2010-10-01 2010-10-01 false Performance evaluation. 36.604 Section 36.604 Federal Acquisition Regulations System FEDERAL ACQUISITION REGULATION SPECIAL... Performance evaluation. See 42.1502(f) for the requirements for preparing past performance evaluations for...
48 CFR 36.604 - Performance evaluation.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 48 Federal Acquisition Regulations System 1 2014-10-01 2014-10-01 false Performance evaluation. 36.604 Section 36.604 Federal Acquisition Regulations System FEDERAL ACQUISITION REGULATION SPECIAL... Performance evaluation. See 42.1502(f) for the requirements for preparing past performance evaluations for...
Performance evaluation of Bragg coherent diffraction imaging

DOE PAGES

Ozturk, Hande; Huang, X.; Yan, H.; ...

2017-10-03

In this study, we present a numerical framework for modeling three-dimensional (3D) diffraction data in Bragg coherent diffraction imaging (Bragg CDI) experiments and evaluating the quality of obtained 3D complex-valued real-space images recovered by reconstruction algorithms under controlled conditions. The approach is used to systematically explore the performance and the detection limit of this phase-retrieval-based microscopy tool. The numerical investigation suggests that the superb performance of Bragg CDI is achieved with an oversampling ratio above 30 and a detection dynamic range above 6 orders. The observed performance degradation subject to the data binning processes is also studied. Furthermore, this numericalmore » tool can be used to optimize experimental parameters and has the potential to significantly improve the throughput of Bragg CDI method.« less
How accurately do drivers evaluate their own driving behavior? An on-road observational study.

PubMed

Amado, Sonia; Arıkan, Elvan; Kaça, Gülin; Koyuncu, Mehmet; Turkan, B Nilay

2014-02-01

Self-assessment of driving skills became a noteworthy research subject in traffic psychology, since by knowing one's strenghts and weaknesses, drivers can take an efficient compensatory action to moderate risk and to ensure safety in hazardous environments. The current study aims to investigate drivers' self-conception of their own driving skills and behavior in relation to expert evaluations of their actual driving, by using naturalistic and systematic observation method during actual on-road driving session and to assess the different aspects of driving via comprehensive scales sensitive to different specific aspects of driving. 19-63 years old male participants (N=158) attended an on-road driving session lasting approximately 80min (45km). During the driving session, drivers' errors and violations were recorded by an expert observer. At the end of the driving session, observers completed the driver evaluation questionnaire, while drivers completed the driving self-evaluation questionnaire and Driver Behavior Questionnaire (DBQ). Low to moderate correlations between driver and observer evaluations of driving skills and behavior, mainly on errors and violations of speed and traffic lights was found. Furthermore, the robust finding that drivers evaluate their driving performance as better than the expert was replicated. Over-positive appraisal was higher among drivers with higher error/violation score and with the ones that were evaluated by the expert as "unsafe". We suggest that the traffic environment might be regulated by increasing feedback indicators of errors and violations, which in turn might increase the insight into driving performance. Improving self-awareness by training and feedback sessions might play a key role for reducing the probability of risk in their driving activity. Copyright © 2013 Elsevier Ltd. All rights reserved.
Dribble Files: Methodologies to Evaluate Learning and Performance in Complex Environments

ERIC Educational Resources Information Center

Schrader, P. G.; Lawless, Kimberly A.

2007-01-01

Research in the area of technology learning environments is tremendously complex. Tasks performed in these contexts are highly cognitive and mostly invisible to the observer. The nature of performance in these contexts is explained not only by the outcome but also by the process. However, evaluating the learning process with respect to tasks…
A Regional Climate Model Evaluation System based on Satellite and other Observations

NASA Astrophysics Data System (ADS)

Lean, P.; Kim, J.; Waliser, D. E.; Hall, A. D.; Mattmann, C. A.; Granger, S. L.; Case, K.; Goodale, C.; Hart, A.; Zimdars, P.; Guan, B.; Molotch, N. P.; Kaki, S.

2010-12-01

Regional climate models are a fundamental tool needed for downscaling global climate simulations and projections, such as those contributing to the Coupled Model Intercomparison Projects (CMIPs) that form the basis of the IPCC Assessment Reports. The regional modeling process provides the means to accommodate higher resolution and a greater complexity of Earth System processes. Evaluation of both the global and regional climate models against observations is essential to identify model weaknesses and to direct future model development efforts focused on reducing the uncertainty associated with climate projections. However, the lack of reliable observational data and the lack of formal tools are among the serious limitations to addressing these objectives. Recent satellite observations are particularly useful as they provide a wealth of information on many different aspects of the climate system, but due to their large volume and the difficulties associated with accessing and using the data, these datasets have been generally underutilized in model evaluation studies. Recognizing this problem, NASA JPL / UCLA is developing a model evaluation system to help make satellite observations, in conjunction with in-situ, assimilated, and reanalysis datasets, more readily accessible to the modeling community. The system includes a central database to store multiple datasets in a common format and codes for calculating predefined statistical metrics to assess model performance. This allows the time taken to compare model simulations with satellite observations to be reduced from weeks to days. Early results from the use this new model evaluation system for evaluating regional climate simulations over California/western US regions will be presented.
13 CFR 304.4 - Performance evaluations.

Code of Federal Regulations, 2011 CFR

2011-01-01

... 13 Business Credit and Assistance 1 2011-01-01 2011-01-01 false Performance evaluations. 304.4... ECONOMIC DEVELOPMENT DISTRICTS § 304.4 Performance evaluations. (a) EDA shall evaluate the management standards, financial accountability and program performance of each District Organization within three (3...
13 CFR 304.4 - Performance evaluations.

Code of Federal Regulations, 2014 CFR

2014-01-01

... 13 Business Credit and Assistance 1 2014-01-01 2014-01-01 false Performance evaluations. 304.4... ECONOMIC DEVELOPMENT DISTRICTS § 304.4 Performance evaluations. (a) EDA shall evaluate the management standards, financial accountability and program performance of each District Organization within three (3...
13 CFR 304.4 - Performance evaluations.

Code of Federal Regulations, 2013 CFR

2013-01-01

... 13 Business Credit and Assistance 1 2013-01-01 2013-01-01 false Performance evaluations. 304.4... ECONOMIC DEVELOPMENT DISTRICTS § 304.4 Performance evaluations. (a) EDA shall evaluate the management standards, financial accountability and program performance of each District Organization within three (3...
Clinical performance of a glass ionomer restorative system: a 6-year evaluation.

PubMed

Gurgan, Sevil; Kutuk, Zeynep Bilge; Ergin, Esra; Oztas, Sema Seval; Cakir, Filiz Yalcin

2017-09-01

The aim of this study is to evaluate the long-term clinical performance of a glass ionomer (GI) restorative system in the restoration of posterior teeth compared with a micro-filled hybrid posterior composite. A total of 140 (80 Cl1 and 60 Cl2) lesions in 59 patients were restored with a GI system (Equia) or a micro hybrid composite (Gradia Direct). Restorations were evaluated at baseline and yearly during 6 years according to the modified-USPHS criteria. Negative replicas at each recall were observed under SEM to evaluate surface characteristics. Data were analyzed with Cohcran's Q and McNemar's tests (p < 0.05). One hundred fifteen (70 Cl1 and 45 Cl2) restorations were evaluated in 47 patients with a recall rate of 79.6% at 6 years. Significant differences were found in marginal adaptation and marginal discoloration for both restorative materials for Cl1 and Cl2 restorations (p < 0.05). However, none of the materials were superior to the other (p > 0.05). A significant decrease in color match was observed in Equia restorations (p < 0.05). Only one Cl2 Equia restoration was missing at 3 years and another one at 4 years. No failures were observed at 5 and 6 years. Both materials exhibited clinically successful performance after 6 years. SEM evaluations were in accordance with the clinical findings. Both materials showed a good clinical performance for the restoration of posterior teeth during the 6-year evaluation. The clinical effectiveness of Equia and Gradia Direct Posterior was acceptable in Cl1 and Cl2 cavities subsequent to 6-year evaluation.

The effects of stereotypes and observer pressure on athletic performance.

PubMed

Krendl, Anne; Gainsburg, Izzy; Ambady, Nalini

2012-02-01

Although the effects of negative stereotypes and observer pressure on athletic performance have been well researched, the effects of positive stereotypes on performance, particularly in the presence of observers, is not known. In the current study, White males watched a video either depicting Whites basketball players as the best free throwers in the NBA (positive stereotype), Black basketball players as the best free throwers in the NBA (negative stereotype), or a neutral sports video (control). Participants then shot a set of free throws, during which half the participants were also videotaped (observer condition), whereas the other half were not (no observer condition). Results demonstrated that positive stereotypes improved free throw performance, but only in the no observer condition. Interestingly, observer pressure interacted with the positive stereotype to lead to performance decrements. In the negative stereotype condition, performance decrements were observed both in the observer and no observer conditions.
A Focused Observation Tool Using Dreyfus Stages of Skill Acquisition as an Evaluative Scale.

PubMed

Driver, Richard; Grose, Brian; Serafini, Mario; Cottrell, Scott; Sizemore, Daniel; Vallejo, Manuel

2017-01-01

Focused Observartion (FO) is associated with assessing complex skills and differs from generalized observations and evaluations. We've developed a FO assessing clinical procedural skills using Hubert Dreyfus Stages of Skill Acquisition as descriptive anchors. This study sought to analyze the effectiveness of this measure of skill progression. During week 1 and week 4 of training, FO was performed repetitively on 6 residents during endotracheal intubation. Skill stage ratings were converted to numerical scores. A dependent, paired samples t-test was calculated using total mean score (dependent variable) and an effect size. (Cohen's d) was performed to ascertain the standardized mean difference between observations. A significant improvement in mean scores occurred between Week 1 (AVG 1.2, STDV ± 0.1) and Week 4 (AVG 2.0, STDV ± 0.1) (t= -3.9, p<.05) Calculated Chohen's d indicates that this difference was meaningful. This study demonstrates success in adapting a Focused Observation technique and an innovative evaluative scale based upon Dreyfus stages of skill acquisition.
Evacuation performance evaluation tool.

PubMed

Farra, Sharon; Miller, Elaine T; Gneuhs, Matthew; Timm, Nathan; Li, Gengxin; Simon, Ashley; Brady, Whittney

2016-01-01

Hospitals conduct evacuation exercises to improve performance during emergency events. An essential aspect in this process is the creation of reliable and valid evaluation tools. The objective of this article is to describe the development and implications of a disaster evacuation performance tool that measures one portion of the very complex process of evacuation. Through the application of the Delphi technique and DeVellis's framework, disaster and neonatal experts provided input in developing this performance evaluation tool. Following development, content validity and reliability of this tool were assessed. Large pediatric hospital and medical center in the Midwest. The tool was pilot tested with an administrative, medical, and nursing leadership group and then implemented with a group of 68 healthcare workers during a disaster exercise of a neonatal intensive care unit (NICU). The tool has demonstrated high content validity with a scale validity index of 0.979 and inter-rater reliability G coefficient (0.984, 95% CI: 0.948-0.9952). The Delphi process based on the conceptual framework of DeVellis yielded a psychometrically sound evacuation performance evaluation tool for a NICU.
Evaluating Music Teachers: A Comparison of Evaluations by Observers with Varied Levels of Musical and Observational Background

ERIC Educational Resources Information Center

Hirokawa, Joy Ondra

2013-01-01

The purpose of this research was to examine the differences in the evaluations of music teachers conducted by individuals with varying backgrounds in music and observation techniques. Part I compared evaluations completed by school administrators and music department leadership. Part II utilized the findings of Part I to create focused and…
Evaluation of internal noise methods for Hotelling observers

NASA Astrophysics Data System (ADS)

Zhang, Yani; Pham, Binh T.; Eckstein, Miguel P.

2005-04-01

Including internal noise in computer model observers to degrade model observer performance to human levels is a common method to allow for quantitatively comparisons of human and model performance. In this paper, we studied two different types of methods for injecting internal noise to Hotelling model observers. The first method adds internal noise to the output of the individual channels: a) Independent non-uniform channel noise, b) Independent uniform channel noise. The second method adds internal noise to the decision variable arising from the combination of channel responses: a) internal noise standard deviation proportional to decision variable's standard deviation due to the external noise, b) internal noise standard deviation proportional to decision variable's variance caused by the external noise. We tested the square window Hotelling observer (HO), channelized Hotelling observer (CHO), and Laguerre-Gauss Hotelling observer (LGHO). The studied task was detection of a filling defect of varying size/shape in one of four simulated arterial segment locations with real x-ray angiography backgrounds. Results show that the internal noise method that leads to the best prediction of human performance differs across the studied models observers. The CHO model best predicts human observer performance with the channel internal noise. The HO and LGHO best predict human observer performance with the decision variable internal noise. These results might help explain why previous studies have found different results on the ability of each Hotelling model to predict human performance. Finally, the present results might guide researchers with the choice of method to include internal noise into their Hotelling models.
Development of a measure of student self-evaluation of physics exam performance

NASA Astrophysics Data System (ADS)

Hagedorn, Eric Anthony

The central purpose of this study was to provide preliminary evidence of the reliability and validity of the SEVSI - P (Self- evaluation scaled instrument - physics). This instrument, designed to measure student self-evaluation of physics exam performance, was developed in congruence with social cognitive theory. Self-evaluation in this study is defined to consist of two of the three subprocesses of self-regulation: self-observation and judgmental process. As such, the SEVSI - P consists of two subscales, one measuring the frequency and types of self-observations made during a physics exam and one measuring the frequency and types of judgmental comparisons made after an exam. Data from 621 completed surveys, voluntarily taken by first semester algebra/trigonometry based physics students at six Midwestern universities and one Southern university, were analyzed for reliability and factorial validity. Cronbach alphas of .71 and .83 for the self-observation and judgment subscales, respectively, indicate acceptable reliability for the instrument. Confirmatory factor analysis indicates the acceptability of the hypothesis that the data analyzed could have indeed been obtained from the proposed two factor model (self-observation and judgment). The results of this confirmatory factor analysis provide preliminary construct validity for this instrument. A number of theoretically related items were included on the SEVSI - P form to elicity information about the use of goals and pre-planned strategies, actions taken in response to previous poor performances, and emotional responses to performance. A correlational analysis of these items along with the self-observation and judgment subscale scores provided a limited degree of convergent validity for the two subscales. Analyses of variance were done to determine the presence of differences in scoring patterns based on gender or reported ethnic origin. These results indicate slightly higher judgment subscale scores for women and
48 CFR 8.606 - Evaluating FPI performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

.... 8.606 Evaluating FPI performance. Agencies shall evaluate FPI contract performance in accordance with subpart 42.15. Performance evaluations do not negate the requirements of 8.602 and 8.604, but they... 48 Federal Acquisition Regulations System 1 2010-10-01 2010-10-01 false Evaluating FPI performance...
Performance Evaluation: A Deadly Disease?

ERIC Educational Resources Information Center

Aluri, Rao; Reichel, Mary

1994-01-01

W. Edwards Deming condemned performance evaluations as a deadly disease afflicting American management. He argued that performance evaluations nourish fear, encourage short-term thinking, stifle teamwork, and are no better than lotteries. This article examines library literature from Deming's perspective. Although that literature accepts…
The supervisor's performance appraisal: evaluating the evaluator.

PubMed

McConnell, C R

1993-04-01

The focus of much performance appraisal in the coming decade or so will likely be on the level of customer satisfaction achieved through performance. Ultimately, evaluating the evaluator--that is, appraising the supervisor--will likely become a matter of assessing how well the supervisor's department meets the needs of its customers. Since meeting the needs of one's customers can well become the strongest determinant of organizational success or failure, it follows that relative success in ensuring these needs are met can become the primary indicator of one's relative success as a supervisor. This has the effect of placing the emphasis on supervisory performance exactly at the point it belongs, right on the bottom-line results of the supervisor's efforts.
40 CFR Table 6 to Subpart Wwww of... - Basic Requirements for Performance Tests, Performance Evaluations, and Design Evaluations for New...

Code of Federal Regulations, 2013 CFR

2013-07-01

... Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control... Performance Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control Devices As required in § 63.5850 you must conduct performance tests, performance evaluations, and...
40 CFR Table 6 to Subpart Wwww of... - Basic Requirements for Performance Tests, Performance Evaluations, and Design Evaluations for New...

Code of Federal Regulations, 2014 CFR

2014-07-01

... Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control... Performance Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control Devices As required in § 63.5850 you must conduct performance tests, performance evaluations, and...
40 CFR Table 6 to Subpart Wwww of... - Basic Requirements for Performance Tests, Performance Evaluations, and Design Evaluations for New...

Code of Federal Regulations, 2012 CFR

2012-07-01

... Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control... Performance Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control Devices As required in § 63.5850 you must conduct performance tests, performance evaluations, and...
Evaluating Behavioural Observation Audiometry with Handicapped Children.

ERIC Educational Resources Information Center

Flexer, Carol; Gans, Donald P.

1982-01-01

Three observers evaluated the responses to sound with 21 mild to severely handicapped children (7 months to 10 years old) on Behavioural Observation Audiometry, an alternative to conditioning paradigms in audiometric assessment. Results showed that inter-observer agreement was high and that responsitivity was not affected by stimulus presentation…
Application of Lidar Data to the Performance Evaluations of ...

EPA Pesticide Factsheets

The Tropospheric Ozone (O3) Lidar Network (TOLNet) provides time/height O3 measurements from near the surface to the top of the troposphere to describe in high-fidelity spatial-temporal distributions, which is uniquely useful to evaluate the temporal evolution of O3 profiles in air quality models. This presentation describes the application of the Lidar data to the performance evaluation of CMAQ simulated O3 vertical profiles during the summer, 2014. Two-way coupled WRF-CMAQ simulations with 12km and 4km domains centered over Boulder, Colorado were performed during this time period. The analysis on the time series of observed and modeled O3 mixing ratios at different vertical layers indicates that the model frequently underestimated the observed values, and the underestimation was amplified in the middle model layers (~1km above the ground). When the lightning strikes detected by the National Lightning Detection Network (NLDN) were analyzed along with the observed O3 time series, it was found that the daily maximum O3 mixing ratios correlated well with the lightning strikes in the vicinity of the Lidar station. The analysis on temporal vertical profiles of both observed and modeled O3 mixing ratios on episodic days suggests that the model resolutions (12km and 4km) do not make any significant difference for this analysis (at this specific location and simulation period), but high O3 levels in the middle layers were linked to lightning activity that occurred in t
Tip of the Tongue States Increase Under Evaluative Observation.

PubMed

James, Lori E; Schmank, Christopher J; Castro, Nichol; Buchanan, Tony W

2018-02-01

We tested the frequent assumption that the difficulty of word retrieval increases when a speaker is being observed and evaluated. We modified the Trier Social Stress Test (TSST) so that participants believed that its evaluative observation components continued throughout the duration of a subsequent word retrieval task, and measured participants' reported tip of the tongue (TOT) states. Participants in this TSST condition experienced more TOTs than participants in a comparable, placebo TSST condition in which there was no suggestion of evaluative observation. This experiment provides initial evidence confirming the assumption that evaluative observation by a third party can be disruptive to word retrieval. We interpret our findings by proposing an extension to a well-supported theoretical model of TOTs.
Market behavior and performance of different strategy evaluation schemes

NASA Astrophysics Data System (ADS)

Baek, Yongjoo; Lee, Sang Hoon; Jeong, Hawoong

2010-08-01

Strategy evaluation schemes are a crucial factor in any agent-based market model, as they determine the agents’ strategy preferences and consequently their behavioral pattern. This study investigates how the strategy evaluation schemes adopted by agents affect their performance in conjunction with the market circumstances. We observe the performance of three strategy evaluation schemes, the history-dependent wealth game, the trend-opposing minority game, and the trend-following majority game, in a stock market where the price is exogenously determined. The price is either directly adopted from the real stock market indices or generated with a Markov chain of order ≤2 . Each scheme’s success is quantified by average wealth accumulated by the traders equipped with the scheme. The wealth game, as it learns from the history, shows relatively good performance unless the market is highly unpredictable. The majority game is successful in a trendy market dominated by long periods of sustained price increase or decrease. On the other hand, the minority game is suitable for a market with persistent zigzag price patterns. We also discuss the consequence of implementing finite memory in the scoring processes of strategies. Our findings suggest under which market circumstances each evaluation scheme is appropriate for modeling the behavior of real market traders.
Market behavior and performance of different strategy evaluation schemes.

PubMed

Baek, Yongjoo; Lee, Sang Hoon; Jeong, Hawoong

2010-08-01

Strategy evaluation schemes are a crucial factor in any agent-based market model, as they determine the agents' strategy preferences and consequently their behavioral pattern. This study investigates how the strategy evaluation schemes adopted by agents affect their performance in conjunction with the market circumstances. We observe the performance of three strategy evaluation schemes, the history-dependent wealth game, the trend-opposing minority game, and the trend-following majority game, in a stock market where the price is exogenously determined. The price is either directly adopted from the real stock market indices or generated with a Markov chain of order ≤2 . Each scheme's success is quantified by average wealth accumulated by the traders equipped with the scheme. The wealth game, as it learns from the history, shows relatively good performance unless the market is highly unpredictable. The majority game is successful in a trendy market dominated by long periods of sustained price increase or decrease. On the other hand, the minority game is suitable for a market with persistent zigzag price patterns. We also discuss the consequence of implementing finite memory in the scoring processes of strategies. Our findings suggest under which market circumstances each evaluation scheme is appropriate for modeling the behavior of real market traders.
A Self-Evaluation Instrument for Work Performance and Support Needs

ERIC Educational Resources Information Center

Brady, Michael P.; Rosenberg, Howard; Frain, Michael P.

2008-01-01

Involvement of students and adult employees into the decisions that affect their education and employment can improve their transition into supported employment. One means for increasing involvement into these decisions is to gain their input into performance evaluations and support needs. The "Job Observation and Behavior Scale: Opportunity for…
Evaluation of medical management during a mass casualty incident exercise: an objective assessment tool to enhance direct observation.

PubMed

Ingrassia, Pier Luigi; Prato, Federico; Geddo, Alessandro; Colombo, Davide; Tengattini, Marco; Calligaro, Sara; La Mura, Fabrizio; Franc, Jeffrey Michael; Della Corte, Francesco

2010-11-01

Functional exercises represent an important link between disaster planning and disaster response. Although these exercises are widely performed, no standardized method exists for their evaluation. To describe a simple and objective method to assess medical performance during functional exercise events. An evaluation tool comprising three data fields (triage, clinical maneuvers, and radio usage), accompanied by direct anecdotal observational methods, was used to evaluate a large functional mass casualty incident exercise. Seventeen medical responders managed 112 victims of a simulated building explosion. Although 81% of the patients were assigned the appropriate triage codes, evacuation from the site did not follow in priority. Required maneuvers were performed correctly in 85.2% of airway maneuvers and 78.7% of breathing maneuvers, however, significant under-treatment occurred, possibly due to equipment shortages. Extensive use of radio communication was documented. In evaluating this tool, the structured markers were informative, but further information provided by direct observation was invaluable. A three-part tool (triage, medical maneuvers, and radio usage) can provide a method to evaluate functional mass casualty incident exercises, and is easily implemented. For the best results, it should be used in conjunction with direct observation. The evaluation tool has great potential as a reproducible and internationally recognized tool for evaluating disaster management exercises. Copyright © 2010 Elsevier Inc. All rights reserved.
40 CFR Table 6 to Subpart Wwww of... - Basic Requirements for Performance Tests, Performance Evaluations, and Design Evaluations for New...

Code of Federal Regulations, 2010 CFR

2010-07-01

... Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control... Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control Devices As required in § 63.5850 you must conduct performance tests, performance evaluations, and design...

40 CFR Table 6 to Subpart Wwww of... - Basic Requirements for Performance Tests, Performance Evaluations, and Design Evaluations for New...

Code of Federal Regulations, 2011 CFR

2011-07-01

... Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control... Tests, Performance Evaluations, and Design Evaluations for New and Existing Sources Using Add-On Control Devices As required in § 63.5850 you must conduct performance tests, performance evaluations, and design...
Performance evaluation of an agent-based occupancy simulation model

DOE PAGES

Luo, Xuan; Lam, Khee Poh; Chen, Yixing; ...

2017-01-17

Occupancy is an important factor driving building performance. Static and homogeneous occupant schedules, commonly used in building performance simulation, contribute to issues such as performance gaps between simulated and measured energy use in buildings. Stochastic occupancy models have been recently developed and applied to better represent spatial and temporal diversity of occupants in buildings. However, there is very limited evaluation of the usability and accuracy of these models. This study used measured occupancy data from a real office building to evaluate the performance of an agent-based occupancy simulation model: the Occupancy Simulator. The occupancy patterns of various occupant types weremore » first derived from the measured occupant schedule data using statistical analysis. Then the performance of the simulation model was evaluated and verified based on (1) whether the distribution of observed occupancy behavior patterns follows the theoretical ones included in the Occupancy Simulator, and (2) whether the simulator can reproduce a variety of occupancy patterns accurately. Results demonstrated the feasibility of applying the Occupancy Simulator to simulate a range of occupancy presence and movement behaviors for regular types of occupants in office buildings, and to generate stochastic occupant schedules at the room and individual occupant levels for building performance simulation. For future work, model validation is recommended, which includes collecting and using detailed interval occupancy data of all spaces in an office building to validate the simulated occupant schedules from the Occupancy Simulator.« less
Performance evaluation of an agent-based occupancy simulation model

DOE Office of Scientific and Technical Information (OSTI.GOV)

Luo, Xuan; Lam, Khee Poh; Chen, Yixing

Occupancy is an important factor driving building performance. Static and homogeneous occupant schedules, commonly used in building performance simulation, contribute to issues such as performance gaps between simulated and measured energy use in buildings. Stochastic occupancy models have been recently developed and applied to better represent spatial and temporal diversity of occupants in buildings. However, there is very limited evaluation of the usability and accuracy of these models. This study used measured occupancy data from a real office building to evaluate the performance of an agent-based occupancy simulation model: the Occupancy Simulator. The occupancy patterns of various occupant types weremore » first derived from the measured occupant schedule data using statistical analysis. Then the performance of the simulation model was evaluated and verified based on (1) whether the distribution of observed occupancy behavior patterns follows the theoretical ones included in the Occupancy Simulator, and (2) whether the simulator can reproduce a variety of occupancy patterns accurately. Results demonstrated the feasibility of applying the Occupancy Simulator to simulate a range of occupancy presence and movement behaviors for regular types of occupants in office buildings, and to generate stochastic occupant schedules at the room and individual occupant levels for building performance simulation. For future work, model validation is recommended, which includes collecting and using detailed interval occupancy data of all spaces in an office building to validate the simulated occupant schedules from the Occupancy Simulator.« less
13 CFR 304.4 - Performance evaluations.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 13 Business Credit and Assistance 1 2010-01-01 2010-01-01 false Performance evaluations. 304.4 Section 304.4 Business Credit and Assistance ECONOMIC DEVELOPMENT ADMINISTRATION, DEPARTMENT OF COMMERCE ECONOMIC DEVELOPMENT DISTRICTS § 304.4 Performance evaluations. (a) EDA shall evaluate the management...
Correlation between model observer and human observer performance in CT imaging when lesion location is uncertain

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leng, Shuai; Yu, Lifeng; Zhang, Yi

2013-08-15

Purpose: The purpose of this study was to investigate the correlation between model observer and human observer performance in CT imaging for the task of lesion detection and localization when the lesion location is uncertain.Methods: Two cylindrical rods (3-mm and 5-mm diameters) were placed in a 35 × 26 cm torso-shaped water phantom to simulate lesions with −15 HU contrast at 120 kV. The phantom was scanned 100 times on a 128-slice CT scanner at each of four dose levels (CTDIvol = 5.7, 11.4, 17.1, and 22.8 mGy). Regions of interest (ROIs) around each lesion were extracted to generate imagesmore » with signal-present, with each ROI containing 128 × 128 pixels. Corresponding ROIs of signal-absent images were generated from images without lesion mimicking rods. The location of the lesion (rod) in each ROI was randomly distributed by moving the ROIs around each lesion. Human observer studies were performed by having three trained observers identify the presence or absence of lesions, indicating the lesion location in each image and scoring confidence for the detection task on a 6-point scale. The same image data were analyzed using a channelized Hotelling model observer (CHO) with Gabor channels. Internal noise was added to the decision variables for the model observer study. Area under the curve (AUC) of ROC and localization ROC (LROC) curves were calculated using a nonparametric approach. The Spearman's rank order correlation between the average performance of the human observers and the model observer performance was calculated for the AUC of both ROC and LROC curves for both the 3- and 5-mm diameter lesions.Results: In both ROC and LROC analyses, AUC values for the model observer agreed well with the average values across the three human observers. The Spearman's rank order correlation values for both ROC and LROC analyses for both the 3- and 5-mm diameter lesions were all 1.0, indicating perfect rank ordering agreement of the figures of merit (AUC) between
Evaluation of modern cotton harvest systems on irrigated cotton: harvester performance

USDA-ARS?s Scientific Manuscript database

Picker and stripper harvest systems were evaluated on production-scale irrigated cotton on the High Plains of Texas over three harvest seasons. Observations on harvester performance, including time-in-motion, harvest loss, seed cotton composition, and turnout, were conducted at seven locations with...
Performance Evaluation of Nano-JASMINE

NASA Astrophysics Data System (ADS)

Hatsutori, Y.; Kobayashi, Y.; Gouda, N.; Yano, T.; Murooka, J.; Niwa, Y.; Yamada, Y.

We report the results of performance evaluation of the first Japanese astrometry satellite, Nano-JASMINE. It is a very small satellite and weighs only 35 kg. It aims to carry out astrometry measurement of nearby bright stars (z ≤ 7.5 mag) with an accuracy of 3 milli-arcseconds. Nano-JASMINE will be launched by Cyclone-4 rocket in August 2011 from Brazil. The current status is in the process of evaluating the performances. A series of performance tests and numerical analysis were conducted. As a result, the engineering model (EM) of the telescope was measured to be achieving a diffraction-limited performance and confirmed that it has enough performance for scientific astrometry.
Administrators' Views on Teacher Evaluation: Examining Ontario's Teacher Performance Appraisal

ERIC Educational Resources Information Center

Maharaj, Sachin

2014-01-01

This study examines the views of administrators (i.e., principals and vice-principals) in Ontario, Canada, with regard to the province's Teacher Performance Appraisal process. A total of 178 responses were collected from a survey that examined five areas: 1) preparation and training; 2) classroom observations; 3) preparing the formal evaluation;…
Robotic and clinical evaluation of upper limb motor performance in patients with Friedreich's Ataxia: an observational study.

PubMed

Germanotta, Marco; Vasco, Gessica; Petrarca, Maurizio; Rossi, Stefano; Carniel, Sacha; Bertini, Enrico; Cappa, Paolo; Castelli, Enrico

2015-04-23

Friedreich's ataxia (FRDA) is the most common hereditary autosomal recessive form of ataxia. In this disease there is early manifestation of gait ataxia, and dysmetria of the arms and legs which causes impairment in daily activities that require fine manual dexterity. To date there is no cure for this disease. Some novel therapeutic approaches are ongoing in different steps of clinical trial. Development of sensitive outcome measures is crucial to prove therapeutic effectiveness. The aim of the study was to assess the reliability and sensitivity of quantitative and objective assessment of upper limb performance computed by means of the robotic device and to evaluate the correlation with clinical and functional markers of the disease severity. Here we assess upper limb performances by means of the InMotion Arm Robot, a robot designed for clinical neurological applications, in a cohort of 14 children and young adults affected by FRDA, matched for age and gender with 18 healthy subjects. We focused on the analysis of kinematics, accuracy, smoothness, and submovements of the upper limb while reaching movements were performed. The robotic evaluation of upper limb performance consisted of planar reaching movements performed with the robotic system. The motors of the robot were turned off, so that the device worked as a measurement tool. The status of the disease was scored using the Scale for the Assessment and Rating of Ataxia (SARA). Relationships between robotic indices and a range of clinical and disease characteristics were examined. All our robotic indices were significantly different between the two cohorts except for two, and were highly and reliably discriminative between healthy and subjects with FRDA. In particular, subjects with FRDA exhibited slower movements as well as loss of accuracy and smoothness, which are typical of the disease. Duration of Movement, Normalized Jerk, and Number of Submovements were the best discriminative indices, as they were
Comparing masked target transform volume (MTTV) clutter metric to human observer evaluation of visual clutter

NASA Astrophysics Data System (ADS)

Camp, H. A.; Moyer, Steven; Moore, Richard K.

2010-04-01

The Night Vision and Electronic Sensors Directorate's current time-limited search (TLS) model, which makes use of the targeting task performance (TTP) metric to describe image quality, does not explicitly account for the effects of visual clutter on observer performance. The TLS model is currently based on empirical fits to describe human performance for a time of day, spectrum and environment. Incorporating a clutter metric into the TLS model may reduce the number of these empirical fits needed. The masked target transform volume (MTTV) clutter metric has been previously presented and compared to other clutter metrics. Using real infrared imagery of rural images with varying levels of clutter, NVESD is currently evaluating the appropriateness of the MTTV metric. NVESD had twenty subject matter experts (SME) rank the amount of clutter in each scene in a series of pair-wise comparisons. MTTV metric values were calculated and then compared to the SME observers rankings. The MTTV metric ranked the clutter in a similar manner to the SME evaluation, suggesting that the MTTV metric may emulate SME response. This paper is a first step in quantifying clutter and measuring the agreement to subjective human evaluation.
Evaluation of EIT system performance.

PubMed

Yasin, Mamatjan; Böhm, Stephan; Gaggero, Pascal O; Adler, Andy

2011-07-01

An electrical impedance tomography (EIT) system images internal conductivity from surface electrical stimulation and measurement. Such systems necessarily comprise multiple design choices from cables and hardware design to calibration and image reconstruction. In order to compare EIT systems and study the consequences of changes in system performance, this paper describes a systematic approach to evaluate the performance of the EIT systems. The system to be tested is connected to a saline phantom in which calibrated contrasting test objects are systematically positioned using a position controller. A set of evaluation parameters are proposed which characterize (i) data and image noise, (ii) data accuracy, (iii) detectability of single contrasts and distinguishability of multiple contrasts, and (iv) accuracy of reconstructed image (amplitude, resolution, position and ringing). Using this approach, we evaluate three different EIT systems and illustrate the use of these tools to evaluate and compare performance. In order to facilitate the use of this approach, all details of the phantom, test objects and position controller design are made publicly available including the source code of the evaluation and reporting software.
A new global and comprehensive model for ICU ventilator performances evaluation.

PubMed

Marjanovic, Nicolas S; De Simone, Agathe; Jegou, Guillaume; L'Her, Erwan

2017-12-01

This study aimed to provide a new global and comprehensive evaluation of recent ICU ventilators taking into account both technical performances and ergonomics. Six recent ICU ventilators were evaluated. Technical performances were assessed under two FIO 2 levels (100%, 50%), three respiratory mechanics combinations (Normal: compliance [C] = 70 mL cmH 2 O -1 /resistance [R] = 5 cmH 2 O L -1 s -1 ; Restrictive: C = 30/R = 10; Obstructive: C = 120/R = 20), four exponential levels of leaks (from 0 to 12.5 L min -1 ) and three levels of inspiratory effort (P0.1 = 2, 4 and 8 cmH 2 O), using an automated test lung. Ergonomics were evaluated by 20 ICU physicians using a global and comprehensive model involving physiological response to stress measurements (heart rate, respiratory rate, tidal volume variability and eye tracking), psycho-cognitive scales (SUS and NASA-TLX) and objective tasks completion. Few differences in terms of technical performance were observed between devices. Non-invasive ventilation modes had a huge influence on asynchrony occurrence. Using our global model, either objective tasks completion, psycho-cognitive scales and/or physiological measurements were able to depict significant differences in terms of devices' usability. The level of failure that was observed with some devices depicted the lack of adaptation of device's development to end users' requests. Despite similar technical performance, some ICU ventilators exhibit low ergonomics performance and a high risk of misusage.
Dentin bonding performance and interface observation of an MMA-based restorative material.

PubMed

Shinagawa, Junichi; Inoue, Go; Nikaido, Toru; Ikeda, Masaomi; Sadr, Alireza; Tagami, Junji

2016-07-30

The purpose of this study was to evaluate bonding performance and dentin interface acid resistance using a 4-META/MMA-TBB based restorative material (BF) compared to a conventional 4-META/MMA-TBB resin cement (SB), and the effect of sodium fluoride (NaF) addition to the materials. Dentin surfaces were treated with 10% citric acid-3% ferric chloride (10-3) or 4-META containing self-etching primer (TP), followed by application of BF or SB polymer powders with or without NaF, to evaluate microtensile bond strength (µTBS) in six experimental groups; 10-3/SB, 10-3/BF, TP/SB, TP/BF, TP/SB/NaF and TP/BF/NaF. SEM observation of the resin-dentin interface was performed after acid-base challenge to evaluate interfacial dentin resistance to acid attack. TP/BF showed highest µTBS, while NaF polymers decreased µTBS. TP/BF showed funnel-shaped erosion at the interface, however, NaF polymers improved acid resistance of interface. In conclusion, BF demonstrated high µTBSs and low acid-resistance at the interface. NaF addition enhanced acid resistance but decreased µTBS.
Evaluating climate models: Should we use weather or climate observations?

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oglesby, Robert J; Erickson III, David J

2009-12-01

Calling the numerical models that we use for simulations of climate change 'climate models' is a bit of a misnomer. These 'general circulation models' (GCMs, AKA global climate models) and their cousins the 'regional climate models' (RCMs) are actually physically-based weather simulators. That is, these models simulate, either globally or locally, daily weather patterns in response to some change in forcing or boundary condition. These simulated weather patterns are then aggregated into climate statistics, very much as we aggregate observations into 'real climate statistics'. Traditionally, the output of GCMs has been evaluated using climate statistics, as opposed to their abilitymore » to simulate realistic daily weather observations. At the coarse global scale this may be a reasonable approach, however, as RCM's downscale to increasingly higher resolutions, the conjunction between weather and climate becomes more problematic. We present results from a series of present-day climate simulations using the WRF ARW for domains that cover North America, much of Latin America, and South Asia. The basic domains are at a 12 km resolution, but several inner domains at 4 km have also been simulated. These include regions of complex topography in Mexico, Colombia, Peru, and Sri Lanka, as well as a region of low topography and fairly homogeneous land surface type (the U.S. Great Plains). Model evaluations are performed using standard climate analyses (e.g., reanalyses; NCDC data) but also using time series of daily station observations. Preliminary results suggest little difference in the assessment of long-term mean quantities, but the variability on seasonal and interannual timescales is better described. Furthermore, the value-added by using daily weather observations as an evaluation tool increases with the model resolution.« less
Predicting detection performance with model observers: Fourier domain or spatial domain?

PubMed

Chen, Baiyu; Yu, Lifeng; Leng, Shuai; Kofler, James; Favazza, Christopher; Vrieze, Thomas; McCollough, Cynthia

2016-02-27

The use of Fourier domain model observer is challenged by iterative reconstruction (IR), because IR algorithms are nonlinear and IR images have noise texture different from that of FBP. A modified Fourier domain model observer, which incorporates nonlinear noise and resolution properties, has been proposed for IR and needs to be validated with human detection performance. On the other hand, the spatial domain model observer is theoretically applicable to IR, but more computationally intensive than the Fourier domain method. The purpose of this study is to compare the modified Fourier domain model observer to the spatial domain model observer with both FBP and IR images, using human detection performance as the gold standard. A phantom with inserts of various low contrast levels and sizes was repeatedly scanned 100 times on a third-generation, dual-source CT scanner at 5 dose levels and reconstructed using FBP and IR algorithms. The human detection performance of the inserts was measured via a 2-alternative-forced-choice (2AFC) test. In addition, two model observer performances were calculated, including a Fourier domain non-prewhitening model observer and a spatial domain channelized Hotelling observer. The performance of these two mode observers was compared in terms of how well they correlated with human observer performance. Our results demonstrated that the spatial domain model observer correlated well with human observers across various dose levels, object contrast levels, and object sizes. The Fourier domain observer correlated well with human observers using FBP images, but overestimated the detection performance using IR images.
Predicting detection performance with model observers: Fourier domain or spatial domain?

PubMed Central

Chen, Baiyu; Yu, Lifeng; Leng, Shuai; Kofler, James; Favazza, Christopher; Vrieze, Thomas; McCollough, Cynthia

2016-01-01

The use of Fourier domain model observer is challenged by iterative reconstruction (IR), because IR algorithms are nonlinear and IR images have noise texture different from that of FBP. A modified Fourier domain model observer, which incorporates nonlinear noise and resolution properties, has been proposed for IR and needs to be validated with human detection performance. On the other hand, the spatial domain model observer is theoretically applicable to IR, but more computationally intensive than the Fourier domain method. The purpose of this study is to compare the modified Fourier domain model observer to the spatial domain model observer with both FBP and IR images, using human detection performance as the gold standard. A phantom with inserts of various low contrast levels and sizes was repeatedly scanned 100 times on a third-generation, dual-source CT scanner at 5 dose levels and reconstructed using FBP and IR algorithms. The human detection performance of the inserts was measured via a 2-alternative-forced-choice (2AFC) test. In addition, two model observer performances were calculated, including a Fourier domain non-prewhitening model observer and a spatial domain channelized Hotelling observer. The performance of these two mode observers was compared in terms of how well they correlated with human observer performance. Our results demonstrated that the spatial domain model observer correlated well with human observers across various dose levels, object contrast levels, and object sizes. The Fourier domain observer correlated well with human observers using FBP images, but overestimated the detection performance using IR images. PMID:27239086
Virginia Transit Performance Evaluation Package (VATPEP).

DOT National Transportation Integrated Search

1987-01-01

The Virginia Transit Performance Evaluation Package (VATPEP), a computer software package, is documented. This is the computerized version of the methodology used by the Virginia Department of Transportation to evaluate the performance of public tran...
48 CFR 436.201 - Evaluation of contractor performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

... Construction 436.201 Evaluation of contractor performance. Preparation of performance evaluation reports. In addition to the requirements of FAR 36.201, performance evaluation reports shall be prepared for indefinite... of services to be ordered exceeds $500,000.00. For these contracts, performance evaluation reports...
Evaluating Teacher Preparation Using Graduates' Observational Ratings

ERIC Educational Resources Information Center

Ronfeldt, Matthew; Campbell, Shanyce L.

2016-01-01

Despite growing calls for more accountability of teacher education programs (TEPs), there is little consensus about how to evaluate them. This study investigates the potential for using observational ratings of program completers to evaluate TEPs. Drawing on statewide data on almost 9,500 program completers, representing 44 providers (183…
48 CFR 1252.216-72 - Performance evaluation plan.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 5 2010-10-01 2010-10-01 false Performance evaluation....216-72 Performance evaluation plan. As prescribed in (TAR) 48 CFR 1216.406(b), insert the following clause: Performance Evaluation Plan (OCT 1994) (a) A Performance Evaluation Plan shall be unilaterally...

Cognitive-evaluative features of childhood social anxiety in a performance task.

PubMed

Tuschen-Caffier, Brunna; Kühl, Sigrid; Bender, Caroline

2011-06-01

Using an experimental design, we analysed differences in the occurrence of cognitive-evaluative distortions and performance deficits across children with social anxiety disorder, with subclinical anxiety and without any anxiety symptoms. Twenty-one children with full syndrome social phobia, 18 children with partial syndrome social phobia and 20 children without any symptoms of social phobia were compared with respect to their degree of anxiety, negative thinking and task performance during two social-evaluative tasks. In addition, self-ratings of task performance, performance estimations for other children and objective behavioural ratings by two independent observers were obtained. Children with social anxiety disorder and subclinical social anxiety showed higher degrees of experienced anxiety and negative thinking than healthy control children. There was no group difference in respect to actual task performance. Findings are discussed with regard to the continuum assumption of childhood social anxiety disorder and the need of well-adapted early interventions. Copyright © 2010. Published by Elsevier Ltd.
Evidence for -Gz Adaptation Observed with Wearable Biosensors During High Performance Jet Flight.

PubMed

Rice, G Merrill; Snider, Dallas; Moore, Jeffrey L; Lavan, J Timothy; Folga, Rich; VanBrunt, Thomas B

2016-12-01

Few studies have evaluated physiological responses to high acceleration forces during actual flight and to our knowledge no normative data has been acquired by technologies such as wearable biosensors during high performance jet aircraft operations. In-flight physiological data from an FDA cleared portable triaxial accelerometer and bio-sensor were observed from five active duty F-18 pilots of the Naval Flight Demonstration Squadron (Blue Angels). Of the five pilots, three were formation pilots who flew lower G profiles and two were solo pilots who flew higher G profiles. Physiological parameters monitored were heart rate, respiratory rate, temperature, caloric expenditure, and duration of exposure to levels of acceleration. Evaluated were 25 practice demonstration flights; 9 flights were excluded secondary to incomplete or inaccurate physiological data. We observed no significant bradycardia during a total of 189 maneuvers which met inclusion criteria for push-pull events (PPE) or isolated -Gz exposures. Further analysis of 73 PPE revealed an overall significant rise in HR following the PPE, where mean heart rate was 106 (95% CI, 100:112) at the beginning of the push and 129 (95% CI, 123:135) following the pull. A majority of the flights monitored provided reliable physiological data. Initial data suggests, contrary to currently held aeromedical doctrine, maneuvers such as the "push-pull" do not evoke vasovagal based bradycardic responses in aerobatic pilots. Possible explanations for these findings are sympathetic nervous system activation through adaptation and/or sustained isometric resistance from control inputs, both of which are areas of future research for our team.Rice GM, Snider D, Moore JL, Lavan JT, Folga R, VanBrunt TB. Evidence for -Gz adaptation observed with wearable biosensors during high performance jet flight. Aerosp Med Hum Perform. 2016; 87(12):996-1003.
Evaluation of atmospheric dust prediction models using ground-based observations

NASA Astrophysics Data System (ADS)

Terradellas, Enric; María Baldasano, José; Cuevas, Emilio; Basart, Sara; Huneeus, Nicolás; Camino, Carlos; Dundar, Cinhan; Benincasa, Francesco

2013-04-01

An important step in numerical prediction of mineral dust is the model evaluation aimed to assess its performance to forecast the atmospheric dust content and to lead to new directions in model development and improvement. The first problem to address the evaluation is the scarcity of ground-based routine observations intended for dust monitoring. An alternative option would be the use of satellite products. They have the advantage of a large spatial coverage and a regular availability. However, they do have numerous drawbacks that make the quantitative retrievals of aerosol-related variables difficult and imprecise. This work presents the use of different ground-based observing systems for the evaluation of dust models in the Regional Center for Northern Africa, Middle East and Europe of the World Meteorological Organization (WMO) Sand and Dust Storm Warning Advisory and Assessment System (SDS-WAS). The dust optical depth at 550 nm forecast by different models is regularly compared with the AERONET measurements of Aerosol Optical Depth (AOD) for 40 selected stations. Photometric measurements are a powerful tool for remote sensing of the atmosphere allowing retrieval of aerosol properties, such as AOD. This variable integrates the contribution of different aerosol types, but may be complemented with spectral information that enables hypotheses about the nature of the particles. Comparison is restricted to cases with low Ångström exponent values in order to ensure that coarse mineral dust is the dominant aerosol type. Additionally to column dust load, it is important to evaluate dust surface concentration and dust vertical profiles. Air quality monitoring stations are the main source of data for the evaluation of surface concentration. However they are concentrated in populated and industrialized areas around the Mediterranean. In the present contribution, results of different models are compared with observations of PM10 from the Turkish air quality network for
Evaluation of Calibration Laboratories Performance

NASA Astrophysics Data System (ADS)

Filipe, Eduarda

2011-12-01

One of the main goals of interlaboratory comparisons (ILCs) is the evaluation of the laboratories performance for the routine calibrations they perform for the clients. In the frame of Accreditation of Laboratories, the national accreditation boards (NABs) in collaboration with the national metrology institutes (NMIs) organize the ILCs needed to comply with the requirements of the international accreditation organizations. In order that an ILC is a reliable tool for a laboratory to validate its best measurement capability (BMC), it is needed that the NMI (reference laboratory) provides a better traveling standard—in terms of accuracy class or uncertainty—than the laboratories BMCs. Although this is the general situation, there are cases where the NABs ask the NMIs to evaluate the performance of the accredited laboratories when calibrating industrial measuring instruments. The aim of this article is to discuss the existing approaches for the evaluation of ILCs and propose a basis for the validation of the laboratories measurement capabilities. An example is drafted with the evaluation of the results of mercury-in-glass thermometers ILC with 12 participant laboratories.
Human interaction with robotic systems: performance and workload evaluations.

PubMed

Reinerman-Jones, L; Barber, D J; Szalma, J L; Hancock, P A

2017-10-01

We first tested the effect of differing tactile informational forms (i.e. directional cues vs. static cues vs. dynamic cues) on objective performance and perceived workload in a collaborative human-robot task. A second experiment evaluated the influence of task load and informational message type (i.e. single words vs. grouped phrases) on that same collaborative task. In both experiments, the relationship of personal characteristics (attentional control and spatial ability) to performance and workload was also measured. In addition to objective performance and self-report of cognitive load, we evaluated different physiological responses in each experiment. Results showed a performance-workload association for directional cues, message type and task load. EEG measures however, proved generally insensitive to such task load manipulations. Where significant EEG effects were observed, right hemisphere amplitude differences predominated, although unexpectedly these latter relationships were negative. Although EEG measures were partially associated with performance, they appear to possess limited utility as measures of workload in association with tactile displays. Practitioner Summary: As practitioners look to take advantage of innovative tactile displays in complex operational realms like human-robotic interaction, associated performance effects are mediated by cognitive workload. Despite some patterns of association, reliable reflections of operator state can be difficult to discern and employ as the number, complexity and sophistication of these respective measures themselves increase.
High-definition television evaluation for remote handling task performance

NASA Astrophysics Data System (ADS)

Fujita, Y.; Omori, E.; Hayashi, S.; Draper, J. V.; Herndon, J. N.

Described are experiments designed to evaluate the impact of HDTV (High-Definition Television) on the performance of typical remote tasks. The experiments described in this paper compared the performance of four operators using HDTV with their performance while using other television systems. The experiments included four television systems: (1) high-definition color television, (2) high-definition monochromatic television, (3) standard-resolution monochromatic television, and (4) standard-resolution stereoscopic monochromatic television. The stereo system accomplished stereoscopy by displaying two cross-polarized images, one reflected by a half-silvered mirror and one seen through the mirror. Observers wore spectacles with cross-polarized lenses so that the left eye received only the view from the left camera and the right eye received only the view from the right camera.
Evaluation of medical command and control using performance indicators in a full-scale, major aircraft accident exercise.

PubMed

Gryth, Dan; Rådestad, Monica; Nilsson, Heléne; Nerf, Ola; Svensson, Leif; Castrén, Maaret; Rüter, Anders

2010-01-01

Large, functional, disaster exercises are expensive to plan and execute, and often are difficult to evaluate objectively. Command and control in disaster medicine organizations can benefit from objective results from disaster exercises to identify areas that must be improved. The objective of this pilot study was to examine if it is possible to use performance indicators for documentation and evaluation of medical command and control in a full-scale major incident exercise at two levels: (1) local level (scene of the incident and hospital); and (2) strategic level of command and control. Staff procedure skills also were evaluated. Trained observers were placed in each of the three command and control locations. These observers recorded and scored the performance of command and control using templates of performance indicators. The observers scored the level of performance by awarding 2, 1, or 0 points according to the template and evaluated content and timing of decisions. Results from 11 performance indicators were recorded at each template and scores greater than 11 were considered as acceptable. Prehospital command and control had the lowest score. This also was expressed by problems at the scene of the incident. The scores in management and staff skills were at the strategic level 15 and 17, respectively; and at the hospital level, 17 and 21, respectively. It is possible to use performance indicators in a full-scale, major incident exercise for evaluation of medical command and control. The results could be used to compare similar exercises and evaluate real incidents in the future.
Performance Evaluations of Ceramic Wafer Seals

NASA Technical Reports Server (NTRS)

Dunlap, Patrick H., Jr.; DeMange, Jeffrey J.; Steinetz, Bruce M.

2006-01-01

Future hypersonic vehicles will require high temperature, dynamic seals in advanced ramjet/scramjet engines and on the vehicle airframe to seal the perimeters of movable panels, flaps, and doors. Seal temperatures in these locations can exceed 2000 F, especially when the seals are in contact with hot ceramic matrix composite sealing surfaces. NASA Glenn Research Center is developing advanced ceramic wafer seals to meet the needs of these applications. High temperature scrub tests performed between silicon nitride wafers and carbon-silicon carbide rub surfaces revealed high friction forces and evidence of material transfer from the rub surfaces to the wafer seals. Stickage between adjacent wafers was also observed after testing. Several design changes to the wafer seals were evaluated as possible solutions to these concerns. Wafers with recessed sides were evaluated as a potential means of reducing friction between adjacent wafers. Alternative wafer materials are also being considered as a means of reducing friction between the seals and their sealing surfaces and because the baseline silicon nitride wafer material (AS800) is no longer commercially available.
Using Summative and Formative Assessments to Evaluate EFL Teachers' Teaching Performance

ERIC Educational Resources Information Center

Wei, Wei

2015-01-01

Using classroom observations (formative) and student course experience survey results (summative) to evaluate English lecturers' teaching performances is not new in practice, but surprisingly only a few studies have investigated this issue in a higher education context. This study was conducted in an English department of a large university in…
Performance Evaluation of Particle Sampling Probes for Emission Measurements of Aircraft Jet Engines

NASA Technical Reports Server (NTRS)

Lee, Poshin; Chen, Da-Ren; Sanders, Terry (Technical Monitor)

2001-01-01

Considerable attention has been recently received on the impact of aircraft-produced aerosols upon the global climate. Sampling particles directly from jet engines has been performed by different research groups in the U.S. and Europe. However, a large variation has been observed among published data on the conversion efficiency and emission indexes of jet engines. The variation results surely from the differences in test engine types, engine operation conditions, and environmental conditions. The other factor that could result in the observed variation is the performance of sampling probes used. Unfortunately, it is often neglected in the jet engine community. Particle losses during the sampling, transport, and dilution processes are often not discussed/considered in literatures. To address this issue, we evaluated the performance of one sampling probe by challenging it with monodisperse particles. A significant performance difference was observed on the sampling probe evaluated under different temperature conditions. Thermophoretic effect, nonisokinetic sampling and turbulence loss contribute to the loss of particles in sampling probes. The results of this study show that particle loss can be dramatic if the sampling probe is not well designed. Further, the result allows ones to recover the actual size distributions emitted from jet engines.
Tip of the Tongue States Increase under Evaluative Observation

ERIC Educational Resources Information Center

James, Lori E.; Schmank, Christopher J.; Castro, Nichol; Buchanan, Tony W.

2018-01-01

We tested the frequent assumption that the difficulty of word retrieval increases when a speaker is being observed and evaluated. We modified the Trier Social Stress Test (TSST) so that participants believed that its evaluative observation components continued throughout the duration of a subsequent word retrieval task, and measured participants'…
Values of a Patient and Observer Scar Assessment Scale to Evaluate the Facial Skin Graft Scar.

PubMed

Chae, Jin Kyung; Kim, Jeong Hee; Kim, Eun Jung; Park, Kun

2016-10-01

The patient and observer scar assessment scale (POSAS) recently emerged as a promising method, reflecting both observer's and patient's opinions in evaluating scar. This tool was shown to be consistent and reliable in burn scar assessment, but it has not been tested in the setting of skin graft scar in skin cancer patients. To evaluate facial skin graft scar applied to POSAS and to compare with objective scar assessment tools. Twenty three patients, who diagnosed with facial cutaneous malignancy and transplanted skin after Mohs micrographic surgery, were recruited. Observer assessment was performed by three independent rates using the observer component of the POSAS and Vancouver scar scale (VSS). Patient self-assessment was performed using the patient component of the POSAS. To quantify scar color and scar thickness more objectively, spectrophotometer and ultrasonography was applied. Inter-observer reliability was substantial with both VSS and the observer component of the POSAS (average measure intraclass coefficient correlation, 0.76 and 0.80, respectively). The observer component consistently showed significant correlations with patients' ratings for the parameters of the POSAS (all p -values<0.05). The correlation between subjective assessment using POSAS and objective assessment using spectrophotometer and ultrasonography showed low relationship. In facial skin graft scar assessment in skin cancer patients, the POSAS showed acceptable inter-observer reliability. This tool was more comprehensive and had higher correlation with patient's opinion.
48 CFR 236.201 - Evaluation of contractor performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

... CONTRACTS Special Aspects of Contracting for Construction 236.201 Evaluation of contractor performance. (a) Preparation of performance evaluation reports. Use DD Form 2626, Performance Evaluation (Construction... 48 Federal Acquisition Regulations System 3 2010-10-01 2010-10-01 false Evaluation of contractor...
Deployment and Evaluation of an Observations Data Model

NASA Astrophysics Data System (ADS)

Horsburgh, J. S.; Tarboton, D. G.; Zaslavsky, I.; Maidment, D. R.; Valentine, D.

2007-12-01

Environmental observations are fundamental to hydrology and water resources, and the way these data are organized and manipulated either enables or inhibits the analyses that can be performed. The CUAHSI Hydrologic Information System project is developing information technology infrastructure to support hydrologic science. This includes an Observations Data Model (ODM) that provides a new and consistent format for the storage and retrieval of environmental observations in a relational database designed to facilitate integrated analysis of large datasets collected by multiple investigators. Within this data model, observations are stored with sufficient ancillary information (metadata) about the observations to allow them to be unambiguously interpreted and used, and to provide traceable heritage from raw measurements to useable information. The design is based upon a relational database model that exposes each single observation as a record, taking advantage of the capability in relational database systems for querying based upon data values and enabling cross dimension data retrieval and analysis. This data model has been deployed, as part of the HIS Server, at the WATERS Network test bed observatories across the U.S where it serves as a repository for real time data in the observatory information system. The ODM holds the data that is then made available to investigators and the public through web services and the Data Access System for Hydrology (DASH) map based interface. In the WATERS Network test bed settings the ODM has been used to ingest, analyze and publish data from a variety of sources and disciplines. This paper will present an evaluation of the effectiveness of this initial deployment and the revisions that are being instituted to address shortcomings. The ODM represents a new, systematic way for hydrologists, scientists, and engineers to organize and share their data and thereby facilitate a fuller integrated understanding of water resources based on
Conductor gestures influence evaluations of ensemble performance

PubMed Central

Morrison, Steven J.; Price, Harry E.; Smedley, Eric M.; Meals, Cory D.

2014-01-01

Previous research has found that listener evaluations of ensemble performances vary depending on the expressivity of the conductor’s gestures, even when performances are otherwise identical. It was the purpose of the present study to test whether this effect of visual information was evident in the evaluation of specific aspects of ensemble performance: articulation and dynamics. We constructed a set of 32 music performances that combined auditory and visual information and were designed to feature a high degree of contrast along one of two target characteristics: articulation and dynamics. We paired each of four music excerpts recorded by a chamber ensemble in both a high- and low-contrast condition with video of four conductors demonstrating high- and low-contrast gesture specifically appropriate to either articulation or dynamics. Using one of two equivalent test forms, college music majors and non-majors (N = 285) viewed sixteen 30 s performances and evaluated the quality of the ensemble’s articulation, dynamics, technique, and tempo along with overall expressivity. Results showed significantly higher evaluations for performances featuring high rather than low conducting expressivity regardless of the ensemble’s performance quality. Evaluations for both articulation and dynamics were strongly and positively correlated with evaluations of overall ensemble expressivity. PMID:25104944
48 CFR 3052.216-72 - Performance evaluation plan.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 7 2010-10-01 2010-10-01 false Performance evaluation... CONTRACT CLAUSES Text of Provisions and Clauses 3052.216-72 Performance evaluation plan. As prescribed in... Evaluation Plan (DEC 2003) (a) A Performance Evaluation Plan shall be unilaterally established by the...
Performance Evaluation Process.

ERIC Educational Resources Information Center

1998

This document contains four papers from a symposium on the performance evaluation process and human resource development (HRD). "Assessing the Effectiveness of OJT (On the Job Training): A Case Study Approach" (Julie Furst-Bowe, Debra Gates) is a case study of the effectiveness of OJT in one of a high-tech manufacturing company's product…
The Context and Process for Performance Evaluations: Necessary Preconditions for the Use of Performance Evaluations as a Measure of Performance--A Critique of Perry

ERIC Educational Resources Information Center

McCarthy, Mary L.

2006-01-01

This article challenges Perry's research using performance evaluations to determine whether the educational background of child welfare workers is predictive of performance. Institutional theory, an understanding of street-level bureaucracies, and evaluations of field education performance measures are offered as necessary frameworks for Perry's…
Solar power plant performance evaluation: simulation and experimental validation

NASA Astrophysics Data System (ADS)

Natsheh, E. M.; Albarbar, A.

2012-05-01

In this work the performance of solar power plant is evaluated based on a developed model comprise photovoltaic array, battery storage, controller and converters. The model is implemented using MATLAB/SIMULINK software package. Perturb and observe (P&O) algorithm is used for maximizing the generated power based on maximum power point tracker (MPPT) implementation. The outcome of the developed model are validated and supported by a case study carried out using operational 28.8kW grid-connected solar power plant located in central Manchester. Measurements were taken over 21 month's period; using hourly average irradiance and cell temperature. It was found that system degradation could be clearly monitored by determining the residual (the difference) between the output power predicted by the model and the actual measured power parameters. It was found that the residual exceeded the healthy threshold, 1.7kW, due to heavy snow in Manchester last winter. More important, the developed performance evaluation technique could be adopted to detect any other reasons that may degrade the performance of the P V panels such as shading and dirt. Repeatability and reliability of the developed system performance were validated during this period. Good agreement was achieved between the theoretical simulation and the real time measurement taken the online grid connected solar power plant.
Evaluating health worker performance in Benin using the simulated client method with real children.

PubMed

Rowe, Alexander K; Onikpo, Faustin; Lama, Marcel; Deming, Michael S

2012-10-08

The simulated client (SC) method for evaluating health worker performance utilizes surveyors who pose as patients to make surreptitious observations during consultations. Compared to conspicuous observation (CO) by surveyors, which is commonly done in developing countries, SC data better reflect usual health worker practices. This information is important because CO can cause performance to be better than usual. Despite this advantage of SCs, the method's full potential has not been realized for evaluating performance for pediatric illnesses because real children have not been utilized as SCs. Previous SC studies used scenarios of ill children that were not actually brought to health workers. During a trial that evaluated a quality improvement intervention in Benin (the Integrated Management of Childhood Illness [IMCI] strategy), we conducted an SC survey with adult caretakers as surveyors and real children to evaluate the feasibility of this approach and used the results to assess the validity of CO. We conducted an SC survey and a CO survey (one right after the other) of health workers in the same 55 health facilities. A detailed description of the SC survey process was produced. Results of the two surveys were compared for 27 performance indicators using logistic regression modeling. SC and CO surveyors observed 54 and 185 consultations, respectively. No serious problems occurred during the SC survey. Performance levels measured by CO were moderately higher than those measured by SCs (median CO - SC difference = 16.4 percentage-points). Survey differences were sometimes much greater for IMCI-trained health workers (median difference = 29.7 percentage-points) than for workers without IMCI training (median difference = 3.1 percentage-points). SC surveys can be done safely with real children if appropriate precautions are taken. CO can introduce moderately large positive biases, and these biases might be greater for health workers exposed to quality improvement

Handbook for Improving Superintendent Performance Evaluation.

ERIC Educational Resources Information Center

Candoli, Carl; And Others

This handbook for superintendent performance evaluation contains information for boards of education as they institute or improve their evaluation system. The handbook answers questions involved in operationalizing, implementing, and evaluating a superintendent-evaluation system. The information was developed from research on superintendent…
Effect of Using 2 mm Voxels on Observer Performance for PET Lesion Detection

NASA Astrophysics Data System (ADS)

Morey, A. M.; Noo, Frédéric; Kadrmas, Dan J.

2016-06-01

Positron emission tomography (PET) images are typically reconstructed with an in-plane pixel size of approximately 4 mm for cancer imaging. The objective of this work was to evaluate the effect of using smaller pixels on general oncologic lesion-detection. A series of observer studies was performed using experimental phantom data from the Utah PET Lesion Detection Database, which modeled whole-body FDG PET cancer imaging of a 92 kg patient. The data comprised 24 scans over 4 days on a Biograph mCT time-of-flight (TOF) PET/CT scanner, with up to 23 lesions (diam. 6-16 mm) distributed throughout the phantom each day. Images were reconstructed with 2.036 mm and 4.073 mm pixels using ordered-subsets expectation-maximization (OSEM) both with and without point spread function (PSF) modeling and TOF. Detection performance was assessed using the channelized non-prewhitened numerical observer with localization receiver operating characteristic (LROC) analysis. Tumor localization performance and the area under the LROC curve were then analyzed as functions of the pixel size. In all cases, the images with 2 mm pixels provided higher detection performance than those with 4 mm pixels. The degree of improvement from the smaller pixels was larger than that offered by PSF modeling for these data, and provided roughly half the benefit of using TOF. Key results were confirmed by two human observers, who read subsets of the test data. This study suggests that a significant improvement in tumor detection performance for PET can be attained by using smaller voxel sizes than commonly used at many centers. The primary drawback is a 4-fold increase in reconstruction time and data storage requirements.
Model Performance Evaluation and Scenario Analysis (MPESA) Tutorial

EPA Science Inventory

This tool consists of two parts: model performance evaluation and scenario analysis (MPESA). The model performance evaluation consists of two components: model performance evaluation metrics and model diagnostics. These metrics provides modelers with statistical goodness-of-fit m...
The effects of non-evaluative feedback on drivers' self-evaluation and performance.

PubMed

Dogan, Ebru; Steg, Linda; Delhomme, Patricia; Rothengatter, Talib

2012-03-01

Drivers' tend to overestimate their competences, which may result in risk taking behavior. Providing drivers with feedback has been suggested as one of the solutions to overcome drivers' inaccurate self-evaluations. In practice, many tests and driving simulators provide drivers with non-evaluative feedback, which conveys information on the level of performance but not on what caused the performance. Is this type of feedback indeed effective in reducing self-enhancement biases? The current study aimed to investigate the effect of non-evaluative performance feedback on drivers' self-evaluations using a computerized hazard perception test. A between-subjects design was used with one group receiving feedback on performance in the hazard perception test while the other group not receiving any feedback. The results indicated that drivers had a robust self-enhancement bias in their self-evaluations regardless of the presence of performance feedback and that they systematically estimated their performance to be higher than they actually achieved in the test. Furthermore, they devalued the credibility of the test instead of adjusting their self-evaluations in order to cope with the negative feelings following the failure feedback. We discuss the theoretical and practical implications of these counterproductive effects of non-evaluative feedback. Copyright © 2011 Elsevier Ltd. All rights reserved.
Performance evaluation of haptic hand-controllers in a robot-assisted surgical system.

PubMed

Zareinia, Kourosh; Maddahi, Yaser; Ng, Canaan; Sepehri, Nariman; Sutherland, Garnette R

2015-12-01

This paper presents the experimental evaluation of three commercially available haptic hand-controllers to evaluate which was more suitable to the participants. Two surgeons and seven engineers performed two peg-in-hole tasks with different levels of difficulty. Each operator guided the end-effector of a Kuka manipulator that held surgical forceps and was equipped with a surgical microscope. Sigma 7, HD(2) and PHANToM Premium 3.0 hand-controllers were compared. Ten measures were adopted to evaluate operators' performances with respect to effort, speed and accuracy in completing a task, operator improvement during the tests, and the force applied by each haptic device. The best performance was observed with the Premium 3.0; the hand-piece was able to be held in a similar way to that used by surgeons to hold conventional tools. Hand-controllers with a linkage structure similar to the human upper extremity take advantage of the inherent human brain connectome, resulting in improved surgeon performance during robotic-assisted surgery. Copyright © 2015 John Wiley & Sons, Ltd.
Assisting allied health in performance evaluation: a systematic review.

PubMed

Lizarondo, Lucylynn; Grimmer, Karen; Kumar, Saravana

2014-11-14

Performance evaluation raises several challenges to allied health practitioners and there is no agreed approach to measuring or monitoring allied health service performance. The aim of this review was to examine the literature on performance evaluation in healthcare to assist in the establishment of a framework that can guide the measurement and evaluation of allied health clinical service performance. This review determined the core elements of a performance evaluation system, tools for evaluating performance, and barriers to the implementation of performance evaluation. A systematic review of the literature was undertaken. Five electronic databases were used to search for relevant articles: MEDLINE, Embase, CINAHL, PsychInfo, and Academic Search Premier. Articles which focussed on any allied health performance evaluation or those which examined performance in health care in general were considered in the review. Content analysis was used to synthesise the findings from individual articles. A total of 37 articles were included in the review. The literature suggests there are core elements involved in performance evaluation which include prioritising clinical areas for measurement, setting goals, selecting performance measures, identifying sources of feedback, undertaking performance measurement, and reporting the results to relevant stakeholders. The literature describes performance evaluation as multi-dimensional, requiring information or data from more than one perspective to provide a rich assessment of performance. A range of tools or instruments are available to capture various perspectives and gather a comprehensive picture of health care quality. Every allied health care delivery system has different performance needs and will therefore require different approaches. However, there are core processes that can be used as a framework to evaluate allied health performance. A careful examination of barriers to performance evaluation and subsequent tailoring of
48 CFR 2936.201 - Evaluation of contractor performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 7 2010-10-01 2010-10-01 false Evaluation of contractor... Construction 2936.201 Evaluation of contractor performance. The HCA must establish procedures to evaluate construction contractor performance and prepare performance reports as required by FAR 36.201. ...
Values of a Patient and Observer Scar Assessment Scale to Evaluate the Facial Skin Graft Scar

PubMed Central

Chae, Jin Kyung; Kim, Eun Jung; Park, Kun

2016-01-01

Background The patient and observer scar assessment scale (POSAS) recently emerged as a promising method, reflecting both observer's and patient's opinions in evaluating scar. This tool was shown to be consistent and reliable in burn scar assessment, but it has not been tested in the setting of skin graft scar in skin cancer patients. Objective To evaluate facial skin graft scar applied to POSAS and to compare with objective scar assessment tools. Methods Twenty three patients, who diagnosed with facial cutaneous malignancy and transplanted skin after Mohs micrographic surgery, were recruited. Observer assessment was performed by three independent rates using the observer component of the POSAS and Vancouver scar scale (VSS). Patient self-assessment was performed using the patient component of the POSAS. To quantify scar color and scar thickness more objectively, spectrophotometer and ultrasonography was applied. Results Inter-observer reliability was substantial with both VSS and the observer component of the POSAS (average measure intraclass coefficient correlation, 0.76 and 0.80, respectively). The observer component consistently showed significant correlations with patients' ratings for the parameters of the POSAS (all p-values<0.05). The correlation between subjective assessment using POSAS and objective assessment using spectrophotometer and ultrasonography showed low relationship. Conclusion In facial skin graft scar assessment in skin cancer patients, the POSAS showed acceptable inter-observer reliability. This tool was more comprehensive and had higher correlation with patient's opinion. PMID:27746642
Do health economic evaluations using observational data provide reliable assessment of treatment effects?

PubMed Central

2013-01-01

Economic evaluation in modern health care systems is seen as a transparent scientific framework that can be used to advance progress towards improvements in population health at the best possible value. Despite the perceived superiority that trial-based studies have in terms of internal validity, economic evaluations often employ observational data. In this review, the interface between econometrics and economic evaluation is explored, with emphasis placed on highlighting methodological issues relating to the evaluation of cost-effectiveness within a bivariate framework. Studies that satisfied the eligibility criteria exemplified the use of matching, regression analysis, propensity scores, instrumental variables, as well as difference-in-differences approaches. All studies were reviewed and critically appraised using a structured template. The findings suggest that although state-of-the-art econometric methods have the potential to provide evidence on the causal effects of clinical and policy interventions, their application in economic evaluation is subject to a number of limitations. These range from no credible assessment of key assumptions and scarce evidence regarding the relative performance of different methods, to lack of reporting of important study elements, such as a summary outcome measure and its associated sampling uncertainty. Further research is required to better understand the ways in which observational data should be analysed in the context of the economic evaluation framework. PMID:24229445
Extensions to the visual predictive check to facilitate model performance evaluation.

PubMed

Post, Teun M; Freijer, Jan I; Ploeger, Bart A; Danhof, Meindert

2008-04-01

The Visual Predictive Check (VPC) is a valuable and supportive instrument for evaluating model performance. However in its most commonly applied form, the method largely depends on a subjective comparison of the distribution of the simulated data with the observed data, without explicitly quantifying and relating the information in both. In recent adaptations to the VPC this drawback is taken into consideration by presenting the observed and predicted data as percentiles. In addition, in some of these adaptations the uncertainty in the predictions is represented visually. However, it is not assessed whether the expected random distribution of the observations around the predicted median trend is realised in relation to the number of observations. Moreover the influence of and the information residing in missing data at each time point is not taken into consideration. Therefore, in this investigation the VPC is extended with two methods to support a less subjective and thereby more adequate evaluation of model performance: (i) the Quantified Visual Predictive Check (QVPC) and (ii) the Bootstrap Visual Predictive Check (BVPC). The QVPC presents the distribution of the observations as a percentage, thus regardless the density of the data, above and below the predicted median at each time point, while also visualising the percentage of unavailable data. The BVPC weighs the predicted median against the 5th, 50th and 95th percentiles resulting from a bootstrap of the observed data median at each time point, while accounting for the number and the theoretical position of unavailable data. The proposed extensions to the VPC are illustrated by a pharmacokinetic simulation example and applied to a pharmacodynamic disease progression example.
48 CFR 2452.216-73 - Performance evaluation plan.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Performance evaluation plan... 2452.216-73 Performance evaluation plan. As prescribed in 2416.406(e)(3), insert the following clause in all award fee contracts: Performance Evaluation Plan (AUG 1987) (a) The Government shall...
Evaluation of Oral Performance in Outsourced Call Centres: An Exploratory Case Study

ERIC Educational Resources Information Center

Friginal, Eric

2013-01-01

This case study discusses the development and use of an oral performance assessment instrument intended to evaluate Filipino agents' customer service transactions with callers from the United States (US). The design and applications of the instrument were based on a longitudinal, qualitative observation of language training and customer service…
Experimental evaluation of nonclassical correlations between measurement outcomes and target observable in a quantum measurement

NASA Astrophysics Data System (ADS)

Iinuma, Masataka; Suzuki, Yutaro; Nii, Taiki; Kinoshita, Ryuji; Hofmann, Holger F.

2016-03-01

In general, it is difficult to evaluate measurement errors when the initial and final conditions of the measurement make it impossible to identify the correct value of the target observable. Ozawa proposed a solution based on the operator algebra of observables which has recently been used in experiments investigating the error-disturbance trade-off of quantum measurements. Importantly, this solution makes surprisingly detailed statements about the relations between measurement outcomes and the unknown target observable. In the present paper, we investigate this relation by performing a sequence of two measurements on the polarization of a photon, so that the first measurement commutes with the target observable and the second measurement is sensitive to a complementary observable. While the initial measurement can be evaluated using classical statistics, the second measurement introduces the effects of quantum correlations between the noncommuting physical properties. By varying the resolution of the initial measurement, we can change the relative contribution of the nonclassical correlations and identify their role in the evaluation of the quantum measurement. It is shown that the most striking deviation from classical expectations is obtained at the transition between weak and strong measurements, where the competition between different statistical effects results in measurement values well outside the range of possible eigenvalues.
Skylab program earth resouces experiment package. Volume 4: Sensor performance evaluation (S193 R/S). [radiometer/scatterometer

NASA Technical Reports Server (NTRS)

Kenney, G. P.

1975-01-01

The results of the sensor performance evaluation of the 13.9 GHz radiometer/scatterometer, which was part of the earth resources experiment package on Skylab. Findings are presented in the areas of housekeeping parameters, antenna gain and scanning performance, dynamic range, linearity, precision, resolution, stability, integration time, and transmitter output. Supplementary analyses covering performance anomalies, data stream peculiarities, aircraft sensor data comparisons, scatterometer saturation characteristics, and RF heating effects are reported. Results of the evaluation show that instrument performance was generally as expected, but capability degradations were observed to result from three major anomalies. Conclusions are drawn from the evaluation results, and recommendations for improving the effectiveness of a future program are offered. An addendum describes the special evaluation techniques developed and applied in the sensor performance evaluation tasks.
Performance Evaluation Model for Application Layer Firewalls.

PubMed

Xuan, Shichang; Yang, Wu; Dong, Hui; Zhang, Jiangchuan

2016-01-01

Application layer firewalls protect the trusted area network against information security risks. However, firewall performance may affect user experience. Therefore, performance analysis plays a significant role in the evaluation of application layer firewalls. This paper presents an analytic model of the application layer firewall, based on a system analysis to evaluate the capability of the firewall. In order to enable users to improve the performance of the application layer firewall with limited resources, resource allocation was evaluated to obtain the optimal resource allocation scheme in terms of throughput, delay, and packet loss rate. The proposed model employs the Erlangian queuing model to analyze the performance parameters of the system with regard to the three layers (network, transport, and application layers). Then, the analysis results of all the layers are combined to obtain the overall system performance indicators. A discrete event simulation method was used to evaluate the proposed model. Finally, limited service desk resources were allocated to obtain the values of the performance indicators under different resource allocation scenarios in order to determine the optimal allocation scheme. Under limited resource allocation, this scheme enables users to maximize the performance of the application layer firewall.
24 CFR 570.491 - Performance and evaluation report.

Code of Federal Regulations, 2010 CFR

2010-04-01

... 24 Housing and Urban Development 3 2010-04-01 2010-04-01 false Performance and evaluation report... Development Block Grant Program § 570.491 Performance and evaluation report. The annual performance and evaluation report shall be submitted in accordance with 24 CFR part 91. (Approved by the Office of Management...
Performability evaluation of the SIFT computer

NASA Technical Reports Server (NTRS)

Meyer, J. F.; Furchtgott, D. G.; Wu, L. T.

1979-01-01

Performability modeling and evaluation techniques are applied to the SIFT computer as it might operate in the computational evironment of an air transport mission. User-visible performance of the total system (SIFT plus its environment) is modeled as a random variable taking values in a set of levels of accomplishment. These levels are defined in terms of four attributes of total system behavior: safety, no change in mission profile, no operational penalties, and no economic process whose states describe the internal structure of SIFT as well as relavant conditions of the environment. Base model state trajectories are related to accomplishment levels via a capability function which is formulated in terms of a 3-level model hierarchy. Performability evaluation algorithms are then applied to determine the performability of the total system for various choices of computer and environment parameter values. Numerical results of those evaluations are presented and, in conclusion, some implications of this effort are discussed.
Differences between Employees' and Supervisors' Evaluations of Work Performance and Support Needs

ERIC Educational Resources Information Center

Bennett, Kyle; Frain, Michael; Brady, Michael P.; Rosenberg, Howard; Surinak, Tricia

2009-01-01

Assessment systems are needed that are sensitive to employees' work performance as well as their need for support, while incorporating the input from both employees and their supervisors. This study examined the correspondence of one such evaluation system, the Job Observation and Behavior Scale (JOBS) and the JOBS: Opportunity for…
Evaluation of a numerical model's ability to predict bed load transport observed in braided river experiments

NASA Astrophysics Data System (ADS)

Javernick, Luke; Redolfi, Marco; Bertoldi, Walter

2018-05-01

New data collection techniques offer numerical modelers the ability to gather and utilize high quality data sets with high spatial and temporal resolution. Such data sets are currently needed for calibration, verification, and to fuel future model development, particularly morphological simulations. This study explores the use of high quality spatial and temporal data sets of observed bed load transport in braided river flume experiments to evaluate the ability of a two-dimensional model, Delft3D, to predict bed load transport. This study uses a fixed bed model configuration and examines the model's shear stress calculations, which are the foundation to predict the sediment fluxes necessary for morphological simulations. The evaluation is conducted for three flow rates, and model setup used highly accurate Structure-from-Motion (SfM) topography and discharge boundary conditions. The model was hydraulically calibrated using bed roughness, and performance was evaluated based on depth and inundation agreement. Model bed load performance was evaluated in terms of critical shear stress exceedance area compared to maps of observed bed mobility in a flume. Following the standard hydraulic calibration, bed load performance was tested for sensitivity to horizontal eddy viscosity parameterization and bed morphology updating. Simulations produced depth errors equal to the SfM inherent errors, inundation agreement of 77-85%, and critical shear stress exceedance in agreement with 49-68% of the observed active area. This study provides insight into the ability of physically based, two-dimensional simulations to accurately predict bed load as well as the effects of horizontal eddy viscosity and bed updating. Further, this study highlights how using high spatial and temporal data to capture the physical processes at work during flume experiments can help to improve morphological modeling.
Intra- and inter-observer reliability of ten major histological scoring systems used for the evaluation of in vivo cartilage repair.

PubMed

Bonasia, Davide Edoardo; Marmotti, Antongiulio; Massa, Alessandro Domenico Felice; Ferro, Andrea; Blonna, Davide; Castoldi, Filippo; Rossi, Roberto

2015-09-01

In the last two decades, many surgical techniques have been described for articular cartilage repair. Reliable histological scoring systems are fundamental tools to evaluate new procedures. Several histological scoring systems have been described, and these can be divided in elementary and comprehensive scores, according to the number of sub-items. The aim of this study was to test the inter- and intra-observer reliability of ten main scores used for the histological evaluation of in vivo cartilage repair. The authors tested the starting hypothesis that elementary scores would show superior intra- and inter-observer reliability compared with comprehensive scores. Fifty histological sections obtained from the trochlea of New Zealand Rabbit and stained with Safranin-O fast green were used. The histological sections were analysed by 4 observers: 2 experienced in cartilage histology and 2 inexperienced. Histological evaluations were performed at time 1 and time 2, separated by a 30-day interval. The following scores were used: Mankin, O'Driscoll, Pineda, Wakitani, Fortier, Selleres, ICRS, ICRSII, Oswestry (OsScore) and modified O'Driscoll. Intra- and inter-observer reliability were evaluated for each score. In addition, the pavement-ceiling effect and the Bland-Altman Coefficient of Repeatability were then evaluated for each sub-item of every score. Intra-observer reliability was high for all observers in every score, even though the reliability was significantly lower for non-expert observers compared with expert counterparts. In terms of Coefficient of Repeatability, some scores performed better (O'Driscoll, Modified O'Driscoll and ICRSII) than others (Fortier, Seller). Inter-observer reliability was high for all observers in every score, but significantly lower for non-expert compared with expert observers. In expert hands, all the scores showed high intra- and inter-observer reliability, independently of the complexity. Although every score has advantages and

Video observation of procedural skills for assessment of trabeculectomy performed by residents.

PubMed

Hassanpour, Narges; Chen, Rebecca; Baikpour, Masoud; Moghimi, Sasan

2016-06-01

The efficacy and sufficiency of a healthcare system is directly related to the knowledge and skills of graduates working in the system. In this regard, many different assessment methods have been proposed to evaluate various skills of the learners. Video Observation of Procedural Skills (VOPS) is one newly-proposed method. In this study we aimed to compare the results of the VOPS method with the more commonly used Direct Observation of Procedural Skills (DOPS). In this prospective study conducted in 2012, all 10 ophthalmology residents of post graduate year 4 were selected for participation. Three months into training in the glaucoma ward, these residents performed trabeculectomy surgery on patients, and their procedural skills were assessed in real time by an expert via the DOPS method. All surgeries were also recorded and later evaluated via the VOPS method by an expert. Bland-Altman plot also was used to compare the two methods and calculating the mean and 95% limit of agreement. Residents have been done a mean of 14.9 ± 3.5 (range 10-20) independent trabeculectomy before the assessments. DOPS grade was positively associated with number of independent trabeculectomy during glaucoma rotation (β=0.227, p = 0.004). The intra-observer reproducibility of VOPS measurements was 0.847 (95% CI: 0.634, 0.961). The mean VOPS grade was significantly lower than the mean DOPS grade (8.4 vs. 8.9, p = 0.02). However, a good correlation was observed between the grades of VOPS and DOPS (r = 0.89, p = 0.001). Bland-Altman analysis demonstrated that all data points fell within the 95% limits of agreement (-1.46, 0.46). The present study showed that VOPS might be considered a feasible, valid, and reliable assessment method for procedural skills of medical students and residents that can be used as an alternative to the DOPS method. However, VOPS might underestimate DOPS in evaluating surgical skills of residents.
Formative and Summative Evaluation: Related Issues in Performance Measurement.

ERIC Educational Resources Information Center

Wholey, Joseph S.

1996-01-01

Performance measurement can serve both formative and summative evaluation functions. Formative evaluation is typically more useful for government purposes whereas performance measurement is more useful than one-shot evaluations of either formative or summative nature. Evaluators should study performance measurement through case studies and…
Planning for an Evaluation of Teaching Performance. Volume IV. Summaries of Instruments for Use in Evaluating Teacher Performance.

ERIC Educational Resources Information Center

Yuzdepski, I., Comp.; Elliott, L., Comp.

This document presents information, in the form of summary sheets, on 54 teacher evaluation instruments. Each summary contains pertinent information about the instrument regarding publishing company, author, criteria evaluated, subject of observation, category dimension, and coding units. The 19 criteria used in the evaluation tests, which were…
Evaluation of Immediate Actions Taken to Deal with Cracking Problems Observed in Wheels of Rail Commuter Cars

DOT National Transportation Integrated Search

1993-07-01

The report is the first in a series of engineering studies on railroad vehicle wheel performance. Preliminary studies are summarized, involving evaluation of actions taken to respond to high rates of crack occurrence observed in the wheels of certain...
Performance evaluation of Louisiana superpave mixtures.

DOT National Transportation Integrated Search

2008-12-01

This report documents the performance of Louisiana Superpave mixtures through laboratory mechanistic tests, mixture : volumetric properties, gradation analysis, and early field performance. Thirty Superpave mixtures were evaluated in this : study. Fo...
Evaluation on surface current observing network of high frequency ground wave radars in the Gulf of Thailand

NASA Astrophysics Data System (ADS)

Yin, Xunqiang; Shi, Junqiang; Qiao, Fangli

2018-05-01

Due to the high cost of ocean observation system, the scientific design of observation network becomes much important. The current network of the high frequency radar system in the Gulf of Thailand has been studied using a three-dimensional coastal ocean model. At first, the observations from current radars have been assimilated into this coastal model and the forecast results have improved due to the data assimilation. But the results also show that further optimization of the observing network is necessary. And then, a series of experiments were carried out to assess the performance of the existing high frequency ground wave radar surface current observation system. The simulated surface current data in three regions were assimilated sequentially using an efficient ensemble Kalman filter data assimilation scheme. The experimental results showed that the coastal surface current observation system plays a positive role in improving the numerical simulation of the currents. Compared with the control experiment without assimilation, the simulation precision of surface and subsurface current had been improved after assimilated the surface currents observed at current networks. However, the improvement for three observing regions was quite different and current observing network in the Gulf of Thailand is not effective and a further optimization is required. Based on these evaluations, a manual scheme has been designed by discarding the redundant and inefficient locations and adding new stations where the performance after data assimilation is still low. For comparison, an objective scheme based on the idea of data assimilation has been obtained. Results show that all the two schemes of observing network perform better than the original network and optimal scheme-based data assimilation is much superior to the manual scheme that based on the evaluation of original observing network in the Gulf of Thailand. The distributions of the optimal network of radars could be a
OSSE Evaluation of Aircraft Reconnaissance Observations and their Impact on Hurricane Analyses and Forecasts

NASA Astrophysics Data System (ADS)

Ryan, K. E.; Bucci, L. R.; Delgado, J.; Atlas, R. M.; Murillo, S.; Dodge, P.

2016-12-01

NOAA/AOML's Hurricane Research Division (HRD) annually conducts its Hurricane Field Program during which observations are collected via NOAA aircraft to improve the understanding and prediction of hurricanes. Mission experiments suggest a variety of flight patterns and sampling strategies aimed towards their respective goals described by the Intensity Forecasting Experiment (IFEX; Rogers et al., BAMS, 2006, 2013), a collaborative effort among HRD, NHC, and EMC. Evaluating the potential impact of various trade-offs in track design is valuable for determining the optimal air reconnaissance flight pattern for a prospective mission. AOML's HRD has developed a system for performing regional Observing System Simulation Experiments (OSSEs) to assess the potential impact of proposed observing systems on hurricane track and intensity forecasts and analyses. This study focuses on investigating the potential impact of proposed aircraft reconnaissance observing system designs. Aircraft instrument and flight level retrievals were simulated from a regional WRF ARW Nature Run (Nolan et al., 2013) spanning 13 days, covering the life cycle of a rapidly intensifying Atlantic tropical cyclone. The aircraft trajectories of NOAA aircraft are simulated in a variety of ways and are evaluated to examine the potential impact of aircraft reconnaissance observations on hurricane track and intensity forecasts.
Performance and scalability evaluation of "Big Memory" on Blue Gene Linux.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yoshii, K.; Iskra, K.; Naik, H.

2011-05-01

We address memory performance issues observed in Blue Gene Linux and discuss the design and implementation of 'Big Memory' - an alternative, transparent memory space introduced to eliminate the memory performance issues. We evaluate the performance of Big Memory using custom memory benchmarks, NAS Parallel Benchmarks, and the Parallel Ocean Program, at a scale of up to 4,096 nodes. We find that Big Memory successfully resolves the performance issues normally encountered in Blue Gene Linux. For the ocean simulation program, we even find that Linux with Big Memory provides better scalability than does the lightweight compute node kernel designed solelymore » for high-performance applications. Originally intended exclusively for compute node tasks, our new memory subsystem dramatically improves the performance of certain I/O node applications as well. We demonstrate this performance using the central processor of the LOw Frequency ARray radio telescope as an example.« less
Evaluation and Optimization of China's Anthropogenic CO2 Emissions using Observations from Northern China (2005-2009).

NASA Astrophysics Data System (ADS)

Dayalu, A.; Munger, J. W.; Wang, Y.; Wofsy, S.; Zhao, Y.; Nielsen, C. P.; Nehrkorn, T.; McElroy, M. B.; Chang, R.

2017-12-01

China has pledged to peak carbon emissions by 2030, but there continues to be significant uncertainty in estimates of its anthropogenic carbon dioxide (CO2) emissions. In this study, we evaluate the performance of three anthropogenic CO2 inventories, two global and one regional, using five years of continuous hourly observations from a site in Northern China. We model five years of continuous hourly observations (2005 to 2009) using the Stochastic Time-Inverted Lagrangian Transport Model (STILT) run in backward time mode driven by high resolution meteorology from the Weather Research and Forecasting Model version 3.6.1 (WRF) with vegetation fluxes prescribed by a simple biosphere model. We calculate regional enhancements to advected background CO2 derived from NOAA CarbonTracker on seasonal and annual bases and use observations to optimize emissions inventories within the site's influence region at these timescales. Finally, we use annual enhancements to examine carbon intensity of provinces in and adjacent to Northern China as CO2 per unit of the region's GDP to evaluate the effects of local and global economic events on CO2 emissions. With the exception of peak growing season where discrepancies are confounded by errors in the vegetation model, we find the regional inventory agrees significantly better with observations than the global inventories at all timescales. Here we use a single measurement site; significant improvements in inventory optimizations can be achieved with a network of measurements stations. This study highlights the importance of China-specific data over global averages in emissions evaluation and demonstrates the value of top-down studies in independently evaluating inventory performance. We demonstrate the framework's ability to resolve differences of at least 20% among inventories, establishing a benchmark for ongoing efforts to decrease uncertainty in China's reported CO2 emissions estimates.
Evaluation of Student Performance through a Multidimensional Finite Mixture IRT Model.

PubMed

Bacci, Silvia; Bartolucci, Francesco; Grilli, Leonardo; Rampichini, Carla

2017-01-01

In the Italian academic system, a student can enroll for an exam immediately after the end of the teaching period or can postpone it; in this second case the exam result is missing. We propose an approach for the evaluation of a student performance throughout the course of study, accounting also for nonattempted exams. The approach is based on an item response theory model that includes two discrete latent variables representing student performance and priority in selecting the exams to take. We explicitly account for nonignorable missing observations as the indicators of attempted exams also contribute to measure the performance (within-item multidimensionality). The model also allows for individual covariates in its structural part.
The Impact of Self-Evaluation Instruction on Student Self-Evaluation, Music Performance, and Self-Evaluation Accuracy

ERIC Educational Resources Information Center

Hewitt, Michael P.

2011-01-01

The author sought to determine whether self-evaluation instruction had an impact on student self-evaluation, music performance, and self-evaluation accuracy of music performance among middle school instrumentalists. Participants (N = 211) were students at a private middle school located in a metropolitan area of a mid-Atlantic state. Students in…
48 CFR 1552.209-76 - Contractor performance evaluations.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Contractor performance... 1552.209-76 Contractor performance evaluations. As prescribed in section 1509.170-1, insert the following clause in all applicable solicitations and contracts. Contractor Performance Evaluations (OCT 2002...
48 CFR 8.406-7 - Contractor Performance Evaluation.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 1 2010-10-01 2010-10-01 false Contractor Performance... ACQUISITION PLANNING REQUIRED SOURCES OF SUPPLIES AND SERVICES Federal Supply Schedules 8.406-7 Contractor Performance Evaluation. Ordering activities must prepare an evaluation of contractor performance for each...
48 CFR 1536.201 - Evaluation of contracting performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

... performance. 1536.201 Section 1536.201 Federal Acquisition Regulations System ENVIRONMENTAL PROTECTION AGENCY... Contracting for Construction 1536.201 Evaluation of contracting performance. (a) The Contracting Officer will... will file the form in the contractor performance evaluation files which it maintains. (e) The Quality...
Performance evaluation of ionospheric time delay forecasting models using GPS observations at a low-latitude station

NASA Astrophysics Data System (ADS)

Sivavaraprasad, G.; Venkata Ratnam, D.

2017-07-01

Ionospheric delay is one of the major atmospheric effects on the performance of satellite-based radio navigation systems. It limits the accuracy and availability of Global Positioning System (GPS) measurements, related to critical societal and safety applications. The temporal and spatial gradients of ionospheric total electron content (TEC) are driven by several unknown priori geophysical conditions and solar-terrestrial phenomena. Thereby, the prediction of ionospheric delay is challenging especially over Indian sub-continent. Therefore, an appropriate short/long-term ionospheric delay forecasting model is necessary. Hence, the intent of this paper is to forecast ionospheric delays by considering day to day, monthly and seasonal ionospheric TEC variations. GPS-TEC data (January 2013-December 2013) is extracted from a multi frequency GPS receiver established at K L University, Vaddeswaram, Guntur station (geographic: 16.37°N, 80.37°E; geomagnetic: 7.44°N, 153.75°E), India. An evaluation, in terms of forecasting capabilities, of three ionospheric time delay models - an Auto Regressive Moving Average (ARMA) model, Auto Regressive Integrated Moving Average (ARIMA) model, and a Holt-Winter's model is presented. The performances of these models are evaluated through error measurement analysis during both geomagnetic quiet and disturbed days. It is found that, ARMA model is effectively forecasting the ionospheric delay with an accuracy of 82-94%, which is 10% more superior to ARIMA and Holt-Winter's models. Moreover, the modeled VTEC derived from International Reference Ionosphere, IRI (IRI-2012) model and new global TEC model, Neustrelitz TEC Model (NTCM-GL) have compared with forecasted VTEC values of ARMA, ARIMA and Holt-Winter's models during geomagnetic quiet days. The forecast results are indicating that ARMA model would be useful to set up an early warning system for ionospheric disturbances at low latitude regions.
Assessing Multi-year Changes in Modeled and Observed Urban NOx Concentrations from a Dynamic Model Evaluation Perspective

EPA Science Inventory

An investigation of the concentrations of nitrogen oxides (NOx) from an air quality model and observations at monitoring sites was performed to assess the changes in NOx levels attributable to changes in mobile emissions. This evaluation effort focused on weekday morning rush hou...
Sensor Technology Performance Characteristics- Field and Laboratory Observations

EPA Science Inventory

Observed Intangible Performance Characteristics RH and temperature impacts may be significant for some devices Internal battery lifetimes range from 4 to 24 hoursSensor packaging can interfere with accurate measurements (reactivity)Wireless communication protocols are not foolpr...
Evaluation of Classifier Performance for Multiclass Phenotype Discrimination in Untargeted Metabolomics.

PubMed

Trainor, Patrick J; DeFilippis, Andrew P; Rai, Shesh N

2017-06-21

Statistical classification is a critical component of utilizing metabolomics data for examining the molecular determinants of phenotypes. Despite this, a comprehensive and rigorous evaluation of the accuracy of classification techniques for phenotype discrimination given metabolomics data has not been conducted. We conducted such an evaluation using both simulated and real metabolomics datasets, comparing Partial Least Squares-Discriminant Analysis (PLS-DA), Sparse PLS-DA, Random Forests, Support Vector Machines (SVM), Artificial Neural Network, k -Nearest Neighbors ( k -NN), and Naïve Bayes classification techniques for discrimination. We evaluated the techniques on simulated data generated to mimic global untargeted metabolomics data by incorporating realistic block-wise correlation and partial correlation structures for mimicking the correlations and metabolite clustering generated by biological processes. Over the simulation studies, covariance structures, means, and effect sizes were stochastically varied to provide consistent estimates of classifier performance over a wide range of possible scenarios. The effects of the presence of non-normal error distributions, the introduction of biological and technical outliers, unbalanced phenotype allocation, missing values due to abundances below a limit of detection, and the effect of prior-significance filtering (dimension reduction) were evaluated via simulation. In each simulation, classifier parameters, such as the number of hidden nodes in a Neural Network, were optimized by cross-validation to minimize the probability of detecting spurious results due to poorly tuned classifiers. Classifier performance was then evaluated using real metabolomics datasets of varying sample medium, sample size, and experimental design. We report that in the most realistic simulation studies that incorporated non-normal error distributions, unbalanced phenotype allocation, outliers, missing values, and dimension reduction
Effects of Performers' External Characteristics on Performance Evaluations.

ERIC Educational Resources Information Center

Bermingham, Gudrun A.

2000-01-01

States that fairness has been a major concern in the field of music adjudication. Reviews the research literature to reveal information about three external characteristics (race, gender, and physical attractiveness) that may affect judges' performance evaluations and influence fairness of music adjudication. Includes references. (CMK)
Alternative performance measures for evaluating congestion.

DOT National Transportation Integrated Search

2004-04-01

This report summarizes the results of the work performed under the project Alternative Performance Measures for Evaluating : Congestion. The study first outlines existing approaches to looking at congestion. It then builds on the previous work in the...

Building Leadership Talent through Performance Evaluation

ERIC Educational Resources Information Center

Clifford, Matthew

2015-01-01

Most states and districts scramble to provide professional development to support principals, but "principal evaluation" is often lost amid competing priorities. Evaluation is an important method for supporting principal growth, communicating performance expectations to principals, and improving leadership practice. It provides leaders…
Performance Evaluation and Benchmarking of Intelligent Systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

Madhavan, Raj; Messina, Elena; Tunstel, Edward

To design and develop capable, dependable, and affordable intelligent systems, their performance must be measurable. Scientific methodologies for standardization and benchmarking are crucial for quantitatively evaluating the performance of emerging robotic and intelligent systems technologies. There is currently no accepted standard for quantitatively measuring the performance of these systems against user-defined requirements; and furthermore, there is no consensus on what objective evaluation procedures need to be followed to understand the performance of these systems. The lack of reproducible and repeatable test methods has precluded researchers working towards a common goal from exchanging and communicating results, inter-comparing system performance, and leveragingmore » previous work that could otherwise avoid duplication and expedite technology transfer. Currently, this lack of cohesion in the community hinders progress in many domains, such as manufacturing, service, healthcare, and security. By providing the research community with access to standardized tools, reference data sets, and open source libraries of solutions, researchers and consumers will be able to evaluate the cost and benefits associated with intelligent systems and associated technologies. In this vein, the edited book volume addresses performance evaluation and metrics for intelligent systems, in general, while emphasizing the need and solutions for standardized methods. To the knowledge of the editors, there is not a single book on the market that is solely dedicated to the subject of performance evaluation and benchmarking of intelligent systems. Even books that address this topic do so only marginally or are out of date. The research work presented in this volume fills this void by drawing from the experiences and insights of experts gained both through theoretical development and practical implementation of intelligent systems in a variety of diverse application domains. The book
Interrater Reliability and Diagnostic Performance of Subjective Evaluation of Sublingual Microcirculation Images by Physicians and Nurses: A Multicenter Observational Study.

PubMed

Lima, Alexandre; López, Alejandra; van Genderen, Michel E; Hurtado, Francisco Javier; Angulo, Martin; Grignola, Juan C; Shono, Atsuko; van Bommel, Jasper

2015-09-01

This was a cross-sectional multicenter study to investigate the ability of physicians and nurses from three different countries to subjectively evaluate sublingual microcirculation images and thereby discriminate normal from abnormal sublingual microcirculation based on flow and density abnormalities. Forty-five physicians and 61 nurses (mean age, 36 ± 10 years; 44 males) from three different centers in The Netherlands (n = 61), Uruguay (n = 12), and Japan (n = 33) were asked to subjectively evaluate a sample of 15 microcirculation videos randomly selected from an experimental model of endotoxic shock in pigs. All videos were first analyzed offline using the A.V.A. software by an independent, experienced investigator and were categorized as good, bad, or very bad microcirculation based on the microvascular flow index, perfused capillary density, and proportion of perfused capillaries. Then, the videos were randomly assigned to the examiners, who were instructed to subjectively categorize each image as good, bad, or very bad. An interrater analysis was performed, and sensitivity and specificity tests were calculated to evaluate the proportion of A.V.A. score abnormalities that the examiners correctly identified. The κ statistics indicated moderate agreement in the evaluation of microcirculation abnormalities using three categories, i.e., good, bad, or very bad (κ = 0.48), and substantial agreement using two categories, i.e., normal (good) and abnormal (bad or very bad) (κ = 0.66). There was no significant difference between the κ three and κ two statistics. We found that the examiner's subjective evaluations had good diagnostic performance and were highly sensitive (84%; 95% confidence interval, 81%-86%) and specific (87%; 95% confidence interval, 84%-90%) for sublingual microcirculatory abnormalities as assessed using the A.V.A. software. The subjective evaluations of sublingual microcirculation by physicians and nurses agreed well with a conventional offline
Evaluation of Long-Range Lightning Detection Networks Using TRMM/LIS Observations

NASA Technical Reports Server (NTRS)

Rudlosky, Scott D.; Holzworth, Robert H.; Carey, Lawrence D.; Schultz, Chris J.; Bateman, Monte; Cecil, Daniel J.; Cummins, Kenneth L.; Petersen, Walter A.; Blakeslee, Richard J.; Goodman, Steven J.

2011-01-01

Recent advances in long-range lightning detection technologies have improved our understanding of thunderstorm evolution in the data sparse oceanic regions. Although the expansion and improvement of long-range lightning datasets have increased their applicability, these applications (e.g., data assimilation, atmospheric chemistry, and aviation weather hazards) require knowledge of the network detection capabilities. Toward this end, the present study evaluates data from the World Wide Lightning Location Network (WWLLN) using observations from the Lightning Imaging Sensor (LIS) aboard the Tropical Rainfall Measurement Mission (TRMM) satellite. The study documents the WWLLN detection efficiency and location accuracy relative to LIS observations, describes the spatial variability in these performance metrics, and documents the characteristics of LIS flashes that are detected by WWLLN. Improved knowledge of the WWLLN detection capabilities will allow researchers, algorithm developers, and operational users to better prepare for the spatial and temporal coverage of the upcoming GOES-R Geostationary Lightning Mapper (GLM).
30 CFR 14.3 - Observers at tests and evaluations.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Observers at tests and evaluations. 14.3 Section 14.3 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR TESTING, EVALUATION, AND APPROVAL OF MINING PRODUCTS REQUIREMENTS FOR THE APPROVAL OF FLAME-RESISTANT CONVEYOR BELTS...
Advanced Video Analysis Needs for Human Performance Evaluation

NASA Technical Reports Server (NTRS)

Campbell, Paul D.

1994-01-01

Evaluators of human task performance in space missions make use of video as a primary source of data. Extraction of relevant human performance information from video is often a labor-intensive process requiring a large amount of time on the part of the evaluator. Based on the experiences of several human performance evaluators, needs were defined for advanced tools which could aid in the analysis of video data from space missions. Such tools should increase the efficiency with which useful information is retrieved from large quantities of raw video. They should also provide the evaluator with new analytical functions which are not present in currently used methods. Video analysis tools based on the needs defined by this study would also have uses in U.S. industry and education. Evaluation of human performance from video data can be a valuable technique in many industrial and institutional settings where humans are involved in operational systems and processes.
A New Method for the Evaluation and Prediction of Base Stealing Performance.

PubMed

Bricker, Joshua C; Bailey, Christopher A; Driggers, Austin R; McInnis, Timothy C; Alami, Arya

2016-11-01

Bricker, JC, Bailey, CA, Driggers, AR, McInnis, TC, and Alami, A. A new method for the evaluation and prediction of base stealing performance. J Strength Cond Res 30(11): 3044-3050, 2016-The purposes of this study were to evaluate a new method using electronic timing gates to monitor base stealing performance in terms of reliability, differences between it and traditional stopwatch-collected times, and its ability to predict base stealing performance. Twenty-five healthy collegiate baseball players performed maximal effort base stealing trials with a right and left-handed pitcher. An infrared electronic timing system was used to calculate the reaction time (RT) and total time (TT), whereas coaches' times (CT) were recorded with digital stopwatches. Reliability of the TGM was evaluated with intraclass correlation coefficients (ICCs) and coefficient of variation (CV). Differences between the TGM and traditional CT were calculated with paired samples t tests Cohen's d effect size estimates. Base stealing performance predictability of the TGM was evaluated with Pearson's bivariate correlations. Acceptable relative reliability was observed (ICCs 0.74-0.84). Absolute reliability measures were acceptable for TT (CVs = 4.4-4.8%), but measures were elevated for RT (CVs = 32.3-35.5%). Statistical and practical differences were found between TT and CT (right p = 0.00, d = 1.28 and left p = 0.00, d = 1.49). The TGM TT seems to be a decent predictor of base stealing performance (r = -0.49 to -0.61). The authors recommend using the TGM used in this investigation for athlete monitoring because it was found to be reliable, seems to be more precise than traditional CT measured with a stopwatch, provides an additional variable of value (RT), and may predict future performance.
Signal detection theory and methods for evaluating human performance in decision tasks

NASA Technical Reports Server (NTRS)

Obrien, Kevin; Feldman, Evan M.

1993-01-01

Signal Detection Theory (SDT) can be used to assess decision making performance in tasks that are not commonly thought of as perceptual. SDT takes into account both the sensitivity and biases in responding when explaining the detection of external events. In the standard SDT tasks, stimuli are selected in order to reveal the sensory capabilities of the observer. SDT can also be used to describe performance when decisions must be made as to the classification of easily and reliably sensed stimuli. Numbers are stimuli that are minimally affected by sensory processing and can belong to meaningful categories that overlap. Multiple studies have shown that the task of categorizing numbers from overlapping normal distributions produces performance predictable by SDT. These findings are particularly interesting in view of the similarity between the task of the categorizing numbers and that of determining the status of a mechanical system based on numerical values that represent sensor readings. Examples of the use of SDT to evaluate performance in decision tasks are reviewed. The methods and assumptions of SDT are shown to be effective in the measurement, evaluation, and prediction of human performance in such tasks.
Evaluation of pavement marking performance.

DOT National Transportation Integrated Search

2008-06-01

The objective of the investigation was to evaluate the useful life of pavement markings. The Manual on Uniform Traffic Control Devices (MUTCD) provides general guidelines for the application and installation of pavement markings. However, performance...
NACP Synthesis: Evaluating modeled carbon state and flux variables against multiple observational constraints (Invited)

NASA Astrophysics Data System (ADS)

Thornton, P. E.; Nacp Site Synthesis Participants

2010-12-01

The North American Carbon Program (NACP) synthesis effort includes an extensive intercomparison of modeled and observed ecosystem states and fluxes preformed with multiple models across multiple sites. The participating models span a range of complexity and intended application, while the participating sites cover a broad range of natural and managed ecosystems in North America, from the subtropics to arctic tundra, and coastal to interior climates. A unique characteristic of this collaborative effort is that multiple independent observations are available at all sites: fluxes are measured with the eddy covariance technique, and standard biometric and field sampling methods provide estimates of standing stock and annual production in multiple categories. In addition, multiple modeling approaches are employed to make predictions at each site, varying, for example, in the use of diagnostic vs. prognostic leaf area index. Given multiple independent observational constraints and multiple classes of model, we evaluate the internal consistency of observations at each site, and use this information to extend previously derived estimates of uncertainty in the flux observations. Model results are then compared with all available observations and models are ranked according to their consistency with each type of observation (high frequency flux measurement, carbon stock, annual production). We demonstrate a range of internal consistency across the sites, and show that some models which perform well against one observational metric perform poorly against others. We use this analysis to construct a hypothesis for combining eddy covariance, biometrics, and other standard physiological and ecological measurements which, as data collection proceeded over several years, would present an increasingly challenging target for next generation models.
An hierarchical approach to performance evaluation of expert systems

NASA Technical Reports Server (NTRS)

Dominick, Wayne D. (Editor); Kavi, Srinu

1985-01-01

The number and size of expert systems is growing rapidly. Formal evaluation of these systems - which is not performed for many systems - increases the acceptability by the user community and hence their success. Hierarchical evaluation that had been conducted for computer systems is applied for expert system performance evaluation. Expert systems are also evaluated by treating them as software systems (or programs). This paper reports many of the basic concepts and ideas in the Performance Evaluation of Expert Systems Study being conducted at the University of Southwestern Louisiana.
Model Evaluation with Multi-wavelength Satellite Observations Using a Neural Network

NASA Astrophysics Data System (ADS)

Kolassa, Jana; Jimenez, Carlos; Aires, Filipe

2013-04-01

A methodology has been developed to evaluate fields of modelled parameters against a set of satellite observations. The method employs a Neural Network (NN) to construct a statistical model capturing the relationship between the satellite observations and the parameter from a land surface model, in this case the Soil Moisture (SM). This statistical model is then used to estimate the parameter of interest from the set of satellite observations. These estimates are compared to the modelled parameter in order to detect local deviations indicating a possible problem in the model or in the satellite observations. Several synthetic tests, during which an artificial error was added to the"true" soil moisture fields, showed that the methodology is able to correct the errors (Jimenez et al., submitted, 2012). This evaluation technique is very general and can be applied to any modelled parameter for which sensitive satellite observations are available. The use of NNs simplifies the evaluation of the model against satellite observations and is particularly well-suited to utilize the synergy from the observations at different wavelengths (Aires et al., 2005, 2012). In this study the proposed methodology has been applied to evaluate SM fields from a number of land surface models against a synergy of satellite observations from passive and active microwave, infrared and visible sensors. In an inter-comparison of the performance of several land surface models (ORCHIDEE (de Rosnay et al., 2002), HTESSEL (Balsamo et al., 2009), JULES (Best et al., 2011) ) it was found that the soil moisture fields from JULES, HTESSEL and ORCHIDEE are very consistent with the observations, but ORCHIDEE soil moisture shows larger local deviations close to some river basins (Kolassa et al., in press, 2012; Jimenez et al., submitted, 2012). Differences between all models and the observations could also be observed in the Eastern US and over mountainous regions, however, the errors here are more likely
Operator performance evaluation using multi criteria decision making methods

NASA Astrophysics Data System (ADS)

Rani, Ruzanita Mat; Ismail, Wan Rosmanira; Razali, Siti Fatihah

2014-06-01

Operator performance evaluation is a very important operation in labor-intensive manufacturing industry because the company's productivity depends on the performance of its operators. The aims of operator performance evaluation are to give feedback to operators on their performance, to increase company's productivity and to identify strengths and weaknesses of each operator. In this paper, six multi criteria decision making methods; Analytical Hierarchy Process (AHP), fuzzy AHP (FAHP), ELECTRE, PROMETHEE II, Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) and VlseKriterijumska Optimizacija I Kompromisno Resenje (VIKOR) are used to evaluate the operators' performance and to rank the operators. The performance evaluation is based on six main criteria; competency, experience and skill, teamwork and time punctuality, personal characteristics, capability and outcome. The study was conducted at one of the SME food manufacturing companies in Selangor. From the study, it is found that AHP and FAHP yielded the "outcome" criteria as the most important criteria. The results of operator performance evaluation showed that the same operator is ranked the first using all six methods.
Analysis of navigation performance for the Earth Observing System (EOS) using the TDRSS Onboard Navigation System (TONS)

NASA Technical Reports Server (NTRS)

Elrod, B.; Kapoor, A.; Folta, David C.; Liu, K.

1991-01-01

Use of the Tracking and Data Relay Satellite System (TDRSS) Onboard Navigation System (TONS) was proposed as an alternative to the Global Positioning System (GPS) for supporting the Earth Observing System (EOS) mission. The results are presented of EOS navigation performance evaluation with respect to TONS based orbit, time, and frequency determination (OD/TD/FD). Two TONS modes are considered: one uses scheduled TDRSS forward link service to derive one way Doppler tracking data for OD/FD support (TONS-I); the other uses an unscheduled navigation beacon service (proposed for Advanced TDRSS) to obtain pseudorange and Doppler data for OD/TD/FD support (TONS-II). Key objectives of the analysis were to evaluate nominal performance and potential sensitivities, such as suboptimal tracking geometry, tracking contact scheduling, and modeling parameter selection. OD/TD/FD performance predictions are presented based on covariance and simulation analyses. EOS navigation scenarios and the contributions of principal error sources impacting performance are also described. The results indicate that a TONS mode can be configured to meet current and proposed EOS position accuracy requirements of 100 and 50 m, respectively.
Evaluation of Model Performance over the Maritime Continent

NASA Astrophysics Data System (ADS)

Reynolds, C. A.; Barton, N. P.; Chen, S.; Flatau, M. K.; Ridout, J. A.; Janiga, M.; Jensen, T.; Richman, J. G.; Metzger, E. J.; Baranowski, D.

2017-12-01

The introduction of high-resolution global coupled models holds promise for extended-range (subseasonal to seasonal) prediction of high-impact weather. While forecast models have shown considerable improvement in the prediction of tropical phenomena on these timescales, specifically in the simulation and prediction of the Madden-Julian Oscillation (MJO), obstacles remain. In particular, many models still have difficulty accurately simulating the propagation of the MJO over the maritime continent. This has been hypothesized, at least in part, to be related to deficiencies in simulating the diurnal cycle over this region, which in turn is dependent on accurate representation of fine-scale atmosphere-ocean-land interactions, orography, and atmospheric convection. These issues have motivated the international Year of Maritime Continent (YMC) effort and the Office of Naval Research Propagation of Intra-Seasonal Tropical Oscillations (PISTON) initiative. In preparation for YMC and PISTON, we closely evaluate the performance of the Navy Earth System Model (NESM), a coupled global forecast model, in representing the diurnal cycle and other prominent phenomena in the maritime continent region. NESM performance is compared with stand-alone atmospheric simulations with prescribed fixed and analyzed sea surface temperatures (SSTs). Initial results from the Dynamics of the Madden-Julian Oscillation field phase (Fall 2011) period indicate that NESM is able to capture the precipitation day-time maximum over land and night-time maximum over ocean, but day-time precipitation over Borneo, Sumatra and the Malay Peninsula is too strong as compared to TRMM observations. The simulation of low-level winds qualitatively captures sea and land breeze patterns as compared with ERA-Interim analysis, with quantitative biases varying by island. The fully-coupled system and the stand-alone atmospheric model simulations are more similar to each other than to the observations, indicating that
Legal Aspects of Evaluating Teacher Performance.

ERIC Educational Resources Information Center

Beckham, Joseph C.

Chapter 14 in a book on school law concerns the legal aspects of evaluating teacher performance. Careful analysis of recent decisions makes it clear the courts will compel uniform standards and unprecedented rigor in teacher evaluation practices. Particularly in the consideration of equitable standards, state and federal courts are relying on…
40 CFR 35.115 - Evaluation of performance.

Code of Federal Regulations, 2010 CFR

2010-07-01

... requirements for progress reporting under 40 CFR 31.40(b). (b) Elements of the evaluation process. The.... The recipient may request review of the Regional Administrator's decision under the dispute processes... Evaluation of performance. (a) Joint evaluation process. The applicant and the Regional Administrator will...
Evaluating rainfall kinetic energy - intensity relationships with observed disdrometric data

NASA Astrophysics Data System (ADS)

Angulo-Martinez, Marta; Begueria, Santiago; Latorre, Borja

2016-04-01

Rainfall kinetic energy is required for determining erosivity, the ability of rainfall to detach soil particles and initiate erosion. Its determination relay on the use of disdrometers, i.e. devices capable of measuring the drop size distribution and velocity of falling raindrops. In the absence of such devices, rainfall kinetic energy is usually estimated with empirical expressions relating rainfall energy and intensity. We evaluated the performance of 14 rainfall energy equations in estimating one-minute rainfall energy and event total energy, in comparison with observed data from 821 rainfall episodes (more than 100 thousand one-minute observations) by means of an optical disdrometer. In addition, two sources of bias when using such relationships were evaluated: i) the influence of using theoretical terminal raindrop fall velocities instead of measured values; and ii) the influence of time aggregation (rainfall intensity data every 5-, 10-, 15-, 30-, and 60-minutes). Empirical relationships did a relatively good job when complete events were considered (R2 > 0.82), but offered poorer results for within-event (one-minute resolution) variation. Also, systematic biases where large for many equations. When raindrop size distribution was known, estimating the terminal fall velocities by empirical laws produced good results even at fine time resolution. The influence of time aggregation was very high in the estimated kinetic energy, although linear scaling may allow empirical correction. This results stress the importance of considering all these effects when rainfall energy needs to be estimated from more standard precipitation records. , and recommends the use of disdrometer data to locally determine rainfall kinetic energy.
An urban energy performance evaluation system and its computer implementation.

PubMed

Wang, Lei; Yuan, Guan; Long, Ruyin; Chen, Hong

2017-12-15

To improve the urban environment and effectively reflect and promote urban energy performance, an urban energy performance evaluation system was constructed, thereby strengthening urban environmental management capabilities. From the perspectives of internalization and externalization, a framework of evaluation indicators and key factors that determine urban energy performance and explore the reasons for differences in performance was proposed according to established theory and previous studies. Using the improved stochastic frontier analysis method, an urban energy performance evaluation and factor analysis model was built that brings performance evaluation and factor analysis into the same stage for study. According to data obtained for the Chinese provincial capitals from 2004 to 2013, the coefficients of the evaluation indicators and key factors were calculated by the urban energy performance evaluation and factor analysis model. These coefficients were then used to compile the program file. The urban energy performance evaluation system developed in this study was designed in three parts: a database, a distributed component server, and a human-machine interface. Its functions were designed as login, addition, edit, input, calculation, analysis, comparison, inquiry, and export. On the basis of these contents, an urban energy performance evaluation system was developed using Microsoft Visual Studio .NET 2015. The system can effectively reflect the status of and any changes in urban energy performance. Beijing was considered as an example to conduct an empirical study, which further verified the applicability and convenience of this evaluation system. Copyright © 2017 Elsevier Ltd. All rights reserved.
Performance-Based Evaluation and School Librarians

ERIC Educational Resources Information Center

Church, Audrey P.

2015-01-01

Evaluation of instructional personnel is standard procedure in our Pre-K-12 public schools, and its purpose is to document educator effectiveness. With Race to the Top and No Child Left Behind waivers, states are required to implement performance-based evaluations that demonstrate student academic progress. This three-year study describes the…

Theory and Practice on Teacher Performance Evaluation

ERIC Educational Resources Information Center

Yonghong, Cai; Chongde, Lin

2006-01-01

Teacher performance evaluation plays a key role in educational personnel reform, so it has been an important yet difficult issue in educational reform. Previous evaluations on teachers failed to make strict distinction among the three dominant types of evaluation, namely, capability, achievement, and effectiveness. Moreover, teacher performance…
Relationships of multitasking, physicians' strain, and performance: an observational study in ward physicians.

PubMed

Weigl, Matthias; Müller, Andreas; Sevdalis, Nick; Angerer, Peter

2013-03-01

Simultaneous task performance ("multitasking") is common in hospital physicians' work and is implicated as a major determinant for enhanced strain and detrimental performance. The aim was to determine the impact of multitasking by hospital physicians on their self reported strain and performance. A prospective observational time-and-motion study in a Community Hospital was conducted. Twenty-seven hospital physicians (surgical and internal specialties) were observed in 40 full-shift observations. Observed physicians reported twice on their self-monitored strain and performance during the observation time. Associations of observed multitasking events and subsequent strain and performance appraisals were calculated. About 21% of the working time physicians were engaged in simultaneous activities. The average time spent in multitasking activities correlated significantly with subsequently reported strain (r = 0.27, P = 0.018). The number of instances of multitasking activities correlated with self-monitored performance to a marginally significant level (r = 0.19, P = 0.098). Physicians who engage in multitasking activities tend to self-report better performance but at the cost of enhanced psychophysical strain. Hence, physicians do not perceive their own multitasking activities as a source for deficient performance, for example, medical errors. Readjustment of workload, improved organization of work for hospital physicians, and training programs to improve physicians' skills in dealing with multiple clinical demands, prioritization, and efficient task allocation may be useful avenues to explore to reduce the potentially negative impact of simultaneous task performance in clinical settings.
Influence of socioeconomic status on trauma center performance evaluations in a Canadian trauma system.

PubMed

Moore, Lynne; Turgeon, Alexis F; Sirois, Marie-Josée; Murat, Valérie; Lavoie, André

2011-09-01

Trauma center performance evaluations generally include adjustment for injury severity, age, and comorbidity. However, disparities across trauma centers may be due to other differences in source populations that are not accounted for, such as socioeconomic status (SES). We aimed to evaluate whether SES influences trauma center performance evaluations in an inclusive trauma system with universal access to health care. The study was based on data collected between 1999 and 2006 in a Canadian trauma system. Patient SES was quantified using an ecologic index of social and material deprivation. Performance evaluations were based on mortality adjusted using the Trauma Risk Adjustment Model. Agreement between performance results with and without additional adjustment for SES was evaluated with correlation coefficients. The study sample comprised a total of 71,784 patients from 48 trauma centers, including 3,828 deaths within 30 days (4.5%) and 5,549 deaths within 6 months (7.7%). The proportion of patients in the highest quintile of social and material deprivation varied from 3% to 43% and from 11% to 90% across hospitals, respectively. The correlation between performance results with or without adjustment for SES was almost perfect (r = 0.997; 95% CI 0.995-0.998) and the same hospital outliers were identified. We observed an important variation in SES across trauma centers but no change in risk-adjusted mortality estimates when SES was added to adjustment models. Results suggest that after adjustment for injury severity, age, comorbidity, and transfer status, disparities in SES across trauma center source populations do not influence trauma center performance evaluations in a system offering universal health coverage. Copyright © 2011 American College of Surgeons. Published by Elsevier Inc. All rights reserved.
48 CFR 36.201 - Evaluation of contractor performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 1 2010-10-01 2010-10-01 false Evaluation of contractor performance. 36.201 Section 36.201 Federal Acquisition Regulations System FEDERAL ACQUISITION REGULATION... Contracting for Construction 36.201 Evaluation of contractor performance. See 42.1502(e) for the requirements...
Evaluation of Integrated Multi-satellitE Retrievals for GPM with All Weather Gauge Observations over CONUS

NASA Astrophysics Data System (ADS)

Chen, S.; Qi, Y.; Hu, B.; Hu, J.; Hong, Y.

2015-12-01

The Global Precipitation Measurement (GPM) mission is composed of an international network of satellites that provide the next-generation global observations of rain and snow. Integrated Multi-satellitE Retrievals for GPM (IMERG) is the state-of-art precipitation products with high spatio-temporal resolution of 0.1°/30min. IMERG unifies precipitation measurements from a constellation of research and operational satellites with the core sensors dual-frequency precipitation radar (DPR) and microwave imager (GMI) on board a "Core" satellite. Additionally, IMERG blends the advantages of currently most popular satellite-based quantitative precipitation estimates (QPE) algorithms, i.e. TRMM Multi-satellite Precipitation Analysis (TMPA), Climate Prediction Center morphing technique (CMORPH), Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Cloud Classification System (PERSIANN-CCS). The real-time and post real-time IMERG products are now available online at https://stormpps.gsfc.nasa.gov/storm. In this study, the final run post real-time IMERG is evaluated with all-weather manual gauge observations over CONUS from June 2014 through May 2015. Relative Bias (RB), Root-Mean-Squared Error (RMSE), Correlation Coefficient (CC), Probability Of Detection (POD), False Alarm Ratio (FAR), and Critical Success Index (CSI) are used to quantify the performance of IMERG. The performance of IMERG in estimating snowfall precipitation is highlighted in the study. This timely evaluation with all-weather gauge observations is expected to offer insights into performance of IMERG and thus provide useful feedback to the algorithm developers as well as the GPM data users.
Staging a performance: learners' perceptions about direct observation during residency.

PubMed

LaDonna, Kori A; Hatala, Rose; Lingard, Lorelei; Voyer, Stephane; Watling, Christopher

2017-05-01

Evidence strongly supports that direct observation is a valid and reliable assessment tool; support for its impact on learning is less compelling, and we know that some learners are ambivalent about being observed. However, learners' perceptions about the impact of direct observation on their learning and professional development remain underexplored. To promote learning, we need to understand what makes direct observation valuable for learners. Informed by constructivist grounded theory, we interviewed 22 learners about their observation experiences. Data collection and analysis occurred iteratively; themes were identified using constant comparative analysis. Direct observation was widely endorsed as an important educational strategy, albeit one that created significant anxiety. Opaque expectations exacerbated participants' discomfort, and participants described that being observed felt like being assessed. Consequently, participants exchanged their 'usual' practice for a 'textbook' approach; alterations to performance generated uncertainty about their role, and raised questions about whether observers saw an authentic portrayal of their knowledge and skill. An 'observer effect' may partly explain learners' ambivalence about direct observation; being observed seemed to magnify learners' role ambiguity, intensify their tensions around professional development and raise questions about the credibility of feedback. In turn, an observer effect may impact learners' receptivity to feedback and may explain, in part, learners' perceptions that useful feedback is scant. For direct observation to be valuable, educators must be explicit about expectations, and they must be aware that how learners perform in the presence of an observer may not reflect what they do as independent practitioners. To nurture learners' professional development, educators must create a culture of observation-based coaching that is divorced from assessment and is tailored to developing learners
40 CFR 35.515 - Evaluation of performance.

Code of Federal Regulations, 2010 CFR

2010-07-01

....515 Evaluation of performance. (a) Joint evaluation process. The applicant and the Regional... work plan (see section 35.507(b)(2)(iv)). A description of the evaluation process and reporting... annually and must satisfy the requirements for progress reporting under 40 CFR 31.40(b). (b) Elements of...
Error Reduction Program. [combustor performance evaluation codes

NASA Technical Reports Server (NTRS)

Syed, S. A.; Chiappetta, L. M.; Gosman, A. D.

1985-01-01

The details of a study to select, incorporate and evaluate the best available finite difference scheme to reduce numerical error in combustor performance evaluation codes are described. The combustor performance computer programs chosen were the two dimensional and three dimensional versions of Pratt & Whitney's TEACH code. The criteria used to select schemes required that the difference equations mirror the properties of the governing differential equation, be more accurate than the current hybrid difference scheme, be stable and economical, be compatible with TEACH codes, use only modest amounts of additional storage, and be relatively simple. The methods of assessment used in the selection process consisted of examination of the difference equation, evaluation of the properties of the coefficient matrix, Taylor series analysis, and performance on model problems. Five schemes from the literature and three schemes developed during the course of the study were evaluated. This effort resulted in the incorporation of a scheme in 3D-TEACH which is usuallly more accurate than the hybrid differencing method and never less accurate.
Experimental Evaluation of High Performance Integrated Heat Pump

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miller, William A; Berry, Robert; Durfee, Neal

2016-01-01

Integrated heat pump (IHP) technology provides significant potential for energy savings and comfort improvement for residential buildings. In this study, we evaluate the performance of a high performance IHP that provides space heating, cooling, and water heating services. Experiments were conducted according to the ASHRAE Standard 206-2013 where 24 test conditions were identified in order to evaluate the IHP performance indices based on the airside performance. Empirical curve fits of the unit s compressor maps are used in conjunction with saturated condensing and evaporating refrigerant conditions to deduce the refrigerant mass flowrate, which, in turn was used to evaluate themore » refrigerant side performance as a check on the airside performance. Heat pump (compressor, fans, and controls) and water pump power were measured separately per requirements of Standard 206. The system was charged per the system manufacturer s specifications. System test results are presented for each operating mode. The overall IHP performance metrics are determined from the test results per the Standard 206 calculation procedures.« less
Building China's municipal healthcare performance evaluation system: a Tuscan perspective.

PubMed

Li, Hao; Barsanti, Sara; Bonini, Anna

2012-08-01

Regional healthcare performance evaluation systems can help optimize healthcare resources on regional basis and improve the performance of healthcare services provided. The Tuscany region in Italy is a good example of an institution which meets these requirements. China has yet to build such a system based on international experience. In this paper, based on comparative studies between Tuscany and China, we propose that the managing institutions in China's experimental cities can select and commission a third-party agency to, respectively, evaluate the performance of their affiliated hospitals and community health service centers. Following some features of the Tuscan experience, the Chinese municipal healthcare performance evaluation system can be built by focusing on the selection of an appropriate performance evaluation agency, the design of an adequate performance evaluation mechanism and the formulation of a complete set of laws, rules and regulations. When a performance evaluation system at city level is formed, the provincial government can extend the successful experience to other cities.
Observation and Teacher Quality: Critical Analysis of Observational Instruments in Preservice Teacher Performance Assessment

ERIC Educational Resources Information Center

Caughlan, Samantha; Jiang, Heng

2014-01-01

Teacher preparation programs commonly use observational instruments to assess the progress and the exit performances of teacher candidates. However, while these instruments have been described and several have been studied for effectiveness, the field lacks a close examination of how they position participants: teacher candidates, K-12 pupils, and…
Model Performance Evaluation and Scenario Analysis (MPESA)

EPA Pesticide Factsheets

Model Performance Evaluation and Scenario Analysis (MPESA) assesses the performance with which models predict time series data. The tool was developed Hydrological Simulation Program-Fortran (HSPF) and the Stormwater Management Model (SWMM)
The Linear Programming to evaluate the performance of Oral Health in Primary Care.

PubMed

Colussi, Claudia Flemming; Calvo, Maria Cristina Marino; Freitas, Sergio Fernando Torres de

2013-01-01

To show the use of Linear Programming to evaluate the performance of Oral Health in Primary Care. This study used data from 19 municipalities of Santa Catarina city that participated of the state evaluation in 2009 and have more than 50,000 habitants. A total of 40 indicators were evaluated, calculated using the Microsoft Excel 2007, and converted to the interval [0, 1] in ascending order (one indicating the best situation and zero indicating the worst situation). Applying the Linear Programming technique municipalities were assessed and compared among them according to performance curve named "quality estimated frontier". Municipalities included in the frontier were classified as excellent. Indicators were gathered, and became synthetic indicators. The majority of municipalities not included in the quality frontier (values different of 1.0) had lower values than 0.5, indicating poor performance. The model applied to the municipalities of Santa Catarina city assessed municipal management and local priorities rather than the goals imposed by pre-defined parameters. In the final analysis three municipalities were included in the "perceived quality frontier". The Linear Programming technique allowed to identify gaps that must be addressed by city managers to enhance actions taken. It also enabled to observe each municipal performance and compare results among similar municipalities.
An experimental study on CHVE's performance evaluation.

PubMed

Paiva, Paulo V F; Machado, Liliane S; Oliveira, Jauvane C

2012-01-01

Virtual reality-based training simulators, with collaborative capabilities, are known to improve the way users interact with one another while learning or improving skills on a given medical procedure. Performance evaluation of Collaborative Haptic Virtual Environments (CHVE) allows us to understand how such systems can work in the Internet, as well as the requirements for multisensorial and real-time data. This work discloses new performance evaluation results for the collaborative module of the CyberMed VR framework.
Evaluating performance of limestone prone to polishing.

DOT National Transportation Integrated Search

2009-12-01

This research project evaluated the effect of blending Vanport limestone and other aggregates on the frictional surface characteristic properties of constructed trial road surfaces. The study undertook the evaluation of the performance of different m...
Performance evaluation methodology for historical document image binarization.

PubMed

Ntirogiannis, Konstantinos; Gatos, Basilis; Pratikakis, Ioannis

2013-02-01

Document image binarization is of great importance in the document image analysis and recognition pipeline since it affects further stages of the recognition process. The evaluation of a binarization method aids in studying its algorithmic behavior, as well as verifying its effectiveness, by providing qualitative and quantitative indication of its performance. This paper addresses a pixel-based binarization evaluation methodology for historical handwritten/machine-printed document images. In the proposed evaluation scheme, the recall and precision evaluation measures are properly modified using a weighting scheme that diminishes any potential evaluation bias. Additional performance metrics of the proposed evaluation scheme consist of the percentage rates of broken and missed text, false alarms, background noise, character enlargement, and merging. Several experiments conducted in comparison with other pixel-based evaluation measures demonstrate the validity of the proposed evaluation scheme.
Who Should Evaluate Teachers' Performance at Schools?

ERIC Educational Resources Information Center

Tabancali, Erkan

2017-01-01

Correct determination of whether or not the objectives are achieved or of the level of achievement of the objectives is vital in educational organizations. In this context, one of the indicators of how teachers serve the organizational objectives is performance evaluation. In Turkish Educational System, the evaluation of the performance of…
Evaluating Models of Human Performance: Safety-Critical Systems Applications

NASA Technical Reports Server (NTRS)

Feary, Michael S.

2012-01-01

This presentation is part of panel discussion on Evaluating Models of Human Performance. The purpose of this panel is to discuss the increasing use of models in the world today and specifically focus on how to describe and evaluate models of human performance. My presentation will focus on discussions of generating distributions of performance, and the evaluation of different strategies for humans performing tasks with mixed initiative (Human-Automation) systems. I will also discuss issues with how to provide Human Performance modeling data to support decisions on acceptability and tradeoffs in the design of safety critical systems. I will conclude with challenges for the future.
Performance measurement for supply chain management and evaluation criteria determination for reverse supply chain management

NASA Astrophysics Data System (ADS)

Kongar, N. Elif

2004-12-01

Today, since customers are able to obtain similar-quality products for similar prices, the lead time has become the only preference criterion for most of the consumers. Therefore, it is crucial that the lead time, i.e., the time spent from the raw material phase till the manufactured good reaches the customer, is minimized. This issue can be investigated under the title of Supply Chain Management (SCM). An efficiently managed supply chain can lead to reduced response time for customers. To achieve this, continuous observation of supply chain efficiency, i.e., a constant performance evaluation of the current SCM is required. Widely used conventional performance measurement methods lack the ability to evaluate a SCM since the supply chain is a dynamic system that requires a more thorough and flexible performance measurement technique. Balanced Scorecard (BS) is an efficient tool for measuring the performance of dynamic systems and has a proven capability of providing the decision makers with the appropriate feedback data. In addition to SCM, a relatively new management field, namely reverse supply chain management (RSCM), also necessitates an appropriate evaluation approach. RSCM differs from SCM in many aspects, i.e., the criteria used for evaluation, the high level of uncertainty involved etc., not allowing the usage of identical evaluation techniques used for SCM. This study proposes a generic Balanced Scorecard to measure the performance of supply chain management while defining the appropriate performance measures for SCM. A scorecard prototype, ESCAPE, is presented to demonstrate the evaluation process.
Statistical properties of a utility measure of observer performance compared to area under the ROC curve

NASA Astrophysics Data System (ADS)

Abbey, Craig K.; Samuelson, Frank W.; Gallas, Brandon D.; Boone, John M.; Niklason, Loren T.

2013-03-01

The receiver operating characteristic (ROC) curve has become a common tool for evaluating diagnostic imaging technologies, and the primary endpoint of such evaluations is the area under the curve (AUC), which integrates sensitivity over the entire false positive range. An alternative figure of merit for ROC studies is expected utility (EU), which focuses on the relevant region of the ROC curve as defined by disease prevalence and the relative utility of the task. However if this measure is to be used, it must also have desirable statistical properties keep the burden of observer performance studies as low as possible. Here, we evaluate effect size and variability for EU and AUC. We use two observer performance studies recently submitted to the FDA to compare the EU and AUC endpoints. The studies were conducted using the multi-reader multi-case methodology in which all readers score all cases in all modalities. ROC curves from the study were used to generate both the AUC and EU values for each reader and modality. The EU measure was computed assuming an iso-utility slope of 1.03. We find mean effect sizes, the reader averaged difference between modalities, to be roughly 2.0 times as big for EU as AUC. The standard deviation across readers is roughly 1.4 times as large, suggesting better statistical properties for the EU endpoint. In a simple power analysis of paired comparison across readers, the utility measure required 36% fewer readers on average to achieve 80% statistical power compared to AUC.

How the Brain Converts Negative Evaluation into Performance Facilitation.

PubMed

Prévost, Charlotte; Lau, Hakwan; Mobbs, Dean

2018-02-01

Surpassing negative evaluation is a recurrent theme of success stories. Yet, there is little evidence supporting the counterintuitive idea that negative evaluation might not only motivate people, but also enhance performance. To address this question, we designed a task that required participants to decide whether taking up a risky challenge after receiving positive or negative evaluations from independent judges. Participants believed that these evaluations were based on their prior performance on a related task. Results showed that negative evaluation caused a facilitation in performance. Concurrent functional magnetic resonance imaging revealed that the motivating effect of negative evaluation was represented in the insula and striatum, while the performance boost was associated with functional positive connectivity between the insula and a set of brain regions involved in goal-directed behavior and the orienting of attention. These findings provide new insight into the neural representation of negative evaluation-induced facilitation. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Intra- and Inter-observer Variability of Measurements of the Laxity Index on Stress Radiographs Performed with the Vezzoni-Modified Badertscher Hip Distension Device.

PubMed

Bertal, Mileva; Vezzoni, Aldo; Houdellier, Blandine; Bogaerts, Evelien; Stock, Emmelie; Polis, Ingeborgh; Deforce, Dieter; Saunders, Jimmy H; Broeckx, Bart J G

2018-06-02

To describe and evaluate the accuracy, intra- and inter-observer variability of the laxity index (LI), used to quantify hip laxity on stress radiographs obtained with the Vezzoni-modified Badertscher distension device (VMBDD). Stress radiographs of 10 dogs obtained with the VMBDD were measured three times by an experienced observer. Six participants with different backgrounds (two ECVDI residents, two PhD students, two veterinary assistants) followed a short presentation and performed subsequently the measurements four times in two separate sessions. The effect of self-learning, feedback and specialization on the accuracy of the measurements was assessed. While the intra- and inter-observer variability were in agreement with other studies, the results of the experienced observer indicated that the variability can be very low. Neither feedback nor self-learning improved the results. A high degree of experience in radiographic assessment was not necessary to perform the measurements correctly. As the LI measurements were acceptable after a short presentation, they support the use of VMBDD for a complete and correct in-house evaluation of the hip joint by trained clinicians. However, we propose that, in the context of screening, measurements should be performed by a limited number of experienced examiners, to limit the impact of the inter-observer variability. Schattauer GmbH Stuttgart.
Estimating thermal performance curves from repeated field observations

USGS Publications Warehouse

Childress, Evan; Letcher, Benjamin H.

2017-01-01

Estimating thermal performance of organisms is critical for understanding population distributions and dynamics and predicting responses to climate change. Typically, performance curves are estimated using laboratory studies to isolate temperature effects, but other abiotic and biotic factors influence temperature-performance relationships in nature reducing these models' predictive ability. We present a model for estimating thermal performance curves from repeated field observations that includes environmental and individual variation. We fit the model in a Bayesian framework using MCMC sampling, which allowed for estimation of unobserved latent growth while propagating uncertainty. Fitting the model to simulated data varying in sampling design and parameter values demonstrated that the parameter estimates were accurate, precise, and unbiased. Fitting the model to individual growth data from wild trout revealed high out-of-sample predictive ability relative to laboratory-derived models, which produced more biased predictions for field performance. The field-based estimates of thermal maxima were lower than those based on laboratory studies. Under warming temperature scenarios, field-derived performance models predicted stronger declines in body size than laboratory-derived models, suggesting that laboratory-based models may underestimate climate change effects. The presented model estimates true, realized field performance, avoiding assumptions required for applying laboratory-based models to field performance, which should improve estimates of performance under climate change and advance thermal ecology.
NREL Evaluates Advanced Solar Inverter Performance for Hawaiian Electric

Science.gov Websites

Companies | Energy Systems Integration Facility | NREL NREL Evaluates Advanced Solar Inverter Performance for Hawaiian Electric Companies NREL Evaluates Advanced Solar Inverter Performance for Hawaiian performance and impacts of today's advanced solar inverters, as well as proprietary feedback to the inverter
Objective Situation Awareness Measurement Based on Performance Self-Evaluation

NASA Technical Reports Server (NTRS)

DeMaio, Joe

1998-01-01

The research was conducted in support of the NASA Safe All-Weather Flight Operations for Rotorcraft (SAFOR) program. The purpose of the work was to investigate the utility of two measurement tools developed by the British Defense Evaluation Research Agency. These tools were a subjective workload assessment scale, the DRA Workload Scale and a situation awareness measurement tool. The situation awareness tool uses a comparison of the crew's self-evaluation of performance against actual performance in order to determine what information the crew attended to during the performance. These two measurement tools were evaluated in the context of a test of innovative approach to alerting the crew by way of a helmet mounted display. The situation assessment data are reported here. The performance self-evaluation metric of situation awareness was found to be highly effective. It was used to evaluate situation awareness on a tank reconnaissance task, a tactical navigation task, and a stylized task used to evaluated handling qualities. Using the self-evaluation metric, it was possible to evaluate situation awareness, without exact knowledge the relevant information in some cases and to identify information to which the crew attended or failed to attend in others.
Detecting ecosystem performance anomalies for land management in the upper colorado river basin using satellite observations, climate data, and ecosystem models

USGS Publications Warehouse

Gu, Yingxin; Wylie, B.K.

2010-01-01

This study identifies areas with ecosystem performance anomalies (EPA) within the Upper Colorado River Basin (UCRB) during 2005-2007 using satellite observations, climate data, and ecosystem models. The final EPA maps with 250-m spatial resolution were categorized as normal performance, underperformance, and overperformance (observed performance relative to weather-based predictions) at the 90% level of confidence. The EPA maps were validated using "percentage of bare soil" ground observations. The validation results at locations with comparable site potential showed that regions identified as persistently underperforming (overperforming) tended to have a higher (lower) percentage of bare soil, suggesting that our preliminary EPA maps are reliable and agree with ground-based observations. The 3-year (2005-2007) persistent EPA map from this study provides the first quantitative evaluation of ecosystem performance anomalies within the UCRB and will help the Bureau of Land Management (BLM) identify potentially degraded lands. Results from this study can be used as a prototype by BLM and other land managers for making optimal land management decisions. ?? 2010 by the authors.
Detecting Ecosystem Performance Anomalies for Land Management in the Upper Colorado River Basin Using Satellite Observations, Climate Data, and Ecosystem Models

USGS Publications Warehouse

Gu, Yingxin; Wylie, Bruce K.

2010-01-01

This study identifies areas with ecosystem performance anomalies (EPA) within the Upper Colorado River Basin (UCRB) during 2005–2007 using satellite observations, climate data, and ecosystem models. The final EPA maps with 250-m spatial resolution were categorized as normal performance, underperformance, and overperformance (observed performance relative to weather-based predictions) at the 90% level of confidence. The EPA maps were validated using “percentage of bare soil” ground observations. The validation results at locations with comparable site potential showed that regions identified as persistently underperforming (overperforming) tended to have a higher (lower) percentage of bare soil, suggesting that our preliminary EPA maps are reliable and agree with ground-based observations. The 3-year (2005–2007) persistent EPA map from this study provides the first quantitative evaluation of ecosystem performance anomalies within the UCRB and will help the Bureau of Land Management (BLM) identify potentially degraded lands. Results from this study can be used as a prototype by BLM and other land managers for making optimal land management decisions.
The Relationship between Assessor/Assessee Gender and Performance Observation Ratings.

ERIC Educational Resources Information Center

Schuyten, Shana; Tashakkori, Abbas

The effects of the genders of the assessor and the assessee on performance observation ratings of beginning teachers were studied in public schools in Louisiana. Data was collected in the pilot phase of the Louisiana Teacher Assessment Program for Interns, which included both teacher observation and structured interview. Of the assessees who…
Conversations, Not Evaluations: An Alternative Model of Performance Management

ERIC Educational Resources Information Center

Lee, Christopher D.

2003-01-01

Traditional appraisal and evaluation systems focus almost exclusively on an employee's past performance. The desired result in each of these systems is better work performance. The very nature of most appraisals or evaluations, however, may inhibit performance unintentionally by focusing energy, attention and effort on past shortcomings rather…
Imaging acquisition display performance: an evaluation and discussion of performance metrics and procedures.

PubMed

Silosky, Michael S; Marsh, Rebecca M; Scherzinger, Ann L

2016-07-08

When The Joint Commission updated its Requirements for Diagnostic Imaging Services for hospitals and ambulatory care facilities on July 1, 2015, among the new requirements was an annual performance evaluation for acquisition workstation displays. The purpose of this work was to evaluate a large cohort of acquisition displays used in a clinical environment and compare the results with existing performance standards provided by the American College of Radiology (ACR) and the American Association of Physicists in Medicine (AAPM). Measurements of the minimum luminance, maximum luminance, and luminance uniformity, were performed on 42 acquisition displays across multiple imaging modalities. The mean values, standard deviations, and ranges were calculated for these metrics. Additionally, visual evaluations of contrast, spatial resolution, and distortion were performed using either the Society of Motion Pictures and Television Engineers test pattern or the TG-18-QC test pattern. Finally, an evaluation of local nonuniformities was performed using either a uniform white display or the TG-18-UN80 test pattern. Displays tested were flat panel, liquid crystal displays that ranged from less than 1 to up to 10 years of use and had been built by a wide variety of manufacturers. The mean values for Lmin and Lmax for the displays tested were 0.28 ± 0.13 cd/m2 and 135.07 ± 33.35 cd/m2, respectively. The mean maximum luminance deviation for both ultrasound and non-ultrasound displays was 12.61% ± 4.85% and 14.47% ± 5.36%, respectively. Visual evaluation of display performance varied depending on several factors including brightness and contrast settings and the test pattern used for image quality assessment. This work provides a snapshot of the performance of 42 acquisition displays across several imaging modalities in clinical use at a large medical center. Comparison with existing performance standards reveals that changes in display technology and the move from cathode ray
Behavioral patterns of environmental performance evaluation programs.

PubMed

Li, Wanxin; Mauerhofer, Volker

2016-11-01

During the past decades numerous environmental performance evaluation programs have been developed and implemented on different geographic scales. This paper develops a taxonomy of environmental management behavioral patterns in order to provide a practical comparison tool for environmental performance evaluation programs. Ten such programs purposively selected are mapped against the identified four behavioral patterns in the form of diagnosis, negotiation, learning, and socialization and learning. Overall, we found that schemes which serve to diagnose environmental abnormalities are mainly externally imposed and have been developed as a result of technical debates concerning data sources, methodology and ranking criteria. Learning oriented scheme is featured by processes through which free exchange of ideas, mutual and adaptive learning can occur. Scheme developed by higher authority for influencing behaviors of lower levels of government has been adopted by the evaluated to signal their excellent environmental performance. The socializing and learning classified evaluation schemes have incorporated dialogue, participation, and capacity building in program design. In conclusion we consider the 'fitness for purpose' of the various schemes, the merits of our analytical model and the future possibilities of fostering capacity building in the realm of wicked environmental challenges. Copyright © 2016 Elsevier Ltd. All rights reserved.
24 CFR 968.330 - PHA performance and evaluation report.

Code of Federal Regulations, 2010 CFR

2010-04-01

... 24 Housing and Urban Development 4 2010-04-01 2010-04-01 false PHA performance and evaluation... 250 or More Public Housing Units) § 968.330 PHA performance and evaluation report. For any FFY in which a PHA has received assistance under this subpart, the PHA shall submit a Performance and...
Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling

NASA Astrophysics Data System (ADS)

Beck, Hylke E.; Vergopolan, Noemi; Pan, Ming; Levizzani, Vincenzo; van Dijk, Albert I. J. M.; Weedon, Graham P.; Brocca, Luca; Pappenberger, Florian; Huffman, George J.; Wood, Eric F.

2017-12-01

We undertook a comprehensive evaluation of 22 gridded (quasi-)global (sub-)daily precipitation (P) datasets for the period 2000-2016. Thirteen non-gauge-corrected P datasets were evaluated using daily P gauge observations from 76 086 gauges worldwide. Another nine gauge-corrected datasets were evaluated using hydrological modeling, by calibrating the HBV conceptual model against streamflow records for each of 9053 small to medium-sized ( < 50 000 km2) catchments worldwide, and comparing the resulting performance. Marked differences in spatio-temporal patterns and accuracy were found among the datasets. Among the uncorrected P datasets, the satellite- and reanalysis-based MSWEP-ng V1.2 and V2.0 datasets generally showed the best temporal correlations with the gauge observations, followed by the reanalyses (ERA-Interim, JRA-55, and NCEP-CFSR) and the satellite- and reanalysis-based CHIRP V2.0 dataset, the estimates based primarily on passive microwave remote sensing of rainfall (CMORPH V1.0, GSMaP V5/6, and TMPA 3B42RT V7) or near-surface soil moisture (SM2RAIN-ASCAT), and finally, estimates based primarily on thermal infrared imagery (GridSat V1.0, PERSIANN, and PERSIANN-CCS). Two of the three reanalyses (ERA-Interim and JRA-55) unexpectedly obtained lower trend errors than the satellite datasets. Among the corrected P datasets, the ones directly incorporating daily gauge data (CPC Unified, and MSWEP V1.2 and V2.0) generally provided the best calibration scores, although the good performance of the fully gauge-based CPC Unified is unlikely to translate to sparsely or ungauged regions. Next best results were obtained with P estimates directly incorporating temporally coarser gauge data (CHIRPS V2.0, GPCP-1DD V1.2, TMPA 3B42 V7, and WFDEI-CRU), which in turn outperformed the one indirectly incorporating gauge data through another multi-source dataset (PERSIANN-CDR V1R1). Our results highlight large differences in estimation accuracy, and hence the importance of P
Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling

NASA Astrophysics Data System (ADS)

Beck, H.; Vergopolan, N.; Pan, M.; Levizzani, V.; van Dijk, A.; Weedon, G. P.; Brocca, L.; Huffman, G. J.; Wood, E. F.; William, L.

2017-12-01

We undertook a comprehensive evaluation of 22 gridded (quasi-)global (sub-)daily precipitation (P) datasets for the period 2000-2016. Twelve non-gauge-corrected P datasets were evaluated using daily P gauge observations from 76,086 gauges worldwide. Another ten gauge-corrected ones were evaluated using hydrological modeling, by calibrating the conceptual model HBV against streamflow records for each of 9053 small to medium-sized (<50,000 km2) catchments worldwide, and comparing the resulting performance. Marked differences in spatio-temporal patterns and accuracy were found among the datasets. Among the uncorrected P datasets, the satellite- and reanalysis-based MSWEP-ng V1.2 and V2.0 datasets generally showed the best temporal correlations with the gauge observations, followed by the reanalyses (ERA-Interim, JRA-55, and NCEP-CFSR), the estimates based primarily on passive microwave remote sensing of rainfall (CMORPH V1.0, GSMaP V5/6, and TMPA 3B42RT V7) or near-surface soil moisture (SM2RAIN-ASCAT), and finally, estimates based primarily on thermal infrared imagery (GridSat V1.0, PERSIANN, and PERSIANN-CCS). Two of the three reanalyses (ERA-Interim and JRA-55) unexpectedly obtained lower trend errors than the satellite datasets. Among the corrected P datasets, the ones directly incorporating daily gauge data (CPC Unified and MSWEP V1.2 and V2.0) generally provided the best calibration scores, although the good performance of the fully gauge-based CPC Unified is unlikely to translate to sparsely or ungauged regions. Next best results were obtained with P estimates directly incorporating temporally coarser gauge data (CHIRPS V2.0, GPCP-1DD V1.2, TMPA 3B42 V7, and WFDEI-CRU), which in turn outperformed those indirectly incorporating gauge data through other multi-source datasets (PERSIANN-CDR V1R1 and PGF). Our results highlight large differences in estimation accuracy, and hence, the importance of P dataset selection in both research and operational applications
Performance evaluation of the Engineering Analysis and Data Systems (EADS) 2

NASA Technical Reports Server (NTRS)

Debrunner, Linda S.

1994-01-01

The Engineering Analysis and Data System (EADS)II (1) was installed in March 1993 to provide high performance computing for science and engineering at Marshall Space Flight Center (MSFC). EADS II increased the computing capabilities over the existing EADS facility in the areas of throughput and mass storage. EADS II includes a Vector Processor Compute System (VPCS), a Virtual Memory Compute System (CFS), a Common Output System (COS), as well as Image Processing Station, Mini Super Computers, and Intelligent Workstations. These facilities are interconnected by a sophisticated network system. This work considers only the performance of the VPCS and the CFS. The VPCS is a Cray YMP. The CFS is implemented on an RS 6000 using the UniTree Mass Storage System. To better meet the science and engineering computing requirements, EADS II must be monitored, its performance analyzed, and appropriate modifications for performance improvement made. Implementing this approach requires tool(s) to assist in performance monitoring and analysis. In Spring 1994, PerfStat 2.0 was purchased to meet these needs for the VPCS and the CFS. PerfStat(2) is a set of tools that can be used to analyze both historical and real-time performance data. Its flexible design allows significant user customization. The user identifies what data is collected, how it is classified, and how it is displayed for evaluation. Both graphical and tabular displays are supported. The capability of the PerfStat tool was evaluated, appropriate modifications to EADS II to optimize throughput and enhance productivity were suggested and implemented, and the effects of these modifications on the systems performance were observed. In this paper, the PerfStat tool is described, then its use with EADS II is outlined briefly. Next, the evaluation of the VPCS, as well as the modifications made to the system are described. Finally, conclusions are drawn and recommendations for future worked are outlined.
Cognitive Correlates of Functional Abilities in Individuals with Mild Cognitive Impairment: Comparison of Questionnaire, Direct Observation and Performance-based Measures

PubMed Central

Schmitter-Edgecombe, Maureen; Parsey, Carolyn M.

2014-01-01

The relationship between and the cognitive correlates of several proxy measures of functional status were studied in a population with mild cognitive impairment (MCI). Participants were 51 individuals diagnosed with MCI and 51 cognitively healthy older adults (OA). Participants completed performance-based functional status tests, standardized neuropsychological tests, and performed eight activities of daily living (e.g., watered plants, filled medication dispenser) while under direct observation in a campus apartment. An informant interview about everyday functioning was also conducted. Compared to the OA control group, the MCI group performed more poorly on all proxy measures of everyday functioning. The informant-report of instrumental activities of daily living (IADL) did not correlate with the two performance-based measures; however, both the informant-report IADL and the performance-based everyday problem-solving test correlated with the direct observation measure. After controlling for age and education, cognitive predictors did not explain a significant amount of variance in the performance-based measures; however, performance on a delayed memory task was a unique predictor for the informant-report IADL, and processing speed predicted unique variance for the direct observation score. These findings indicate that differing methods for evaluating functional status are not assessing completely overlapping aspects of everyday functioning in the MCI population. PMID:24766574
Cognitive correlates of functional abilities in individuals with mild cognitive impairment: comparison of questionnaire, direct observation, and performance-based measures.

PubMed

Schmitter-Edgecombe, Maureen; Parsey, Carolyn M

2014-01-01

The relationship between, and the cognitive correlates of, several proxy measures of functional status were studied in a population with mild cognitive impairment (MCI). Participants were 51 individuals diagnosed with MCI and 51 cognitively healthy older adults (OA). Participants completed performance-based functional status tests and standardized neuropsychological tests, and performed eight activities of daily living (e.g., watered plants, filled medication dispenser) while under direct observation in a campus apartment. An informant interview about everyday functioning was also conducted. Compared to the OA control group, the MCI group performed more poorly on all proxy measures of everyday functioning. The informant report of instrumental activities of daily living (IADL) did not correlate with the two performance-based measures; however, both the informant-report IADL and the performance-based everyday problem-solving test correlated with the direct observation measure. After controlling for age and education, cognitive predictors did not explain a significant amount of variance in the performance-based measures; however, performance on a delayed memory task was a unique predictor for the informant-report IADL, and processing speed predicted unique variance for the direct observation score. These findings indicate that differing methods for evaluating functional status are not assessing completely overlapping aspects of everyday functioning in the MCI population.
Sustainability performance evaluation: Literature review and future directions.

PubMed

Büyüközkan, Gülçin; Karabulut, Yağmur

2018-07-01

Current global economic activities are increasingly being perceived as unsustainable. Despite the high number of publications, sustainability science remains highly dispersed over diverse approaches and topics. This article aims to provide a structured overview of sustainability performance evaluation related publications and to document the current state of literature, categorize publications, analyze and link trends, as well as highlight gaps and provide research recommendations. 128 articles between 2007 and 2018 are identified. The results suggest that sustainability performance evaluation models shall be more balanced, suitable criteria and their interrelations shall be well defined and subjectivity of qualitative criteria inherent to sustainability indicators shall be considered. To address this subjectivity, group decision-making techniques and other analytical methods that can deal with uncertainty, conflicting indicators, and linguistic evaluations can be used in future works. By presenting research gaps, this review stimulates researchers to establish practically applicable sustainability performance evaluation frameworks to help assess and compare the degree of sustainability, leading to more sustainable business practices. The review is unique in defining corporate sustainability performance evaluation for the first time, exploring the gap between sustainability accounting and sustainability assessment, and coming up with a structured overview of innovative research recommendations about integrating analytical assessment methods into conceptual sustainability frameworks. Copyright © 2018 Elsevier Ltd. All rights reserved.
Performance evaluation of Teledyne Geotech bivane

DOE Office of Scientific and Technical Information (OSTI.GOV)

Addis, R.P.

1986-05-13

The new production prototype bivane manufactured by Teledyne Geotech underwent tests to evaluate its performance and determine its suitability as a replacement for obsolete instrumentation presently on the SRP meteorological towers. The bivane performs well for routine observations for emergency response, as well as for most routine plume dispersion research to be conducted at SRL for the foreseeable future. It should also be suitable for providing an accurate and reliable meteorological data base for engineering and meteorological applications for the next ten years. The bivane was tested in a wind tunnel where its Damping Ratio was found to be 0.30more » (azimuth) and 0.29 (elevation), which contrasts with 0.4 claimed by the manufacturer's preliminary specifications. Although the measured damping is less than the optimum value (0.43), it is estimated that the bivane will be able to measure the turbulent parameters (standard deviation of azimuth and elevation) used in the SRP emergency response codes, within 8%. The bivane's suitability as a research tool for measuring turbulent fluxes was determined by comparison with results from a sonic anemometer. The mean bivane momentum flux measurements were within 5% of those of the sonic, averaged over all measured flux intensities, and within 10% of the sonic for fluxes less than or equal to -0.05 m/sup 2//s/sup 2/. During periods of low fluxes, such as may occur under stable nocturnal conditions, a higher damping ratio (approx. 0.4) and a smaller natural wavelength would improve the bivane response to high frequency turbulence. The cup anemometer paired with the bivane, also performed well in the tests. An intercomparison of wind speeds with those measured by the sonic anemometer showed a mean difference of only 1 cm/s (0.02 mph).« less
Evaluation of confocal microscopy system performance.

PubMed

Zucker, R M; Price, O

2001-08-01

The confocal laser scanning microscope (CLSM) has been used by scientists to visualize three-dimensional (3D) biological samples. Although this system involves lasers, electronics, optics, and microscopes, there are few published tests that can be used to assess the performance of this equipment. Usually the CLSM is assessed by subjectively evaluating a biological/histological test slide for image quality. Although there is a use for the test slide, there are many other components in the CLSM that need to be assessed. It would be useful if tests existed that produced reference values for machine performance. The aim of this research was to develop quality assurance tests to ensure that the CLSM was stable while delivering reproducible intensity measurements with excellent image quality. Our ultimate research objective was to quantify fluorescence using a CLSM. To achieve this goal, it is essential that the CLSM be stable while delivering known parameters of performance. Using Leica TCS-SP1 and TCS-4D systems, a number of tests have been devised to evaluate equipment performance. Tests measuring dichroic reflectivity, field illumination, lens performance, laser power output, spectral registration, axial resolution, laser stability, photomultiplier tube (PMT) reliability, and system noise were either incorporated from the literature or derived in our laboratory to measure performance. These tests are also applicable to other manufacturer's systems with minor modifications. A preliminary report from our laboratory has addressed a number of the QA issues necessary to achieve CLSM performance. This report extends our initial work on the evaluation of CLSM system performance. Tests that were described previously have been modified and new tests involved in laser stability and sensitivity are described. The QA tests on the CLSM measured laser power, PMT function, dichroic reflection, spectral registration, axial registration, system noise and sensitivity, lens performance

The Domain Five Observation Instrument: A Competency-Based Coach Evaluation Tool

ERIC Educational Resources Information Center

Shangraw, Rebecca

2017-01-01

The Domain Five Observation Instrument (DFOI) is a competency-based observation instrument recommended for sport leaders or researchers who wish to evaluate coaches' instructional behaviors. The DFOI includes 10 behavior categories and four timed categories that encompass 34 observable instructional benchmarks outlined in domain five of the…
A Study of the Associations between Conditions of Performance and Characteristics of Performers and New York State Solo Performance Ratings

ERIC Educational Resources Information Center

vonWurmb, Elizabeth C.

2013-01-01

This dissertation undertakes an analysis of 1,044 performance evaluations from New York State School Music Association (NYSSMA) Spring Festival solo adjudication ratings of student performers from a large suburban school district. It relies on results of evaluations of observed performances, and takes these evaluations as assessments of what the…
Ensemble of trees approaches to risk adjustment for evaluating a hospital's performance.

PubMed

Liu, Yang; Traskin, Mikhail; Lorch, Scott A; George, Edward I; Small, Dylan

2015-03-01

A commonly used method for evaluating a hospital's performance on an outcome is to compare the hospital's observed outcome rate to the hospital's expected outcome rate given its patient (case) mix and service. The process of calculating the hospital's expected outcome rate given its patient mix and service is called risk adjustment (Iezzoni 1997). Risk adjustment is critical for accurately evaluating and comparing hospitals' performances since we would not want to unfairly penalize a hospital just because it treats sicker patients. The key to risk adjustment is accurately estimating the probability of an Outcome given patient characteristics. For cases with binary outcomes, the method that is commonly used in risk adjustment is logistic regression. In this paper, we consider ensemble of trees methods as alternatives for risk adjustment, including random forests and Bayesian additive regression trees (BART). Both random forests and BART are modern machine learning methods that have been shown recently to have excellent performance for prediction of outcomes in many settings. We apply these methods to carry out risk adjustment for the performance of neonatal intensive care units (NICU). We show that these ensemble of trees methods outperform logistic regression in predicting mortality among babies treated in NICU, and provide a superior method of risk adjustment compared to logistic regression.
Models for evaluating the performability of degradable computing systems

NASA Technical Reports Server (NTRS)

Wu, L. T.

1982-01-01

Recent advances in multiprocessor technology established the need for unified methods to evaluate computing systems performance and reliability. In response to this modeling need, a general modeling framework that permits the modeling, analysis and evaluation of degradable computing systems is considered. Within this framework, several user oriented performance variables are identified and shown to be proper generalizations of the traditional notions of system performance and reliability. Furthermore, a time varying version of the model is developed to generalize the traditional fault tree reliability evaluation methods of phased missions.
Human performance evaluation in dual-axis critical task tracking

NASA Technical Reports Server (NTRS)

Ritchie, M. L.; Nataraj, N. S.

1975-01-01

A dual axis tracking using a multiloop critical task was set up to evaluate human performance. The effects of control stick variation and display formats are evaluated. A secondary loading was used to measure the degradation in tracking performance.
Improved human observer performance in digital reconstructed radiograph verification in head and neck cancer radiotherapy.

PubMed

Sturgeon, Jared D; Cox, John A; Mayo, Lauren L; Gunn, G Brandon; Zhang, Lifei; Balter, Peter A; Dong, Lei; Awan, Musaddiq; Kocak-Uzel, Esengul; Mohamed, Abdallah Sherif Radwan; Rosenthal, David I; Fuller, Clifton David

2015-10-01

Digitally reconstructed radiographs (DRRs) are routinely used as an a priori reference for setup correction in radiotherapy. The spatial resolution of DRRs may be improved to reduce setup error in fractionated radiotherapy treatment protocols. The influence of finer CT slice thickness reconstruction (STR) and resultant increased resolution DRRs on physician setup accuracy was prospectively evaluated. Four head and neck patient CT-simulation images were acquired and used to create DRR cohorts by varying STRs at 0.5, 1, 2, 2.5, and 3 mm. DRRs were displaced relative to a fixed isocenter using 0-5 mm random shifts in the three cardinal axes. Physician observers reviewed DRRs of varying STRs and displacements and then aligned reference and test DRRs replicating daily KV imaging workflow. A total of 1,064 images were reviewed by four blinded physicians. Observer errors were analyzed using nonparametric statistics (Friedman's test) to determine whether STR cohorts had detectably different displacement profiles. Post hoc bootstrap resampling was applied to evaluate potential generalizability. The observer-based trial revealed a statistically significant difference between cohort means for observer displacement vector error ([Formula: see text]) and for [Formula: see text]-axis [Formula: see text]. Bootstrap analysis suggests a 15% gain in isocenter translational setup error with reduction of STR from 3 mm to [Formula: see text]2 mm, though interobserver variance was a larger feature than STR-associated measurement variance. Higher resolution DRRs generated using finer CT scan STR resulted in improved observer performance at shift detection and could decrease operator-dependent geometric error. Ideally, CT STRs [Formula: see text]2 mm should be utilized for DRR generation in the head and neck.
A comparison of resampling schemes for estimating model observer performance with small ensembles

NASA Astrophysics Data System (ADS)

Elshahaby, Fatma E. A.; Jha, Abhinav K.; Ghaly, Michael; Frey, Eric C.

2017-09-01

In objective assessment of image quality, an ensemble of images is used to compute the 1st and 2nd order statistics of the data. Often, only a finite number of images is available, leading to the issue of statistical variability in numerical observer performance. Resampling-based strategies can help overcome this issue. In this paper, we compared different combinations of resampling schemes (the leave-one-out (LOO) and the half-train/half-test (HT/HT)) and model observers (the conventional channelized Hotelling observer (CHO), channelized linear discriminant (CLD) and channelized quadratic discriminant). Observer performance was quantified by the area under the ROC curve (AUC). For a binary classification task and for each observer, the AUC value for an ensemble size of 2000 samples per class served as a gold standard for that observer. Results indicated that each observer yielded a different performance depending on the ensemble size and the resampling scheme. For a small ensemble size, the combination [CHO, HT/HT] had more accurate rankings than the combination [CHO, LOO]. Using the LOO scheme, the CLD and CHO had similar performance for large ensembles. However, the CLD outperformed the CHO and gave more accurate rankings for smaller ensembles. As the ensemble size decreased, the performance of the [CHO, LOO] combination seriously deteriorated as opposed to the [CLD, LOO] combination. Thus, it might be desirable to use the CLD with the LOO scheme when smaller ensemble size is available.
Intraoperative performance and postoperative outcomes of microcoaxial phacoemulsification. Observational study.

PubMed

Vasavada, Viraj; Vasavada, Vaishali; Raj, Shetal M; Vasavada, Abhay R

2007-06-01

To evaluate the intraoperative performance and postoperative outcomes after microcoaxial phacoemulsification. Iladevi Cataract & IOL Research Centre, Ahmedabad, India. A prospective observational case series comprised 84 eyes with age-related uncomplicated cataract having microcoaxial phacoemulsification through a 2.2 mm clear corneal incision by a standard surgical technique. Phacoemulsification parameters (Infiniti Vision System, Alcon) were microburst width, 30 ms; preset power, 50%; vacuum, 650 mm Hg; aspiration flow rate, 25 cc/minute. A single-piece Alcon AcrySof intraocular lens was implanted with the C cartridge (Alcon) cartridge. The incision was measured at the end of surgery. Observations included surgical time (from commencement of sculpting to end of epinucleus removal), cumulative dissipated energy (CDE), wound burns, intraoperative complications, postoperative increase in mean central corneal thickness (CCT) at 1 day and 1 month, mean % decrease in endothelial cell density (ECD), absolute mean change in coefficient of variation (cv) 3 months, and uncorrected visual acuity (UCVA) at 1 day. Data were analyzed using a 1-sample t test with 95% confidence intervals (CIs). The mean follow up was 3 months +/- 0.3 (SD). The mean incision size at the end of surgery was 2.3 +/- .09 mm; mean surgical time, 4.5 +/- 1.5 minutes; and mean CDE, 2.3 +/- 2.2 seconds. No wound burns or other intraoperative complications occurred. The postoperative CCT increased by a mean of 16 microm at 1 day (95% CI, 8-25; P = .66;) and by a mean of 3.14 microm at 1 month (95% CI, 2.26-4.05; P = .92). The ECD decreased by a mean of 5.8% (95% CI, 6.8-3.5; P = .82) and the mean coefficient of variation, by 3.3 (95% CI, 4.5-2.0; P = .65). At 1 day, the UCVA was 20/20 in 29% of cases, 20/20 to 20/40 in 58%, and 20/40 to 20/50 in 12%. Microcoaxial phacoemulsification was safely and effectively performed, achieving consistent and satisfactory postoperative outcomes.
User-friendly tools on handheld devices for observer performance study

NASA Astrophysics Data System (ADS)

Matsumoto, Takuya; Hara, Takeshi; Shiraishi, Junji; Fukuoka, Daisuke; Abe, Hiroyuki; Matsusako, Masaki; Yamada, Akira; Zhou, Xiangrong; Fujita, Hiroshi

2012-02-01

ROC studies require complex procedures to select cases from many data samples, and to set confidence levels in each selected case to generate ROC curves. In some observer performance studies, researchers have to develop software with specific graphical user interface (GUI) to obtain confidence levels from readers. Because ROC studies could be designed for various clinical situations, it is difficult task for preparing software corresponding to every ROC studies. In this work, we have developed software for recording confidence levels during observer studies on tiny personal handheld devices such as iPhone, iPod touch, and iPad. To confirm the functions of our software, three radiologists performed observer studies to detect lung nodules by using public database of chest radiograms published by Japan Society of Radiological Technology. The output in text format conformed to the format for the famous ROC kit from the University of Chicago. Times required for the reading each case was recorded very precisely.
ATAMM enhancement and multiprocessor performance evaluation

NASA Technical Reports Server (NTRS)

Stoughton, John W.; Mielke, Roland R.; Som, Sukhamoy; Obando, Rodrigo; Malekpour, Mahyar R.; Jones, Robert L., III; Mandala, Brij Mohan V.

1991-01-01

ATAMM (Algorithm To Architecture Mapping Model) enhancement and multiprocessor performance evaluation is discussed. The following topics are included: the ATAMM model; ATAMM enhancement; ADM (Advanced Development Model) implementation of ATAMM; and ATAMM support tools.
Effect of prior performance on subsequent performance evaluation by field independent-dependent raters.

PubMed

Sisco, Howard; Leventhal, Gloria

2007-12-01

The importance of accurate performance appraisals is central to many aspects of personnel activities in organizations. This study examined threats due to past performance to accuracy of evaluation of subsequent performance by raters differing in scores on field dependence. 162 college students were classified as Field-dependent (n = 81) or Field-independent (n = 81), using a median split on the Group Embedded Figures Test. Past performance (a lecture) was good or poor, presented directly via a videotape or indirectly via a written evaluation to the Field-independent or Field-dependent groups. Analysis indicated the hypothesized contrast effect (ratings in the opposite direction from that of prior ratings) in the Direct condition and an unexpected, albeit smaller, contrast effect in the Indirect condition. There were also differential effects of performance, presentation, and field dependency on rating of lecturer's style and ability.
Performance Evaluation Methods for Assistive Robotic Technology

NASA Astrophysics Data System (ADS)

Tsui, Katherine M.; Feil-Seifer, David J.; Matarić, Maja J.; Yanco, Holly A.

Robots have been developed for several assistive technology domains, including intervention for Autism Spectrum Disorders, eldercare, and post-stroke rehabilitation. Assistive robots have also been used to promote independent living through the use of devices such as intelligent wheelchairs, assistive robotic arms, and external limb prostheses. Work in the broad field of assistive robotic technology can be divided into two major research phases: technology development, in which new devices, software, and interfaces are created; and clinical, in which assistive technology is applied to a given end-user population. Moving from technology development towards clinical applications is a significant challenge. Developing performance metrics for assistive robots poses a related set of challenges. In this paper, we survey several areas of assistive robotic technology in order to derive and demonstrate domain-specific means for evaluating the performance of such systems. We also present two case studies of applied performance measures and a discussion regarding the ubiquity of functional performance measures across the sampled domains. Finally, we present guidelines for incorporating human performance metrics into end-user evaluations of assistive robotic technologies.
Measuring Medical Housestaff Teamwork Performance Using Multiple Direct Observation Instruments: Comparing Apples and Apples.

PubMed

Weingart, Saul N; Yaghi, Omar; Wetherell, Matthew; Sweeney, Megan

2018-04-10

To examine the composition and concordance of existing instruments used to assess medical teams' performance. A trained observer joined 20 internal medicine housestaff teams for morning work rounds at Tufts Medical Center, a 415-bed Boston teaching hospital, from October through December 2015. The observer rated each team's performance using 9 teamwork observation instruments that examined domains including team structure, leadership, situation monitoring, mutual support, and communication. Observations recorded on paper forms were stored electronically. Scores were normalized from 1 (low) to 5 (high) to account for different rating scales. Overall mean scores were calculated and graphed; weighted scores adjusted for the number of items in each teamwork domain. Teamwork scores were analyzed using t-tests, pair-wise correlations, and the Kruskal-Wallis statistic, and team performance was compared across instruments by domain. The 9 tools incorporated 5 major domains, with 5-35 items per instrument for a total of 161 items per observation session. In weighted and unweighted analyses, the overall teamwork performance score for a given team on a given day varied by instrument. While all of the tools identified the same low outlier, high performers on some instruments were low performers on others. Inconsistent scores for a given team across instruments persisted in domain-level analyses. There was substantial variation in the rating of individual teams assessed concurrently by a single observer using multiple instruments. Since existing teamwork observation tools do not yield concordant assessments, researchers should create better tools for measuring teamwork performance.
Evaluating Gridded Spring Indices Using the USA National Phenology Network's Observational Phenology Data

NASA Astrophysics Data System (ADS)

Crimmins, T. M.; Gerst, K.

2017-12-01

The USA National Phenology Network (USA-NPN; www.usanpn.org) produces and freely delivers daily and short-term forecast maps of spring onset dates at fine spatial scale for the conterminous United States and Alaska using the Spring Indices. These models, which represent the start of biological activity in the spring season, were developed using a long-term observational record of four species of lilacs and honeysuckles contributed by volunteer observers. Three of the four species continue to be tracked through the USA-NPN's phenology observation program, Nature's Notebook. The gridded Spring Index maps have utility for a wide range of natural resource planning and management applications, including scheduling invasive species and pest detection and control activities, anticipating allergy outbreaks and planning agricultural harvest dates. However, to date, there has not been a comprehensive assessment of how well the gridded Spring Index maps accurately reflect phenological activity in lilacs and honeysuckles or other species of plants. In this study, we used observational plant phenology data maintained by the USA-NPN to evaluate how well the gridded Spring Index maps match leaf and flowering onset dates in a) the lilac and honeysuckle species used to construct the models and b) in several species of deciduous trees. The Spring Index performed strongly at predicting the timing of leaf-out and flowering in lilacs and honeysuckles. The average error between predicted and observed date of onset ranged from 5.9 to 11.4 days. Flowering models performed slightly better than leaf-out models. The degree to which the Spring Indices predicted native deciduous tree leaf and flower phenology varied by year, species, and region. Generally, the models were better predictors of leaf and flowering onset dates in the Northeastern and Midwestern US. These results reveal when and where the Spring Indices are a meaningful proxy of phenological activity across the United States.
The dependability of medical students' performance ratings as documented on in-training evaluations.

PubMed

van Barneveld, Christina

2005-03-01

To demonstrate an approach to obtain an unbiased estimate of the dependability of students' performance ratings during training, when the data-collection design includes nesting of student in rater, unbalanced nest sizes, and dependent observations. In 2003, two variance components analyses of in-training evaluation (ITE) report data were conducted using urGENOVA software. In the first analysis, the dependability for the nested and unbalanced data-collection design was calculated. In the second analysis, an approach using multiple generalizability studies was used to obtain an unbiased estimate of the student variance component, resulting in an unbiased estimate of dependability. Results suggested that there is bias in estimates of the dependability of students' performance on ITEs that are attributable to the data-collection design. When the bias was corrected, the results indicated that the dependability of ratings of student performance was almost zero. The combination of the multiple generalizability studies method and the use of specialized software provides an unbiased estimate of the dependability of ratings of student performance on ITE scores for data-collection designs that include nesting of student in rater, unbalanced nest sizes, and dependent observations.
Methodologies for evaluating performance and assessing uncertainty of atmospheric dispersion models

NASA Astrophysics Data System (ADS)

Chang, Joseph C.

This thesis describes methodologies to evaluate the performance and to assess the uncertainty of atmospheric dispersion models, tools that predict the fate of gases and aerosols upon their release into the atmosphere. Because of the large economic and public-health impacts often associated with the use of the dispersion model results, these models should be properly evaluated, and their uncertainty should be properly accounted for and understood. The CALPUFF, HPAC, and VLSTRACK dispersion modeling systems were applied to the Dipole Pride (DP26) field data (˜20 km in scale), in order to demonstrate the evaluation and uncertainty assessment methodologies. Dispersion model performance was found to be strongly dependent on the wind models used to generate gridded wind fields from observed station data. This is because, despite the fact that the test site was a flat area, the observed surface wind fields still showed considerable spatial variability, partly because of the surrounding mountains. It was found that the two components were comparable for the DP26 field data, with variability more important than uncertainty closer to the source, and less important farther away from the source. Therefore, reducing data errors for input meteorology may not necessarily increase model accuracy due to random turbulence. DP26 was a research-grade field experiment, where the source, meteorological, and concentration data were all well-measured. Another typical application of dispersion modeling is a forensic study where the data are usually quite scarce. An example would be the modeling of the alleged releases of chemical warfare agents during the 1991 Persian Gulf War, where the source data had to rely on intelligence reports, and where Iraq had stopped reporting weather data to the World Meteorological Organization since the 1981 Iran-Iraq-war. Therefore the meteorological fields inside Iraq must be estimated by models such as prognostic mesoscale meteorological models, based on
Building a More Complete Understanding of Teacher Evaluation Using Classroom Observations

ERIC Educational Resources Information Center

Cohen, Julie; Goldhaber, Dan

2016-01-01

Improving teacher evaluation is one of the most pressing but also contested areas of educational policy. Value-added measures have received much of the attention in new evaluation systems, but they can only be used to evaluate a fraction of teachers. Classroom observations are almost universally used to assess teachers, yet their statistical…
Performance evaluation and clinical applications of 3D plenoptic cameras

NASA Astrophysics Data System (ADS)

Decker, Ryan; Shademan, Azad; Opfermann, Justin; Leonard, Simon; Kim, Peter C. W.; Krieger, Axel

2015-06-01

The observation and 3D quantification of arbitrary scenes using optical imaging systems is challenging, but increasingly necessary in many fields. This paper provides a technical basis for the application of plenoptic cameras in medical and medical robotics applications, and rigorously evaluates camera integration and performance in the clinical setting. It discusses plenoptic camera calibration and setup, assesses plenoptic imaging in a clinically relevant context, and in the context of other quantitative imaging technologies. We report the methods used for camera calibration, precision and accuracy results in an ideal and simulated surgical setting. Afterwards, we report performance during a surgical task. Test results showed the average precision of the plenoptic camera to be 0.90mm, increasing to 1.37mm for tissue across the calibrated FOV. The ideal accuracy was 1.14mm. The camera showed submillimeter error during a simulated surgical task.
An integrated evaluation for the performance of clinical engineering department.

PubMed

Yousry, Ahmed M; Ouda, Bassem K; Eldeib, Ayman M

2014-01-01

Performance benchmarking have become a very important component in all successful organizations nowadays that must be used by Clinical Engineering Department (CED) in hospitals. Many researchers identified essential mainstream performance indicators needed to improve the CED's performance. These studies revealed mainstream performance indicators that use the database of a CED to evaluate its performance. In this work, we believe that those indicators are insufficient for hospitals. Additional important indicators should be included to improve the evaluation accuracy. Therefore, we added new indicators: technical/maintenance indicators, economic indicators, intrinsic criticality indicators, basic hospital indicators, equipment acquisition, and safety indicators. Data is collected from 10 hospitals that cover different types of healthcare organizations. We developed a software tool that analyses collected data to provide a score for each CED under evaluation. Our results indicate that there is an average gap of 67% between the CEDs' performance and the ideal target. The reasons for the noncompliance are discussed in order to improve performance of CEDs under evaluation.
Evaluating Performance of Highway Safety Projects

DOT National Transportation Integrated Search

2016-12-01

The purpose of this project was to investigate and document methods that the Idaho Transportation Department (ITD) and Local Highway Technical Assistance Council (LHTAC) can use to evaluate the performance of safety projects that have been implemente...

NREL Evaluates Performance of Fast-Charge Electric Buses

DOE Office of Scientific and Technical Information (OSTI.GOV)

2016-09-16

This real-world performance evaluation is designed to enhance understanding of the overall usage and effectiveness of electric buses in transit operation and to provide unbiased technical information to other agencies interested in adding such vehicles to their fleets. Initial results indicate that the electric buses under study offer significant fuel and emissions savings. The final results will help Foothill Transit optimize the energy-saving potential of its transit fleet. NREL's performance evaluations help vehicle manufacturers fine-tune their designs and help fleet managers select fuel-efficient, low-emission vehicles that meet their bottom line and operational goals. help Foothill Transit optimize the energy-saving potentialmore » of its transit fleet. NREL's performance evaluations help vehicle manufacturers fine-tune their designs and help fleet managers select fuel-efficient, low-emission vehicles that meet their bottom line and operational goals.« less
A Descriptive-Comparative Study of Teacher Performance Evaluation on Student Achievement in a Public School District

ERIC Educational Resources Information Center

Christensen, William Howard

2013-01-01

In 2010, the federal government increased accountability expectations by placing more emphasis on monitoring teacher performance. Using a model that focuses on the New York State teacher evaluation system, that is comprised of a rubric for observation, local student assessment scores, and student state assessment scores, this…
Nurses' evaluation of physicians' non-clinical performance in emergency departments: advantages, disadvantages and lessons learned.

PubMed

Alameddine, Mohamad; Mufarrij, Afif; Saliba, Miriam; Mourad, Yara; Jabbour, Rima; Hitti, Eveline

2015-02-27

Peer evaluation is increasingly used as a method to assess physicians' interpersonal and communication skills. We report on experience with soliciting registered nurses' feedback on physicians' non-clinical performance in the ED of a large academic medical center in Lebanon. We utilized a secondary analysis of a de-identified database of ED nurses' assessment of physicians' non-clinical performance coupled with an evaluation of interventions carried out as a result of this evaluation. The database was compiled as part of quality/performance improvement initiatives using a cross-sectional design to survey registered nurses working at the ED. The survey instrument included open ended and closed ended questions assessing physicians' communication, professionalism and leadership skills. Three episodes of evaluation were carried out over an 18 month period. Physicians were provided with a communication training carried out after the first cycle of evaluation and a detailed feedback on their assessment by nurses after each evaluation cycle. A paired t-test was carried out to compare mean evaluation scores between the three cycles of evaluation. Thematic analysis of nurses' qualitative comments was carried out. A statistically significant increase in the averages of skills was observed between the first and second evaluations, followed by a significant decrease in the averages of the three skills between the second and third evaluations. Personalized feedback to ED physicians and communication training initially contributed to a significant positive impact on improving ED physicians' non-clinical skills as perceived by the ED nurses. Yet, gains achieved were lost upon reaching the third cycle of evaluation. However, the thematic analysis of the nurses' qualitative responses portrays a decrease in concerns across the various dimensions of non-clinical performance. Nurses' evaluation of the non-clinical performance of physicians has the potential of improving communication
48 CFR 3036.201 - Evaluation of contractor performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 7 2010-10-01 2010-10-01 false Evaluation of contractor performance. 3036.201 Section 3036.201 Federal Acquisition Regulations System DEPARTMENT OF HOMELAND SECURITY... contractor performance. (a)(2) Performance reports shall be prepared and entered into the Contractor...
High-Precision Image Aided Inertial Navigation with Known Features: Observability Analysis and Performance Evaluation

PubMed Central

Jiang, Weiping; Wang, Li; Niu, Xiaoji; Zhang, Quan; Zhang, Hui; Tang, Min; Hu, Xiangyun

2014-01-01

A high-precision image-aided inertial navigation system (INS) is proposed as an alternative to the carrier-phase-based differential Global Navigation Satellite Systems (CDGNSSs) when satellite-based navigation systems are unavailable. In this paper, the image/INS integrated algorithm is modeled by a tightly-coupled iterative extended Kalman filter (IEKF). Tightly-coupled integration ensures that the integrated system is reliable, even if few known feature points (i.e., less than three) are observed in the images. A new global observability analysis of this tightly-coupled integration is presented to guarantee that the system is observable under the necessary conditions. The analysis conclusions were verified by simulations and field tests. The field tests also indicate that high-precision position (centimeter-level) and attitude (half-degree-level)-integrated solutions can be achieved in a global reference. PMID:25330046
AERMOD performance evaluation for three coal-fired electrical generating units in Southwest Indiana.

PubMed

Frost, Kali D

2014-03-01

An evaluation of the steady-state dispersion model AERMOD was conducted to determine its accuracy at predicting hourly ground-level concentrations of sulfur dioxide (SO2) by comparing model-predicted concentrations to a full year of monitored SO2 data. The two study sites are comprised of three coal-fired electrical generating units (EGUs) located in southwest Indiana. The sites are characterized by tall, buoyant stacks,flat terrain, multiple SO2 monitors, and relatively isolated locations. AERMOD v12060 and AERMOD v12345 with BETA options were evaluated at each study site. For the six monitor-receptor pairs evaluated, AERMOD showed generally good agreement with monitor values for the hourly 99th percentile SO2 design value, with design value ratios that ranged from 0.92 to 1.99. AERMOD was within acceptable performance limits for the Robust Highest Concentration (RHC) statistic (RHC ratios ranged from 0.54 to 1.71) at all six monitors. Analysis of the top 5% of hourly concentrations at the six monitor-receptor sites, paired in time and space, indicated poor model performance in the upper concentration range. The amount of hourly model predicted data that was within a factor of 2 of observations at these higher concentrations ranged from 14 to 43% over the six sites. Analysis of subsets of data showed consistent overprediction during low wind speed and unstable meteorological conditions, and underprediction during stable, low wind conditions. Hourly paired comparisons represent a stringent measure of model performance; however given the potential for application of hourly model predictions to the SO2 NAAQS design value, this may be appropriate. At these two sites, AERMOD v12345 BETA options do not improve model performance. A regulatory evaluation of AERMOD utilizing quantile-quantile (Q-Q) plots, the RHC statistic, and 99th percentile design value concentrations indicates that model performance is acceptable according to widely accepted regulatory performance
Evaluation of Long-Term Cloud-Resolving Model Simulations Using Satellite Radiance Observations and Multi-Frequency Satellite Simulators

NASA Technical Reports Server (NTRS)

Matsui, Toshihisa; Zeng, Xiping; Tao, Wei-Kuo; Masunaga, Hirohiko; Olson, William S.; Lang, Stephen

2008-01-01

This paper proposes a methodology known as the Tropical Rainfall Measuring Mission (TRMM) Triple-Sensor Three-step Evaluation Framework (T3EF) for the systematic evaluation of precipitating cloud types and microphysics in a cloud-resolving model (CRM). T3EF utilizes multi-frequency satellite simulators and novel statistics of multi-frequency radiance and backscattering signals observed from the TRMM satellite. Specifically, T3EF compares CRM and satellite observations in the form of combined probability distributions of precipitation radar (PR) reflectivity, polarization-corrected microwave brightness temperature (Tb), and infrared Tb to evaluate the candidate CRM. T3EF is used to evaluate the Goddard Cumulus Ensemble (GCE) model for cases involving the South China Sea Monsoon Experiment (SCSMEX) and Kwajalein Experiment (KWAJEX). This evaluation reveals that the GCE properly captures the satellite-measured frequencies of different precipitating cloud types in the SCSMEX case but underestimates the frequencies of deep convective and deep stratiform types in the KWAJEX case. Moreover, the GCE tends to simulate excessively large and abundant frozen condensates in deep convective clouds as inferred from the overestimated GCE-simulated radar reflectivities and microwave Tb depressions. Unveiling the detailed errors in the GCE s performance provides the best direction for model improvements.
Congenital heart surgery: expected versus observed surgical performance according to the Aristotle complexity score.

PubMed

Photiadis, J; Sinzobahamvya, N; Arenz, C; Sata, S; Haun, C; Schindler, E; Asfour, B; Hraska, V

2011-08-01

The Aristotle score quantifies the complexity involved in congenital heart surgery. It defines surgical performance as complexity score times hospital survival. We studied how expected and observed surgical performance evolved over time. 2312 main procedures carried out between 2006 and 2010 were analyzed. The Aristotle basic score, corresponding hospital survival and related observed surgical performance were estimated. Expected survival was based on the mortality risks published by O'Brien and coauthors. Observed performance divided by expected performance was called the standardized ratio of performance. This should trend towards a figure above 100%. Survival rates and performance are given with 95% confidence intervals. The mean Aristotle basic score was 7.88 ± 2.68. 51 patients died: observed hospital survival was 97.8 % (97.1 %-98.3%). 115 deaths were anticipated: expected survival was 95.2% (93.5%-96.3%). Observed and expected surgical performance reached 7.71 (7.65-7.75) and 7.49 (7.37-7.59), respectively. Therefore the overall standardized ratio of performance was 102.94%. The ratio increased from 2006 (ratio = 101.60%) to 2009 (103.92%) and was 103.42% in 2010. Performance was high for the repair of congenital corrected transposition of the great arteries and ventricular septal defect (VSD) by atrial switch and Rastelli procedure, the Norwood procedure, repair of truncus arteriosus, aortic arch repair and VSD closure, and the Ross-Konno procedure, with corresponding standardized ratios of 123.30%, 116.83%, 112.99%, 110.86% and 110.38%, respectively. With a ratio of 82.87%, performance was low for repair of Ebstein's anomaly. The standardized ratio of surgical performance integrates three factors into a single value: procedure complexity, postoperative observed survival, and comparison with expected survival. It constitutes an excellent instrument for quality monitoring of congenital heart surgery programs over time. It allows an accurate comparison of
Efficient estimation of ideal-observer performance in classification tasks involving high-dimensional complex backgrounds

PubMed Central

Park, Subok; Clarkson, Eric

2010-01-01

The Bayesian ideal observer is optimal among all observers and sets an absolute upper bound for the performance of any observer in classification tasks [Van Trees, Detection, Estimation, and Modulation Theory, Part I (Academic, 1968).]. Therefore, the ideal observer should be used for objective image quality assessment whenever possible. However, computation of ideal-observer performance is difficult in practice because this observer requires the full description of unknown, statistical properties of high-dimensional, complex data arising in real life problems. Previously, Markov-chain Monte Carlo (MCMC) methods were developed by Kupinski et al. [J. Opt. Soc. Am. A 20, 430(2003) ] and by Park et al. [J. Opt. Soc. Am. A 24, B136 (2007) and IEEE Trans. Med. Imaging 28, 657 (2009) ] to estimate the performance of the ideal observer and the channelized ideal observer (CIO), respectively, in classification tasks involving non-Gaussian random backgrounds. However, both algorithms had the disadvantage of long computation times. We propose a fast MCMC for real-time estimation of the likelihood ratio for the CIO. Our simulation results show that our method has the potential to speed up ideal-observer performance in tasks involving complex data when efficient channels are used for the CIO. PMID:19884916
Effectiveness of the Marine Corps’ Junior Enlisted Performance Evaluation System: An Evaluation of Proficiency and Conduct Marks

DTIC Science & Technology

2017-03-01

THE MARINE CORPS’ JUNIOR ENLISTED PERFORMANCE EVALUATION SYSTEM: AN EVALUATION OF PROFICIENCY AND CONDUCT MARKS by Richard B. Larger Jr...CORPS’ JUNIOR ENLISTED PERFORMANCE EVALUATION SYSTEM: AN EVALUATION OF PROFICIENCY AND CONDUCT MARKS 5. FUNDING NUMBERS 6. AUTHOR(S) Richard B...in order to improve interpretability and minimize redundancies. 14. SUBJECT TERMS performance evaluation , proficiency marks, conduct marks
GENERAL METHODS FOR REMEDIAL PERFORMANCE EVALUATIONS

EPA Science Inventory

This document was developed by an EPA-funded project to explain technical considerations and principles necessary to evaluated the performance of ground-water contamination remediations at hazardous waste sites. This is neither a "cookbook", nor an encyclopedia of recommended fi...
EVALUATION OF CONFOCAL MICROSCOPY SYSTEM PERFORMANCE

EPA Science Inventory

BACKGROUND. The confocal laser scanning microscope (CLSM) has enormous potential in many biological fields. Currently there is a subjective nature in the assessment of a confocal microscope's performance by primarily evaluating the system with a specific test slide provided by ea...
From feedback- to response-based performance monitoring in active and observational learning.

PubMed

Bellebaum, Christian; Colosio, Marco

2014-09-01

Humans can adapt their behavior by learning from the consequences of their own actions or by observing others. Gradual active learning of action-outcome contingencies is accompanied by a shift from feedback- to response-based performance monitoring. This shift is reflected by complementary learning-related changes of two ACC-driven ERP components, the feedback-related negativity (FRN) and the error-related negativity (ERN), which have both been suggested to signal events "worse than expected," that is, a negative prediction error. Although recent research has identified comparable components for observed behavior and outcomes (observational ERN and FRN), it is as yet unknown, whether these components are similarly modulated by prediction errors and thus also reflect behavioral adaptation. In this study, two groups of 15 participants learned action-outcome contingencies either actively or by observation. In active learners, FRN amplitude for negative feedback decreased and ERN amplitude in response to erroneous actions increased with learning, whereas observational ERN and FRN in observational learners did not exhibit learning-related changes. Learning performance, assessed in test trials without feedback, was comparable between groups, as was the ERN following actively performed errors during test trials. In summary, the results show that action-outcome associations can be learned similarly well actively and by observation. The mechanisms involved appear to differ, with the FRN in active learning reflecting the integration of information about own actions and the accompanying outcomes.
A new method to evaluate image quality of CBCT images quantitatively without observers

PubMed Central

Shimizu, Mayumi; Okamura, Kazutoshi; Yoshida, Shoko; Weerawanich, Warangkana; Tokumori, Kenji; Jasa, Gainer R; Yoshiura, Kazunori

2017-01-01

Objectives: To develop an observer-free method for quantitatively evaluating the image quality of CBCT images by applying just-noticeable difference (JND). Methods: We used two test objects: (1) a Teflon (polytetrafluoroethylene) plate phantom attached to a dry human mandible; and (2) a block phantom consisting of a Teflon step phantom and an aluminium step phantom. These phantoms had holes with different depths. They were immersed in water and scanned with a CB MercuRay (Hitachi Medical Corporation, Tokyo, Japan) at tube voltages of 120 kV, 100 kV, 80 kV and 60 kV. Superimposed images of the phantoms with holes were used for evaluation. The number of detectable holes was used as an index of image quality. In detecting holes quantitatively, the threshold grey value (ΔG), which differentiated holes from the background, was calculated using a specific threshold (the JND), and we extracted the holes with grey values above ΔG. The indices obtained by this quantitative method (the extracted hole values) were compared with the observer evaluations (the observed hole values). In addition, the contrast-to-noise ratio (CNR) of the shallowest detectable holes and the deepest undetectable holes were measured to evaluate the contribution of CNR to detectability. Results: The results of this evaluation method corresponded almost exactly with the evaluations made by observers. The extracted hole values reflected the influence of different tube voltages. All extracted holes had an area with a CNR of ≥1.5. Conclusions: This quantitative method of evaluating CBCT image quality may be more useful and less time-consuming than evaluation by observation. PMID:28045343
At-Risk Youth Appearance and Job Performance Evaluation

ERIC Educational Resources Information Center

Freeburg, Beth Winfrey; Workman, Jane E.

2008-01-01

The goal of this study was to identify the relationship of at-risk youth workplace appearance to other job performance criteria. Employers (n = 30; each employing from 1 to 17 youths) evaluated 178 at-risk high school youths who completed a paid summer employment experience. Appearance evaluations were significantly correlated with evaluations of…
Prospective safety performance evaluation on construction sites.

PubMed

Wu, Xianguo; Liu, Qian; Zhang, Limao; Skibniewski, Miroslaw J; Wang, Yanhong

2015-05-01

This paper presents a systematic Structural Equation Modeling (SEM) based approach for Prospective Safety Performance Evaluation (PSPE) on construction sites, with causal relationships and interactions between enablers and the goals of PSPE taken into account. According to a sample of 450 valid questionnaire surveys from 30 Chinese construction enterprises, a SEM model with 26 items included for PSPE in the context of Chinese construction industry is established and then verified through the goodness-of-fit test. Three typical types of construction enterprises, namely the state-owned enterprise, private enterprise and Sino-foreign joint venture, are selected as samples to measure the level of safety performance given the enterprise scale, ownership and business strategy are different. Results provide a full understanding of safety performance practice in the construction industry, and indicate that the level of overall safety performance situation on working sites is rated at least a level of III (Fair) or above. This phenomenon can be explained that the construction industry has gradually matured with the norms, and construction enterprises should improve the level of safety performance as not to be eliminated from the government-led construction industry. The differences existing in the safety performance practice regarding different construction enterprise categories are compared and analyzed according to evaluation results. This research provides insights into cause-effect relationships among safety performance factors and goals, which, in turn, can facilitate the improvement of high safety performance in the construction industry. Copyright © 2015 Elsevier Ltd. All rights reserved.
Evaluation of Flagging Criteria of United States Kidney Transplant Center Performance: How to Best Define Outliers?

PubMed

Schold, Jesse D; Miller, Charles M; Henry, Mitchell L; Buccini, Laura D; Flechner, Stuart M; Goldfarb, David A; Poggio, Emilio D; Andreoni, Kenneth A

2017-06-01

Scientific Registry of Transplant Recipients report cards of US organ transplant center performance are publicly available and used for quality oversight. Low center performance (LP) evaluations are associated with changes in practice including reduced transplant rates and increased waitlist removals. In 2014, Scientific Registry of Transplant Recipients implemented new Bayesian methodology to evaluate performance which was not adopted by Center for Medicare and Medicaid Services (CMS). In May 2016, CMS altered their performance criteria, reducing the likelihood of LP evaluations. Our aims were to evaluate incidence, survival rates, and volume of LP centers with Bayesian, historical (old-CMS) and new-CMS criteria using 6 consecutive program-specific reports (PSR), January 2013 to July 2015 among adult kidney transplant centers. Bayesian, old-CMS and new-CMS criteria identified 13.4%, 8.3%, and 6.1% LP PSRs, respectively. Over the 3-year period, 31.9% (Bayesian), 23.4% (old-CMS), and 19.8% (new-CMS) of centers had 1 or more LP evaluation. For small centers (<83 transplants/PSR), there were 4-fold additional LP evaluations (52 vs 13 PSRs) for 1-year mortality with Bayesian versus new-CMS criteria. For large centers (>183 transplants/PSR), there were 3-fold additional LP evaluations for 1-year mortality with Bayesian versus new-CMS criteria with median differences in observed and expected patient survival of -1.6% and -2.2%, respectively. A significant proportion of kidney transplant centers are identified as low performing with relatively small survival differences compared with expected. Bayesian criteria have significantly higher flagging rates and new-CMS criteria modestly reduce flagging. Critical appraisal of performance criteria is needed to assess whether quality oversight is meeting intended goals and whether further modifications could reduce risk aversion, more efficiently allocate resources, and increase transplant opportunities.
Estimating functional cognition in older adults using observational assessments of task performance in complex everyday activities: A systematic review and evaluation of measurement properties.

PubMed

Wesson, Jacqueline; Clemson, Lindy; Brodaty, Henry; Reppermund, Simone

2016-09-01

Functional cognition is a relatively new concept in assessment of older adults with mild cognitive impairment or dementia. Instruments need to be reliable and valid, hence we conducted a systematic review of observational assessments of task performance used to estimate functional cognition in this population. Two separate database searches were conducted: firstly to identify instruments; and secondly to identify studies reporting on the psychometric properties of the instruments. Studies were analysed using a published checklist and their quality reviewed according to specific published criteria. Clinical utility was reviewed and the information formulated into a best evidence synthesis. We found 21 instruments and included 58 studies reporting on measurement properties. The majority of studies were rated as being of fair methodological quality and the range of properties investigated was restricted. Most instruments had studies reporting on construct validity (hypothesis testing), none on content validity and there were few studies reporting on reliability. Overall the evidence on psychometric properties is lacking and there is an urgent need for further evaluation of instruments. Copyright © 2016 Elsevier Ltd. All rights reserved.
Novel surgical performance evaluation approximates Standardized Incidence Ratio with high accuracy at simple means.

PubMed

Gabbay, Itay E; Gabbay, Uri

2013-01-01

Excess adverse events may be attributable to poor surgical performance but also to case-mix, which is controlled through the Standardized Incidence Ratio (SIR). SIR calculations can be complicated, resource consuming, and unfeasible in some settings. This article suggests a novel method for SIR approximation. In order to evaluate a potential SIR surrogate measure we predefined acceptance criteria. We developed a new measure - Approximate Risk Index (ARI). "Number Needed for Event" (NNE) is the theoretical number of patients needed "to produce" one adverse event. ARI is defined as the quotient of the group of patients needed for no observed events Ge by total patients treated Ga. Our evaluation compared 2500 surgical units and over 3 million heterogeneous risk surgical patients that were induced through a computerized simulation. Surgical unit's data were computed for SIR and ARI to evaluate compliance with the predefined criteria. Approximation was evaluated by correlation analysis and performance prediction capability by Receiver Operating Characteristics (ROC) analysis. ARI strongly correlates with SIR (r(2) = 0.87, p < 0.05). ARI prediction of excessive risk revealed excellent ROC (Area Under the Curve > 0.9) 87% sensitivity and 91% specificity. ARI provides good approximation of SIR and excellent prediction capability. ARI is simple and cost-effective as it requires thorough risk evaluation of only the adverse events patients. ARI can provide a crucial screening and performance evaluation quality control tool. The ARI method may suit other clinical and epidemiological settings where relatively small fraction of the entire population is affected. Copyright © 2013 Surgical Associates Ltd. Published by Elsevier Ltd. All rights reserved.
Metrics for Performance Evaluation of Patient Exercises during Physical Therapy.

PubMed

Vakanski, Aleksandar; Ferguson, Jake M; Lee, Stephen

2017-06-01

The article proposes a set of metrics for evaluation of patient performance in physical therapy exercises. Taxonomy is employed that classifies the metrics into quantitative and qualitative categories, based on the level of abstraction of the captured motion sequences. Further, the quantitative metrics are classified into model-less and model-based metrics, in reference to whether the evaluation employs the raw measurements of patient performed motions, or whether the evaluation is based on a mathematical model of the motions. The reviewed metrics include root-mean square distance, Kullback Leibler divergence, log-likelihood, heuristic consistency, Fugl-Meyer Assessment, and similar. The metrics are evaluated for a set of five human motions captured with a Kinect sensor. The metrics can potentially be integrated into a system that employs machine learning for modelling and assessment of the consistency of patient performance in home-based therapy setting. Automated performance evaluation can overcome the inherent subjectivity in human performed therapy assessment, and it can increase the adherence to prescribed therapy plans, and reduce healthcare costs.

Tactile orientation perception: an ideal observer analysis of human psychophysical performance in relation to macaque area 3b receptive fields

PubMed Central

Peters, Ryan M.; Staibano, Phillip

2015-01-01

The ability to resolve the orientation of edges is crucial to daily tactile and sensorimotor function, yet the means by which edge perception occurs is not well understood. Primate cortical area 3b neurons have diverse receptive field (RF) spatial structures that may participate in edge orientation perception. We evaluated five candidate RF models for macaque area 3b neurons, previously recorded while an oriented bar contacted the monkey's fingertip. We used a Bayesian classifier to assign each neuron a best-fit RF structure. We generated predictions for human performance by implementing an ideal observer that optimally decoded stimulus-evoked spike counts in the model neurons. The ideal observer predicted a saturating reduction in bar orientation discrimination threshold with increasing bar length. We tested 24 humans on an automated, precision-controlled bar orientation discrimination task and observed performance consistent with that predicted. We next queried the ideal observer to discover the RF structure and number of cortical neurons that best matched each participant's performance. Human perception was matched with a median of 24 model neurons firing throughout a 1-s period. The 10 lowest-performing participants were fit with RFs lacking inhibitory sidebands, whereas 12 of the 14 higher-performing participants were fit with RFs containing inhibitory sidebands. Participants whose discrimination improved as bar length increased to 10 mm were fit with longer RFs; those who performed well on the 2-mm bar, with narrower RFs. These results suggest plausible RF features and computational strategies underlying tactile spatial perception and may have implications for perceptual learning. PMID:26354318
A Regional Climate Model Evaluation System based on contemporary Satellite and other Observations for Assessing Regional Climate Model Fidelity

NASA Astrophysics Data System (ADS)

Waliser, D. E.; Kim, J.; Mattman, C.; Goodale, C.; Hart, A.; Zimdars, P.; Lean, P.

2011-12-01

Evaluation of climate models against observations is an essential part of assessing the impact of climate variations and change on regionally important sectors and improving climate models. Regional climate models (RCMs) are of a particular concern. RCMs provide fine-scale climate needed by the assessment community via downscaling global climate model projections such as those contributing to the Coupled Model Intercomparison Project (CMIP) that form one aspect of the quantitative basis of the IPCC Assessment Reports. The lack of reliable fine-resolution observational data and formal tools and metrics has represented a challenge in evaluating RCMs. Recent satellite observations are particularly useful as they provide a wealth of information and constraints on many different processes within the climate system. Due to their large volume and the difficulties associated with accessing and using contemporary observations, however, these datasets have been generally underutilized in model evaluation studies. Recognizing this problem, NASA JPL and UCLA have developed the Regional Climate Model Evaluation System (RCMES) to help make satellite observations, in conjunction with in-situ and reanalysis datasets, more readily accessible to the regional modeling community. The system includes a central database (Regional Climate Model Evaluation Database: RCMED) to store multiple datasets in a common format and codes for calculating and plotting statistical metrics to assess model performance (Regional Climate Model Evaluation Tool: RCMET). This allows the time taken to compare model data with satellite observations to be reduced from weeks to days. RCMES is a component of the recent ExArch project, an international effort for facilitating the archive and access of massive amounts data for users using cloud-based infrastructure, in this case as applied to the study of climate and climate change. This presentation will describe RCMES and demonstrate its utility using examples
[Municipalities Stratification for Health Performance Evaluation].

PubMed

Calvo, Maria Cristina Marino; Lacerda, Josimari Telino de; Colussi, Claudia Flemming; Schneider, Ione Jayce Ceola; Rocha, Thiago Augusto Hernandes

2016-01-01

to propose and present a stratification of Brazilian municipalities into homogeneous groups for evaluation studies of health management performance. this was a methodological study, with selected indicators which classify municipalities according to conditions that influence the health management and population size; data for the year 2010 were collected from demographic and health databases; correlation tests and factor analysis were used. seven strata were identified - Large-sized; Medium-sized with favorable, regular or unfavorable influences; and Small-sized with favorable, regular or unfavorable influences -; there was a concentration of municipalities with favorable influences in strata with better purchasing power and funding, as well as a concentration of municipalities with unfavorable influences in the North and Northeast regions. the proposed classification grouped similar municipalities regarding influential factors in health management, which allowed the identification of comparable groups of municipalities, setting up a consistent alternative to performance evaluation studies.
The ambiguities of performance-based governance reforms in Italy: Reviving the fortunes of evaluation and performance measurement.

PubMed

Marra, Mita

2018-08-01

Over the past two decades, Italy's administrative reforms have institutionalized evaluation to improve program effectiveness, staff productivity, and results-driven accountability against waste and corruption. Across ministries, regional governments, universities, schools and environmental protection agencies, seemingly unexpected consequences have emerged out of the implementation of performance measurement and evaluation regimes within public organizations. Formal compliance to legally binding evaluation procedures, judicially-sanctioned managerial accountability and lack of cross-agency coordination coupled with long-standing cultural separations among evaluators are some of the ambiguities associated with a performance-based governance system within Italian public administration. Building upon the 'new governane theory,' and qualitative fieldwork, I explore the political consequences of evaluation and performance measurement for possible improvements. From a normative perspective, greater integration between program evaluation and performance measurement can support organizational learning and democratic accountability both at the central and local level. Copyright © 2017 Elsevier Ltd. All rights reserved.
Performance evaluation of DAAF as a booster material using the onionskin test

DOE Office of Scientific and Technical Information (OSTI.GOV)

Morris, John S; Francois, Elizabeth G; Hooks, Daniel E

Initiation of insensitive high explosive (IHE) formulations requires the use of a booster explosive in the initiation train. Booster material selection is crucial, as the initiation must reliably function across some spectrum of physical parameters. The interest in Diaminoazoxyfurazan (DAAF) for this application stems from the fact that it possesses many traits of an IHE but is shock sensitive enough to serve as an explosive booster. A hemispherical wave breakout test, termed the onionskin test, is one of the methods used to evaluate the performance of a booster material. The wave breakout time-position history at the surface of a hemisphericalmore » IHE charge is recorded and the relative uniformity of the breakout can be quantitatively compared between booster materials. A series of onionskin tests were performed to investigate breakout and propagation diaminoazoxyfurazan (DAAF) at low temperatures to evaluate ignition and detonation spreading in comparison to other explosives commonly used in booster applications. Some wave perturbation was observed with the DAAF booster in the onionskin tests presented. The results of these tests will be presented and discussed.« less
Strapdown system performance optimization test evaluations (SPOT), volume 1

NASA Technical Reports Server (NTRS)

Blaha, R. J.; Gilmore, J. P.

1973-01-01

A three axis inertial system was packaged in an Apollo gimbal fixture for fine grain evaluation of strapdown system performance in dynamic environments. These evaluations have provided information to assess the effectiveness of real-time compensation techniques and to study system performance tradeoffs to factors such as quantization and iteration rate. The strapdown performance and tradeoff studies conducted include: (1) Compensation models and techniques for the inertial instrument first-order error terms were developed and compensation effectivity was demonstrated in four basic environments; single and multi-axis slew, and single and multi-axis oscillatory. (2) The theoretical coning bandwidth for the first-order quaternion algorithm expansion was verified. (3) Gyro loop quantization was identified to affect proportionally the system attitude uncertainty. (4) Land navigation evaluations identified the requirement for accurate initialization alignment in order to pursue fine grain navigation evaluations.
Functional Performance Evaluation

NASA Technical Reports Server (NTRS)

Greenisen, Michael C.; Hayes, Judith C.; Siconolfi, Steven F.; Moore, Alan D.

1999-01-01

The Extended Duration Orbiter Medical Project (EDOMP) was established to address specific issues associated with optimizing the ability of crews to complete mission tasks deemed essential to entry, landing, and egress for spaceflights lasting up to 16 days. The main objectives of this functional performance evaluation were to investigate the physiological effects of long-duration spaceflight on skeletal muscle strength and endurance, as well as aerobic capacity and orthostatic function. Long-duration exposure to a microgravity environment may produce physiological alterations that affect crew ability to complete critical tasks such as extravehicular activity (EVA), intravehicular activity (IVA), and nominal or emergency egress. Ultimately, this information will be used to develop and verify countermeasures. The answers to three specific functional performance questions were sought: (1) What are the performance decrements resulting from missions of varying durations? (2) What are the physical requirements for successful entry, landing, and emergency egress from the Shuttle? and (3) What combination of preflight fitness training and in-flight countermeasures will minimize in-flight muscle performance decrements? To answer these questions, the Exercise Countermeasures Project looked at physiological changes associated with muscle degradation as well as orthostatic intolerance. A means of ensuring motor coordination was necessary to maintain proficiency in piloting skills, EVA, and IVA tasks. In addition, it was necessary to maintain musculoskeletal strength and function to meet the rigors associated with moderate altitude bailout and with nominal or emergency egress from the landed Orbiter. Eight investigations, referred to as Detailed Supplementary Objectives (DSOs) 475, 476, 477, 606, 608, 617, 618, and 624, were conducted to study muscle degradation and the effects of exercise on exercise capacity and orthostatic function (Table 3-1). This chapter is divided into
Evaluating building performance in healthcare facilities: an organizational perspective.

PubMed

Steinke, Claudia; Webster, Lynn; Fontaine, Marie

2010-01-01

Using the environment as a strategic tool is one of the most cost-effective and enduring approaches for improving public health; however, it is one that requires multiple perspectives. The purpose of this article is to highlight an innovative methodology that has been developed for conducting comprehensive performance evaluations in public sector health facilities in Canada. The building performance evaluation methodology described in this paper is a government initiative. The project team developed a comprehensive building evaluation process for all new capital health projects that would respond to the aforementioned need for stakeholders to be more accountable and to better integrate the larger organizational strategy of facilities. The Balanced Scorecard, which is a multiparadigmatic, performance-based business framework, serves as the underlying theoretical framework for this initiative. It was applied in the development of the conceptual model entitled the Building Performance Evaluation Scorecard, which provides the following benefits: (1) It illustrates a process to link facilities more effectively to the overall mission and goals of an organization; (2) It is both a measurement and a management system that has the ability to link regional facilities to measures of success and larger business goals; (3) It provides a standardized methodology that ensures consistency in assessing building performance; and (4) It is more comprehensive than traditional building evaluations. The methodology presented in this paper is both a measurement and management system that integrates the principles of evidence-based design with the practices of pre- and post-occupancy evaluation. It promotes accountability and continues throughout the life cycle of a project. The advantage of applying this framework is that it engages health organizations in clarifying a vision and strategy for their facilities and helps translate those strategies into action and measurable performance
Sliding Mode Control of Real-Time PNU Vehicle Driving Simulator and Its Performance Evaluation

NASA Astrophysics Data System (ADS)

Lee, Min Cheol; Park, Min Kyu; Yoo, Wan Suk; Son, Kwon; Han, Myung Chul

This paper introduces an economical and effective full-scale driving simulator for study of human sensibility and development of new vehicle parts and its control. Real-time robust control to accurately reappear a various vehicle motion may be a difficult task because the motion platform is the nonlinear complex system. This study proposes the sliding mode controller with a perturbation compensator using observer-based fuzzy adaptive network (FAN). This control algorithm is designed to solve the chattering problem of a sliding mode control and to select the adequate fuzzy parameters of the perturbation compensator. For evaluating the trajectory control performance of the proposed approach, a tracking control of the developed simulator named PNUVDS is experimentally carried out. And then, the driving performance of the simulator is evaluated by using human perception and sensibility of some drivers in various driving conditions.
Performance Evaluation of the United Nations Environment ...

EPA Pesticide Factsheets

A request for technical collaboration between the UNEP and the US EPA resulted in the establishment of a MCRADA. The purpose of this agreement was to evaluate an air quality monitoring system (referred to as the UNEP pod) developed by the UNEP for use in environmental situations where more sophisticated monitoring instrumentation was not available. The US EPA has conducted numerous evaluations of other similar sensor pods at its Research Triangle Park, NC research campus and has trained staff as well as established research designs for such efforts. Under the terms of the MCRADA, the US EPA would operate the pod using UNEP provided operating procedures in a manner consistent with its planned intent of deployment. The US EPA would collect air quality monitoring data from the pod involving select environmental measures over a period of approximately one month. Reference monitoring data collected from collocated federal regulatory monitors would be used to establish a comparison between the two systems and thus establishment of performance characteristics. In addition, the US EPA would provide feedback information to the UNEP as to observed ease of use features of the pod that would be beneficial in its future evolution and deployment. The UNEP recently developed a multipollutant sensor pod called the UNEP Air Quality Monitoring Unit, herein simply defined as the UNEP pod (http://aqicn.org/faq/2015-10-28/unep-air-quality-monitoring-station/). First introduced in 20
Performance of the high-resolution atmospheric model HRRR-AK for correcting geodetic observations from spaceborne radars

PubMed Central

Gong, W; Meyer, F J; Webley, P; Morton, D

2013-01-01

[1] Atmospheric phase delays are considered to be one of the main performance limitations for high-quality satellite radar techniques, especially when applied to ground deformation monitoring. Numerical weather prediction (NWP) models are widely seen as a promising tool for the mitigation of atmospheric delays as they can provide knowledge of the atmospheric conditions at the time of Synthetic Aperture Radar data acquisition. However, a thorough statistical analysis of the performance of using NWP production in radar signal correction is missing to date. This study provides a quantitative analysis of the accuracy in using operational NWP products for signal delay correction in satellite radar geodetic remote sensing. The study focuses on the temperate, subarctic, and Arctic climate regions due to a prevalence of relevant geophysical signals in these areas. In this study, the operational High Resolution Rapid Refresh over the Alaska region (HRRR-AK) model is used and evaluated. Five test sites were selected over Alaska (AK), USA, covering a wide range of climatic regimes that are commonly encountered in high-latitude regions. The performance of the HRRR-AK NWP model for correcting absolute atmospheric range delays of radar signals is assessed by comparing to radiosonde observations. The average estimation accuracy for the one-way zenith total atmospheric delay from 24 h simulations was calculated to be better than ∼14 mm. This suggests that the HRRR-AK operational products are a good data source for spaceborne geodetic radar observations atmospheric delay correction, if the geophysical signal to be observed is larger than 20 mm. PMID:25973360
A method to evaluate process performance by integrating time and resources

NASA Astrophysics Data System (ADS)

Wang, Yu; Wei, Qingjie; Jin, Shuang

2017-06-01

The purpose of process mining is to improve the existing process of the enterprise, so how to measure the performance of the process is particularly important. However, the current research on the performance evaluation method is still insufficient. The main methods of evaluation are mainly using time or resource. These basic statistics cannot evaluate process performance very well. In this paper, a method of evaluating the performance of the process based on time dimension and resource dimension is proposed. This method can be used to measure the utilization and redundancy of resources in the process. This paper will introduce the design principle and formula of the evaluation algorithm. Then, the design and the implementation of the evaluation method will be introduced. Finally, we will use the evaluating method to analyse the event log from a telephone maintenance process and propose an optimization plan.
Tropical convection regimes in climate models: evaluation with satellite observations

NASA Astrophysics Data System (ADS)

Steiner, Andrea K.; Lackner, Bettina C.; Ringer, Mark A.

2018-04-01

High-quality observations are powerful tools for the evaluation of climate models towards improvement and reduction of uncertainty. Particularly at low latitudes, the most uncertain aspect lies in the representation of moist convection and interaction with dynamics, where rising motion is tied to deep convection and sinking motion to dry regimes. Since humidity is closely coupled with temperature feedbacks in the tropical troposphere, a proper representation of this region is essential. Here we demonstrate the evaluation of atmospheric climate models with satellite-based observations from Global Positioning System (GPS) radio occultation (RO), which feature high vertical resolution and accuracy in the troposphere to lower stratosphere. We focus on the representation of the vertical atmospheric structure in tropical convection regimes, defined by high updraft velocity over warm surfaces, and investigate atmospheric temperature and humidity profiles. Results reveal that some models do not fully capture convection regions, particularly over land, and only partly represent strong vertical wind classes. Models show large biases in tropical mean temperature of more than 4 K in the tropopause region and the lower stratosphere. Reasonable agreement with observations is given in mean specific humidity in the lower to mid-troposphere. In moist convection regions, models tend to underestimate moisture by 10 to 40 % over oceans, whereas in dry downdraft regions they overestimate moisture by 100 %. Our findings provide evidence that RO observations are a unique source of information, with a range of further atmospheric variables to be exploited, for the evaluation and advancement of next-generation climate models.
Performance evaluation of seal coat materials and designs.

DOT National Transportation Integrated Search

2011-01-01

"This project presents an evaluation of seal coat materials and design method. The primary objectives of this research are 1) to evaluate seal coat performance : from various combinations of aggregates and emulsions in terms of aggregate loss; 2) to ...
Performance evaluation of infrared imaging system in field test

NASA Astrophysics Data System (ADS)

Wang, Chensheng; Guo, Xiaodong; Ren, Tingting; Zhang, Zhi-jie

2014-11-01

Infrared imaging system has been applied widely in both military and civilian fields. Since the infrared imager has various types and different parameters, for system manufacturers and customers, there is great demand for evaluating the performance of IR imaging systems with a standard tool or platform. Since the first generation IR imager was developed, the standard method to assess the performance has been the MRTD or related improved methods which are not perfect adaptable for current linear scanning imager or 2D staring imager based on FPA detector. For this problem, this paper describes an evaluation method based on the triangular orientation discrimination metric which is considered as the effective and emerging method to evaluate the synthesis performance of EO system. To realize the evaluation in field test, an experiment instrument is developed. And considering the importance of operational environment, the field test is carried in practical atmospheric environment. The test imagers include panoramic imaging system and staring imaging systems with different optics and detectors parameters (both cooled and uncooled). After showing the instrument and experiment setup, the experiment results are shown. The target range performance is analyzed and discussed. In data analysis part, the article gives the range prediction values obtained from TOD method, MRTD method and practical experiment, and shows the analysis and results discussion. The experimental results prove the effectiveness of this evaluation tool, and it can be taken as a platform to give the uniform performance prediction reference.
Echocardiographic evaluation of myocardial changes observed after closure of patent ductus arteriosus in dogs.

PubMed

Hamabe, L; Kim, S; Yoshiyuki, R; Fukayama, T; Nakata, T M; Fukushima, R; Tanaka, R

2015-01-01

Closure of PDA can be associated with echocardiographic changes including deterioration of LV systolic function. Although PDA is commonly encountered in dogs, few comprehensive reports of echocardiographic changes in dogs with PDA closure are available. To evaluate the short-term echocardiographic changes observed after PDA closure in dogs using strain analysis. Seventeen client-owned dogs with left-to-right PDA. Echocardiographic evaluations, including standard echocardiography and two-dimensional tissue tracking (2DTT), were performed before and within 3 days of PDA closure. Preclosure examination showed LV and left atrial dilatation indicating volume overload as a result of PDA. Closure of PDA resulted in significant reduction of LVIDd (<.0001) and LA/Ao (0.01) without change in LVIDs, suggestive of decreased preload. Postclosure LV systolic dysfunction was observed with significant decreased in FS (<.0001) and strain values (P = .0039 for radial strains, P = .0005 for circumferential strains). Additionally, significant LV dyssynchrony (P = .0162) was observed after closure of PDA. Closure of PDA resulted in decreased preload as a result of alleviation of LV volume overload, which in turn caused transient deterioration of LV systolic function. Additionally, this study demonstrated that strain analysis is load dependent. Therefore, care should be taken when interpreting strain measurements as an indicator of LV systolic function. Copyright © 2015 The Authors. Journal of Veterinary Internal Medicine published by Wiley Periodicals, Inc. on behalf of American College of Veterinary Internal Medicine.
Performance Evaluation in Network-Based Parallel Computing

NASA Technical Reports Server (NTRS)

Dezhgosha, Kamyar

1996-01-01

Network-based parallel computing is emerging as a cost-effective alternative for solving many problems which require use of supercomputers or massively parallel computers. The primary objective of this project has been to conduct experimental research on performance evaluation for clustered parallel computing. First, a testbed was established by augmenting our existing SUNSPARCs' network with PVM (Parallel Virtual Machine) which is a software system for linking clusters of machines. Second, a set of three basic applications were selected. The applications consist of a parallel search, a parallel sort, a parallel matrix multiplication. These application programs were implemented in C programming language under PVM. Third, we conducted performance evaluation under various configurations and problem sizes. Alternative parallel computing models and workload allocations for application programs were explored. The performance metric was limited to elapsed time or response time which in the context of parallel computing can be expressed in terms of speedup. The results reveal that the overhead of communication latency between processes in many cases is the restricting factor to performance. That is, coarse-grain parallelism which requires less frequent communication between processes will result in higher performance in network-based computing. Finally, we are in the final stages of installing an Asynchronous Transfer Mode (ATM) switch and four ATM interfaces (each 155 Mbps) which will allow us to extend our study to newer applications, performance metrics, and configurations.
Evaluation of Process Performance for Sustainable Hard Machining

NASA Astrophysics Data System (ADS)

Rotella, Giovanna; Umbrello, Domenico; , Oscar W. Dillon, Jr.; Jawahir, I. S.

This paper aims to evaluate the sustainability performance of machining operation of through-hardening steel, AISI 52100, taking into account the impact of the material removal process in its various aspects. Experiments were performed for dry and cryogenic cutting conditions using chamfered cubic boron nitride (CBN) tool inserts at varying cutting conditions (cutting speed and feed rate). Cutting forces, mechanical power, tool wear, white layer thickness, surface roughness and residual stresses were investigated in order to evaluate the effects of extreme in-process cooling on the machined surface. The results indicate that cryogenic cooling has the potential to be used for surface integrity enhancement for improved product life and more sustainable functional performance.
Proposed evaluation framework for assessing operator performance with multisensor displays

NASA Technical Reports Server (NTRS)

Foyle, David C.

1992-01-01

Despite aggressive work on the development of sensor fusion algorithms and techniques, no formal evaluation procedures have been proposed. Based on existing integration models in the literature, an evaluation framework is developed to assess an operator's ability to use multisensor, or sensor fusion, displays. The proposed evaluation framework for evaluating the operator's ability to use such systems is a normative approach: The operator's performance with the sensor fusion display can be compared to the models' predictions based on the operator's performance when viewing the original sensor displays prior to fusion. This allows for the determination as to when a sensor fusion system leads to: 1) poorer performance than one of the original sensor displays (clearly an undesirable system in which the fused sensor system causes some distortion or interference); 2) better performance than with either single sensor system alone, but at a sub-optimal (compared to the model predictions) level; 3) optimal performance (compared to model predictions); or, 4) super-optimal performance, which may occur if the operator were able to use some highly diagnostic 'emergent features' in the sensor fusion display, which were unavailable in the original sensor displays. An experiment demonstrating the usefulness of the proposed evaluation framework is discussed.
Performance Evaluation of Steam Traps and Orifice Plates.

DTIC Science & Technology

1980-10-01

ADlAO9dl 229 JOHNS - MANVILLE SALES CORP DENVER CO RESEARCH AND DEV-’ETC F/S 13/1 PERFOR1ANCE EVALUATION OF STEAM TRAPS AND ORIFICE PLATES.(U)/ OCT 80...AGENCY t REPORT FESA-TS-2085 41! PERFORMANCE EVALUATION OF STEAM TRAPS AND ORIFICE PLATES P. B. SHEPHERD JOHNS - MANVILLE SALES CORPORATION w RESEARCH...PERFORMING ORGANIZATION NAME ANED ADDPESS!_ i lFioC’iA.TCr ’.ETPlJ A~ Johns - Manville Sales Corporation &00* 0 - Research & Development Center qOll Ken

Non-technical skills evaluation in the critical care air ambulance environment: introduction of an adapted rating instrument--an observational study.

PubMed

Myers, Julia A; Powell, David M C; Psirides, Alex; Hathaway, Karyn; Aldington, Sarah; Haney, Michael F

2016-03-08

In the isolated and dynamic health-care setting of critical care air ambulance transport, the quality of clinical care is strongly influenced by non-technical skills such as anticipating, recognising and understanding, decision making, and teamwork. However there are no published reports identifying or applying a non-technical skills framework specific to an intensive care air ambulance setting. The objective of this study was to adapt and evaluate a non-technical skills rating framework for the air ambulance clinical environment. In the first phase of the project the anaesthetists' non-technical skills (ANTS) framework was adapted to the air ambulance setting, using data collected directly from clinician groups, published literature, and field observation. In the second phase experienced and inexperienced inter-hospital transport clinicians completed a simulated critical care air transport scenario, and their non-technical skills performance was independently rated by two blinded assessors. Observed and self-rated general clinical performance ratings were also collected. Rank-based statistical tests were used to examine differences in the performance of experienced and inexperienced clinicians, and relationships between different assessment approaches and assessors. The framework developed during phase one was referred to as an aeromedical non-technical skills framework, or AeroNOTS. During phase two 16 physicians from speciality training programmes in intensive care, emergency medicine and anaesthesia took part in the clinical simulation study. Clinicians with inter-hospital transport experience performed more highly than those without experience, according to both AeroNOTS non-technical skills ratings (p = 0.001) and general performance ratings (p = 0.003). Self-ratings did not distinguish experienced from inexperienced transport clinicians (p = 0.32) and were not strongly associated with either observed general performance (r(s) = 0.4, p = 0
SWIR, VIS and LWIR observer performance against handheld objects: a comparison

NASA Astrophysics Data System (ADS)

Adomeit, Uwe

2016-10-01

The short wave infrared spectral range caused interest to be used in day and night time military and security applications in the last years. This necessitates performance assessment of SWIR imaging equipment in comparison to the one operating in the visual (VIS) and thermal infrared (LWIR) spectral range. In the military context (nominal) range is the main performance criteria. Discriminating friend from foe is one of the main tasks in today's asymmetric scenarios and so personnel, human activities and handheld objects are used as targets to estimate ranges. The later was also used for an experiment at Fraunhofer IOSB to get a first impression how the SWIR performs compared to VIS and LWIR. A human consecutively carrying one of nine different civil or military objects was recorded from five different ranges in the three spectral ranges. For the visual spectral range a 3-chip color-camera was used, the SWIR range was covered by an InGaAs-camera and the LWIR by an uncooled bolometer. It was ascertained that the nominal spatial resolution of the three cameras was in the same magnitude in order to enable an unbiased assessment. Daytime conditions were selected for data acquisition to separate the observer performance from illumination conditions and to some extend also camera performance. From the recorded data, a perception experiment was prepared. It was conducted as a nine-alternative forced choice, unlimited observation time test with 15 observers participating. Before the experiment, the observers were trained on close range target data. Outcome of the experiment was the average probability of identification versus range between camera and target. The comparison of the range performance achieved in the three spectral bands gave a mixed result. On one hand a ranking VIS / SWIR / LWIR in decreasing order can be seen in the data, but on the other hand only the difference between VIS and the other bands is statistically significant. Additionally it was not possible
A framework for evaluating statistical downscaling performance under changing climatic conditions (Invited)

NASA Astrophysics Data System (ADS)

Dixon, K. W.; Balaji, V.; Lanzante, J.; Radhakrishnan, A.; Hayhoe, K.; Stoner, A. K.; Gaitan, C. F.

2013-12-01

Statistical downscaling (SD) methods may be viewed as generating a value-added product - a refinement of global climate model (GCM) output designed to add finer scale detail and to address GCM shortcomings via a process that gleans information from a combination of observations and GCM-simulated climate change responses. Making use of observational data sets and GCM simulations representing the same historical period, cross-validation techniques allow one to assess how well an SD method meets this goal. However, lacking observations of future, the extent to which a particular SD method's skill might degrade when applied to future climate projections cannot be assessed in the same manner. Here we illustrate and describe extensions to a 'perfect model' experimental design that seeks to quantify aspects of SD method performance both for a historical period (1979-2008) and for late 21st century climate projections. Examples highlighting cases in which downscaling performance deteriorates in future climate projections will be discussed. Also, results will be presented showing how synthetic datasets having known statistical properties may be used to further isolate factors responsible for degradations in SD method skill under changing climatic conditions. We will describe a set of input files used to conduct these analyses that are being made available to researchers who wish to utilize this experimental framework to evaluate SD methods they have developed. The gridded data sets cover a region centered on the contiguous 48 United States with a grid spacing of approximately 25km, have daily time resolution (e.g., maximum and minimum near-surface temperature and precipitation), and represent a total of 120 years of model simulations. This effort is consistent with the 2013 National Climate Predictions and Projections Platform Quantitative Evaluation of Downscaling Workshop goal of supporting a community approach to promote the informed use of downscaled climate projections.
Leveraging Teacher Talent: Peer Observation in Educator Evaluation. Ask the Team

ERIC Educational Resources Information Center

Jacques, Catherine

2013-01-01

Many teachers are already keen observers and skilled in supporting and collaborating with their colleagues. Leveraging this rich talent among staff can be an efficient way to address capacity challenges and enrich teachers' evaluations with more targeted feedback. Teachers, however, require training to become systematic, reliable observers who can…
Performance evaluation of the insurance companies based on AHP

NASA Astrophysics Data System (ADS)

Lu, Manhong; Zhu, Kunping

2018-04-01

With the entry of foreign capital, China's insurance industry is under increasing pressure of competition. The performance of a company is the external manifestation of its comprehensive strength. Therefore, the establishment of a scientific evaluation system is of practical significance for the insurance companies. In this paper, based on the financial and non-financial indicators of the companies, the performance evaluation system is constructed by means of the analytic hierarchy process (AHP). In the system, the weights of the indicators which represent the impact on the performance of the companies will be calculated by the process. The evaluation system is beneficial for the companies to realize their own strengths and weaknesses, so as to take steps to enhance the core competitiveness of the companies.
Evaluating supplier quality performance using fuzzy analytical hierarchy process

NASA Astrophysics Data System (ADS)

Ahmad, Nazihah; Kasim, Maznah Mat; Rajoo, Shanmugam Sundram Kalimuthu

2014-12-01

Evaluating supplier quality performance is vital in ensuring continuous supply chain improvement, reducing the operational costs and risks towards meeting customer's expectation. This paper aims to illustrate an application of Fuzzy Analytical Hierarchy Process to prioritize the evaluation criteria in a context of automotive manufacturing in Malaysia. Five main criteria were identified which were quality, cost, delivery, customer serviceand technology support. These criteria had been arranged into hierarchical structure and evaluated by an expert. The relative importance of each criteria was determined by using linguistic variables which were represented as triangular fuzzy numbers. The Center of Gravity defuzzification method was used to convert the fuzzy evaluations into their corresponding crisps values. Such fuzzy evaluation can be used as a systematic tool to overcome the uncertainty evaluation of suppliers' performance which usually associated with human being subjective judgments.
VIEWDEX: an efficient and easy-to-use software for observer performance studies.

PubMed

Håkansson, Markus; Svensson, Sune; Zachrisson, Sara; Svalkvist, Angelica; Båth, Magnus; Månsson, Lars Gunnar

2010-01-01

The development of investigation techniques, image processing, workstation monitors, analysing tools etc. within the field of radiology is vast, and the need for efficient tools in the evaluation and optimisation process of image and investigation quality is important. ViewDEX (Viewer for Digital Evaluation of X-ray images) is an image viewer and task manager suitable for research and optimisation tasks in medical imaging. ViewDEX is DICOM compatible and the features of the interface (tasks, image handling and functionality) are general and flexible. The configuration of a study and output (for example, answers given) can be edited in any text editor. ViewDEX is developed in Java and can run from any disc area connected to a computer. It is free to use for non-commercial purposes and can be downloaded from http://www.vgregion.se/sas/viewdex. In the present work, an evaluation of the efficiency of ViewDEX for receiver operating characteristic (ROC) studies, free-response ROC (FROC) studies and visual grading (VG) studies was conducted. For VG studies, the total scoring rate was dependent on the number of criteria per case. A scoring rate of approximately 150 cases h(-1) can be expected for a typical VG study using single images and five anatomical criteria. For ROC and FROC studies using clinical images, the scoring rate was approximately 100 cases h(-1) using single images and approximately 25 cases h(-1) using image stacks ( approximately 50 images case(-1)). In conclusion, ViewDEX is an efficient and easy-to-use software for observer performance studies.
Personality traits affect teaching performance of attending physicians: results of a multi-center observational study.

PubMed

Scheepers, Renée A; Lombarts, Kiki M J M H; van Aken, Marcel A G; Heineman, Maas Jan; Arah, Onyebuchi A

2014-01-01

Worldwide, attending physicians train residents to become competent providers of patient care. To assess adequate training, attending physicians are increasingly evaluated on their teaching performance. Research suggests that personality traits affect teaching performance, consistent with studied effects of personality traits on job performance and academic performance in medicine. However, up till date, research in clinical teaching practice did not use quantitative methods and did not account for specialty differences. We empirically studied the relationship of attending physicians' personality traits with their teaching performance across surgical and non-surgical specialties. We conducted a survey across surgical and non-surgical specialties in eighteen medical centers in the Netherlands. Residents evaluated attending physicians' overall teaching performance, as well as the specific domains learning climate, professional attitude, communication, evaluation, and feedback, using the validated 21-item System for Evaluation of Teaching Qualities (SETQ). Attending physicians self-evaluated their personality traits on a 5-point scale using the validated 10-item Big Five Inventory (BFI), yielding the Five Factor model: extraversion, conscientiousness, neuroticism, agreeableness and openness. Overall, 622 (77%) attending physicians and 549 (68%) residents participated. Extraversion positively related to overall teaching performance (regression coefficient, B: 0.05, 95% CI: 0.01 to 0.10, P = 0.02). Openness was negatively associated with scores on feedback for surgical specialties only (B: -0.10, 95% CI: -0.15 to -0.05, P<0.001) and conscientiousness was positively related to evaluation of residents for non-surgical specialties only (B: 0.13, 95% CI: 0.03 to 0.22, p = 0.01). Extraverted attending physicians were consistently evaluated as better supervisors. Surgical attending physicians who display high levels of openness were evaluated as less adequate feedback
PERFORMANCE EVALUATION OF TYPE I MARINE SANITATION DEVICES

EPA Science Inventory

This performance test was designed to evaluate the effectiveness of two Type I Marine Sanitation Devices (MSDs): the Electro Scan Model EST 12, manufactured by Raritan Engineering Company, Inc., and the Thermopure-2, manufactured by Gross Mechanical Laboratories, Inc. Performance...
[Analysis of microalbuminuria with immunonephelometry and high performance liquid chromatography. Evaluation of new criteria].

PubMed

Markó, Lajos; Molnár, Gergo Attila; Wagner, Zoltán; Koszegi, Tamás; Matus, Zoltán; Mohás, Márton; Kuzma, Mónika; Szijártó, István András; Wittmann, István

2008-01-13

Evaluation study, the rate of microalbuminuria positivity among the immunonephelometrically negative patients decreased to 14.5% by high performance liquid chromatography and the decrease in the number of microalbuminuria positive cases by high performance liquid chromatography could be observed mainly in the diabetic and hypertensive group (49% vs. 7.5%), while slighter decrease could be observed in the non-diabetic hypertensive group (37% vs. 26.5%). Applying the traditional criteria, the strongest predictor was the male gender by the logistic regression analysis. In 28% of microalbuminuria negative patients by immunonephelometry the diagnosis of microalbuminuria can be established using high performance liquid chromatography. Almost in one-third of microalbuminuria negative patients by immunonephelometry the diagnosis of microalbuminuria can be established by high performance liquid chromatography for which diagnosis three constitutive urine examinations are still needed. New criteria determined by the Heart Outcomes Prevention Evaluation study can be used neither in case of diabetic and hypertensive patients, nor in the case of non-diabetic hypertensive patients. The gender as the most important predictor of microalbuminuria cannot be ignored.
A comparative and experimental evaluation of performance of stocked diploid and triploid brook trout

USGS Publications Warehouse

Budy, Phaedra E.; Thiede, G.P.; Dean, A.; Olsen, D.; Rowley, G.

2012-01-01

Despite numerous negative impacts, nonnative trout are still being stocked to provide economically and socially valuable sport fisheries in western mountain lakes. We evaluated relative performance and potential differences in feeding strategy and competitive ability of triploid versus diploid brook trout Salvelinus fontinalis in alpine lakes, as well as behavioral and performance differences of diploid and triploid brook trout in two controlled experimental settings: behavioral experiments in the laboratory and performance evaluations in ponds. Across lakes, catch per unit effort (CPUE) and relative weight (Wr ) were not significantly different between ploidy levels. Mean sizes were also similar between ploidy levels except in two of the larger lakes where diploids attained slightly larger sizes (approximately 20 mm longer). We observed no significant differences between diploids and triploids in diet, diet preference, or trophic structure. Similarly, growth and condition did not differ between ploidy levels in smaller-scale pond experiments, and aggressive behavior did not differ between ploidy levels (fed or unfed fish trials) in the laboratory. Independent of ploidy level, the relative performance of brook trout varied widely among lakes, a pattern that appeared to be a function of lake size or a factor that covaries with lake size such as temperature regime or carrying capacity. In summary, we observed no significant differences in the relative performance of brook trout from either ploidy level across a number of indices, systems, and environmental conditions, nor any indication that one group is more aggressive or a superior competitor than the other. Collectively, these results suggest that triploid brook trout will offer a more risk-averse and promising management opportunity when they are stocked to these lakes and elsewhere to simultaneously meet the needs for the sport fishery and conservation objectives.
Social status determines how we monitor and evaluate our performance

PubMed Central

Kostermans, Evelien; Milivojevic, Branka; De Cremer, David

2012-01-01

Since people with low status are more likely to experience social evaluative threat and are therefore more inclined to monitor for these threats and inhibit approach behaviour, we expected that low-status subjects would be more engaged in evaluating their own performance, compared with high-status subjects. We created a highly salient social hierarchy based on the performance of a simple time estimation task. Subjects could achieve high, middle or low status while performing this task simultaneously with other two players who were either higher or lower in status. Subjects received feedback on their own performance, as well as on the performance of the other two players simultaneously. Electroencephalography (EEG) was recorded from all three participants. The results showed that medial frontal negativity (an event-related potential reflecting performance evaluation) was significantly enhanced for low-status subjects. Implications for status-related differences in goal-directed behaviour are discussed. PMID:21421733
Manipulator Performance Evaluation Using Fitts' Taping Task

DOE Office of Scientific and Technical Information (OSTI.GOV)

Draper, J.V.; Jared, B.C.; Noakes, M.W.

1999-04-25

Metaphorically, a teleoperator with master controllers projects the user's arms and hands into a re- mote area, Therefore, human users interact with teleoperators at a more fundamental level than they do with most human-machine systems. Instead of inputting decisions about how the system should func- tion, teleoperator users input the movements they might make if they were truly in the remote area and the remote machine must recreate their trajectories and impedance. This intense human-machine inter- action requires displays and controls more carefully attuned to human motor capabilities than is neces- sary with most systems. It is important for teleoperatedmore » manipulators to be able to recreate human trajectories and impedance in real time. One method for assessing manipulator performance is to observe how well a system be- haves while a human user completes human dexterity tasks with it. Fitts' tapping task has been, used many times in the past for this purpose. This report describes such a performance assessment. The International Submarine Engineering (ISE) Autonomous/Teleoperated Operations Manipulator (ATOM) servomanipulator system was evalu- ated using a generic positioning accuracy task. The task is a simple one but has the merits of (1) pro- ducing a performance function estimate rather than a point estimate and (2) being widely used in the past for human and servomanipulator dexterity tests. Results of testing using this task may, therefore, allow comparison with other manipulators, and is generically representative of a broad class of tasks. Results of the testing indicate that the ATOM manipulator is capable of performing the task. Force reflection had a negative impact on task efficiency in these data. This was most likely caused by the high resistance to movement the master controller exhibited with the force reflection engaged. Measurements of exerted forces were not made, so it is not possible to say whether the force reflection helped partici
A Novel Performance Evaluation Methodology for Single-Target Trackers.

PubMed

Kristan, Matej; Matas, Jiri; Leonardis, Ales; Vojir, Tomas; Pflugfelder, Roman; Fernandez, Gustavo; Nebehay, Georg; Porikli, Fatih; Cehovin, Luka

2016-11-01

This paper addresses the problem of single-target tracker performance evaluation. We consider the performance measures, the dataset and the evaluation system to be the most important components of tracker evaluation and propose requirements for each of them. The requirements are the basis of a new evaluation methodology that aims at a simple and easily interpretable tracker comparison. The ranking-based methodology addresses tracker equivalence in terms of statistical significance and practical differences. A fully-annotated dataset with per-frame annotations with several visual attributes is introduced. The diversity of its visual properties is maximized in a novel way by clustering a large number of videos according to their visual attributes. This makes it the most sophistically constructed and annotated dataset to date. A multi-platform evaluation system allowing easy integration of third-party trackers is presented as well. The proposed evaluation methodology was tested on the VOT2014 challenge on the new dataset and 38 trackers, making it the largest benchmark to date. Most of the tested trackers are indeed state-of-the-art since they outperform the standard baselines, resulting in a highly-challenging benchmark. An exhaustive analysis of the dataset from the perspective of tracking difficulty is carried out. To facilitate tracker comparison a new performance visualization technique is proposed.
Performativity and Affectivity: Lesson Observations in England's Further Education Colleges

ERIC Educational Resources Information Center

Edgington, Ursula

2013-01-01

Teaching and learning observations (TLOs) are used in educational environments worldwide to measure and improve quality and support professional development. TLOs can be positive for teachers who enjoy opportunities to "perform" their craft and/or engage in professional dialogue. However, if this crucial, collaborative developmental…
The Effect of Implied Performer Age and Group Membership on Evaluations of Music Performances

ERIC Educational Resources Information Center

Harrington, Ann M.

2018-01-01

This study examined the effects of implied performer age and group membership on listeners' evaluations of music performances. Undergraduate music majors (n = 23), nonmusic majors (n = 17), and members of a New Horizons ensemble (n = 16) were presented with six 30-second excerpts of concert band performances. Excerpts were presented to all…
Observer performance assessment of JPEG-compressed high-resolution chest images

NASA Astrophysics Data System (ADS)

Good, Walter F.; Maitz, Glenn S.; King, Jill L.; Gennari, Rose C.; Gur, David

1999-05-01

The JPEG compression algorithm was tested on a set of 529 chest radiographs that had been digitized at a spatial resolution of 100 micrometer and contrast sensitivity of 12 bits. Images were compressed using five fixed 'psychovisual' quantization tables which produced average compression ratios in the range 15:1 to 61:1, and were then printed onto film. Six experienced radiologists read all cases from the laser printed film, in each of the five compressed modes as well as in the non-compressed mode. For comparison purposes, observers also read the same cases with reduced pixel resolutions of 200 micrometer and 400 micrometer. The specific task involved detecting masses, pneumothoraces, interstitial disease, alveolar infiltrates and rib fractures. Over the range of compression ratios tested, for images digitized at 100 micrometer, we were unable to demonstrate any statistically significant decrease (p greater than 0.05) in observer performance as measured by ROC techniques. However, the observers' subjective assessments of image quality did decrease significantly as image resolution was reduced and suggested a decreasing, but nonsignificant, trend as the compression ratio was increased. The seeming discrepancy between our failure to detect a reduction in observer performance, and other published studies, is likely due to: (1) the higher resolution at which we digitized our images; (2) the higher signal-to-noise ratio of our digitized films versus typical CR images; and (3) our particular choice of an optimized quantization scheme.
Evaluation of the performance of a novel system for continuous glucose monitoring.

PubMed

Zschornack, Eva; Schmid, Christina; Pleus, Stefan; Link, Manuela; Klötzer, Hans-Martin; Obermaier, Karin; Schoemaker, Michael; Strasser, Monika; Frisch, Gerhard; Schmelzeisen-Redeker, Günther; Haug, Cornelia; Freckmann, Guido

2013-07-01

The performance of a continuous glucose monitoring (CGM) system in the early stage of development was assessed in an inpatient setting that simulates daily life conditions of people with diabetes. Performance was evaluated at low glycemic, euglycemic, and high glycemic ranges as well as during phases with rapid glucose excursions. Each of the 30 participants with type 1 diabetes (15 female, age 47 ± 12 years, hemoglobin A1c 7.7% ± 1.3%) wore two sensors of the prototype system in parallel for 7 days. Capillary blood samples were measured at least 16 times per day (at least 15 times per daytime and at least once per night). On two subsequent study days, glucose excursions were induced. For performance evaluation, the mean absolute relative difference (MARD) between CGM readings and paired capillary blood glucose readings and precision absolute relative difference (PARD), i.e., differences between paired CGM readings were calculated. Overall aggregated MARD was 9.2% and overall aggregated PARD was 7.5%. During induced glucose excursions, MARD was 10.9% and PARD was 7.8%. Lowest MARD (8.5%) and lowest PARD (6.4%) were observed in the high glycemic range (euglycemic range, MARD 9.1% and PARD 7.4%; low glycemic range, MARD 12.3% and PARD 12.4%). The performance of this prototype CGM system was, particularly in the hypoglycemic range and during phases with rapid glucose fluctuations, better than performance data reported for other commercially available systems. In addition, performance of this prototype sensor was noticeably constant over the whole study period. This prototype system is not yet approved, and performance of this CGM system needs to be further assessed in clinical studies. © 2013 Diabetes Technology Society.
Network Performance Measurements for NASA's Earth Observation System

NASA Technical Reports Server (NTRS)

Loiacono, Joe; Gormain, Andy; Smith, Jeff

2004-01-01

NASA's Earth Observation System (EOS) Project studies all aspects of planet Earth from space, including climate change, and ocean, ice, land, and vegetation characteristics. It consists of about 20 satellite missions over a period of about a decade. Extensive collaboration is used, both with other US. agencies (e.g., National Oceanic and Atmospheric Administration (NOA), United States Geological Survey (USGS), Department of Defense (DoD), and international agencies (e.g., European Space Agency (ESA), Japan Aerospace Exploration Agency (JAXA)), to improve cost effectiveness and obtain otherwise unavailable data. Scientific researchers are located at research institutions worldwide, primarily government research facilities and research universities. The EOS project makes extensive use of networks to support data acquisition, data production, and data distribution. Many of these functions impose requirements on the networks, including throughput and availability. In order to verify that these requirements are being met, and be pro-active in recognizing problems, NASA conducts on-going performance measurements. The purpose of this paper is to examine techniques used by NASA to measure the performance of the networks used by EOSDIS (EOS Data and Information System) and to indicate how this performance information is used.
Performance evaluation of the national early warning system for shallow landslides in Norway

NASA Astrophysics Data System (ADS)

Dahl, Mads-Peter; Piciullo, Luca; Devoli, Graziella; Colleuille, Hervé; Calvello, Michele

2017-04-01

As a consequence of the increased number of rainfall-and snowmelt-induced landslides (debris flows, debris slides, debris avalanches and slush flows) occurring in Norway, a national landslide early warning system (EWS) has been developed for monitoring and forecasting the hydro-meteorological conditions potentially necessary of triggering slope failures. The system, operational since 2013, is managed by the Norwegian Water Resources and Energy Directorate (NVE) and has been designed in cooperation with the Norwegian Public Road Administration (SVV), the Norwegian National Rail Administration (JBV) and the Norwegian Meteorological Institute (MET). Decision-making in the EWS is based upon hazard threshold levels, hydro-meteorological and real-time landslide observations as well as landslide inventory and susceptibility maps. Hazard threshold levels have been obtained through statistical analyses of historical landslides and modelled hydro-meteorological parameters. Daily hydro-meteorological conditions such as rainfall, snowmelt, runoff, soil saturation, groundwater level and frost depth have been derived from a distributed version of the hydrological HBV-model. Two different landslide susceptibility maps are used as supportive data in deciding daily warning levels. Daily alerts are issued throughout the country considering variable warning zones. Warnings are issued once per day for the following 3 days with an update possibility later during the day according to the information gathered by the monitoring variables. The performance of the EWS has been evaluated applying the EDuMaP method. In particular, the performance of warnings issued in Western Norway, in the period 2013-2014 has been evaluated using two different landslide datasets. The best performance is obtained for the smallest and more accurate dataset. Different performance results may be observed as a function of changing the landslide density criterion, Lden(k), (i.e., thresholds considered to

Quality performance of laboratory testing in pharmacies: a collaborative evaluation.

PubMed

Zaninotto, Martina; Miolo, Giorgia; Guiotto, Adriano; Marton, Silvia; Plebani, Mario

2016-11-01

The quality performance and the comparability between results of pharmacies point-of-care-testing (POCT) and institutional laboratories have been evaluated. Eight pharmacies participated in the project: a capillary specimen collected by the pharmacist and, simultaneously, a lithium-heparin sample drawn by a physician of laboratory medicine for the pharmacy customers (n=106) were analyzed in the pharmacy and in the laboratory, respectively. Glucose, cholesterol, HDL-cholesterol, triglycerides, creatinine, uric acid, aspartate aminotransferase, alanine aminotransferase, were measured using: Reflotron, n=5; Samsung, n=1; Cardiocheck PA, n=1; Cholestech LDX, n=1 and Cobas 8000. The POCT analytical performance only (phase 2) were evaluated testing, in pharmacies and in the laboratory, the lithium heparin samples from a female drawn fasting daily in a week, and a control sample containing high concentrations of glucose, cholesterol and triglycerides. For all parameters, except triglycerides, the slopes showed a satisfactory correlation. For triglycerides, a median value higher in POCT in comparison to the laboratory (1.627 mmol/L vs. 0.950 mmol/L) has been observed. The agreement in the subjects classification, demonstrates that for glucose, 70% of the subjects show concentrations below the POCT recommended level (5.8-6.1 mmol/L), while 56% are according to the laboratory limit (<5.6 mmol/L). Total cholesterol exhibits a similar trend while POCT triglycerides show a greater percentage of increased values (21% vs. 9%). The reduction in triglycerides bias (phase 2) suggests that differences between POCT and central laboratory is attributable to a pre-analytical problem. The results confirm the acceptable analytical performance of POCT pharmacies and specific criticisms in the pre- and post-analytical phases.
S-NPP ATMS Instrument Prelaunch and On-Orbit Performance Evaluation

NASA Technical Reports Server (NTRS)

Kim, Edward; Lyu, Cheng-Hsuan; Anderson, Kent; Leslie, Vincent R.; Blackwell, William J.

2014-01-01

The first of a new generation of microwave sounders was launched aboard the Suomi-National Polar-Orbiting Partnership satellite in October 2011. The Advanced Technology Microwave Sounder (ATMS) combines the capabilities and channel sets of three predecessor sounders into a single package to provide information on the atmospheric vertical temperature and moisture profiles that are the most critical observations needed for numerical weather forecast models. Enhancements include size/mass/power approximately one third of the previous total, three new sounding channels, the first space-based, Nyquist-sampled cross-track microwave temperature soundings for improved fusion with infrared soundings, plus improved temperature control and reliability. This paper describes the ATMS characteristics versus its predecessor, the advanced microwave sounding unit (AMSU), and presents the first comprehensive evaluation of key prelaunch and on-orbit performance parameters. Two-year on-orbit performance shows that the ATMS has maintained very stable radiometric sensitivity, in agreement with prelaunch data, meeting requirements for all channels (with margins of 40% for channels 1-15), and improvements over AMSU-A when processed for equivalent spatial resolution. The radiometric accuracy, determined by analysis from ground test measurements, and using on-orbit instrument temperatures, also shows large margins relative to requirements (specified as <1.0K for channels 1, 2, and 16-22 and <0.75 K for channels 3-15). A thorough evaluation of the performance of ATMS is especially important for this first proto-flight model unit of what will eventually be a series of ATMS sensors providing operational sounding capability for the U.S. and its international partners well into the next decade.
The Performance Evaluation of Corporate Universities

ERIC Educational Resources Information Center

Cappiello, Giuseppe; Pedrini, Giulio

2017-01-01

The aim of this paper is to illustrate the phenomenon of corporate universities from the perspective of the evaluation of their performance. Corporate universities have a hybrid nature that can be referred to both as a business unit and as a higher education institution. Having reviewed the literature on corporate universities and performance…
Performance Evaluation for Non-Teaching Professionals.

ERIC Educational Resources Information Center

Panebianco, Anthony F.

The program Performance Evaluation for Non-Teaching Professionals at the State University of New York Institute of Technology at Utica/Rome provides periodic assessments as required by institutional policy. The system is intended to establish a standard for judging quality of an employee's work and a rational and uniform basis for appraising…
Evaluating Observation Influence on Regional Water Budgets in Reanalyses

NASA Technical Reports Server (NTRS)

Bosilovich, Michael G.; Chern, Jiun-Dar; Mocko, David; Robertson, Franklin R.; daSilva, Arlindo M.

2014-01-01

The assimilation of observations in reanalyses incurs the potential for the physical terms of budgets to be balanced by a term relating the fit of the observations relative to a forecast first guess analysis. This may indicate a limitation in the physical processes of the background model, or perhaps inconsistencies in the observing system and its assimilation. In the MERRA reanalysis, an area of long term moisture flux divergence over land has been identified over the Central United States. Here, we evaluate the water vapor budget in this region, taking advantage of two unique features of the MERRA diagnostic output; 1) a closed water budget that includes the analysis increment and 2) a gridded diagnostic output data set of the assimilated observations and their innovations (e.g. forecast departures). In the Central United States, an anomaly occurs where the analysis adds water to the region, while precipitation decreases and moisture flux divergence increases. This is related more to a change in the observing system than to a deficiency in the model physical processes. MERRAs Gridded Innovations and Observations (GIO) data narrow the observations that influence this feature to the ATOVS and Aqua satellites during the 06Z and 18Z analysis cycles. Observing system experiments further narrow the instruments that affect the anomalous feature to AMSUA (mainly window channels) and AIRS. This effort also shows the complexities of the observing system, and the reactions of the regional water budgets in reanalyses to the assimilated observations.
A new method to evaluate human-robot system performance

NASA Technical Reports Server (NTRS)

Rodriguez, G.; Weisbin, C. R.

2003-01-01

One of the key issues in space exploration is that of deciding what space tasks are best done with humans, with robots, or a suitable combination of each. In general, human and robot skills are complementary. Humans provide as yet unmatched capabilities to perceive, think, and act when faced with anomalies and unforeseen events, but there can be huge potential risks to human safety in getting these benefits. Robots provide complementary skills in being able to work in extremely risky environments, but their ability to perceive, think, and act by themselves is currently not error-free, although these capabilities are continually improving with the emergence of new technologies. Substantial past experience validates these generally qualitative notions. However, there is a need for more rigorously systematic evaluation of human and robot roles, in order to optimize the design and performance of human-robot system architectures using well-defined performance evaluation metrics. This article summarizes a new analytical method to conduct such quantitative evaluations. While the article focuses on evaluating human-robot systems, the method is generally applicable to a much broader class of systems whose performance needs to be evaluated.
Metrics for Offline Evaluation of Prognostic Performance

NASA Technical Reports Server (NTRS)

Saxena, Abhinav; Celaya, Jose; Saha, Bhaskar; Saha, Sankalita; Goebel, Kai

2010-01-01

Prognostic performance evaluation has gained significant attention in the past few years. Currently, prognostics concepts lack standard definitions and suffer from ambiguous and inconsistent interpretations. This lack of standards is in part due to the varied end-user requirements for different applications, time scales, available information, domain dynamics, etc. to name a few. The research community has used a variety of metrics largely based on convenience and their respective requirements. Very little attention has been focused on establishing a standardized approach to compare different efforts. This paper presents several new evaluation metrics tailored for prognostics that were recently introduced and were shown to effectively evaluate various algorithms as compared to other conventional metrics. Specifically, this paper presents a detailed discussion on how these metrics should be interpreted and used. These metrics have the capability of incorporating probabilistic uncertainty estimates from prognostic algorithms. In addition to quantitative assessment they also offer a comprehensive visual perspective that can be used in designing the prognostic system. Several methods are suggested to customize these metrics for different applications. Guidelines are provided to help choose one method over another based on distribution characteristics. Various issues faced by prognostics and its performance evaluation are discussed followed by a formal notational framework to help standardize subsequent developments.
Evaluation of Aerosol-cloud Interaction in the GISS Model E Using ARM Observations

NASA Technical Reports Server (NTRS)

DeBoer, G.; Bauer, S. E.; Toto, T.; Menon, Surabi; Vogelmann, A. M.

2013-01-01

Observations from the US Department of Energy's Atmospheric Radiation Measurement (ARM) program are used to evaluate the ability of the NASA GISS ModelE global climate model in reproducing observed interactions between aerosols and clouds. Included in the evaluation are comparisons of basic meteorology and aerosol properties, droplet activation, effective radius parameterizations, and surface-based evaluations of aerosol-cloud interactions (ACI). Differences between the simulated and observed ACI are generally large, but these differences may result partially from vertical distribution of aerosol in the model, rather than the representation of physical processes governing the interactions between aerosols and clouds. Compared to the current observations, the ModelE often features elevated droplet concentrations for a given aerosol concentration, indicating that the activation parameterizations used may be too aggressive. Additionally, parameterizations for effective radius commonly used in models were tested using ARM observations, and there was no clear superior parameterization for the cases reviewed here. This lack of consensus is demonstrated to result in potentially large, statistically significant differences to surface radiative budgets, should one parameterization be chosen over another.
Formal implementation of a performance evaluation model for the face recognition system.

PubMed

Shin, Yong-Nyuo; Kim, Jason; Lee, Yong-Jun; Shin, Woochang; Choi, Jin-Young

2008-01-01

Due to usability features, practical applications, and its lack of intrusiveness, face recognition technology, based on information, derived from individuals' facial features, has been attracting considerable attention recently. Reported recognition rates of commercialized face recognition systems cannot be admitted as official recognition rates, as they are based on assumptions that are beneficial to the specific system and face database. Therefore, performance evaluation methods and tools are necessary to objectively measure the accuracy and performance of any face recognition system. In this paper, we propose and formalize a performance evaluation model for the biometric recognition system, implementing an evaluation tool for face recognition systems based on the proposed model. Furthermore, we performed evaluations objectively by providing guidelines for the design and implementation of a performance evaluation system, formalizing the performance test process.
Reactive Agility Performance in Handball; Development and Evaluation of a Sport-Specific Measurement Protocol

PubMed Central

Spasic, Miodrag; Krolo, Ante; Zenic, Natasa; Delextrat, Anne; Sekulic, Damir

2015-01-01

There is no current study that examined sport-specific tests of reactive-agility and change-of-direction-speed (CODS) to replicate real-sport environment in handball (team-handball). This investigation evaluated the reliability and validity of two novel tests designed to assess reactive-agility and CODS of handball players. Participants were female (25.14 ± 3.71 years of age; 1.77 ± 0.09 m and 74.1 ± 6.1 kg) and male handball players (26.9 ± 4.1 years of age; 1.90 ± 0.09 m and 93.90±4.6 kg). Variables included body height, body mass, body mass index, broad jump, 5-m sprint, CODS and reactive-agility tests. Results showed satisfactory reliability for reactive-agility-test and CODS-test (ICC of 0.85-0.93, and CV of 2.4-4.8%). The reactive-agility and CODS shared less than 20% of the common variance. The calculated index of perceptual and reactive capacity (P&RC; ratio between reactive-agility- and CODS-performance) is found to be valid measure in defining true-game reactive-agility performance in handball in both genders. Therefore, the handball athletes’ P&RC should be used in the evaluation of real-game reactive-agility performance. Future studies should explore other sport-specific reactive-agility tests and factors associated to such performance in sports involving agile maneuvers. Key points Reactive agility and change-of-direction-speed should be observed as independent qualities, even when tested over the same course and similar movement template The reactive-agility-performance of the handball athletes involved in defensive duties is closer to their non-reactive-agility-score than in their peers who are not involved in defensive duties The handball specific “true-game” reactive-agility-performance should be evaluated as the ratio between reactive-agility and corresponding CODS performance. PMID:26336335
Reactive Agility Performance in Handball; Development and Evaluation of a Sport-Specific Measurement Protocol.

PubMed

Spasic, Miodrag; Krolo, Ante; Zenic, Natasa; Delextrat, Anne; Sekulic, Damir

2015-09-01

There is no current study that examined sport-specific tests of reactive-agility and change-of-direction-speed (CODS) to replicate real-sport environment in handball (team-handball). This investigation evaluated the reliability and validity of two novel tests designed to assess reactive-agility and CODS of handball players. Participants were female (25.14 ± 3.71 years of age; 1.77 ± 0.09 m and 74.1 ± 6.1 kg) and male handball players (26.9 ± 4.1 years of age; 1.90 ± 0.09 m and 93.90±4.6 kg). Variables included body height, body mass, body mass index, broad jump, 5-m sprint, CODS and reactive-agility tests. Results showed satisfactory reliability for reactive-agility-test and CODS-test (ICC of 0.85-0.93, and CV of 2.4-4.8%). The reactive-agility and CODS shared less than 20% of the common variance. The calculated index of perceptual and reactive capacity (P&RC; ratio between reactive-agility- and CODS-performance) is found to be valid measure in defining true-game reactive-agility performance in handball in both genders. Therefore, the handball athletes' P&RC should be used in the evaluation of real-game reactive-agility performance. Future studies should explore other sport-specific reactive-agility tests and factors associated to such performance in sports involving agile maneuvers. Key pointsReactive agility and change-of-direction-speed should be observed as independent qualities, even when tested over the same course and similar movement templateThe reactive-agility-performance of the handball athletes involved in defensive duties is closer to their non-reactive-agility-score than in their peers who are not involved in defensive dutiesThe handball specific "true-game" reactive-agility-performance should be evaluated as the ratio between reactive-agility and corresponding CODS performance.
Evaluating Innovations in Home Care for Performance Accountability.

PubMed

Collister, Barbara; Gutscher, Abram; Ambrogiano, Jana

2016-01-01

Concerns about rising costs and the sustainability of our healthcare system have led to a drive for innovative solutions and accountability for performance. Integrated Home Care, Calgary Zone, Alberta Health Services went beyond traditional accountability measures to use evaluation methodology to measure the progress of complex innovations to its organization structure and service delivery model. This paper focuses on the first two phases of a three-phase evaluation. The results of the first two phases generated learning about innovation adoption and sustainability, and performance accountability at the program-level of a large publicly funded healthcare organization.
Occupational safety of different industrial sectors in Khartoum State, Sudan. Part 1: Safety performance evaluation.

PubMed

Zaki, Gehan R; El-Marakby, Fadia A; H Deign El-Nor, Yasser; Nofal, Faten H; Zakaria, Adel M

2012-12-01

Safety performance evaluation enables decision makers improve safety acts. In Sudan, accident records, statistics, and safety performance were not evaluated before maintenance of accident records became mandatory in 2005. This study aimed at evaluating and comparing safety performance by accident records among different cities and industrial sectors in Khartoum state, Sudan, during the period from 2005 to 2007. This was a retrospective study, the sample in which represented all industrial enterprises in Khartoum state employing 50 workers or more. All industrial accident records of the Ministry of Manpower and Health and those of different enterprises during the period from 2005 to 2007 were reviewed. The safety performance indicators used within this study were the frequency-severity index (FSI) and fatal and disabling accident frequency rates (DAFR). In Khartoum city, the FSI [0.10 (0.17)] was lower than that in Bahari [0.11 (0.21)] and Omdurman [0.84 (0.34)]. It was the maximum in the chemical sector [0.33 (0.64)] and minimum in the metallurgic sector [0.09 (0.19)]. The highest DAFR was observed in Omdurman [5.6 (3.5)] and in the chemical sector [2.5 (4.0)]. The fatal accident frequency rate in the mechanical and electrical engineering industry was the highest [0.0 (0.69)]. Male workers who were older, divorced, and had lower levels of education had the lowest safety performance indicators. The safety performance of the industrial enterprises in Khartoum city was the best. The safety performance in the chemical sector was the worst with regard to FSI and DAFR. The age, sex, and educational level of injured workers greatly affect safety performance.
The association between self-perceived proficiency of personal protective equipment and objective performance: An observational study during a bioterrorism simulation drill.

PubMed

Fogel, Itay; David, Osant; Balik, Chaya H; Eisenkraft, Arik; Poles, Lion; Shental, Omri; Kassirer, Michael; Brosh-Nissimov, Tal

2017-11-01

The recent Ebola virus disease outbreak emphasized the potential misuse of personal protective equipment (PPE) by health care workers (HCWs) during such an event. We aimed to compare self-perceived proficiency of PPE use and objective performance, and identify predictors of low compliance and PPE misuse. An observational study combined with subjective questionnaires were carried out during a bioterror simulation drill. Forty-two observers evaluated performance under PPE. Mistakes were recorded and graded using a structured observational format and were correlated with the subjective questionnaires and with demographic parameters. One hundred seventy-eight HCWs from community clinics and hospitals were included. The mean self-perceived proficiency was high (6.1 out of 7), mean level of comfort was moderate (4.0 out of 7), and mean objective performance was intermediate (9.5 out of 13). There was no correlation between comfort and objective performance scores. Self-perceived proficiency was in correlation with donning and continuous performance with PPE but not with doffing. Clinic personnel performed better than personnel in hospitals (40.3% vs 67.8% with 3 or more mistakes, respectively; P = .001). Demographic characteristics had no correlation with objective or self-perceived performance. Self-perceived proficiency is a poor predictor of appropriate PPE use. The results suggest poor awareness of the possibility of PPE misuse. Copyright © 2017 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Elsevier Inc. All rights reserved.
Performance Evaluation and Parameter Identification on DROID III

NASA Technical Reports Server (NTRS)

Plumb, Julianna J.

2011-01-01

The DROID III project consisted of two main parts. The former, performance evaluation, focused on the performance characteristics of the aircraft such as lift to drag ratio, thrust required for level flight, and rate of climb. The latter, parameter identification, focused on finding the aerodynamic coefficients for the aircraft using a system that creates a mathematical model to match the flight data of doublet maneuvers and the aircraft s response. Both portions of the project called for flight testing and that data is now available on account of this project. The conclusion of the project is that the performance evaluation data is well-within desired standards but could be improved with a thrust model, and that parameter identification is still in need of more data processing but seems to produce reasonable results thus far.
Evaluation of seismic performance of reinforced concrete (RC) buildings under near-field earthquakes

NASA Astrophysics Data System (ADS)

Moniri, Hassan

2017-03-01

Near-field ground motions are significantly severely affected on seismic response of structure compared with far-field ground motions, and the reason is that the near-source forward directivity ground motions contain pulse-long periods. Therefore, the cumulative effects of far-fault records are minor. The damage and collapse of engineering structures observed in the last decades' earthquakes show the potential of damage in existing structures under near-field ground motions. One important subject studied by earthquake engineers as part of a performance-based approach is the determination of demand and collapse capacity under near-field earthquake. Different methods for evaluating seismic structural performance have been suggested along with and as part of the development of performance-based earthquake engineering. This study investigated the results of illustrious characteristics of near-fault ground motions on the seismic response of reinforced concrete (RC) structures, by the use of Incremental Nonlinear Dynamic Analysis (IDA) method. Due to the fact that various ground motions result in different intensity-versus-response plots, this analysis is done again under various ground motions in order to achieve significant statistical averages. The OpenSees software was used to conduct nonlinear structural evaluations. Numerical modelling showed that near-source outcomes cause most of the seismic energy from the rupture to arrive in a single coherent long-period pulse of motion and permanent ground displacements. Finally, a vulnerability of RC building can be evaluated against pulse-like near-fault ground motions effects.
Performance and evaluation of real-time multicomputer control systems

NASA Technical Reports Server (NTRS)

Shin, K. G.

1983-01-01

New performance measures, detailed examples, modeling of error detection process, performance evaluation of rollback recovery methods, experiments on FTMP, and optimal size of an NMR cluster are discussed.
Use of video observation and motor imagery on jumping performance in national rhythmic gymnastics athletes.

PubMed

Battaglia, Claudia; D'Artibale, Emanuele; Fiorilli, Giovanni; Piazza, Marina; Tsopani, Despina; Giombini, Arrigo; Calcagno, Giuseppe; di Cagno, Alessandra

2014-12-01

The aim of this study was to evaluate whether a mental training protocol could improve gymnastic jumping performance. Seventy-two rhythmic gymnasts were randomly divided into an experimental and control group. At baseline, experimental group completed the Movement Imagery Questionnaire Revised (MIQ-R) to assess the gymnast ability to generate movement imagery. A repeated measures design was used to compare two different types of training aimed at improving jumping performance: (a) video observation and PETTLEP mental training associated with physical practice, for the experimental group, and (b) physical practice alone for the control group. Before and after six weeks of training, their jumping performance was measured using the Hopping Test (HT), Drop Jump (DJ), and Counter Movement Jump (CMJ). Results revealed differences between jumping parameters F(1,71)=11.957; p<.01, and between groups F(1,71)=10.620; p<.01. In the experimental group there were significant correlations between imagery ability and the post-training Flight Time of the HT, r(34)=-.295, p<.05 and the DJ, r(34)=-.297, p<.05. The application of the protocol described herein was shown to improve jumping performance, thereby preserving the elite athlete's energy for other tasks. Copyright © 2014 Elsevier B.V. All rights reserved.
Asynchronous threat awareness by observer trials using crowd simulation

NASA Astrophysics Data System (ADS)

Dunau, Patrick; Huber, Samuel; Stein, Karin U.; Wellig, Peter

2016-10-01

The last few years showed that a high risk of asynchronous threats is given in every day life. Especially in large crowds a high probability of asynchronous attacks is evident. High observational abilities to detect threats are desirable. Consequently highly trained security and observation personal is needed. This paper evaluates the effectiveness of a training methodology to enhance performance of observation personnel engaging in a specific target identification task. For this purpose a crowd simulation video is utilized. The study first provides a measurement of the base performance before the training sessions. Furthermore a training procedure will be performed. Base performance will then be compared to the after training performance in order to look for a training effect. A thorough evaluation of both the training sessions as well as the overall performance will be done in this paper. A specific hypotheses based metric is used. Results will be discussed in order to provide guidelines for the design of training for observational tasks.
Personality Traits Affect Teaching Performance of Attending Physicians: Results of a Multi-Center Observational Study

PubMed Central

Scheepers, Renée A.; Lombarts, Kiki M. J. M. H.; van Aken, Marcel A. G.; Heineman, Maas Jan; Arah, Onyebuchi A.

2014-01-01

Background Worldwide, attending physicians train residents to become competent providers of patient care. To assess adequate training, attending physicians are increasingly evaluated on their teaching performance. Research suggests that personality traits affect teaching performance, consistent with studied effects of personality traits on job performance and academic performance in medicine. However, up till date, research in clinical teaching practice did not use quantitative methods and did not account for specialty differences. We empirically studied the relationship of attending physicians' personality traits with their teaching performance across surgical and non-surgical specialties. Method We conducted a survey across surgical and non-surgical specialties in eighteen medical centers in the Netherlands. Residents evaluated attending physicians' overall teaching performance, as well as the specific domains learning climate, professional attitude, communication, evaluation, and feedback, using the validated 21-item System for Evaluation of Teaching Qualities (SETQ). Attending physicians self-evaluated their personality traits on a 5-point scale using the validated 10-item Big Five Inventory (BFI), yielding the Five Factor model: extraversion, conscientiousness, neuroticism, agreeableness and openness. Results Overall, 622 (77%) attending physicians and 549 (68%) residents participated. Extraversion positively related to overall teaching performance (regression coefficient, B: 0.05, 95% CI: 0.01 to 0.10, P = 0.02). Openness was negatively associated with scores on feedback for surgical specialties only (B: −0.10, 95% CI: −0.15 to −0.05, P<0.001) and conscientiousness was positively related to evaluation of residents for non-surgical specialties only (B: 0.13, 95% CI: 0.03 to 0.22, p = 0.01). Conclusions Extraverted attending physicians were consistently evaluated as better supervisors. Surgical attending physicians who display high levels of

Synthesis and Performance Evaluation of Pulse Electrodeposited Ni-AlN Nanocomposite Coatings

PubMed Central

Ali, Kamran; Narayana, Sivaprasad; Okonkwo, Paul C.; Yusuf, Moinuddin M.; Alashraf, Abdullah

2018-01-01

This research work presents the microscopic analysis of pulse electrodeposited Ni-AlN nanocomposite coatings using SEM and AFM techniques and their performance evaluation (mechanical and electrochemical) by employing nanoindentation and electrochemical methods. The Ni-AlN nanocomposite coatings were developed by pulse electrodeposition. The nickel matrix was reinforced with various amounts of AlN nanoparticles (3, 6, and 9 g/L) to develop Ni-AlN nanocomposite coatings. The effect of reinforcement concentration on structure, surface morphology, and mechanical and anticorrosion properties was studied. SEM and AFM analyses indicate that Ni-AlN nanocomposite coatings have dense, homogenous, and well-defined pyramid structure containing uniformly distributed AlN particles. A decent improvement in the corrosion protection performance is also observed by the addition of AlN particles to the nickel matrix. Corrosion current was reduced from 2.15 to 1.29 μA cm−2 by increasing the AlN particles concentration from 3 to 9 g/L. It has been observed that the properties of Ni-AlN nanocomposite coating are sensitive to the concentration of AlN nanoparticles used as reinforcement. PMID:29619143
Defining Administrative Tasks, Evaluating Performance, and Developing Skills.

ERIC Educational Resources Information Center

Herman, Janice L.; Herman, Jerry J.

1995-01-01

To ensure high performance, administrators should develop an articulated structure and process systems approach that identifies the critical success factors (CSFs) of performance for each position; appropriate indicators and scales; and a personal-improvement plan based on last year's evaluation. Once CSFs are identified and written into the…
Assessing hospital disaster preparedness: a comparison of an on-site survey, directly observed drill performance, and video analysis of teamwork.

PubMed

Kaji, Amy H; Langford, Vinette; Lewis, Roger J

2008-09-01

There is currently no validated method for assessing hospital disaster preparedness. We determine the degree of correlation between the results of 3 methods for assessing hospital disaster preparedness: administration of an on-site survey, drill observation using a structured evaluation tool, and video analysis of team performance in the hospital incident command center. This was a prospective, observational study conducted during a regional disaster drill, comparing the results from an on-site survey, a structured disaster drill evaluation tool, and a video analysis of teamwork, performed at 6 911-receiving hospitals in Los Angeles County, CA. The on-site survey was conducted separately from the drill and assessed hospital disaster plan structure, vendor agreements, modes of communication, medical and surgical supplies, involvement of law enforcement, mutual aid agreements with other facilities, drills and training, surge capacity, decontamination capability, and pharmaceutical stockpiles. The drill evaluation tool, developed by Johns Hopkins University under contract from the Agency for Healthcare Research and Quality, was used to assess various aspects of drill performance, such as the availability of the hospital disaster plan, the geographic configuration of the incident command center, whether drill participants were identifiable, whether the noise level interfered with effective communication, and how often key information (eg, number of available staffed floor, intensive care, and isolation beds; number of arriving victims; expected triage level of victims; number of potential discharges) was received by the incident command center. Teamwork behaviors in the incident command center were quantitatively assessed, using the MedTeams analysis of the video recordings obtained during the disaster drill. Spearman rank correlations of the results between pair-wise groupings of the 3 assessment methods were calculated. The 3 evaluation methods demonstrated
Towards Reliable Evaluation of Anomaly-Based Intrusion Detection Performance

NASA Technical Reports Server (NTRS)

Viswanathan, Arun

2012-01-01

This report describes the results of research into the effects of environment-induced noise on the evaluation process for anomaly detectors in the cyber security domain. This research was conducted during a 10-week summer internship program from the 19th of August, 2012 to the 23rd of August, 2012 at the Jet Propulsion Laboratory in Pasadena, California. The research performed lies within the larger context of the Los Angeles Department of Water and Power (LADWP) Smart Grid cyber security project, a Department of Energy (DoE) funded effort involving the Jet Propulsion Laboratory, California Institute of Technology and the University of Southern California/ Information Sciences Institute. The results of the present effort constitute an important contribution towards building more rigorous evaluation paradigms for anomaly-based intrusion detectors in complex cyber physical systems such as the Smart Grid. Anomaly detection is a key strategy for cyber intrusion detection and operates by identifying deviations from profiles of nominal behavior and are thus conceptually appealing for detecting "novel" attacks. Evaluating the performance of such a detector requires assessing: (a) how well it captures the model of nominal behavior, and (b) how well it detects attacks (deviations from normality). Current evaluation methods produce results that give insufficient insight into the operation of a detector, inevitably resulting in a significantly poor characterization of a detectors performance. In this work, we first describe a preliminary taxonomy of key evaluation constructs that are necessary for establishing rigor in the evaluation regime of an anomaly detector. We then focus on clarifying the impact of the operational environment on the manifestation of attacks in monitored data. We show how dynamic and evolving environments can introduce high variability into the data stream perturbing detector performance. Prior research has focused on understanding the impact of this
Performance comparison of attitude determination, attitude estimation, and nonlinear observers algorithms

NASA Astrophysics Data System (ADS)

MOHAMMED, M. A. SI; BOUSSADIA, H.; BELLAR, A.; ADNANE, A.

2017-01-01

This paper presents a brief synthesis and useful performance analysis of different attitude filtering algorithms (attitude determination algorithms, attitude estimation algorithms, and nonlinear observers) applied to Low Earth Orbit Satellite in terms of accuracy, convergence time, amount of memory, and computation time. This latter is calculated in two ways, using a personal computer and also using On-board computer 750 (OBC 750) that is being used in many SSTL Earth observation missions. The use of this comparative study could be an aided design tool to the designer to choose from an attitude determination or attitude estimation or attitude observer algorithms. The simulation results clearly indicate that the nonlinear Observer is the more logical choice.
Reducing the Effects of Gender Stereotypes on Performance Evaluations.

ERIC Educational Resources Information Center

Bauer, Cara C.; Baltes, Boris B.

2002-01-01

Examined whether a structured free recall intervention could decrease the influence of traditional gender stereotypes on the performance evaluations of women. College students provided performance ratings for vignettes describing the performance of male and female professors. Without the intervention, raters who had traditional stereotypes…
Practical performance evaluation of the Wave Glider in geophysical observations

NASA Astrophysics Data System (ADS)

Sugioka, Hiroko; Hamano, Yozo

2016-04-01

The Wave Glider (WG), manufactured by Liquid Robotics Inc. of California, USA, is the first wave and solar powered autonomous sea surface vehicle. It has led the way to make ocean data collection and communications easier and safer, lower risk and cost, and real-time. By analyzing data from a long-term deployment of the WG in the sea to investigate the feasibility, an assessment of operating characteristics informs the potential utility of the WG to identify the parameters for a seafloor experiment designed the WG as a station-keeping gateway. We apply the WG in the following two observation systems that we have been developing. First, after the 2011 Tohoku earthquake tsunami, we have developed a real-time offshore tsunami monitoring system using a new type of seafloor tsunami sensor called Vector TsunaMeter (VTM) able to directly estimate the tsunami propagation vector based on the electromagnetic induction theory to provide early and reliable information at the coastal area. The WG equipped with both an acoustic modem and a satellite communication modem is used in the system as a relay platform for data transfer and communications between the sea bottom observatory and the land station. We had some experiments beginning with newly developing of the VTM in November 2012 to complete as a real-time monitoring system using the WG in March 2014. During the last experiment, we succeeded in detecting the micro-tsunami associated with the 2014 Iquique, Chile earthquake with Mw 8.2 on April 1 to confirm the practical utility of the WG. Second, since the Nishinoshima volcano of the Bonin Islands erupted in November 2013, we have been developing an isolated volcanic activity monitoring system using the unmanned WG vehicle. In this system the WG plays roles not only in a relay station with a satellite communication modem but also in a multi-purpose observatory platform with microphone for detecting acoustic waves in the air due to eruptions, with hydrophones for detecting
Characteristic Evaluation on Cooling Performance of Thermoelectric Modules.

PubMed

Seo, Sae Rom; Han, Seungwoo

2015-10-01

The aim of this work is to develop a performance evaluation system for thermoelectric cooling modules. We describe the design of such a system, composed of a vacuum chamber with a heat sink along with a metal block to measure the absorbed heat Qc. The system has a simpler structure than existing water-cooled or air-cooled systems. The temperature difference between the cold and hot sides of the thermoelectric module ΔT can be accurately measured without any effects due to convection, and the temperature equilibrium time is minimized compared to a water-cooled system. The evaluation system described here can be used to measure characteristic curves of Qc as a function of ΔT, as well as the current-voltage relations. High-performance thermoelectric systems can therefore be developed using optimal modules evaluated with this system.
A Proposed RTN Officer Performance Evaluation System

DTIC Science & Technology

1989-12-01

Taa& No. WokI Unlit Acca ~def 11¶. TITLE (biclde Securiy ClassifiCation) A PROPOSED ROYAL THAI NAVY OFIICER PERFORM NCE EVALUATION SYSTEM 12. PERSONAL...all aspects of performance into account , the commanding officer uses his opinion to decide who is "the best." There are no standard guidelines for...ftequently used in orgunsadozn as a bais for adminiardstive decisions such as employee promotion., tuufer, and allocation of financial reward; employee
Performance of CMIP3 and CMIP5 GCMs to simulate observed rainfall characteristics over the Western Himalayan region

NASA Astrophysics Data System (ADS)

Meher, J. K.; Das, L.

2017-12-01

The Western Himalayan Region (WHR) was subject to a significant negative trend in the annual and monsoon rainfall during 1902-2005. Annual and seasonal rainfall change over WHR of India was estimated using 22 rain gauge station rainfall data from the India Meteorological Department. The performance of 13 global climate models (GCMs) from the coupled model intercomparison project phase 3 (CMIP3) and 42 GCMs from CMIP5 was evaluated through multiple analysis: the evaluation of the mean annual cycle, annual cycles of interannual variability, spatial patterns, trends and signal-to-noise ratio. In general, CMIP5 GCMs were more skillful in terms of simulating the annual cycle of interannual variability compared to CMIP3 GCMs. The CMIP3 GCMs failed to reproduce the observed trend whereas 50% of the CMIP5 GCMs reproduced the statistical distribution of short-term (30-years) trend-estimates than for the longer term (99-years). GCMs from both CMIP3 and CMIP5 were able to simulate the spatial distribution of observed rainfall in pre-monsoon and winter months. Based on performance, each model of CMIP3 and CMIP5 was given an overall rank, which puts the high resolution version of the MIROC3.2 model (MIROC3.2 hires) and MIROC5 at the top in CMIP3 and CMIP5 respectively. Robustness of the ranking was judged through a sensitivity analysis, which indicated that ranks were independent during the process of adding or removing any individual method. It also revealed that trend analysis was not a robust method of judging performances of the model as compared to other methods.
Reliability and performance evaluation of systems containing embedded rule-based expert systems

NASA Technical Reports Server (NTRS)

Beaton, Robert M.; Adams, Milton B.; Harrison, James V. A.

1989-01-01

A method for evaluating the reliability of real-time systems containing embedded rule-based expert systems is proposed and investigated. It is a three stage technique that addresses the impact of knowledge-base uncertainties on the performance of expert systems. In the first stage, a Markov reliability model of the system is developed which identifies the key performance parameters of the expert system. In the second stage, the evaluation method is used to determine the values of the expert system's key performance parameters. The performance parameters can be evaluated directly by using a probabilistic model of uncertainties in the knowledge-base or by using sensitivity analyses. In the third and final state, the performance parameters of the expert system are combined with performance parameters for other system components and subsystems to evaluate the reliability and performance of the complete system. The evaluation method is demonstrated in the context of a simple expert system used to supervise the performances of an FDI algorithm associated with an aircraft longitudinal flight-control system.
Using hybrid method to evaluate the green performance in uncertainty.

PubMed

Tseng, Ming-Lang; Lan, Lawrence W; Wang, Ray; Chiu, Anthony; Cheng, Hui-Ping

2011-04-01

Green performance measure is vital for enterprises in making continuous improvements to maintain sustainable competitive advantages. Evaluation of green performance, however, is a challenging task due to the dependence complexity of the aspects, criteria, and the linguistic vagueness of some qualitative information and quantitative data together. To deal with this issue, this study proposes a novel approach to evaluate the dependence aspects and criteria of firm's green performance. The rationale of the proposed approach, namely green network balanced scorecard, is using balanced scorecard to combine fuzzy set theory with analytical network process (ANP) and importance-performance analysis (IPA) methods, wherein fuzzy set theory accounts for the linguistic vagueness of qualitative criteria and ANP converts the relations among the dependence aspects and criteria into an intelligible structural modeling used IPA. For the empirical case study, four dependence aspects and 34 green performance criteria for PCB firms in Taiwan were evaluated. The managerial implications are discussed.
Performance evaluation of a second-generation elastic loop mobility system

NASA Technical Reports Server (NTRS)

Melzer, K. J.; Swanson, G. D.

1974-01-01

Tests were conducted to evaluate the mobility performance of a second-generation Elastic Loop Mobility System (ELMS II). Performance on level test lanes and slopes of lunar soil simulant (LSS) and obstacle-surmounting and crevasse-crossing capabilities were investigated. In addition, internal losses and contact pressure distributions were evaluated. To evaluate the soft-soil performance, two basic soil conditions were tested: loose (LSS1) and dense (LSS5). These conditions embrace the spectrum of soil strengths tested during recent studies for NASA related to the mobility performance of the LRV. Data indicated that for the tested range of the various performance parameters, performance was independent of unit load (contact pressure) and ELMS II drum angular velocity, but was influenced by soil strength and ELMS pitch mode. Power requirements were smaller at a given system output for dense soil than for loose soil. The total system output in terms of pull developed or slope-climbing capability was larger for the ELMS II operating in restrained-pitch mode than in free-pitch mode.
Validity evidence for the Simulated Colonoscopy Objective Performance Evaluation scoring system.

PubMed

Trinca, Kristen D; Cox, Tiffany C; Pearl, Jonathan P; Ritter, E Matthew

2014-02-01

Low-cost, objective systems to assess and train endoscopy skills are needed. The aim of this study was to evaluate the ability of Simulated Colonoscopy Objective Performance Evaluation to assess the skills required to perform endoscopy. Thirty-eight subjects were included in this study, all of whom performed 4 tasks. The scoring system measured performance by calculating precision and efficiency. Data analysis assessed the relationship between colonoscopy experience and performance on each task and the overall score. Endoscopic trainees' Simulated Colonoscopy Objective Performance Evaluation scores correlated significantly with total colonoscopy experience (r = .61, P = .003) and experience in the past 12 months (r = .63, P = .002). Significant differences were seen among practicing endoscopists, nonendoscopic surgeons, and trainees (P < .0001). When the 4 tasks were analyzed, each showed significant correlation with colonoscopy experience (scope manipulation, r = .44, P = .044; tool targeting, r = .45, P = .04; loop management, r = .47, P = .032; mucosal inspection, r = .65, P = .001) and significant differences in performance between the endoscopist groups, except for mucosal inspection (scope manipulation, P < .0001; tool targeting, P = .002; loop management, P = .0008; mucosal inspection, P = .27). Simulated Colonoscopy Objective Performance Evaluation objectively assesses the technical skills required to perform endoscopy and shows promise as a platform for proficiency-based skills training. Published by Elsevier Inc.
On-line evaluation of multiloop digital controller performance

NASA Technical Reports Server (NTRS)

Wieseman, Carol D.

1993-01-01

The purpose of this presentation is to inform the Guidance and Control community of capabilities which were developed by the Aeroservoelasticity Branch to evaluate the performance of multivariable control laws, on-line, during wind-tunnel testing. The capabilities are generic enough to be useful for all kinds of on-line analyses involving multivariable control in experimental testing. Consequently, it was decided to present this material at this workshop even though it has been presented elsewhere. Topics covered include: essential on-line analysis requirements; on-line analysis capabilities; on-line analysis software; frequency domain procedures; controller performance evaluation frequency-domain flutter suppression; and plant determination.
Quantitative Evaluation of Performance during Robot-assisted Treatment.

PubMed

Peri, E; Biffi, E; Maghini, C; Servodio Iammarrone, F; Gagliardi, C; Germiniasi, C; Pedrocchi, A; Turconi, A C; Reni, G

2016-01-01

This article is part of the Focus Theme of Methods of Information in Medicine on "Methodologies, Models and Algorithms for Patients Rehabilitation". The great potential of robots in extracting quantitative and meaningful data is not always exploited in clinical practice. The aim of the present work is to describe a simple parameter to assess the performance of subjects during upper limb robotic training exploiting data automatically recorded by the robot, with no additional effort for patients and clinicians. Fourteen children affected by cerebral palsy (CP) performed a training with Armeo®Spring. Each session was evaluated with P, a simple parameter that depends on the overall performance recorded, and median and interquartile values were computed to perform a group analysis. Median (interquartile) values of P significantly increased from 0.27 (0.21) at T0 to 0.55 (0.27) at T1 . This improvement was functionally validated by a significant increase of the Melbourne Assessment of Unilateral Upper Limb Function. The parameter described here was able to show variations in performance over time and enabled a quantitative evaluation of motion abilities in a way that is reliable with respect to a well-known clinical scale.
Comparison of the medical students' perceived self-efficacy and the evaluation of the observers and patients.

PubMed

Ammentorp, Jette; Thomsen, Janus Laust; Jarbøl, Dorte Ejg; Holst, René; Øvrehus, Anne Lindebo Holm; Kofoed, Poul-Erik

2013-04-08

The accuracy of self-assessment has been questioned in studies comparing physicians' self-assessments to observed assessments; however, none of these studies used self-efficacy as a method for self-assessment. The aim of the study was to investigate how medical students' perceived self-efficacy of specific communication skills corresponds to the evaluation of simulated patients and observers. All of the medical students who signed up for an Objective Structured Clinical Examination (OSCE) were included. As a part of the OSCE, the student performance in the "parent-physician interaction" was evaluated by a simulated patient and an observer at one of the stations. After the examination the students were asked to assess their self-efficacy according to the same specific communication skills. The Calgary Cambridge Observation Guide formed the basis for the outcome measures used in the questionnaires. A total of 12 items was rated on a Likert scale from 1-5 (strongly disagree to strongly agree). We used extended Rasch models for comparisons between the groups of responses of the questionnaires. Comparisons of groups were conducted on dichotomized responses. Eighty-four students participated in the examination, 87% (73/84) of whom responded to the questionnaire. The response rate for the simulated patients and the observers was 100%. Significantly more items were scored in the highest categories (4 and 5) by the observers and simulated patients compared to the students (observers versus students: -0.23; SE:0.112; p=0.002 and patients versus students:0.177; SE:0.109; p=0.037). When analysing the items individually, a statistically significant difference only existed for two items. This study showed that students scored their communication skills lower compared to observers or simulated patients. The differences were driven by only 2 of 12 items. The results in this study indicate that self-efficacy based on the Calgary Cambridge Observation guide seems to be a reliable
Risk-adjusted performance evaluation in three academic thoracic surgery units using the Eurolung risk models.

PubMed

Pompili, Cecilia; Shargall, Yaron; Decaluwe, Herbert; Moons, Johnny; Chari, Madhu; Brunelli, Alessandro

2018-01-03

The objective of this study was to evaluate the performance of 3 thoracic surgery centres using the Eurolung risk models for morbidity and mortality. This was a retrospective analysis performed on data collected from 3 academic centres (2014-2016). Seven hundred and twenty-one patients in Centre 1, 857 patients in Centre 2 and 433 patients in Centre 3 who underwent anatomical lung resections were analysed. The Eurolung1 and Eurolung2 models were used to predict risk-adjusted cardiopulmonary morbidity and 30-day mortality rates. Observed and risk-adjusted outcomes were compared within each centre. The observed morbidity of Centre 1 was in line with the predicted morbidity (observed 21.1% vs predicted 22.7%, P = 0.31). Centre 2 performed better than expected (observed morbidity 20.2% vs predicted 26.7%, P < 0.001), whereas the observed morbidity of Centre 3 was higher than the predicted morbidity (observed 41.1% vs predicted 24.3%, P < 0.001). Centre 1 had higher observed mortality when compared with the predicted mortality (3.6% vs 2.1%, P = 0.005), whereas Centre 2 had an observed mortality rate significantly lower than the predicted mortality rate (1.2% vs 2.5%, P = 0.013). Centre 3 had an observed mortality rate in line with the predicted mortality rate (observed 1.4% vs predicted 2.4%, P = 0.17). The observed mortality rates in the patients with major complications were 30.8% in Centre 1 (versus predicted mortality rate 3.8%, P < 0.001), 8.2% in Centre 2 (versus predicted mortality rate 4.1%, P = 0.030) and 9.0% in Centre 3 (versus predicted mortality rate 3.5%, P = 0.014). The Eurolung models were successfully used as risk-adjusting instruments to internally audit the outcomes of 3 different centres, showing their applicability for future quality improvement initiatives. © The Author(s) 2018. Published by Oxford University Press on behalf of the European Association for Cardio-Thoracic Surgery. All rights reserved.
An Empirical Study on Low-Carbon: Human Resources Performance Evaluation

PubMed Central

Chen, Quan; Tsai, Sang-Bing; Zhou, Jie; Yu, Jian; Chang, Li-Chung; Li, Guodong; Zheng, Yuxiang; Wang, Jiangtao

2018-01-01

Low-carbon logistics meets the requirements of a low-carbon economy and is the most effective operating model for logistic development to achieve sustainability by coping with severe energy consumption and global warming. Low-carbon logistics aims to reduce carbon intensity rather than simply reduce energy consumption and carbon emissions. Human resources are an important part of the great competition in the logistics market and significantly affect the operations of enterprises. Performance evaluations of human resources are particularly important for low-carbon logistics enterprises with scarce talents. Such evaluations in these enterprises are of great significance for their strategic development. This study constructed a human resource performance evaluation system to assess non-managerial employees’ low-carbon job capacity, job performance, and job attitude in the low-carbon logistics sector. The case study results revealed that the investigated company enjoyed initial success after having promoted low-carbon concepts and values to its non-managerial employees, and the success was demonstrated by excellent performance in its employees’ job attitude and knowledge. This study adopts the AHP method to reasonably determine an indicator system of performance evaluation and its weight to avoid certain human-caused bias. This study not only fills the gap in the related literature, but can also be applied to industrial practice. PMID:29301375
An Empirical Study on Low-Carbon: Human Resources Performance Evaluation.

PubMed

Chen, Quan; Tsai, Sang-Bing; Zhai, Yuming; Zhou, Jie; Yu, Jian; Chang, Li-Chung; Li, Guodong; Zheng, Yuxiang; Wang, Jiangtao

2018-01-03

Low-carbon logistics meets the requirements of a low-carbon economy and is the most effective operating model for logistic development to achieve sustainability by coping with severe energy consumption and global warming. Low-carbon logistics aims to reduce carbon intensity rather than simply reduce energy consumption and carbon emissions. Human resources are an important part of the great competition in the logistics market and significantly affect the operations of enterprises. Performance evaluations of human resources are particularly important for low-carbon logistics enterprises with scarce talents. Such evaluations in these enterprises are of great significance for their strategic development. This study constructed a human resource performance evaluation system to assess non-managerial employees' low-carbon job capacity, job performance, and job attitude in the low-carbon logistics sector. The case study results revealed that the investigated company enjoyed initial success after having promoted low-carbon concepts and values to its non-managerial employees, and the success was demonstrated by excellent performance in its employees' job attitude and knowledge. This study adopts the AHP method to reasonably determine an indicator system of performance evaluation and its weight to avoid certain human-caused bias. This study not only fills the gap in the related literature, but can also be applied to industrial practice.

The Effects of Group Interdependence on Supervisor Performance Evaluations.

ERIC Educational Resources Information Center

Liden, Robert C.; Mitchell, Terence R.

1983-01-01

Tested the effect of group member interdependence on supervisory performance ratings. Students (N=72) played the role of supervisors in charge of evaluating members of a three-person work group. Results showed supervisors rated the poor performer higher and the good performers lower when the group was portrayed as highly interdependent. (JAC)
Strategic performance evaluation in cancer centers.

PubMed

Delgado, Rigoberto I; Langabeer, James R

2009-01-01

Most research in healthcare strategy has focused on formulating or implementing organizational plans and strategies, and little attention has been dedicated to the post-implementation control and evaluation of strategy, which we contend is the most critical aspect of achieving organizational goals. The objective of this study was to identify strategic control approaches used by major cancer centers in the country and to relate these practices to financial performance. Our intent was to expand the theory and practice of healthcare strategy to focused services, such as oncology. We designed a 17-question survey to capture elements of strategy and performance from our study sample, which comprised major cancer hospitals in the United States and shared similar mandates and resource constraints. The results suggest that high-performing cancer centers use more sophisticated analytical approaches, invest greater financial resources in performance analysis, and conduct more frequent performance reviews than do low-performing organizations. Our conclusions point to the need for a more robust approach to strategic assessment. In this article, we offer a number of recommendations for management to achieve strategic plans and goals on the basis of our research. To our knowledge, this study is one of the first to concentrate on the area of strategic control.
ASBESTOS IN DRINKING WATER PERFORMANCE EVALUATION STUDIES

EPA Science Inventory

Performance evaluations of laboratories testing for asbestos in drinking water according to USEPA Test Method 100.1 or 100.2 are complicated by the difficulty of providing stable sample dispersions of asbestos in water. Reference samples of a graduated series of chrysotile asbes...
ASBESTOS IN DRINKING WATER PERFORMANCE EVALUATION STUDIES

EPA Science Inventory

Performance evaluations of laboratories testing for asbestos in drinking water according to USEPA Test Method 100.1 or 100.2 are complicated by the difficulty of providing stable sample dispersions of asbestos in water. Reference samples of a graduated series of chrysotile asbest...
A performance evaluation model for the Stock Point Logistics Integrated Communication Environment (SPLICE)

NASA Astrophysics Data System (ADS)

Schmidt, J. B.

1985-09-01

This thesis investigates ways of improving the real-time performance of the Stockpoint Logistics Integrated Communication Environment (SPLICE). Performance evaluation through continuous monitoring activities and performance studies are the principle vehicles discussed. The method for implementing this performance evaluation process is the measurement of predefined performance indexes. Performance indexes for SPLICE are offered that would measure these areas. Existing SPLICE capability to carry out performance evaluation is explored, and recommendations are made to enhance that capability.
Normative Functional Performance Values in High School Athletes: The Functional Pre-Participation Evaluation Project.

PubMed

Onate, James A; Starkel, Cambrie; Clifton, Daniel R; Best, Thomas M; Borchers, James; Chaudhari, Ajit; Comstock, R Dawn; Cortes, Nelson; Grooms, Dustin R; Hertel, Jay; Hewett, Timothy E; Miller, Meghan Maume; Pan, Xueliang; Schussler, Eric; Van Lunen, Bonnie L

2018-01-01

The fourth edition of the Preparticipation Physical Evaluation recommends functional testing for the musculoskeletal portion of the examination; however, normative data across sex and grade level are limited. Establishing normative data can provide clinicians reference points with which to compare their patients, potentially aiding in the development of future injury-risk assessments and injury-mitigation programs. To establish normative functional performance and limb-symmetry data for high school-aged male and female athletes in the United States. Cross-sectional study. Athletic training facilities and gymnasiums across the United States. A total of 3951 male and female athletes who participated on high school-sponsored basketball, football, lacrosse, or soccer teams enrolled in this nationwide study. Functional performance testing consisted of 3 evaluations. Ankle-joint range of motion, balance, and lower extremity muscular power and landing control were assessed via the weight-bearing ankle-dorsiflexion-lunge, single-legged anterior-reach, and anterior single-legged hop-for-distance (SLHOP) tests, respectively. We used 2-way analyses of variance and χ 2 analyses to examine the effects of sex and grade level on ankle-dorsiflexion-lunge, single-legged anterior-reach, and SLHOP test performance and symmetry. The SLHOP performance differed between sexes (males = 187.8% ± 33.1% of limb length, females = 157.5% ± 27.8% of limb length; t = 30.3, P < .001). A Cohen d value of 0.97 indicated a large effect of sex on SLHOP performance. We observed differences for SLHOP and ankle-dorsiflexion-lunge performance among grade levels, but these differences were not clinically meaningful. We demonstrated differences in normative data for lower extremity functional performance during preparticipation physical evaluations across sex and grade levels. The results of this study will allow clinicians to compare sex- and grade-specific functional
Multivendor Spectral-Domain Optical Coherence Tomography Dataset, Observer Annotation Performance Evaluation, and Standardized Evaluation Framework for Intraretinal Cystoid Fluid Segmentation.

PubMed

Wu, Jing; Philip, Ana-Maria; Podkowinski, Dominika; Gerendas, Bianca S; Langs, Georg; Simader, Christian; Waldstein, Sebastian M; Schmidt-Erfurth, Ursula M

2016-01-01

Development of image analysis and machine learning methods for segmentation of clinically significant pathology in retinal spectral-domain optical coherence tomography (SD-OCT), used in disease detection and prediction, is limited due to the availability of expertly annotated reference data. Retinal segmentation methods use datasets that either are not publicly available, come from only one device, or use different evaluation methodologies making them difficult to compare. Thus we present and evaluate a multiple expert annotated reference dataset for the problem of intraretinal cystoid fluid (IRF) segmentation, a key indicator in exudative macular disease. In addition, a standardized framework for segmentation accuracy evaluation, applicable to other pathological structures, is presented. Integral to this work is the dataset used which must be fit for purpose for IRF segmentation algorithm training and testing. We describe here a multivendor dataset comprised of 30 scans. Each OCT scan for system training has been annotated by multiple graders using a proprietary system. Evaluation of the intergrader annotations shows a good correlation, thus making the reproducibly annotated scans suitable for the training and validation of image processing and machine learning based segmentation methods. The dataset will be made publicly available in the form of a segmentation Grand Challenge.
Multivendor Spectral-Domain Optical Coherence Tomography Dataset, Observer Annotation Performance Evaluation, and Standardized Evaluation Framework for Intraretinal Cystoid Fluid Segmentation

PubMed Central

Wu, Jing; Philip, Ana-Maria; Podkowinski, Dominika; Gerendas, Bianca S.; Langs, Georg; Simader, Christian

2016-01-01

Development of image analysis and machine learning methods for segmentation of clinically significant pathology in retinal spectral-domain optical coherence tomography (SD-OCT), used in disease detection and prediction, is limited due to the availability of expertly annotated reference data. Retinal segmentation methods use datasets that either are not publicly available, come from only one device, or use different evaluation methodologies making them difficult to compare. Thus we present and evaluate a multiple expert annotated reference dataset for the problem of intraretinal cystoid fluid (IRF) segmentation, a key indicator in exudative macular disease. In addition, a standardized framework for segmentation accuracy evaluation, applicable to other pathological structures, is presented. Integral to this work is the dataset used which must be fit for purpose for IRF segmentation algorithm training and testing. We describe here a multivendor dataset comprised of 30 scans. Each OCT scan for system training has been annotated by multiple graders using a proprietary system. Evaluation of the intergrader annotations shows a good correlation, thus making the reproducibly annotated scans suitable for the training and validation of image processing and machine learning based segmentation methods. The dataset will be made publicly available in the form of a segmentation Grand Challenge. PMID:27579177
DECIDE: a software for computer-assisted evaluation of diagnostic test performance.

PubMed

Chiecchio, A; Bo, A; Manzone, P; Giglioli, F

1993-05-01

The evaluation of the performance of clinical tests is a complex problem involving different steps and many statistical tools, not always structured in an organic and rational system. This paper presents a software which provides an organic system of statistical tools helping evaluation of clinical test performance. The program allows (a) the building and the organization of a working database, (b) the selection of the minimal set of tests with the maximum information content, (c) the search of the model best fitting the distribution of the test values, (d) the selection of optimal diagnostic cut-off value of the test for every positive/negative situation, (e) the evaluation of performance of the combinations of correlated and uncorrelated tests. The uncertainty associated with all the variables involved is evaluated. The program works in a MS-DOS environment with EGA or higher performing graphic card.
Performance evaluation soil samples utilizing encapsulation technology

DOEpatents

Dahlgran, J.R.

1999-08-17

Performance evaluation soil samples and method of their preparation uses encapsulation technology to encapsulate analytes which are introduced into a soil matrix for analysis and evaluation by analytical laboratories. Target analytes are mixed in an appropriate solvent at predetermined concentrations. The mixture is emulsified in a solution of polymeric film forming material. The emulsified solution is polymerized to form microcapsules. The microcapsules are recovered, quantitated and introduced into a soil matrix in a predetermined ratio to form soil samples with the desired analyte concentration. 1 fig.
Performance evaluation soil samples utilizing encapsulation technology

DOEpatents

Dahlgran, James R.

1999-01-01

Performance evaluation soil samples and method of their preparation using encapsulation technology to encapsulate analytes which are introduced into a soil matrix for analysis and evaluation by analytical laboratories. Target analytes are mixed in an appropriate solvent at predetermined concentrations. The mixture is emulsified in a solution of polymeric film forming material. The emulsified solution is polymerized to form microcapsules. The microcapsules are recovered, quantitated and introduced into a soil matrix in a predetermined ratio to form soil samples with the desired analyte concentration.
Modification of a compressor performance test bench for liquid slugging observation in refrigeration compressors

NASA Astrophysics Data System (ADS)

Ola, Max; Thomas, Christiane; Hesse, Ullrich

2017-08-01

Compressor performance test procedures are defined by the standard DIN EN 13771, wherein a variety of possible calorimeter and flow rate measurement methods are suggested. One option is the selection of two independent measurement methods. The accuracies of both selected measurement methods are essential. The second option requires only one method. However the measurement accuracy of the used device has to be verified and recalibrated on a regular basis. The compressor performance test facility at the Technische Universitaet Dresden uses a calibrated flow measurement sensor, a hot gas bypass and a mixed flow heat exchanger. The test bench can easily be modified for tests of various compressor types at different operating ranges and with various refrigerants. In addition, the modified test setup enables the investigation of long term liquid slug and its effects on the compressor. The modification comprises observational components, adjustments of the control system, safety measures and a customized oil recirculation system for compressors which do not contain an integrated oil sump or oil level regulation system. This paper describes the setup of the test bench, its functional principle, the key modifications, first test results and an evaluation of the energy balance.
Performance Evaluation Test of the Rapid Area Preparation Tool (RAPTOR)

DTIC Science & Technology

2008-12-01

the standard SETCO tires. A blast test of the new SETCO tire is scheduled for the spring of...Washington, DC 20301-2500 Performance Evaluation Test of the Rapid Area Preparation Tool (RAPTOR) December 2008 Prepared ...2008 to 00-00-2008 4. TITLE AND SUBTITLE Performance Evaluation Test of the Rapid Area Preparation Tool (RAPTOR) 5a. CONTRACT NUMBER 5b.
Real-Time Point Positioning Performance Evaluation of Single-Frequency Receivers Using NASA's Global Differential GPS System

NASA Technical Reports Server (NTRS)

Muellerschoen, Ronald J.; Iijima, Byron; Meyer, Robert; Bar-Sever, Yoaz; Accad, Elie

2004-01-01

This paper evaluates the performance of a single-frequency receiver using the 1-Hz differential corrections as provided by NASA's global differential GPS system. While the dual-frequency user has the ability to eliminate the ionosphere error by taking a linear combination of observables, the single-frequency user must remove or calibrate this error by other means. To remove the ionosphere error we take advantage of the fact that the magnitude of the group delay in range observable and the carrier phase advance have the same magnitude but are opposite in sign. A way to calibrate this error is to use a real-time database of grid points computed by JPL's RTI (Real-Time Ionosphere) software. In both cases we evaluate the positional accuracy of a kinematic carrier phase based point positioning method on a global extent.
Performance Evaluation and Benchmarking of Next Intelligent Systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

del Pobil, Angel; Madhavan, Raj; Bonsignorio, Fabio

Performance Evaluation and Benchmarking of Intelligent Systems presents research dedicated to the subject of performance evaluation and benchmarking of intelligent systems by drawing from the experiences and insights of leading experts gained both through theoretical development and practical implementation of intelligent systems in a variety of diverse application domains. This contributed volume offers a detailed and coherent picture of state-of-the-art, recent developments, and further research areas in intelligent systems. The chapters cover a broad range of applications, such as assistive robotics, planetary surveying, urban search and rescue, and line tracking for automotive assembly. Subsystems or components described in this bookmore » include human-robot interaction, multi-robot coordination, communications, perception, and mapping. Chapters are also devoted to simulation support and open source software for cognitive platforms, providing examples of the type of enabling underlying technologies that can help intelligent systems to propagate and increase in capabilities. Performance Evaluation and Benchmarking of Intelligent Systems serves as a professional reference for researchers and practitioners in the field. This book is also applicable to advanced courses for graduate level students and robotics professionals in a wide range of engineering and related disciplines including computer science, automotive, healthcare, manufacturing, and service robotics.« less
Reliability and Validity of the Professional Counseling Performance Evaluation

ERIC Educational Resources Information Center

Shepherd, J. Brad; Britton, Paula J.; Kress, Victoria E.

2008-01-01

The definition and measurement of counsellor trainee competency is an issue that has received increased attention yet lacks quantitative study. This research evaluates item responses, scale reliability and intercorrelations, interrater agreement, and criterion-related validity of the Professional Performance Fitness Evaluation/Professional…
13 CFR 306.7 - Performance evaluations of University Centers.

Code of Federal Regulations, 2013 CFR

2013-01-01

... University Centers. 306.7 Section 306.7 Business Credit and Assistance ECONOMIC DEVELOPMENT ADMINISTRATION, DEPARTMENT OF COMMERCE TRAINING, RESEARCH AND TECHNICAL ASSISTANCE INVESTMENTS University Center Economic Development Program § 306.7 Performance evaluations of University Centers. (a) EDA will: (1) Evaluate each...
13 CFR 306.7 - Performance evaluations of University Centers.

Code of Federal Regulations, 2011 CFR

2011-01-01

... University Centers. 306.7 Section 306.7 Business Credit and Assistance ECONOMIC DEVELOPMENT ADMINISTRATION, DEPARTMENT OF COMMERCE TRAINING, RESEARCH AND TECHNICAL ASSISTANCE INVESTMENTS University Center Economic Development Program § 306.7 Performance evaluations of University Centers. (a) EDA will: (1) Evaluate each...
13 CFR 306.7 - Performance evaluations of University Centers.

Code of Federal Regulations, 2012 CFR

2012-01-01

... University Centers. 306.7 Section 306.7 Business Credit and Assistance ECONOMIC DEVELOPMENT ADMINISTRATION, DEPARTMENT OF COMMERCE TRAINING, RESEARCH AND TECHNICAL ASSISTANCE INVESTMENTS University Center Economic Development Program § 306.7 Performance evaluations of University Centers. (a) EDA will: (1) Evaluate each...
13 CFR 306.7 - Performance evaluations of University Centers.

Code of Federal Regulations, 2014 CFR

2014-01-01

... University Centers. 306.7 Section 306.7 Business Credit and Assistance ECONOMIC DEVELOPMENT ADMINISTRATION, DEPARTMENT OF COMMERCE TRAINING, RESEARCH AND TECHNICAL ASSISTANCE INVESTMENTS University Center Economic Development Program § 306.7 Performance evaluations of University Centers. (a) EDA will: (1) Evaluate each...

Evaluating Pekin duck walking ability using a treadmill performance test.

PubMed

Byrd, C J; Main, R P; Makagon, M M

2016-10-01

Gait scoring is the most popular method for assessing the walking ability of poultry species. Although inexpensive and easy to implement, gait scoring systems are often criticized for being subjective. Using a treadmill performance test we assessed whether observable differences in Pekin duck walking ability identified using a gait scoring system translated to differences in walking performance. One hundred and eighty ducks were selected using a three-category gait scoring system (GS0 = smooth gait, n = 55; GS0.5 = labored walk without easily identifiable impediment, n = 56; GS1 = obvious impediment, n = 59) and the amount of time each duck was able to sustain walking on a treadmill at a speed of 0.31 m/s was evaluated. The walking test ended when each duck met one of three elimination criteria: (1) The duck walked for a maximum time of ten minutes, (2) the duck required support from the observer's hand for more than three seconds in order to continue walking on the treadmill, or (3) the duck sat down on the treadmill and made no attempt to stand despite receiving assistance from the observer. Data were analyzed in SAS 9.4 using PROC GLM. Tukey's multiple comparison test was used to compare differences in time spent walking between gait scores. Significant differences were found between all gait scores (P < 0.05). Behavioral correlates of walking performance were investigated. Video recorded during the treadmill test was analyzed for counts of sitting, standing, and leaning behaviors. Data were analyzed in SAS 9.4 using a negative binomial model for count data. No differences were found between gait scores for counts of sitting, standing, and leaning behaviors (P > 0.05). In conclusion, the amount of time spent walking on the treadmill corresponded to gait score and was an effective measurement for quantifying Pekin duck walking ability. The test could be a valuable tool for assessing the development of walking issues or the effectiveness of
Peer Observation Reports and Student Evaluations of Teaching: Who Are the Experts?

ERIC Educational Resources Information Center

Ackerman, David; Gross, Barbara L.; Vigneron, Franck

2009-01-01

This study is an exploratory inquiry into the perceptions of university faculty regarding two forms of teaching evaluations, student evaluations of teaching (SET), and peer observation reports (POR). Which, if either, better assesses the quality of instruction? Who are the real experts in judging teaching quality: peers who are experts in their…
Climate Model Diagnostic and Evaluation: With a Focus on Satellite Observations

NASA Technical Reports Server (NTRS)

Waliser, Duane

2011-01-01

Each year, we host a summer school that brings together the next generation of climate scientists - about 30 graduate students and postdocs from around the world - to engage with premier climate scientists from the Jet Propulsion Laboratory and elsewhere. Our yearly summer school focuses on topics on the leading edge of climate science research. Our inaugural summer school, held in 2011, was on the topic of "Using Satellite Observations to Advance Climate Models," and enabled students to explore how satellite observations can be used to evaluate and improve climate models. Speakers included climate experts from both NASA and the National Oceanic and Atmospheric Administration (NOAA), who provided updates on climate model diagnostics and evaluation and remote sensing of the planet. Details of the next summer school will be posted here in due course.
Preliminary Evaluation of MapReduce for High-Performance Climate Data Analysis

NASA Technical Reports Server (NTRS)

Duffy, Daniel Q.; Schnase, John L.; Thompson, John H.; Freeman, Shawn M.; Clune, Thomas L.

2012-01-01

MapReduce is an approach to high-performance analytics that may be useful to data intensive problems in climate research. It offers an analysis paradigm that uses clusters of computers and combines distributed storage of large data sets with parallel computation. We are particularly interested in the potential of MapReduce to speed up basic operations common to a wide range of analyses. In order to evaluate this potential, we are prototyping a series of canonical MapReduce operations over a test suite of observational and climate simulation datasets. Our initial focus has been on averaging operations over arbitrary spatial and temporal extents within Modern Era Retrospective- Analysis for Research and Applications (MERRA) data. Preliminary results suggest this approach can improve efficiencies within data intensive analytic workflows.
Reliable and valid tools for measuring surgeons' teaching performance: residents' vs. self evaluation.

PubMed

Boerebach, Benjamin C M; Arah, Onyebuchi A; Busch, Olivier R C; Lombarts, Kiki M J M H

2012-01-01

In surgical education, there is a need for educational performance evaluation tools that yield reliable and valid data. This paper describes the development and validation of robust evaluation tools that provide surgeons with insight into their clinical teaching performance. We investigated (1) the reliability and validity of 2 tools for evaluating the teaching performance of attending surgeons in residency training programs, and (2) whether surgeons' self evaluation correlated with the residents' evaluation of those surgeons. We surveyed 343 surgeons and 320 residents as part of a multicenter prospective cohort study of faculty teaching performance in residency training programs. The reliability and validity of the SETQ (System for Evaluation Teaching Qualities) tools were studied using standard psychometric techniques. We then estimated the correlations between residents' and surgeons' evaluations. The response rate was 87% among surgeons and 84% among residents, yielding 2625 residents' evaluations and 302 self evaluations. The SETQ tools yielded reliable and valid data on 5 domains of surgical teaching performance, namely, learning climate, professional attitude towards residents, communication of goals, evaluation of residents, and feedback. The correlations between surgeons' self and residents' evaluations were low, with coefficients ranging from 0.03 for evaluation of residents to 0.18 for communication of goals. The SETQ tools for the evaluation of surgeons' teaching performance appear to yield reliable and valid data. The lack of strong correlations between surgeons' self and residents' evaluations suggest the need for using external feedback sources in informed self evaluation of surgeons. Copyright © 2012 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
A note on evaluating model tidal currents against observations

NASA Astrophysics Data System (ADS)

Cummins, Patrick F.; Thupaki, Pramod

2018-01-01

The root-mean-square magnitude of the vector difference between modeled and observed tidal ellipses is a comprehensive metric to evaluate the representation of tidal currents in ocean models. A practical expression for this difference is given in terms of the harmonic constants that are routinely used to specify current ellipses for a given tidal constituent. The resulting metric is sensitive to differences in all four current ellipse parameters, including phase.
Computational and human observer image quality evaluation of low dose, knowledge-based CT iterative reconstruction

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eck, Brendan L.; Fahmi, Rachid; Miao, Jun

2015-10-15

Purpose: Aims in this study are to (1) develop a computational model observer which reliably tracks the detectability of human observers in low dose computed tomography (CT) images reconstructed with knowledge-based iterative reconstruction (IMR™, Philips Healthcare) and filtered back projection (FBP) across a range of independent variables, (2) use the model to evaluate detectability trends across reconstructions and make predictions of human observer detectability, and (3) perform human observer studies based on model predictions to demonstrate applications of the model in CT imaging. Methods: Detectability (d′) was evaluated in phantom studies across a range of conditions. Images were generated usingmore » a numerical CT simulator. Trained observers performed 4-alternative forced choice (4-AFC) experiments across dose (1.3, 2.7, 4.0 mGy), pin size (4, 6, 8 mm), contrast (0.3%, 0.5%, 1.0%), and reconstruction (FBP, IMR), at fixed display window. A five-channel Laguerre–Gauss channelized Hotelling observer (CHO) was developed with internal noise added to the decision variable and/or to channel outputs, creating six different internal noise models. Semianalytic internal noise computation was tested against Monte Carlo and used to accelerate internal noise parameter optimization. Model parameters were estimated from all experiments at once using maximum likelihood on the probability correct, P{sub C}. Akaike information criterion (AIC) was used to compare models of different orders. The best model was selected according to AIC and used to predict detectability in blended FBP-IMR images, analyze trends in IMR detectability improvements, and predict dose savings with IMR. Predicted dose savings were compared against 4-AFC study results using physical CT phantom images. Results: Detection in IMR was greater than FBP in all tested conditions. The CHO with internal noise proportional to channel output standard deviations, Model-k4, showed the best trade
Adapting the McMaster-Ottawa scale and developing behavioral anchors for assessing performance in an interprofessional Team Observed Structured Clinical Encounter.

PubMed

Lie, Désirée; May, Win; Richter-Lagha, Regina; Forest, Christopher; Banzali, Yvonne; Lohenry, Kevin

2015-01-01

Current scales for interprofessional team performance do not provide adequate behavioral anchors for performance evaluation. The Team Observed Structured Clinical Encounter (TOSCE) provides an opportunity to adapt and develop an existing scale for this purpose. We aimed to test the feasibility of using a retooled scale to rate performance in a standardized patient encounter and to assess faculty ability to accurately rate both individual students and teams. The 9-point McMaster-Ottawa Scale developed for a TOSCE was converted to a 3-point scale with behavioral anchors. Students from four professions were trained a priori to perform in teams of four at three different levels as individuals and teams. Blinded faculty raters were trained to use the scale to evaluate individual and team performances. G-theory was used to analyze ability of faculty to accurately rate individual students and teams using the retooled scale. Sixteen faculty, in groups of four, rated four student teams, each participating in the same TOSCE station. Faculty expressed comfort rating up to four students in a team within a 35-min timeframe. Accuracy of faculty raters varied (38-81% individuals, 50-100% teams), with errors in the direction of over-rating individual, but not team performance. There was no consistent pattern of error for raters. The TOSCE can be administered as an evaluation method for interprofessional teams. However, faculty demonstrate a 'leniency error' in rating students, even with prior training using behavioral anchors. To improve consistency, we recommend two trained faculty raters per station.
Development of an instrument for the evaluation of advanced life support performance.

PubMed

Peltonen, L-M; Peltonen, V; Salanterä, S; Tommila, M

2017-10-01

Assessing advanced life support (ALS) competence requires validated instruments. Existing instruments include aspects of technical skills (TS), non-technical skills (NTS) or both, but one instrument for detailed assessment that suits all resuscitation situations is lacking. This study aimed to develop an instrument for the evaluation of the overall ALS performance of the whole team. This instrument development study had four phases. First, we reviewed literature and resuscitation guidelines to explore items to include in the instrument. Thereafter, we interviewed resuscitation team professionals (n = 66), using the critical incident technique, to determine possible additional aspects associated with the performance of ALS. Second, we developed an instrument based on the findings. Third, we used an expert panel (n = 20) to assess the validity of the developed instrument. Finally, we revised the instrument based on the experts' comments and tested it with six experts who evaluated 22 video recorded resuscitations. The final version of the developed instrument had 69 items divided into adherence to guidelines (28 items), clinical decision-making (5 items), workload management (12 items), team behaviour (8 items), information management (6 items), patient integrity and consideration of laymen (4 items) and work routines (6 items). The Cronbach's α values were good, and strong correlations between the overall performance and the instrument were observed. The instrument may be useful for detailed assessment of the team's overall performance, but the numerous items make the use demanding. The instrument is still under development, and more research is needed to determine its psychometric properties. © 2017 The Acta Anaesthesiologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
The importance of metrics for evaluating scientific performance

NASA Astrophysics Data System (ADS)

Miyakawa, Tsuyoshi

Evaluation of scientific performance is a major factor that determines the behavior of both individual researchers and the academic institutes to which they belong. Because the number of researchers heavily outweighs the number of available research posts, and the competitive funding accounts for an ever-increasing proportion of research budget, some objective indicators of research performance have gained recognition for increasing transparency and openness. It is common practice to use metrics and indices to evaluate a researcher's performance or the quality of their grant applications. Such measures include the number of publications, the number of times these papers are cited and, more recently, the h-index, which measures the number of highly-cited papers the researcher has written. However, academic institutions and funding agencies in Japan have been rather slow to adopt such metrics. In this article, I will outline some of the currently available metrics, and discuss why we need to use such objective indicators of research performance more often in Japan. I will also discuss how to promote the use of metrics and what we should keep in mind when using them, as well as their potential impact on the research community in Japan.
Blind Source Parameters for Performance Evaluation of Despeckling Filters.

PubMed

Biradar, Nagashettappa; Dewal, M L; Rohit, ManojKumar; Gowre, Sanjaykumar; Gundge, Yogesh

2016-01-01

The speckle noise is inherent to transthoracic echocardiographic images. A standard noise-free reference echocardiographic image does not exist. The evaluation of filters based on the traditional parameters such as peak signal-to-noise ratio, mean square error, and structural similarity index may not reflect the true filter performance on echocardiographic images. Therefore, the performance of despeckling can be evaluated using blind assessment metrics like the speckle suppression index, speckle suppression and mean preservation index (SMPI), and beta metric. The need for noise-free reference image is overcome using these three parameters. This paper presents a comprehensive analysis and evaluation of eleven types of despeckling filters for echocardiographic images in terms of blind and traditional performance parameters along with clinical validation. The noise is effectively suppressed using the logarithmic neighborhood shrinkage (NeighShrink) embedded with Stein's unbiased risk estimation (SURE). The SMPI is three times more effective compared to the wavelet based generalized likelihood estimation approach. The quantitative evaluation and clinical validation reveal that the filters such as the nonlocal mean, posterior sampling based Bayesian estimation, hybrid median, and probabilistic patch based filters are acceptable whereas median, anisotropic diffusion, fuzzy, and Ripplet nonlinear approximation filters have limited applications for echocardiographic images.
Blind Source Parameters for Performance Evaluation of Despeckling Filters

PubMed Central

Biradar, Nagashettappa; Dewal, M. L.; Rohit, ManojKumar; Gowre, Sanjaykumar; Gundge, Yogesh

2016-01-01

The speckle noise is inherent to transthoracic echocardiographic images. A standard noise-free reference echocardiographic image does not exist. The evaluation of filters based on the traditional parameters such as peak signal-to-noise ratio, mean square error, and structural similarity index may not reflect the true filter performance on echocardiographic images. Therefore, the performance of despeckling can be evaluated using blind assessment metrics like the speckle suppression index, speckle suppression and mean preservation index (SMPI), and beta metric. The need for noise-free reference image is overcome using these three parameters. This paper presents a comprehensive analysis and evaluation of eleven types of despeckling filters for echocardiographic images in terms of blind and traditional performance parameters along with clinical validation. The noise is effectively suppressed using the logarithmic neighborhood shrinkage (NeighShrink) embedded with Stein's unbiased risk estimation (SURE). The SMPI is three times more effective compared to the wavelet based generalized likelihood estimation approach. The quantitative evaluation and clinical validation reveal that the filters such as the nonlocal mean, posterior sampling based Bayesian estimation, hybrid median, and probabilistic patch based filters are acceptable whereas median, anisotropic diffusion, fuzzy, and Ripplet nonlinear approximation filters have limited applications for echocardiographic images. PMID:27298618
Tribo-performance evaluation of ecofriendly brake friction composite materials

NASA Astrophysics Data System (ADS)

Kumar, Naresh; Singh, Tej; Grewal, G. S.

2018-05-01

This paper presents the potential of natural fibre in brake friction materials. Natural fibre filled ecofriendly brake friction materials were developed without Kevlar fibre evaluated for tribo-performance on a chase friction testing machine following SAE J 661a standard. Experimental results indicated that natural fibre enhances the fade performance, but depresses the friction and wear performance, whereas Kevlar fibre improves the friction, wear and recovery performance but depresses the fade performance. Also the results revealed that with the increase in natural fibre content, the friction and fade performances enhanced.
Evaluating large-scale propensity score performance through real-world and synthetic data experiments.

PubMed

Tian, Yuxi; Schuemie, Martijn J; Suchard, Marc A

2018-06-22

Propensity score adjustment is a popular approach for confounding control in observational studies. Reliable frameworks are needed to determine relative propensity score performance in large-scale studies, and to establish optimal propensity score model selection methods. We detail a propensity score evaluation framework that includes synthetic and real-world data experiments. Our synthetic experimental design extends the 'plasmode' framework and simulates survival data under known effect sizes, and our real-world experiments use a set of negative control outcomes with presumed null effect sizes. In reproductions of two published cohort studies, we compare two propensity score estimation methods that contrast in their model selection approach: L1-regularized regression that conducts a penalized likelihood regression, and the 'high-dimensional propensity score' (hdPS) that employs a univariate covariate screen. We evaluate methods on a range of outcome-dependent and outcome-independent metrics. L1-regularization propensity score methods achieve superior model fit, covariate balance and negative control bias reduction compared with the hdPS. Simulation results are mixed and fluctuate with simulation parameters, revealing a limitation of simulation under the proportional hazards framework. Including regularization with the hdPS reduces commonly reported non-convergence issues but has little effect on propensity score performance. L1-regularization incorporates all covariates simultaneously into the propensity score model and offers propensity score performance superior to the hdPS marginal screen.
Mindfulness, burnout, and effects on performance evaluations in internal medicine residents.

PubMed

Braun, Sarah E; Auerbach, Stephen M; Rybarczyk, Bruce; Lee, Bennett; Call, Stephanie

2017-01-01

Burnout has been documented at high levels in medical residents with negative effects on performance. Some dispositional qualities, like mindfulness, may protect against burnout. The purpose of the present study was to assess burnout prevalence among internal medicine residents at a single institution, examine the relationship between mindfulness and burnout, and provide preliminary findings on the relation between burnout and performance evaluations in internal medicine residents. Residents (n = 38) completed validated measures of burnout at three time points separated by 2 months and a validated measure of dispositional mindfulness at baseline. Program director end-of-year performance evaluations were also obtained on 22 milestones used to evaluate internal medicine resident performance; notably, these milestones have not yet been validated for research purposes; therefore, the investigation here is exploratory. Overall, 71.1% (n = 27) of the residents met criteria for burnout during the study. Lower scores on the "acting with awareness" facet of dispositional mindfulness significantly predicted meeting burnout criteria χ 2 (5) = 11.88, p = 0.04. Lastly, meeting burnout criteria significantly predicted performance on three of the performance milestones, with positive effects on milestones from the "system-based practices" and "professionalism" domains and negative effects on a milestone from the "patient care" domain. Burnout rates were high in this sample of internal medicine residents and rates were consistent with other reports of burnout during medical residency. Dispositional mindfulness was supported as a protective factor against burnout. Importantly, results from the exploratory investigation of the relationship between burnout and resident evaluations suggested that burnout may improve performance on some domains of resident evaluations while compromising performance on other domains. Implications and directions for future research are discussed.
Measures of Searcher Performance: A Psychometric Evaluation.

ERIC Educational Resources Information Center

Wildemuth, Barbara M.; And Others

1993-01-01

Describes a study of medical students that was conducted to evaluate measures of performance on factual searches of INQUIRER, a full-text database in microbiology. Measures relating to recall, precision, search term overlap, and efficiency are discussed; reliability and construct validity are considered; and implications for future research are…
Using Conjoint Analysis to Evaluate and Reward Teaching Performance

ERIC Educational Resources Information Center

Bacon, Donald R.; Zheng, Yilong; Stewart, Kim A.; Johnson, Carol J.; Paul, Pallab

2016-01-01

Although widely used, student evaluations of teaching do not address several factors that should be considered in evaluating teaching performance such as new course preparations, teaching larger classes, and inconvenient class times. Consequently, the incentive exists to avoid certain teaching assignments to achieve high SET scores while…
Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction.

PubMed

Park, Seong Ho; Han, Kyunghwa

2018-03-01

The use of artificial intelligence in medicine is currently an issue of great interest, especially with regard to the diagnostic or predictive analysis of medical images. Adoption of an artificial intelligence tool in clinical practice requires careful confirmation of its clinical utility. Herein, the authors explain key methodology points involved in a clinical evaluation of artificial intelligence technology for use in medicine, especially high-dimensional or overparameterized diagnostic or predictive models in which artificial deep neural networks are used, mainly from the standpoints of clinical epidemiology and biostatistics. First, statistical methods for assessing the discrimination and calibration performances of a diagnostic or predictive model are summarized. Next, the effects of disease manifestation spectrum and disease prevalence on the performance results are explained, followed by a discussion of the difference between evaluating the performance with use of internal and external datasets, the importance of using an adequate external dataset obtained from a well-defined clinical cohort to avoid overestimating the clinical performance as a result of overfitting in high-dimensional or overparameterized classification model and spectrum bias, and the essentials for achieving a more robust clinical evaluation. Finally, the authors review the role of clinical trials and observational outcome studies for ultimate clinical verification of diagnostic or predictive artificial intelligence tools through patient outcomes, beyond performance metrics, and how to design such studies. © RSNA, 2018.
10 CFR 1045.9 - RD classification performance evaluation.

Code of Federal Regulations, 2010 CFR

2010-01-01

... Program Management of the Restricted Data and Formerly Restricted Data Classification System § 1045.9 RD classification performance evaluation. (a) Heads of agencies shall ensure that RD management officials and those...
Performance evaluation of multi-material electronic cleansing for ultra-low-dose dual-energy CT colonography

NASA Astrophysics Data System (ADS)

Tachibana, Rie; Kohlhase, Naja; Näppi, Janne J.; Hironaka, Toru; Ota, Junko; Ishida, Takayuki; Regge, Daniele; Yoshida, Hiroyuki

2016-03-01

Accurate electronic cleansing (EC) for CT colonography (CTC) enables the visualization of the entire colonic surface without residual materials. In this study, we evaluated the accuracy of a novel multi-material electronic cleansing (MUMA-EC) scheme for non-cathartic ultra-low-dose dual-energy CTC (DE-CTC). The MUMA-EC performs a wateriodine material decomposition of the DE-CTC images and calculates virtual monochromatic images at multiple energies, after which a random forest classifier is used to label the images into the regions of lumen air, soft tissue, fecal tagging, and two types of partial-volume boundaries based on image-based features. After the labeling, materials other than soft tissue are subtracted from the CTC images. For pilot evaluation, 384 volumes of interest (VOIs), which represented sources of subtraction artifacts observed in current EC schemes, were sampled from 32 ultra-low-dose DE-CTC scans. The voxels in the VOIs were labeled manually to serve as a reference standard. The metric for EC accuracy was the mean overlap ratio between the labels of the reference standard and the labels generated by the MUMA-EC, a dualenergy EC (DE-EC), and a single-energy EC (SE-EC) scheme. Statistically significant differences were observed between the performance of the MUMA/DE-EC and the SE-EC methods (p<0.001). Visual assessment confirmed that the MUMA-EC generated less subtraction artifacts than did DE-EC and SE-EC. Our MUMA-EC scheme yielded superior performance over conventional SE-EC scheme in identifying and minimizing subtraction artifacts on noncathartic ultra-low-dose DE-CTC images.

The Role of Performance Quality in Adolescents' Self-Evaluation and Rumination after a Speech: Is it Contingent on Social Anxiety Level?

PubMed

Blöte, Anke W; Miers, Anne C; Van den Bos, Esther; Westenberg, P Michiel

2018-05-17

Cognitive behavioural therapy (CBT) has relatively poor outcomes for youth with social anxiety, possibly because broad-based CBT is not tailored to their specific needs. Treatment of social anxiety in youth may need to pay more attention to negative social cognitions that are considered a key factor in social anxiety development and maintenance. The aim of the present study was to learn more about the role of performance quality in adolescents' cognitions about their social performance and, in particular, the moderating role social anxiety plays in the relationship between performance quality and self-cognitions. A community sample of 229 participants, aged 11 to 18 years, gave a speech and filled in questionnaires addressing social anxiety, depression, expected and self-evaluated performance, and post-event rumination. Independent observers rated the quality of the speech. The data were analysed using moderated mediation analysis. Performance quality mediated the link between expected and self-evaluated performance in adolescents with low and medium levels of social anxiety. For adolescents with high levels of social anxiety, only a direct link between expected and self-evaluated performance was found. Their self-evaluation was not related to the quality of their performance. Performance quality also mediated the link between expected performance and rumination, but social anxiety did not moderate this mediation effect. Results suggest that a good performance does not help socially anxious adolescents to replace their negative self-evaluations with more realistic ones. Specific cognitive intervention strategies should be tailored to the needs of socially anxious adolescents who perform well.
Model Performance Evaluation and Scenario Analysis (MPESA) Tutorial

EPA Pesticide Factsheets

The model performance evaluation consists of metrics and model diagnostics. These metrics provides modelers with statistical goodness-of-fit measures that capture magnitude only, sequence only, and combined magnitude and sequence errors.
The effect of observing novice and expert performance on acquisition of surgical skills on a robotic platform

PubMed Central

Harris, David J.; Vine, Samuel J.; Wilson, Mark R.; McGrath, John S.; LeBel, Marie-Eve

2017-01-01

Background Observational learning plays an important role in surgical skills training, following the traditional model of learning from expertise. Recent findings have, however, highlighted the benefit of observing not only expert performance but also error-strewn performance. The aim of this study was to determine which model (novice vs. expert) would lead to the greatest benefits when learning robotically assisted surgical skills. Methods 120 medical students with no prior experience of robotically-assisted surgery completed a ring-carrying training task on three occasions; baseline, post-intervention and at one-week follow-up. The observation intervention consisted of a video model performing the ring-carrying task, with participants randomly assigned to view an expert model, a novice model, a mixed expert/novice model or no observation (control group). Participants were assessed for task performance and surgical instrument control. Results There were significant group differences post-intervention, with expert and novice observation groups outperforming the control group, but there were no clear group differences at a retention test one week later. There was no difference in performance between the expert-observing and error-observing groups. Conclusions Similar benefits were found when observing the traditional expert model or the error-strewn model, suggesting that viewing poor performance may be as beneficial as viewing expertise in the early acquisition of robotic surgical skills. Further work is required to understand, then inform, the optimal curriculum design when utilising observational learning in surgical training. PMID:29141046
Tuberculosis control program in the municipal context: performance evaluation.

PubMed

Arakawa, Tiemi; Magnabosco, Gabriela Tavares; Andrade, Rubia Laine de Paula; Brunello, Maria Eugenia Firmino; Monroe, Aline Aparecida; Ruffino-Netto, Antonio; Scatena, Lucia Marina; Villa, Tereza Cristina Scatena

2017-03-30

The objective of this study is to evaluate the performance of the Tuberculosis Control Program in municipalities of the State of São Paulo. This is a program evaluation research, with ecological design, which uses three non-hierarchical groups of the municipalities of the State of São Paulo according to their performance in relation to operational indicators. We have selected 195 municipalities with at least five new cases of tuberculosis notified in the Notification System of the State of São Paulo and with 20,000 inhabitants or more in 2010. The multiple correspondence analysis was used to identify the association between the groups of different performances, the epidemiological and demographic characteristics, and the characteristics of the health systems of the municipalities. The group with the worst performance showed the highest rates of abandonment (average [avg] = 10.4, standard deviation [sd] = 9.4) and the lowest rates of supervision of Directly Observed Treatment (avg = 6.1, sd = 12.9), and it was associated with low incidence of tuberculosis, high tuberculosis and HIV, small population, high coverage of the Family Health Strategy/Program of Community Health Agents, and being located on the countryside. The group with the best performance presented the highest cure rate (avg = 83.7, sd = 10.5) and the highest rate of cases in Directly Observed Treatment (avg = 83.0, sd = 12.7); the group of regular performance showed regular results for outcome (avg cure = 79.8, sd = 13.2; abandonment avg = 9.5, sd = 8.3) and supervision of the Directly Observed Treatment (avg = 42.8, sd = 18.8). Large population, low coverage of the Family Health Strategy/Program of Community Health Agents, high incidence of tuberculosis and AIDS, and being located on the coast and in metropolitan areas were associated with these groups. The findings highlight the importance of the Directly Observed Treatment in relation to the outcome for treatment and raise reflections on the
Balanced scorecard-based performance evaluation of Chinese county hospitals in underdeveloped areas.

PubMed

Gao, Hongda; Chen, He; Feng, Jun; Qin, Xianjing; Wang, Xuan; Liang, Shenglin; Zhao, Jinmin; Feng, Qiming

2018-05-01

Objective Since the Guangxi government implemented public county hospital reform in 2009, there have been no studies of county hospitals in this underdeveloped area of China. This study aimed to establish an evaluation indicator system for Guangxi county hospitals and to generate recommendations for hospital development and policymaking. Methods A performance evaluation indicator system was developed based on balanced scorecard theory. Opinions were elicited from 25 experts from administrative units, universities and hospitals and the Delphi method was used to modify the performance indicators. The indicator system and the Topsis method were used to evaluate the performance of five county hospitals randomly selected from the same batch of 2015 Guangxi reform pilots. Results There were 4 first-level indicators, 9 second-level indicators and 36 third-level indicators in the final performance evaluation indicator system that showed good consistency, validity and reliability. The performance rank of the hospitals was B > E > A > C > D. Conclusions The performance evaluation indicator system established using the balanced scorecard is practical and scientific. Analysis of the results based on this indicator system identified several factors affecting hospital performance, such as resource utilisation efficiency, medical service price, personnel structure and doctor-patient relationships.
A Performance-Based Method of Student Evaluation

ERIC Educational Resources Information Center

Nelson, G. E.; And Others

1976-01-01

The Problem Oriented Medical Record (which allows practical definition of the behavioral terms thoroughness, reliability, sound analytical sense, and efficiency as they apply to the identification and management of patient problems) provides a vehicle to use in performance based type evaluation. A test-run use of the record is reported. (JT)
Diverging diamond interchange performance evaluation (I-44 and Route 13)

DOT National Transportation Integrated Search

2011-02-01

Performance evaluation was conducted on the first diverging diamond interchange (DDI) or double : crossover interchange (DCD) constructed in the United States. This evaluation assessed traffic operations, safety and : public perceptions t...
Chinese Middle School Teachers' Preferences Regarding Performance Evaluation Measures

ERIC Educational Resources Information Center

Liu, Shujie; Xu, Xianxuan; Stronge, James H.

2016-01-01

Teacher performance evaluation currently is receiving unprecedented attention from policy makers, scholars, and practitioners worldwide. This study is one of the few studies of teacher perceptions regarding teacher performance measures that focus on China. We employed a quantitative dominant mixed research design to investigate Chinese teachers'…
Performance evaluation of Louisiana superpave mixtures : tech summary.

DOT National Transportation Integrated Search

2008-12-01

The primary objective of this research was to evaluate the fundamental engineering : properties and mixture performance of Superpave hot mix asphalt (HMA) mixtures : in Louisiana through laboratory mechanistic tests, aggregate gradation analysis, and...
Evaluating climate model performance in the tropics with retrievals of water isotopic composition from Aura TES

NASA Astrophysics Data System (ADS)

Field, Robert; Kim, Daehyun; Kelley, Max; LeGrande, Allegra; Worden, John; Schmidt, Gavin

2014-05-01

Observational and theoretical arguments suggest that satellite retrievals of the stable isotope composition of water vapor could be useful for climate model evaluation. The isotopic composition of water vapor is controlled by the same processes that control water vapor amount, but the observed distribution of isotopic composition is distinct from amount itself . This is due to the fractionation that occurs between the abundant H216O isotopes (isotopologues) and the rare and heavy H218O and HDO isotopes during evaporation and condensation. The fractionation physics are much simpler than the underlying moist physics; discrepancies between observed and modeled isotopic fields are more likely due to problems in the latter. Isotopic measurements therefore have the potential for identifying problems that might not be apparent from more conventional measurements. Isotopic tracers have existed in climate models since the 1980s but it is only since the mid 2000s that there have been enough data for meaningful model evaluation in this sense, in the troposphere at least. We have evaluated the NASA GISS ModelE2 general circulation model over the tropics against water isotope (HDO/H2O) retrievals from the Aura Tropospheric Emission Spectrometer (TES), alongside more conventional measurements. A small ensemble of experiments was performed with physics perturbations to the cumulus and planetary boundary layer schemes, done in the context of the normal model development process. We examined the degree to which model-data agreement could be used to constrain a select group of internal processes in the model, namely condensate evaporation, entrainment strength, and moist convective air mass flux. All are difficult to parameterize, but exert strong influence over model performance. We found that the water isotope composition was significantly more sensitive to physics changes than precipitation, temperature or relative humidity through the depth of the tropical troposphere. Among the
23 CFR 636.205 - Can past performance be used as an evaluation criteria?

Code of Federal Regulations, 2010 CFR

2010-04-01

... 23 Highways 1 2010-04-01 2010-04-01 false Can past performance be used as an evaluation criteria... past performance be used as an evaluation criteria? (a) Yes, past performance information is one... used as an evaluation criteria in either phase-one or phase-two solicitations. If you elect to use past...
Beginning and Intermediate Piano Students' Experiences Participating in Evaluative Performances

ERIC Educational Resources Information Center

Mitchell, Nancy

2017-01-01

Evaluative performances, such as conservatory examinations and competitions, frequently play a significant role in piano instruction in many parts of the world. Many students participate in these performances as a result of the perception that a program of instruction that is focused on standardized curriculum and evaluation practices will be of…
20 CFR 411.330 - How will SSA evaluate an EN's performance?

Code of Federal Regulations, 2014 CFR

2014-04-01

... 20 Employees' Benefits 2 2014-04-01 2014-04-01 false How will SSA evaluate an EN's performance? 411.330 Section 411.330 Employees' Benefits SOCIAL SECURITY ADMINISTRATION THE TICKET TO WORK AND SELF-SUFFICIENCY PROGRAM Employment Networks § 411.330 How will SSA evaluate an EN's performance? (a) We will...
20 CFR 411.330 - How will SSA evaluate an EN's performance?

Code of Federal Regulations, 2013 CFR

2013-04-01

... 20 Employees' Benefits 2 2013-04-01 2013-04-01 false How will SSA evaluate an EN's performance? 411.330 Section 411.330 Employees' Benefits SOCIAL SECURITY ADMINISTRATION THE TICKET TO WORK AND SELF-SUFFICIENCY PROGRAM Employment Networks § 411.330 How will SSA evaluate an EN's performance? (a) We will...
20 CFR 411.330 - How will SSA evaluate an EN's performance?

Code of Federal Regulations, 2012 CFR

2012-04-01

... 20 Employees' Benefits 2 2012-04-01 2012-04-01 false How will SSA evaluate an EN's performance? 411.330 Section 411.330 Employees' Benefits SOCIAL SECURITY ADMINISTRATION THE TICKET TO WORK AND SELF-SUFFICIENCY PROGRAM Employment Networks § 411.330 How will SSA evaluate an EN's performance? (a) We will...
20 CFR 411.330 - How will SSA evaluate an EN's performance?

Code of Federal Regulations, 2010 CFR

2010-04-01

... 20 Employees' Benefits 2 2010-04-01 2010-04-01 false How will SSA evaluate an EN's performance? 411.330 Section 411.330 Employees' Benefits SOCIAL SECURITY ADMINISTRATION THE TICKET TO WORK AND SELF-SUFFICIENCY PROGRAM Employment Networks § 411.330 How will SSA evaluate an EN's performance? (a) We will...
20 CFR 411.330 - How will SSA evaluate an EN's performance?

Code of Federal Regulations, 2011 CFR

2011-04-01

... 20 Employees' Benefits 2 2011-04-01 2011-04-01 false How will SSA evaluate an EN's performance? 411.330 Section 411.330 Employees' Benefits SOCIAL SECURITY ADMINISTRATION THE TICKET TO WORK AND SELF-SUFFICIENCY PROGRAM Employment Networks § 411.330 How will SSA evaluate an EN's performance? (a) We will...
Association between liver transplant center performance evaluations and transplant volume.

PubMed

Buccini, L D; Segev, D L; Fung, J; Miller, C; Kelly, D; Quintini, C; Schold, J D

2014-09-01

There has been increased oversight of transplant centers and stagnation in liver transplantation nationally in recent years. We hypothesized that centers that received low performance (LP) evaluations were more likely to alter protocols, resulting in reduced rates of transplants and patients placed on the waiting list. We evaluated the association of LP evaluations and transplant activity among liver transplant centers in the United States using national Scientific Registry of Transplant Recipients data (January 2007 to July 2012). We compared the average change in recipient and candidate volume and donor and patient characteristics based on whether the centers received LP evaluations. Of 92 eligible centers, 27 (29%) received at least one LP evaluation. Centers without an LP evaluation (n = 65) had an average increase of 9.3 transplants and 14.9 candidates while LP centers had an average decrease of 39.9 transplants (p < 0.01) and 67.3 candidates (p < 0.01). LP centers reduced the use of older donors, donations with longer cold ischemia, and donations after cardiac death (p-values < 0.01). There was no association between the change in transplant volume and measured performance (R(2) = 0.002, p = 0.91). Findings indicate a strong association between performance evaluations and changes in candidate listings and transplants among liver transplant centers, with no measurable improvement in outcomes associated with reduction in transplant volume. © Copyright 2014 The American Society of Transplantation and the American Society of Transplant Surgeons.
New Developments in Observer Performance Methodology in Medical Imaging

PubMed Central

Chakraborty, Dev P.

2011-01-01

A common task in medical imaging is assessing whether a new imaging system, or a variant of an existing one, is an improvement over an existing imaging technology. Imaging systems are generally quite complex, consisting of several components – e.g., image acquisition hardware, image processing and display hardware and software, and image interpretation by radiologists– each of which can affect performance. While it may appear odd to include the radiologist as a “component” of the imaging chain, since the radiologist’s decision determines subsequent patient care, the effect of the human interpretation has to be included. Physical measurements like modulation transfer function, signal to noise ratio, etc., are useful for characterizing the non-human parts of the imaging chain under idealized and often unrealistic conditions, such as uniform background phantoms, target objects with sharp edges, etc. Measuring the effect on performance of the entire imaging chain, including the radiologist, and using real clinical images, requires different methods that fall under the rubric of observer performance methods or “ROC analysis”. The purpose of this paper is to review recent developments in this field, particularly with respect to the free-response method. PMID:21978444
Implementation and performance evaluation of acoustic denoising algorithms for UAV

NASA Astrophysics Data System (ADS)

Chowdhury, Ahmed Sony Kamal

Unmanned Aerial Vehicles (UAVs) have become popular alternative for wildlife monitoring and border surveillance applications. Elimination of the UAV's background noise and classifying the target audio signal effectively are still a major challenge. The main goal of this thesis is to remove UAV's background noise by means of acoustic denoising techniques. Existing denoising algorithms, such as Adaptive Least Mean Square (LMS), Wavelet Denoising, Time-Frequency Block Thresholding, and Wiener Filter, were implemented and their performance evaluated. The denoising algorithms were evaluated for average Signal to Noise Ratio (SNR), Segmental SNR (SSNR), Log Likelihood Ratio (LLR), and Log Spectral Distance (LSD) metrics. To evaluate the effectiveness of the denoising algorithms on classification of target audio, we implemented Support Vector Machine (SVM) and Naive Bayes classification algorithms. Simulation results demonstrate that LMS and Discrete Wavelet Transform (DWT) denoising algorithm offered superior performance than other algorithms. Finally, we implemented the LMS and DWT algorithms on a DSP board for hardware evaluation. Experimental results showed that LMS algorithm's performance is robust compared to DWT for various noise types to classify target audio signals.

Evaluating Organizational Performance: Rational, Natural, and Open System Models

ERIC Educational Resources Information Center

Martz, Wes

2013-01-01

As the definition of organization has evolved, so have the approaches used to evaluate organizational performance. During the past 60 years, organizational theorists and management scholars have developed a comprehensive line of thinking with respect to organizational assessment that serves to inform and be informed by the evaluation discipline.…
Mindfulness, burnout, and effects on performance evaluations in internal medicine residents

PubMed Central

Braun, Sarah E; Auerbach, Stephen M; Rybarczyk, Bruce; Lee, Bennett; Call, Stephanie

2017-01-01

Purpose Burnout has been documented at high levels in medical residents with negative effects on performance. Some dispositional qualities, like mindfulness, may protect against burnout. The purpose of the present study was to assess burnout prevalence among internal medicine residents at a single institution, examine the relationship between mindfulness and burnout, and provide preliminary findings on the relation between burnout and performance evaluations in internal medicine residents. Methods Residents (n = 38) completed validated measures of burnout at three time points separated by 2 months and a validated measure of dispositional mindfulness at baseline. Program director end-of-year performance evaluations were also obtained on 22 milestones used to evaluate internal medicine resident performance; notably, these milestones have not yet been validated for research purposes; therefore, the investigation here is exploratory. Results Overall, 71.1% (n = 27) of the residents met criteria for burnout during the study. Lower scores on the “acting with awareness” facet of dispositional mindfulness significantly predicted meeting burnout criteria χ2(5) = 11.88, p = 0.04. Lastly, meeting burnout criteria significantly predicted performance on three of the performance milestones, with positive effects on milestones from the “system-based practices” and “professionalism” domains and negative effects on a milestone from the “patient care” domain. Conclusion Burnout rates were high in this sample of internal medicine residents and rates were consistent with other reports of burnout during medical residency. Dispositional mindfulness was supported as a protective factor against burnout. Importantly, results from the exploratory investigation of the relationship between burnout and resident evaluations suggested that burnout may improve performance on some domains of resident evaluations while compromising performance on other domains. Implications and
Influence of learning styles on the practical performance after the four-step basic life support training approach - An observational cohort study.

PubMed

Schröder, Hanna; Henke, Alexandra; Stieger, Lina; Beckers, Stefan; Biermann, Henning; Rossaint, Rolf; Sopka, Saša

2017-01-01

Learning and training basic life support (BLS)-especially external chest compressions (ECC) within the BLS-algorithm-are essential resuscitation training for laypersons as well as for health care professionals. The objective of this study was to evaluate the influence of learning styles on the performance of BLS and to identify whether all types of learners are sufficiently addressed by Peyton's four-step approach for BLS training. A study group of first-year medical students (n = 334) without previous medical knowledge was categorized according to learning styles using the German Lernstilinventar questionnaire based on Kolb's Learning Styles Inventory. Students' BLS performances were assessed before and after a four-step BLS training approach lasting 4 hours. Standardized BLS training was provided by an educational staff consisting of European Resuscitation Council-certified advanced life support providers and instructors. Pre- and post-intervention BLS performance was evaluated using a single-rescuer-scenario and standardized questionnaires (6-point-Likert-scales: 1 = completely agree, 6 = completely disagree). The recorded points of measurement were the time to start, depth, and frequency of ECC. The study population was categorized according to learning styles: diverging (5%, n = 16), assimilating (36%, n = 121), converging (41%, n = 138), and accommodating (18%, n = 59). Independent of learning styles, both male and female participants showed significant improvement in cardiopulmonary resuscitation (CPR) performance. Based on the Kolb learning styles, no significant differences between the four groups were observed in compression depth, frequency, time to start CPR, or the checklist-based assessment within the baseline assessment. A significant sex effect on the difference between pre- and post-interventional assessment points was observed for mean compression depth and mean compression frequency. The findings of this work show that the four-step-approach for
State observers and Kalman filtering for high performance vibration isolation systems.

PubMed

Beker, M G; Bertolini, A; van den Brand, J F J; Bulten, H J; Hennes, E; Rabeling, D S

2014-03-01

There is a strong scientific case for the study of gravitational waves at or below the lower end of current detection bands. To take advantage of this scientific benefit, future generations of ground based gravitational wave detectors will need to expand the limit of their detection bands towards lower frequencies. Seismic motion presents a major challenge at these frequencies and vibration isolation systems will play a crucial role in achieving the desired low-frequency sensitivity. A compact vibration isolation system designed to isolate in-vacuum optical benches for Advanced Virgo will be introduced and measurements on this system are used to present its performance. All high performance isolation systems employ an active feedback control system to reduce the residual motion of their suspended payloads. The development of novel control schemes is needed to improve the performance beyond what is currently feasible. Here, we present a multi-channel feedback approach that is novel to the field. It utilizes a linear quadratic regulator in combination with a Kalman state observer and is shown to provide effective suppression of residual motion of the suspended payload. The application of state observer based feedback control for vibration isolation will be demonstrated with measurement results from the Advanced Virgo optical bench suspension system.
The Independence and Interdependence of Coacting Observers in Regard to Performance Efficiency, Workload, and Stress in a Vigilance Task.

PubMed

Funke, Gregory J; Warm, Joel S; Baldwin, Carryl L; Garcia, Andre; Funke, Matthew E; Dillard, Michael B; Finomore, Victor S; Matthews, Gerald; Greenlee, Eric T

2016-09-01

We investigated performance, workload, and stress in groups of paired observers who performed a vigilance task in a coactive (independent) manner. Previous studies have demonstrated that groups of coactive observers detect more signals in a vigilance task than observers working alone. Therefore, the use of such groups might be effective in enhancing signal detection in operational situations. However, concern over appearing less competent than one's cohort might induce elevated levels of workload and stress in coactive group members and thereby undermine group performance benefits. Accordingly, we performed the initial experiment comparing workload and stress in observers who performed a vigilance task coactively with those of observers who performed the vigilance task alone. Observers monitored a video display for collision flight paths in a simulated unmanned aerial vehicle control task. Self-reports of workload and stress were secured via the NASA-Task Load Index and the Dundee Stress State Questionnaire, respectively. Groups of coactive observers detected significantly more signals than did single observers. Coacting observers did not differ significantly from those operating by themselves in terms of workload but did in regard to stress; posttask distress was significantly lower for coacting than for single observers. Performing a visual vigilance task in a coactive manner with another observer does not elevate workload above that of observers working alone and serves to attenuate the stress associated with vigilance task performance. The use of coacting observers could be an effective vehicle for enhancing performance efficiency in operational vigilance. © 2016, Human Factors and Ergonomics Society.
Performance in physiology evaluation: possible improvement by active learning strategies.

PubMed

Montrezor, Luís H

2016-12-01

The evaluation process is complex and extremely important in the teaching/learning process. Evaluations are constantly employed in the classroom to assist students in the learning process and to help teachers improve the teaching process. The use of active methodologies encourages students to participate in the learning process, encourages interaction with their peers, and stimulates thinking about physiological mechanisms. This study examined the performance of medical students on physiology over four semesters with and without active engagement methodologies. Four activities were used: a puzzle, a board game, a debate, and a video. The results show that engaging in activities with active methodologies before a physiology cognitive monitoring test significantly improved student performance compared with not performing the activities. We integrate the use of these methodologies with classic lectures, and this integration appears to improve the teaching/learning process in the discipline of physiology and improves the integration of physiology with cardiology and neurology. In addition, students enjoy the activities and perform better on their evaluations when they use them. Copyright © 2016 The American Physiological Society.
Content Analysis of Evaluation Instruments Used for Student Evaluation of Classroom Teaching Performance in Higher Education.

ERIC Educational Resources Information Center

Tagomori, Harry T.; Bishop, Laurence A.

A major argument against evaluation of teacher performance by students pertains to the instruments being used. Colleges conduct instructional evaluation using instruments they devise, borrow, adopt, or adapt from other institutions. Whether these instruments are tested for content validity is unknown. This study determined how evaluation questions…
Performance Evaluation of a Data Validation System

NASA Technical Reports Server (NTRS)

Wong, Edmond (Technical Monitor); Sowers, T. Shane; Santi, L. Michael; Bickford, Randall L.

2005-01-01

Online data validation is a performance-enhancing component of modern control and health management systems. It is essential that performance of the data validation system be verified prior to its use in a control and health management system. A new Data Qualification and Validation (DQV) Test-bed application was developed to provide a systematic test environment for this performance verification. The DQV Test-bed was used to evaluate a model-based data validation package known as the Data Quality Validation Studio (DQVS). DQVS was employed as the primary data validation component of a rocket engine health management (EHM) system developed under NASA's NGLT (Next Generation Launch Technology) program. In this paper, the DQVS and DQV Test-bed software applications are described, and the DQV Test-bed verification procedure for this EHM system application is presented. Test-bed results are summarized and implications for EHM system performance improvements are discussed.
Performance-cost evaluation methodology for ITS equipment deployment

DOT National Transportation Integrated Search

2000-09-01

Although extensive Intelligent Transportation Systems (ITS) technology is being deployed in the field, little analysis is being performed to evaluate the benefits of implementation schemes. Benefit analysis is particularly in need for one popular ITS...
Skylab program earth resources experiment package sensor performance evaluation, volume 1, (S190A)

NASA Technical Reports Server (NTRS)

Kenney, G. P.

1975-01-01

The results of S190A sensor performance evaluation are summarized based on data presented by all contributors to the sensor performance evaluation interim reports. Techniques used in sensor performance evaluation are discussed. Topics discussed include: performance degradation identified during the Skylab missions, S190A and EREP system anomalies that affected S190A performance, and the performance achieved, in terms of pertinent S190A parameters. Additional analyses include final performance analyses completed after submittal of the SL4 interim sensor performance evaluation reports, including completion of detailed analyses of basic performance parameters initiated during the interim report periods and consolidation analyses to reduce independent mission data (SL2, SL3, and SL4) to determine overall performance realized during all three Skylab missions.
A Computational Observer For Performing Contrast-Detail Analysis Of Ultrasound Images

NASA Astrophysics Data System (ADS)

Lopez, H.; Loew, M. H.

1988-06-01

Contrast-Detail (C/D) analysis allows the quantitative determination of an imaging system's ability to display a range of varying-size targets as a function of contrast. Using this technique, a contrast-detail plot is obtained which can, in theory, be used to compare image quality from one imaging system to another. The C/D plot, however, is usually obtained by using data from human observer readings. We have shown earlier(7) that the performance of human observers in the task of threshold detection of simulated lesions embedded in random ultrasound noise is highly inaccurate and non-reproducible for untrained observers. We present an objective, computational method for the determination of the C/D curve for ultrasound images. This method utilizes digital images of the C/D phantom developed at CDRH, and lesion-detection algorithms that simulate the Bayesian approach using the likelihood function for an ideal observer. We present the results of this method, and discuss the relationship to the human observer and to the comparability of image quality between systems.
THE ROLE OF OBSERVATION AND FEEDBACK IN ENHANCING PERFORMANCE WITH MEDICATION ADMINISTRATION.

PubMed

Davies, Karen; Mitchell, Charles; Coombes, Ian

2015-12-01

Legislation in Queensland such as the Health (Drugs and Poisons) Regulation 1996, the national registration competency standards set by the Nursing and Midwifery Board of Australia, and the Continuing Professional Development Registration Standards made pursuant to the Health Practitioner Regulation National Law define expected standards of practice for nurses. The Framework for Assessing Standards for Practice for Registered Nurses, Enrolled Nurses and Midwives, released in July 2015, includes the principles for assessing standards but not the methods. Local policies and procedures offer specific requirements founded on evidence-based practice. Observation of clinical practice with the provision of immediate descriptive feedback to individual practitioners has been associated with improved performance. This column describes the role of regular observation and individual feedback on medication administration as a strategy to enhance performance and patient care.
Student Performance Evaluation. Physical Educators for Equity. Module 7.

ERIC Educational Resources Information Center

Uhlir, Ann

Guidelines are presented to aid secondary school physical education teachers in evaluating student performance in a way that avoids sex-role stereotyping and sex discrimination. Suggestions made for conducting testing in a bias-free setting include: (1) avoid sex-differentiated role tasks; (2) organize motor-performance testing procedures so that…
Evaluation of the channelized Hotelling observer with an internal-noise model in a train-test paradigm for cardiac SPECT defect detection.

PubMed

Brankov, Jovan G

2013-10-21

The channelized Hotelling observer (CHO) has become a widely used approach for evaluating medical image quality, acting as a surrogate for human observers in early-stage research on assessment and optimization of imaging devices and algorithms. The CHO is typically used to measure lesion detectability. Its popularity stems from experiments showing that the CHO's detection performance can correlate well with that of human observers. In some cases, CHO performance overestimates human performance; to counteract this effect, an internal-noise model is introduced, which allows the CHO to be tuned to match human-observer performance. Typically, this tuning is achieved using example data obtained from human observers. We argue that this internal-noise tuning step is essentially a model training exercise; therefore, just as in supervised learning, it is essential to test the CHO with an internal-noise model on a set of data that is distinct from that used to tune (train) the model. Furthermore, we argue that, if the CHO is to provide useful insights about new imaging algorithms or devices, the test data should reflect such potential differences from the training data; it is not sufficient simply to use new noise realizations of the same imaging method. Motivated by these considerations, the novelty of this paper is the use of new model selection criteria to evaluate ten established internal-noise models, utilizing four different channel models, in a train-test approach. Though not the focus of the paper, a new internal-noise model is also proposed that outperformed the ten established models in the cases tested. The results, using cardiac perfusion SPECT data, show that the proposed train-test approach is necessary, as judged by the newly proposed model selection criteria, to avoid spurious conclusions. The results also demonstrate that, in some models, the optimal internal-noise parameter is very sensitive to the choice of training data; therefore, these models are prone
Evaluation of Sub Query Performance in SQL Server

NASA Astrophysics Data System (ADS)

Oktavia, Tanty; Sujarwo, Surya

2014-03-01

The paper explores several sub query methods used in a query and their impact on the query performance. The study uses experimental approach to evaluate the performance of each sub query methods combined with indexing strategy. The sub query methods consist of in, exists, relational operator and relational operator combined with top operator. The experimental shows that using relational operator combined with indexing strategy in sub query has greater performance compared with using same method without indexing strategy and also other methods. In summary, for application that emphasized on the performance of retrieving data from database, it better to use relational operator combined with indexing strategy. This study is done on Microsoft SQL Server 2012.
Performance evaluation of extension education centers in universities based on the balanced scorecard.

PubMed

Wu, Hung-Yi; Lin, Yi-Kuei; Chang, Chi-Hsiang

2011-02-01

This study aims at developing a set of appropriate performance evaluation indices mainly based on balanced scorecard (BSC) for extension education centers in universities by utilizing multiple criteria decision making (MCDM). Through literature reviews and experts who have real practical experiences in extension education, adequate performance evaluation indices have been selected and then utilizing the decision making trial and evaluation laboratory (DEMATEL) and analytic network process (ANP), respectively, further establishes the causality between the four BSC perspectives as well as the relative weights between evaluation indices. According to this previous result, an empirical analysis of the performance evaluation of extension education centers of three universities at Taoyuan County in Taiwan is illustrated by applying VlseKriterijumska Optimizacija I Kompromisno Resenje (VIKOR). From the analysis results, it indicates that "Learning and growth" is the significant influential factor and it would affect the other three perspectives. In addition, it is discovered that "Internal process" perspective as well as "Financial" perspective play important roles in the performance evaluation of extension education centers. The top three key performance indices are "After-sales service", "Turnover volume", and "Net income". The proposed evaluation model could be considered as a reference for extension education centers in universities to prioritize their improvements on the key performance indices after performing VIKOR analyses. 2010 Elsevier Ltd. All rights reserved.
An accurate evaluation of the performance of asynchronous DS-CDMA systems with zero-correlation-zone coding in Rayleigh fading

NASA Astrophysics Data System (ADS)

Walker, Ernest; Chen, Xinjia; Cooper, Reginald L.

2010-04-01

An arbitrarily accurate approach is used to determine the bit-error rate (BER) performance for generalized asynchronous DS-CDMA systems, in Gaussian noise with Raleigh fading. In this paper, and the sequel, new theoretical work has been contributed which substantially enhances existing performance analysis formulations. Major contributions include: substantial computational complexity reduction, including a priori BER accuracy bounding; an analytical approach that facilitates performance evaluation for systems with arbitrary spectral spreading distributions, with non-uniform transmission delay distributions. Using prior results, augmented by these enhancements, a generalized DS-CDMA system model is constructed and used to evaluated the BER performance, in a variety of scenarios. In this paper, the generalized system modeling was used to evaluate the performance of both Walsh- Hadamard (WH) and Walsh-Hadamard-seeded zero-correlation-zone (WH-ZCZ) coding. The selection of these codes was informed by the observation that WH codes contain N spectral spreading values (0 to N - 1), one for each code sequence; while WH-ZCZ codes contain only two spectral spreading values (N/2 - 1,N/2); where N is the sequence length in chips. Since these codes span the spectral spreading range for DS-CDMA coding, by invoking an induction argument, the generalization of the system model is sufficiently supported. The results in this paper, and the sequel, support the claim that an arbitrary accurate performance analysis for DS-CDMA systems can be evaluated over the full range of binary coding, with minimal computational complexity.
Study of Integrated USV/UUV Observation System Performance in Monterey Bay

DTIC Science & Technology

2017-09-01

5 IV. EXPERIMENTAL SETUP... quasi -stationary at depth in low-current environments. This thesis evaluates the performance of deep sensors in determining behavior of a moving source...acoustic sensors that would be quasi -stationary receivers when in drift mode at depth in low current environments. One key advantage to this technique is
Evaluating Preference for Graphic Feedback on Correct versus Incorrect Performance

ERIC Educational Resources Information Center

Sigurdsson, Sigurdur O.; Ring, Brandon M.

2013-01-01

The current study evaluated preferences of undergraduate students for graphic feedback on percentage of incorrect performance versus feedback on percentage of correct performance. A total of 108 participants were enrolled in the study and received graphic feedback on performance on 12 online quizzes. One half of participants received graphic…
Performance Evaluation of the ISS Water Processor Multifiltration Beds

NASA Technical Reports Server (NTRS)

Bowman, Elizabeth M.; Carter, Layne; Wilson, Mark; Cole, Harold; Orozco, Nicole; Snowdon, Doug

2012-01-01

The ISS Water Processor Assembly (WPA) produces potable water from a waste stream containing humidity condensate and urine distillate. The primary treatment process is achieved in the Multifiltration Bed, which includes adsorbent media and ion exchange resin for the removal of dissolved organic and inorganic contaminants. The first Multifiltration Bed was replaced on ISS in July 2010 after initial indication of inorganic breakthrough. This bed was returned to ground in July 2011 for an engineering investigation. The water resident in the bed was analyzed for various parameters to evaluate adsorbent loading, performance of the ion exchange resin, microbial activity, and generation of leachates from the ion exchange resin. Portions of the adsorbent media and ion exchange resin were sampled and subsequently desorbed to identify the primary contaminants removed at various points in the bed. In addition, an unused Multifiltration Bed was evaluated after two years in storage to assess the generation of leachates during storage. This assessment was performed to evaluate the possibility that these leachates are impacting performance of the Catalytic Reactor located downstream of the Multifiltration Bed. The results of these investigations and implications to the operation of the WPA on ISS are documented in this paper.

Evaluating the performance and safety effectiveness of roundabouts.

DOT National Transportation Integrated Search

2011-12-01

This report documents the evaluation of the performance and safety effectiveness of roundabouts within the State of Michigan. The study began with the identification of roundabouts within Michigan. This was followed by collecting data on the geometri...
Evaluation of performance based concrete for bridge decks.

DOT National Transportation Integrated Search

2015-06-01

The Washington State Department of Transportation (WSDOT) revised the concrete : specification for bridge decks in 2011 to be more performance based with the desired effect of : having less early-age shrinkage cracking. This report evaluates a sample...
Evaluation of analytical performance based on partial order methodology.

PubMed

Carlsen, Lars; Bruggemann, Rainer; Kenessova, Olga; Erzhigitov, Erkin

2015-01-01

Classical measurements of performances are typically based on linear scales. However, in analytical chemistry a simple scale may be not sufficient to analyze the analytical performance appropriately. Here partial order methodology can be helpful. Within the context described here, partial order analysis can be seen as an ordinal analysis of data matrices, especially to simplify the relative comparisons of objects due to their data profile (the ordered set of values an object have). Hence, partial order methodology offers a unique possibility to evaluate analytical performance. In the present data as, e.g., provided by the laboratories through interlaboratory comparisons or proficiency testings is used as an illustrative example. However, the presented scheme is likewise applicable for comparison of analytical methods or simply as a tool for optimization of an analytical method. The methodology can be applied without presumptions or pretreatment of the analytical data provided in order to evaluate the analytical performance taking into account all indicators simultaneously and thus elucidating a "distance" from the true value. In the present illustrative example it is assumed that the laboratories analyze a given sample several times and subsequently report the mean value, the standard deviation and the skewness, which simultaneously are used for the evaluation of the analytical performance. The analyses lead to information concerning (1) a partial ordering of the laboratories, subsequently, (2) a "distance" to the Reference laboratory and (3) a classification due to the concept of "peculiar points". Copyright © 2014 Elsevier B.V. All rights reserved.
Performance evaluation of Chrysopogon zizanoides under urban conditions of Kuwait.

PubMed

Suleiman, Majda Khalil; Bhat, Narayana Ramachandra; Jacob, Sheena; Al-Burais, Meali

2018-02-01

Plant physiological and morphological attributes should be critically evaluated for selecting any species for landscaping projects. The selection of a species should be based on the evaluation of its adaptability, noninvasiveness, growth potential, and performance under the prevailing local arid conditions for their aesthetic looks, soil stabilization, and afforestation values. Chrysopogon zizanoides (Vetiver), is suitable for Kuwait because it can withstand fluctuating temperatures ranging from -14 to 55 °C with unique physical and physiological characteristics. Despite the successful growth performance of Vetiver in landscaping projects mostly in several tropical countries, it has not been utilized and evaluated in the Arabian Gulf region. The objective of the current study was to evaluate the performance of selected ten cultivars of Vetiver (ODV-1, 8, 9, 13, 17, 21, 23, Silent Valley, Urlikal, and Pannimedu) in the deficient soil and environmental conditions of Kuwait in urban landscape at minimal maintenance. It is suggested that based on visual greenery effect and overall growth performance cultivars, Pannimedu, Silent Valley, ODV-13, ODV-8 and ODV-9 can be considered for landscaping projects in Kuwait. To obtain the superior crown volume (which considers height and canopy) cultivar Pannimedu is suggested and to get a bushy growth (considering the number of tillers) cultivar ODV-13 and ODV-8 is found to be suitable.
Development of a test protocol for evaluating EVA glove performance

NASA Technical Reports Server (NTRS)

Hinman, Elaine M.

1992-01-01

Testing gloved hand performance involves work from several disciplines. Evaluations performed in the course of reenabling a disabled hand, designing a robotic end effector or master controller, or hard-suit design have all yielded relevant information, and, in most cases, produced performance test methods. Most times, these test methods have been primarily oriented toward their parent discipline. For space operations, a comparative test which would provide a way to quantify pressure glove and end effector performance would be useful in dividing tasks between humans and robots. Such a test would have to rely heavily on sensored measurement, as opposed to questionnaires, to produce relevant data. However, at some point human preference would have to be taken into account. This paper presents a methodology for evaluating gloved hand performance which attempts to respond to these issues. Glove testing of a prototype glove design using this method is described.
Approaches to chronic disease management evaluation in use in Europe: a review of current methods and performance measures.

PubMed

Conklin, Annalijn; Nolte, Ellen; Vrijhoef, Hubertus

2013-01-01

An overview was produced of approaches currently used to evaluate chronic disease management in selected European countries. The study aims to describe the methods and metrics used in Europe as a first to help advance the methodological basis for their assessment. A common template for collection of evaluation methods and performance measures was sent to key informants in twelve European countries; responses were summarized in tables based on template evaluation categories. Extracted data were descriptively analyzed. Approaches to the evaluation of chronic disease management vary widely in objectives, designs, metrics, observation period, and data collection methods. Half of the reported studies used noncontrolled designs. The majority measure clinical process measures, patient behavior and satisfaction, cost and utilization; several also used a range of structural indicators. Effects are usually observed over 1 or 3 years on patient populations with a single, commonly prevalent, chronic disease. There is wide variation within and between European countries on approaches to evaluating chronic disease management in their objectives, designs, indicators, target audiences, and actors involved. This study is the first extensive, international overview of the area reported in the literature.
Reliability and Validity of Observational Risk Screening in Evaluating Dynamic Knee Valgus

PubMed Central

Ekegren, Christina L.; Miller, William C.; Celebrini, Richard G.; Eng, Janice J.; MacIntyre, Donna L.

2012-01-01

Study Design Nonexperimental methodological study. Objectives To determine the interrater and intrarater reliability and validity of using observational risk screening guidelines to evaluate dynamic knee valgus. Background A deficiency in the neuromuscular control of the hip has been identified as a key risk factor for non-contact anterior cruciate ligament (ACL) injury in post pubescent females. This deficiency can manifest itself as a valgus knee alignment during tasks involving hip and knee flexion. There are currently no scientifically tested methods to screen for dynamic knee valgus in the clinic or on the field. Methods Three physiotherapists used observational risk screening guidelines to rate 40 adolescent female soccer players according to their risk of ACL injury. The rating was based on the amount of dynamic knee valgus observed on a drop jump landing. Ratings were evaluated for intrarater and interrater agreement using kappa coefficients. Sensitivity and specificity of ratings were evaluated by comparing observational ratings with measurements obtained using 3-dimensional (3D) motion analysis. Results Kappa coefficients for intrarater and interrater agreement ranged from 0.75 to 0.85, indicating that ratings were reasonably consistent over time and between physiotherapists. Sensitivity values were inadequate, ranging from 67–87%. This indicated that raters failed to detect up to a third of “truly high risk” individuals. Specificity values ranged from 60–72% which was considered adequate for the purposes of the screen. Conclusion Observational risk screening is a practical and cost-effective method of screening for ACL injury risk. Rater agreement and specificity were acceptable for this method but sensitivity was not. To detect a greater proportion of individuals at risk of ACL injury, coaches and clinicians should ensure that they include additional tests for other high risk characteristics in their screening protocols. PMID:19721212
Evaluating Middle School Students' Spatial-scientific Performance in Earth-space Science

NASA Astrophysics Data System (ADS)

Wilhelm, Jennifer; Jackson, C.; Toland, M. D.; Cole, M.; Wilhelm, R. J.

2013-06-01

Many astronomical concepts cannot be understood without a developed understanding of four spatial-mathematics domains defined as follows: a) Geometric Spatial Visualization (GSV) - Visualizing the geometric features of a system as it appears above, below, and within the system’s plane; b) Spatial Projection (SP) - Projecting to a different location and visualizing from that global perspective; c) Cardinal Directions (CD) - Distinguishing directions (N, S, E, W) in order to document an object’s vector position in space; and d) Periodic Patterns - (PP) Recognizing occurrences at regular intervals of time and/or space. For this study, differences were examined between groups of sixth grade students’ spatial-scientific development pre/post implementation of an Earth/Space unit. Treatment teachers employed a NASA-based curriculum (Realistic Explorations in Astronomical Learning), while control teachers implemented their regular Earth/Space units. A 2-level hierarchical linear model was used to evaluate student performance on the Lunar Phases Concept Inventory (LPCI) and four spatial-mathematics domains, while controlling for two variables (gender and ethnicity) at the student level and one variable (teaching experience) at the teacher level. Overall LPCI results show pre-test scores predicted post-test scores, boys performed better than girls, and Whites performed better than non-Whites. We also compared experimental and control groups’ by spatial-mathematics domain outcomes. For GSV, it was found that boys, in general, tended to have higher GSV post-scores. For domains CD and SP, no statistically significant differences were observed. PP results show Whites performed better than non-Whites. Also for PP, a significant cross-level interaction term (gender-treatment) was observed, which means differences in control and experimental groups are dependent on students’ gender. These findings can be interpreted as: (a) the experimental girls scored higher than the
Performance evaluation of wireless communications through capsule endoscope.

PubMed

Takizawa, Kenichi; Aoyagi, Takahiro; Hamaguchi, Kiyoshi; Kohno, Ryuji

2009-01-01

This paper presents a performance evaluation of wireless communications applicable into a capsule endoscope. A numerical model to describe the received signal strength (RSS) radiated from a capsule-sized signal generator is derived through measurements in which a liquid phantom that has equivalent electrical constants is used. By introducing this model and taking into account the characteristics of its direction pattern of the capsule and propagation distance between the implanted capsule and on-body antenna, a cumulative distribution function (CDF) of the received SNR is evaluated. Then, simulation results related to the error ratio in the wireless channel are obtained. These results show that the frequencies of 611 MHz or lesser would be useful for the capsule endoscope applications from the view point of error rate performance. Further, we show that the use of antenna diversity brings additional gain to this application.
Performance and life evaluation of advanced battery technologies for electric vehicle applications

NASA Astrophysics Data System (ADS)

Deluca, W. H.; Gillie, K. R.; Kulaga, J. E.; Smaga, J. A.; Tummillo, A. F.; Webster, C. E.

Advanced battery technology evaluations are performed under simulated electric vehicle (EV) operating conditions at the Argonne Analysis and Diagnostic Laboratory (ADL). The ADL provides a common basis for both performance characterization and life evaluation with unbiased application of tests and analyses. This paper summarizes the performance characterizations and life evaluations conducted in 1990 on nine single cells and fifteen 3- to 360-cell modules that encompass six technologies: (Na/S, Zn/Br, Ni/Fe, Ni/Cd, Ni-metal hydride, and lead-acid). These evaluations were performed for the Department of Energy and Electric Power Research Institute. The results provide battery users, developers, and program managers an interim measure of the progress being made in battery R and D programs, a comparison of battery technologies, and a source of basic data for modelling and continuing R and D.
Note on evaluating safety performance of road infrastructure to motivate safety competition.

PubMed

Han, Sangjin

2016-01-01

Road infrastructures are usually developed and maintained by governments or public sectors. There is no competitor in the market of their jurisdiction. This monopolic feature discourages road authorities from improving the level of safety with proactive motivation. This study suggests how to apply a principle of competition for roads, in particular by means of performance evaluation. It first discusses why road infrastructure has been slow in safety oriented development and management in respect of its business model. Then it suggests some practical ways of how to promote road safety between road authorities, particularly by evaluating safety performance of road infrastructure. These are summarized as decision of safety performance indicators, classification of spatial boundaries, data collection, evaluation, and reporting. Some consideration points are also discussed to make safety performance evaluation on road infrastructure lead to better road safety management.
Functional Capacity Evaluation: Performance of Patients with Chronic Non-specific Low Back Pain Without Waddell Signs.

PubMed

Oesch, Peter; Meyer, Kathrin; Jansen, Beatrice; Kool, Jan

2015-06-01

The primary objective of this study is to evaluate the effect of Waddell signs (WS) on Functional Capacity Evaluation (FCE) in patients with chronic non-specific low back pain (CNSLBP) undergoing fitness for work evaluation. If an effect is observed, the secondary objective is to report performance of patients without WS in a standardized 1 day FCE protocol. Survey of patients with CNSLBP as their primary complaint, referred for fitness for work evaluation, age between 20 and 60 years. Main outcome measures were WS and performance during manual handling assessed with lifting from floor to waist, waist to crown, horizontal and one handed carry; grip strength with Jamar hand held Dynamometer; ambulation with stair climbing and six minute walking test; work postures with elevated work, forward bend standing, kneeling, and sitting. 145 male with a mean age of 44.5 years (±10.1), and 53 females with a mean age of 43.6 years (±11.0) were included. Mean days off work were in male 658 (±1,056) and in female 642 (±886). 33% of all patients presented positive WS. FCE performance in male and female patients with positive and negative WS differed significantly in all comparisons except grip strength of the dominant hand and sitting in female. Performance of patients with negative WS indicated a mean physical capacity corresponding to lightmedium work in females and medium work in males for both age groups. WS should be assessed for interpretation of FCE results. Despite long work absence, patients with CNSLBP with negative WS demonstrated a physical capacity corresponding to substantial physical work demands.
The effect of viewing distance on observer performance in skeletal radiographs

NASA Astrophysics Data System (ADS)

Butler, M. L.; Lowe, J.; Toomey, R. J.; Maher, M.; Evanoff, M. E.; Rainford, L.

2013-03-01

A number of different viewing distances are recommended by international agencies, however none with specific reference to radiologist performance. The purpose of this study was to ascertain the extent to which radiologists performance is affected by viewing distance on softcopy skeletal reporting. Eighty dorsi-palmar (DP) wrist radiographs, of which half feature 1 or more fractures, were viewed by seven observers at 2 viewing distances, 30cm and 70cm. Observers rated the images as normal or not on a scale of 1 to 5 and could mark multiple locations on the images when they visualised a fracture. Viewing distance was measured from the centre of the face plate to the outer canthus of the eye. The DBM MRM analysis showed no statistically significant differences between the area under the curve for the two distances (p = 0.482). The JAFROC analysis, however, demonstrated a statistically significantly higher area under the curve with the 30cm viewing distance than with the 70 cm distance (p = 0.035). This suggests that while observers were able to make decisions about whether an image contained a fracture or not equally well at both viewing distances, they may have been less reliable in terms of fracture localisation or detection of multiple fractures. The impact of viewing distance warrants further attention from both clinical and scientific perspectives.
Evaluating cryostat performance for naval applications

NASA Astrophysics Data System (ADS)

Knoll, David; Willen, Dag; Fesmire, James; Johnson, Wesley; Smith, Jonathan; Meneghelli, Barry; Demko, Jonathan; George, Daniel; Fowler, Brian; Huber, Patti

2012-06-01

The Navy intends to use High Temperature Superconducting Degaussing (HTSDG) coil systems on future Navy platforms. The Navy Metalworking Center (NMC) is leading a team that is addressing cryostat configuration and manufacturing issues associated with fabricating long lengths of flexible, vacuum-jacketed cryostats that meet Navy shipboard performance requirements. The project includes provisions to evaluate the reliability performance, as well as proofing of fabrication techniques. Navy cryostat performance specifications include less than 1 Wm-1 heat loss, 2 MPa working pressure, and a 25-year vacuum life. Cryostat multilayer insulation (MLI) systems developed on the project have been validated using a standardized cryogenic test facility and implemented on 5-meterlong test samples. Performance data from these test samples, which were characterized using both LN2 boiloff and flow-through measurement techniques, will be presented. NMC is working with an Integrated Project Team consisting of Naval Sea Systems Command, Naval Surface Warfare Center-Carderock Division, Southwire Company, nkt cables, Oak Ridge National Laboratory (ORNL), ASRC Aerospace, and NASA Kennedy Space Center (NASA-KSC) to complete these efforts. Approved for public release; distribution is unlimited. This material is submitted with the understanding that right of reproduction for governmental purposes is reserved for the Office of Naval Research, Arlington, Virginia 22203-1995.
The Role of Scheduling in Observing Teacher-Child Interactions

ERIC Educational Resources Information Center

Cash, Anne H.; Pianta, Robert C.

2014-01-01

Observational assessment is being used on a large scale to evaluate the quality of interactions between teachers and children in classroom environments. When one performs observations at scale, features of the protocol such as the scheduling of observations can potentially influence observed scores. In this study interactions were observed for 88…
Evaluation of Precipitation Simulated by Seven SCMs against the ARM Observations at the SGP Site

NASA Technical Reports Server (NTRS)

Song, Hua; Lin, Wuyin; Lin, Yanluan; Wolf, Audrey B.; Neggers, Roel; Donner, Leo J.; Del Genio, Anthony D.; Liu, Yangang

2013-01-01

This study evaluates the performances of seven single-column models (SCMs) by comparing simulated surface precipitation with observations at the Atmospheric Radiation Measurement Program Southern Great Plains (SGP) site from January 1999 to December 2001. Results show that although most SCMs can reproduce the observed precipitation reasonably well, there are significant and interesting differences in their details. In the cold season, the model-observation differences in the frequency and mean intensity of rain events tend to compensate each other for most SCMs. In the warm season, most SCMs produce more rain events in daytime than in nighttime, whereas the observations have more rain events in nighttime. The mean intensities of rain events in these SCMs are much stronger in daytime, but weaker in nighttime, than the observations. The higher frequency of rain events during warm-season daytime in most SCMs is related to the fact that most SCMs produce a spurious precipitation peak around the regime of weak vertical motions but rich in moisture content. The models also show distinct biases between nighttime and daytime in simulating significant rain events. In nighttime, all the SCMs have a lower frequency of moderate-to-strong rain events than the observations for both seasons. In daytime, most SCMs have a higher frequency of moderate-to-strong rain events than the observations, especially in the warm season. Further analysis reveals distinct meteorological backgrounds for large underestimation and overestimation events. The former occur in the strong ascending regimes with negative low-level horizontal heat and moisture advection, whereas the latter occur in the weak or moderate ascending regimes with positive low-level horizontal heat and moisture advection.
Performance measures in the earth observations commercialization applications program

NASA Astrophysics Data System (ADS)

Macauley, Molly K.

1996-03-01

Performance measures in the Earth Observations Commercialization Application Program (EOCAP) are key to its success and include net profitability; enhancements to industry productivity through generic innovations in industry practices, standards, and protocols; and documented contributions to public policy governing the newly developing remote sensing industry. Because EOCAP requires company co-funding, both parties to the agreement (the government and the corporate partner) have incentives to pursue these goals. Further strengthening progress towards these goals are requirements for business plans in the company's EOCAP proposal, detailed scrutiny given these plans during proposal selection, and regularly documented progress reports during project implementation.
Between-individual comparisons in performance evaluation: a perspective from prospect theory.

PubMed

Wong, Kin Fai Ellick; Kwong, Jessica Y Y

2005-03-01

This article examines how between-individual comparisons influence performance evaluations in rating tasks. The authors demonstrated a systematic change in the perceived difference across ratees as a result of changing the way performance information is expressed. Study 1 found that perceived performance difference between 2 individuals was greater when their objective performance levels were presented with small numbers (e.g., absence rates of 2% vs. 5%) than when they were presented with large numbers (e.g., attendance rates of 98% vs. 95%). Extending this finding to situations involving trade-offs between multiple performance attributes across ratees, Study 2 showed that the relative preference for 1 ratee over another actually reversed when the presentation format of the performance information changed. The authors draw upon prospect theory to offer a theoretical framework describing the between-individual comparison aspect of performance evaluation.
Development of a Peer Teaching-Assessment Program and a Peer Observation and Evaluation Tool

PubMed Central

Trujillo, Jennifer M.; Barr, Judith; Gonyeau, Michael; Van Amburgh, Jenny A.; Matthews, S. James; Qualters, Donna

2008-01-01

Objectives To develop a formalized, comprehensive, peer-driven teaching assessment program and a valid and reliable assessment tool. Methods A volunteer taskforce was formed and a peer-assessment program was developed using a multistep, sequential approach and the Peer Observation and Evaluation Tool (POET). A pilot study was conducted to evaluate the efficiency and practicality of the process and to establish interrater reliability of the tool. Intra-class correlation coefficients (ICC) were calculated. Results ICCs for 8 separate lectures evaluated by 2-3 observers ranged from 0.66 to 0.97, indicating good interrater reliability of the tool. Conclusion Our peer assessment program for large classroom teaching, which includes a valid and reliable evaluation tool, is comprehensive, feasible, and can be adopted by other schools of pharmacy. PMID:19325963
Comparison of model and human observer performance in FFDM, DBT, and synthetic mammography

NASA Astrophysics Data System (ADS)

Ikejimba, Lynda; Glick, Stephen J.; Samei, Ehsan; Lo, Joseph Y.

2016-03-01

Reader studies are important in assessing breast imaging systems. The purpose of this work was to assess task-based performance of full field digital mammography (FFDM), digital breast tomosynthesis (DBT), and synthetic mammography (SM) using different phantom types, and to determine an accurate observer model for human readers. Images were acquired on a Hologic Selenia Dimensions system with a uniform and anthropomorphic phantom. A contrast detail insert of small, low-contrast disks was created using an inkjet printer with iodine-doped ink and inserted in the phantoms. The disks varied in diameter from 210 to 630 μm, and in contrast from 1.1% contrast to 2.2% in regular increments. Human and model observers performed a 4-alternative forced choice experiment. The models were a non-prewhitening matched filter with eye model (NPWE) and a channelized Hotelling observer with either Gabor channels (Gabor-CHO) or Laguerre-Gauss channels (LG-CHO). With the given phantoms, reader scores were higher in FFDM and DBT than SM. The structure in the phantom background had a bigger impact on outcome for DBT than for FFDM or SM. All three model observers showed good correlation with humans in the uniform background, with ρ between 0.89 and 0.93. However, in the structured background, only the CHOs had high correlation, with ρ=0.92 for Gabor-CHO, 0.90 for LG-CHO, and 0.77 for NPWE. Because results of any analysis can depend on the phantom structure, conclusions of modality performance may need to be taken in the context of an appropriate model observer and a realistic phantom.

Digital replication of chest radiographs without altering diagnostic observer performance

NASA Astrophysics Data System (ADS)

Flynn, Michael J.; Davies, Eric; Spizarny, David; Beute, Gordon H.; Peterson, Edward; Eyler, William R.; Gross, Barry; Chen, Ji

1991-05-01

A study to test the ability of a high-fidelity system to digitize chest radiographs, store the data in a computer, and reprint the film without altering diagnostic observer performance is reported. Two hundred and fifty-two (252) chest films with subtle image features indicative of interstitial disease, pulmonary nodule, or pneumothorax, along with 36 normal chest films were used in the study. Films were selected from a key word search on a computerized report archive and were graded by two experienced radiologists. Each film was digitized with 86 micron pixels and stored in 4000 X 5000 arrays using a research instrument. Replicates were printed using a commercial laser film printer (Eastman Kodak Company) having 80 micron pixels. Originals and replicates were observed separately by two different experienced radiologists. Each indicated a graded response for the three possible pathologies. The agreement of observers between responses for replicates and originals was described by the kappa statistic and compared to the agreement when rereading the original film. The final result of this study supports a hypothesis that the replicate is indistinguishable from the original.
Evaluation of cloud-resolving modeling of haboobs using in-situ and remotely sensed observations

NASA Astrophysics Data System (ADS)

Anisimov, Anatolii; Axisa, Duncan; Mostamandi, Suleiman; Kucera, Paul A.; Stenchikov, Georgiy

2017-04-01

Arabian Peninsula is one of the major dust generation regions that at present is severely under-sampled. In this study, we combine unique aircraft observations of aerosol and fine-resolution simulations to better quantify dust generation and transport in deep convective storms called haboobs. The aerosol observations were obtained during the "Kingdom of Saudi Arabia Assessment of Rainfall Augmentation" research program that was conducted in the Central and Southwest regions of Saudi Arabia for the years of 2006 through 2009. We ingest the observations from the first phase of the project conducted in the central Arabian Peninsula near Riyadh in April 2007 and focus on the observational cases when the aircraft sampled high concentrations of dust within haboobs. These data are indispensable for assessment of dust properties during periods of extreme aerosol loading. We perform cloud-resolving 2-km simulations using the coupled meteorology-chemistry WRF-Chem model with 8-bin MOSAIC aerosol microphysics scheme that accounts for direct and indirect aerosol effects. The model is validated using observations from surface weather stations, Doppler weather radar network, AERONET stations, MODIS and SEVIRI satellite aerosol sensors. We also compare the model results with recent MERRA-2 reanalysis that assimilates aerosols and chemical components. The model captures the spatiotemporal variability of atmospheric circulation and aerosol properties and calculates contributions of different aerosol species. We specifically compare the simulated aerosols with the aircraft measurements to evaluate the vertical extent and the structure of dust layers in haboobs. The simulated column-averaged dust size distribution compares reasonably well with AERONET and aircraft measurement. Despite total aerosol optical depth in simulations and MERRA2 reanalysis are quite similar, the vertical distribution and regional dust emission fluxes in the model and reanalysis differ significantly. The
A human visual model-based approach of the visual attention and performance evaluation

NASA Astrophysics Data System (ADS)

Le Meur, Olivier; Barba, Dominique; Le Callet, Patrick; Thoreau, Dominique

2005-03-01

In this paper, a coherent computational model of visual selective attention for color pictures is described and its performances are precisely evaluated. The model based on some important behaviours of the human visual system is composed of four parts: visibility, perception, perceptual grouping and saliency map construction. This paper focuses mainly on its performances assessment by achieving extended subjective and objective comparisons with real fixation points captured by an eye-tracking system used by the observers in a task-free viewing mode. From the knowledge of the ground truth, qualitatively and quantitatively comparisons have been made in terms of the measurement of the linear correlation coefficient (CC) and of the Kulback Liebler divergence (KL). On a set of 10 natural color images, the results show that the linear correlation coefficient and the Kullback Leibler divergence are of about 0.71 and 0.46, respectively. CC and Kl measures with this model are respectively improved by about 4% and 7% compared to the best model proposed by L.Itti. Moreover, by comparing the ability of our model to predict eye movements produced by an average observer, we can conclude that our model succeeds quite well in predicting the spatial locations of the most important areas of the image content.
U.S. Geological Survey Standard Reference Sample Project: Performance Evaluation of Analytical Laboratories

USGS Publications Warehouse

Long, H. Keith; Daddow, Richard L.; Farrar, Jerry W.

1998-01-01

Since 1962, the U.S. Geological Survey (USGS) has operated the Standard Reference Sample Project to evaluate the performance of USGS, cooperator, and contractor analytical laboratories that analyze chemical constituents of environmental samples. The laboratories are evaluated by using performance evaluation samples, called Standard Reference Samples (SRSs). SRSs are submitted to laboratories semi-annually for round-robin laboratory performance comparison purposes. Currently, approximately 100 laboratories are evaluated for their analytical performance on six SRSs for inorganic and nutrient constituents. As part of the SRS Project, a surplus of homogeneous, stable SRSs is maintained for purchase by USGS offices and participating laboratories for use in continuing quality-assurance and quality-control activities. Statistical evaluation of the laboratories results provides information to compare the analytical performance of the laboratories and to determine possible analytical deficiences and problems. SRS results also provide information on the bias and variability of different analytical methods used in the SRS analyses.
Performance Evaluation of a Bedside Cardiac SPECT System

NASA Astrophysics Data System (ADS)

Studenski, Matthew T.; Gilland, David R.; Parker, Jason G.; Hammond, B.; Majewski, Stan; Weisenberger, Andrew G.; Popov, Vladimir

2009-06-01

This paper reports on the initial performance evaluation of a bedside cardiac PET/SPECT system. The system was designed to move within a hospital to image critically-ill patients, for example, those in intensive care unit (ICU) or emergency room settings, who cannot easily be transported to a conventional SPECT or PET facility. The system uses two compact (25 cm times 25 cm) detectors with pixilated NaI crystals and position sensitive PMTs. The performance is evaluated for both 140 keV (Tc-99m) and 511 keV (F-18) emitters with the system operating in single photon counting (SPECT) mode. The imaging performance metrics for both 140 keV and 511 keV included intrinsic energy resolution, spatial resolution (intrinsic, system, and reconstructed SPECT), detection sensitivity, count rate capability, and uniformity. Results demonstrated an intrinsic energy resolution of 31% at 140 keV and 23% at 511 keV, a planar intrinsic spatial resolution of 5.6 mm full width half-maximum (FWHM) at 140 keV and 6.3 mm FWHM at 511 keV, and a sensitivity of 4.15 countsmiddotmuCi-1 ldr s-1 at 140 keV and 0.67 counts ldr muCi-1 ldr s-1 at 511 keV. To further the study, a SPECT acquisition using a dynamic cardiac phantom was performed, and the resulting reconstructed images are presented.
Evaluation of results in aesthetic plastic surgery: preliminary observations on mammaplasty.

PubMed

Ferreira, M C

2000-12-01

Aesthetic plastic surgery has received wide public attention in the past few years. Expectations of patients regarding results have been exaggerated; the real place and medical importance of the procedures are still not clear because of a lack of more objective evidence. This study discusses the difficulties encountered related to the scientific evaluation of the aesthetic operations and proposes alternatives for assessment. A frequently performed procedure, reduction mammaplasty, is presented as an example, with its specific evaluation.
Style-independent document labeling: design and performance evaluation

NASA Astrophysics Data System (ADS)

Mao, Song; Kim, Jong Woo; Thoma, George R.

2003-12-01

The Medical Article Records System or MARS has been developed at the U.S. National Library of Medicine (NLM) for automated data entry of bibliographical information from medical journals into MEDLINE, the premier bibliographic citation database at NLM. Currently, a rule-based algorithm (called ZoneCzar) is used for labeling important bibliographical fields (title, author, affiliation, and abstract) on medical journal article page images. While rules have been created for medical journals with regular layout types, new rules have to be manually created for any input journals with arbitrary or new layout types. Therefore, it is of interest to label any journal articles independent of their layout styles. In this paper, we first describe a system (called ZoneMatch) for automated generation of crucial geometric and non-geometric features of important bibliographical fields based on string-matching and clustering techniques. The rule based algorithm is then modified to use these features to perform style-independent labeling. We then describe a performance evaluation method for quantitatively evaluating our algorithm and characterizing its error distributions. Experimental results show that the labeling performance of the rule-based algorithm is significantly improved when the generated features are used.
The Modified, Multi-patient Observed Simulated Handoff Experience (M-OSHE): Assessment and Feedback for Entering Residents on Handoff Performance.

PubMed

Gaffney, Sean; Farnan, Jeanne M; Hirsch, Kristen; McGinty, Michael; Arora, Vineet M

2016-04-01

Despite the identification of transfer of patient responsibility as a Core Entrustable Professional Activity for Entering Residency, rigorous methods to evaluate incoming residents' ability to give a verbal handoff of multiple patients are lacking. Our purpose was to implement a multi-patient, simulation-based curriculum to assess verbal handoff performance. Graduate Medical Education (GME) orientation at an urban, academic medical center. Eighty-four incoming residents from four residency programs participated in the study. The curriculum featured an online training module and a multi-patient observed simulated handoff experience (M-OSHE). Participants verbally "handed off" three mock patients of varying acuity and were evaluated by a trained "receiver" using an expert-informed, five-item checklist. Prior handoff experience in medical school was associated with higher checklist scores (23% none vs. 33% either third OR fourth year vs. 58% third AND fourth year, p = 0.021). Prior training was associated with prioritization of patients based on acuity (12% no training vs. 38% prior training, p = 0.014). All participants agreed that the M-OSHE realistically portrayed a clinical setting. The M-OSHE is a promising strategy for teaching and evaluating entering residents' ability to give verbal handoffs of multiple patients. Prior training and more handoff experience was associated with higher performance, which suggests that additional handoff training in medical school may be of benefit.
Comprehensive Performance Evaluation for Hydrological and Nutrients Simulation Using the Hydrological Simulation Program–Fortran in a Mesoscale Monsoon Watershed, China

PubMed Central

Luo, Chuan; Jiang, Kaixia; Wan, Rongrong; Li, Hengpeng

2017-01-01

The Hydrological Simulation Program–Fortran (HSPF) is a hydrological and water quality computer model that was developed by the United States Environmental Protection Agency. Comprehensive performance evaluations were carried out for hydrological and nutrient simulation using the HSPF model in the Xitiaoxi watershed in China. Streamflow simulation was calibrated from 1 January 2002 to 31 December 2007 and then validated from 1 January 2008 to 31 December 2010 using daily observed data, and nutrient simulation was calibrated and validated using monthly observed data during the period from July 2009 to July 2010. These results of model performance evaluation showed that the streamflows were well simulated over the study period. The determination coefficient (R2) was 0.87, 0.77 and 0.63, and the Nash-Sutcliffe coefficient of efficiency (Ens) was 0.82, 0.76 and 0.65 for the streamflow simulation in annual, monthly and daily time-steps, respectively. Although limited to monthly observed data, satisfactory performance was still achieved during the quantitative evaluation for nutrients. The R2 was 0.73, 0.82 and 0.92, and the Ens was 0.67, 0.74 and 0.86 for nitrate, ammonium and orthophosphate simulation, respectively. Some issues may affect the application of HSPF were also discussed, such as input data quality, parameter values, etc. Overall, the HSPF model can be successfully used to describe streamflow and nutrients transport in the mesoscale watershed located in the East Asian monsoon climate area. This study is expected to serve as a comprehensive and systematic documentation of understanding the HSPF model for wide application and avoiding possible misuses. PMID:29257117
Comprehensive Performance Evaluation for Hydrological and Nutrients Simulation Using the Hydrological Simulation Program-Fortran in a Mesoscale Monsoon Watershed, China.

PubMed

Li, Zhaofu; Luo, Chuan; Jiang, Kaixia; Wan, Rongrong; Li, Hengpeng

2017-12-19

The Hydrological Simulation Program-Fortran (HSPF) is a hydrological and water quality computer model that was developed by the United States Environmental Protection Agency. Comprehensive performance evaluations were carried out for hydrological and nutrient simulation using the HSPF model in the Xitiaoxi watershed in China. Streamflow simulation was calibrated from 1 January 2002 to 31 December 2007 and then validated from 1 January 2008 to 31 December 2010 using daily observed data, and nutrient simulation was calibrated and validated using monthly observed data during the period from July 2009 to July 2010. These results of model performance evaluation showed that the streamflows were well simulated over the study period. The determination coefficient ( R ²) was 0.87, 0.77 and 0.63, and the Nash-Sutcliffe coefficient of efficiency (Ens) was 0.82, 0.76 and 0.65 for the streamflow simulation in annual, monthly and daily time-steps, respectively. Although limited to monthly observed data, satisfactory performance was still achieved during the quantitative evaluation for nutrients. The R ² was 0.73, 0.82 and 0.92, and the Ens was 0.67, 0.74 and 0.86 for nitrate, ammonium and orthophosphate simulation, respectively. Some issues may affect the application of HSPF were also discussed, such as input data quality, parameter values, etc. Overall, the HSPF model can be successfully used to describe streamflow and nutrients transport in the mesoscale watershed located in the East Asian monsoon climate area. This study is expected to serve as a comprehensive and systematic documentation of understanding the HSPF model for wide application and avoiding possible misuses.
Learning from Teacher Observations: Challenges and Opportunities Posed by New Teacher Evaluation Systems

ERIC Educational Resources Information Center

Hill, Heather C.; Grossman, Pam

2013-01-01

In this article, Heather C. Hill and Pam Grossman discuss the current focus on using teacher observation instruments as part of new teacher evaluation systems being considered and implemented by states and districts. They argue that if these teacher observation instruments are to achieve the goal of supporting teachers in improving instructional…
EVALUATIONS ON ASR DAMAGE OF CONCRETE STRUCTURE AND ITS STRUCTURAL PERFORMANCE

NASA Astrophysics Data System (ADS)

Ueda, Naoshi; Nakamura, Hikaru; Kunieda, Minoru; Maeno, Hirofumi; Morishit, Noriaki; Asai, Hiroshi

In this paper, experiments and finite element analyses were conducted in order to evaluate effects of ASR on structural performance of RC and PC structures. From the experimental results, it was confirmed that the ASR expansion was affected by the restraint of reinforcement and the magnitude of prestress. The material properties of concrete damaged by ASR had anisotropic characteristics depending on the degree of ASR expansion. Therefore, when the structural performance of RC and PC structures were evaluated by using the material properties of core concrete, the direction and place where cylinder specimens were cored should be considered. On the other hand, by means of proposed analytical method, ASR expansion behaviors of RC and PC beams and changing of their structural performance were evaluated. As the results, it was confirmed that PC structure had much advantage comparing with RC structure regarding the structural performance under ASR damage because of restraint by prestress against the ASR.
WE-E-217A-02: Methodologies for Evaluation of Standalone CAD System Performance.

PubMed

Sahiner, B

2012-06-01

Standalone performance evaluation of a CAD system provides information about the abnormality detection or classification performance of the computerized system alone. Although the performance of the reader with CAD is the final step in CAD system assessment, standalone performance evaluation is an important component for several reasons: First, standalone evaluation informs the reader about the performance level of the CAD system and may have an impact on how the reader uses the system. Second, it provides essential information to the system designer for algorithm optimization during system development. Third, standalone evaluation can provide a detailed description of algorithm performance (e.g., on subgroups of the population) because a larger data set with more samples from different subgroups can be included in standalone studies compared to reader studies. Proper standalone evaluation of a CAD system involves a number of key components, some of which are shared with the assessment of reader performance with CAD. These include (1) selection of a test data set that allows performance assessment with little or no bias and acceptable uncertainty; (2) a reference standard that indicates disease status as well as the location and extent of disease; (3) a clearly defined method for labeling each CAD mark as a true-positive or false-positive; and (4) a properly selected set of metrics to summarize the accuracy of the computer marks and their corresponding scores. In this lecture, we will discuss various approaches for the key components of standalone CAD performance evaluation listed above, and present some of the recommendations and opinions from the AAPM CAD subcommittee on these issues. Learning Objectives 1. Identify basic components and metrics in the assessment of standalone CAD systems 2. Understand how each component may affect the assessed performance 3. Learn about AAPM CAD subcommittee's opinions and recommendations on factors and metrics related to the
The effect of observational learning on students' performance, processes, and motivation in two creative domains.

PubMed

Groenendijk, Talita; Janssen, Tanja; Rijlaarsdam, Gert; van den Bergh, Huub

2013-03-01

Previous research has shown that observation can be effective for learning in various domains, for example, argumentative writing and mathematics. The question in this paper is whether observational learning can also be beneficial when learning to perform creative tasks in visual and verbal arts. We hypothesized that observation has a positive effect on performance, process, and motivation. We expected similarity in competence between the model and the observer to influence the effectiveness of observation. Sample. A total of 131 Dutch students (10(th) grade, 15 years old) participated. Two experiments were carried out (one for visual and one for verbal arts). Participants were randomly assigned to one of three conditions; two observational learning conditions and a control condition (learning by practising). The observational learning conditions differed in instructional focus (on the weaker or the more competent model of a pair to be observed). We found positive effects of observation on creative products, creative processes, and motivation in the visual domain. In the verbal domain, observation seemed to affect the creative process, but not the other variables. The model similarity hypothesis was not confirmed. Results suggest that observation may foster learning in creative domains, especially in the visual arts. © 2011 The British Psychological Society.
Functional assessment and performance evaluation for assistive robotic manipulators: Literature review.

PubMed

Chung, Cheng-Shiu; Wang, Hongwu; Cooper, Rory A

2013-07-01

The user interface development of assistive robotic manipulators can be traced back to the 1960s. Studies include kinematic designs, cost-efficiency, user experience involvements, and performance evaluation. This paper is to review studies conducted with clinical trials using activities of daily living (ADLs) tasks to evaluate performance categorized using the International Classification of Functioning, Disability, and Health (ICF) frameworks, in order to give the scope of current research and provide suggestions for future studies. We conducted a literature search of assistive robotic manipulators from 1970 to 2012 in PubMed, Google Scholar, and University of Pittsburgh Library System - PITTCat. Twenty relevant studies were identified. Studies were separated into two broad categories: user task preferences and user-interface performance measurements of commercialized and developing assistive robotic manipulators. The outcome measures and ICF codes associated with the performance evaluations are reported. Suggestions for the future studies include (1) standardized ADL tasks for the quantitative and qualitative evaluation of task efficiency and performance to build comparable measures between research groups, (2) studies relevant to the tasks from user priority lists and ICF codes, and (3) appropriate clinical functional assessment tests with consideration of constraints in assistive robotic manipulator user interfaces. In addition, these outcome measures will help physicians and therapists build standardized tools while prescribing and assessing assistive robotic manipulators.
Functional assessment and performance evaluation for assistive robotic manipulators: Literature review

PubMed Central

Chung, Cheng-Shiu; Wang, Hongwu; Cooper, Rory A.

2013-01-01

Context The user interface development of assistive robotic manipulators can be traced back to the 1960s. Studies include kinematic designs, cost-efficiency, user experience involvements, and performance evaluation. This paper is to review studies conducted with clinical trials using activities of daily living (ADLs) tasks to evaluate performance categorized using the International Classification of Functioning, Disability, and Health (ICF) frameworks, in order to give the scope of current research and provide suggestions for future studies. Methods We conducted a literature search of assistive robotic manipulators from 1970 to 2012 in PubMed, Google Scholar, and University of Pittsburgh Library System – PITTCat. Results Twenty relevant studies were identified. Conclusion Studies were separated into two broad categories: user task preferences and user-interface performance measurements of commercialized and developing assistive robotic manipulators. The outcome measures and ICF codes associated with the performance evaluations are reported. Suggestions for the future studies include (1) standardized ADL tasks for the quantitative and qualitative evaluation of task efficiency and performance to build comparable measures between research groups, (2) studies relevant to the tasks from user priority lists and ICF codes, and (3) appropriate clinical functional assessment tests with consideration of constraints in assistive robotic manipulator user interfaces. In addition, these outcome measures will help physicians and therapists build standardized tools while prescribing and assessing assistive robotic manipulators. PMID:23820143
Evaluating Algorithm Performance Metrics Tailored for Prognostics

NASA Technical Reports Server (NTRS)

Saxena, Abhinav; Celaya, Jose; Saha, Bhaskar; Saha, Sankalita; Goebel, Kai

2009-01-01

Prognostics has taken a center stage in Condition Based Maintenance (CBM) where it is desired to estimate Remaining Useful Life (RUL) of the system so that remedial measures may be taken in advance to avoid catastrophic events or unwanted downtimes. Validation of such predictions is an important but difficult proposition and a lack of appropriate evaluation methods renders prognostics meaningless. Evaluation methods currently used in the research community are not standardized and in many cases do not sufficiently assess key performance aspects expected out of a prognostics algorithm. In this paper we introduce several new evaluation metrics tailored for prognostics and show that they can effectively evaluate various algorithms as compared to other conventional metrics. Specifically four algorithms namely; Relevance Vector Machine (RVM), Gaussian Process Regression (GPR), Artificial Neural Network (ANN), and Polynomial Regression (PR) are compared. These algorithms vary in complexity and their ability to manage uncertainty around predicted estimates. Results show that the new metrics rank these algorithms in different manner and depending on the requirements and constraints suitable metrics may be chosen. Beyond these results, these metrics offer ideas about how metrics suitable to prognostics may be designed so that the evaluation procedure can be standardized. 1
A Bayesian Approach to Evaluating Consistency between Climate Model Output and Observations

NASA Astrophysics Data System (ADS)

Braverman, A. J.; Cressie, N.; Teixeira, J.

2010-12-01

Like other scientific and engineering problems that involve physical modeling of complex systems, climate models can be evaluated and diagnosed by comparing their output to observations of similar quantities. Though the global remote sensing data record is relatively short by climate research standards, these data offer opportunities to evaluate model predictions in new ways. For example, remote sensing data are spatially and temporally dense enough to provide distributional information that goes beyond simple moments to allow quantification of temporal and spatial dependence structures. In this talk, we propose a new method for exploiting these rich data sets using a Bayesian paradigm. For a collection of climate models, we calculate posterior probabilities its members best represent the physical system each seeks to reproduce. The posterior probability is based on the likelihood that a chosen summary statistic, computed from observations, would be obtained when the model's output is considered as a realization from a stochastic process. By exploring how posterior probabilities change with different statistics, we may paint a more quantitative and complete picture of the strengths and weaknesses of the models relative to the observations. We demonstrate our method using model output from the CMIP archive, and observations from NASA's Atmospheric Infrared Sounder.
Evaluation of Global Observations-Based Evapotranspiration Datasets and IPCC AR4 Simulations

NASA Technical Reports Server (NTRS)

Mueller, B.; Seneviratne, S. I.; Jimenez, C.; Corti, T.; Hirschi, M.; Balsamo, G.; Ciais, P.; Dirmeyer, P.; Fisher, J. B.; Guo, Z.;

2011-01-01

Quantification of global land evapotranspiration (ET) has long been associated with large uncertainties due to the lack of reference observations. Several recently developed products now provide the capacity to estimate ET at global scales. These products, partly based on observational data, include satellite ]based products, land surface model (LSM) simulations, atmospheric reanalysis output, estimates based on empirical upscaling of eddycovariance flux measurements, and atmospheric water balance datasets. The LandFlux-EVAL project aims to evaluate and compare these newly developed datasets. Additionally, an evaluation of IPCC AR4 global climate model (GCM) simulations is presented, providing an assessment of their capacity to reproduce flux behavior relative to the observations ]based products. Though differently constrained with observations, the analyzed reference datasets display similar large-scale ET patterns. ET from the IPCC AR4 simulations was significantly smaller than that from the other products for India (up to 1 mm/d) and parts of eastern South America, and larger in the western USA, Australia and China. The inter-product variance is lower across the IPCC AR4 simulations than across the reference datasets in several regions, which indicates that uncertainties may be underestimated in the IPCC AR4 models due to shared biases of these simulations.

Fuzzy logic based sensor performance evaluation of vehicle mounted metal detector systems

NASA Astrophysics Data System (ADS)

Abeynayake, Canicious; Tran, Minh D.

2015-05-01

Vehicle Mounted Metal Detector (VMMD) systems are widely used for detection of threat objects in humanitarian demining and military route clearance scenarios. Due to the diverse nature of such operational conditions, operational use of VMMD without a proper understanding of its capability boundaries may lead to heavy causalities. Multi-criteria fitness evaluations are crucial for determining capability boundaries of any sensor-based demining equipment. Evaluation of sensor based military equipment is a multi-disciplinary topic combining the efforts of researchers, operators, managers and commanders having different professional backgrounds and knowledge profiles. Information acquired through field tests usually involves uncertainty, vagueness and imprecision due to variations in test and evaluation conditions during a single test or series of tests. This report presents a fuzzy logic based methodology for experimental data analysis and performance evaluation of VMMD. This data evaluation methodology has been developed to evaluate sensor performance by consolidating expert knowledge with experimental data. A case study is presented by implementing the proposed data analysis framework in a VMMD evaluation scenario. The results of this analysis confirm accuracy, practicability and reliability of the fuzzy logic based sensor performance evaluation framework.

A Modified Importance-Performance Framework for Evaluating Recreation-Based Experiential Learning Programs

ERIC Educational Resources Information Center

Pitas, Nicholas; Murray, Alison; Olsen, Max; Graefe, Alan

2017-01-01

This article describes a modified importance-performance framework for use in evaluation of recreation-based experiential learning programs. Importance-performance analysis (IPA) provides an effective and readily applicable means of evaluating many programs, but the near universal satisfaction associated with recreation inhibits the use of IPA in…
Evaluation of emergency department performance - a systematic review on recommended performance and quality-in-care measures.

PubMed

Sørup, Christian Michel; Jacobsen, Peter; Forberg, Jakob Lundager

2013-08-09

Evaluation of emergency department (ED) performance remains a difficult task due to the lack of consensus on performance measures that reflects high quality, efficiency, and sustainability. To describe, map, and critically evaluate which performance measures that the published literature regard as being most relevant in assessing overall ED performance. Following the PRISMA guidelines, a systematic literature review of review articles reporting accentuated ED performance measures was conducted in the databases of PubMed, Cochrane Library, and Web of Science. Study eligibility criteria includes: 1) the main purpose was to discuss, analyse, or promote performance measures best reflecting ED performance, 2) the article was a review article, and 3) the article reported macro-level performance measures, thus reflecting an overall departmental performance level. A number of articles addresses this study's objective (n = 14 of 46 unique hits). Time intervals and patient-related measures were dominant in the identified performance measures in review articles from US, UK, Sweden and Canada. Length of stay (LOS), time between patient arrival to initial clinical assessment, and time between patient arrivals to admission were highlighted by the majority of articles. Concurrently, "patients left without being seen" (LWBS), unplanned re-attendance within a maximum of 72 hours, mortality/morbidity, and number of unintended incidents were the most highlighted performance measures that related directly to the patient. Performance measures related to employees were only stated in two of the 14 included articles. A total of 55 ED performance measures were identified. ED time intervals were the most recommended performance measures followed by patient centeredness and safety performance measures. ED employee related performance measures were rarely mentioned in the investigated literature. The study's results allow for advancement towards improved performance measurement and
The Opinion of Students and Faculty Members about the Effect of the Faculty Performance Evaluation

PubMed Central

Ghahrani, Nassim; Siamian, Hasan; Balaghafari, Azita; Aligolbandi, Kobra; Vahedi, Mohammad

2015-01-01

domains, using binomial test, it could be concluded that only on the regulation domain with the significance level of 0.000, significant different was observed. So that, 30(23%) and 50(53%) supported of the effect of evaluation on the effect of evaluation of situation. Evaluation to improve the regulatory status of teachers and 70% (53 patients), the effects are positive. Students and faculty evaluations to compare the Mann-Whitney U test was used. The results show, only within the rules, with a significance level of 0.01 considered statistically significant relationship between teachers and students there. Conclusion: considering the viewpoints of students and faculty members about the impact of teacher performance evaluation of the students, most of the students believed that the greatest impact assessment has been on the improve educational performance entitled as responsibility of the faculty member for education, interest in presenting lessons, using audio-visual tools, having lesson plans, faculty members participate interest and enthusiasm in presenting lessons the use of teaching aids, lesson plans, faculty members participation in seminars, creating interest in students to participate in class discussions and expressing the importance of learning lessons perspective of teachers, but the faculty members viewpoints indicate the impact of evaluation on the regular attendance and discipline, the greatest impact assessment in the area of regulatory and compliance with the timely and orderly and thus their activities. PMID:26543421
The Opinion of Students and Faculty Members about the Effect of the Faculty Performance Evaluation.

PubMed

Ghahrani, Nassim; Siamian, Hasan; Balaghafari, Azita; Aligolbandi, Kobra; Vahedi, Mohammad

2015-08-01

, it could be concluded that only on the regulation domain with the significance level of 0.000, significant different was observed. So that, 30(23%) and 50(53%) supported of the effect of evaluation on the effect of evaluation of situation. Evaluation to improve the regulatory status of teachers and 70% (53 patients), the effects are positive. Students and faculty evaluations to compare the Mann-Whitney U test was used. The results show, only within the rules, with a significance level of 0.01 considered statistically significant relationship between teachers and students there. considering the viewpoints of students and faculty members about the impact of teacher performance evaluation of the students, most of the students believed that the greatest impact assessment has been on the improve educational performance entitled as responsibility of the faculty member for education, interest in presenting lessons, using audio-visual tools, having lesson plans, faculty members participate interest and enthusiasm in presenting lessons the use of teaching aids, lesson plans, faculty members participation in seminars, creating interest in students to participate in class discussions and expressing the importance of learning lessons perspective of teachers, but the faculty members viewpoints indicate the impact of evaluation on the regular attendance and discipline, the greatest impact assessment in the area of regulatory and compliance with the timely and orderly and thus their activities.
48 CFR 1553.209-70 - EPA Form 1900-26, Contracting Officer's Evaluation of Contractor Performance.

Code of Federal Regulations, 2010 CFR

2010-10-01

..., Contracting Officer's Evaluation of Contractor Performance. 1553.209-70 Section 1553.209-70 Federal... 1553.209-70 EPA Form 1900-26, Contracting Officer's Evaluation of Contractor Performance. As prescribed... evaluation of Contractor performance. ...
Documenting Teacher Candidates' Professional Growth through Performance Evaluation

ERIC Educational Resources Information Center

Brown, Elizabeth Levine; Suh, Jennifer; Parsons, Seth A.; Parker, Audra K.; Ramirez, Erin M.

2015-01-01

In the United States, colleges of education are responding to demands for increased accountability. The purpose of this article is to describe one teacher education program's implementation of a performance evaluation tool during final internship that measures teacher candidates' development across four domains: Planning and Preparation,…
40 CFR 35.9055 - Evaluation of recipient performance.

Code of Federal Regulations, 2013 CFR

2013-07-01

... 40 Protection of Environment 1 2013-07-01 2013-07-01 false Evaluation of recipient performance. 35.9055 Section 35.9055 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY GRANTS AND OTHER FEDERAL ASSISTANCE STATE AND LOCAL ASSISTANCE Financial Assistance for the National Estuary Program § 35.9055...
40 CFR 35.9055 - Evaluation of recipient performance.

Code of Federal Regulations, 2012 CFR

2012-07-01

... 40 Protection of Environment 1 2012-07-01 2012-07-01 false Evaluation of recipient performance. 35.9055 Section 35.9055 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY GRANTS AND OTHER FEDERAL ASSISTANCE STATE AND LOCAL ASSISTANCE Financial Assistance for the National Estuary Program § 35.9055...
40 CFR 35.9055 - Evaluation of recipient performance.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 40 Protection of Environment 1 2010-07-01 2010-07-01 false Evaluation of recipient performance. 35.9055 Section 35.9055 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY GRANTS AND OTHER FEDERAL ASSISTANCE STATE AND LOCAL ASSISTANCE Financial Assistance for the National Estuary Program § 35.9055...
40 CFR 35.9055 - Evaluation of recipient performance.

Code of Federal Regulations, 2014 CFR

2014-07-01

... 40 Protection of Environment 1 2014-07-01 2014-07-01 false Evaluation of recipient performance. 35.9055 Section 35.9055 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY GRANTS AND OTHER FEDERAL ASSISTANCE STATE AND LOCAL ASSISTANCE Financial Assistance for the National Estuary Program § 35.9055...
40 CFR 35.9055 - Evaluation of recipient performance.

Code of Federal Regulations, 2011 CFR

2011-07-01

... 40 Protection of Environment 1 2011-07-01 2011-07-01 false Evaluation of recipient performance. 35.9055 Section 35.9055 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY GRANTS AND OTHER FEDERAL ASSISTANCE STATE AND LOCAL ASSISTANCE Financial Assistance for the National Estuary Program § 35.9055...
Evaluation of reanalysis datasets against observational soil temperature data over China

NASA Astrophysics Data System (ADS)

Yang, Kai; Zhang, Jingyong

2018-01-01

Soil temperature is a key land surface variable, and is a potential predictor for seasonal climate anomalies and extremes. Using observational soil temperature data in China for 1981-2005, we evaluate four reanalysis datasets, the land surface reanalysis of the European Centre for Medium-Range Weather Forecasts (ERA-Interim/Land), the second modern-era retrospective analysis for research and applications (MERRA-2), the National Center for Environmental Prediction Climate Forecast System Reanalysis (NCEP-CFSR), and version 2 of the Global Land Data Assimilation System (GLDAS-2.0), with a focus on 40 cm soil layer. The results show that reanalysis data can mainly reproduce the spatial distributions of soil temperature in summer and winter, especially over the east of China, but generally underestimate their magnitudes. Owing to the influence of precipitation on soil temperature, the four datasets perform better in winter than in summer. The ERA-Interim/Land and GLDAS-2.0 produce spatial characteristics of the climatological mean that are similar to observations. The interannual variability of soil temperature is well reproduced by the ERA-Interim/Land dataset in summer and by the CFSR dataset in winter. The linear trend of soil temperature in summer is well rebuilt by reanalysis datasets. We demonstrate that soil heat fluxes in April-June and in winter are highly correlated with the soil temperature in summer and winter, respectively. Different estimations of surface energy balance components can contribute to different behaviors in reanalysis products in terms of estimating soil temperature. In addition, reanalysis datasets can mainly rebuild the northwest-southeast gradient of soil temperature memory over China.
Ground truth and benchmarks for performance evaluation

NASA Astrophysics Data System (ADS)

Takeuchi, Ayako; Shneier, Michael; Hong, Tsai Hong; Chang, Tommy; Scrapper, Christopher; Cheok, Geraldine S.

2003-09-01

Progress in algorithm development and transfer of results to practical applications such as military robotics requires the setup of standard tasks, of standard qualitative and quantitative measurements for performance evaluation and validation. Although the evaluation and validation of algorithms have been discussed for over a decade, the research community still faces a lack of well-defined and standardized methodology. The range of fundamental problems include a lack of quantifiable measures of performance, a lack of data from state-of-the-art sensors in calibrated real-world environments, and a lack of facilities for conducting realistic experiments. In this research, we propose three methods for creating ground truth databases and benchmarks using multiple sensors. The databases and benchmarks will provide researchers with high quality data from suites of sensors operating in complex environments representing real problems of great relevance to the development of autonomous driving systems. At NIST, we have prototyped a High Mobility Multi-purpose Wheeled Vehicle (HMMWV) system with a suite of sensors including a Riegl ladar, GDRS ladar, stereo CCD, several color cameras, Global Position System (GPS), Inertial Navigation System (INS), pan/tilt encoders, and odometry . All sensors are calibrated with respect to each other in space and time. This allows a database of features and terrain elevation to be built. Ground truth for each sensor can then be extracted from the database. The main goal of this research is to provide ground truth databases for researchers and engineers to evaluate algorithms for effectiveness, efficiency, reliability, and robustness, thus advancing the development of algorithms.
The Impact of Teacher Observations with Coordinated Professional Development on Student Performance: A 27-State Program Evaluation

ERIC Educational Resources Information Center

Shaha, Steven H.; Glassett, Kelly F.; Copas, Aimee

2015-01-01

The impact of teacher observations in alignment with professional development (PD) on teacher efficacy was quantified for 292 schools in 110 districts within 27 U.S. States. Teacher observations conducted by school leaders or designated internal coaches were coordinated with PD offerings aligned with intended teacher improvements. The PD involved…
Spherical aberration yielding optimum visual performance: Evaluation of intraocular lenses using adaptive optics simulation

PubMed Central

Werner, John S.; Elliott, Sarah L.; Choi, Stacey S.; Doble, Nathan

2009-01-01

PURPOSE To evaluate the influence of spherical aberration on contrast sensitivity using adaptive optics. SETTING Vision Science and Advanced Retinal Imaging Laboratory, Department of Ophthalmology & Vision Science, University of California, Davis Medical Center, Sacramento, California, USA. METHODS Contrast sensitivity at 8 cycles per degree was evaluated using an adaptive optics system that permitted aberrations to be measured with a Shack-Hartman wavefront sensor and controlled by a 109 actuator continuous-surface deformable mirror that was at a plane conjugate to the observer’s pupil. Vertical Gabor patches were viewed through a 6.3 mm diameter pupil conjugate aperture. Contrast sensitivity was measured with the deformable mirror set to produce 1 of 5 spherical aberration profiles (−0.2 to +0.2 μm). Contrast sensitivity over the range of spherical aberration was fitted with a polynomial function. RESULTS Three observers (age 21 to 24 years) participated. The measured total mean spherical aberration resulting from the spherical aberration profiles produced by the deformable mirror was between −0.15 μm and +0.25 μm. The peak contrast sensitivity of this function for the 3 observers combined occurred at +0.06 μm of spherical aberration. The peak contrast sensitivity was also achieved with positive spherical aberration for observer (mean 0.09). CONCLUSION There was intersubject variability in the measurements; however, the average visual performance was best with the introduction of a small positive spherical aberration. PMID:19545813
Application of Wavelet Filters in an Evaluation of Photochemical Model Performance

EPA Science Inventory

Air quality model evaluation can be enhanced with time-scale specific comparisons of outputs and observations. For example, high-frequency (hours to one day) time scale information in observed ozone is not well captured by deterministic models and its incorporation into model pe...
Performance Evaluation of a Bedside Cardiac SPECT System

DOE Office of Scientific and Technical Information (OSTI.GOV)

M.T. Studenski, D.R. Gilland, J.G. Parker, B. Hammond, S. Majewski, A.G. Weisenberger, V. Popov

This paper reports on the initial performance evaluation of a bedside cardiac PET/SPECT system. The system was designed to move within a hospital to image critically-ill patients, for example, those in intensive care unit (ICU) or emergency room settings, who cannot easily be transported to a conventional SPECT or PET facility. The system uses two compact (25 cm times 25 cm) detectors with pixilated NaI crystals and position sensitive PMTs. The performance is evaluated for both 140 keV (Tc-99m) and 511 keV (F-18) emitters with the system operating in single photon counting (SPECT) mode. The imaging performance metrics for bothmore » 140 keV and 511 keV included intrinsic energy resolution, spatial resolution (intrinsic, system, and reconstructed SPECT), detection sensitivity, count rate capability, and uniformity. Results demonstrated an intrinsic energy resolution of 31% at 140 keV and 23% at 511 keV, a planar intrinsic spatial resolution of 5.6 mm full width half-maximum (FWHM) at 140 keV and 6.3 mm FWHM at 511 keV, and a sensitivity of 4.15 countsmiddotmuCi-1 ldr s-1 at 140 keV and 0.67 counts ldr muCi-1 ldr s-1 at 511 keV. To further the study, a SPECT acquisition using a dynamic cardiac phantom was performed, and the resulting reconstructed images are presented.« less
Advancing a Model-Validated Statistical Method for Decomposing the Key Oceanic Drivers of Observed Regional Climate Variability and Evaluating Model Performance: Focus on North African Rainfall in CESM

NASA Astrophysics Data System (ADS)

Wang, F.; Notaro, M.; Yu, Y.; Mao, J.; Shi, X.; Wei, Y.

2016-12-01

North (N.) African rainfall is characterized by dramatic interannual to decadal variability with serious socio-economic ramifications. The Sahel and West African Monsoon (WAM) region experienced a dramatic shift to persistent drought by the late 1960s, while the Horn of Africa (HOA) underwent drying since the 1990s. Large disagreementregarding the dominant oceanic drivers of N. African hydrologic variability exists among modeling studies, leading to notable spread in Sahel summer rainfall projections for this century among Coupled Model Intercomparison Project models. In order to gain a deeper understanding of the oceanic drivers of N. African rainfall and establish a benchmark for model evaluation, a statistical method, the multivariate Generalized Equilibrium Feedback Assessment, is validated and applied to observations and a control run from the Community Earth System Model (CESM). This study represents the first time that the dominant oceanic drivers of N. African rainfall were evaluated and systematically compared between observations and model simulations. CESM and the observations consistently agree that tropical oceanic modes are the dominant controls of N. African rainfall. During the monsoon season, CESM and observations agree that an anomalously warm eastern tropical Pacific shifts the Walker Circulation eastward, with its descending branch supporting Sahel drying. CESM and the observations concur that a warmer tropical eastern Atlantic favors a southward-shifted Intertropical Convergence Zone, which intensifies WAM monsoonal rainfall. An observed reduction in Sahel rainfall accompanies this enhanced WAM rainfall, yet is confined to the Atlantic in CESM. During the short rains, both observations and CESM indicate that a positive phase of tropical Indian Ocean dipole (IOD) mode [anomalously warm (cold) in western (eastern) Indian] enhances HOA rainfall. The observed IOD impacts are limited to the short rains, while the simulated impacts are year-round.
Evaluation Method for Low-Temperature Performance of Lithium Battery

NASA Astrophysics Data System (ADS)

Wang, H. W.; Ma, Q.; Fu, Y. L.; Tao, Z. Q.; Xiao, H. Q.; Bai, H.; Bai, H.

2018-05-01

In this paper, the evaluation method for low temperature performance of lithium battery is established. The low temperature performance level was set up to determine the best operating temperature range of the lithium battery using different cathode materials. Results are shared with the consumers for the proper use of lithium battery to make it have a longer service life and avoid the occurrence of early rejection.
Creation of an ensemble of simulated cardiac cases and a human observer study: tools for the development of numerical observers for SPECT myocardial perfusion imaging

NASA Astrophysics Data System (ADS)

O'Connor, J. Michael; Pretorius, P. Hendrik; Gifford, Howard C.; Licho, Robert; Joffe, Samuel; McGuiness, Matthew; Mehurg, Shannon; Zacharias, Michael; Brankov, Jovan G.

2012-02-01

Our previous Single Photon Emission Computed Tomography (SPECT) myocardial perfusion imaging (MPI) research explored the utility of numerical observers. We recently created two hundred and eighty simulated SPECT cardiac cases using Dynamic MCAT (DMCAT) and SIMIND Monte Carlo tools. All simulated cases were then processed with two reconstruction methods: iterative ordered subset expectation maximization (OSEM) and filtered back-projection (FBP). Observer study sets were assembled for both OSEM and FBP methods. Five physicians performed an observer study on one hundred and seventy-nine images from the simulated cases. The observer task was to indicate detection of any myocardial perfusion defect using the American Society of Nuclear Cardiology (ASNC) 17-segment cardiac model and the ASNC five-scale rating guidelines. Human observer Receiver Operating Characteristic (ROC) studies established the guidelines for the subsequent evaluation of numerical model observer (NO) performance. Several NOs were formulated and their performance was compared with the human observer performance. One type of NO was based on evaluation of a cardiac polar map that had been pre-processed using a gradient-magnitude watershed segmentation algorithm. The second type of NO was also based on analysis of a cardiac polar map but with use of a priori calculated average image derived from an ensemble of normal cases.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.