performance evaluation measures: Topics by Science.gov

Sample records for performance evaluation measures

Formative and Summative Evaluation: Related Issues in Performance Measurement.

ERIC Educational Resources Information Center

Wholey, Joseph S.

1996-01-01

Performance measurement can serve both formative and summative evaluation functions. Formative evaluation is typically more useful for government purposes whereas performance measurement is more useful than one-shot evaluations of either formative or summative nature. Evaluators should study performance measurement through case studies and…
Assisting allied health in performance evaluation: a systematic review.

PubMed

Lizarondo, Lucylynn; Grimmer, Karen; Kumar, Saravana

2014-11-14

Performance evaluation raises several challenges to allied health practitioners and there is no agreed approach to measuring or monitoring allied health service performance. The aim of this review was to examine the literature on performance evaluation in healthcare to assist in the establishment of a framework that can guide the measurement and evaluation of allied health clinical service performance. This review determined the core elements of a performance evaluation system, tools for evaluating performance, and barriers to the implementation of performance evaluation. A systematic review of the literature was undertaken. Five electronic databases were used to search for relevant articles: MEDLINE, Embase, CINAHL, PsychInfo, and Academic Search Premier. Articles which focussed on any allied health performance evaluation or those which examined performance in health care in general were considered in the review. Content analysis was used to synthesise the findings from individual articles. A total of 37 articles were included in the review. The literature suggests there are core elements involved in performance evaluation which include prioritising clinical areas for measurement, setting goals, selecting performance measures, identifying sources of feedback, undertaking performance measurement, and reporting the results to relevant stakeholders. The literature describes performance evaluation as multi-dimensional, requiring information or data from more than one perspective to provide a rich assessment of performance. A range of tools or instruments are available to capture various perspectives and gather a comprehensive picture of health care quality. Every allied health care delivery system has different performance needs and will therefore require different approaches. However, there are core processes that can be used as a framework to evaluate allied health performance. A careful examination of barriers to performance evaluation and subsequent tailoring of strategies to overcome these barriers should be undertaken to achieve the aims of performance evaluation. The findings of this review should inform the development of a standardised framework that can be used to measure and evaluate allied health performance. Future research should explore the utility and overall impact of such framework in allied health service delivery.
The Context and Process for Performance Evaluations: Necessary Preconditions for the Use of Performance Evaluations as a Measure of Performance--A Critique of Perry

ERIC Educational Resources Information Center

McCarthy, Mary L.

2006-01-01

This article challenges Perry's research using performance evaluations to determine whether the educational background of child welfare workers is predictive of performance. Institutional theory, an understanding of street-level bureaucracies, and evaluations of field education performance measures are offered as necessary frameworks for Perry's…
Forging a Strategic and Comprehensive Approach to Evaluation within Public and Nonprofit Organizations: Integrating Measurement and Analytics within Evaluation

ERIC Educational Resources Information Center

Newcomer, Kathryn; Brass, Clinton T.

2016-01-01

The "performance movement" has been a subject of enthusiasm and frustration for evaluators. Performance measurement, data analytics, and program evaluation have been treated as different tasks, and those addressing them speak their own languages in their own circles. We suggest that situating performance measurement and data analytics…
Evaluation methodologies for an advanced information processing system

NASA Technical Reports Server (NTRS)

Schabowsky, R. S., Jr.; Gai, E.; Walker, B. K.; Lala, J. H.; Motyka, P.

1984-01-01

The system concept and requirements for an Advanced Information Processing System (AIPS) are briefly described, but the emphasis of this paper is on the evaluation methodologies being developed and utilized in the AIPS program. The evaluation tasks include hardware reliability, maintainability and availability, software reliability, performance, and performability. Hardware RMA and software reliability are addressed with Markov modeling techniques. The performance analysis for AIPS is based on queueing theory. Performability is a measure of merit which combines system reliability and performance measures. The probability laws of the performance measures are obtained from the Markov reliability models. Scalar functions of this law such as the mean and variance provide measures of merit in the AIPS performability evaluations.
The ambiguities of performance-based governance reforms in Italy: Reviving the fortunes of evaluation and performance measurement.

PubMed

Marra, Mita

2018-08-01

Over the past two decades, Italy's administrative reforms have institutionalized evaluation to improve program effectiveness, staff productivity, and results-driven accountability against waste and corruption. Across ministries, regional governments, universities, schools and environmental protection agencies, seemingly unexpected consequences have emerged out of the implementation of performance measurement and evaluation regimes within public organizations. Formal compliance to legally binding evaluation procedures, judicially-sanctioned managerial accountability and lack of cross-agency coordination coupled with long-standing cultural separations among evaluators are some of the ambiguities associated with a performance-based governance system within Italian public administration. Building upon the 'new governane theory,' and qualitative fieldwork, I explore the political consequences of evaluation and performance measurement for possible improvements. From a normative perspective, greater integration between program evaluation and performance measurement can support organizational learning and democratic accountability both at the central and local level. Copyright © 2017 Elsevier Ltd. All rights reserved.
Multiple performance measures are needed to evaluate triage systems in the emergency department.

PubMed

Zachariasse, Joany M; Nieboer, Daan; Oostenbrink, Rianne; Moll, Henriëtte A; Steyerberg, Ewout W

2018-02-01

Emergency department triage systems can be considered prediction rules with an ordinal outcome, where different directions of misclassification have different clinical consequences. We evaluated strategies to compare the performance of triage systems and aimed to propose a set of performance measures that should be used in future studies. We identified performance measures based on literature review and expert knowledge. Their properties are illustrated in a case study evaluating two triage modifications in a cohort of 14,485 pediatric emergency department visits. Strengths and weaknesses of the performance measures were systematically appraised. Commonly reported performance measures are measures of statistical association (34/60 studies) and diagnostic accuracy (17/60 studies). The case study illustrates that none of the performance measures fulfills all criteria for triage evaluation. Decision curves are the performance measures with the most attractive features but require dichotomization. In addition, paired diagnostic accuracy measures can be recommended for dichotomized analysis, and the triage-weighted kappa and Nagelkerke's R 2 for ordinal analyses. Other performance measures provide limited additional information. When comparing modifications of triage systems, decision curves and diagnostic accuracy measures should be used in a dichotomized analysis, and the triage-weighted kappa and Nagelkerke's R 2 in an ordinal approach. Copyright © 2017 Elsevier Inc. All rights reserved.
A performance evaluation model for the Stock Point Logistics Integrated Communication Environment (SPLICE)

NASA Astrophysics Data System (ADS)

Schmidt, J. B.

1985-09-01

This thesis investigates ways of improving the real-time performance of the Stockpoint Logistics Integrated Communication Environment (SPLICE). Performance evaluation through continuous monitoring activities and performance studies are the principle vehicles discussed. The method for implementing this performance evaluation process is the measurement of predefined performance indexes. Performance indexes for SPLICE are offered that would measure these areas. Existing SPLICE capability to carry out performance evaluation is explored, and recommendations are made to enhance that capability.
40 CFR 63.7191 - What records must I keep?

Code of Federal Regulations, 2010 CFR

2010-07-01

... malfunctions. (3) Records of performance tests and performance evaluations as required in § 63.10(b)(2)(viii... measurements, raw performance evaluation measurements). (3) All required CMS measurements (including monitoring... Section 63.7191 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS...
45 CFR 2522.540 - Do the costs of performance measurement or evaluation count towards the statutory cap on...

Code of Federal Regulations, 2010 CFR

2010-10-01

... evaluation count towards the statutory cap on administrative costs? 2522.540 Section 2522.540 Public Welfare... measurement or evaluation count towards the statutory cap on administrative costs? No, the costs of performance measurement and evaluation do not count towards the statutory five percent cap on administrative...
An examination of the relationships between physicians' clinical and hospital-utilization performance.

PubMed Central

Saywell, R M; Bean, J A; Ludke, R L; Redman, R W; McHugh, G J

1981-01-01

To examine the relationships between measures of attending physician teams' clinical and utilization performance, inpatient hospital audits were conducted in 22 Maryland and western Pennsylvania nonfederal short-term hospitals. A total of 6,980 medical records were abstracted from eight diagnostic categories using the Payne and JCAH PEP medical audit procedures. The results indicate weak statistical associations between the two medical care evaluation audits; between clinical performance and utilization performance, as measured by appropriateness of admissions and length of stay; and between three utilization measures. Based on these findings, it does not appear valid to use performance in one area to evaluate performance in the other in order to measure or evaluate and ultimately improve physicians; clinical or utilization performance. PMID:6946048
An Application of the Impact Evaluation Process for Designing a Performance Measurement and Evaluation Framework in K-12 Environments

ERIC Educational Resources Information Center

Guerra-Lopez, Ingrid; Toker, Sacip

2012-01-01

This article illustrates the application of the Impact Evaluation Process for the design of a performance measurement and evaluation framework for an urban high school. One of the key aims of this framework is to enhance decision-making by providing timely feedback about the effectiveness of various performance improvement interventions. The…
Evaluation of emergency department performance - a systematic review on recommended performance and quality-in-care measures.

PubMed

Sørup, Christian Michel; Jacobsen, Peter; Forberg, Jakob Lundager

2013-08-09

Evaluation of emergency department (ED) performance remains a difficult task due to the lack of consensus on performance measures that reflects high quality, efficiency, and sustainability. To describe, map, and critically evaluate which performance measures that the published literature regard as being most relevant in assessing overall ED performance. Following the PRISMA guidelines, a systematic literature review of review articles reporting accentuated ED performance measures was conducted in the databases of PubMed, Cochrane Library, and Web of Science. Study eligibility criteria includes: 1) the main purpose was to discuss, analyse, or promote performance measures best reflecting ED performance, 2) the article was a review article, and 3) the article reported macro-level performance measures, thus reflecting an overall departmental performance level. A number of articles addresses this study's objective (n = 14 of 46 unique hits). Time intervals and patient-related measures were dominant in the identified performance measures in review articles from US, UK, Sweden and Canada. Length of stay (LOS), time between patient arrival to initial clinical assessment, and time between patient arrivals to admission were highlighted by the majority of articles. Concurrently, "patients left without being seen" (LWBS), unplanned re-attendance within a maximum of 72 hours, mortality/morbidity, and number of unintended incidents were the most highlighted performance measures that related directly to the patient. Performance measures related to employees were only stated in two of the 14 included articles. A total of 55 ED performance measures were identified. ED time intervals were the most recommended performance measures followed by patient centeredness and safety performance measures. ED employee related performance measures were rarely mentioned in the investigated literature. The study's results allow for advancement towards improved performance measurement and standardised assessment across EDs.
Performance Measurement for Public Services in Academic and Research Libraries. Occasional Paper Number #9.

ERIC Educational Resources Information Center

Cronin, Mary J.

This paper defines performance measurement as the clarification of objectives and standards, identification of key activities, data collection and analysis, and formative evaluation of services. It then examines some of the factors involved in using performance measurement to evaluate public services activities, and analyzes performance…
Alternative performance measures for evaluating congestion.

DOT National Transportation Integrated Search

2004-04-01

This report summarizes the results of the work performed under the project Alternative Performance Measures for Evaluating : Congestion. The study first outlines existing approaches to looking at congestion. It then builds on the previous work in the...
Performance measurement for supply chain management and evaluation criteria determination for reverse supply chain management

NASA Astrophysics Data System (ADS)

Kongar, N. Elif

2004-12-01

Today, since customers are able to obtain similar-quality products for similar prices, the lead time has become the only preference criterion for most of the consumers. Therefore, it is crucial that the lead time, i.e., the time spent from the raw material phase till the manufactured good reaches the customer, is minimized. This issue can be investigated under the title of Supply Chain Management (SCM). An efficiently managed supply chain can lead to reduced response time for customers. To achieve this, continuous observation of supply chain efficiency, i.e., a constant performance evaluation of the current SCM is required. Widely used conventional performance measurement methods lack the ability to evaluate a SCM since the supply chain is a dynamic system that requires a more thorough and flexible performance measurement technique. Balanced Scorecard (BS) is an efficient tool for measuring the performance of dynamic systems and has a proven capability of providing the decision makers with the appropriate feedback data. In addition to SCM, a relatively new management field, namely reverse supply chain management (RSCM), also necessitates an appropriate evaluation approach. RSCM differs from SCM in many aspects, i.e., the criteria used for evaluation, the high level of uncertainty involved etc., not allowing the usage of identical evaluation techniques used for SCM. This study proposes a generic Balanced Scorecard to measure the performance of supply chain management while defining the appropriate performance measures for SCM. A scorecard prototype, ESCAPE, is presented to demonstrate the evaluation process.
Objective Situation Awareness Measurement Based on Performance Self-Evaluation

NASA Technical Reports Server (NTRS)

DeMaio, Joe

1998-01-01

The research was conducted in support of the NASA Safe All-Weather Flight Operations for Rotorcraft (SAFOR) program. The purpose of the work was to investigate the utility of two measurement tools developed by the British Defense Evaluation Research Agency. These tools were a subjective workload assessment scale, the DRA Workload Scale and a situation awareness measurement tool. The situation awareness tool uses a comparison of the crew's self-evaluation of performance against actual performance in order to determine what information the crew attended to during the performance. These two measurement tools were evaluated in the context of a test of innovative approach to alerting the crew by way of a helmet mounted display. The situation assessment data are reported here. The performance self-evaluation metric of situation awareness was found to be highly effective. It was used to evaluate situation awareness on a tank reconnaissance task, a tactical navigation task, and a stylized task used to evaluated handling qualities. Using the self-evaluation metric, it was possible to evaluate situation awareness, without exact knowledge the relevant information in some cases and to identify information to which the crew attended or failed to attend in others.
Promoting Accountability and Continual Improvement: A Review of the Respective Roles of Performance Measurement, Auditing, Evaluation, and Reporting.

ERIC Educational Resources Information Center

Reid, William

1999-01-01

Provides a synthesis of the literature pertaining to the principles and practices of performance measurement, auditing, evaluation, and reporting. Discusses how bringing these elements together in a performance management system can be achieved through refinement of strategic direction, reporting on key measures, and periodic, systematic…
Development of an Online Toolkit for Measuring Performance in Health Emergency Response Exercises.

PubMed

Agboola, Foluso; Bernard, Dorothy; Savoia, Elena; Biddinger, Paul D

2015-10-01

Exercises that simulate emergency scenarios are accepted widely as an essential component of a robust Emergency Preparedness program. Unfortunately, the variability in the quality of the exercises conducted, and the lack of standardized processes to measure performance, has limited the value of exercises in measuring preparedness. In order to help health organizations improve the quality and standardization of the performance data they collect during simulated emergencies, a model online exercise evaluation toolkit was developed using performance measures tested in over 60 Emergency Preparedness exercises. The exercise evaluation toolkit contains three major components: (1) a database of measures that can be used to assess performance during an emergency response exercise; (2) a standardized data collection tool (form); and (3) a program that populates the data collection tool with the measures that have been selected by the user from the database. The evaluation toolkit was pilot tested from January through September 2014 in collaboration with 14 partnering organizations representing 10 public health agencies and four health care agencies from eight states across the US. Exercise planners from the partnering organizations were asked to use the toolkit for their exercise evaluation process and were interviewed to provide feedback on the use of the toolkit, the generated evaluation tool, and the usefulness of the data being gathered for the development of the exercise after-action report. Ninety-three percent (93%) of exercise planners reported that they found the online database of performance measures appropriate for the creation of exercise evaluation forms, and they stated that they would use it again for future exercises. Seventy-two percent (72%) liked the exercise evaluation form that was generated from the toolkit, and 93% reported that the data collected by the use of the evaluation form were useful in gauging their organization's performance during the exercise. Seventy-nine percent (79%) of exercise planners preferred the evaluation form generated by the toolkit to other forms of evaluations. Results of this project show that users found the newly developed toolkit to be user friendly and more relevant to measurement of specific public health and health care capabilities than other tools currently available. The developed toolkit may contribute to the further advancement of developing a valid approach to exercise performance measurement.
On the Performance Evaluation of 3D Reconstruction Techniques from a Sequence of Images

NASA Astrophysics Data System (ADS)

Eid, Ahmed; Farag, Aly

2005-12-01

The performance evaluation of 3D reconstruction techniques is not a simple problem to solve. This is not only due to the increased dimensionality of the problem but also due to the lack of standardized and widely accepted testing methodologies. This paper presents a unified framework for the performance evaluation of different 3D reconstruction techniques. This framework includes a general problem formalization, different measuring criteria, and a classification method as a first step in standardizing the evaluation process. Performance characterization of two standard 3D reconstruction techniques, stereo and space carving, is also presented. The evaluation is performed on the same data set using an image reprojection testing methodology to reduce the dimensionality of the evaluation domain. Also, different measuring strategies are presented and applied to the stereo and space carving techniques. These measuring strategies have shown consistent results in quantifying the performance of these techniques. Additional experiments are performed on the space carving technique to study the effect of the number of input images and the camera pose on its performance.

76 FR 18615 - 30-Day Notice of Proposed Information Collection: Performance Measurement, Evaluation and Public...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-04-04

...: Performance Measurement, Evaluation and Public Diplomacy Program Surveys. OMB Control Number: 1405-0158. Type... and Cultural Affairs, Office of Policy and Evaluation, Evaluation Division (ECA/P/ V). Form Number: SV... State. [FR Doc. 2011-7931 Filed 4-1-11; 8:45 am] BILLING CODE 4710-05-P ...
Evaluation of emergency department performance – a systematic review on recommended performance and quality-in-care measures

PubMed Central

2013-01-01

Background Evaluation of emergency department (ED) performance remains a difficult task due to the lack of consensus on performance measures that reflects high quality, efficiency, and sustainability. Aim To describe, map, and critically evaluate which performance measures that the published literature regard as being most relevant in assessing overall ED performance. Methods Following the PRISMA guidelines, a systematic literature review of review articles reporting accentuated ED performance measures was conducted in the databases of PubMed, Cochrane Library, and Web of Science. Study eligibility criteria includes: 1) the main purpose was to discuss, analyse, or promote performance measures best reflecting ED performance, 2) the article was a review article, and 3) the article reported macro-level performance measures, thus reflecting an overall departmental performance level. Results A number of articles addresses this study’s objective (n = 14 of 46 unique hits). Time intervals and patient-related measures were dominant in the identified performance measures in review articles from US, UK, Sweden and Canada. Length of stay (LOS), time between patient arrival to initial clinical assessment, and time between patient arrivals to admission were highlighted by the majority of articles. Concurrently, “patients left without being seen” (LWBS), unplanned re-attendance within a maximum of 72 hours, mortality/morbidity, and number of unintended incidents were the most highlighted performance measures that related directly to the patient. Performance measures related to employees were only stated in two of the 14 included articles. Conclusions A total of 55 ED performance measures were identified. ED time intervals were the most recommended performance measures followed by patient centeredness and safety performance measures. ED employee related performance measures were rarely mentioned in the investigated literature. The study’s results allow for advancement towards improved performance measurement and standardised assessment across EDs. PMID:23938117
Approaches for Combining Multiple Measures of Teacher Performance: Reliability, Validity, and Implications for Evaluation Policy

ERIC Educational Resources Information Center

Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete

2016-01-01

A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…
Connected Vehicle Pilot Deployment Program phase 1 : performance measurement and evaluation support plan : New York City : final report.

DOT National Transportation Integrated Search

2016-07-12

This document describes the Performance Measurement and Evaluation Support Plan for the New York City Department of Transportation New York City (NYC) Connected Vehicle Pilot Deployment (CVPD) Project. The report documents the performance metrics tha...
Connected Vehicle Pilot Deployment Program phase 1 : performance measurement and evaluation support plan : Tampa (THEA) : final report.

DOT National Transportation Integrated Search

2016-03-14

The Performance Measurement and Evaluation Support Plan for the Connected Vehicle Pilot Deployment Program Phase 1, Tampa Hillsborough Expressway Authority, outlines the goals and objectives for the Pilot as well as the proposed performance metrics. ...
Evaluation of the Relationship between Literacy and Mathematics Skills as Assessed by Curriculum-Based Measures

ERIC Educational Resources Information Center

Rutherford-Becker, Kristy J.; Vanderwood, Michael L.

2009-01-01

The purpose of this study was to evaluate the extent that reading performance (as measured by curriculum-based measures [CBM] of oral reading fluency [ORF] and Maze reading comprehension), is related to math performance (as measured by CBM math computation and applied math). Additionally, this study examined which of the two reading measures was a…
Rating and Ranking the Role of Bibliometrics and Webometrics in Nursing and Midwifery

PubMed Central

Davidson, Patricia M.; Newton, Phillip J.; Ferguson, Caleb

2014-01-01

Background. Bibliometrics are an essential aspect of measuring academic and organizational performance. Aim. This review seeks to describe methods for measuring bibliometrics, identify the strengths and limitations of methodologies, outline strategies for interpretation, summarise evaluation of nursing and midwifery performance, identify implications for metric of evaluation, and specify the implications for nursing and midwifery and implications of social networking for bibliometrics and measures of individual performance. Method. A review of electronic databases CINAHL, Medline, and Scopus was undertaken using search terms such as bibliometrics, nursing, and midwifery. The reference lists of retrieved articles and Internet sources and social media platforms were also examined. Results. A number of well-established, formal ways of assessment have been identified, including h- and c-indices. Changes in publication practices and the use of the Internet have challenged traditional metrics of influence. Moreover, measuring impact beyond citation metrics is an increasing focus, with social media representing newer ways of establishing performance and impact. Conclusions. Even though a number of measures exist, no single bibliometric measure is perfect. Therefore, multiple approaches to evaluation are recommended. However, bibliometric approaches should not be the only measures upon which academic and scholarly performance are evaluated. PMID:24550691
Measuring human performance on NASA's microgravity aircraft

NASA Technical Reports Server (NTRS)

Morris, Randy B.; Whitmore, Mihriban

1993-01-01

Measuring human performance in a microgravity environment will aid in identifying the design requirements, human capabilities, safety, and productivity of future astronauts. The preliminary understanding of the microgravity effects on human performance can be achieved through evaluations conducted onboard NASA's KC-135 aircraft. These evaluations can be performed in relation to hardware performance, human-hardware interface, and hardware integration. Measuring human performance in the KC-135 simulated environment will contribute to the efforts of optimizing the human-machine interfaces for future and existing space vehicles. However, there are limitations, such as limited number of qualified subjects, unexpected hardware problems, and miscellaneous plane movements which must be taken into consideration. Examples for these evaluations, the results, and their implications are discussed in the paper.
Chapter 15: Commercial New Construction Evaluation Protocol. The Uniform Methods Project: Methods for Determining Energy Efficiency Savings for Specific Measures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kurnik, Charles W.; Keates, Steven

This protocol is intended to describe the recommended method when evaluating the whole-building performance of new construction projects in the commercial sector. The protocol focuses on energy conservation measures (ECMs) or packages of measures where evaluators can analyze impacts using building simulation. These ECMs typically require the use of calibrated building simulations under Option D of the International Performance Measurement and Verification Protocol (IPMVP).
Chinese Middle School Teachers' Preferences Regarding Performance Evaluation Measures

ERIC Educational Resources Information Center

Liu, Shujie; Xu, Xianxuan; Stronge, James H.

2016-01-01

Teacher performance evaluation currently is receiving unprecedented attention from policy makers, scholars, and practitioners worldwide. This study is one of the few studies of teacher perceptions regarding teacher performance measures that focus on China. We employed a quantitative dominant mixed research design to investigate Chinese teachers'…
Measures of Searcher Performance: A Psychometric Evaluation.

ERIC Educational Resources Information Center

Wildemuth, Barbara M.; And Others

1993-01-01

Describes a study of medical students that was conducted to evaluate measures of performance on factual searches of INQUIRER, a full-text database in microbiology. Measures relating to recall, precision, search term overlap, and efficiency are discussed; reliability and construct validity are considered; and implications for future research are…
Freight performance measures : approach analysis.

DOT National Transportation Integrated Search

2010-05-01

This report reviews the existing state of the art and also the state of the practice of freight performance measurement. Most performance measures at the state level have aimed at evaluating highway or transit infrastructure performance with an empha...
Pulse Transit Time Measurement Using Seismocardiogram, Photoplethysmogram, and Acoustic Recordings: Evaluation and Comparison.

PubMed

Yang, Chenxi; Tavassolian, Negar

2018-05-01

This work proposes a novel method of pulse transit time (PTT) measurement. The proximal arterial location data are collected from seismocardiogram (SCG) recordings by placing a micro-electromechanical accelerometer on the chest wall. The distal arterial location data are recorded using an acoustic sensor placed inside the ear. The performance of distal location recordings is evaluated by comparing SCG-acoustic and SCG-photoplethysmogram (PPG) measurements. PPG and acoustic performances under motion noise are also compared. Experimental results suggest comparable performances for the acoustic-based and PPG-based devices. The feasibility of each PTT measurement method is validated for blood pressure evaluations and its limitations are analyzed.
A method to evaluate process performance by integrating time and resources

NASA Astrophysics Data System (ADS)

Wang, Yu; Wei, Qingjie; Jin, Shuang

2017-06-01

The purpose of process mining is to improve the existing process of the enterprise, so how to measure the performance of the process is particularly important. However, the current research on the performance evaluation method is still insufficient. The main methods of evaluation are mainly using time or resource. These basic statistics cannot evaluate process performance very well. In this paper, a method of evaluating the performance of the process based on time dimension and resource dimension is proposed. This method can be used to measure the utilization and redundancy of resources in the process. This paper will introduce the design principle and formula of the evaluation algorithm. Then, the design and the implementation of the evaluation method will be introduced. Finally, we will use the evaluating method to analyse the event log from a telephone maintenance process and propose an optimization plan.
Predictive validity of driving-simulator assessments following traumatic brain injury: a preliminary study.

PubMed

Lew, Henry L; Poole, John H; Lee, Eun Ha; Jaffe, David L; Huang, Hsiu-Chen; Brodd, Edward

2005-03-01

To evaluate whether driving simulator and road test evaluations can predict long-term driving performance, we conducted a prospective study on 11 patients with moderate to severe traumatic brain injury. Sixteen healthy subjects were also tested to provide normative values on the simulator at baseline. At their initial evaluation (time-1), subjects' driving skills were measured during a 30-minute simulator trial using an automated 12-measure Simulator Performance Index (SPI), while a trained observer also rated their performance using a Driving Performance Inventory (DPI). In addition, patients were evaluated on the road by a certified driving evaluator. Ten months later (time-2), family members observed patients driving for at least 3 hours over 4 weeks and rated their driving performance using the DPI. At time-1, patients were significantly impaired on automated SPI measures of driving skill, including: speed and steering control, accidents, and vigilance to a divided-attention task. These simulator indices significantly predicted the following aspects of observed driving performance at time-2: handling of automobile controls, regulation of vehicle speed and direction, higher-order judgment and self-control, as well as a trend-level association with car accidents. Automated measures of simulator skill (SPI) were more sensitive and accurate than observational measures of simulator skill (DPI) in predicting actual driving performance. To our surprise, the road test results at time-1 showed no significant relation to driving performance at time-2. Simulator-based assessment of patients with brain injuries can provide ecologically valid measures that, in some cases, may be more sensitive than a traditional road test as predictors of long-term driving performance in the community.
A nationwide survey of state-mandated evaluation practices for domestic violence agencies.

PubMed

Riger, Stephanie; Staggs, Susan L

2011-01-01

Many agencies serving survivors of domestic violence are required to evaluate their services. Three possible evaluation strategies include: a) process measurement, which typically involves a frequency count of agency activities, such as the number of counseling hours given; b) outcome evaluation, which measures the impact of agency activities on clients, such as increased understanding of the dynamics of abuse; or c) performance measurement, which assesses the extent to which agencies achieve their stated goals. Findings of a telephone survey of state funders of domestic violence agencies in the United States revealed that most states (67%) require only process measurement, while fewer than 10% require performance measurement. Most (69%) funders reported satisfaction with their evaluation strategy and emphasized the need for involvement of all stakeholders, especially grantees, in developing an evaluation.
Performance measurement: integrating quality management and activity-based cost management.

PubMed

McKeon, T

1996-04-01

The development of an activity-based management system provides a framework for developing performance measures integral to quality and cost management. Performance measures that cross operational boundaries and embrace core processes provide a mechanism to evaluate operational results related to strategic intention and internal and external customers. The author discusses this measurement process that allows managers to evaluate where they are and where they want to be, and to set a course of action that closes the gap between the two.
Combining control input with flight path data to evaluate pilot performance in transport aircraft.

PubMed

Ebbatson, Matt; Harris, Don; Huddlestone, John; Sears, Rodney

2008-11-01

When deriving an objective assessment of piloting performance from flight data records, it is common to employ metrics which purely evaluate errors in flight path parameters. The adequacy of pilot performance is evaluated from the flight path of the aircraft. However, in large jet transport aircraft these measures may be insensitive and require supplementing with frequency-based measures of control input parameters. Flight path and control input data were collected from pilots undertaking a jet transport aircraft conversion course during a series of symmetric and asymmetric approaches in a flight simulator. The flight path data were analyzed for deviations around the optimum flight path while flying an instrument landing approach. Manipulation of the flight controls was subject to analysis using a series of power spectral density measures. The flight path metrics showed no significant differences in performance between the symmetric and asymmetric approaches. However, control input frequency domain measures revealed that the pilots employed highly different control strategies in the pitch and yaw axes. The results demonstrate that to evaluate pilot performance fully in large aircraft, it is necessary to employ performance metrics targeted at both the outer control loop (flight path) and the inner control loop (flight control) parameters in parallel, evaluating both the product and process of a pilot's performance.
Analytically Quantifying Gains in the Test and Evaluation Process through Capabilities-Based Analysis

DTIC Science & Technology

2011-09-01

Evaluation Process through Capabilities-Based Analysis 5. FUNDING NUMBERS 6. AUTHOR(S) Eric J. Lednicky 7. PERFORMING ORGANIZATION NAME(S) AND...ADDRESS(ES) Naval Postgraduate School Monterey, CA 93943-5000 8. PERFORMING ORGANIZATION REPORT NUMBER 9. SPONSORING /MONITORING AGENCY NAME(S...14 C. MEASURES OF EFFECTIVENESS / MEASURES OF PERFORMANCE
Situation Awareness and Workload Measures for SAFOR

NASA Technical Reports Server (NTRS)

DeMaio, Joe; Hart, Sandra G.; Allen, Ed (Technical Monitor)

1999-01-01

The present research was conducted in support of the NASA Safe All-Weather Flight Operations for Rotorcraft (SAFOR) program. The purpose of the work was to investigate the utility of two measurement tools developed by the British Defense Evaluation Research Agency. These tools were a subjective workload assessment scale, the DRA Workload Scale (DRAWS), and a situation awareness measurement tool in which the crews self-evaluation of performance is compared against actual performance. These two measurement tools were evaluated in the context of a test of an innovative approach to alerting the crew by way of a helmet mounted display. The DRAWS was found to be usable, but it offered no advantages over extant scales, and it had only limited resolution. The performance self-evaluation metric of situation awareness was found to be highly effective.

75 FR 42760 - Statement of Organization, Functions, and Delegations of Authority

Federal Register 2010, 2011, 2012, 2013, 2014

2010-07-22

... accounting reports and invoices, and monitoring all spending. The Team develops, defends and executes the... results; performance measurement; research and evaluation methodologies; demonstration testing and model... ACF programs; strategic planning; performance measurement; program and policy evaluation; research and...
Robust Multimodal Cognitive Load Measurement

DTIC Science & Technology

2014-03-26

dimension, Hurst exponent ) of electroencephalogram (EEG) signals to evaluate changes in working memory load during the performance of a cognitive task...dimension, Hurst exponent ) of electroencephalogram (EEG) signals to evaluate changes in working memory load during the performance of a cognitive task with...approximate entropies, wavelet-based complexity measures, correlation dimension, Hurst exponent ) of electroencephalogram (EEG) signals to evaluate changes
Performance measures for multi-vehicle allowance shuttle transit (MAST) system.

DOT National Transportation Integrated Search

2014-09-01

This study investigates the performance measures for multi-vehicle mobility allowance shuttle : transit (MAST) system. Particularly, researchers were primarily concerned with two measures, : waiting time and ride time, to evaluate the performance and...
Performance Evaluation of Nano-JASMINE

NASA Astrophysics Data System (ADS)

Hatsutori, Y.; Kobayashi, Y.; Gouda, N.; Yano, T.; Murooka, J.; Niwa, Y.; Yamada, Y.

We report the results of performance evaluation of the first Japanese astrometry satellite, Nano-JASMINE. It is a very small satellite and weighs only 35 kg. It aims to carry out astrometry measurement of nearby bright stars (z ≤ 7.5 mag) with an accuracy of 3 milli-arcseconds. Nano-JASMINE will be launched by Cyclone-4 rocket in August 2011 from Brazil. The current status is in the process of evaluating the performances. A series of performance tests and numerical analysis were conducted. As a result, the engineering model (EM) of the telescope was measured to be achieving a diffraction-limited performance and confirmed that it has enough performance for scientific astrometry.
Study on verifying the angle measurement performance of the rotary-laser system

NASA Astrophysics Data System (ADS)

Zhao, Jin; Ren, Yongjie; Lin, Jiarui; Yin, Shibin; Zhu, Jigui

2018-04-01

An angle verification method to verify the angle measurement performance of the rotary-laser system was developed. Angle measurement performance has a great impact on measuring accuracy. Although there is some previous research on the verification of angle measuring uncertainty for the rotary-laser system, there are still some limitations. High-precision reference angles are used in the study of the method, and an integrated verification platform is set up to evaluate the performance of the system. This paper also probes the error that has biggest influence on the verification system. Some errors of the verification system are avoided via the experimental method, and some are compensated through the computational formula and curve fitting. Experimental results show that the angle measurement performance meets the requirement for coordinate measurement. The verification platform can evaluate the uncertainty of angle measurement for the rotary-laser system efficiently.
Comparison of patient evaluations of health care quality in relation to WHO measures of achievement in 12 European countries.

PubMed

Kerssens, Jan J; Groenewegen, Peter P; Sixma, Herman J; Boerma, Wienke G W; van der Eijk, Ingrid

2004-02-01

To gain insight into similarities and differences in patient evaluations of quality of primary care across 12 European countries and to correlate patient evaluations with WHO health system performance measures (for example, responsiveness) of these countries. Patient evaluations were derived from a series of Quote (QUality of care Through patients' Eyes) instruments designed to measure the quality of primary care. Various research groups provided a total sample of 5133 patients from 12 countries: Belarus, Denmark, Finland, Greece, Ireland, Israel, Italy, the Netherlands, Norway, Portugal, United Kingdom, and Ukraine. Intraclass correlations of 10 Quote items were calculated to measure differences between countries. The world health report 2000 - Health systems: improving performance performance measures in the same countries were correlated with mean Quote scores. Intra-class correlation coefficients ranged from low to very high, which indicated little variation between countries in some respects (for example, primary care providers have a good understanding of patients' problems in all countries) and large variation in other respects (for example, with respect to prescription of medication and communication between primary care providers). Most correlations between mean Quote scores per country and WHO performance measures were positive. The highest correlation (0.86) was between the primary care provider's understanding of patients' problems and responsiveness according to WHO. Patient evaluations of the quality of primary care showed large differences across countries and related positively to WHO's performance measures of health care systems.
Towards a balanced performance measurement system in a public health care organization.

PubMed

Yuen, Peter P; Ng, Artie W

2012-01-01

This article attempts to devise an integrated performance measurement framework to assess the Hong Kong Hospital Authority (HA) management system by harnessing previous performance measurement systems. An integrated evaluative framework based on the balanced score card (BSC) was developed and applied using the case study method and longitudinal data to evaluate the HA's performance management system. The authors unveil evolving HA performance indicators (P1). Despite the HA staffs explicit quality emphasis, cost control remains the primary focus in their performance measurements. RESEARCH LHNITATIONS/IMPLICATIONS: Data used in this study are from secondary sources, disclosed mostly by HA staff. This study shows public sector staff often attach too much importance to cost control and easily measurable activities at the expense of quality and other less easily measurable attributes'. A balanced performance measurement system, linked to health targets, with a complementary budgeting process that supports pertinent resource allocation is yet to be implemented in Hong Kong's public hospitals.
48 CFR 1516.401-70 - Award term incentives.

Code of Federal Regulations, 2010 CFR

2010-10-01

..., including the evaluation criteria and performance measures, and serves as the basis for award term decisions...) The contractor has failed to achieve the performance measures for the corresponding evaluation period....401-70 Section 1516.401-70 Federal Acquisition Regulations System ENVIRONMENTAL PROTECTION AGENCY...
Information and complexity measures for hydrologic model evaluation

USDA-ARS?s Scientific Manuscript database

Hydrological models are commonly evaluated through the residual-based performance measures such as the root-mean square error or efficiency criteria. Such measures, however, do not evaluate the degree of similarity of patterns in simulated and measured time series. The objective of this study was to...
Functional assessment and performance evaluation for assistive robotic manipulators: Literature review.

PubMed

Chung, Cheng-Shiu; Wang, Hongwu; Cooper, Rory A

2013-07-01

The user interface development of assistive robotic manipulators can be traced back to the 1960s. Studies include kinematic designs, cost-efficiency, user experience involvements, and performance evaluation. This paper is to review studies conducted with clinical trials using activities of daily living (ADLs) tasks to evaluate performance categorized using the International Classification of Functioning, Disability, and Health (ICF) frameworks, in order to give the scope of current research and provide suggestions for future studies. We conducted a literature search of assistive robotic manipulators from 1970 to 2012 in PubMed, Google Scholar, and University of Pittsburgh Library System - PITTCat. Twenty relevant studies were identified. Studies were separated into two broad categories: user task preferences and user-interface performance measurements of commercialized and developing assistive robotic manipulators. The outcome measures and ICF codes associated with the performance evaluations are reported. Suggestions for the future studies include (1) standardized ADL tasks for the quantitative and qualitative evaluation of task efficiency and performance to build comparable measures between research groups, (2) studies relevant to the tasks from user priority lists and ICF codes, and (3) appropriate clinical functional assessment tests with consideration of constraints in assistive robotic manipulator user interfaces. In addition, these outcome measures will help physicians and therapists build standardized tools while prescribing and assessing assistive robotic manipulators.
Human performance evaluation in dual-axis critical task tracking

NASA Technical Reports Server (NTRS)

Ritchie, M. L.; Nataraj, N. S.

1975-01-01

A dual axis tracking using a multiloop critical task was set up to evaluate human performance. The effects of control stick variation and display formats are evaluated. A secondary loading was used to measure the degradation in tracking performance.
An interlaboratory comparison programme on radio frequency electromagnetic field measurements: the second round of the scheme.

PubMed

Nicolopoulou, E P; Ztoupis, I N; Karabetsos, E; Gonos, I F; Stathopulos, I A

2015-04-01

The second round of an interlaboratory comparison scheme on radio frequency electromagnetic field measurements has been conducted in order to evaluate the overall performance of laboratories that perform measurements in the vicinity of mobile phone base stations and broadcast antenna facilities. The participants recorded the electric field strength produced by two high frequency signal generators inside an anechoic chamber in three measurement scenarios with the antennas transmitting each time different signals at the FM, VHF, UHF and GSM frequency bands. In each measurement scenario, the participants also used their measurements in order to calculate the relative exposure ratios. The results were evaluated in each test level calculating performance statistics (z-scores and En numbers). Subsequently, possible sources of errors for each participating laboratory were discussed, and the overall evaluation of their performances was determined by using an aggregated performance statistic. A comparison between the two rounds proves the necessity of the scheme. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Systematic content evaluation and review of measurement properties of questionnaires for measuring self-reported fatigue among older people.

PubMed

Egerton, Thorlene; Riphagen, Ingrid I; Nygård, Arnhild J; Thingstad, Pernille; Helbostad, Jorunn L

2015-09-01

The assessment of fatigue in older people requires simple and user-friendly questionnaires that capture the phenomenon, yet are free from items indistinguishable from other disorders and experiences. This study aimed to evaluate the content, and systematically review and rate the measurement properties of self-report questionnaires for measuring fatigue, in order to identify the most suitable questionnaires for older people. This study firstly involved identification of questionnaires that purport to measure self-reported fatigue, and evaluation of the content using a rating scale developed for the purpose from contemporary understanding of the construct. Secondly, for the questionnaires that had acceptable content, we identified studies reporting measurement properties and rated the methodological quality of those studies according to the COSMIN system. Finally, we extracted and synthesised the results of the studies to give an overall rating for each questionnaire for each measurement property. The protocol was registered with PROSPERO (CRD42013005589). Of the 77 identified questionnaires, twelve were selected for review after content evaluation. Methodological quality varied, and there was a lack of information on measurement error and responsiveness. The PROMIS-Fatigue item bank and short forms perform the best. The FACIT-Fatigue scale, Parkinsons Fatigue Scale, Perform Questionnaire, and Uni-dimensional Fatigue Impact Scale also perform well and can be recommended. Minor modifications to improve performance are suggested. Further evaluation of unresolved measurement properties, particularly with samples including older people, is needed for all the recommended questionnaires.
Performance of biometric quality measures.

PubMed

Grother, Patrick; Tabassi, Elham

2007-04-01

We document methods for the quantitative evaluation of systems that produce a scalar summary of a biometric sample's quality. We are motivated by a need to test claims that quality measures are predictive of matching performance. We regard a quality measurement algorithm as a black box that converts an input sample to an output scalar. We evaluate it by quantifying the association between those values and observed matching results. We advance detection error trade-off and error versus reject characteristics as metrics for the comparative evaluation of sample quality measurement algorithms. We proceed this with a definition of sample quality, a description of the operational use of quality measures. We emphasize the performance goal by including a procedure for annotating the samples of a reference corpus with quality values derived from empirical recognition scores.
Evaluating the use of prior information under different pacing conditions on aircraft inspection performance: The use of virtual reality technology

NASA Astrophysics Data System (ADS)

Bowling, Shannon Raye

The aircraft maintenance industry is a complex system consisting of human and machine components, because of this; much emphasis has been placed on improving aircraft-inspection performance. One proven technique for improving inspection performance is the use of training. There are several strategies that have been implemented for training, one of which is feedforward information. The use of prior information (feedforward) is known to positively affect inspection performance. This information can consist of knowledge about defect characteristics (types, severity/criticality, and location) and the probability of occurrence. Although several studies have been conducted that demonstrate the usefulness of feedforward as a training strategy, there are certain research issues that need to be addressed. This study evaluates the effect of feedforward information in a simulated 3-dimensional environment by the use of virtual reality. A controlled study was conducted to evaluate the effectiveness of feedforward information in a simulated aircraft inspection environment. The study was conducted in two phases. The first phase evaluated the difference between general and detailed inspection at different pacing levels. The second phase evaluated the effect of feedforward information pertaining to severity, probability and location. Analyses of the results showed that subjects performing detailed inspection performed significantly better than while performing general inspection. Pacing also had the effect of reducing performance for both general and detailed inspection. The study also found that as the level of feedforward information increases, performance also increases. In addition to evaluating performance measures, the study also evaluated process and subjective measures. It was found that process measures such as number of fixation points, fixation groups, mean fixation duration, and percent area covered were all affected by the treatment levels. Analyses of the subjective measures also found a correlation between the perceived usefulness of feedforward information and the actual effect on performance. The study also examined the potential of virtual reality as a training tool and analyzed the effect different calculational algorithms have on determining various process measures.
Validity and reliability of a novel measure of activity performance and participation.

PubMed

Murgatroyd, Phil; Karimi, Leila

2016-01-01

To develop and evaluate an innovative clinician-rated measure, which produces global numerical ratings of activity performance and participation. Repeated measures study with 48 community-dwelling participants investigating clinical sensibility, comprehensiveness, practicality, inter-rater reliability, responsiveness, sensitivity and concurrent validity with Barthel Index. Important clinimetric characteristics including comprehensiveness and ease of use were rated >8/10 by clinicians. Inter-rater reliability was excellent on the summary scores (intraclass correlation of 0.95-0.98). There was good evidence that the new outcome measure distinguished between known high and low functional scoring groups, including both responsiveness to change and sensitivity at the same time point in numerous tests. Concurrent validity with the Barthel Index was fair to high (Spearman Rank Order Correlation 0.32-0.85, p > 0.05). The new measure's summary scores were nearly twice as responsive to change compared with the Barthel Index. Other more detailed data could also be generated by the new measure. The Activity Performance Measure is an innovative outcome instrument that showed good clinimetric qualities in this initial study. Some of the results were strong, given the sample size, and further trial and evaluation is appropriate. Implications for Rehabilitation The Activity Performance Measure is an innovative outcome measure covering activity performance and participation. In an initial evaluation, it showed good clinimetric qualities including responsiveness to change, sensitivity, practicality, clinical sensibility, item coverage, inter-rater reliability and concurrent validity with the Barthel Index. Further trial and evaluation is appropriate.
Objective and automated protocols for the evaluation of biomedical search engines using No Title Evaluation protocols.

PubMed

Campagne, Fabien

2008-02-29

The evaluation of information retrieval techniques has traditionally relied on human judges to determine which documents are relevant to a query and which are not. This protocol is used in the Text Retrieval Evaluation Conference (TREC), organized annually for the past 15 years, to support the unbiased evaluation of novel information retrieval approaches. The TREC Genomics Track has recently been introduced to measure the performance of information retrieval for biomedical applications. We describe two protocols for evaluating biomedical information retrieval techniques without human relevance judgments. We call these protocols No Title Evaluation (NT Evaluation). The first protocol measures performance for focused searches, where only one relevant document exists for each query. The second protocol measures performance for queries expected to have potentially many relevant documents per query (high-recall searches). Both protocols take advantage of the clear separation of titles and abstracts found in Medline. We compare the performance obtained with these evaluation protocols to results obtained by reusing the relevance judgments produced in the 2004 and 2005 TREC Genomics Track and observe significant correlations between performance rankings generated by our approach and TREC. Spearman's correlation coefficients in the range of 0.79-0.92 are observed comparing bpref measured with NT Evaluation or with TREC evaluations. For comparison, coefficients in the range 0.86-0.94 can be observed when evaluating the same set of methods with data from two independent TREC Genomics Track evaluations. We discuss the advantages of NT Evaluation over the TRels and the data fusion evaluation protocols introduced recently. Our results suggest that the NT Evaluation protocols described here could be used to optimize some search engine parameters before human evaluation. Further research is needed to determine if NT Evaluation or variants of these protocols can fully substitute for human evaluations.
Child-Report Measures of Occupational Performance: A Systematic Review

PubMed Central

Totino, Rebekah; Doma, Kenji; Leicht, Anthony; Brown, Nicole; Cuomo, Belinda

2016-01-01

Introduction Improving occupational performance is a key service of occupational therapists and client-centred approach to care is central to clinical practice. As such it is important to comprehensively evaluate the quality of psychometric properties reported across measures of occupational performance; in order to guide assessment and treatment planning. Objective To systematically review the literature on the psychometric properties of child-report measures of occupational performance for children ages 2–18 years. Methods A systematic search of the following six electronic databases was conducted: CINAHL; PsycINFO; EMBASE; PubMed; the Health and Psychosocial Instruments (HAPI) database; and Google Scholar. The quality of the studies was evaluated against the COSMIN taxonomy of measurement properties and the overall quality of psychometric properties was evaluated using pre-set psychometric criteria. Results Fifteen articles and one manual were reviewed to assess the psychometric properties of the six measures–the PEGS, MMD, CAPE, PAC, COSA, and OSA- which met the inclusion criteria. Most of the measures had conducted good quality studies to evaluate the psychometric properties of measures (PEGS, CAPE, PAC, OSA); however, the quality of the studies for two of these measures was relatively weak (MMD, COSA). When integrating the quality of the psychometric properties of the measures with the quality of the studies, the PAC stood out as having superior psychometric qualities. Conclusions The overall quality of the psychometric properties of most measures was limited. There is a need for continuing research into the psychometric properties of child-report measures of occupational performance, and to revise and improve the psychometric properties of existing measures. PMID:26808674
Nonstructural urban stormwater quality measures: building a knowledge base to improve their use.

PubMed

Taylor, André C; Fletcher, Tim D

2007-05-01

This article summarizes a research project that investigated the use, performance, cost, and evaluation of nonstructural measures to improve urban stormwater quality. A survey of urban stormwater managers from Australia, New Zealand, and the United States revealed a widespread trend of increasing use of nonstructural measures among leading stormwater management agencies, with at least 76% of 41 types of nonstructural measures being found to be increasing in use. Data gathered from the survey, an international literature review, and a multicriteria analysis highlighted four nonstructural measures of greatest potential value: mandatory town planning controls that promote the adoption of low-impact development principles and techniques; development of strategic urban stormwater management plans for a city, shire, or catchment; stormwater management measures and programs for construction/building sites; and stormwater management activities related to municipal maintenance operations such as maintenance of the stormwater drainage network and manual litter collections. Knowledge gained on the use and performance of nonstructural measures from the survey, literature review, and three trial evaluation projects was used to develop tailored monitoring and evaluation guidelines for these types of measure. These guidelines incorporate a new evaluation framework based on seven alternative styles of evaluation that range from simply monitoring whether a nonstructural measure has been fully implemented to monitoring its impact on waterway health. This research helps to build the stormwater management industry's knowledge base concerning nonstructural measures and provides a practical tool to address common impediments associated with monitoring and evaluating the performance and cost of these measures.
Modular biowaste monitoring system

NASA Technical Reports Server (NTRS)

Fogal, G. L.

1975-01-01

The objective of the Modular Biowaste Monitoring System Program was to generate and evaluate hardware for supporting shuttle life science experimental and diagnostic programs. An initial conceptual design effort established requirements and defined an overall modular system for the collection, measurement, sampling and storage of urine and feces biowastes. This conceptual design effort was followed by the design, fabrication and performance evaluation of a flight prototype model urine collection, volume measurement and sampling capability. No operational or performance deficiencies were uncovered as a result of the performance evaluation tests.

Performance and evaluation of real-time multicomputer control systems

NASA Technical Reports Server (NTRS)

Shin, K. G.

1983-01-01

New performance measures, detailed examples, modeling of error detection process, performance evaluation of rollback recovery methods, experiments on FTMP, and optimal size of an NMR cluster are discussed.
Electrochemical impedance spectroscopy (EIS) as a tool for measuring corrosion of polymer-coated fasteners used in treated wood

Treesearch

Samuel L. Zelinka; Lorraine Ortiz-Candelaria; Donald S. Stone; Douglas R. Rammer

2009-01-01

Currently, many of the polymer-coated fasteners on the market are designed for improved corrosion performance in treated wood; yet, there is no way to evaluate their corrosion performance. In this study, a common technique for measuring the corrosion performance of polymer-coated metals, electrochemical impedance spectroscopy (EIS), was used to evaluate commercial...
Comparative evaluation of performance measures for shading correction in time-lapse fluorescence microscopy.

PubMed

Liu, L; Kan, A; Leckie, C; Hodgkin, P D

2017-04-01

Time-lapse fluorescence microscopy is a valuable technology in cell biology, but it suffers from the inherent problem of intensity inhomogeneity due to uneven illumination or camera nonlinearity, known as shading artefacts. This will lead to inaccurate estimates of single-cell features such as average and total intensity. Numerous shading correction methods have been proposed to remove this effect. In order to compare the performance of different methods, many quantitative performance measures have been developed. However, there is little discussion about which performance measure should be generally applied for evaluation on real data, where the ground truth is absent. In this paper, the state-of-the-art shading correction methods and performance evaluation methods are reviewed. We implement 10 popular shading correction methods on two artificial datasets and four real ones. In order to make an objective comparison between those methods, we employ a number of quantitative performance measures. Extensive validation demonstrates that the coefficient of joint variation (CJV) is the most applicable measure in time-lapse fluorescence images. Based on this measure, we have proposed a novel shading correction method that performs better compared to well-established methods for a range of real data tested. © 2016 The Authors Journal of Microscopy © 2016 Royal Microscopical Society.
Performance and non-destructive evaluation methods of airborne radome and stealth structures

NASA Astrophysics Data System (ADS)

Panwar, Ravi; Ryul Lee, Jung

2018-06-01

In the past few years, great effort has been devoted to the fabrication of highly efficient, broadband radome and stealth (R&S) structures for distinct control, guidance, surveillance and communication applications for airborne platforms. The evaluation of non-planar aircraft R&S structures in terms of their electromagnetic performance and structural damage is still a very challenging task. In this article, distinct measurement techniques are discussed for the electromagnetic performance and non-destructive evaluation (NDE) of R&S structures. This paper deals with an overview of the transmission line method and free space measurement based microwave measurement techniques for the electromagnetic performance evaluation of R&S structures. In addition, various conventional as well as advanced methods, such as millimetre and terahertz wave based imaging techniques with great potential for NDE of load bearing R&S structures, are also discussed in detail. A glimpse of in situ NDE techniques with corresponding experimental setup for R&S structures is also presented. The basic concepts, measurement ranges and their instrumentation, measurement method of different R&S structures and some miscellaneous topics are discussed in detail. Some of the challenges and issues pertaining to the measurement of curved R&S structures are also presented. This study also lists various mathematical models and analytical techniques for the electromagnetic performance evaluation and NDE of R&S structures. The research directions described in this study may be of interest to the scientific community in the aerospace sectors.
40 CFR 60.185 - Monitoring of operations.

Code of Federal Regulations, 2010 CFR

2010-07-01

...) The continuous monitoring system performance evaluation required under § 60.13(c) shall be completed... monitoring system performance evaluation required under § 60.13(c), the reference method referred to under... be Method 6. For the performance evaluation, each concentration measurement shall be of one hour...
Proposed Performance Measures and Strategies for Implementation of the Fatigue Risk Management Guidelines for Emergency Medical Services

DOT National Transportation Integrated Search

2018-01-11

Background: Performance measures are a key component of implementation, dissemination, and evaluation of evidence-based guidelines (EBGs). We developed performance measures for Emergency Medical Services (EMS) stakeholders to enable the implementatio...
Development of performance measures for the assessment of rural planning organizations.

DOT National Transportation Integrated Search

2011-04-27

In order for the Transportation Planning Board to provide oversight and assistance to the 20 RPOs in the state, they : need effective evaluation criteria and performance measures. The existing measures, including the annual : performance report, do n...
Evaluation of Calibration Laboratories Performance

NASA Astrophysics Data System (ADS)

Filipe, Eduarda

2011-12-01

One of the main goals of interlaboratory comparisons (ILCs) is the evaluation of the laboratories performance for the routine calibrations they perform for the clients. In the frame of Accreditation of Laboratories, the national accreditation boards (NABs) in collaboration with the national metrology institutes (NMIs) organize the ILCs needed to comply with the requirements of the international accreditation organizations. In order that an ILC is a reliable tool for a laboratory to validate its best measurement capability (BMC), it is needed that the NMI (reference laboratory) provides a better traveling standard—in terms of accuracy class or uncertainty—than the laboratories BMCs. Although this is the general situation, there are cases where the NABs ask the NMIs to evaluate the performance of the accredited laboratories when calibrating industrial measuring instruments. The aim of this article is to discuss the existing approaches for the evaluation of ILCs and propose a basis for the validation of the laboratories measurement capabilities. An example is drafted with the evaluation of the results of mercury-in-glass thermometers ILC with 12 participant laboratories.
Operational and environmental performance in China's thermal power industry: Taking an effectiveness measure as complement to an efficiency measure.

PubMed

Wang, Ke; Zhang, Jieming; Wei, Yi-Ming

2017-05-01

The trend toward a more fiercely competitive and strictly environmentally regulated electricity market in several countries, including China has led to efforts by both industry and government to develop advanced performance evaluation models that adapt to new evaluation requirements. Traditional operational and environmental efficiency measures do not fully consider the influence of market competition and environmental regulations and, thus, are not sufficient for the thermal power industry to evaluate its operational performance with respect to specific marketing goals (operational effectiveness) and its environmental performance with respect to specific emissions reduction targets (environmental effectiveness). As a complement to an operational efficiency measure, an operational effectiveness measure not only reflects the capacity of an electricity production system to increase its electricity generation through the improvement of operational efficiency, but it also reflects the system's capability to adjust its electricity generation activities to match electricity demand. In addition, as a complement to an environmental efficiency measure, an environmental effectiveness measure not only reflects the capacity of an electricity production system to decrease its pollutant emissions through the improvement of environmental efficiency, but it also reflects the system's capability to adjust its emissions abatement activities to fulfill environmental regulations. Furthermore, an environmental effectiveness measure helps the government regulator to verify the rationality of its emissions reduction targets assigned to the thermal power industry. Several newly developed effectiveness measurements based on data envelopment analysis (DEA) were utilized in this study to evaluate the operational and environmental performance of the thermal power industry in China during 2006-2013. Both efficiency and effectiveness were evaluated from the three perspectives of operational, environmental, and joint adjustments to each electricity production system. The operational and environmental performance changes over time were also captured through an effectiveness measure based on the global Malmquist productivity index. Our empirical results indicated that the performance of China's thermal power industry experienced significant progress during the study period and that policies regarding the development and regulation of the thermal power industry yielded the expected effects. However, the emissions reduction targets assigned to China's thermal power industry are loose and conservative. Copyright © 2017 Elsevier Ltd. All rights reserved.
Computer systems performance measurement techniques.

DOT National Transportation Integrated Search

1971-06-01

Computer system performance measurement techniques, tools, and approaches are presented as a foundation for future recommendations regarding the instrumentation of the ARTS ATC data processing subsystem for purposes of measurement and evaluation.
Evaluating building performance in healthcare facilities: an organizational perspective.

PubMed

Steinke, Claudia; Webster, Lynn; Fontaine, Marie

2010-01-01

Using the environment as a strategic tool is one of the most cost-effective and enduring approaches for improving public health; however, it is one that requires multiple perspectives. The purpose of this article is to highlight an innovative methodology that has been developed for conducting comprehensive performance evaluations in public sector health facilities in Canada. The building performance evaluation methodology described in this paper is a government initiative. The project team developed a comprehensive building evaluation process for all new capital health projects that would respond to the aforementioned need for stakeholders to be more accountable and to better integrate the larger organizational strategy of facilities. The Balanced Scorecard, which is a multiparadigmatic, performance-based business framework, serves as the underlying theoretical framework for this initiative. It was applied in the development of the conceptual model entitled the Building Performance Evaluation Scorecard, which provides the following benefits: (1) It illustrates a process to link facilities more effectively to the overall mission and goals of an organization; (2) It is both a measurement and a management system that has the ability to link regional facilities to measures of success and larger business goals; (3) It provides a standardized methodology that ensures consistency in assessing building performance; and (4) It is more comprehensive than traditional building evaluations. The methodology presented in this paper is both a measurement and management system that integrates the principles of evidence-based design with the practices of pre- and post-occupancy evaluation. It promotes accountability and continues throughout the life cycle of a project. The advantage of applying this framework is that it engages health organizations in clarifying a vision and strategy for their facilities and helps translate those strategies into action and measurable performance outcomes.
45 CFR 2522.700 - How does evaluation differ from performance measurement?

Code of Federal Regulations, 2010 CFR

2010-10-01

... progress, evaluation uses scientifically-based research methods to assess the effectiveness of programs by... the reading ability of students in a program over time to a similar group of students not... example, a performance measure for a literacy program may include the percentage of students receiving...
Study of Adaptive Mathematical Models for Deriving Automated Pilot Performance Measurement Techniques. Volume II. Appendices. Final Report.

ERIC Educational Resources Information Center

Connelly, E. M.; And Others

A new approach to deriving human performance measures and criteria for use in automatically evaluating trainee performance is described. Ultimately, this approach will allow automatic measurement of pilot performance in a flight simulator or from recorded in-flight data. An efficient method of representing performance data within a computer is…
Performance evaluation methodology for historical document image binarization.

PubMed

Ntirogiannis, Konstantinos; Gatos, Basilis; Pratikakis, Ioannis

2013-02-01

Document image binarization is of great importance in the document image analysis and recognition pipeline since it affects further stages of the recognition process. The evaluation of a binarization method aids in studying its algorithmic behavior, as well as verifying its effectiveness, by providing qualitative and quantitative indication of its performance. This paper addresses a pixel-based binarization evaluation methodology for historical handwritten/machine-printed document images. In the proposed evaluation scheme, the recall and precision evaluation measures are properly modified using a weighting scheme that diminishes any potential evaluation bias. Additional performance metrics of the proposed evaluation scheme consist of the percentage rates of broken and missed text, false alarms, background noise, character enlargement, and merging. Several experiments conducted in comparison with other pixel-based evaluation measures demonstrate the validity of the proposed evaluation scheme.
Functional assessment and performance evaluation for assistive robotic manipulators: Literature review

PubMed Central

Chung, Cheng-Shiu; Wang, Hongwu; Cooper, Rory A.

2013-01-01

Context The user interface development of assistive robotic manipulators can be traced back to the 1960s. Studies include kinematic designs, cost-efficiency, user experience involvements, and performance evaluation. This paper is to review studies conducted with clinical trials using activities of daily living (ADLs) tasks to evaluate performance categorized using the International Classification of Functioning, Disability, and Health (ICF) frameworks, in order to give the scope of current research and provide suggestions for future studies. Methods We conducted a literature search of assistive robotic manipulators from 1970 to 2012 in PubMed, Google Scholar, and University of Pittsburgh Library System – PITTCat. Results Twenty relevant studies were identified. Conclusion Studies were separated into two broad categories: user task preferences and user-interface performance measurements of commercialized and developing assistive robotic manipulators. The outcome measures and ICF codes associated with the performance evaluations are reported. Suggestions for the future studies include (1) standardized ADL tasks for the quantitative and qualitative evaluation of task efficiency and performance to build comparable measures between research groups, (2) studies relevant to the tasks from user priority lists and ICF codes, and (3) appropriate clinical functional assessment tests with consideration of constraints in assistive robotic manipulator user interfaces. In addition, these outcome measures will help physicians and therapists build standardized tools while prescribing and assessing assistive robotic manipulators. PMID:23820143
Embedded measures of performance validity using verbal fluency tests in a clinical sample.

PubMed

Sugarman, Michael A; Axelrod, Bradley N

2015-01-01

The objective of this study was to determine to what extent verbal fluency measures can be used as performance validity indicators during neuropsychological evaluation. Participants were clinically referred for neuropsychological evaluation in an urban-based Veteran's Affairs hospital. Participants were placed into 2 groups based on their objectively evaluated effort on performance validity tests (PVTs). Individuals who exhibited credible performance (n = 431) failed 0 PVTs, and those with poor effort (n = 192) failed 2 or more PVTs. All participants completed the Controlled Oral Word Association Test (COWAT) and Animals verbal fluency measures. We evaluated how well verbal fluency scores could discriminate between the 2 groups. Raw scores and T scores for Animals discriminated between the credible performance and poor-effort groups with 90% specificity and greater than 40% sensitivity. COWAT scores had lower sensitivity for detecting poor effort. A combination of FAS and Animals scores into logistic regression models yielded acceptable group classification, with 90% specificity and greater than 44% sensitivity. Verbal fluency measures can yield adequate detection of poor effort during neuropsychological evaluation. We provide suggested cut points and logistic regression models for predicting the probability of poor effort in our clinical setting and offer suggested cutoff scores to optimize sensitivity and specificity.
1999 commuter assistance program evaluation manual

DOT National Transportation Integrated Search

2001-01-01

This manual was developed to assist Florida's Commuter Assistance Programs (CAP) to measure and evaluate their performance. It provides information necessary for a CAP to create and implement its own evaluation program. It discusses performance measu...
Measuring Principal Performance: How Rigorous Are Commonly Used Principal Performance Assessment Instruments? A Quality School Leadership Issue Brief

ERIC Educational Resources Information Center

Condon, Christopher; Clifford, Matthew

2010-01-01

This brief reviews the publicly available principal assessments and points superintendents and policy makers toward strong instruments to measure principal performance. Specifically, the measures included in this review are expressly intended to evaluate principal performance and have varying degrees of publicly available evidence of psychometric…
Performance Evaluation Tests for Environmental Research (PETER): evaluation of 114 measures

NASA Technical Reports Server (NTRS)

Bittner, A. C. Jr; Carter, R. C.; Kennedy, R. S.; Harbeson, M. M.; Krause, M.

1986-01-01

The goal of the Performance Evaluation Tests for Environmental Research (PETER) Program was to identify a set of measures of human capabilities for use in the study of environmental and other time-course effects. 114 measures studied in the PETER Program were evaluated and categorized into four groups based upon task stability and task definition. The Recommended category contained 30 measures that clearly obtained total stabilization and had an acceptable level of reliability efficiency. The Acceptable-But-Redundant category contained 15 measures. The 37 measures in the Marginal category, which included an inordinate number of slope and other derived measures, usually had desirable features which were outweighed by faults. The 32 measures in the Unacceptable category had either differential instability or weak reliability efficiency. It is our opinion that the 30 measures in the Recommended category should be given first consideration for environmental research applications. Further, it is recommended that information pertaining to preexperimental practice requirements and stabilized reliabilities should be utilized in repeated-measures environmental studies.
Team Performance Assessment and Measurement: Theory, Methods, and Applications. Series in Applied Psychology.

ERIC Educational Resources Information Center

Brannick, Michael T., Ed.; Salas, Eduardo, Ed.; Prince, Carolyn, Ed.

This volume presents thoughts on measuring team performance written by experts currently working with teams in fields such as training, evaluation, and process consultation. The chapters are: (1) "An Overview of Team Performance Measurement" (Michael T. Brannick and Carolyn Prince); (2) "A Conceptual Framework for Teamwork Measurement" (Terry L.…

Review and evaluation of performance measures for survival prediction models in external validation settings.

PubMed

Rahman, M Shafiqur; Ambler, Gareth; Choodari-Oskooei, Babak; Omar, Rumana Z

2017-04-18

When developing a prediction model for survival data it is essential to validate its performance in external validation settings using appropriate performance measures. Although a number of such measures have been proposed, there is only limited guidance regarding their use in the context of model validation. This paper reviewed and evaluated a wide range of performance measures to provide some guidelines for their use in practice. An extensive simulation study based on two clinical datasets was conducted to investigate the performance of the measures in external validation settings. Measures were selected from categories that assess the overall performance, discrimination and calibration of a survival prediction model. Some of these have been modified to allow their use with validation data, and a case study is provided to describe how these measures can be estimated in practice. The measures were evaluated with respect to their robustness to censoring and ease of interpretation. All measures are implemented, or are straightforward to implement, in statistical software. Most of the performance measures were reasonably robust to moderate levels of censoring. One exception was Harrell's concordance measure which tended to increase as censoring increased. We recommend that Uno's concordance measure is used to quantify concordance when there are moderate levels of censoring. Alternatively, Gönen and Heller's measure could be considered, especially if censoring is very high, but we suggest that the prediction model is re-calibrated first. We also recommend that Royston's D is routinely reported to assess discrimination since it has an appealing interpretation. The calibration slope is useful for both internal and external validation settings and recommended to report routinely. Our recommendation would be to use any of the predictive accuracy measures and provide the corresponding predictive accuracy curves. In addition, we recommend to investigate the characteristics of the validation data such as the level of censoring and the distribution of the prognostic index derived in the validation setting before choosing the performance measures.
Measuring comparative hospital performance.

PubMed

Griffith, John R; Alexander, Jeffrey A; Jelinek, Richard C

2002-01-01

Leading healthcare provider organizations now use a "balanced scorecard" of performance measures, expanding information reviewed at the governance level to include financial, customer, and internal performance information, as well as providing an opportunity to learn and grow to provide better strategic guidance. The approach, successfully used by other industries, uses competitor data and benchmarks to identify opportunities for improved mission achievement. This article evaluates one set of nine multidimensional hospital performance measures derived from Medicare reports (cash flow, asset turnover, mortality, complications, length of inpatient stay, cost per case, occupancy, change in occupancy, and percent of revenue from outpatient care). The study examines the content validity, reliability and sensitivity, validity of comparison, and independence and concludes that seven of the nine measures (all but the two occupancy measures) represent a potentially useful set for evaluating most U.S. hospitals. This set reflects correctable differences in performance between hospitals serving similar populations, that is, the measures reflect relative performance and identify opportunities to make the organization more successful.
Evaluation of integrated assessment model hindcast experiments: a case study of the GCAM 3.0 land use module

DOE Office of Scientific and Technical Information (OSTI.GOV)

Snyder, Abigail C.; Link, Robert P.; Calvin, Katherine V.

Hindcasting experiments (conducting a model forecast for a time period in which observational data are available) are being undertaken increasingly often by the integrated assessment model (IAM) community, across many scales of models. When they are undertaken, the results are often evaluated using global aggregates or otherwise highly aggregated skill scores that mask deficiencies. We select a set of deviation-based measures that can be applied on different spatial scales (regional versus global) to make evaluating the large number of variable–region combinations in IAMs more tractable. We also identify performance benchmarks for these measures, based on the statistics of the observationalmore » dataset, that allow a model to be evaluated in absolute terms rather than relative to the performance of other models at similar tasks. An ideal evaluation method for hindcast experiments in IAMs would feature both absolute measures for evaluation of a single experiment for a single model and relative measures to compare the results of multiple experiments for a single model or the same experiment repeated across multiple models, such as in community intercomparison studies. The performance benchmarks highlight the use of this scheme for model evaluation in absolute terms, providing information about the reasons a model may perform poorly on a given measure and therefore identifying opportunities for improvement. To demonstrate the use of and types of results possible with the evaluation method, the measures are applied to the results of a past hindcast experiment focusing on land allocation in the Global Change Assessment Model (GCAM) version 3.0. The question of how to more holistically evaluate models as complex as IAMs is an area for future research. We find quantitative evidence that global aggregates alone are not sufficient for evaluating IAMs that require global supply to equal global demand at each time period, such as GCAM. The results of this work indicate it is unlikely that a single evaluation measure for all variables in an IAM exists, and therefore sector-by-sector evaluation may be necessary.« less
Evaluation of integrated assessment model hindcast experiments: a case study of the GCAM 3.0 land use module

DOE PAGES

Snyder, Abigail C.; Link, Robert P.; Calvin, Katherine V.

2017-11-29

Hindcasting experiments (conducting a model forecast for a time period in which observational data are available) are being undertaken increasingly often by the integrated assessment model (IAM) community, across many scales of models. When they are undertaken, the results are often evaluated using global aggregates or otherwise highly aggregated skill scores that mask deficiencies. We select a set of deviation-based measures that can be applied on different spatial scales (regional versus global) to make evaluating the large number of variable–region combinations in IAMs more tractable. We also identify performance benchmarks for these measures, based on the statistics of the observationalmore » dataset, that allow a model to be evaluated in absolute terms rather than relative to the performance of other models at similar tasks. An ideal evaluation method for hindcast experiments in IAMs would feature both absolute measures for evaluation of a single experiment for a single model and relative measures to compare the results of multiple experiments for a single model or the same experiment repeated across multiple models, such as in community intercomparison studies. The performance benchmarks highlight the use of this scheme for model evaluation in absolute terms, providing information about the reasons a model may perform poorly on a given measure and therefore identifying opportunities for improvement. To demonstrate the use of and types of results possible with the evaluation method, the measures are applied to the results of a past hindcast experiment focusing on land allocation in the Global Change Assessment Model (GCAM) version 3.0. The question of how to more holistically evaluate models as complex as IAMs is an area for future research. We find quantitative evidence that global aggregates alone are not sufficient for evaluating IAMs that require global supply to equal global demand at each time period, such as GCAM. The results of this work indicate it is unlikely that a single evaluation measure for all variables in an IAM exists, and therefore sector-by-sector evaluation may be necessary.« less
Evaluation of integrated assessment model hindcast experiments: a case study of the GCAM 3.0 land use module

NASA Astrophysics Data System (ADS)

Snyder, Abigail C.; Link, Robert P.; Calvin, Katherine V.

2017-11-01

Hindcasting experiments (conducting a model forecast for a time period in which observational data are available) are being undertaken increasingly often by the integrated assessment model (IAM) community, across many scales of models. When they are undertaken, the results are often evaluated using global aggregates or otherwise highly aggregated skill scores that mask deficiencies. We select a set of deviation-based measures that can be applied on different spatial scales (regional versus global) to make evaluating the large number of variable-region combinations in IAMs more tractable. We also identify performance benchmarks for these measures, based on the statistics of the observational dataset, that allow a model to be evaluated in absolute terms rather than relative to the performance of other models at similar tasks. An ideal evaluation method for hindcast experiments in IAMs would feature both absolute measures for evaluation of a single experiment for a single model and relative measures to compare the results of multiple experiments for a single model or the same experiment repeated across multiple models, such as in community intercomparison studies. The performance benchmarks highlight the use of this scheme for model evaluation in absolute terms, providing information about the reasons a model may perform poorly on a given measure and therefore identifying opportunities for improvement. To demonstrate the use of and types of results possible with the evaluation method, the measures are applied to the results of a past hindcast experiment focusing on land allocation in the Global Change Assessment Model (GCAM) version 3.0. The question of how to more holistically evaluate models as complex as IAMs is an area for future research. We find quantitative evidence that global aggregates alone are not sufficient for evaluating IAMs that require global supply to equal global demand at each time period, such as GCAM. The results of this work indicate it is unlikely that a single evaluation measure for all variables in an IAM exists, and therefore sector-by-sector evaluation may be necessary.
Evaluation of an institutional project to improve venous thromboembolism prevention.

PubMed

Minami, Christina A; Yang, Anthony D; Ju, Mila; Culver, Eckford; Seifert, Kathryn; Kreutzer, Lindsey; Halverson, Terri; O'Leary, Kevin J; Bilimoria, Karl Y

2016-12-01

Northwestern Memorial Hospital (NMH) was historically a poor performer on the venous thromboembolism (VTE) outcome measure. As this measure has been shown to be flawed by surveillance bias, NMH embraced process-of-care measures to ensure appropriate VTE prophylaxis to assess healthcare-associated VTE prevention efforts. To evaluate the impact of an institution-wide project aimed at improving hospital performance on VTE prophylaxis measures. A retrospective observational study. NMH, an 885-bed academic medical center in Chicago, Illinois PATIENTS: Inpatients admitted to NMH from January 1, 2013 to May 1, 2013 and from October 1, 2014 to April 1, 2015 were eligible for evaluation. Using the define-measure-analyze-improve-control (DMAIC) process-improvement methodology, a multidisciplinary team implemented and iteratively improved 15 data-driven interventions in 4 broad areas: (1) electronic medical record (EMR) alerts, (2) education initiatives, (3) new EMR order sets, and (4) other EMR changes. The Joint Commission's 6 core measures and the Surgical Care Improvement Project (SCIP) SCIP-VTE-2 measure. Based on 3103 observations (1679 from January 1, 2013 to May 1, 2013, and 1424 from October 1, 2014 to April 1, 2015), performance on the core measures improved. Performance on measure 1 (chemoprophylaxis) improved from 82.5% to 90.2% on medicine services, and from 94.4% to 97.6% on surgical services. The largest improvements were seen in measure 4 (platelet monitoring), with a performance increase from 76.7% adherence to 100%, and measure 5 (warfarin discharge instructions), with a performance increase from 27.4% to 88.8%. A systematic hospital-wide DMAIC project improved VTE prophylaxis measure performance. Sustained performance has been observed, and novel control mechanisms for continued performance surveillance have been embedded in the hospital system. Journal of Hospital Medicine 2016;11:S29-S37. © 2016 Society of Hospital Medicine. © 2016 Society of Hospital Medicine.
Human interaction with robotic systems: performance and workload evaluations.

PubMed

Reinerman-Jones, L; Barber, D J; Szalma, J L; Hancock, P A

2017-10-01

We first tested the effect of differing tactile informational forms (i.e. directional cues vs. static cues vs. dynamic cues) on objective performance and perceived workload in a collaborative human-robot task. A second experiment evaluated the influence of task load and informational message type (i.e. single words vs. grouped phrases) on that same collaborative task. In both experiments, the relationship of personal characteristics (attentional control and spatial ability) to performance and workload was also measured. In addition to objective performance and self-report of cognitive load, we evaluated different physiological responses in each experiment. Results showed a performance-workload association for directional cues, message type and task load. EEG measures however, proved generally insensitive to such task load manipulations. Where significant EEG effects were observed, right hemisphere amplitude differences predominated, although unexpectedly these latter relationships were negative. Although EEG measures were partially associated with performance, they appear to possess limited utility as measures of workload in association with tactile displays. Practitioner Summary: As practitioners look to take advantage of innovative tactile displays in complex operational realms like human-robotic interaction, associated performance effects are mediated by cognitive workload. Despite some patterns of association, reliable reflections of operator state can be difficult to discern and employ as the number, complexity and sophistication of these respective measures themselves increase.
[Study on the acquiring data time and intervals for measuring performance of air cleaner on formaldehyde].

PubMed

Tang, Zhigang; Wang, Guifang; Xu, Dongqun; Han, Keqin; Li, Yunpu; Zhang, Aijun; Dong, Xiaoyan

2004-09-01

The measuring time and measuring intervals to evaluate different type of air cleaner performance to remove formaldehyde were provided. The natural decay measurement and formaldehyde removal measurement were conducted in 1.5 m3 and 30 m3 test chamber. The natural decay rate was determined by acquiring formaldehyde concentration data at 15 minute intervals for 2.5 hours. The measured decay rate was determined by acquiring formaldehyde concentration data at 5 minute intervals for 1.2 hours. When the wind power of air cleaner is smaller than 30 m3/h or measuring performance of no wind power air clearing product, the 1.5 m3 test chamber can be used. Both the natural decay rate and the measured decay rate are determined by acquiring formaldehyde concentration data at 8 minute intervals for 64 minutes. There were different measuring time and measuring intervals to evaluate different type of air cleaner performance to remove formaldehyde.
Evaluation schemes for video and image anomaly detection algorithms

NASA Astrophysics Data System (ADS)

Parameswaran, Shibin; Harguess, Josh; Barngrover, Christopher; Shafer, Scott; Reese, Michael

2016-05-01

Video anomaly detection is a critical research area in computer vision. It is a natural first step before applying object recognition algorithms. There are many algorithms that detect anomalies (outliers) in videos and images that have been introduced in recent years. However, these algorithms behave and perform differently based on differences in domains and tasks to which they are subjected. In order to better understand the strengths and weaknesses of outlier algorithms and their applicability in a particular domain/task of interest, it is important to measure and quantify their performance using appropriate evaluation metrics. There are many evaluation metrics that have been used in the literature such as precision curves, precision-recall curves, and receiver operating characteristic (ROC) curves. In order to construct these different metrics, it is also important to choose an appropriate evaluation scheme that decides when a proposed detection is considered a true or a false detection. Choosing the right evaluation metric and the right scheme is very critical since the choice can introduce positive or negative bias in the measuring criterion and may favor (or work against) a particular algorithm or task. In this paper, we review evaluation metrics and popular evaluation schemes that are used to measure the performance of anomaly detection algorithms on videos and imagery with one or more anomalies. We analyze the biases introduced by these by measuring the performance of an existing anomaly detection algorithm.
An Examination of Performance-Based Teacher Evaluation Systems in Five States. Summary. Issues & Answers. REL 2012-No. 129

ERIC Educational Resources Information Center

Shakman, Karen; Riordan, Julie; Sanchez, Maria Teresa; Cook, Kyle DeMeo; Fournier, Richard; Brett, Jessica

2012-01-01

This study reports on performance-based teacher evaluation systems in five states that have implemented such systems. It investigates two primary research questions: (1) What are the key characteristics of state-level performance-based teacher evaluation systems in the study states?; and (2) How do state teacher evaluation measures, the teaching…
A Safety Index and Method for Flightdeck Evaluation

NASA Technical Reports Server (NTRS)

Latorella, Kara A.

2000-01-01

If our goal is to improve safety through machine, interface, and training design, then we must define a metric of flightdeck safety that is usable in the design process. Current measures associated with our notions of "good" pilot performance and ultimate safety of flightdeck performance fail to provide an adequate index of safe flightdeck performance for design evaluation purposes. The goal of this research effort is to devise a safety index and method that allows us to evaluate flightdeck performance holistically and in a naturalistic experiment. This paper uses Reason's model of accident causation (1990) as a basis for measuring safety, and proposes a relational database system and method for 1) defining a safety index of flightdeck performance, and 2) evaluating the "safety" afforded by flightdeck performance for the purpose of design iteration. Methodological considerations, limitations, and benefits are discussed as well as extensions to this work.
Research Frontiers in Public Sector Performance Measurement

NASA Astrophysics Data System (ADS)

Zhonghua, Cai; Ye, Wang

In "New Public Management" era, performance measurement has been widely used in managerial practices of public sectors. From the content and features of performance measurement, this paper aims to explore inspirations on Chinese public sector performance measurement, which based on a review of prior literatures including influencial factors, methods and indicators of public sector performance evaluation. In the end, arguments are presented in this paper pointed out the direction of future researches in this field.
Study of Adaptive Mathematical Models for Deriving Automated Pilot Performance Measurement Techniques. Volume I. Model Development.

ERIC Educational Resources Information Center

Connelly, Edward A.; And Others

A new approach to deriving human performance measures and criteria for use in automatically evaluating trainee performance is documented in this report. The ultimate application of the research is to provide methods for automatically measuring pilot performance in a flight simulator or from recorded in-flight data. An efficient method of…
Are Improvements in Measured Performance Driven by Better Treatment or "Denominator Management"?

PubMed

Harris, Alex H S; Chen, Cheng; Rubinsky, Anna D; Hoggatt, Katherine J; Neuman, Matthew; Vanneman, Megan E

2016-04-01

Process measures of healthcare quality are usually formulated as the number of patients who receive evidence-based treatment (numerator) divided by the number of patients in the target population (denominator). When the systems being evaluated can influence which patients are included in the denominator, it is reasonable to wonder if improvements in measured quality are driven by expanding numerators or contracting denominators. In 2003, the US Department of Veteran Affairs (VA) based executive compensation in part on performance on a substance use disorder (SUD) continuity-of-care quality measure. The first goal of this study was to evaluate if implementing the measure in this way resulted in expected improvements in measured performance. The second goal was to examine if the proportion of patients with SUD who qualified for the denominator contracted after the quality measure was implemented, and to describe the facility-level variation in and correlates of denominator contraction or expansion. Using 40 quarters of data straddling the implementation of the performance measure, an interrupted time series design was used to evaluate changes in two outcomes. All veterans with an SUD diagnosis in all VA facilities from fiscal year 2000 to 2009. The two outcomes were 1) measured performance-patients retained/patients qualified and 2) denominator prevalence-patients qualified/patients with SUD program contact. Measured performance improved over time (P < 0.001). Notably, the proportion of patients with SUD program contact who qualified for the denominator decreased more rapidly after the measure was implemented (p = 0.02). Facilities with higher pre-implementation denominator prevalence had steeper declines in denominator prevalence after implementation (p < 0.001). These results should motivate the development of measures that are less vulnerable to denominator management, and also the exploration of "shadow measures" to monitor and reduce undesirable denominator management.
Report on the State of Development, Availability, Evaluation, and Future use of Test Kits for the Measurement of Lead in Paint

EPA Science Inventory

The purpose of this issue paper is to address the availability and performance characteristics of portable lead test kits especially suited for lead in paint, procedures for evaluating the performance of these test kits, and the availability of performance evaluation (PE) materia...
MTF Database: A Repository of Students' Academic Performance Measurements for the Development of Techniques for Evaluating Team Functioning

ERIC Educational Resources Information Center

Hsiung, Chin-Min; Zheng, Xiang-Xiang

2015-01-01

The Measurements for Team Functioning (MTF) database contains a series of student academic performance measurements obtained at a national university in Taiwan. The measurements are acquired from unit tests and homework tests performed during a core mechanical engineering course, and provide an objective means of assessing the functioning of…
Characteristic Evaluation on Cooling Performance of Thermoelectric Modules.

PubMed

Seo, Sae Rom; Han, Seungwoo

2015-10-01

The aim of this work is to develop a performance evaluation system for thermoelectric cooling modules. We describe the design of such a system, composed of a vacuum chamber with a heat sink along with a metal block to measure the absorbed heat Qc. The system has a simpler structure than existing water-cooled or air-cooled systems. The temperature difference between the cold and hot sides of the thermoelectric module ΔT can be accurately measured without any effects due to convection, and the temperature equilibrium time is minimized compared to a water-cooled system. The evaluation system described here can be used to measure characteristic curves of Qc as a function of ΔT, as well as the current-voltage relations. High-performance thermoelectric systems can therefore be developed using optimal modules evaluated with this system.
Workload - An examination of the concept

NASA Technical Reports Server (NTRS)

Gopher, Daniel; Donchin, Emanuel

1986-01-01

The relations between task difficulty and workload and workload and performance are examined. The architecture and limitations of the central processor are discussed. Various procedures for measuring workload are described and evaluated. Consideration is given to normative and descriptive approaches; subjective, performance, and arousal measures; performance operating characteristics; and psychophysiological measures of workload.
77 FR 51762 - Proposed Information Collection; Comment Request; Economic Surveys for U.S. Commercial Fisheries

Federal Register 2010, 2011, 2012, 2013, 2014

2012-08-27

... through primary processing; (2) to analyze the economic performance effects of current management measures; and (3) to analyze the economic performance effects of alternative management measures. The measures... used to track economic performance and to evaluate the economic effects of alternative management...
Using satellite observations in performance evaluation for regulatory air quality modeling: Comparison with ground-level measurements

NASA Astrophysics Data System (ADS)

Odman, M. T.; Hu, Y.; Russell, A.; Chai, T.; Lee, P.; Shankar, U.; Boylan, J.

2012-12-01

Regulatory air quality modeling, such as State Implementation Plan (SIP) modeling, requires that model performance meets recommended criteria in the base-year simulations using period-specific, estimated emissions. The goal of the performance evaluation is to assure that the base-year modeling accurately captures the observed chemical reality of the lower troposphere. Any significant deficiencies found in the performance evaluation must be corrected before any base-case (with typical emissions) and future-year modeling is conducted. Corrections are usually made to model inputs such as emission-rate estimates or meteorology and/or to the air quality model itself, in modules that describe specific processes. Use of ground-level measurements that follow approved protocols is recommended for evaluating model performance. However, ground-level monitoring networks are spatially sparse, especially for particulate matter. Satellite retrievals of atmospheric chemical properties such as aerosol optical depth (AOD) provide spatial coverage that can compensate for the sparseness of ground-level measurements. Satellite retrievals can also help diagnose potential model or data problems in the upper troposphere. It is possible to achieve good model performance near the ground, but have, for example, erroneous sources or sinks in the upper troposphere that may result in misleading and unrealistic responses to emission reductions. Despite these advantages, satellite retrievals are rarely used in model performance evaluation, especially for regulatory modeling purposes, due to the high uncertainty in retrievals associated with various contaminations, for example by clouds. In this study, 2007 was selected as the base year for SIP modeling in the southeastern U.S. Performance of the Community Multiscale Air Quality (CMAQ) model, at a 12-km horizontal resolution, for this annual simulation is evaluated using both recommended ground-level measurements and non-traditional satellite retrievals. Evaluation results are assessed against recommended criteria and peer studies in the literature. Further analysis is conducted, based upon these assessments, to discover likely errors in model inputs and potential deficiencies in the model itself. Correlations as well as differences in input errors and model deficiencies revealed by ground-level measurements versus satellite observations are discussed. Additionally, sensitivity analyses are employed to investigate errors in emission-rate estimates using either ground-level measurements or satellite retrievals, and the results are compared against each other considering observational uncertainties. Recommendations are made for how to effectively utilize satellite retrievals in regulatory air quality modeling.

Use of Latent Class Analysis to define groups based on validity, cognition, and emotional functioning.

PubMed

Morin, Ruth T; Axelrod, Bradley N

Latent Class Analysis (LCA) was used to classify a heterogeneous sample of neuropsychology data. In particular, we used measures of performance validity, symptom validity, cognition, and emotional functioning to assess and describe latent groups of functioning in these areas. A data-set of 680 neuropsychological evaluation protocols was analyzed using a LCA. Data were collected from evaluations performed for clinical purposes at an urban medical center. A four-class model emerged as the best fitting model of latent classes. The resulting classes were distinct based on measures of performance validity and symptom validity. Class A performed poorly on both performance and symptom validity measures. Class B had intact performance validity and heightened symptom reporting. The remaining two Classes performed adequately on both performance and symptom validity measures, differing only in cognitive and emotional functioning. In general, performance invalidity was associated with worse cognitive performance, while symptom invalidity was associated with elevated emotional distress. LCA appears useful in identifying groups within a heterogeneous sample with distinct performance patterns. Further, the orthogonal nature of performance and symptom validities is supported.
Gum-compliant uncertainty propagations for Pu and U concentration measurements using the 1st-prototype XOS/LANL hiRX instrument; an SRNL H-Canyon Test Bed performance evaluation project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Holland, Michael K.; O'Rourke, Patrick E.

An SRNL H-Canyon Test Bed performance evaluation project was completed jointly by SRNL and LANL on a prototype monochromatic energy dispersive x-ray fluorescence instrument, the hiRX. A series of uncertainty propagations were generated based upon plutonium and uranium measurements performed using the alpha-prototype hiRX instrument. Data reduction and uncertainty modeling provided in this report were performed by the SRNL authors. Observations and lessons learned from this evaluation were also used to predict the expected uncertainties that should be achievable at multiple plutonium and uranium concentration levels provided instrument hardware and software upgrades being recommended by LANL and SRNL are performed.
Objective measures of situation awareness in a simulated medical environment

PubMed Central

Wright, M; Taekman, J; Endsley, M

2004-01-01

One major limitation in the use of human patient simulators is a lack of objective, validated measures of human performance. Objective measures are necessary if simulators are to be used to evaluate the skills and training of medical practitioners and teams or to evaluate the impact of new processes or equipment design on overall system performance. Situation awareness (SA) refers to a person's perception and understanding of their dynamic environment. This awareness and comprehension is critical in making correct decisions that ultimately lead to correct actions in medical care settings. An objective measure of SA may be more sensitive and diagnostic than traditional performance measures. This paper reviews a theory of SA and discusses the methods required for developing an objective measure of SA within the context of a simulated medical environment. Analysis and interpretation of SA data for both individual and team performance in health care are also presented. PMID:15465958
Skylab experiment performance evaluation manual. Appendix S: Experiment T027 contamination measurement sample array (MSFC)

NASA Technical Reports Server (NTRS)

Tonetti, B. B.

1973-01-01

Analyses for Experiment T027, Contamination Measurement Sample Array (MSFC), to be used for evaluating the performance of the Skylab corrollary experiments under preflight, inflight, and post-flight conditions are presented. Experiment contingency plan workaround procedure and malfunction analyses are presented in order to assist in making the experiment operationally successful.
Measurement issues in the evaluation of chronic disease self-management programs.

PubMed

Nolte, Sandra; Elsworth, Gerald R; Newman, Stanton; Osborne, Richard H

2013-09-01

To provide an in-depth analysis of outcome measures used in the evaluation of chronic disease self-management programs consistent with the Stanford curricula. Based on a systematic review on self-management programs, effect sizes derived from reported outcome measures are categorized according to the quality of life appraisal model developed by Schwartz and Rapkin which classifies outcomes from performance-based measures (e.g., clinical outcomes) to evaluation-based measures (e.g., emotional well-being). The majority of outcomes assessed in self-management trials are based on evaluation-based methods. Overall, effects on knowledge--the only performance-based measure observed in selected trials--are generally medium to large. In contrast, substantially more inconsistent results are found for both perception- and evaluation-based measures that mostly range between nil and small positive effects. Effectiveness of self-management interventions and resulting recommendations for health policy makers are most frequently derived from highly variable evaluation-based measures, that is, types of outcomes that potentially carry a substantial amount of measurement error and/or bias such as response shift. Therefore, decisions regarding the value and efficacy of chronic disease self-management programs need to be interpreted with care. More research, especially qualitative studies, is needed to unravel cognitive processes and the role of response shift bias in the measurement of change.
Evaluating Innovations in Home Care for Performance Accountability.

PubMed

Collister, Barbara; Gutscher, Abram; Ambrogiano, Jana

2016-01-01

Concerns about rising costs and the sustainability of our healthcare system have led to a drive for innovative solutions and accountability for performance. Integrated Home Care, Calgary Zone, Alberta Health Services went beyond traditional accountability measures to use evaluation methodology to measure the progress of complex innovations to its organization structure and service delivery model. This paper focuses on the first two phases of a three-phase evaluation. The results of the first two phases generated learning about innovation adoption and sustainability, and performance accountability at the program-level of a large publicly funded healthcare organization.
Evaluating Robotic Surgical Skills Performance Under Distractive Environment Using Objective and Subjective Measures.

PubMed

Suh, Irene H; LaGrange, Chad A; Oleynikov, Dmitry; Siu, Ka-Chun

2016-02-01

Distractions are recognized as a significant factor affecting performance in safety critical domains. Although operating rooms are generally full of distractions, the effect of distractions on robot-assisted surgical (RAS) performance is unclear. Our aim was to investigate the effect of distractions on RAS performance using both objective and subjective measures. Fifteen participants performed a knot-tying task using the da Vinci Surgical System and were exposed to 3 distractions: (1) passive distraction entailed listening to noise with a constant heart rate, (2) active distraction included listening to noise and acknowledging a change of random heart rate from 60 to 120 bpm, and (3) interactive distraction consisted of answering math questions. The objective kinematics of the surgical instrument tips were used to evaluate performance. Electromyography (EMG) of the forearm and hand muscles of the participants were collected. The median EMG frequency (EMG(fmed)) and the EMG envelope (EMG(env)) were analyzed. NASA Task Load Index and Fundamentals of Laparoscopic Surgery score were used to evaluate the subjective performance. One-way repeated analysis of variance was applied to examine the effects of distraction on skills performance. Spearman's correlations were conducted to compare objective and subjective measures. Significant distraction effect was found for all objective kinematics measures (P < .05). There were significant distraction effects for EMG measures (EMG(env), P < .004; EMG(fmed), P = .031). Significant distraction effects were also found for subjective measurements. Distraction impairs surgical skills performance and increases muscle work. Understanding how the surgeons cope with distractions is important in developing surgical education. © The Author(s) 2015.
Pulmonary tumor measurements from x-ray computed tomography in one, two, and three dimensions.

PubMed

Villemaire, Lauren; Owrangi, Amir M; Etemad-Rezai, Roya; Wilson, Laura; O'Riordan, Elaine; Keller, Harry; Driscoll, Brandon; Bauman, Glenn; Fenster, Aaron; Parraga, Grace

2011-11-01

We evaluated the accuracy and reproducibility of three-dimensional (3D) measurements of lung phantoms and patient tumors from x-ray computed tomography (CT) and compared these to one-dimensional (1D) and two-dimensional (2D) measurements. CT images of three spherical and three irregularly shaped tumor phantoms were evaluated by three observers who performed five repeated measurements. Additionally, three observers manually segmented 29 patient lung tumors five times each. Follow-up imaging was performed for 23 tumors and response criteria were compared. For a single subject, imaging was performed on nine occasions over 2 years to evaluate multidimensional tumor response. To evaluate measurement accuracy, we compared imaging measurements to ground truth using analysis of variance. For estimates of precision, intraobserver and interobserver coefficients of variation and intraclass correlations (ICC) were used. Linear regression and Pearson correlations were used to evaluate agreement and tumor response was descriptively compared. For spherical shaped phantoms, all measurements were highly accurate, but for irregularly shaped phantoms, only 3D measurements were in high agreement with ground truth measurements. All phantom and patient measurements showed high intra- and interobserver reproducibility (ICC >0.900). Over a 2-year period for a single patient, there was disagreement between tumor response classifications based on 3D measurements and those generated using 1D and 2D measurements. Tumor volume measurements were highly reproducible and accurate for irregular, spherical phantoms and patient tumors with nonuniform dimensions. Response classifications obtained from multidimensional measurements suggest that 3D measurements provide higher sensitivity to tumor response. Copyright © 2011 AUR. Published by Elsevier Inc. All rights reserved.
[Assessment comparison between area sampling and personal sampling noise measurement in new thermal power plant].

PubMed

Zhang, Hua; Chen, Qing-song; Li, Nan; Hua, Yan; Zeng, Lin; Xu, Guo-yang; Tao, Li-yuan; Zhao, Yi-ming

2013-05-01

To compare the results of noise hazard evaluations based on area sampling and personal sampling in a new thermal power plant and to analyze the similarities and differences between the two measurement methods. According to Measurement of Physical agents in Workplace Part 8: Noise(GBZff 189.8-2007), area sampling was performed at various operating points for noise measurement, and meanwhile the workers under different types of work wore noise dosimeters for personal noise exposure measurement. The two measurement methods were used to evaluate the level of noise hazards in the enterprise according to the corresponding occupational health standards, and the evaluation results were compared. Area sampling was performed at 99 operating points, the mean noise level was 88.9 ± 11.1 dB (A)(range, 51.3-107.0 dB (A)), with an over-standard rate of 75.8%. Personal sampling was performed (73 person times),and the mean noise level was 79.3 ± 6.3 dB (A), with an over-standard rate of 6.6% ( 16/241 ). There was a statistically significant difference in the over-standard rate between the evaluation results of the two measurement methods ( x2=53.869, ?<0.001 ). Because of the characteristics of the work in new thermal power plants, the noise hazard evaluation based on area sampling cannot be used instead of personal noise exposure measurement among workers. Personal sampling should be used in the noise measurement in new thermal power plant.
Employee Performance in the Context of the Problems of Measurement and Evaluation in Practice

NASA Astrophysics Data System (ADS)

Szabó, Peter; Mĺkva, Miroslava; Vaňová, Jaromíra; Marková, Petra

2017-09-01

Employee performance is a condition and an assumption for the performance and success of a company on the market. In order to ensure competitive ability, the quality of human resources, their management, and related measurement and performance assessment are at the forefront of company interest. Employee assessment affects the performance, development and motivation of people and also provides the necessary information about the employees. It allows the organization to monitor employee performance and compare their work with other collaborators. Many companies have the problem of setting up evaluation system so that it carried itself elements of responsibility and objectivity. The result of conceptual work in this area is the ultimate use of tools whose deployment, if possible, motivates employees to perform better. The aim of the paper is to refer to problems that arise in companies in evaluating the performance of employees.
Value-Added Measures of Education Performance: Clearing Away the Smoke and Mirrors. Policy Brief 10-4

ERIC Educational Resources Information Center

Harris, Douglas N.

2010-01-01

In this policy brief, the author explores the problems with attainment measures when it comes to evaluating performance at the school level, and explores the best uses of value-added measures. These value-added measures, the author writes, are useful for sorting out-of-school influences from school influences or from teacher performance, giving…
Experimental evaluations of wearable ECG monitor.

PubMed

Ha, Kiryong; Kim, Youngsung; Jung, Junyoung; Lee, Jeunwoo

2008-01-01

Healthcare industry is changing with ubiquitous computing environment and wearable ECG measurement is one of the most popular approaches in this healthcare industry. Reliability and performance of healthcare device is fundamental issue for widespread adoptions, and interdisciplinary perspectives of wearable ECG monitor make this more difficult. In this paper, we propose evaluation criteria considering characteristic of both ECG measurement and ubiquitous computing. With our wearable ECG monitors, various levels of experimental analysis are performed based on evaluation strategy.
Reliability and Validity of the Professional Counseling Performance Evaluation

ERIC Educational Resources Information Center

Shepherd, J. Brad; Britton, Paula J.; Kress, Victoria E.

2008-01-01

The definition and measurement of counsellor trainee competency is an issue that has received increased attention yet lacks quantitative study. This research evaluates item responses, scale reliability and intercorrelations, interrater agreement, and criterion-related validity of the Professional Performance Fitness Evaluation/Professional…
Minimum detectable gas concentration performance evaluation method for gas leak infrared imaging detection systems.

PubMed

Zhang, Xu; Jin, Weiqi; Li, Jiakun; Wang, Xia; Li, Shuo

2017-04-01

Thermal imaging technology is an effective means of detecting hazardous gas leaks. Much attention has been paid to evaluation of the performance of gas leak infrared imaging detection systems due to several potential applications. The minimum resolvable temperature difference (MRTD) and the minimum detectable temperature difference (MDTD) are commonly used as the main indicators of thermal imaging system performance. This paper establishes a minimum detectable gas concentration (MDGC) performance evaluation model based on the definition and derivation of MDTD. We proposed the direct calculation and equivalent calculation method of MDGC based on the MDTD measurement system. We build an experimental MDGC measurement system, which indicates the MDGC model can describe the detection performance of a thermal imaging system to typical gases. The direct calculation, equivalent calculation, and direct measurement results are consistent. The MDGC and the minimum resolvable gas concentration (MRGC) model can effectively describe the performance of "detection" and "spatial detail resolution" of thermal imaging systems to gas leak, respectively, and constitute the main performance indicators of gas leak detection systems.
45 CFR 2522.620 - How do I report my performance measures to the Corporation?

Code of Federal Regulations, 2012 CFR

2012-10-01

... 45 Public Welfare 4 2012-10-01 2012-10-01 false How do I report my performance measures to the Corporation? 2522.620 Section 2522.620 Public Welfare Regulations Relating to Public Welfare (Continued) CORPORATION FOR NATIONAL AND COMMUNITY SERVICE AMERICORPS PARTICIPANTS, PROGRAMS, AND APPLICANTS Evaluation Requirements Performance Measures:...
Early seizure detection in an animal model of temporal lobe epilepsy

NASA Astrophysics Data System (ADS)

Talathi, Sachin S.; Hwang, Dong-Uk; Ditto, William; Carney, Paul R.

2007-11-01

The performance of five seizure detection schemes, i.e., Nonlinear embedding delay, Hurst scaling, Wavelet Scale, autocorrelation and gradient of accumulated energy, in their ability to detect EEG seizures close to the seizure onset time were evaluated to determine the feasibility of their application in the development of a real time closed loop seizure intervention program (RCLSIP). The criteria chosen for the performance evaluation were, high statistical robustness as determined through the predictability index, the sensitivity and the specificity of a given measure to detect an EEG seizure, the lag in seizure detection with respect to the EEG seizure onset time, as determined through visual inspection and the computational efficiency for each detection measure. An optimality function was designed to evaluate the overall performance of each measure dependent on the criteria chosen. While each of the above measures analyzed for seizure detection performed very well in terms of the statistical parameters, the nonlinear embedding delay measure was found to have the highest optimality index due to its ability to detect seizure very close to the EEG seizure onset time, thereby making it the most suitable dynamical measure in the development of RCLSIP in rat model with chronic limbic epilepsy.
Measurements and Predictions for a Distributed Exhaust Nozzle

NASA Technical Reports Server (NTRS)

Kinzie, Kevin W.; Brown, Martha C.; Schein, David B.; Solomon, W. David, Jr.

2001-01-01

The acoustic and aerodynamic performance characteristics of a distributed exhaust nozzle (DEN) design concept were evaluated experimentally and analytically with the purpose of developing a design methodology for developing future DEN technology. Aerodynamic and acoustic measurements were made to evaluate the DEN performance and the CFD design tool. While the CFD approach did provide an excellent prediction of the flowfield and aerodynamic performance characteristics of the DEN and 2D reference nozzle, the measured acoustic suppression potential of this particular DEN was low. The measurements and predictions indicated that the mini-exhaust jets comprising the distributed exhaust coalesced back into a single stream jet very shortly after leaving the nozzles. Even so, the database provided here will be useful for future distributed exhaust designs with greater noise reduction and aerodynamic performance potential.
Development of System-level Performance Measures for Evaluation of Models of Care for Inflammatory Arthritis in Canada.

PubMed

Barber, Claire E H; Marshall, Deborah A; Mosher, Dianne P; Akhavan, Pooneh; Tucker, Lori; Houghton, Kristin; Batthish, Michelle; Levy, Deborah M; Schmeling, Heinrike; Ellsworth, Janet; Tibollo, Heidi; Grant, Sean; Khodyakov, Dmitry; Lacaille, Diane

2016-03-01

To develop system-level performance measures for evaluating the care of patients with inflammatory arthritis (IA), including rheumatoid arthritis (RA), psoriatic arthritis, ankylosing spondylitis, and juvenile idiopathic arthritis. This study involved several methodological phases. Over multiple rounds, various participants were asked to help define a set of candidate measurement themes. A systematic search was conducted of existing guidelines and measures. A set of 6 performance measures was defined and presented to 50 people, including patients with IA, rheumatologists, allied health professionals, and researchers using a 3-round, online, modified Delphi process. Participants rated the validity, feasibility, relevance, and likelihood of use of the measures. Measures with median ratings ≥ 7 for validity and relevance were included in the final set. Six performance measures were developed evaluating the following aspects of care, with each measure being applied separately for each type of IA except where specified: waiting times for rheumatology consultation for patients with new onset IA, percentage of patients with IA seen by a rheumatologist, percentage of patients with IA seen in yearly followup by a rheumatologist, percentage of patients with RA treated with a disease-modifying antirheumatic drug (DMARD), time to DMARD therapy in RA, and number of rheumatologists per capita. The first set of system-level performance measures for IA care in Canada has been developed with broad input. The measures focus on timely access to care and initiation of appropriate treatment for patients with IA, and are likely to be of interest to other arthritis care systems internationally.
Developing and applying mobility performance measures for freight transportation in urban areas.

DOT National Transportation Integrated Search

2010-12-01

This report summarizes the activities performed in a one-year study with the objective to develop an : understanding of the interrelationships of urban goods movement and congestion and identify performance : measures that will help evaluate the impa...
Questionnaire Evaluating Teaching Competencies in the University Environment. Evaluation of Teaching Competencies in the University

ERIC Educational Resources Information Center

Moreno-Murcia, Juan Antonio; Silveira Torregrosa, Yolanda; Belando Pedreño, Noelia

2015-01-01

The objective of this study was to design and validate a measuring instrument to evaluate the performance of university professors. The Evaluation of Teaching Performance (CEID [Centro de Estudios e Investigaciones Docentes (Center for Teaching Studies and Research)]) questionnaire was administered to 1297 university students. Various factor…

Performance Measurement for Substance Abuse Treatment Services. Integrated Evaluation Methods. Revised.

ERIC Educational Resources Information Center

Harwood, Henrick; Bazron, Barbara; Fountain, Douglas

This paper presents state-of-the-art models addressing issues related to coordination of treatment and evaluation activities, and integration of clinical, performance, and evaluation information. Specifically, this concept paper contains a discussion of the need for and types of cost analyses for CSAT treatment evaluation and knowledge-generating…
Validation and Evaluation of Army Aviation Collective Performance Measures

DTIC Science & Technology

2014-01-01

Research Report 1972 Validation and Evaluation of Army Aviation Collective Performance Measures Martin L. Bink U.S. Army...United States Army Research Institute for the Behavioral and Social Sciences Approved for public release; distribution is unlimited. U.S. Army...Research Institute for the Behavioral and Social Sciences Department of the Army Deputy Chief of Staff, G1 Authorized and approved for
Teachers as Strategic Classroom Leaders: The Relationship of Their Cognitive and Behavioral Agility to Student Outcomes and Performance Evaluations

ERIC Educational Resources Information Center

Warkentien, Michael

2016-01-01

The purpose of this non-experimental study was to determine whether teacher cognitive and behavioral agility relates to student achievement as measured by their value-added model (VAM) score and their performance evaluation measured through the Marzano instructional practice (IP) framework, and whether that relationship is moderated by contextual…
Transportation performance measures for outcome based system management and monitoring.

DOT National Transportation Integrated Search

2014-09-01

The Oregon Department of Transportation (ODOT) is mature in its development and use of : performance measures, however there was not a standard approach for selecting measures nor : evaluating if existing ones were used to inform decision-making. Thi...
More than a score: a qualitative study of ancillary benefits of performance measurement.

PubMed

Powell, Adam A; White, Katie M; Partin, Melissa R; Halek, Krysten; Hysong, Sylvia J; Zarling, Edwin; Kirsh, Susan R; Bloomfield, Hanna E

2014-08-01

Prior research has examined clinical effects of performance measurement systems. To the extent that non-clinical effects have been researched, the focus has been on negative unintended consequences. Yet, these same systems may also have ancillary benefits for patients and providers--that is, benefits that extend beyond improvements on clinical measures. The purpose of this study is to identify and describe potential ancillary benefits of performance measures as perceived by primary care staff and facility leaders in a large US healthcare system. In-person individual semistructured interviews were conducted with 59 primary care staff and facility leaders at four Veterans Health Administration facilities. Transcribed interviews were coded and organised into thematic categories. Interviewed staff observed that local performance measurement implementation practices can result in increased patient knowledge and motivation. These effects on patients can lead to improved performance scores and additional ancillary benefits. Performance measurement implementation can also directly result in ancillary benefits for the patients and providers. Patients may experience greater satisfaction with care and psychosocial benefits associated with increased provider-patient communication. Ancillary benefits of performance measurement for providers include increased pride in individual or organisational performance and greater confidence that one's practice is grounded in evidence-based medicine. A comprehensive understanding of the effects of performance measurement systems needs to incorporate ancillary benefits as well as effects on clinical performance scores and negative unintended consequences. Although clinical performance has been the focus of most evaluations of performance measurement to date, both patient care and provider satisfaction may improve more rapidly if all three categories of effects are considered when designing and evaluating performance measurement systems. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Crew Selection and Training

NASA Technical Reports Server (NTRS)

Helmreich, Robert L.

1996-01-01

This research addressed a number of issues relevant to the performance of teams in demanding environments. Initial work, conducted in the aviation analog environment, focused on developing new measures of performance related attitudes and behaviors. The attitude measures were used to assess acceptance of concepts related to effective teamwork and personal capabilities under stress. The behavioral measures were used to evaluate the effectiveness of flight crews operating in commercial aviation. Assessment of team issues in aviation led further to the evaluation and development of training to enhance team performance. Much of the work addressed evaluation of the effectiveness of such training, which has become known as Crew Resource Management (CRM). A second line of investigation was into personality characteristics that predict performance in challenging environments such as aviation and space. A third line of investigation of team performance grew out of the study of flight crews in different organizations. This led to the development of a theoretical model of crew performance that included not only individual attributes such as personality and ability, but also organizational and national culture. A final line of investigation involved beginning to assess whether the methodologies and measures developed for the aviation analog could be applied to another domain -- the performance of medical teams working in the operating room.
Model Performance Evaluation and Scenario Analysis (MPESA) Tutorial

EPA Pesticide Factsheets

The model performance evaluation consists of metrics and model diagnostics. These metrics provides modelers with statistical goodness-of-fit measures that capture magnitude only, sequence only, and combined magnitude and sequence errors.
Orchestra Festival Evaluations: Interjudge Agreement and Relationships between Performance Categories and Final Ratings.

ERIC Educational Resources Information Center

Garman, Barry R.; And Others

1991-01-01

Band, orchestra, and choir festival evaluations are a regular part of many secondary school music programs, and most such festivals engage adjudicators who rate each group's performance. Because music ensemble performance is complex and multi-dimensional, it does not lend itself readily to precise measurement; generally, musical performances are…
Implementation and evaluation of a dilation and evacuation simulation training curriculum.

PubMed

York, Sloane L; McGaghie, William C; Kiley, Jessica; Hammond, Cassing

2016-06-01

To evaluate obstetrics and gynecology resident physicians' performance following a simulation curriculum on dilation and evacuation (D&E) procedures. This study included two phases: simulation curriculum development and resident physician performance evaluation following training on a D&E simulator. Trainees participated in two evaluations. Simulation training evaluated participants performing six cases on a D&E simulator, measuring procedural time and a 26-step checklist of D&E steps. The operative training portion evaluated residents' performance after training on the simulator using mastery learning techniques. Intra-operative evaluation was based on a 21-step checklist score, Objective Structured Assessment of Technical Skills (OSATS), and percentage of cases completed. Twenty-two residents participated in simulation training, demonstrating improved performance from cases one and two to cases five and six, as measured by checklist score and procedural time (p<.001 and p=.001, respectively). Of 10 participants in the operative training, all performed at least three D&Es, while seven performed at least six cases. While checklist scores did not change significantly from the first to sixth case (mean for first case: 18.3; for sixth case: 19.6; p=.593), OSATS ratings improved from case one (19.7) to case three (23.5; p=.001) and to case six (26.8; p=.005). Trainees completed approximately 71.6% of their first case (range: 21.4-100%). By case six, the six participants performed 81.2% of the case (range: 14.3-100%). D&E simulation using a newly-developed uterine model and simulation curriculum improves resident technical skills. Simulation training with mastery learning techniques transferred to high level of performance in OR using checklist. The OSATS measured skills and showed improvement in performance with subsequent cases. Implementation of a D&E simulation curriculum offers potential for improved surgical training and abortion provision. Copyright © 2016 Elsevier Inc. All rights reserved.
Faculty performance evaluation in accredited U.S. public health graduate schools and programs: a national study.

PubMed

Gimbel, Ronald W; Cruess, David F; Schor, Kenneth; Hooper, Tomoko I; Barbour, Galen L

2008-10-01

To provide baseline data on evaluation of faculty performance in U.S. schools and programs of public health. The authors administered an anonymous Internet-based questionnaire using PHP Surveyor. The invited sample consisted of individuals listed in the Council on Education for Public Health (CEPH) Directory of Accredited Schools and Programs of Public Health. The authors explored performance measures in teaching, research, and service, and assessed how faculty performance measures are used. A total of 64 individuals (60.4%) responded to the survey, with 26 (40.6%) reporting accreditation/reaccreditation by CEPH within the preceding 24 months. Although all schools and programs employ faculty performance evaluations, a significant difference exists between schools and programs in the use of results for merit pay increases and mentoring purposes. Thirty-one (48.4%) of the organizations published minimum performance expectations. Fifty-nine (92.2%) of the respondents counted number of publications, but only 22 (34.4%) formally evaluated their quality. Sixty-two (96.9%) evaluated teaching through student course evaluations, and only 29 (45.3%) engaged in peer assessment. Although aggregate results of teaching evaluation are available to faculty and administrators, this information is often unavailable to students and the public. Most schools and programs documented faculty service activities qualitatively but neither assessed it quantitatively nor evaluated its impact. This study provides insight into how schools and programs of public health evaluate faculty performance. Results suggest that although schools and programs do evaluate faculty performance on a basic level, many do not devote substantial attention to this process.
Position Measurement Standard Evaluation

DOT National Transportation Integrated Search

1975-02-01

The objectives of the Position Measurement Standard Program were to collect navigation data from three DME receivers and a low-frequency GLOBAL Navigation system, and evaluate their relative performance against a reference radar. Flight test data dur...
A comprehensive performance evaluation on the prediction results of existing cooperative transcription factors identification algorithms.

PubMed

Lai, Fu-Jou; Chang, Hong-Tsun; Huang, Yueh-Min; Wu, Wei-Sheng

2014-01-01

Eukaryotic transcriptional regulation is known to be highly connected through the networks of cooperative transcription factors (TFs). Measuring the cooperativity of TFs is helpful for understanding the biological relevance of these TFs in regulating genes. The recent advances in computational techniques led to various predictions of cooperative TF pairs in yeast. As each algorithm integrated different data resources and was developed based on different rationales, it possessed its own merit and claimed outperforming others. However, the claim was prone to subjectivity because each algorithm compared with only a few other algorithms and only used a small set of performance indices for comparison. This motivated us to propose a series of indices to objectively evaluate the prediction performance of existing algorithms. And based on the proposed performance indices, we conducted a comprehensive performance evaluation. We collected 14 sets of predicted cooperative TF pairs (PCTFPs) in yeast from 14 existing algorithms in the literature. Using the eight performance indices we adopted/proposed, the cooperativity of each PCTFP was measured and a ranking score according to the mean cooperativity of the set was given to each set of PCTFPs under evaluation for each performance index. It was seen that the ranking scores of a set of PCTFPs vary with different performance indices, implying that an algorithm used in predicting cooperative TF pairs is of strength somewhere but may be of weakness elsewhere. We finally made a comprehensive ranking for these 14 sets. The results showed that Wang J's study obtained the best performance evaluation on the prediction of cooperative TF pairs in yeast. In this study, we adopted/proposed eight performance indices to make a comprehensive performance evaluation on the prediction results of 14 existing cooperative TFs identification algorithms. Most importantly, these proposed indices can be easily applied to measure the performance of new algorithms developed in the future, thus expedite progress in this research field.
Hardware Demonstration: Frequency Spectra of Transients

NASA Technical Reports Server (NTRS)

McCloskey, John; Dimov, Jen

2017-01-01

Radiated emissions measurements as specified by MIL-STD-461 are performed in the frequency domain, which is best suited to continuous wave (CW) types of signals. However, many platforms implement signals that are single event pulses or transients. Such signals can potentially generate momentary radiated emissions that can cause interference in the system, but they may be missed with traditional measurement techniques. This demonstration provides measurement and analysis techniques that effectively evaluate the potential emissions from such signals in order to evaluate their potential impacts to system performance.
Scout: An Impact Analysis Tool for Building Energy-Efficiency Technologies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Harris, Chioke; Langevin, Jared; Roth, Amir

Evaluating the national impacts of candidate U.S. building energy-efficiency technologies has historically been difficult for organizations with large energy efficiency portfolios. In particular, normalizing results from technology-specific impact studies is time-consuming when those studies do not use comparable assumptions about the underlying building stock. To equitably evaluate its technology research, development, and deployment portfolio, the U.S. Department of Energy's Building Technologies Office has developed Scout, a software tool that quantitatively assesses the energy and CO2 impacts of building energy-efficiency measures on the national building stock. Scout efficiency measures improve upon the unit performance and/or lifetime operational costs of an equipmentmore » stock baseline that is determined from the U.S. Energy Information Administration Annual Energy Outlook (AEO). Scout measures are characterized by a market entry and exit year, unit performance level, cost, and lifetime. To evaluate measures on a consistent basis, Scout uses EnergyPlus simulation on prototype building models to translate measure performance specifications to whole-building energy savings; these savings impacts are then extended to a national scale using floor area weighting factors. Scout represents evolution in the building stock over time using AEO projections for new construction, retrofit, and equipment replacements, and competes technologies within market segments under multiple adoption scenarios. Scout and its efficiency measures are open-source, as is the EnergyPlus whole building simulation framework that is used to evaluate measure performance. The program is currently under active development and will be formally released once an initial set of measures has been analyzed and reviewed.« less
Formal implementation of a performance evaluation model for the face recognition system.

PubMed

Shin, Yong-Nyuo; Kim, Jason; Lee, Yong-Jun; Shin, Woochang; Choi, Jin-Young

2008-01-01

Due to usability features, practical applications, and its lack of intrusiveness, face recognition technology, based on information, derived from individuals' facial features, has been attracting considerable attention recently. Reported recognition rates of commercialized face recognition systems cannot be admitted as official recognition rates, as they are based on assumptions that are beneficial to the specific system and face database. Therefore, performance evaluation methods and tools are necessary to objectively measure the accuracy and performance of any face recognition system. In this paper, we propose and formalize a performance evaluation model for the biometric recognition system, implementing an evaluation tool for face recognition systems based on the proposed model. Furthermore, we performed evaluations objectively by providing guidelines for the design and implementation of a performance evaluation system, formalizing the performance test process.
Spectral contents readout of birefringent sensor

NASA Technical Reports Server (NTRS)

Redner, Alex S.

1989-01-01

The technical objective of this research program was to develop a birefringent sensor, capable of measuring strain/stress up to 2000 F and a readout system based on Spectral Contents analysis. As a result of the research work, a data acquisition system was developed, capable of measuring strain birefringence in a sensor at 2000 F, with multi-point static and dynamic capabilities. The system uses a dedicated spectral analyzer for evaluation of stress-birefringence and a PC-based readout. Several sensor methods were evaluated. Fused silica was found most satisfactory. In the final evaluation, measurements were performed up to 2000 F and the system performance exceeded expectations.
Blinded evaluation of interrater reliability of an operative competency assessment tool for direct laryngoscopy and rigid bronchoscopy.

PubMed

Ishman, Stacey L; Benke, James R; Johnson, Kaalan Erik; Zur, Karen B; Jacobs, Ian N; Thorne, Marc C; Brown, David J; Lin, Sandra Y; Bhatti, Nasir; Deutsch, Ellen S

2012-10-01

OBJECTIVES To confirm interrater reliability using blinded evaluation of a skills-assessment instrument to assess the surgical performance of resident and fellow trainees performing pediatric direct laryngoscopy and rigid bronchoscopy in simulated models. DESIGN Prospective, paired, blinded observational validation study. SUBJECTS Paired observers from multiple institutions simultaneously evaluated residents and fellows who were performing surgery in an animal laboratory or using high-fidelity manikins. The evaluators had no previous affiliation with the residents and fellows and did not know their year of training. INTERVENTIONS One- and 2-page versions of an objective structured assessment of technical skills (OSATS) assessment instrument composed of global and a task-specific surgical items were used to evaluate surgical performance. RESULTS Fifty-two evaluations were completed by 17 attending evaluators. The instrument agreement for the 2-page assessment was 71.4% when measured as a binary variable (ie, competent vs not competent) (κ = 0.38; P = .08). Evaluation as a continuous variable revealed a 42.9% percentage agreement (κ = 0.18; P = .14). The intraclass correlation was 0.53, considered substantial/good interrater reliability (69% reliable). For the 1-page instrument, agreement was 77.4% when measured as a binary variable (κ = 0.53, P = .0015). Agreement when evaluated as a continuous measure was 71.0% (κ = 0.54, P < .001). The intraclass correlation was 0.73, considered high interrater reliability (85% reliable). CONCLUSIONS The OSATS assessment instrument is an effective tool for evaluating surgical performance among trainees with acceptable interrater reliability in a simulator setting. Reliability was good for both the 1- and 2-page OSATS checklists, and both serve as excellent tools to provide immediate formative feedback on operational competency.
Quality of protection evaluation of security mechanisms.

PubMed

Ksiezopolski, Bogdan; Zurek, Tomasz; Mokkas, Michail

2014-01-01

Recent research indicates that during the design of teleinformatic system the tradeoff between the systems performance and the system protection should be made. The traditional approach assumes that the best way is to apply the strongest possible security measures. Unfortunately, the overestimation of security measures can lead to the unreasonable increase of system load. This is especially important in multimedia systems where the performance has critical character. In many cases determination of the required level of protection and adjustment of some security measures to these requirements increase system efficiency. Such an approach is achieved by means of the quality of protection models where the security measures are evaluated according to their influence on the system security. In the paper, we propose a model for QoP evaluation of security mechanisms. Owing to this model, one can quantify the influence of particular security mechanisms on ensuring security attributes. The methodology of our model preparation is described and based on it the case study analysis is presented. We support our method by the tool where the models can be defined and QoP evaluation can be performed. Finally, we have modelled TLS cryptographic protocol and presented the QoP security mechanisms evaluation for the selected versions of this protocol.
Objective Fidelity Evaluation in Multisensory Virtual Environments: Auditory Cue Fidelity in Flight Simulation

PubMed Central

Meyer, Georg F.; Wong, Li Ting; Timson, Emma; Perfect, Philip; White, Mark D.

2012-01-01

We argue that objective fidelity evaluation of virtual environments, such as flight simulation, should be human-performance-centred and task-specific rather than measure the match between simulation and physical reality. We show how principled experimental paradigms and behavioural models to quantify human performance in simulated environments that have emerged from research in multisensory perception provide a framework for the objective evaluation of the contribution of individual cues to human performance measures of fidelity. We present three examples in a flight simulation environment as a case study: Experiment 1: Detection and categorisation of auditory and kinematic motion cues; Experiment 2: Performance evaluation in a target-tracking task; Experiment 3: Transferrable learning of auditory motion cues. We show how the contribution of individual cues to human performance can be robustly evaluated for each task and that the contribution is highly task dependent. The same auditory cues that can be discriminated and are optimally integrated in experiment 1, do not contribute to target-tracking performance in an in-flight refuelling simulation without training, experiment 2. In experiment 3, however, we demonstrate that the auditory cue leads to significant, transferrable, performance improvements with training. We conclude that objective fidelity evaluation requires a task-specific analysis of the contribution of individual cues. PMID:22957068
Logic Modeling as a Tool to Prepare to Evaluate Disaster and Emergency Preparedness, Response, and Recovery in Schools

ERIC Educational Resources Information Center

Zantal-Wiener, Kathy; Horwood, Thomas J.

2010-01-01

The authors propose a comprehensive evaluation framework to prepare for evaluating school emergency management programs. This framework involves a logic model that incorporates Government Performance and Results Act (GPRA) measures as a foundation for comprehensive evaluation that complements performance monitoring used by the U.S. Department of…

Performance Evaluation of Target Detection with a Near-Space Vehicle-Borne Radar in Blackout Condition.

PubMed

Li, Yanpeng; Li, Xiang; Wang, Hongqiang; Deng, Bin; Qin, Yuliang

2016-01-06

Radar is a very important sensor in surveillance applications. Near-space vehicle-borne radar (NSVBR) is a novel installation of a radar system, which offers many benefits, like being highly suited to the remote sensing of extremely large areas, having a rapidly deployable capability and having low vulnerability to electronic countermeasures. Unfortunately, a target detection challenge arises because of complicated scenarios, such as nuclear blackout, rain attenuation, etc. In these cases, extra care is needed to evaluate the detection performance in blackout situations, since this a classical problem along with the application of an NSVBR. However, the existing evaluation measures are the probability of detection and the receiver operating curve (ROC), which cannot offer detailed information in such a complicated application. This work focuses on such requirements. We first investigate the effect of blackout on an electromagnetic wave. Performance evaluation indexes are then built: three evaluation indexes on the detection capability and two evaluation indexes on the robustness of the detection process. Simulation results show that the proposed measure will offer information on the detailed performance of detection. These measures are therefore very useful in detecting the target of interest in a remote sensing system and are helpful for both the NSVBR designers and users.
Performance Evaluation of Target Detection with a Near-Space Vehicle-Borne Radar in Blackout Condition

PubMed Central

Li, Yanpeng; Li, Xiang; Wang, Hongqiang; Deng, Bin; Qin, Yuliang

2016-01-01

Radar is a very important sensor in surveillance applications. Near-space vehicle-borne radar (NSVBR) is a novel installation of a radar system, which offers many benefits, like being highly suited to the remote sensing of extremely large areas, having a rapidly deployable capability and having low vulnerability to electronic countermeasures. Unfortunately, a target detection challenge arises because of complicated scenarios, such as nuclear blackout, rain attenuation, etc. In these cases, extra care is needed to evaluate the detection performance in blackout situations, since this a classical problem along with the application of an NSVBR. However, the existing evaluation measures are the probability of detection and the receiver operating curve (ROC), which cannot offer detailed information in such a complicated application. This work focuses on such requirements. We first investigate the effect of blackout on an electromagnetic wave. Performance evaluation indexes are then built: three evaluation indexes on the detection capability and two evaluation indexes on the robustness of the detection process. Simulation results show that the proposed measure will offer information on the detailed performance of detection. These measures are therefore very useful in detecting the target of interest in a remote sensing system and are helpful for both the NSVBR designers and users. PMID:26751445
Northeast corridor passenger transportation data study

DOT National Transportation Integrated Search

1976-08-31

Fourteen measures of performance are recommended for use in Northeast Corridor rail system evaluation and multimodal comparisons. These include performance measures in the categories of system configuration (e.g., daily available-seat miles by vehicl...
Summary of ORSphere critical and reactor physics measurements

NASA Astrophysics Data System (ADS)

Marshall, Margaret A.; Bess, John D.

2017-09-01

In the early 1970s Dr. John T. Mihalczo (team leader), J.J. Lynn, and J.R. Taylor performed experiments at the Oak Ridge Critical Experiments Facility (ORCEF) with highly enriched uranium (HEU) metal (called Oak Ridge Alloy or ORALLOY) to recreate GODIVA I results with greater accuracy than those performed at Los Alamos National Laboratory in the 1950s. The purpose of the Oak Ridge ORALLOY Sphere (ORSphere) experiments was to estimate the unreflected and unmoderated critical mass of an idealized sphere of uranium metal corrected to a density, purity, and enrichment such that it could be compared with the GODIVA I experiments. This critical configuration has been evaluated. Preliminary results were presented at ND2013. Since then, the evaluation was finalized and judged to be an acceptable benchmark experiment for the International Criticality Safety Benchmark Experiment Project (ICSBEP). Additionally, reactor physics measurements were performed to determine surface button worths, central void worth, delayed neutron fraction, prompt neutron decay constant, fission density and neutron importance. These measurements have been evaluated and found to be acceptable experiments and are discussed in full detail in the International Handbook of Evaluated Reactor Physics Benchmark Experiments. The purpose of this paper is to summarize all the evaluated critical and reactor physics measurements evaluations.
Language disturbance and functioning in first episode psychosis.

PubMed

Roche, Eric; Segurado, Ricardo; Renwick, Laoise; McClenaghan, Aisling; Sexton, Sarah; Frawley, Timothy; Chan, Carol K; Bonar, Maurice; Clarke, Mary

2016-01-30

Language disturbance has a central role in the presentation of psychotic disorders however its relationship with functioning requires further clarification, particularly in first episode psychosis (FEP). Both language disturbance and functioning can be evaluated with clinician-rated and performance-based measures. We aimed to investigate the concurrent association between clinician-rated and performance-based measures of language disturbance and functioning in FEP. We assessed 108 individuals presenting to an Early Intervention in Psychosis Service in Ireland. Formal thought disorder (FTD) dimensions and bizarre idiosyncratic thinking (BIT) were rated with structured assessment tools. Functioning was evaluated with a performance-based instrument, a clinician-rated measure and indicators of real-world functioning. The disorganisation dimension of FTD was significantly associated with clinician-rated measures of occupational and social functioning (Beta=-0.19, P<0.05 and Beta=-0.31, P<0.01, respectively). BIT was significantly associated with the performance-based measure of functioning (Beta=-0.22, P<0.05). Language disturbance was of less value in predicting real-world measures of functioning. Clinician-rated and performance-based assessments of language disturbance are complementary and each has differential associations with functioning. Communication disorders should be considered as a potential target for intervention in FEP, although further evaluation of the longitudinal relationship between language disturbance and functioning should be undertaken. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
The Relationship of Teacher Evaluation Scores Generated by a Process-Product Evaluation Instrument to Selected Variables.

ERIC Educational Resources Information Center

Tadlock, James; Nesbit, Lamar

The Jackson Municipal Separate School District, Mississippi, has instituted a mixed-criteria reduction-in-force procedure emphasizing classroom performance to a greater degree than seniority, certification, and staff development participation. The district evaluation process--measuring classroom teaching performance--generated data for the present…
MULTI-SITE PERFORMANCE EVALUATIONS OF CANDIDATE METHODOLOGIES FOR DETERMINING COARSE PARTICULATE MATTER (PMC) CONCENTRATIONS

EPA Science Inventory

Comprehensive field studies were conducted to evaluate the performance of sampling methods for measuring the coarse fraction of PM10 in ambient air. Five separate sampling approaches were evaluated at each of three sampling sites. As the primary basis of comparison, a discret...
Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps.

PubMed

Powell, Adam C; Torous, John; Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

2016-02-10

There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff's alpha was calculated for each of the measures and reported by app category and in aggregate. The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their own assessments of apps, in conjunction with evaluating information from reviews.
Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps

PubMed Central

Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

2016-01-01

Background There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. Objective We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. Methods We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff’s alpha was calculated for each of the measures and reported by app category and in aggregate. Results The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. Conclusions We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their own assessments of apps, in conjunction with evaluating information from reviews. PMID:26863986
MEASUREMENT OF VOLATILE ORGANIC COMPOUNDS BY THE US ENVIRONMENTAL PROTECTION AGENCY COMPENDIUM METHOD TO-17 - EVALUATION OF PERFORMANCE CRITERIA

EPA Science Inventory

An evaluation of performance criteria for US Environmental Protection Agency Compendium Method TO-17 for monitoring volatile organic compounds (VOCs) in air has been accomplished. The method is a solid adsorbent-based sampling and analytical procedure including performance crit...
Calibration of automatic performance measures - speed and volume data : volume 1, evaluation of the accuracy of traffic volume counts collected by microwave sensors.

DOT National Transportation Integrated Search

2015-09-01

Over the past few years, the Utah Department of Transportation (UDOT) has developed a system called the : Signal Performance Metrics System (SPMS) to evaluate the performance of signalized intersections. This system : currently provides data summarie...
Non-parametric early seizure detection in an animal model of temporal lobe epilepsy

NASA Astrophysics Data System (ADS)

Talathi, Sachin S.; Hwang, Dong-Uk; Spano, Mark L.; Simonotto, Jennifer; Furman, Michael D.; Myers, Stephen M.; Winters, Jason T.; Ditto, William L.; Carney, Paul R.

2008-03-01

The performance of five non-parametric, univariate seizure detection schemes (embedding delay, Hurst scale, wavelet scale, nonlinear autocorrelation and variance energy) were evaluated as a function of the sampling rate of EEG recordings, the electrode types used for EEG acquisition, and the spatial location of the EEG electrodes in order to determine the applicability of the measures in real-time closed-loop seizure intervention. The criteria chosen for evaluating the performance were high statistical robustness (as determined through the sensitivity and the specificity of a given measure in detecting a seizure) and the lag in seizure detection with respect to the seizure onset time (as determined by visual inspection of the EEG signal by a trained epileptologist). An optimality index was designed to evaluate the overall performance of each measure. For the EEG data recorded with microwire electrode array at a sampling rate of 12 kHz, the wavelet scale measure exhibited better overall performance in terms of its ability to detect a seizure with high optimality index value and high statistics in terms of sensitivity and specificity.
In vivo MRS and MRSI: Performance analysis, measurement considerations and evaluation of metabolite concentration images

NASA Astrophysics Data System (ADS)

Vikhoff-Baaz, Barbro

2000-10-01

The doctoral thesis concerns development, evaluation and performance of quality assessment methods for volume- selection methods in 31P and 1H MR spectroscopy (MRS). It also contains different aspects of the measurement procedure for 1H MR spectroscopic imaging (MRSI) with application on the human brain, image reconstruction of the MRSI images and evaluation methods for lateralization of temporal lobe epilepsy (TLE). Two complementary two-compartment phantoms and evaluation methods for quality assessment of 31P MRS in small-bore MR systems were presented. The first phantom consisted of an inner cube inside a sphere phantom where measurements with and without volume selection where compared for various VOI sizes. The multi-centre showed that the evaluated parameters provide useful information of the performance of volume-selective MRS at the MR system. The second phantom consisted of two compartments divided by a very thin wall and was found useful for measurements of the appearance and position of the VOI profile in specific gradient directions. The second part concerned 1H MRS and MRSI of whole-body MR systems. Different factors that may degrade or complicate the measurement procedure like for MRSI were evaluated, e.g. the volume selection performance, contamination, susceptibility and motion. Two interpolation methods for reconstruction of MRSI images were compared. Measurements and computer simulations showed that Fourier interpolation correctly visualizes the information inherent in the data set, while the results were dependent on the position of the object relative the original matrix using Cubic spline interpolation. Application of spatial filtering may improve the image representation of the data. Finally, 1H MRSI was performed on healthy volunteers and patients with temporal lobe epilepsy (TLE). Metabolite concentration images were used for lateralization of TLE, where the signal intensity in the two hemispheres were compared. Visual analysis of the metabolite concentration images can, with high accuracy, be used for lateralization in routine examinations. Analysis from measurements with region-of-interests (ROI) in different locations gives quantitative information about the degree of signal loss and the spatial distribution.
SU-F-T-552: A One-Year Evaluation of the QABeamChecker+ for Use with the CyberKnife System

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gersh, J; Spectrum Medical Physics, LLC, Greenville, SC

Purpose: By attaching an adapter plate with fiducial markers to the QA BeamChecker+ (Standard Imaging, Inc., Middleton, WI), the output of the CyberKnife can be accurately, efficiently, and consistently evaluated. The adapter plate, known as the Cutting Board, allows for automated alignment of the QABC+ using the CK’s stereoscopic kV image-based treatment localization system (TLS). Described herein is an evaluation of the system following a year of clinical utilization. Methods: Based on a CT scan of the QABC+ and CB, a treatment plan is generated which delivers a beam to each of the 5 plane-parallel ionization chambers. Following absolute calibrationmore » of the CK, the QA plan is delivered, and baseline measurements are acquired (and automatically corrected for temperature and pressure). This test was performed at the beginning of each treatment day for a year. A calibration evaluation (using a water-equivalent slab and short thimble chamber) is performed every four weeks, or whenever the QABC+ detects a deviation of more than 1.0%. Results: During baseline evaluation, repeat measurements (n=10) were performed, with an average output of 0.25% with an SD of 0.11%. As a test of the reposition of the QABC+ and CB, ten additional measurements were performed where between each acquisition, the entire system was removed and re-positioned using the TLS. The average output deviation was 0.30% with a SD of 0.13%. During the course of the year, 187 QABC+ measurements and 13 slab-based measurements were performed. The output measurements of the QABC+ correlated well with slab-based measurements (R2=0.909). Conclusion: By using the QABC+ and CB, daily output was evaluated accurately, efficiently, and consistently. From setup to break-down (including analysis), this test required 5 minutes instead of approximately 15 using traditional techniques (collimator-mounted ionization chambers). Additionally, by automatically saving resultant output deviation to a database, trend analysis was simplified. Spectrum Medical Physics, LLC of Greenville, SC has a consulting contract with Standard Imaging of Middleton, WI.« less
SELECTION OF ENDOCRINOLOGY SUBSPECIALTY TRAINEES: WHICH APPLICANT CHARACTERISTICS ARE ASSOCIATED WITH PERFORMANCE DURING FELLOWSHIP TRAINING?

PubMed

Natt, Neena; Chang, Alice Y; Berbari, Elie F; Kennel, Kurt A; Kearns, Ann E

2016-01-01

To determine which residency characteristics are associated with performance during endocrinology fellowship training as measured by competency-based faculty evaluation scores and faculty global ratings of trainee performance. We performed a retrospective review of interview applications from endocrinology fellows who graduated from a single academic institution between 2006 and 2013. Performance measures included competency-based faculty evaluation scores and faculty global ratings. The association between applicant characteristics and measures of performance during fellowship was examined by linear regression. The presence of a laudatory comparative statement in the residency program director's letter of recommendation (LoR) or experience as a chief resident was significantly associated with competency-based faculty evaluation scores (β = 0.22, P = .001; and β = 0.24, P = .009, respectively) and faculty global ratings (β = 0.85, P = .006; and β = 0.96, P = .015, respectively). The presence of a laudatory comparative statement in the residency program director's LoR or experience as a chief resident were significantly associated with overall performance during subspecialty fellowship training. Future studies are needed in other cohorts to determine the broader implications of these findings in the application and selection process.
Detailed performance and environmental monitoring of aquifer heating and cooling systems

NASA Astrophysics Data System (ADS)

Acuna, José; Ahlkrona, Malva; Zandin, Hanna; Singh, Ashutosh

2016-04-01

The project intends to quantify the performance and environmental impact of large scale aquifer thermal energy storage, as well as point at recommendations for operating and estimating the environmental footprint of future systems. Field measurements, test of innovative equipment as well as advanced modelling work and analysis will be performed. The following aspects are introduced and covered in the presentation: -Thermal, chemical and microbiological influence of akvifer thermal energy storage systems: measurement and evaluation of real conditions and the influence of one system in operation. -Follow up of energy extraction from aquifer as compared to projected values, recommendations for improvements. -Evaluation of the most used thermal modeling tool for design and calculation of groundwater temperatures, calculations with MODFLOW/MT3DMS -Test and evaluation of optical fiber cables as a way to measure temperatures in aquifer thermal energy storages
Evaluation of audit-based performance measures for dental care plans.

PubMed

Bader, J D; Shugars, D A; White, B A; Rindal, D B

1999-01-01

Although a set of clinical performance measures, i.e., a report card for dental plans, has been designed for use with administrative data, most plans do not have administrative data systems containing the data needed to calculate the measures. Therefore, we evaluated the use of a set of proxy clinical performance measures calculated from data obtained through chart audits. Chart audits were conducted in seven dental programs--three public health clinics, two dental health maintenance organizations (DHMO), and two preferred provider organizations (PPO). In all instances audits were completed by clinical staff who had been trained using telephone consultation and a self-instructional audit manual. The performance measures were calculated for the seven programs, audit reliability was assessed in four programs, and for one program the audit-based proxy measures were compared to the measures calculated using administrative data. The audit-based measures were sensitive to known differences in program performance. The chart audit procedures yielded reasonably reliable data. However, missing data in patient charts rendered the calculation of some measures problematic--namely, caries and periodontal disease assessment and experience. Agreement between administrative and audit-based measures was good for most, but not all, measures in one program. The audit-based proxy measures represent a complex but feasible approach to the calculation of performance measures for those programs lacking robust administrative data systems. However, until charts contain more complete diagnostic information (i.e., periodontal charting and diagnostic codes or reason-for-treatment codes), accurate determination of these aspects of clinical performance will be difficult.
Research performance measures and program evaluation.

DOT National Transportation Integrated Search

2003-10-01

The Iowa Department of Transportation hosted a Peer Exchange on October 8-9, 2003. The : purpose of this exchange was to give research managers from several state departments of : transportation the opportunity to discuss research performance measure...
Aggregate Interview Method of ranking orthopedic applicants predicts future performance.

PubMed

Geissler, Jacqueline; VanHeest, Ann; Tatman, Penny; Gioe, Terence

2013-07-01

This article evaluates and describes a process of ranking orthopedic applicants using what the authors term the Aggregate Interview Method. The authors hypothesized that higher-ranking applicants using this method at their institution would perform better than those ranked lower using multiple measures of resident performance. A retrospective review of 115 orthopedic residents was performed at the authors' institution. Residents were grouped into 3 categories by matching rank numbers: 1-5, 6-14, and 15 or higher. Each rank group was compared with resident performance as measured by faculty evaluations, the Orthopaedic In-Training Examination (OITE), and American Board of Orthopaedic Surgery (ABOS) test results. Residents ranked 1-5 scored significantly better on patient care, behavior, and overall competence by faculty evaluation (P<.05). Residents ranked 1-5 scored higher on the OITE compared with those ranked 6-14 during postgraduate years 2 and 3 (P⩽.5). Graduates who had been ranked 1-5 had a 100% pass rate on the ABOS part 1 examination on the first attempt. The most favorably ranked residents performed at or above the level of other residents in the program; they did not score inferiorly on any measure. These results support the authors' method of ranking residents. The rigorous Aggregate Interview Method for ranking applicants consistently identified orthopedic resident candidates who scored highly on the Accreditation Council for Graduate Medical Education resident core competencies as measured by faculty evaluations, performed above the national average on the OITE, and passed the ABOS part 1 examination at rates exceeding the national average. Copyright 2013, SLACK Incorporated.
The Performance Blueprint: An Integrated Logic Model Developed To Enhance Performance Measurement Literacy: The Case of Performance-Based Contract Management.

ERIC Educational Resources Information Center

Longo, Paul J.

This study explored the mechanics of using an enhanced, comprehensive multipurpose logic model, the Performance Blueprint, as a means of building evaluation capacity, referred to in this paper as performance measurement literacy, to facilitate the attainment of both service-delivery oriented and community-oriented outcomes. The application of this…

Laboratory Performance Evaluation Report of SEL 421 Phasor Measurement Unit

DOE Office of Scientific and Technical Information (OSTI.GOV)

Huang, Zhenyu; faris, Anthony J.; Martin, Kenneth E.

2007-12-01

PNNL and BPA have been in close collaboration on laboratory performance evaluation of phasor measurement units for over ten years. A series of evaluation tests are designed to confirm accuracy and determine measurement performance under a variety of conditions that may be encountered in actual use. Ultimately the testing conducted should provide parameters that can be used to adjust all measurements to a standardized basis. These tests are performed with a standard relay test set using recorded files of precisely generated test signals. The test set provides test signals at a level and in a format suitable for input tomore » a PMU that accurately reproduces the signals in both signal amplitude and timing. Test set outputs are checked to confirm the accuracy of the output signal. The recorded signals include both current and voltage waveforms and a digital timing track used to relate the PMU measured value with the test signal. Test signals include steady-state waveforms to test amplitude, phase, and frequency accuracy, modulated signals to determine measurement and rejection bands, and step tests to determine timing and response accuracy. Additional tests are included as necessary to fully describe the PMU operation. Testing is done with a BPA phasor data concentrator (PDC) which provides communication support and monitors data input for dropouts and data errors.« less
Performance analysis and evaluation of direct phase measuring deflectometry

NASA Astrophysics Data System (ADS)

Zhao, Ping; Gao, Nan; Zhang, Zonghua; Gao, Feng; Jiang, Xiangqian

2018-04-01

Three-dimensional (3D) shape measurement of specular objects plays an important role in intelligent manufacturing applications. Phase measuring deflectometry (PMD)-based methods are widely used to obtain the 3D shapes of specular surfaces because they offer the advantages of a large dynamic range, high measurement accuracy, full-field and noncontact operation, and automatic data processing. To enable measurement of specular objects with discontinuous and/or isolated surfaces, a direct PMD (DPMD) method has been developed to build a direct relationship between phase and depth. In this paper, a new virtual measurement system is presented and is used to optimize the system parameters and evaluate the system's performance in DPMD applications. Four system parameters are analyzed to obtain accurate measurement results. Experiments are performed using simulated and actual data and the results confirm the effects of these four parameters on the measurement results. Researchers can therefore select suitable system parameters for actual DPMD (including PMD) measurement systems to obtain the 3D shapes of specular objects with high accuracy.
Performance evaluation of an agent-based occupancy simulation model

DOE PAGES

Luo, Xuan; Lam, Khee Poh; Chen, Yixing; ...

2017-01-17

Occupancy is an important factor driving building performance. Static and homogeneous occupant schedules, commonly used in building performance simulation, contribute to issues such as performance gaps between simulated and measured energy use in buildings. Stochastic occupancy models have been recently developed and applied to better represent spatial and temporal diversity of occupants in buildings. However, there is very limited evaluation of the usability and accuracy of these models. This study used measured occupancy data from a real office building to evaluate the performance of an agent-based occupancy simulation model: the Occupancy Simulator. The occupancy patterns of various occupant types weremore » first derived from the measured occupant schedule data using statistical analysis. Then the performance of the simulation model was evaluated and verified based on (1) whether the distribution of observed occupancy behavior patterns follows the theoretical ones included in the Occupancy Simulator, and (2) whether the simulator can reproduce a variety of occupancy patterns accurately. Results demonstrated the feasibility of applying the Occupancy Simulator to simulate a range of occupancy presence and movement behaviors for regular types of occupants in office buildings, and to generate stochastic occupant schedules at the room and individual occupant levels for building performance simulation. For future work, model validation is recommended, which includes collecting and using detailed interval occupancy data of all spaces in an office building to validate the simulated occupant schedules from the Occupancy Simulator.« less
Performance evaluation of an agent-based occupancy simulation model

DOE Office of Scientific and Technical Information (OSTI.GOV)

Luo, Xuan; Lam, Khee Poh; Chen, Yixing

Occupancy is an important factor driving building performance. Static and homogeneous occupant schedules, commonly used in building performance simulation, contribute to issues such as performance gaps between simulated and measured energy use in buildings. Stochastic occupancy models have been recently developed and applied to better represent spatial and temporal diversity of occupants in buildings. However, there is very limited evaluation of the usability and accuracy of these models. This study used measured occupancy data from a real office building to evaluate the performance of an agent-based occupancy simulation model: the Occupancy Simulator. The occupancy patterns of various occupant types weremore » first derived from the measured occupant schedule data using statistical analysis. Then the performance of the simulation model was evaluated and verified based on (1) whether the distribution of observed occupancy behavior patterns follows the theoretical ones included in the Occupancy Simulator, and (2) whether the simulator can reproduce a variety of occupancy patterns accurately. Results demonstrated the feasibility of applying the Occupancy Simulator to simulate a range of occupancy presence and movement behaviors for regular types of occupants in office buildings, and to generate stochastic occupant schedules at the room and individual occupant levels for building performance simulation. For future work, model validation is recommended, which includes collecting and using detailed interval occupancy data of all spaces in an office building to validate the simulated occupant schedules from the Occupancy Simulator.« less
APPLICATION OF EYE TRACKING FOR MEASUREMENT AND EVALUATION IN HUMAN FACTORS STUDIES IN CONTROL ROOM MODERNIZATION

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kovesdi, C.; Spielman, Z.; LeBlanc, K.

An important element of human factors engineering (HFE) pertains to measurement and evaluation (M&E). The role of HFE-M&E should be integrated throughout the entire control room modernization (CRM) process and be used for human-system performance evaluation and diagnostic purposes with resolving potential human engineering deficiencies (HEDs) and other human machine interface (HMI) design issues. NUREG-0711 describes how HFE in CRM should employ a hierarchical set of measures, particularly during integrated system validation (ISV), including plant performance, personnel task performance, situation awareness, cognitive workload, and anthropometric/ physiological factors. Historically, subjective measures have been primarily used since they are easier to collectmore » and do not require specialized equipment. However, there are pitfalls with relying solely on subjective measures in M&E such that negatively impact reliability, sensitivity, and objectivity. As part of comprehensively capturing a diverse set of measures that strengthen findings and inferences made of the benefits from emerging technologies like advanced displays, this paper discusses the value of using eye tracking as an objective method that can be used in M&E. A brief description of eye tracking technology and relevant eye tracking measures is provided. Additionally, technical considerations and the unique challenges with using eye tracking in full-scaled simulations are addressed. Finally, this paper shares preliminary findings regarding the use of a wearable eye tracking system in a full-scale simulator study. These findings should help guide future full-scale simulator studies using eye tracking as a methodology to evaluate human-system performance.« less
Skylab experiment performance evaluation manual. Appendix T: Experiment T027/S073 contamination measurement, photometer and Gegenschein/zodiacal light (MSFC)

NASA Technical Reports Server (NTRS)

Meyers, J. E.

1973-01-01

A series of analyses for Experiment T027/S073, contamination measurement, photometer and gegenschein/zodiacal light (MSFC), to be used for evaluating the performance of the Skylab corollary experiments under preflight, inflight, and post-flight conditons is presented. Experiment contingency plan workaround procedure and malfunction analyses are presented in order to assist in making the experiment operationally successful.
Teachers' Perceptions of Evaluation and Teachers' Sense of Self-Efficacy in High-Performing High Schools

ERIC Educational Resources Information Center

McCall, James P.

2011-01-01

The evaluation, improvement, and accountability of teachers has been the topic of the nation throughout the era of No Child Left Behind. Where some critics point to a business model of measuring outputs (i.e., student achievement scores on standardized tests) to evaluate teacher performance, others will advocate for a fair evaluation system that…
Performance measurement of commercial electronic still picture cameras

NASA Astrophysics Data System (ADS)

Hsu, Wei-Feng; Tseng, Shinn-Yih; Chiang, Hwang-Cheng; Cheng, Jui-His; Liu, Yuan-Te

1998-06-01

Commercial electronic still picture cameras need a low-cost, systematic method for evaluating the performance. In this paper, we present a measurement method to evaluating the dynamic range and sensitivity by constructing the opto- electronic conversion function (OECF), the fixed pattern noise by the peak S/N ratio (PSNR) and the image shading function (ISF), and the spatial resolution by the modulation transfer function (MTF). The evaluation results of individual color components and the luminance signal from a PC camera using SONY interlaced CCD array as the image sensor are then presented.
Reliability and Validity of the Turkish Version of the Job Performance Scale Instrument.

PubMed

Harmanci Seren, Arzu Kader; Tuna, Rujnan; Eskin Bacaksiz, Feride

2018-02-01

Objective measurement of the job performance of nursing staff using valid and reliable instruments is important in the evaluation of healthcare quality. A current, valid, and reliable instrument that specifically measures the performance of nurses is required for this purpose. The aim of this study was to determine the validity and reliability of the Turkish version of the Job Performance Instrument. This study used a methodological design and a sample of 240 nurses working at different units in four hospitals in Istanbul, Turkey. A descriptive data form, the Job Performance Scale, and the Employee Performance Scale were used to collect data. Data were analyzed using IBM SPSS Statistics Version 21.0 and LISREL Version 8.51. On the basis of the data analysis, the instrument was revised. Some items were deleted, and subscales were combined. The Turkish version of the Job Performance Instrument was determined to be valid and reliable to measure the performance of nurses. The instrument is suitable for evaluating current nursing roles.
Evaluation of the Langley 4- by 7-meter tunnel for propeller noise measurements

NASA Technical Reports Server (NTRS)

Block, P. J. W.; Gentry, G. L., Jr.

1984-01-01

An experimental and theoretical evaluation of the Langley 4- by 7- Meter Tunnel was conducted to determine its suitability for obtaining propeller noise data. The tunnel circuit and open test section are described. An experimental evaluation is performed using microphones placed in and on the tunnel floor. The reflection characteristics and background noise are determined. The predicted source (propeller) near-field/far-field boundary is given using a first-principles method. The effect of the tunnel-floor boundry layer on the noise from the propeller is also predicted. A propeller test stand used for part of his evaluation is also described. The measured propeller performance characteristics are compared with those obtained at a larger scale, and the effect of the test-section configuration on the propeller performance is examined. Finally, propeller noise measurements were obtained on an eight-bladed SR-2 propeller operating at angles of attack -8 deg, 0 deg, and 4.6 deg to give an indication of attainable signal-to-noise ratios.
Effect of patient positions on measurement errors of the knee-joint space on radiographs

NASA Astrophysics Data System (ADS)

Gilewska, Grazyna

2001-08-01

Osteoarthritis (OA) is one of the most important health problems these days. It is one of the most frequent causes of pain and disability of middle-aged and old people. Nowadays the radiograph is the most economic and available tool to evaluate changes in OA. Error of performance of radiographs of knee joint is the basic problem of their evaluation for clinical research. The purpose of evaluation of such radiographs in my study was measuring the knee-joint space on several radiographs performed at defined intervals. Attempt at evaluating errors caused by a radiologist of a patient was presented in this study. These errors resulted mainly from either incorrect conditions of performance or from a patient's fault. Once we have information about size of the errors, we will be able to assess which of these elements have the greatest influence on accuracy and repeatability of measurements of knee-joint space. And consequently we will be able to minimize their sources.
Proposed Performance Measures and Strategies for Implementation of the Fatigue Risk Management Guidelines for Emergency Medical Services.

PubMed

Martin-Gill, Christian; Higgins, J Stephen; Van Dongen, Hans P A; Buysse, Daniel J; Thackery, Ronald W; Kupas, Douglas F; Becker, David S; Dean, Bradley E; Lindbeck, George H; Guyette, Francis X; Penner, Josef H; Violanti, John M; Lang, Eddy S; Patterson, P Daniel

2018-02-15

Performance measures are a key component of implementation, dissemination, and evaluation of evidence-based guidelines (EBGs). We developed performance measures for Emergency Medical Services (EMS) stakeholders to enable the implementation of guidelines for fatigue risk management in the EMS setting. Panelists associated with the Fatigue in EMS Project, which was supported by the National Highway Traffic Safety Administration (NHTSA), used an iterative process to develop a draft set of performance measures linked to 5 recommendations for fatigue risk management in EMS. We used a cross-sectional survey design and the Content Validity Index (CVI) to quantify agreement among panelists on the wording and content of draft measures. An anonymous web-based tool was used to solicit the panelists' perceptions of clarity and relevance of draft measures. Panelists rated the clarity and relevance separately for each draft measure on a 4-point scale. CVI scores ≥0.78 for clarity and relevance were specified a priori to signify agreement and completion of measurement development. Panelists judged 5 performance measures for fatigue risk management as clear and relevant. These measures address use of fatigue and/or sleepiness survey instruments, optimal duration of shifts, access to caffeine as a fatigue countermeasure, use of napping during shift work, and the delivery of education and training on fatigue risk management for EMS personnel. Panelists complemented performance measures with suggestions for implementation by EMS agencies. Performance measures for fatigue risk management in the EMS setting will facilitate the implementation and evaluation of the EBG for Fatigue in EMS.
Evaluation of full depth asphaltic concrete pavements : final report.

DOT National Transportation Integrated Search

1982-10-01

the aim of this study was to evaluate the full depth asphaltic concrete pavement design concept by observing the performance characteristics of two 13-inch pavements constructed in 1970. Pavement performance measurements, over an 11-year period, incl...
ASUPT Automated Objective Performance Measurement System.

ERIC Educational Resources Information Center

Waag, Wayne L.; And Others

To realize its full research potential, a need exists for the development of an automated objective pilot performance evaluation system for use in the Advanced Simulation in Undergraduate Pilot Training (ASUPT) facility. The present report documents the approach taken for the development of performance measures and also presents data collected…
Lessons Learned from Military Performance Assessment.

ERIC Educational Resources Information Center

Wise, Lauress L.

Lessons derived from the Job Performance Measurement (JPM) Project, which is overseen by the Office of the Assistant Secretary of Defense for Force Management and Personnel, for educational assessment are explored. The JPM Project was initiated to develop high fidelity measures of performance on the job that can be used to evaluate personnel…
Performance Evaluation of the Educational Leader (PEEL): Another Breakthrough in Competency Based Educational Administration.

ERIC Educational Resources Information Center

Metzger, Christa; Lynch, Steven B.

1974-01-01

This paper describes the Performance Evaluation of the Education Leader (PEEL) program, initiated from a study to define the competent school administrator and to develop an instrument to measure administrative competence objectively and accurately. The resulting PEEL materials include the following: (a) "Guidelines for Evaluation: The School…
Analysis of seasonal strain measurements in asphalt materials under accelerated pavement testing and comparing field performance and laboratory measured binder tension properties.

DOT National Transportation Integrated Search

2009-06-01

Seasonal variation of measured pavement responses with temperature and its relationship to pavement performance has not been : thoroughly evaluated for ALF Experiments II and III. Such information may be used to improve instrumentation strategies in ...
Subjective Performance Evaluation in the Public Sector: Evidence from School Inspections. CEE DP 135

ERIC Educational Resources Information Center

Hussain, Iftikhar

2012-01-01

Performance measurement in the public sector is largely based on objective metrics, which may be subject to gaming behaviour. This paper investigates a novel subjective performance evaluation system where independent inspectors visit schools at very short notice, publicly disclose their findings and sanction schools rated fail. First, I…
Development and Validation of a Clarinet Performance Adjudication Scale

ERIC Educational Resources Information Center

Abeles, Harold F.

1973-01-01

A basic assumption of this study is that there are generally agreed upon performance standards as evidenced by the use of adjudicators for evaluations at contests and festivals. An evaluation instrument was developed to enable raters to measure effectively those aspects of performance that have common standards of proficiency. (Author/RK)
Mining Formative Evaluation Rules Using Web-Based Learning Portfolios for Web-Based Learning Systems

ERIC Educational Resources Information Center

Chen, Chih-Ming; Hong, Chin-Ming; Chen, Shyuan-Yi; Liu, Chao-Yu

2006-01-01

Learning performance assessment aims to evaluate what knowledge learners have acquired from teaching activities. Objective technical measures of learning performance are difficult to develop, but are extremely important for both teachers and learners. Learning performance assessment using learning portfolios or web server log data is becoming an…

Unrealistic Optimism in the Pursuit of Academic Success

ERIC Educational Resources Information Center

Lewine, Rich; Sommers, Alison A.

2016-01-01

Although the ability to evaluate one's own knowledge and performance is critical to learning, the correlation between students' self-evaluation and actual performance measures is modest at best. In this study we examine the effect of offering extra credit for students' accurate prediction (self-accuracy) of their performance on four exams in two…
Automating Performance Measures and Clinical Practice Guidelines: Differences and Complementarities.

PubMed

Tu, Samson W; Martins, Susana; Oshiro, Connie; Yuen, Kaeli; Wang, Dan; Robinson, Amy; Ashcraft, Michael; Heidenreich, Paul A; Goldstein, Mary K

2016-01-01

Through close analysis of two pairs of systems that implement the automated evaluation of performance measures (PMs) and guideline-based clinical decision support (CDS), we contrast differences in their knowledge encoding and necessary changes to a CDS system that provides management recommendations for patients failing performance measures. We trace the sources of differences to the implementation environments and goals of PMs and CDS.
Design and Testing of a Tool for Evaluating the Quality of Diabetes Consumer-Information Web Sites

PubMed Central

Steinwachs, Donald; Rubin, Haya R

2003-01-01

Background Most existing tools for measuring the quality of Internet health information focus almost exclusively on structural criteria or other proxies for quality information rather than evaluating actual accuracy and comprehensiveness. Objective This research sought to develop a new performance-measurement tool for evaluating the quality of Internet health information, test the validity and reliability of the tool, and assess the variability in diabetes Web site quality. Methods An objective, systematic tool was developed to evaluate Internet diabetes information based on a quality-of-care measurement framework. The principal investigator developed an abstraction tool and trained an external reviewer on its use. The tool included 7 structural measures and 34 performance measures created by using evidence-based practice guidelines and experts' judgments of accuracy and comprehensiveness. Results Substantial variation existed in all categories, with overall scores following a normal distribution and ranging from 15% to 95% (mean was 50% and median was 51%). Lin's concordance correlation coefficient to assess agreement between raters produced a rho of 0.761 (Pearson's r of 0.769), suggesting moderate to high agreement. The average agreement between raters for the performance measures was 0.80. Conclusions Diabetes Web site quality varies widely. Alpha testing of this new tool suggests that it could become a reliable and valid method for evaluating the quality of Internet health sites. Such an instrument could help lay people distinguish between beneficial and misleading information. PMID:14713658
Evaluating Curriculum-Based Measurement from a Behavioral Assessment Perspective

ERIC Educational Resources Information Center

Ardoin, Scott P.; Roof, Claire M.; Klubnick, Cynthia; Carfolite, Jessica

2008-01-01

Curriculum-based measurement Reading (CBM-R) is an assessment procedure used to evaluate students' relative performance compared to peers and to evaluate their growth in reading. Within the response to intervention (RtI) model, CBM-R data are plotted in time series fashion as a means modeling individual students' response to varying levels of…
A Comparison of Evaluation Practices Based on E-Learning and Mobile Learning Delivery Rates

ERIC Educational Resources Information Center

Marshall, James

2018-01-01

Learning and performance professionals are increasingly pressed to measure the results of their learning program design efforts, and ultimately prove their worth. However, evaluation efforts are often limited to measuring participant reaction. This study sought to quantify evaluation practices in organizations and investigate how the use of…
Surviving annual performance reviews.

PubMed

Lazarus, Arthur

2008-01-01

Physicians who work in organizational settings can expect to be evaluated at least twice a year. Yet physicians are accustomed to functioning autonomously, and they may resist having their performance measured or become anxious at the thought of it. Several recommendations are made to help physicians survive the ordeal: (1) establish measurable goals and objectives for the year; (2) perform at your very best at all times; (3) obtain feedback about your performance from your colleagues; (4) ask for a mentor if you lack experience; (5) learn to manage upward; (6) let your boss know when other people have praised your work; (7) insist on face-to-face evaluations; and (8) sign your annual performance review and indicate agreement or disagreement.
The Ling 6(HL) test: typical pediatric performance data and clinical use evaluation.

PubMed

Glista, Danielle; Scollie, Susan; Moodie, Sheila; Easwar, Vijayalakshmi

2014-01-01

The Ling 6(HL) test offers a calibrated version of naturally produced speech sounds in dB HL for evaluation of detection thresholds. Aided performance has been previously characterized in adults. The purpose of this work was to evaluate and refine the Ling 6(HL) test for use in pediatric hearing aid outcome measurement. This work is presented across two studies incorporating an integrated knowledge translation approach in the characterization of normative and typical performance, and in the evaluation of clinical feasibility, utility, acceptability, and implementation. A total of 57 children, 28 normally hearing and 29 with binaural sensorineural hearing loss, were included in Study 1. Children wore their own hearing aids fitted using Desired Sensation Level v5.0. Nine clinicians from The Network of Pediatric Audiologists participated in Study 2. A CD-based test format was used in the collection of unaided and aided detection thresholds in laboratory and clinical settings; thresholds were measured clinically as part of routine clinical care. Confidence intervals were derived to characterize normal performance and typical aided performance according to hearing loss severity. Unaided-aided performance was analyzed using a repeated-measures analysis of variance. The audiologists completed an online questionnaire evaluating the quality, feasibility/executability, utility/comparative value/relative advantage, acceptability/applicability, and interpretability, in addition to recommendation and general comments sections. Ling 6(HL) thresholds were reliably measured with children 3-18 yr old. Normative and typical performance ranges were translated into a scoring tool for use in pediatric outcome measurement. In general, questionnaire respondents generally agreed that the Ling 6(HL) test was a high-quality outcome evaluation tool that can be implemented successfully in clinical settings. By actively collaborating with pediatric audiologists and using an integrated knowledge translation framework, this work supported the creation of an evidence-based clinical tool that has the potential to be implemented in, and useful to, clinical practice. More research is needed to characterize performance in alternative listening conditions to facilitate use with infants, for example. Future efforts focused on monitoring the use of the Ling 6(HL) test in daily clinical practice may help describe whether clinical use has been maintained across time and if any additional adaptations are necessary to facilitate clinical uptake. American Academy of Audiology.
Evaluating Cross-Cutting Approaches to Chronic Disease Prevention and Management: Developing a Comprehensive Evaluation

PubMed Central

Jernigan, Jan; Barnes, Seraphine Pitt; Shea, Pat; Davis, Rachel; Rutledge, Stephanie

2017-01-01

We provide an overview of the comprehensive evaluation of State Public Health Actions to Prevent and Control Diabetes, Heart Disease, Obesity and Associated Risk Factors and Promote School Health (State Public Health Actions). State Public Health Actions is a program funded by the Centers for Disease Control and Prevention to support the statewide implementation of cross-cutting approaches to promote health and prevent and control chronic diseases. The evaluation addresses the relevance, quality, and impact of the program by using 4 components: a national evaluation, performance measures, state evaluations, and evaluation technical assistance to states. Challenges of the evaluation included assessing the extent to which the program contributed to changes in the outcomes of interest and the variability in the states’ capacity to conduct evaluations and track performance measures. Given the investment in implementing collaborative approaches at both the state and national level, achieving meaningful findings from the evaluation is critical. PMID:29215974
Demonstration of subsidence monitoring system

NASA Astrophysics Data System (ADS)

Conroy, P. J.; Gyarmaty, J. H.; Pearson, M. L.

1981-06-01

Data on coal mine subsidence were studied as a basis for the development of subsidence control technology. Installation, monitoring, and evaluation of three subsidence monitoring instrument systems were examined: structure performance, performance of supported systems, and performance of caving systems. Objectives of the instrument program were: (1) to select, test, assemble, install, monitor, and maintain all instrumentation required for implementing the three subsidence monitoring systems; and (2) to evaluate performance of each instrument individually and as part of the appropriate monitoring system or systems. The use of an automatic level and a rod extensometer for measuring structure performance, and the automatic level, steel tape extensometer, FPBX, FPBI, USBM borehole deformation gauge, and vibrating wire stressmeters for measuring the performance of caving systems are recommended.
Evaluating the effect of online data compression on the disk cache of a mass storage system

NASA Technical Reports Server (NTRS)

Pentakalos, Odysseas I.; Yesha, Yelena

1994-01-01

A trace driven simulation of the disk cache of a mass storage system was used to evaluate the effect of an online compression algorithm on various performance measures. Traces from the system at NASA's Center for Computational Sciences were used to run the simulation and disk cache hit ratios, number of files and bytes migrating to tertiary storage were measured. The measurements were performed for both an LRU and a size based migration algorithm. In addition to seeing the effect of online data compression on the disk cache performance measure, the simulation provided insight into the characteristics of the interactive references, suggesting that hint based prefetching algorithms are the only alternative for any future improvements to the disk cache hit ratio.
Evaluation of AAFE apparatus to measure residual and transient convection in zero-gravity

NASA Technical Reports Server (NTRS)

Ruff, R. C.; Facemire, B. R.; Witherow, W. K.

1978-01-01

An evaluation apparatus which photographs convective and diffusive flows in crystal growth experiments is presented. Results in the following catagories are reported: (1) Human factors; (2) Electrical and mechanical; (3) Optical performance; and (4) Thermal performance.
Holistic rubric vs. analytic rubric for measuring clinical performance levels in medical students.

PubMed

Yune, So Jung; Lee, Sang Yeoup; Im, Sun Ju; Kam, Bee Sung; Baek, Sun Yong

2018-06-05

Task-specific checklists, holistic rubrics, and analytic rubrics are often used for performance assessments. We examined what factors evaluators consider important in holistic scoring of clinical performance assessment, and compared the usefulness of applying holistic and analytic rubrics respectively, and analytic rubrics in addition to task-specific checklists based on traditional standards. We compared the usefulness of a holistic rubric versus an analytic rubric in effectively measuring the clinical skill performances of 126 third-year medical students who participated in a clinical performance assessment conducted by Pusan National University School of Medicine. We conducted a questionnaire survey of 37 evaluators who used all three evaluation methods-holistic rubric, analytic rubric, and task-specific checklist-for each student. The relationship between the scores on the three evaluation methods was analyzed using Pearson's correlation. Inter-rater agreement was analyzed by Kappa index. The effect of holistic and analytic rubric scores on the task-specific checklist score was analyzed using multiple regression analysis. Evaluators perceived accuracy and proficiency to be major factors in objective structured clinical examinations evaluation, and history taking and physical examination to be major factors in clinical performance examinations evaluation. Holistic rubric scores were highly related to the scores of the task-specific checklist and analytic rubric. Relatively low agreement was found in clinical performance examinations compared to objective structured clinical examinations. Meanwhile, the holistic and analytic rubric scores explained 59.1% of the task-specific checklist score in objective structured clinical examinations and 51.6% in clinical performance examinations. The results show the usefulness of holistic and analytic rubrics in clinical performance assessment, which can be used in conjunction with task-specific checklists for more efficient evaluation.
Modeling instructor preferences for CPR and AED competence estimation.

PubMed

Birnbaum, Alice; McBurnie, Mary Ann; Powell, Judy; Ottingham, Lois Van; Riegel, Barbara; Potts, Jerry; Hedges, Jerris R

2005-03-01

Cardiopulmonary resuscitation (CPR) and automated external defibrillator (AED) skills competency can be tested using a checklist of component skills, individually graded "pass" or "fail." Scores are typically calculated as the percentage of skills passed, but may differ from an instructor's overall subjective assessment of simulated CPR or AED adequacy. To identify and evaluate composite measures (methods for scoring checklists) that reflect instructors' subjective assessments of CPR or AED skills performance best. Associations between instructor assessment and lay-volunteer skill performance were made using 6380 CPR and 3313 AED skill retention tests collected in the Public Access Defibrillation Trial. Checklists included CPR skills (e.g., calling 911, administering compressions) and AED skills (e.g., positioning electrodes, shocking within 90 s of AED arrival). The instructor's subjective overall assessment (adequate/inadequate) of CPR performance (perfusion) or AED competence (effective shock) was compared to composite measures. We evaluated the traditional composite measure (assigning equal weights to individual skills) and several nontraditional composite measures (assigning variable weights). Skills performed out of sequence were further weighted from 0% (no credit) to 100% (full credit). Composite measures providing full credit for skills performed out of sequence and down-weighting process skills (e.g., calling 911, clearing oneself from the AED) had the strongest association with the instructor's subjective assessment; the traditional CPR composite measure had the weakest association. Our findings suggest that instructors in public CPR and AED classes may tend to down-weight process skills and to excuse step sequencing errors when evaluating CPR and AED skills subjectively for overall proficiency. Testing methods that relate classroom performance to actual performance in the field and to clinical outcomes require further research.
Operational Test and Evaluation Handbook for Aircrew Training Devices. Volume II. Operational Effectiveness Evaluation

DTIC Science & Technology

1982-02-01

should also convey an understanding of the differ- ences in learning behavior between initial learning activity and later skill maintenance and...refinement might then be, ATTACK MANEUVERS * Pop-up attack # Loft/ LADO type attack * Level/laydown attack Figure 5-4 showe diagrammatically the...sensitive to differ- ences in performance. Severai criteria should be used to guide the selection/development of performance measures, i.e., measure validity
Solar energy system performance evaluation. Seasonal report for Wormser, Columbia, South Carolina

NASA Technical Reports Server (NTRS)

1980-01-01

The Wormser Solar Energy System's operational performance from April 1979 through March 1980 was evaluated. The space heating subsystem met 42 percent of the measured space heating load and the hot water subsystem met 23 percent of the measured hot water demand. Net electrical energy savings were 4.36 million Btu's or 1277 kwh. Fossil energy savings will increase considerably if the uncontrolled solar energy input to the building is considered.
Models and techniques for evaluating the effectiveness of aircraft computing systems

NASA Technical Reports Server (NTRS)

Meyer, J. F.

1977-01-01

Models, measures and techniques were developed for evaluating the effectiveness of aircraft computing systems. The concept of effectiveness involves aspects of system performance, reliability and worth. Specifically done was a detailed development of model hierarchy at mission, functional task, and computational task levels. An appropriate class of stochastic models was investigated which served as bottom level models in the hierarchial scheme. A unified measure of effectiveness called 'performability' was defined and formulated.
Space shuttle main engine computed tomography applications

NASA Technical Reports Server (NTRS)

Sporny, Richard F.

1990-01-01

For the past two years the potential applications of computed tomography to the fabrication and overhaul of the Space Shuttle Main Engine were evaluated. Application tests were performed at various government and manufacturer facilities with equipment produced by four different manufacturers. The hardware scanned varied in size and complexity from a small temperature sensor and turbine blades to an assembled heat exchanger and main injector oxidizer inlet manifold. The evaluation of capabilities included the ability to identify and locate internal flaws, measure the depth of surface cracks, measure wall thickness, compare manifold design contours to actual part contours, perform automatic dimensional inspections, generate 3D computer models of actual parts, and image the relationship of the details in a complex assembly. The capabilities evaluated, with the exception of measuring the depth of surface flaws, demonstrated the existing and potential ability to perform many beneficial Space Shuttle Main Engine applications.
Quality of Protection Evaluation of Security Mechanisms

PubMed Central

Ksiezopolski, Bogdan; Zurek, Tomasz; Mokkas, Michail

2014-01-01

Recent research indicates that during the design of teleinformatic system the tradeoff between the systems performance and the system protection should be made. The traditional approach assumes that the best way is to apply the strongest possible security measures. Unfortunately, the overestimation of security measures can lead to the unreasonable increase of system load. This is especially important in multimedia systems where the performance has critical character. In many cases determination of the required level of protection and adjustment of some security measures to these requirements increase system efficiency. Such an approach is achieved by means of the quality of protection models where the security measures are evaluated according to their influence on the system security. In the paper, we propose a model for QoP evaluation of security mechanisms. Owing to this model, one can quantify the influence of particular security mechanisms on ensuring security attributes. The methodology of our model preparation is described and based on it the case study analysis is presented. We support our method by the tool where the models can be defined and QoP evaluation can be performed. Finally, we have modelled TLS cryptographic protocol and presented the QoP security mechanisms evaluation for the selected versions of this protocol. PMID:25136683
A New Method for the Evaluation and Prediction of Base Stealing Performance.

PubMed

Bricker, Joshua C; Bailey, Christopher A; Driggers, Austin R; McInnis, Timothy C; Alami, Arya

2016-11-01

Bricker, JC, Bailey, CA, Driggers, AR, McInnis, TC, and Alami, A. A new method for the evaluation and prediction of base stealing performance. J Strength Cond Res 30(11): 3044-3050, 2016-The purposes of this study were to evaluate a new method using electronic timing gates to monitor base stealing performance in terms of reliability, differences between it and traditional stopwatch-collected times, and its ability to predict base stealing performance. Twenty-five healthy collegiate baseball players performed maximal effort base stealing trials with a right and left-handed pitcher. An infrared electronic timing system was used to calculate the reaction time (RT) and total time (TT), whereas coaches' times (CT) were recorded with digital stopwatches. Reliability of the TGM was evaluated with intraclass correlation coefficients (ICCs) and coefficient of variation (CV). Differences between the TGM and traditional CT were calculated with paired samples t tests Cohen's d effect size estimates. Base stealing performance predictability of the TGM was evaluated with Pearson's bivariate correlations. Acceptable relative reliability was observed (ICCs 0.74-0.84). Absolute reliability measures were acceptable for TT (CVs = 4.4-4.8%), but measures were elevated for RT (CVs = 32.3-35.5%). Statistical and practical differences were found between TT and CT (right p = 0.00, d = 1.28 and left p = 0.00, d = 1.49). The TGM TT seems to be a decent predictor of base stealing performance (r = -0.49 to -0.61). The authors recommend using the TGM used in this investigation for athlete monitoring because it was found to be reliable, seems to be more precise than traditional CT measured with a stopwatch, provides an additional variable of value (RT), and may predict future performance.
Validity of a verbal incidental learning measure from the WAIS-IV in older adults.

PubMed

Hammers, Dustin B; Kucera, Amanda M; Card, Stephanie J; Tolle, Kathryn A; Atkinson, Taylor J; Duff, Kevin; Spencer, Robert J

2018-01-01

Incidental memory may reflect a form of learning in everyday life, although it is not consistently evaluated during standard neuropsychological evaluations. Further validation of a recently created measure of verbal Incidental Learning (IL) from the Wechsler Adult Intelligence Scale-IV is necessary to understand the utility of such a measure in clinical settings. Sixty-eight adults aged 50 to 89 were recruited from a Cognitive Disorders Clinic while receiving a standard neuropsychological assessment, along with two additional measures of IL. IL-Total Score was significantly correlated with immediate and delayed memory trials from standard neuropsychological tests (rs = .43 to .73, ps < .001, ds = 0.94-2.14), with worse IL performance being associated with lower memory abilities. Participants with probable Alzheimer's disease performed worse on the IL-Total Score than participants with Mild Cognitive Impairment, t(39.997) = 5.46, p < .001, d = 1.13. Given the strong relationships between this IL task and traditional memory measures in our sample, and the discrimination of IL-Total Score performance among diagnostic groups despite its short administration time, this IL task may play a role as a measure of memory in brief cognitive evaluations.

Association between Measures of Academic Performance and Psychosocial Adjustment for Asian/Pacific-Islander Adolescents.

ERIC Educational Resources Information Center

Hishinuma, Earl S.; Foster, Judy E.; Miyamoto, Robin H.; Nishimura, Stephanie T.; Andrade, Naleen N.; Nahulu, Linda B.; Goebert, Deborah A.; Yuen, Noelle Y. C.; Makini, George K., Jr.; Kim, S. Peter; Carlton, Barry S.

2001-01-01

Examines the association between different measures of academic performance and psychological adjustment for a sample of under-researched Asian/Pacific Islander adolescents from Hawaii. Results support the use of the actual quantification of academic performance (i.e. cumulative grade point average or self reported evaluation) in predicting…
Evaluating Performance Measurement Systems in Nonprofit Agencies: The Program Accountability Quality Scale (PAQS).

ERIC Educational Resources Information Center

Poole, Dennis L.; Nelson, Joan; Carnahan, Sharon; Chepenik, Nancy G.; Tubiak, Christine

2000-01-01

Developed and field tested the Performance Accountability Quality Scale (PAQS) on 191 program performance measurement systems developed by nonprofit agencies in central Florida. Preliminary findings indicate that the PAQS provides a structure for obtaining expert opinions based on a theory-driven model about the quality of proposed measurement…
The Relationship between Emotional Intelligence and Student Teacher Performance

ERIC Educational Resources Information Center

Drew, Todd L.

2006-01-01

The purpose of this mixed methods study (N = 40) was to determine whether Student Teacher Performance (STP), as measured by a behavior-based performance evaluation process, is associated with Emotional Intelligence (EI), as measured by a personality assessment instrument. The study is an important contribution to the literature in that it appears…
Race to the Paycheck: Merit Pay and Theories of Teacher Motivation

ERIC Educational Resources Information Center

Horne, Jason; Foley, Virginia P.; Flora, Bethany H.

2014-01-01

Recent reforms in teacher evaluation tie these evaluations to student performance as measured by test scores and merit pay has been offered as a way to reward high test scores and improve teacher performance. Thus, the federal Race to the Top program has led several states toward teacher evaluation instruments that incorporate outcome data in the…
Production and evaluation of measuring equipment for share viscosity of polymer melts included nanofiller with injection molding machine

NASA Astrophysics Data System (ADS)

Kameda, Takao; Sugino, Naoto; Takei, Satoshi

2016-10-01

Shear viscosity measurement device was produced to evaluate the injection molding workability for high-performance resins. Observation was possible in shear rate from 10 to 10000 [1/sec] that were higher than rotary rheometer by measuring with a plasticization cylinder of the injection molding machine. The result of measurements extrapolated result of a measurement of the rotary rheometer.
Development of ultrasonic methods for hemodynamic measurements

NASA Technical Reports Server (NTRS)

Histand, M. B.; Miller, C. W.; Wells, M. K.; Mcleod, F. D.; Greene, E. R.; Winter, D.

1975-01-01

A transcutanous method to measure instantaneous mean blood flow in peripheral arteries of the human body was defined. Transcutanous and implanted cuff ultrasound velocity measurements were evaluated, and the accuracies of velocity, flow, and diameter measurements were assessed for steady flow. Performance criteria were established for the pulsed Doppler velocity meter (PUDVM), and performance tests were conducted. Several improvements are suggested.
Relation between measures of speech-in-noise performance and measures of efferent activity

NASA Astrophysics Data System (ADS)

Smith, Brad; Harkrider, Ashley; Burchfield, Samuel; Nabelek, Anna

2003-04-01

Individual differences in auditory perceptual abilities in noise are well documented but the factors causing such variability are unclear. The purpose of this study was to determine if individual differences in responses measured from the auditory efferent system were correlated to individual variations in speech-in-noise performance. The relation between behavioral performance on three speech-in-noise tasks and two objective measures of the efferent auditory system were examined in thirty normal-hearing, young adults. Two of the speech-in-noise tasks measured an acceptable noise level, the maximum level of speech-babble noise that a subject is willing to accept while listening to a story. For these, the acceptable noise level was evaluated using both an ipsilateral (story and noise in same ear) and a contralateral (story and noise in opposite ears) paradigm. The third speech-in-noise task evaluated speech recognition using monosyllabic words presented in competing speech babble. Auditory efferent activity was assessed by examining the resulting suppression of click-evoked otoacoustic emissions following the introduction of a contralateral, broad-band stimulus and the activity of the ipsilateral and contralateral acoustic reflex arc was evaluated using tones and broad-band noise. Results will be discussed relative to current theories of speech in noise performance and auditory inhibitory processes.
USING BIOASSAYS TO EVALUATE THE PERFORMANCE OF RISK MANAGEMENT TECHNIQUES

EPA Science Inventory

Often, the performance of risk management techniques is evaluated by measuring the concentrations of the chemials of concern before and after risk management effoprts. However, using bioassays and chemical data provides a more robust understanding of the effectiveness of risk man...
Evaluating Comparability in the Scoring of Performance Assessments for Accountability Purposes

ERIC Educational Resources Information Center

Lyons, Susan; Evans, Carla

2017-01-01

This brief summarizes "Comparability in Balanced Assessment Systems for State Accountability," published in "Educational Measurement: Issues and Practice" (Evans & Lyons 2017). The study evaluated comparability claims in local scoring of performance assessments across districts participating in New Hampshire's Performance…
78 FR 79697 - Statement of Organization, Functions, and Delegations of Authority

Federal Register 2010, 2011, 2012, 2013, 2014

2013-12-31

... new initiatives based on emerging issues, science, and policy; (6) supports the harmonization and..., and programmatic efforts; (10) manages evaluation fellowship; (11) guides performance-based strategic... improvement based on effective program evaluation, and performance measurement; (14) supports evidence-driven...
USING BIOASSAYS TO EVALUATE THE PERFORMANCE OF EDC RISK MANAGEMENT METHODS

EPA Science Inventory

In Superfund risk management research, the performance of risk management techniques is typically evaluated by measuring "the concentrations of the chemicals of concern before and after risk management efforts. However, using bioassays and chemical data provides a more robust und...
48 CFR 1552.216-77 - Award term incentive.

Code of Federal Regulations, 2010 CFR

2010-10-01

... performance measures for the corresponding evaluation period; or (iii) The Government notifies the contractor....216-77 Section 1552.216-77 Federal Acquisition Regulations System ENVIRONMENTAL PROTECTION AGENCY...) based on overall contractor performance as evaluated in accordance with the Clause entitled “Award Term...
Classroom Composition and Measured Teacher Performance: What Do Teacher Observation Scores Really Measure?

ERIC Educational Resources Information Center

Steinberg, Matthew P.; Garrett, Rachel

2016-01-01

As states and districts implement more rigorous teacher evaluation systems, measures of teacher performance are increasingly being used to support instruction and inform retention decisions. Classroom observations take a central role in these systems, accounting for the majority of teacher ratings upon which accountability decisions are based.…
Factors affecting construction performance: exploratory factor analysis

NASA Astrophysics Data System (ADS)

Soewin, E.; Chinda, T.

2018-04-01

The present work attempts to develop a multidimensional performance evaluation framework for a construction company by considering all relevant measures of performance. Based on the previous studies, this study hypothesizes nine key factors, with a total of 57 associated items. The hypothesized factors, with their associated items, are then used to develop questionnaire survey to gather data. The exploratory factor analysis (EFA) was applied to the collected data which gave rise 10 factors with 57 items affecting construction performance. The findings further reveal that the items constituting ten key performance factors (KPIs) namely; 1) Time, 2) Cost, 3) Quality, 4) Safety & Health, 5) Internal Stakeholder, 6) External Stakeholder, 7) Client Satisfaction, 8) Financial Performance, 9) Environment, and 10) Information, Technology & Innovation. The analysis helps to develop multi-dimensional performance evaluation framework for an effective measurement of the construction performance. The 10 key performance factors can be broadly categorized into economic aspect, social aspect, environmental aspect, and technology aspects. It is important to understand a multi-dimension performance evaluation framework by including all key factors affecting the construction performance of a company, so that the management level can effectively plan to implement an effective performance development plan to match with the mission and vision of the company.
Measuring Cooperative Biological Engagement Program (CBEP) Performance: Capacities, Capabilities, and Sustainability Enablers for Biorisk Management and Biosurveillance

DTIC Science & Technology

2014-01-01

valid OMB control number. 1. REPORT DATE 2014 2. REPORT TYPE 3. DATES COVERED 00-00-2014 to 00-00-2014 4. TITLE AND SUBTITLE Measuring...should approaches to monitoring program performance. Recognizing this, Congress requested that the Department of Defense improve metrics for measuring...Cooperative Biological Engagement Program Performance broader community of program evaluation practitioners, the work advances innovative approaches
Deployment of a tool for measuring freeway safety performance.

DOT National Transportation Integrated Search

2011-12-01

This project updated and deployed a freeway safety performance measurement tool, building upon a previous project that developed the core methodology. The tool evaluates the cumulative risk over time of an accident or a particular kind of accident. T...
SELECTION OF ENDOCRINOLOGY SUBSPECIALTY TRAINEES: WHICH APPLICANT CHARACTERISTICS ARE ASSOCIATED WITH PERFORMANCE DURING FELLOWSHIP TRAINING?

PubMed Central

Natt, Neena; Chang, Alice Y.; Berbari, Elie F.; Kennel, Kurt A.; Kearns, Ann E.

2016-01-01

Objective To determine which residency characteristics are associated with performance during endocrinology fellowship training as measured by competency-based faculty evaluation scores and faculty global ratings of trainee performance. Method We performed a retrospective review of interview applications from endocrinology fellows who graduated from a single academic institution between 2006 and 2013. Performance measures included competency-based faculty evaluation scores and faculty global ratings. The association between applicant characteristics and measures of performance during fellowship was examined by linear regression. Results The presence of a laudatory comparative statement in the residency program director’s letter of recommendation (LoR) or experience as a chief resident was significantly associated with competency-based faculty evaluation scores (β = 0.22, P = 0.001; and β = 0.24, P = 0.009, respectively) and faculty global ratings (β = 0.85, P = 0.006; and β = 0.96, P = 0.015, respectively). Conclusion The presence of a laudatory comparative statement in the residency program director’s LoR or experience as a chief resident were significantly associated with overall performance during subspecialty fellowship training. Future studies are needed in other cohorts to determine the broader implications of these findings in the application and selection process. PMID:26437219
GATEWAY Report Brief: Evaluating OLED Lighting in the Accounting Office of DeJoy, Knauf & Blood LLP

DOE Office of Scientific and Technical Information (OSTI.GOV)

None

Summary of GATEWAY report evaluating a new lighting system, at the offices of the accounting firm of DeJoy, Knauf & Blood, LLP in Rochester, NY, that incorporates a number of different OLED luminaires. Evaluation of the OLED products included efficacy performance, field measurements of panel color, flicker measurements, and staff feedback.
Engineering evaluation of SSME dynamic data from engine tests and SSV flights

NASA Technical Reports Server (NTRS)

1986-01-01

An engineering evaluation of dynamic data from SSME hot firing tests and SSV flights is summarized. The basic objective of the study is to provide analyses of vibration, strain and dynamic pressure measurements in support of MSFC performance and reliability improvement programs. A brief description of the SSME test program is given and a typical test evaluation cycle reviewed. Data banks generated to characterize SSME component dynamic characteristics are described and statistical analyses performed on these data base measurements are discussed. Analytical models applied to define the dynamic behavior of SSME components (such as turbopump bearing elements and the flight accelerometer safety cut-off system) are also summarized. Appendices are included to illustrate some typical tasks performed under this study.
Ductless Mini-Split Heat Pump Comfort Evaluation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roth, K.; Sehgal, N.; Akers, C.

2013-03-01

Field tests were conducted in two homes in Austin, TX, to evaluate the comfort performance of ductless minisplit heat pumps (DMSHPs), measuring temperature and relative humidity measurements in four rooms in each home before and after retrofitting a central HVAC system with DMSHPs.

Ductless Mini-Split Heat Pump Comfort Evaluation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roth, K.; Sehgal, N.; Akers, C.

2013-03-01

Field tests were conducted in two homes in Austin, TX to evaluate the comfort performance of ductless mini-split heat pumps (DMSHPs), measuring temperature and relative humidity measurements in four rooms in each home before and after retrofitting a central HVAC system with DMSHPs.
Kinematic Analysis and Performance Evaluation of Novel PRS Parallel Mechanism

NASA Astrophysics Data System (ADS)

Balaji, K.; Khan, B. Shahul Hamid

2018-02-01

In this paper, a 3 DoF (Degree of Freedom) novel PRS (Prismatic-Revolute- Spherical) type parallel mechanisms has been designed and presented. The combination of striaght and arc type linkages for 3 DOF parallel mechanism is introduced for the first time. The performances of the mechanisms are evaluated based on the indices such as Minimum Singular Value (MSV), Condition Number (CN), Local Conditioning Index (LCI), Kinematic Configuration Index (KCI) and Global Conditioning Index (GCI). The overall reachable workspace of all mechanisms are presented. The kinematic measure, dexterity measure and workspace analysis for all the mechanism have been evaluated and compared.
Evaluating the use of key performance indicators to evidence the patient experience.

PubMed

McCance, Tanya; Hastings, Jack; Dowler, Hilda

2015-11-01

To test eight person-centred key performance indicators and the feasibility of an appropriate measurement framework as an approach to evidencing the patient experience. The value of measuring the quality of patient care is undisputed in the international literature, however, the type of measures that can be used to generate data that is meaningful for practice continues to be debated. This paper offers a different perspective to the 'measurement' of the nursing and midwifery contribution to the patient experience. Fourth generation evaluation was the methodological approach used to evaluate the implementation of the key performance indicators and measurement framework across three participating organisations involving nine practice settings. Data were collected by repeated use of claims, concerns and issues with staff working across nine participating sites (n = 18) and the senior executives from the three partner organisations (n = 12). Data were collected during the facilitated sessions with stakeholders and analysed in conjunction with the data generated from the measurement framework. The data reveal the inherent value placed on the evidence generated from the implementation of the key performance indicators as reflected in the following themes: measuring what matters; evidencing the patient experience; engaging staff; a focus for improving practice; and articulating and demonstrating the positive contribution of nursing and midwifery. The implementation of the key performance indicators and the measurement framework has been effective in generating evidence that demonstrates the patient experience. The nature of the data generated not only privileges the patient voice but also offers feedback to nurses and midwives that can inform the development of person-centred cultures. The use of these indicators will produce evidence of patient experience that can be used by nurse and midwives to celebrate and further inform person-centred practice. © 2015 John Wiley & Sons Ltd.
Self Evaluation of Organizations.

ERIC Educational Resources Information Center

Pooley, Richard C.

Evaluation within human service organizations is defined in terms of accepted evaluation criteria, with reasonable expectations shown and structured into a model of systematic evaluation practice. The evaluation criteria of program effort, performance, adequacy, efficiency and process mechanisms are discussed, along with measurement information…
Virtual tape measure for the operating microscope: system specifications and performance evaluation.

PubMed

Kim, M Y; Drake, J M; Milgram, P

2000-01-01

The Virtual Tape Measure for the Operating Microscope (VTMOM) was created to assist surgeons in making accurate 3D measurements of anatomical structures seen in the surgical field under the operating microscope. The VTMOM employs augmented reality techniques by combining stereoscopic video images with stereoscopic computer graphics, and functions by relying on an operator's ability to align a 3D graphic pointer, which serves as the end-point of the virtual tape measure, with designated locations on the anatomical structure being measured. The VTMOM was evaluated for its baseline and application performances as well as its application efficacy. Baseline performance was determined by measuring the mean error (bias) and standard deviation of error (imprecision) in measurements of non-anatomical objects. Application performance was determined by comparing the error in measuring the dimensions of aneurysm models with and without the VTMOM. Application efficacy was determined by comparing the error in selecting the appropriate aneurysm clip size with and without the VTMOM. Baseline performance indicated a bias of 0.3 mm and an imprecision of 0.6 mm. Application bias was 3.8 mm and imprecision was 2.8 mm for aneurysm diameter. The VTMOM did not improve aneurysm clip size selection accuracy. The VTMOM is a potentially accurate tool for use under the operating microscope. However, its performance when measuring anatomical objects is highly dependent on complex visual features of the object surfaces. Copyright 2000 Wiley-Liss, Inc.
A knowledge based search tool for performance measures in health care systems.

PubMed

Beyan, Oya D; Baykal, Nazife

2012-02-01

Performance measurement is vital for improving the health care systems. However, we are still far from having accepted performance measurement models. Researchers and developers are seeking comparable performance indicators. We developed an intelligent search tool to identify appropriate measures for specific requirements by matching diverse care settings. We reviewed the literature and analyzed 229 performance measurement studies published after 2000. These studies are evaluated with an original theoretical framework and stored in the database. A semantic network is designed for representing domain knowledge and supporting reasoning. We have applied knowledge based decision support techniques to cope with uncertainty problems. As a result we designed a tool which simplifies the performance indicator search process and provides most relevant indicators by employing knowledge based systems.
Measurement properties of performance-based measures to assess physical function in hip and knee osteoarthritis: a systematic review.

PubMed

Dobson, F; Hinman, R S; Hall, M; Terwee, C B; Roos, E M; Bennell, K L

2012-12-01

To systematically review the measurement properties of performance-based measures to assess physical function in people with hip and/or knee osteoarthritis (OA). Electronic searches were performed in MEDLINE, CINAHL, Embase, and PsycINFO up to the end of June 2012. Two reviewers independently rated measurement properties using the consensus-based standards for the selection of health status measurement instrument (COSMIN). "Best evidence synthesis" was made using COSMIN outcomes and the quality of findings. Twenty-four out of 1792 publications were eligible for inclusion. Twenty-one performance-based measures were evaluated including 15 single-activity measures and six multi-activity measures. Measurement properties evaluated included internal consistency (three measures), reliability (16 measures), measurement error (14 measures), validity (nine measures), responsiveness (12 measures) and interpretability (three measures). A positive rating was given to only 16% of possible measurement ratings. Evidence for the majority of measurement properties of tests reported in the review has yet to be determined. On balance of the limited evidence, the 40 m self-paced test was the best rated walk test, the 30 s-chair stand test and timed up and go test were the best rated sit to stand tests, and the Stratford battery, Physical Activity Restrictions and Functional Assessment System were the best rated multi-activity measures. Further good quality research investigating measurement properties of performance measures, including responsiveness and interpretability in people with hip and/or knee OA, is needed. Consensus on which combination of measures will best assess physical function in people with hip/and or knee OA is urgently required. Crown Copyright © 2012. Published by Elsevier Ltd. All rights reserved.
Solar energy system performance evaluation: Seasonal report for Fern Lansing, Lansing, Michigan

NASA Technical Reports Server (NTRS)

1980-01-01

A solar space heating and hot water system's operational performance from April 1979 through March 1980 is evaluated. Solar energy satisfied 15 percent of the total measured load (hot water plus space heating). Net savings were approximately 21 million BTUs.
COMPARING THE SOLID PHASE AND SALINE EXTRACT MICROTOX(R) ASSAYS FOR TWO PAH CONTAMINATED SOILS

EPA Science Inventory

The performance of remedial treatments is typically evaluated by measuring the concentration of specific chemicals. By adding toxicity bioassays to treatment evaluations, a fuller understanding of treatment performance is obtained. The solid phase Microtox assay is one potenti...
New approach to enhance and evaluate the performance of vehicle-infrastructure integration and its communication systems, final report.

DOT National Transportation Integrated Search

2010-09-01

Initial research studied the use of wireless local area networks (WLAN) protocols in Inter-Vehicle Communications : (IVC) environments. The protocols performance was evaluated in terms of measuring throughput, jitter time and : delay time. This re...
34 CFR 645.32 - How does the Secretary evaluate prior experience?

Code of Federal Regulations, 2010 CFR

2010-07-01

... performance under its expiring Upward Bound grant. This information includes information derived from annual performance reports, audit reports, site visit reports, project evaluation reports, and any other verifiable... project participants have demonstrated improvement in academic skills and competencies as measured by...
Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. REL 2013-002

ERIC Educational Resources Information Center

Gill, Brian; Bruch, Julie; Booker, Kevin

2013-01-01

States are increasingly interested in including measures of student achievement growth, or "value- added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot …
Perception of premenstrual syndrome and attitude of evaluations of work performance among incoming university female students.

PubMed

Cheng, Shu Hui; Sun, Zih-Jie; Lee, I Hui; Shih, Chi-Chen; Chen, Kao Chin; Lin, Shih-Hsien; Lu, Feng-Hwa; Yang, Yi-Ching; Yang, Yen Kuang

2015-01-01

Premenstrual syndrome (PMS) is a common condition, and for 5% of women, the influence is so severe as to interfere with their mental health, interpersonal relationships, or studies. Severe PMS may result in decreased occupational productivity. The aim of this study was to investigate the influence of perception of PMS on evaluation of work performance. A total of 1971 incoming female university students were recruited in September 2009. A simulated clinical scenario was used, with a test battery including measurement of psychological symptoms and the Chinese Premenstrual Symptom Questionnaire. When evaluating employee performance in the simulated scenario, 1565 (79.4%) students neglected the impact of PMS, while 136 (6.9%) students considered it. Multivariate logistic regression showed that perception of daily function impairment due to PMS and frequency of measuring body weight were significantly associated with consideration of the influence of PMS on evaluation of work performance. It is important to increase the awareness of functional impairments related to severe PMS.
Time to antibiotics for septic shock: evaluating a proposed performance measure.

PubMed

Venkatesh, Arjun K; Avula, Umakanth; Bartimus, Holly; Reif, Justin; Schmidt, Michael J; Powell, Emilie S

2013-04-01

International guidelines recommend antibiotics within 1 hour of septic shock recognition; however, a recently proposed performance measure is focused on measuring antibiotic administration within 3 hours of emergency department (ED) arrival. Our objective was to describe the time course of septic shock and subsequent implications for performance measurement. Cross-sectional study of consecutive ED patients ultimately diagnosed with septic shock. All patients were evaluated at an urban, academic ED in 2006 to 2008. Primary outcomes included time to definition of septic shock and performance on 2 measures: antibiotics within 3 hours of ED arrival vs antibiotics within 1 hour of septic shock definition. Of 267 patients with septic shock, the median time to definition was 88 minutes (interquartile range, 37-156), and 217 patients (81.9%) met the definition within 3 hours of arrival. Of 221 (83.4%) of patients who received antibiotics within 3 hours of arrival, 38 (17.2%) did not receive antibiotics within 1 hour of definition. Of 207 patients who received antibiotics within 1 hour of definition, 11.6% (n = 24) did not receive antibiotics within 3 hours of arrival. The arrival measure did not accurately classify performance in 23.4% of patients. Nearly 1 of 5 patients cannot be captured for performance measurement within 3 hours of ED arrival due to the variable progression of septic shock. Use of this measure would misclassify performance in 23% of patients. Measuring antibiotic administration based on the clinical course of septic shock rather than from ED arrival would be more appropriate. Copyright © 2013 Elsevier Inc. All rights reserved.
Guiding Principles and Checklist for Population-Based Quality Metrics

PubMed Central

Brunelli, Steven M.; Maddux, Franklin W.; Parker, Thomas F.; Johnson, Douglas; Nissenson, Allen R.; Collins, Allan; Lacson, Eduardo

2014-01-01

The Centers for Medicare and Medicaid Services oversees the ESRD Quality Incentive Program to ensure that the highest quality of health care is provided by outpatient dialysis facilities that treat patients with ESRD. To that end, Centers for Medicare and Medicaid Services uses clinical performance measures to evaluate quality of care under a pay-for-performance or value-based purchasing model. Now more than ever, the ESRD therapeutic area serves as the vanguard of health care delivery. By translating medical evidence into clinical performance measures, the ESRD Prospective Payment System became the first disease-specific sector using the pay-for-performance model. A major challenge for the creation and implementation of clinical performance measures is the adjustments that are necessary to transition from taking care of individual patients to managing the care of patient populations. The National Quality Forum and others have developed effective and appropriate population-based clinical performance measures quality metrics that can be aggregated at the physician, hospital, dialysis facility, nursing home, or surgery center level. Clinical performance measures considered for endorsement by the National Quality Forum are evaluated using five key criteria: evidence, performance gap, and priority (impact); reliability; validity; feasibility; and usability and use. We have developed a checklist of special considerations for clinical performance measure development according to these National Quality Forum criteria. Although the checklist is focused on ESRD, it could also have broad application to chronic disease states, where health care delivery organizations seek to enhance quality, safety, and efficiency of their services. Clinical performance measures are likely to become the norm for tracking performance for health care insurers. Thus, it is critical that the methodologies used to develop such metrics serve the payer and the provider and most importantly, reflect what represents the best care to improve patient outcomes. PMID:24558050
User Performance Evaluation of Four Blood Glucose Monitoring Systems Applying ISO 15197:2013 Accuracy Criteria and Calculation of Insulin Dosing Errors.

PubMed

Freckmann, Guido; Jendrike, Nina; Baumstark, Annette; Pleus, Stefan; Liebing, Christina; Haug, Cornelia

2018-04-01

The international standard ISO 15197:2013 requires a user performance evaluation to assess if intended users are able to obtain accurate blood glucose measurement results with a self-monitoring of blood glucose (SMBG) system. In this study, user performance was evaluated for four SMBG systems on the basis of ISO 15197:2013, and possibly related insulin dosing errors were calculated. Additionally, accuracy was assessed in the hands of study personnel. Accu-Chek ® Performa Connect (A), Contour ® plus ONE (B), FreeStyle Optium Neo (C), and OneTouch Select ® Plus (D) were evaluated with one test strip lot. After familiarization with the systems, subjects collected a capillary blood sample and performed an SMBG measurement. Study personnel observed the subjects' measurement technique. Then, study personnel performed SMBG measurements and comparison measurements. Number and percentage of SMBG measurements within ± 15 mg/dl and ± 15% of the comparison measurements at glucose concentrations < 100 and ≥ 100 mg/dl, respectively, were calculated. In addition, insulin dosing errors were modelled. In the hands of lay-users three systems fulfilled ISO 15197:2013 accuracy criteria with the investigated test strip lot showing 96% (A), 100% (B), and 98% (C) of results within the defined limits. All systems fulfilled minimum accuracy criteria in the hands of study personnel [99% (A), 100% (B), 99.5% (C), 96% (D)]. Measurements with all four systems were within zones of the consensus error grid and surveillance error grid associated with no or minimal risk. Regarding calculated insulin dosing errors, all 99% ranges were between dosing errors of - 2.7 and + 1.4 units for measurements in the hands of lay-users and between - 2.5 and + 1.4 units for study personnel. Frequent lay-user errors were not checking the test strips' expiry date and applying blood incorrectly. Data obtained in this study show that not all available SMBG systems complied with ISO 15197:2013 accuracy criteria when measurements were performed by lay-users. The study was registered at ClinicalTrials.gov (NCT02916576). Ascensia Diabetes Care Deutschland GmbH.
Measurements methodology for evaluation of Digital TV operation in VHF high-band

NASA Astrophysics Data System (ADS)

Pudwell Chaves de Almeida, M.; Vladimir Gonzalez Castellanos, P.; Alfredo Cal Braz, J.; Pereira David, R.; Saboia Lima de Souza, R.; Pereira da Soledade, A.; Rodrigues Nascimento Junior, J.; Ferreira Lima, F.

2016-07-01

This paper describes the experimental setup of field measurements carried out for evaluating the operation of the ISDB-TB (Integrated Services Digital Broadcasting, Terrestrial, Brazilian version) standard digital TV in the VHF-highband. Measurements were performed in urban and suburban areas in a medium-sized Brazilian city. Besides the direct measurements of received power and environmental noise, a measurement procedure involving the injection of Gaussian additive noise was employed to achieve the signal to noise ratio threshold at each measurement site. The analysis includes results of static reception measurements for evaluating the received field strength and the signal to noise ratio thresholds for correct signal decoding.
NERC Policy 10: Measurement of two generation and load balancing IOS

DOE Office of Scientific and Technical Information (OSTI.GOV)

Spicer, P.J.; Galow, G.G.

1999-11-01

Policy 10 will describe specific standards and metrics for most of the reliability functions described in the Interconnected Operations Services Working Group (IOS WG) report. The purpose of this paper is to discuss, in detail, the proposed metrics for two generation and load balancing IOSs: Regulation; Load Following. For purposes of this paper, metrics include both measurement and performance evaluation. The measurement methods discussed are included in the current draft of the proposed Policy 10. The performance evaluation method discussed is offered by the authors for consideration by the IOS ITF (Implementation Task Force) for inclusion into Policy 10.
A pump monitoring approach to irrigation pumping plant testing

USDA-ARS?s Scientific Manuscript database

The conventional approach for evaluating irrigation pumping plant performance has been an instantaneous spot measurement approach. Using this method, the tester measures the necessary work and energy use parameters to determine overall pumping plant performance. The primary limitation of this appr...
Flight performance measurement utilizing a figure of merit (FOM)

NASA Technical Reports Server (NTRS)

Mosier, Kathleen L.; Zacharias, Greg L.

1993-01-01

One of the goals of the NASA Strategic Behavior/Workload Management Program is to develop standardized procedures for constructing figures of merit (FOMs) that describe minimal criteria for flight task performance, as well as summarize overall performance quality. Such a measure could be utilized for evaluating flight crew performance, for assessing the effectiveness of new equipment or technological innovations, or for measuring performance at a particular airport. In this report, we describe the initial phases in the creation of a FOM to be employed in examining crew performance in NASA-Ames Air Ground Compatibility and Strategic Behavior/Workload Management programs.

Using lagging and leading indicators for the evaluation of occupational safety and health performance in industry.

PubMed

Pawłowska, Zofia

2015-01-01

Improvement of occupational safety and health (OSH) management is closely related to the development of OSH performance measurement, which should include OSH outcomes (e.g., occupational accidents), OSH inputs (including working conditions) and OSH-related activities. The indicators used to measure the OSH outcomes are often called lagging indicators, and the indicators of inputs and OSH activities are leading indicators. A study was conducted in 60 companies in order to determine what kinds of indicators were used for OSH performance measurement by companies with different levels of OSH performance. The results reveal that the indicators most commonly used in all of the companies are those related to ensuring compliance with the statutory requirements. At the same time, the leading indicators are much more often adopted in companies with a higher performance level. These companies also much more often monitor on a regular basis the indicators adopted for the evaluation of their OSH performance.
Using lagging and leading indicators for the evaluation of occupational safety and health performance in industry

PubMed Central

Pawłowska, Zofia

2015-01-01

Improvement of occupational safety and health (OSH) management is closely related to the development of OSH performance measurement, which should include OSH outcomes (e.g., occupational accidents), OSH inputs (including working conditions) and OSH-related activities. The indicators used to measure the OSH outcomes are often called lagging indicators, and the indicators of inputs and OSH activities are leading indicators. A study was conducted in 60 companies in order to determine what kinds of indicators were used for OSH performance measurement by companies with different levels of OSH performance. The results reveal that the indicators most commonly used in all of the companies are those related to ensuring compliance with the statutory requirements. At the same time, the leading indicators are much more often adopted in companies with a higher performance level. These companies also much more often monitor on a regular basis the indicators adopted for the evaluation of their OSH performance. PMID:26647949
MULTI-SITE FIELD EVALUATION OF CANDIDATE SAMPLERS FOR MEASURING COARSE-MODE PM

EPA Science Inventory

In response to expected changes to the National Ambient Air Quality Standards for particulate matter, comprehensive field studies were conducted to evaluate the performance of sampling methods for measuring coarse mode aerosols (i.e. PMc). Five separate PMc sampling approaches w...
COMPUTERIZED NEEDS-ORIENTED QUALITY MEASUREMENT EVALUATION SYSTEM (CONQUEST)

EPA Science Inventory

CONQUEST is an easy-to-use quality improvement software tool that uses a common structure and language to help users identity, understand, compare, evaluate, and select among 1,200 clinical performance measures that can be used to assess and improve quality of care. CONQUEST's in...
A Statistical Evaluation of the Diagnostic Performance of MEDAS-The Medical Emergency Decision Assistance System

PubMed Central

Georgakis, D. Christine; Trace, David A.; Naeymi-Rad, Frank; Evens, Martha

1990-01-01

Medical expert systems require comprehensive evaluation of their diagnostic accuracy. The usefulness of these systems is limited without established evaluation methods. We propose a new methodology for evaluating the diagnostic accuracy and the predictive capacity of a medical expert system. We have adapted to the medical domain measures that have been used in the social sciences to examine the performance of human experts in the decision making process. Thus, in addition to the standard summary measures, we use measures of agreement and disagreement, and Goodman and Kruskal's λ and τ measures of predictive association. This methodology is illustrated by a detailed retrospective evaluation of the diagnostic accuracy of the MEDAS system. In a study using 270 patients admitted to the North Chicago Veterans Administration Hospital, diagnoses produced by MEDAS are compared with the discharge diagnoses of the attending physicians. The results of the analysis confirm the high diagnostic accuracy and predictive capacity of the MEDAS system. Overall, the agreement of the MEDAS system with the “gold standard” diagnosis of the attending physician has reached a 90% level.
Laboratory evaluation of an OTT acoustic digital current meter and a SonTek Laboratory acoustic Doppler velocimeter

USGS Publications Warehouse

Vermeyen, T.B.; Oberg, Kevin A.; Jackson, Patrick Ryan

2009-01-01

Recently, an acoustic current meter known as the OTT * acoustic digital current meter (ADC) was introduced as an alternative instrument for stream gaging measurements. The Bureau of Reclamation and the U.S. Geological Survey collaborated on a side- by-side evaluation of the ADC and a SonTek/YSI acoustic Doppler velocimeter (ADV). Measurements were carried out in a laboratory flume to evaluate the performance characteristics of the ADC under a range of flow and boundary conditions. The flume contained a physical model of a mountain river with a diversion dam and variety of bed materials ranging from smooth mortar to a cobble bed. The instruments were installed on a trolley system that allowed them to be easily moved within the flume while maintaining a consistent probe orientation. More than 50 comparison measurements were made in an effort to verify the manufacturer’s performance specifications and to evaluate potential boundary disturbance for near-bed and vertical boundary measurements. Data and results from this evaluation are presented and discussed.
Use of the Marshall Space Flight Center solar simulator in collector performance evaluation

NASA Technical Reports Server (NTRS)

Humphries, W. R.

1978-01-01

Actual measured values from simulator checkout tests are detailed. Problems encountered during initial startup are discussed and solutions described. Techniques utilized to evaluate collector performance from simulator test data are given. Performance data generated in the simulator are compared to equivalent data generated during natural outdoor testing. Finally, a summary of collector performance parameters generated to date as a result of simulator testing are given.
Surgeon-tool force/torque signatures--evaluation of surgical skills in minimally invasive surgery.

PubMed

Rosen, J; MacFarlane, M; Richards, C; Hannaford, B; Sinanan, M

1999-01-01

The best method of training for laparoscopic surgical skills is controversial. Some advocate observation in the operating room, while others promote animal and simulated models or a combination of surgical related tasks. The mode of proficiency evaluation common to all of these methods has been subjective evaluation by a skilled surgeon. In order to define an objective means of evaluating performance, an instrumented laparoscopic grasper was developed measuring the force/torque at the surgeon hand/tool interface. The measured database demonstrated substantial differences between experienced and novice surgeon groups. Analyzing forces and torques combined with the state transition during surgical procedures allows an objective measurement of skill in MIS. Teaching the novice surgeon to limit excessive loads and improve movement efficiency during surgical procedures can potentially result in less injury to soft tissues and less wasted time during laparoscopic surgery. Moreover the force/torque database measured in this study may be used for developing realistic virtual reality simulators and optimization of medical robots performance.
Preferences of Training Performance Measurement: A Comparative Study of Training Professionals and Non-Training Managers

ERIC Educational Resources Information Center

Chapman, Diane D.

2004-01-01

This survey-based study addressed a perceived gap between training performance evaluation practice and decision-making criteria required in business. Training professionals and non-training managers in North Carolina were surveyed. The study found that the groups differ in the performance measures that motivate them to act on training issues.…
A framework for improving the cost-effectiveness of DSM program evaluations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sonnenblick, R.; Eto, J.

The prudence of utility demand-side management (DSM) investments hinges on their performance, yet evaluating performance is complicated because the energy saved by DSM programs can never be observed directly but only inferred. This study frames and begins to answer the following questions: (1) how well do current evaluation methods perform in improving confidence in the measurement of energy savings produced by DSM programs; (2) in view of this performance, how can limited evaluation resources be best allocated to maximize the value of the information they provide? The authors review three major classes of methods for estimating annual energy savings: trackingmore » database (sometimes called engineering estimates), end-use metering, and billing analysis and examine them in light of the uncertainties in current estimates of DSM program measure lifetimes. The authors assess the accuracy and precision of each method and construct trade-off curves to examine the costs of increases in accuracy or precision. Several approaches for improving evaluations for the purpose of assessing program cost effectiveness are demonstrated. The methods can be easily generalized to other evaluation objectives, such as shared savings incentive payments.« less
Gate frequency sweep: An effective method to evaluate the dynamic performance of AlGaN/GaN power heterojunction field effect transistors

DOE Office of Scientific and Technical Information (OSTI.GOV)

Santi, C. de; Meneghini, M., E-mail: matteo.meneghini@dei.unipd.it; Meneghesso, G.

2014-08-18

With this paper we propose a test method for evaluating the dynamic performance of GaN-based transistors, namely, gate-frequency sweep measurements: the effectiveness of the method is verified by characterizing the dynamic performance of Gate Injection Transistors. We demonstrate that this method can provide an effective description of the impact of traps on the transient performance of Heterojunction Field Effect Transistors, and information on the properties (activation energy and cross section) of the related defects. Moreover, we discuss the relation between the results obtained by gate-frequency sweep measurements and those collected by conventional drain current transients and double pulse characterization.
Documenting Teacher Candidates' Professional Growth through Performance Evaluation

ERIC Educational Resources Information Center

Brown, Elizabeth Levine; Suh, Jennifer; Parsons, Seth A.; Parker, Audra K.; Ramirez, Erin M.

2015-01-01

In the United States, colleges of education are responding to demands for increased accountability. The purpose of this article is to describe one teacher education program's implementation of a performance evaluation tool during final internship that measures teacher candidates' development across four domains: Planning and Preparation,…
Performance Validity Testing in Neuropsychology: Scientific Basis and Clinical Application-A Brief Review.

PubMed

Greher, Michael R; Wodushek, Thomas R

2017-03-01

Performance validity testing refers to neuropsychologists' methodology for determining whether neuropsychological test performances completed in the course of an evaluation are valid (ie, the results of true neurocognitive function) or invalid (ie, overly impacted by the patient's effort/engagement in testing). This determination relies upon the use of either standalone tests designed for this sole purpose, or specific scores/indicators embedded within traditional neuropsychological measures that have demonstrated this utility. In response to a greater appreciation for the critical role that performance validity issues play in neuropsychological testing and the need to measure this variable to the best of our ability, the scientific base for performance validity testing has expanded greatly over the last 20 to 30 years. As such, the majority of current day neuropsychologists in the United States use a variety of measures for the purpose of performance validity testing as part of everyday forensic and clinical practice and address this issue directly in their evaluations. The following is the first article of a 2-part series that will address the evolution of performance validity testing in the field of neuropsychology, both in terms of the science as well as the clinical application of this measurement technique. The second article of this series will review performance validity tests in terms of methods for development of these measures, and maximizing of diagnostic accuracy.
Approaches to chronic disease management evaluation in use in Europe: a review of current methods and performance measures.

PubMed

Conklin, Annalijn; Nolte, Ellen; Vrijhoef, Hubertus

2013-01-01

An overview was produced of approaches currently used to evaluate chronic disease management in selected European countries. The study aims to describe the methods and metrics used in Europe as a first to help advance the methodological basis for their assessment. A common template for collection of evaluation methods and performance measures was sent to key informants in twelve European countries; responses were summarized in tables based on template evaluation categories. Extracted data were descriptively analyzed. Approaches to the evaluation of chronic disease management vary widely in objectives, designs, metrics, observation period, and data collection methods. Half of the reported studies used noncontrolled designs. The majority measure clinical process measures, patient behavior and satisfaction, cost and utilization; several also used a range of structural indicators. Effects are usually observed over 1 or 3 years on patient populations with a single, commonly prevalent, chronic disease. There is wide variation within and between European countries on approaches to evaluating chronic disease management in their objectives, designs, indicators, target audiences, and actors involved. This study is the first extensive, international overview of the area reported in the literature.
Assessing the validity and reliability of the Malagasy version of Oral Impacts on Daily Performance (OIDP): a cross-sectional study.

PubMed

Razanamihaja, Noeline; Ranivoharilanto, Eva

2017-01-01

Evaluating health needs includes measures of the impact of state of health on the quality of life. This entails evaluating the psychosocial aspects of health. To achieve this, several tools for measuring the quality of life related to oral health have been developed. However, it is vital to evaluate the psychometric properties of these tools so they can be used in a new context and on a new population. The purpose of this study was to evaluate the reliability and validity of the Malagasy version of a questionnaire for studying the impacts of oral-dental health on daily activities (Oral Impacts on Daily Performance), and analyse the interrelations between the scores obtained and the oral health indicators. A cross-sectional study was performed for the transcultural adaptation of the Oral Impacts on Daily Performance questionnaire forward translated and back-translated from English to Malagasy and from Malagasy to English, respectively. The psychometric characteristics of the Malagasy version of the Oral Impacts on Daily Performance were then evaluated in terms of internal reliability, test-retest, and construct, criteria and discriminant validity. Four hundred and six adults responded in face-to-face interviews to the Malagasy version of the Oral Impacts on Daily Performance questionnaire. Nearly 74% of the participants indicated impacts of their oral health on their performance in their daily lives during the 6 months prior to the survey. The activities most affected were: "smiling", "eating" and "sleeping and relaxing". Cronbach's alpha was 0.87. The construct validity was demonstrated by a significant association between the Oral Impacts on Daily Performance scores and the subjective evaluation of oral health ( p <0.001). Discriminant validity was demonstrated by the fact that the Oral Impacts on Daily Performance scores were significantly higher in subjects with more than ten missing teeth, compared to those with fewer than ten missing teeth ( p < 0.001). The Malagasy version of the Oral Impacts on Daily Performance index is a valid and reliable measure for use in Malagasy adults over 55 years old.
A Primer on Building Teacher Evaluation Instruments.

ERIC Educational Resources Information Center

Bitner, Ted; Kratzner, Ron

This paper presents a primer on building a scientifically oriented teacher evaluation instrument. It stresses the importance of accurate measures and accepts the presupposition that scientific approaches provide the most accurate measures of student teacher performance. The paper discusses the scientific concepts of validity and reliability, and…
An evaluation of the performance of concretes containing fly ash and ground slag in bridge decks.

DOT National Transportation Integrated Search

2006-01-01

Cores from 36 bridge decks were evaluated to assess the condition and quality of the concrete by petrographic methods and direct and indirect measures of the transport properties. Transport properties were measured by a rate of absorption test (ASTM ...
Quantification of error associated with stormwater and wastewater flow measurement devices

EPA Science Inventory

A novel flow testbed has been designed to evaluate the performance of flumes as flow measurement devices. The newly constructed testbed produces both steady and unsteady flows ranging from 10 to 1500 gpm. Two types of flumes (Parshall and trapezoidal) are evaluated under differen...
Summary of ORSphere Critical and Reactor Physics Measurements

DOE Office of Scientific and Technical Information (OSTI.GOV)

Marshall, Margaret A.; Bess, John D.

In the early 1970s Dr. John T. Mihalczo (team leader), J. J. Lynn, and J. R. Taylor performed experiments at the Oak Ridge Critical Experiments Facility (ORCEF) with highly enriched uranium (HEU) metal (called Oak Ridge Alloy or ORALLOY) to recreate GODIVA I results with greater accuracy than those performed at Los Alamos National Laboratory in the 1950s. The purpose of the Oak Ridge ORALLOY Sphere (ORSphere) experiments was to estimate the unreflected and unmoderated critical mass of an idealized sphere of uranium metal corrected to a density, purity, and enrichment such that it could be compared with the GODIVAmore » I experiments. This critical configuration has been evaluated. Preliminary results were presented at ND2013. Since then, the evaluation was finalized and judged to be an acceptable benchmark experiment for the International Criticality Safety Benchmark Experiment Project (ICSBEP). Additionally, reactor physics measurements were performed to determine surface button worths, central void worth, delayed neutron fraction, prompt neutron decay constant, fission density and neutron importance. These measurements have been evaluated and found to be acceptable experiments and are discussed in full detail in the International Handbook of Evaluated Reactor Physics Benchmark Experiments. The purpose of this paper is summary summarize all the critical and reactor physics measurements evaluations and, when possible, to compare them to GODIVA experiment results.« less
The PATH project in eight European countries: an evaluation.

PubMed

Veillard, Jeremy Henri Maurice; Schiøtz, Michaela Louise; Guisset, Ann-Lise; Brown, Adalsteinn Davidson; Klazinga, Niek S

2013-01-01

This paper's aim is to evaluate the perceived impact and the enabling factors and barriers experienced by hospital staff participating in an international hospital performance measurement project focused on internal quality improvement. Semi-structured interviews involving international hospital performance measurement project coordinators, including 140 hospitals from eight European countries (Belgium, Estonia, France, Germany, Hungary, Poland, Slovakia and Slovenia). Inductively analyzing the interview transcripts was carried out using the grounded theory approach. Even when public reporting is absent, the project was perceived as having stimulated performance measurement and quality improvement initiatives in participating hospitals. Attention should be paid to leadership/ownership, context, content (project intrinsic features) and processes supporting elements. Generalizing the findings is limited by the study's small sample size. Possible implications for the WHO European Regional Office and for participating hospitals would be to assess hospital preparedness to participate in the PATH project, depending on context, process and structural elements; and enhance performance and practice benchmarking through suggested approaches. This research gathered rich and unique material related to an international performance measurement project. It derived actionable findings.

Sofia Observatory Performance and Characterization

NASA Technical Reports Server (NTRS)

Temi, Pasquale; Miller, Walter; Dunham, Edward; McLean, Ian; Wolf, Jurgen; Becklin, Eric; Bida, Tom; Brewster, Rick; Casey, Sean; Collins, Peter;

2012-01-01

The Stratospheric Observatory for Infrared Astronomy (SOFIA) has recently concluded a set of engineering flights for Observatory performance evaluation. These in-flight opportunities have been viewed as a first comprehensive assessment of the Observatory's performance and will be used to address the development activity that is planned for 2012, as well as to identify additional Observatory upgrades. A series of 8 SOFIA Characterization And Integration (SCAI) flights have been conducted from June to December 2011. The HIPO science instrument in conjunction with the DSI Super Fast Diagnostic Camera (SFDC) have been used to evaluate pointing stability, including the image motion due to rigid-body and flexible-body telescope modes as well as possible aero-optical image motion. We report on recent improvements in pointing stability by using an Active Mass Damper system installed on Telescope Assembly. Measurements and characterization of the shear layer and cavity seeing, as well as image quality evaluation as a function of wavelength have been performed using the HIPO+FLITECAM Science Instrument configuration (FLIPO). A number of additional tests and measurements have targeted basic Observatory capabilities and requirements including, but not limited to, pointing accuracy, chopper evaluation and imager sensitivity. SCAI activities included in-flight partial Science Instrument commissioning prior to the use of the instruments as measuring engines. This paper reports on the data collected during the SCAI flights and presents current SOFIA Observatory performance and characterization.

A Scoping Review of Physical Performance Outcome Measures Used in Exercise Interventions for Older Adults With Alzheimer Disease and Related Dementias.

PubMed

McGough, Ellen L; Lin, Shih-Yin; Belza, Basia; Becofsky, Katie M; Jones, Dina L; Liu, Minhui; Wilcox, Sara; Logsdon, Rebecca G

2017-11-28

There is growing evidence that exercise interventions can mitigate functional decline and reduce fall risk in older adults with Alzheimer disease and related dementias (ADRD). Although physical performance outcome measures have been successfully used in older adults without cognitive impairment, additional research is needed regarding their use with individuals who have ADRD, and who may have difficulty following instructions regarding performance of these measures. The purpose of this scoping review was to identify commonly used physical performance outcome measures, for exercise interventions, that are responsive and reliable in older adults with ADRD. Ultimately, we aimed to provide recommendations regarding the use of outcome measures for individuals with ADRD across several domains of physical performance. A scoping review was conducted to broadly assess physical performance outcome measures used in exercise interventions for older adults with ADRD. Exercise intervention studies that included at least 1 measure of physical performance were included. All physical performance outcome measures were abstracted, coded, and categorized into 5 domains of physical performance: fitness, functional mobility, gait, balance, and strength. Criteria for recommendations were based on (1) the frequency of use, (2) responsiveness, and (3) reliability. Frequency was determined by the number of studies that used the outcome measure per physical performance domain. Responsiveness was assessed via calculated effect size of the outcome measures across studies within physical performance domains. Reliability was evaluated via published studies of psychometric properties. A total of 20 physical performance outcome measures were extracted from 48 articles that met study inclusion criteria. The most frequently used outcome measures were the 6-minute walk test, Timed Up and Go, repeated chair stand tests, short-distance gait speed, the Berg Balance Scale, and isometric strength measures. These outcome measures demonstrated a small, medium, or large effect in at least 50% of the exercise intervention studies. Good to excellent reliability was reported in samples of older adults with mild to moderate dementia. Fitness, functional mobility, gait, balance, and strength represent important domains of physical performance for older adults. The 6-minute walk test, Timed Up and Go, repeated chair stand tests, short-distance gait speed, Berg Balance Scale, and isometric strength are recommended as commonly used and reliable physical performance outcome measures for exercise interventions in older adults with mild to moderate ADRD. Further research is needed on optimal measures for individuals with severe ADRD. The results of this review will aid clinicians and researchers in selecting reliable measures to evaluate physical performance outcomes in response to exercise interventions in older adults with ADRD.
Multidimensional assessment of homework: an analysis of students with ADHD.

PubMed

Mautone, Jennifer A; Marshall, Stephen A; Costigan, Tracy E; Clarke, Angela T; Power, Thomas J

2012-10-01

Homework can have beneficial effects for students; however, it presents challenges, particularly for students with attention problems. Although effective homework interventions exist, intervention development and evaluation has been hampered by the lack of psychometrically sound measures. The primary purpose of this study was to evaluate the construct validity of the Homework Performance Questionnaire (HPQ), Parent and Teacher Versions, in a sample of children with ADHD. A secondary purpose was to examine variations in homework performance as a function of individual characteristics, such as academic achievement, quality of the family-school relationship, and child's diagnostic status. The sample included 91 children (34% female) with ADHD in Grades 2 to 6. Measures included parent and teacher ratings of homework performance and the quality of the parent-teacher relationship as well as direct assessment of child academic achievement and homework performance (i.e., samples of completed assignments). Correlational analyses were used to examine construct validity, and ANOVAs were used to evaluate group differences. Each factor of the HPQ had a significant relationship with other measures of relevant constructs. There were no significant differences in homework performance between groups for ADHD subtype, medication status, or comorbidity, with the exception of learning disability. Children with ADHD and learning disabilities had significantly lower teacher ratings of academic competence. Results of the present study suggest that HPQ scores may be used to make valid inferences about the homework performance of children with attention problems. These rating scales may be helpful in progress monitoring and evaluating intervention effectiveness.
A novel measure and significance testing in data analysis of cell image segmentation.

PubMed

Wu, Jin Chu; Halter, Michael; Kacker, Raghu N; Elliott, John T; Plant, Anne L

2017-03-14

Cell image segmentation (CIS) is an essential part of quantitative imaging of biological cells. Designing a performance measure and conducting significance testing are critical for evaluating and comparing the CIS algorithms for image-based cell assays in cytometry. Many measures and methods have been proposed and implemented to evaluate segmentation methods. However, computing the standard errors (SE) of the measures and their correlation coefficient is not described, and thus the statistical significance of performance differences between CIS algorithms cannot be assessed. We propose the total error rate (TER), a novel performance measure for segmenting all cells in the supervised evaluation. The TER statistically aggregates all misclassification error rates (MER) by taking cell sizes as weights. The MERs are for segmenting each single cell in the population. The TER is fully supported by the pairwise comparisons of MERs using 106 manually segmented ground-truth cells with different sizes and seven CIS algorithms taken from ImageJ. Further, the SE and 95% confidence interval (CI) of TER are computed based on the SE of MER that is calculated using the bootstrap method. An algorithm for computing the correlation coefficient of TERs between two CIS algorithms is also provided. Hence, the 95% CI error bars can be used to classify CIS algorithms. The SEs of TERs and their correlation coefficient can be employed to conduct the hypothesis testing, while the CIs overlap, to determine the statistical significance of the performance differences between CIS algorithms. A novel measure TER of CIS is proposed. The TER's SEs and correlation coefficient are computed. Thereafter, CIS algorithms can be evaluated and compared statistically by conducting the significance testing.
Apollo 15 mission report, supplement 4: Descent propulsion system final flight evaluation

NASA Technical Reports Server (NTRS)

Avvenire, A. T.; Wood, S. C.

1972-01-01

The results of a postflight analysis of the LM-10 Descent Propulsion System (DPS) during the Apollo 15 Mission are reported. The analysis determined the steady state performance of the DPS during the descent phase of the manned lunar landing. Flight measurement discrepancies are discussed. Simulated throttle performance results are cited along with overall performance results. Evaluations of the propellant quantity gaging system, propellant loading, pressurization system, and engine are reported. Graphic illustrations of the evaluations are included.
Using Student Test Scores to Measure Teacher Performance: Some Problems in the Design and Implementation of Evaluation Systems

ERIC Educational Resources Information Center

Ballou, Dale; Springer, Matthew G.

2015-01-01

Our aim in this article is to draw attention to some underappreciated problems in the design and implementation of evaluation systems that incorporate value-added measures. We focus on four: (1) taking into account measurement error in teacher assessments, (2) revising teachers' scores as more information becomes available about their students,…
National Quality Forum Colon Cancer Quality Metric Performance: How Are Hospitals Measuring Up?

PubMed

Mason, Meredith C; Chang, George J; Petersen, Laura A; Sada, Yvonne H; Tran Cao, Hop S; Chai, Christy; Berger, David H; Massarweh, Nader N

2017-12-01

To evaluate the impact of care at high-performing hospitals on the National Quality Forum (NQF) colon cancer metrics. The NQF endorses evaluating ≥12 lymph nodes (LNs), adjuvant chemotherapy (AC) for stage III patients, and AC within 4 months of diagnosis as colon cancer quality indicators. Data on hospital-level metric performance and the association with survival are unclear. Retrospective cohort study of 218,186 patients with resected stage I to III colon cancer in the National Cancer Data Base (2004-2012). High-performing hospitals (>75% achievement) were identified by the proportion of patients achieving each measure. The association between hospital performance and survival was evaluated using Cox shared frailty modeling. Only hospital LN performance improved (15.8% in 2004 vs 80.7% in 2012; trend test, P < 0.001), with 45.9% of hospitals performing well on all 3 measures concurrently in the most recent study year. Overall, 5-year survival was 75.0%, 72.3%, 72.5%, and 69.5% for those treated at hospitals with high performance on 3, 2, 1, and 0 metrics, respectively (log-rank, P < 0.001). Care at hospitals with high metric performance was associated with lower risk of death in a dose-response fashion [0 metrics, reference; 1, hazard ratio (HR) 0.96 (0.89-1.03); 2, HR 0.92 (0.87-0.98); 3, HR 0.85 (0.80-0.90); 2 vs 1, HR 0.96 (0.91-1.01); 3 vs 1, HR 0.89 (0.84-0.93); 3 vs 2, HR 0.95 (0.89-0.95)]. Performance on metrics in combination was associated with lower risk of death [LN + AC, HR 0.86 (0.78-0.95); AC + timely AC, HR 0.92 (0.87-0.98); LN + AC + timely AC, HR 0.85 (0.80-0.90)], whereas individual measures were not [LN, HR 0.95 (0.88-1.04); AC, HR 0.95 (0.87-1.05)]. Less than half of hospitals perform well on these NQF colon cancer metrics concurrently, and high performance on individual measures is not associated with improved survival. Quality improvement efforts should shift focus from individual measures to defining composite measures encompassing the overall multimodal care pathway and capturing successful transitions from one care modality to another.
Aluminum Data Measurements and Evaluation for Criticality Safety Applications

NASA Astrophysics Data System (ADS)

Leal, L. C.; Guber, K. H.; Spencer, R. R.; Derrien, H.; Wright, R. Q.

2002-12-01

The Defense Nuclear Facility Safety Board (DNFSB) Recommendation 93-2 motivated the US Department of Energy (DOE) to develop a comprehensive criticality safety program to maintain and to predict the criticality of systems throughout the DOE complex. To implement the response to the DNFSB Recommendation 93-2, a Nuclear Criticality Safety Program (NCSP) was created including the following tasks: Critical Experiments, Criticality Benchmarks, Training, Analytical Methods, and Nuclear Data. The Nuclear Data portion of the NCSP consists of a variety of differential measurements performed at the Oak Ridge Electron Linear Accelerator (ORELA) at the Oak Ridge National Laboratory (ORNL), data analysis and evaluation using the generalized least-squares fitting code SAMMY in the resolved, unresolved, and high energy ranges, and the development and benchmark testing of complete evaluations for a nuclide for inclusion into the Evaluated Nuclear Data File (ENDF/B). This paper outlines the work performed at ORNL to measure, evaluate, and test the nuclear data for aluminum for applications in criticality safety problems.
Cultural values and performance appraisal: assessing the effects of rater self-construal on performance ratings.

PubMed

Mishra, Vipanchi; Roch, Sylvia G

2013-01-01

Much of the prior research investigating the influence of cultural values on performance ratings has focused either on conducting cross-national comparisons among raters or using cultural level individualism/collectivism scales to measure the effects of cultural values on performance ratings. Recent research has shown that there is considerable within country variation in cultural values, i.e. people in one country can be more individualistic or collectivistic in nature. Taking the latter perspective, the present study used Markus and Kitayama's (1991) conceptualization of independent and interdependent self-construals as measures of individual variations in cultural values to investigate within culture variations in performance ratings. Results suggest that rater self-construal has a significant influence on overall performance evaluations; specifically, raters with a highly interdependent self-construal tend to show a preference for interdependent ratees, whereas raters high on independent self-construal do not show a preference for specific type of ratees when making overall performance evaluations. Although rater self-construal significantly influenced overall performance evaluations, no such effects were observed for specific dimension ratings. Implications of these results for performance appraisal research and practice are discussed.
Performance evaluation and accuracy of passive capillary samplers (PCAPs) for estimating real-time drainage water fluxes

USDA-ARS?s Scientific Manuscript database

Successful monitoring of pollutant transport through the soil profile requires accurate, reliable, and appropriate instrumentation to measure amount of drainage water or flux within the vadose layer. We evaluated the performance and accuracy of automated passive capillary wick samplers (PCAPs) for ...
Development of a Rubric for Collegiate Jazz Improvisation Performance Assessment

ERIC Educational Resources Information Center

Moore, Kendall Ryan

2016-01-01

The purpose of this study was to develop a jazz improvisation rubric for the evaluation of collegiate jazz improvisation. To create this measure, research objectives were devised to investigate the aurally-observed performer-controlled components of improvisation, which aurally-observed components should be evaluated in an improvisatory…
Implementation and Performance Evaluation Using the Fuzzy Network Balanced Scorecard

ERIC Educational Resources Information Center

Tseng, Ming-Lang

2010-01-01

The balanced scorecard (BSC) is a multi-criteria evaluation concept that highlights the importance of performance measurement. However, although there is an abundance of literature on the BSC framework, there is a scarcity of literature regarding how the framework with dependence and interactive relationships should be properly implemented in…
Performance Evaluation of Automated Passive Capillary Sampler for Estimating Water Drainage in the Vadose Zone

USDA-ARS?s Scientific Manuscript database

Passive capillary samplers (PCAPs) are widely used to monitor, measure and sample drainage water under saturated and unsaturated soil conditions in the vadose zone. The objective of this study was to evaluate the performance and accuracy of automated passive capillary sampler for estimating drainage...
A Practical Approach to Sex Fair Performance Evaluation in Secondary Physical Education.

ERIC Educational Resources Information Center

McGonagle, Kenneth; Stevens, Ann

A method of sex-fair performance evaluation is presented which can be used in coeducational secondary school physical education classes. This method tallies specific skill areas associated with athletic activities, disregarding such concepts as student improvement, level of competition, participation, effort, and exact skill measurement.…
Effect of time span and task load on pilot mental workload

NASA Technical Reports Server (NTRS)

Berg, S. L.; Sheridan, T. B.

1986-01-01

Two sets of simulations designed to examine how a pilot's mental workload is affected by continuous manual-control activity versus discrete mental tasks that included the length of time between receiving an assignment and executing it are described. The first experiment evaluated two types of measures: objective performance indicators and subjective ratings. Subjective ratings for the two missions were different, but the objective performance measures were similar. In the second experiments, workload levels were increased and a second performance measure was taken. Mental workload had no influence on either performance-based workload measure. Subjective ratings discriminated among the scenarios and correlated with performance measures for high-workload flights. The number of mental tasks performed did not influence error rates, although high manual workloads did increase errors.
A Novel Method for Assessing Task Complexity in Outpatient Clinical-Performance Measures.

PubMed

Hysong, Sylvia J; Amspoker, Amber B; Petersen, Laura A

2016-04-01

Clinical-performance measurement has helped improve the quality of health-care; yet success in attaining high levels of quality across multiple domains simultaneously still varies considerably. Although many sources of variability in care quality have been studied, the difficulty required to complete the clinical work itself has received little attention. We present a task-based methodology for evaluating the difficulty of clinical-performance measures (CPMs) by assessing the complexity of their component requisite tasks. Using Functional Job Analysis (FJA), subject-matter experts (SMEs) generated task lists for 17 CPMs; task lists were rated on ten dimensions of complexity, and then aggregated into difficulty composites. Eleven outpatient work SMEs; 133 VA Medical Centers nationwide. Clinical Performance: 17 outpatient CPMs (2000-2008) at 133 VA Medical Centers nationwide. Measure Difficulty: for each CPM, the number of component requisite tasks and the average rating across ten FJA complexity scales for the set of tasks comprising the measure. Measures varied considerably in the number of component tasks (M = 10.56, SD = 6.25, min = 5, max = 25). Measures of chronic care following acute myocardial infarction exhibited significantly higher measure difficulty ratings compared to diabetes or screening measures, but not to immunization measures ([Formula: see text] = 0.45, -0.04, -0.05, and -0.06 respectively; F (3, 186) = 3.57, p = 0.015). Measure difficulty ratings were not significantly correlated with the number of component tasks (r = -0.30, p = 0.23). Evaluating the difficulty of achieving recommended CPM performance levels requires more than simply counting the tasks involved; using FJA to assess the complexity of CPMs' component tasks presents an alternate means of assessing the difficulty of primary-care CPMs and accounting for performance variation among measures and performers. This in turn could be used in designing performance reward programs, or to match workflow to clinician time and effort.
Evaluation of 16 measures of mental workload using a simulated flight task emphasizing mediational activity

NASA Technical Reports Server (NTRS)

Wierwille, W. W.; Rahimi, M.; Casali, J. G.

1985-01-01

As aircraft and other systems become more automated, a shift is occurring in human operator participation in these systems. This shift is away from manual control and toward activities that tap the higher mental functioning of human operators. Therefore, an experiment was performed in a moving-base flight simulator to assess mediational (cognitive) workload measurement. Specifically, 16 workload estimation techniques were evaluated as to their sensitivity and intrusion in a flight task emphasizing mediational behavior. Task loading, using navigation problems presented on a display, was treated as an independent variable, and workload-measure values were treated as dependent variables. Results indicate that two mediational task measures, two rating scale measures, time estimation, and two eye behavior measures were reliably sensitive to mediational loading. The time estimation measure did, however, intrude on mediational task performance. Several of the remaining measures were completely insensitive to mediational load.
Physician performance assessment using a composite quality index.

PubMed

Liu, Kaibo; Jain, Shabnam; Shi, Jianjun

2013-07-10

Assessing physician performance is important for the purposes of measuring and improving quality of service and reducing healthcare delivery costs. In recent years, physician performance scorecards have been used to provide feedback on individual measures; however, one key challenge is how to develop a composite quality index that combines multiple measures for overall physician performance evaluation. A controversy arises over establishing appropriate weights to combine indicators in multiple dimensions, and cannot be easily resolved. In this study, we proposed a generic unsupervised learning approach to develop a single composite index for physician performance assessment by using non-negative principal component analysis. We developed a new algorithm named iterative quadratic programming to solve the numerical issue in the non-negative principal component analysis approach. We conducted real case studies to demonstrate the performance of the proposed method. We provided interpretations from both statistical and clinical perspectives to evaluate the developed composite ranking score in practice. In addition, we implemented the root cause assessment techniques to explain physician performance for improvement purposes. Copyright © 2012 John Wiley & Sons, Ltd.
Model Performance Evaluation and Scenario Analysis ...

EPA Pesticide Factsheets

This tool consists of two parts: model performance evaluation and scenario analysis (MPESA). The model performance evaluation consists of two components: model performance evaluation metrics and model diagnostics. These metrics provides modelers with statistical goodness-of-fit measures that capture magnitude only, sequence only, and combined magnitude and sequence errors. The performance measures include error analysis, coefficient of determination, Nash-Sutcliffe efficiency, and a new weighted rank method. These performance metrics only provide useful information about the overall model performance. Note that MPESA is based on the separation of observed and simulated time series into magnitude and sequence components. The separation of time series into magnitude and sequence components and the reconstruction back to time series provides diagnostic insights to modelers. For example, traditional approaches lack the capability to identify if the source of uncertainty in the simulated data is due to the quality of the input data or the way the analyst adjusted the model parameters. This report presents a suite of model diagnostics that identify if mismatches between observed and simulated data result from magnitude or sequence related errors. MPESA offers graphical and statistical options that allow HSPF users to compare observed and simulated time series and identify the parameter values to adjust or the input data to modify. The scenario analysis part of the too
A new concept of feature-based gauge for coordinate measuring arm evaluation

NASA Astrophysics Data System (ADS)

Cuesta, E.; González-Madruga, D.; Alvarez, B. J.; Barreiro, J.

2014-06-01

Articulated arm coordinate measuring machines (AACMM or CMA) have conquered a market share in the actual dimensional metrology field, overall when their role implies the inspection of geometrical and dimensional tolerances in an accurate 3D environment for medium-size parts. However, the unavoidable fact of AACMM manual operation constrains its reliability to a great extent, avoiding rigorous evaluation and casting doubt upon the usefulness of external calibration. In this research, a dimensional gauge especially aimed at AACMM evaluation has been developed. Furthermore, the operator skill will be revealed through the use of this gauge. A set of geometrical features, some of them oriented to evaluate the operator and others the equipment, have been collected for the gauge. The proposed evaluation methodology clearly distinguishes between dimensional and geometrical tolerances (with or without datum references), whereas actual verification standards only consider the former. Next, quality indicators deduced from the measurement results are proposed in order to compare AACMM versus coordinate measuring machine (CMM) performance, assuming that CMM possess the maximum accuracy that AACMM could reach, because CMM combines maximum contact accuracy with minimum operator influence. As a result, AACMM evaluation time could be significantly reduced since this gauge allows us to perform a customized evaluation of only those specific tolerances of interest to the user.

Demystifying Results-Based Performance Measurement.

ERIC Educational Resources Information Center

Jorjani, Hamid

Many evaluators are convinced that Results-based Performance Measurement (RBPM) is an effective tool to improve service delivery and cost effectiveness in both public and private sectors. Successful RBPM requires self-directed and cross-functional work teams and the supporting infrastructure to make it work. There are many misconceptions and…
Measuring Performance in Child Welfare: Secondary Effects of Success.

ERIC Educational Resources Information Center

Usher, Charles L.; Gibbs, Deborah A.; Wildfire, Judith B.

1999-01-01

Draws on findings from evaluations of recent reform initiatives in Alabama, North Carolina, and Ohio to suggest that performance-measurement systems for state child-welfare programs must adapt to changing circumstances, especially when improvements in one area can influence standards and expectations in others. (Author/KB)
Front-end Electronics for Unattended Measurement (FEUM). Prototype Test Plan

DOE Office of Scientific and Technical Information (OSTI.GOV)

Conrad, Ryan C.; Morris, Scott J.; Smith, Leon E.

2015-09-16

The IAEA has requested that PNNL perform an initial set of tests on front-end electronics for unattended measurement (FEUM) prototypes. The FEUM prototype test plan details the tests to be performed, the criteria for evaluation, and the procedures used to execute the tests.
48 CFR 2937.602 - Elements of performance-based contracting.

Code of Federal Regulations, 2010 CFR

2010-10-01

... objectively measurable incentives (e.g., Firm-Fixed-Price, Fixed-Price-Incentive-Fee, or Cost-Plus-Incentive-Fee) is appropriate. However, when contractor performance (e.g., cost control, schedule, or quality/technical) is best evaluated subjectively using qualitative measures, a Cost-Plus-Award-Fee contract may be...
Are university rankings useful to improve research? A systematic review.

PubMed

Vernon, Marlo M; Balas, E Andrew; Momani, Shaher

2018-01-01

Concerns about reproducibility and impact of research urge improvement initiatives. Current university ranking systems evaluate and compare universities on measures of academic and research performance. Although often useful for marketing purposes, the value of ranking systems when examining quality and outcomes is unclear. The purpose of this study was to evaluate usefulness of ranking systems and identify opportunities to support research quality and performance improvement. A systematic review of university ranking systems was conducted to investigate research performance and academic quality measures. Eligibility requirements included: inclusion of at least 100 doctoral granting institutions, be currently produced on an ongoing basis and include both global and US universities, publish rank calculation methodology in English and independently calculate ranks. Ranking systems must also include some measures of research outcomes. Indicators were abstracted and contrasted with basic quality improvement requirements. Exploration of aggregation methods, validity of research and academic quality indicators, and suitability for quality improvement within ranking systems were also conducted. A total of 24 ranking systems were identified and 13 eligible ranking systems were evaluated. Six of the 13 rankings are 100% focused on research performance. For those reporting weighting, 76% of the total ranks are attributed to research indicators, with 24% attributed to academic or teaching quality. Seven systems rely on reputation surveys and/or faculty and alumni awards. Rankings influence academic choice yet research performance measures are the most weighted indicators. There are no generally accepted academic quality indicators in ranking systems. No single ranking system provides a comprehensive evaluation of research and academic quality. Utilizing a combined approach of the Leiden, Thomson Reuters Most Innovative Universities, and the SCImago ranking systems may provide institutions with a more effective feedback for research improvement. Rankings which extensively rely on subjective reputation and "luxury" indicators, such as award winning faculty or alumni who are high ranking executives, are not well suited for academic or research performance improvement initiatives. Future efforts should better explore measurement of the university research performance through comprehensive and standardized indicators. This paper could serve as a general literature citation when one or more of university ranking systems are used in efforts to improve academic prominence and research performance.
USDOT guidance summary for connected vehicle deployments evaluation support.

DOT National Transportation Integrated Search

2016-07-01

The document provides guidance to Pilot Deployers in the timely and successful completion of Concept DevelopmentPhase deliverables, specifically in developing the Performance Measurement and Evaluation Support Plan in Task 5,identifying evaluation-su...
Winter maintenance performance measure.

DOT National Transportation Integrated Search

2016-01-01

The Winter Performance Index is a method of quantifying winter storm events and the DOTs response to them. : It is a valuable tool for evaluating the States maintenance practices, performing post-storm analysis, training : maintenance personnel...
FY 2016 Annual Performance Report

EPA Pesticide Factsheets

Presents detailed performance results, as measured against the targets established in EPA’s FY 2016 Annual Plan and Budget. The Executive Overview section analyzes key performance outcomes and links to FY 2016 program evaluations.
FY 2017 Annual Performance Report (APR)

EPA Pesticide Factsheets

Presents detailed performance results, as measured against the targets established in EPA’s FY 2017 Annual Plan and Budget. The Executive Overview section analyzes key performance outcomes and links to FY 2017 program evaluations.
FY 2015 Annual Performance Report

EPA Pesticide Factsheets

Presents detailed performance results, as measured against the targets established in EPA’s FY 2015 Annual Plan and Budget. The Executive Overview section analyzes key performance outcomes and links to FY 2015 program evaluations.
Performance Evaluation of Phasor Measurement Systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

Huang, Zhenyu; Kasztenny, Bogdan; Madani, Vahid

2008-07-20

After two decades of phasor network deployment, phasor measurements are now available at many major substations and power plants. The North American SynchroPhasor Initiative (NASPI), supported by both the US Department of Energy and the North American Electricity Reliability Council (NERC), provides a forum to facilitate the efforts in phasor technology in North America. Phasor applications have been explored and some are in today’s utility practice. IEEE C37.118 Standard is a milestone in standardizing phasor measurements and defining performance requirements. To comply with IEEE C37.118 and to better understand the impact of phasor quality on applications, the NASPI Performance andmore » Standards Task Team (PSTT) initiated and accomplished the development of two important documents to address characterization of PMUs and instrumentation channels, which leverage prior work (esp. in WECC) and international experience. This paper summarizes the accomplished PSTT work and presents the methods for phasor measurement evaluation.« less
Quality Measures for Dialysis: Time for a Balanced Scorecard

PubMed Central

2016-01-01

Recent federal legislation establishes a merit-based incentive payment system for physicians, with a scorecard for each professional. The Centers for Medicare and Medicaid Services evaluate quality of care with clinical performance measures and have used these metrics for public reporting and payment to dialysis facilities. Similar metrics may be used for the future merit-based incentive payment system. In nephrology, most clinical performance measures measure processes and intermediate outcomes of care. These metrics were developed from population studies of best practice and do not identify opportunities for individualizing care on the basis of patient characteristics and individual goals of treatment. The In-Center Hemodialysis (ICH) Consumer Assessment of Healthcare Providers and Systems (CAHPS) survey examines patients' perception of care and has entered the arena to evaluate quality of care. A balanced scorecard of quality performance should include three elements: population-based best clinical practice, patient perceptions, and individually crafted patient goals of care. PMID:26316622
A Computerized Evaluation of Sensory Memory and Short-term Memory Impairment After Rapid Ascent to 4280 m.

PubMed

Shi, Qing Hai; Ge, Di; Zhao, Wei; Ma, Xue; Hu, Ke Yan; Lu, Yao; Liu, Zheng Xiang; Ran, Ji Hua; Li, Xiao Ling; Zhou, Yu; Fu, Jian Feng

2016-06-01

To evaluate the effect of acute high-altitude exposure on sensory and short-term memory using interactive software, we transported 30 volunteers in a sport utility vehicle to a 4280 m plateau within 3 h. We measured their memory performance on the plain (initial arrival) and 3 h after arrival on the plateau using six measures. Memory performance was significantly poorer on the plateau by four of the six measures. Furthermore, memory performance was significantly poorer in the acute mountain sickness (AMS) group than in the non-AMS group by five of the six measures. These findings indicate that rapid ascent to 4280 m and remaining at this altitude for 3 h resulted in decreased sensory and short-term memory, particularly among participants who developed AMS. Copyright © 2016 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
A new technique for measuring listening and reading literacy in developing countries

NASA Astrophysics Data System (ADS)

Greene, Barbara A.; Royer, James M.; Anzalone, Stephen

1990-03-01

One problem in evaluating educational interventions in developing countries is the absence of tests that adequately reflect the culture and curriculum. The Sentence Verification Technique is a new procedure for measuring reading and listening comprehension that allows for the development of tests based on materials indigenous to a given culture. The validity of using the Sentence Verification Technique to measure reading comprehension in Grenada was evaluated in the present study. The study involved 786 students at standards 3, 4 and 5. The tests for each standard consisted of passages that varied in difficulty. The students identified as high ability students in all three standards performed better than those identified as low ability. All students performed better with easier passages. Additionally, students in higher standards performed bettter than students in lower standards on a given passage. These results supported the claim that the Sentence Verification Technique is a valid measure of reading comprehension in Grenada.
Performance measures for transform data coding.

NASA Technical Reports Server (NTRS)

Pearl, J.; Andrews, H. C.; Pratt, W. K.

1972-01-01

This paper develops performance criteria for evaluating transform data coding schemes under computational constraints. Computational constraints that conform with the proposed basis-restricted model give rise to suboptimal coding efficiency characterized by a rate-distortion relation R(D) similar in form to the theoretical rate-distortion function. Numerical examples of this performance measure are presented for Fourier, Walsh, Haar, and Karhunen-Loeve transforms.
Metastable Radioxenon Verification Laboratory (MRVL) Year-End Report

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cooper, Matthew W.; Hayes, James C.; Lidey, Lance S.

2014-11-07

This is the year end report that is due to the client. The MRVL system is designed to measure multiple radioxenon isotopes ( 135Xe, 133Xe, 133mXe and 133mXe) simultaneously. The system has 12 channels to load samples and make nuclear measurements. Although the MRVL system has demonstrated excellent stability in measurements of Xe-133 and Xe-135 over the year of evaluation prior to delivery, there has been concern about system stability over measurements performed on samples with orders of magnitude different radioactivity, and samples containing multiple isotopes. To address these concerns, a series of evaluation test have been performed at themore » end-user laboratory. The evaluation was performed in two separate phases. Phase 1 made measurements on isotopically pure Xe-133 from high radioactivity down to the system background levels of activity, addressing the potential count rate dependencies when activities change from extreme high to very low. The second phase performed measurements on samples containing multiple isotopes (Xe-135, Xe-133 and Xe-133m), and addressed concerns about the dependence of isotopic concentrations on the presence of additional isotopes. The MRVL showed a concentration dependence on the Xe-133 due to the amount of Xe-133m that was in the sample. The dependency is due to the decay of Xe-133m into Xe-133. This document focuses on the second phase and will address the analysis used to account for ingrowth of Xe-133 from Xe-133m.« less
45 CFR 2522.700 - How does evaluation differ from performance measurement?

Code of Federal Regulations, 2013 CFR

2013-10-01

... progress, evaluation uses scientifically-based research methods to assess the effectiveness of programs by... services from your program who increase their reading ability from “below grade level” to “at or above grade level”. This measure indicates something good is happening to your program's service beneficiaries...
45 CFR 2522.700 - How does evaluation differ from performance measurement?

Code of Federal Regulations, 2014 CFR

2014-10-01

... progress, evaluation uses scientifically-based research methods to assess the effectiveness of programs by... services from your program who increase their reading ability from “below grade level” to “at or above grade level”. This measure indicates something good is happening to your program's service beneficiaries...
45 CFR 2522.700 - How does evaluation differ from performance measurement?

Code of Federal Regulations, 2012 CFR

2012-10-01

... progress, evaluation uses scientifically-based research methods to assess the effectiveness of programs by... services from your program who increase their reading ability from “below grade level” to “at or above grade level”. This measure indicates something good is happening to your program's service beneficiaries...
45 CFR 2522.700 - How does evaluation differ from performance measurement?

Code of Federal Regulations, 2011 CFR

2011-10-01

... progress, evaluation uses scientifically-based research methods to assess the effectiveness of programs by... services from your program who increase their reading ability from “below grade level” to “at or above grade level”. This measure indicates something good is happening to your program's service beneficiaries...

Evaluating Math Recovery: Measuring Fidelity of Implementation

ERIC Educational Resources Information Center

Munter, Charles; Garrison, Anne; Cobb, Paul; Cordray, David

2010-01-01

In this paper, the authors describe a case of measuring implementation fidelity within an evaluation study of Math Recovery (MR), a pullout tutoring program aimed at increasing the mathematics achievement of low-performing first graders, thereby closing the school-entry achievement gap by enabling them to achieve at the level of their…
High-Stakes, Minimum-Competency Exams: How Competent Are They for Evaluating Teacher Competence?

ERIC Educational Resources Information Center

Goodman, Gay; Arbona, Consuelo; Dominguez de Rameriz, Romilia

2008-01-01

Increasingly, teacher educators recommend authentic, performance-related measures for evaluating teacher candidates. Nevertheless, more states are requiring teachers to pass high-stakes, minimum-competency exams. This study examined the relation between teacher candidate scores on authentic measures and their scores on certification exams required…
Test procedures and performance measures sensitive to automobile steering dynamics. [considering operator/vehicle responses

NASA Technical Reports Server (NTRS)

Klein, R. H.; Mcruer, D. T.; Weir, D.

1975-01-01

A maneuver complex and related performance measures used to evaluate driver/vehicle system responses as effected by variations in the directional response characteristics of passenger cars are described. The complex consists of normal and emergency maneuvers (including random and discrete disturbances) which, taken as a whole, represent all classes of steering functions and all modes of driver response behavior. Measures of driver/vehicle system response and performance in regulation tasks included direct describing function measurements and rms yaw velocity. In transient maneuvers, measures such as steering activity and cone strikes were used.
Performance testing accountability measurements

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oldham, R.D.; Mitchell, W.G.; Spaletto, M.I.

The New Brunswick Laboratory (NBL) provides assessment support to the DOE Operations Offices in the area of Material Control and Accountability (MC and A). During surveys of facilities, the Operations Offices have begun to request from NBL either assistance in providing materials for performance testing of accountability measurements or both materials and personnel to do performance testing. To meet these needs, NBL has developed measurement and measurement control performance test procedures and materials. The present NBL repertoire of performance tests include the following: (1) mass measurement performance testing procedures using calibrated and traceable test weights, (2) uranium elemental concentration (assay)more » measurement performance tests which use ampulated solutions of normal uranyl nitrate containing approximately 7 milligrams of uranium per gram of solution, and (3) uranium isotopic measurement performance tests which use ampulated uranyl nitrate solutions with enrichments ranging from 4% to 90% U-235. The preparation, characterization, and packaging of the uranium isotopic and assay performance test materials were done in cooperation with the NBL Safeguards Measurements Evaluation Program since these materials can be used for both purposes.« less
Choice and Change of Measures in Performance-Measurement Models

DTIC Science & Technology

2005-05-01

associated costs . 3 Discussions of many current accounting and performance-measurement issues can be...change: an exploratory study. Accounting , Organizations, and Society, 24(3), 189-204. Adimando, C., Butler, R., Malley, S ., Ravid, S . A., Shepro, R...impact of contextual and process factors on the evaluation of activity-based costing systems. Accounting , Organizations and Society, 24, 525-559. Antle
Mindfulness, burnout, and effects on performance evaluations in internal medicine residents

PubMed Central

Braun, Sarah E; Auerbach, Stephen M; Rybarczyk, Bruce; Lee, Bennett; Call, Stephanie

2017-01-01

Purpose Burnout has been documented at high levels in medical residents with negative effects on performance. Some dispositional qualities, like mindfulness, may protect against burnout. The purpose of the present study was to assess burnout prevalence among internal medicine residents at a single institution, examine the relationship between mindfulness and burnout, and provide preliminary findings on the relation between burnout and performance evaluations in internal medicine residents. Methods Residents (n = 38) completed validated measures of burnout at three time points separated by 2 months and a validated measure of dispositional mindfulness at baseline. Program director end-of-year performance evaluations were also obtained on 22 milestones used to evaluate internal medicine resident performance; notably, these milestones have not yet been validated for research purposes; therefore, the investigation here is exploratory. Results Overall, 71.1% (n = 27) of the residents met criteria for burnout during the study. Lower scores on the “acting with awareness” facet of dispositional mindfulness significantly predicted meeting burnout criteria χ2(5) = 11.88, p = 0.04. Lastly, meeting burnout criteria significantly predicted performance on three of the performance milestones, with positive effects on milestones from the “system-based practices” and “professionalism” domains and negative effects on a milestone from the “patient care” domain. Conclusion Burnout rates were high in this sample of internal medicine residents and rates were consistent with other reports of burnout during medical residency. Dispositional mindfulness was supported as a protective factor against burnout. Importantly, results from the exploratory investigation of the relationship between burnout and resident evaluations suggested that burnout may improve performance on some domains of resident evaluations while compromising performance on other domains. Implications and directions for future research are discussed. PMID:28860889
Mindfulness, burnout, and effects on performance evaluations in internal medicine residents.

PubMed

Braun, Sarah E; Auerbach, Stephen M; Rybarczyk, Bruce; Lee, Bennett; Call, Stephanie

2017-01-01

Burnout has been documented at high levels in medical residents with negative effects on performance. Some dispositional qualities, like mindfulness, may protect against burnout. The purpose of the present study was to assess burnout prevalence among internal medicine residents at a single institution, examine the relationship between mindfulness and burnout, and provide preliminary findings on the relation between burnout and performance evaluations in internal medicine residents. Residents (n = 38) completed validated measures of burnout at three time points separated by 2 months and a validated measure of dispositional mindfulness at baseline. Program director end-of-year performance evaluations were also obtained on 22 milestones used to evaluate internal medicine resident performance; notably, these milestones have not yet been validated for research purposes; therefore, the investigation here is exploratory. Overall, 71.1% (n = 27) of the residents met criteria for burnout during the study. Lower scores on the "acting with awareness" facet of dispositional mindfulness significantly predicted meeting burnout criteria χ 2 (5) = 11.88, p = 0.04. Lastly, meeting burnout criteria significantly predicted performance on three of the performance milestones, with positive effects on milestones from the "system-based practices" and "professionalism" domains and negative effects on a milestone from the "patient care" domain. Burnout rates were high in this sample of internal medicine residents and rates were consistent with other reports of burnout during medical residency. Dispositional mindfulness was supported as a protective factor against burnout. Importantly, results from the exploratory investigation of the relationship between burnout and resident evaluations suggested that burnout may improve performance on some domains of resident evaluations while compromising performance on other domains. Implications and directions for future research are discussed.
Design logistics performance measurement model of automotive component industry for srengthening competitiveness of dealing AEC 2015

NASA Astrophysics Data System (ADS)

Amran, T. G.; Janitra Yose, Mindy

2018-03-01

As the free trade Asean Economic Community (AEC) causes the tougher competition, it is important that Indonesia’s automotive industry have high competitiveness as well. A model of logistics performance measurement was designed as an evaluation tool for automotive component companies to improve their logistics performance in order to compete in AEC. The design of logistics performance measurement model was based on the Logistics Scorecard perspectives, divided into two stages: identifying the logistics business strategy to get the KPI and arranging the model. 23 KPI was obtained. The measurement result can be taken into consideration of determining policies to improve the performance logistics competitiveness.
Performance, physiological, and oculometer evaluation of VTOL landing displays

NASA Technical Reports Server (NTRS)

North, R. A.; Stackhouse, S. P.; Graffunder, K.

1979-01-01

A methodological approach to measuring workload was investigated for evaluation of new concepts in VTOL aircraft displays. Physiological, visual response, and conventional flight performance measures were recorded for landing approaches performed in the NASA Visual Motion Simulator (VMS). Three displays (two computer graphic and a conventional flight director), three crosswind amplitudes, and two motion base conditions (fixed vs. moving base) were tested in a factorial design. Multivariate discriminant functions were formed from flight performance and/or visual response variables. The flight performance variable discriminant showed maximum differentation between crosswind conditions. The visual response measure discriminant maximized differences between fixed vs. motion base conditions and experimental displays. Physiological variables were used to attempt to predict the discriminant function values for each subject/condition trial. The weights of the physiological variables in these equations showed agreement with previous studies. High muscle tension, light but irregular breathing patterns, and higher heart rate with low amplitude all produced higher scores on this scale and thus represent higher workload levels.
Integration of Virtual Machine Technologies into Hastily Formed Networks in Support of Humanitarian Relief and Disaster Recovery Missions

DTIC Science & Technology

2011-12-01

and measures of effectiveness (MOE). New technologies that offer solid-state hard drives built into modular VDI devices known as appliances ...Joint Reconfigurable Vehicle LAN Local Area Network LOS Line of Sight LTE Long Term Evolution MB Megabyte MOP Measure of Performance MOE Measure ...re-usable measures of performance and measures of effectiveness (MOP and MOE) and evaluation procedures will be applied to this research. A
Contact Thermocouple Methodology and Evaluation for Temperature Measurement in the Laboratory

NASA Technical Reports Server (NTRS)

Brewer, Ethan J.; Pawlik, Ralph J.; Krause, David L.

2013-01-01

Laboratory testing of advanced aerospace components very often requires highly accurate temperature measurement and control devices, as well as methods to precisely analyze and predict the performance of such components. Analysis of test articles depends on accurate measurements of temperature across the specimen. Where possible, this task is accomplished using many thermocouples welded directly to the test specimen, which can produce results with great precision. However, it is known that thermocouple spot welds can initiate deleterious cracks in some materials, prohibiting the use of welded thermocouples. Such is the case for the nickel-based superalloy MarM-247, which is used in the high temperature, high pressure heater heads for the Advanced Stirling Converter component of the Advanced Stirling Radioisotope Generator space power system. To overcome this limitation, a method was developed that uses small diameter contact thermocouples to measure the temperature of heater head test articles with the same level of accuracy as welded thermocouples. This paper includes a brief introduction and a background describing the circumstances that compelled the development of the contact thermocouple measurement method. Next, the paper describes studies performed on contact thermocouple readings to determine the accuracy of results. It continues on to describe in detail the developed measurement method and the evaluation of results produced. A further study that evaluates the performance of different measurement output devices is also described. Finally, a brief conclusion and summary of results is provided.
Comparison of measured and modeled BRDF of natural targets

NASA Astrophysics Data System (ADS)

Boucher, Yannick; Cosnefroy, Helene; Petit, Alain D.; Serrot, Gerard; Briottet, Xavier

1999-07-01

The Bidirectional Reflectance Distribution Function (BRDF) plays a major role to evaluate or simulate the signatures of natural and artificial targets in the solar spectrum. A goniometer covering a large spectral and directional domain has been recently developed by the ONERA/DOTA. It was designed to allow both laboratory and outside measurements. The spectral domain ranges from 0.40 to 0.95 micrometer, with a resolution of 3 nm. The geometrical domain ranges 0 - 60 degrees for the zenith angle of the source and the sensor, and 0 - 180 degrees for the relative azimuth between the source and the sensor. The maximum target size for nadir measurements is 22 cm. The spatial target irradiance non-uniformity has been evaluated and then used to correct the raw measurements. BRDF measurements are calibrated thanks to a spectralon reference panel. Some BRDF measurements performed on sand and short grass and are presented here. Eight bidirectional models among the most popular models found in the literature have been tested on these measured data set. A code fitting the model parameters to the measured BRDF data has been developed. The comparative evaluation of the model performances is carried out, versus different criteria (root mean square error, root mean square relative error, correlation diagram . . .). The robustness of the models is evaluated with respect to the number of BRDF measurements, noise and interpolation.
Performance Evaluation and Community Application of Low-Cost Sensors for Ozone and Nitrogen Dioxide.

PubMed

Duvall, Rachelle M; Long, Russell W; Beaver, Melinda R; Kronmiller, Keith G; Wheeler, Michael L; Szykman, James J

2016-10-13

This study reports on the performance of electrochemical-based low-cost sensors and their use in a community application. CairClip sensors were collocated with federal reference and equivalent methods and operated in a network of sites by citizen scientists (community members) in Houston, Texas and Denver, Colorado, under the umbrella of the NASA-led DISCOVER-AQ Earth Venture Mission. Measurements were focused on ozone (O₃) and nitrogen dioxide (NO₂). The performance evaluation showed that the CairClip O₃/NO₂ sensor provided a consistent measurement response to that of reference monitors (r² = 0.79 in Houston; r² = 0.72 in Denver) whereas the CairClip NO₂ sensor measurements showed no agreement to reference measurements. The CairClip O₃/NO₂ sensor data from the citizen science sites compared favorably to measurements at nearby reference monitoring sites. This study provides important information on data quality from low-cost sensor technologies and is one of few studies that reports sensor data collected directly by citizen scientists.
Everyday memory impairment in patients with temporal lobe epilepsy caused by hippocampal sclerosis.

PubMed

Rzezak, Patrícia; Lima, Ellen Marise; Gargaro, Ana Carolina; Coimbra, Erica; de Vincentiis, Silvia; Velasco, Tonicarlo Rodrigues; Leite, João Pereira; Busatto, Geraldo F; Valente, Kette D

2017-04-01

Patients with temporal lobe epilepsy caused by hippocampal sclerosis (TLE-HS) have episodic memory impairment. Memory has rarely been evaluated using an ecologic measure, even though performance on these tests is more related to patients' memory complaints. We aimed to measure everyday memory of patients with TLE-HS to age- and gender-matched controls. We evaluated 31 patients with TLE-HS and 34 healthy controls, without epilepsy and psychiatric disorders, using the Rivermead Behavioral Memory Test (RBMT), Visual Reproduction (WMS-III) and Logical Memory (WMS-III). We evaluated the impact of clinical variables such as the age of onset, epilepsy duration, AED use, history of status epilepticus, and seizure frequency on everyday memory. Statistical analyses were performed using MANCOVA with years of education as a confounding factor. Patients showed worse performance than controls on traditional memory tests and in the overall score of RBMT. Patients had more difficulties to recall names, a hidden belonging, to deliver a message, object recognition, to remember a story full of details, a previously presented short route, and in time and space orientation. Clinical epilepsy variables were not associated with RBMT performance. Memory span and working memory were correlated with worse performance on RBMT. Patients with TLE-HS demonstrated deficits in everyday memory functions. A standard neuropsychological battery, designed to assess episodic memory, would not evaluate these impairments. Impairment in recalling names, routes, stories, messages, and space/time disorientation can adversely impact social adaptation, and we must consider these ecologic measures with greater attention in the neuropsychological evaluation of patients with memory complaints. Copyright © 2017 Elsevier Inc. All rights reserved.
The effect of various factors on the masticatory performance of removable denture wearer

NASA Astrophysics Data System (ADS)

Pratama, S.; Koesmaningati, H.; Kusdhany, L. S.

2017-08-01

An individual’s masticatory performance concerns his/her ability to break down food in order to facilitate digestion, and it therefore plays an important role in nutrition. Removable dentures are used to rehabilitate a loss of teeth, which could jeopardize masticatory performance. Further, there exist various other factors that can affect masticatory performance. The objective of this research is to analyze the relationship between various factors and masticatory performance. Thirty-four removable denture wearers (full dentures, single complete dentures, or partial dentures) participated in a cross-sectional study of masticatory performance using color-changeable chewing gum (Masticatory Performance Evaluating Gum Xylitol®). The volume of saliva was evaluated using measuring cups, while the residual ridge heights were measured using a modified mouth mirror no. 3 with metric measurements. The residual ridge height and removable-denture-wearing experience exhibited a significant relationship with masticatory performance. However, age, gender, saliva volume, denture type, and the number and location of the missing teeth did not have a statistically significant association with masticatory performance. The residual ridge height influences the masticatory performance of removable denture wearers, since the greater the ridge height, the better the performance. The experience of using dentures also has a statistically significant influence on masticatory performance.
Impact of reconstruction strategies on system performance measures : maximizing safety and mobility while minimizing life-cycle costs : final report, December 8, 2008.

DOT National Transportation Integrated Search

2008-12-08

The objective of this research is to develop a general methodological framework for planning and : evaluating the effectiveness of highway reconstruction strategies on the systems performance : measures, in particular safety, mobility, and the tot...
An Investigation into Specifying Service Level Agreements for Provisioning Cloud Computing Services

DTIC Science & Technology

2012-12-01

IT .................................................................................................... Information Technology KPI ...the service delivery be measured? 3. Key Performance Indicators ( KPIs ): Describe the KPIs and the responsible party for producing the KPIs . 4...level objectives (SLOs) that are evaluated according to measurable Key Performance Indicators ( KPIs ). Automatic SLA protection enables further
Evaluation and comparison of current fetal ultrasound image segmentation methods for biometric measurements: a grand challenge.

PubMed

Rueda, Sylvia; Fathima, Sana; Knight, Caroline L; Yaqub, Mohammad; Papageorghiou, Aris T; Rahmatullah, Bahbibi; Foi, Alessandro; Maggioni, Matteo; Pepe, Antonietta; Tohka, Jussi; Stebbing, Richard V; McManigle, John E; Ciurte, Anca; Bresson, Xavier; Cuadra, Meritxell Bach; Sun, Changming; Ponomarev, Gennady V; Gelfand, Mikhail S; Kazanov, Marat D; Wang, Ching-Wei; Chen, Hsiang-Chou; Peng, Chun-Wei; Hung, Chu-Mei; Noble, J Alison

2014-04-01

This paper presents the evaluation results of the methods submitted to Challenge US: Biometric Measurements from Fetal Ultrasound Images, a segmentation challenge held at the IEEE International Symposium on Biomedical Imaging 2012. The challenge was set to compare and evaluate current fetal ultrasound image segmentation methods. It consisted of automatically segmenting fetal anatomical structures to measure standard obstetric biometric parameters, from 2D fetal ultrasound images taken on fetuses at different gestational ages (21 weeks, 28 weeks, and 33 weeks) and with varying image quality to reflect data encountered in real clinical environments. Four independent sub-challenges were proposed, according to the objects of interest measured in clinical practice: abdomen, head, femur, and whole fetus. Five teams participated in the head sub-challenge and two teams in the femur sub-challenge, including one team who tackled both. Nobody attempted the abdomen and whole fetus sub-challenges. The challenge goals were two-fold and the participants were asked to submit the segmentation results as well as the measurements derived from the segmented objects. Extensive quantitative (region-based, distance-based, and Bland-Altman measurements) and qualitative evaluation was performed to compare the results from a representative selection of current methods submitted to the challenge. Several experts (three for the head sub-challenge and two for the femur sub-challenge), with different degrees of expertise, manually delineated the objects of interest to define the ground truth used within the evaluation framework. For the head sub-challenge, several groups produced results that could be potentially used in clinical settings, with comparable performance to manual delineations. The femur sub-challenge had inferior performance to the head sub-challenge due to the fact that it is a harder segmentation problem and that the techniques presented relied more on the femur's appearance.
Performance and life evaluation of advanced battery technologies for electric vehicle applications

NASA Astrophysics Data System (ADS)

Deluca, W. H.; Gillie, K. R.; Kulaga, J. E.; Smaga, J. A.; Tummillo, A. F.; Webster, C. E.

Advanced battery technology evaluations are performed under simulated electric vehicle (EV) operating conditions at the Argonne Analysis and Diagnostic Laboratory (ADL). The ADL provides a common basis for both performance characterization and life evaluation with unbiased application of tests and analyses. This paper summarizes the performance characterizations and life evaluations conducted in 1990 on nine single cells and fifteen 3- to 360-cell modules that encompass six technologies: (Na/S, Zn/Br, Ni/Fe, Ni/Cd, Ni-metal hydride, and lead-acid). These evaluations were performed for the Department of Energy and Electric Power Research Institute. The results provide battery users, developers, and program managers an interim measure of the progress being made in battery R and D programs, a comparison of battery technologies, and a source of basic data for modelling and continuing R and D.
Performance Evaluation Methods for Assistive Robotic Technology

NASA Astrophysics Data System (ADS)

Tsui, Katherine M.; Feil-Seifer, David J.; Matarić, Maja J.; Yanco, Holly A.

Robots have been developed for several assistive technology domains, including intervention for Autism Spectrum Disorders, eldercare, and post-stroke rehabilitation. Assistive robots have also been used to promote independent living through the use of devices such as intelligent wheelchairs, assistive robotic arms, and external limb prostheses. Work in the broad field of assistive robotic technology can be divided into two major research phases: technology development, in which new devices, software, and interfaces are created; and clinical, in which assistive technology is applied to a given end-user population. Moving from technology development towards clinical applications is a significant challenge. Developing performance metrics for assistive robots poses a related set of challenges. In this paper, we survey several areas of assistive robotic technology in order to derive and demonstrate domain-specific means for evaluating the performance of such systems. We also present two case studies of applied performance measures and a discussion regarding the ubiquity of functional performance measures across the sampled domains. Finally, we present guidelines for incorporating human performance metrics into end-user evaluations of assistive robotic technologies.

Memory awareness profiles differentiate mild cognitive impairment from early-stage dementia: evidence from assessments of performance monitoring and evaluative judgement.

PubMed

Clare, Linda; Whitaker, Christopher J; Roberts, Judith L; Nelis, Sharon M; Martyr, Anthony; Marková, Ivana S; Roth, Ilona; Woods, Robert T; Morris, Robin G

2013-01-01

Measures of memory awareness based on evaluative judgement and performance monitoring are often regarded as equivalent, but the Levels of Awareness Framework suggests they reflect different awareness phenomena. Examination of memory awareness among groups with differing degrees of impairment provides a test of this proposition. Ninety-nine people with dementia (PwD), 30 people with mild cognitive impairment (PwMCI), and their relatives completed isomorphic performance monitoring and evaluative judgement measures of memory awareness and were followed up at 12 and (PwD only) 20 months. In addition to the resulting awareness indices, comparative accuracy scores were calculated using the relatives' data to establish whether any inaccuracy was specific to self-ratings. When making evaluative judgements about their memory in general, both PwD and PwMCI tended to overestimate their own functioning relative to informant ratings made by relatives. When monitoring performance on memory tests, PwD again overestimated performance relative to test scores, but PwMCI were much more accurate. Comparative accuracy scores indicated that, unlike PwD, PwMCI do not show a specific inaccuracy in self-related appraisals. The results support the proposition that awareness indices at the levels of evaluative judgement and performance monitoring should be regarded as reflecting distinct awareness phenomena. Copyright © 2013 S. Karger AG, Basel.
A protocol for evaluating video trackers under real-world conditions.

PubMed

Nawaz, Tahir; Cavallaro, Andrea

2013-04-01

The absence of a commonly adopted performance evaluation framework is hampering advances in the design of effective video trackers. In this paper, we present a single-score evaluation measure and a protocol to objectively compare trackers. The proposed measure evaluates tracking accuracy and failure, and combines them for both summative and formative performance assessment. The proposed protocol is composed of a set of trials that evaluate the robustness of trackers on a range of test scenarios representing several real-world conditions. The protocol is validated on a set of sequences with a diversity of targets (head, vehicle and person) and challenges (occlusions, background clutter, pose changes and scale changes) using six state-of-the-art trackers, highlighting their strengths and weaknesses on more than 187000 frames. The software implementing the protocol and the evaluation results are made available online and new results can be included, thus facilitating the comparison of trackers.
Evaluation of confocal microscopy system performance.

PubMed

Zucker, R M; Price, O

2001-08-01

The confocal laser scanning microscope (CLSM) has been used by scientists to visualize three-dimensional (3D) biological samples. Although this system involves lasers, electronics, optics, and microscopes, there are few published tests that can be used to assess the performance of this equipment. Usually the CLSM is assessed by subjectively evaluating a biological/histological test slide for image quality. Although there is a use for the test slide, there are many other components in the CLSM that need to be assessed. It would be useful if tests existed that produced reference values for machine performance. The aim of this research was to develop quality assurance tests to ensure that the CLSM was stable while delivering reproducible intensity measurements with excellent image quality. Our ultimate research objective was to quantify fluorescence using a CLSM. To achieve this goal, it is essential that the CLSM be stable while delivering known parameters of performance. Using Leica TCS-SP1 and TCS-4D systems, a number of tests have been devised to evaluate equipment performance. Tests measuring dichroic reflectivity, field illumination, lens performance, laser power output, spectral registration, axial resolution, laser stability, photomultiplier tube (PMT) reliability, and system noise were either incorporated from the literature or derived in our laboratory to measure performance. These tests are also applicable to other manufacturer's systems with minor modifications. A preliminary report from our laboratory has addressed a number of the QA issues necessary to achieve CLSM performance. This report extends our initial work on the evaluation of CLSM system performance. Tests that were described previously have been modified and new tests involved in laser stability and sensitivity are described. The QA tests on the CLSM measured laser power, PMT function, dichroic reflection, spectral registration, axial registration, system noise and sensitivity, lens performance, and laser stability. Laser power stability varied between 3% and 30% due to various factors, which may include incompatibility of the fiber-optic polarization with laser polarization, thermal instability of the acoustical optical transmission filter (AOTF), and laser noise. The sensitivity of the system was measured using a 10-microm Spherotech bead and the PMTs were assessed with the CV concept (image noise). The maximum sensitivity obtainable on our TCS-SP1 system measured on the 10-microm Spherotech beads was approximately 4% for 488 nm, 2.5% for 568 nm, 20% for 647 nm, and 19% for 365 nm laser light. The values serve as a comparison to test machine sensitivity from the same or different manufacturers. QA tests are described on the CLSM to assess performance and ensure that reproducing data are obtained. It is suggested strongly that these tests be used in place of a biological/histological sample to evaluate system performance. The tests are more specific and can recognize instrument functionality and problems better than a biological/histological sample. Utilization of this testing approach will eliminate the subjective assessment of the CLSM and may allow the data from different machines to be compared. These tests are essential if one is interested in making intensity measurements on experimental samples as well as obtaining the best signal detection and image resolution from a CLSM. Published 2001 Wiley-Liss, Inc.
The feasibility of meta-cognitive strategy training in acute inpatient stroke rehabilitation: case report.

PubMed

Skidmore, Elizabeth R; Holm, Margo B; Whyte, Ellen M; Dew, Mary Amanda; Dawson, Deirdre; Becker, James T

2011-04-01

Meta-cognitive strategy training may be used to augment inpatient rehabilitation to promote active engagement and subsequent benefit for individuals with cognitive impairments after stroke. We examined the feasibility of administering a form of meta-cognitive strategy training, Cognitive Orientation to daily Occupational Performance (CO-OP), during inpatient rehabilitation. We trained an individual with cognitive impairments after right hemisphere stroke to identify performance problems, set self-selected goals, develop plans to address goals, and evaluate performance improvements. To assess feasibility, we examined the number of meta-cognitive training sessions attended, the number of self-selected goals, and changes in goal-related performance. We also examined changes in rehabilitation engagement and disability. The participant used the meta-cognitive strategy to set eight goals addressing physically oriented, instrumental, and work-related activities. Mean improvement in Canadian Occupational Performance Measure Performance Scale scores was 6.1. Pittsburgh Rehabilitation Participation Scale scores (measuring rehabilitation engagement) improved from 3.2 at admission to 4.9 at discharge. Functional Independence Measure scores (measuring disability) improved from 68 at admission, to 97 at discharge. Performance Assessment of Self-Care Skills scores improved from 1.1 at admission to 2.9 at discharge. The results indicate that meta-cognitive strategy training was feasible during inpatient rehabilitation and warrants further evaluation to determine its effectiveness.
7 CFR 1709.216 - Evaluation criteria and weights.

Code of Federal Regulations, 2013 CFR

2013-01-01

... announcement. (a) Program Design. Reviewers will consider the financial viability of the applicant's revolving... less severe physical and economic challenges. (c) Program evaluation and performance measures...
7 CFR 1709.216 - Evaluation criteria and weights.

Code of Federal Regulations, 2010 CFR

2010-01-01

... announcement. (a) Program Design. Reviewers will consider the financial viability of the applicant's revolving... less severe physical and economic challenges. (c) Program evaluation and performance measures...
7 CFR 1709.216 - Evaluation criteria and weights.

Code of Federal Regulations, 2012 CFR

2012-01-01

... announcement. (a) Program Design. Reviewers will consider the financial viability of the applicant's revolving... less severe physical and economic challenges. (c) Program evaluation and performance measures...
7 CFR 1709.216 - Evaluation criteria and weights.

Code of Federal Regulations, 2011 CFR

2011-01-01

... announcement. (a) Program Design. Reviewers will consider the financial viability of the applicant's revolving... less severe physical and economic challenges. (c) Program evaluation and performance measures...
7 CFR 1709.216 - Evaluation criteria and weights.

Code of Federal Regulations, 2014 CFR

2014-01-01

... announcement. (a) Program Design. Reviewers will consider the financial viability of the applicant's revolving... less severe physical and economic challenges. (c) Program evaluation and performance measures...
Evaluation of 12 blood glucose monitoring systems for self-testing: system accuracy and measurement reproducibility.

PubMed

Freckmann, Guido; Baumstark, Annette; Schmid, Christina; Pleus, Stefan; Link, Manuela; Haug, Cornelia

2014-02-01

Systems for self-monitoring of blood glucose (SMBG) have to provide accurate and reproducible blood glucose (BG) values in order to ensure adequate therapeutic decisions by people with diabetes. Twelve SMBG systems were compared in a standardized manner under controlled laboratory conditions: nine systems were available on the German market and were purchased from a local pharmacy, and three systems were obtained from the manufacturer (two systems were available on the U.S. market, and one system was not yet introduced to the German market). System accuracy was evaluated following DIN EN ISO (International Organization for Standardization) 15197:2003. In addition, measurement reproducibility was assessed following a modified TNO (Netherlands Organization for Applied Scientific Research) procedure. Comparison measurements were performed with either the glucose oxidase method (YSI 2300 STAT Plus™ glucose analyzer; YSI Life Sciences, Yellow Springs, OH) or the hexokinase method (cobas(®) c111; Roche Diagnostics GmbH, Mannheim, Germany) according to the manufacturer's measurement procedure. The 12 evaluated systems showed between 71.5% and 100% of the measurement results within the required system accuracy limits. Ten systems fulfilled with the evaluated test strip lot minimum accuracy requirements specified by DIN EN ISO 15197:2003. In addition, accuracy limits of the recently published revision ISO 15197:2013 were applied and showed between 54.5% and 100% of the systems' measurement results within the required accuracy limits. Regarding measurement reproducibility, each of the 12 tested systems met the applied performance criteria. In summary, 83% of the systems fulfilled with the evaluated test strip lot minimum system accuracy requirements of DIN EN ISO 15197:2003. Each of the tested systems showed acceptable measurement reproducibility. In order to ensure sufficient measurement quality of each distributed test strip lot, regular evaluations are required.
Calibration of automatic performance measures - speed and volume data: volume 2, evaluation of the accuracy of approach volume counts and speeds collected by microwave sensors.

DOT National Transportation Integrated Search

2016-05-01

This study evaluated the accuracy of approach volumes and free flow approach speeds collected by the Wavetronix : SmartSensor Advance sensor for the Signal Performance Metrics system of the Utah Department of Transportation (UDOT), : using the field ...
Remote control circuit breaker evaluation testing. [for space shuttles

NASA Technical Reports Server (NTRS)

Bemko, L. M.

1974-01-01

Engineering evaluation tests were performed on several models/types of remote control circuit breakers marketed in an attempt to gain some insight into their potential suitability for use on the space shuttle vehicle. Tests included the measurement of several electrical and operational performance parameters under laboratory ambient, space simulation, acceleration and vibration environmental conditions.
Grazing Incidence Wavefront Sensing and Verification of X-Ray Optics Performance

NASA Technical Reports Server (NTRS)

Saha, Timo T.; Rohrbach, Scott; Zhang, William W.

2011-01-01

Evaluation of interferometrically measured mirror metrology data and characterization of a telescope wavefront can be powerful tools in understanding of image characteristics of an x-ray optical system. In the development of soft x-ray telescope for the International X-Ray Observatory (IXO), we have developed new approaches to support the telescope development process. Interferometrically measuring the optical components over all relevant spatial frequencies can be used to evaluate and predict the performance of an x-ray telescope. Typically, the mirrors are measured using a mount that minimizes the mount and gravity induced errors. In the assembly and mounting process the shape of the mirror segments can dramatically change. We have developed wavefront sensing techniques suitable for the x-ray optical components to aid us in the characterization and evaluation of these changes. Hartmann sensing of a telescope and its components is a simple method that can be used to evaluate low order mirror surface errors and alignment errors. Phase retrieval techniques can also be used to assess and estimate the low order axial errors of the primary and secondary mirror segments. In this paper we describe the mathematical foundation of our Hartmann and phase retrieval sensing techniques. We show how these techniques can be used in the evaluation and performance prediction process of x-ray telescopes.
Metrics for Performance Evaluation of Patient Exercises during Physical Therapy.

PubMed

Vakanski, Aleksandar; Ferguson, Jake M; Lee, Stephen

2017-06-01

The article proposes a set of metrics for evaluation of patient performance in physical therapy exercises. Taxonomy is employed that classifies the metrics into quantitative and qualitative categories, based on the level of abstraction of the captured motion sequences. Further, the quantitative metrics are classified into model-less and model-based metrics, in reference to whether the evaluation employs the raw measurements of patient performed motions, or whether the evaluation is based on a mathematical model of the motions. The reviewed metrics include root-mean square distance, Kullback Leibler divergence, log-likelihood, heuristic consistency, Fugl-Meyer Assessment, and similar. The metrics are evaluated for a set of five human motions captured with a Kinect sensor. The metrics can potentially be integrated into a system that employs machine learning for modelling and assessment of the consistency of patient performance in home-based therapy setting. Automated performance evaluation can overcome the inherent subjectivity in human performed therapy assessment, and it can increase the adherence to prescribed therapy plans, and reduce healthcare costs.
Evaluation of a device for standardized measurements of reading performance in a prepresbyopic population.

PubMed

Arad, Tschingis; Baumeister, Martin; Bühren, Jens; Kohnen, Thomas

2017-04-20

Automated measurements of reading performance are required for clinical trials involving presbyopia-correcting surgery options. Repeatability of a testing device for reading (Salzburg Reading Desk) was evaluated in a prepresbyopic population. Subjective reading performance of 50 subjects divided into 2 age groups (23-30 years and 38-49 years) with distance-corrected eyes was investigated with different log-scaled reading charts. At study entry, refractive parameters were measured and distance visual acuity assessed. Two standardized binocular measurements were performed for each subject (32.24 ± 9.87 days apart [mean ± SD]). The repeatability of the tests was estimated using correlation coefficients, Wilcoxon signed-rank test, and Bland-Altman method. The test parameters at both maximum reading rate (MRR) measurements demonstrate a strong relationship of age group 2 subjects (correlation coefficient [r] = 0.74 p = 10-4) and of younger subjects (age group 1: r = 0.69, p = 10-4). Prepresbyopic subjects of age group 2 showed moderate results for near reading distance (r = 0.67, p = 10-4); by contrast, younger subjects had poorer results (r = 0.55, p = 10-3). The Wilcoxon signed-rank test revealed agreement between measurements and Bland-Altman plots showed a wide data spread for MRR and near reading distance in both groups. The device measures repeatedly selected reading performance parameters of near real world conditions, such as MRR, in prepresbyopic populations if several factors are taken into account. The option to choose preferred distance leads to more variance in measuring repeated reading performance. German Clinical Trials Register (DRKS) registration reference number: DRKS00000784.
SynchroPhasor Measurements: System Architecture and Performance Evaluation in Supporting Wide-Area Applications

DOE Office of Scientific and Technical Information (OSTI.GOV)

Huang, Zhenyu; Dagle, Jeffery E.

2008-07-31

The infrastructure of phasor measurements have evolved over the last two decades from isolated measurement units to networked measurement systems with footprints beyond individual utility companies. This is, to a great extent, a bottom-up self-evolving process except some local systems built by design. Given the number of phasor measurement units (PMUs) in the system is small (currently 70 each in western and eastern interconnections), current phasor network architecture works just fine. However, the architecture will become a bottleneck when large number of PMUs are installed (e.g. >1000~10000). The need for phasor architecture design has yet to be addressed. This papermore » reviews the current phasor networks and investigates future architectures, as related to the efforts undertaken by the North America SynchroPhasor Initiative (NASPI). Then it continues to present staged system tests to evaluate the performance of phasor networks, which is a common practice in the Western Electricity Coordinating Council (WECC) system. This is followed by field measurement evaluation and the implication of phasor quality issues on phasor applications.« less
Findings and Preliminary Recommendations from the Michigan State and Indiana University Research Study of Value-Added Models to Evaluate Teacher Performance

ERIC Educational Resources Information Center

Guarino, Cassandra M.

2013-01-01

The push for accountability in public schooling has extended to the measurement of teacher performance, accelerated by federal efforts through Race to the Top. Currently, a large number of states and districts across the country are computing measures of teacher performance based on the standardized test scores of their students and using them in…
Evaluating Organic Aerosol Model Performance: Impact of two Embedded Assumptions

NASA Astrophysics Data System (ADS)

Jiang, W.; Giroux, E.; Roth, H.; Yin, D.

2004-05-01

Organic aerosols are important due to their abundance in the polluted lower atmosphere and their impact on human health and vegetation. However, modeling organic aerosols is a very challenging task because of the complexity of aerosol composition, structure, and formation processes. Assumptions and their associated uncertainties in both models and measurement data make model performance evaluation a truly demanding job. Although some assumptions are obvious, others are hidden and embedded, and can significantly impact modeling results, possibly even changing conclusions about model performance. This paper focuses on analyzing the impact of two embedded assumptions on evaluation of organic aerosol model performance. One assumption is about the enthalpy of vaporization widely used in various secondary organic aerosol (SOA) algorithms. The other is about the conversion factor used to obtain ambient organic aerosol concentrations from measured organic carbon. These two assumptions reflect uncertainties in the model and in the ambient measurement data, respectively. For illustration purposes, various choices of the assumed values are implemented in the evaluation process for an air quality model based on CMAQ (the Community Multiscale Air Quality Model). Model simulations are conducted for the Lower Fraser Valley covering Southwest British Columbia, Canada, and Northwest Washington, United States, for a historical pollution episode in 1993. To understand the impact of the assumed enthalpy of vaporization on modeling results, its impact on instantaneous organic aerosol yields (IAY) through partitioning coefficients is analysed first. The analysis shows that utilizing different enthalpy of vaporization values causes changes in the shapes of IAY curves and in the response of SOA formation capability of reactive organic gases to temperature variations. These changes are then carried into the air quality model and cause substantial changes in the organic aerosol modeling results. In another aspect, using different assumed factors to convert measured organic carbon to organic aerosol concentrations cause substantial variations in the processed ambient data themselves, which are normally used as performance targets for model evaluations. The combination of uncertainties in the modeling results and in the moving performance targets causes major uncertainties in the final conclusion about the model performance. Without further information, the best thing that a modeler can do is to choose a combination of the assumed values from the sensible parameter ranges available in the literature, based on the best match of the modeling results with the processed measurement data. However, the best match of the modeling results with the processed measurement data may not necessarily guarantee that the model itself is rigorous and the model performance is robust. Conclusions on the model performance can only be reached with sufficient understanding of the uncertainties and their impact.
Quality of Care for PTSD and Depression in the Military Health System

DTIC Science & Technology

evaluate the receipt of recommended assessments and treatments. These measures draw on multiple data sources including administrative encounter data...services are effective in reducing symptoms. When comparing performance between 20122013 and 20132014, most measures demonstrated slight improvement ...in 20132014 for over 38,000 active-component service members with PTSD or depression. The assessment includes performance on 30 quality measures to
GATEWAY Demonstrations: OLED Lighting in the Offices of DeJoy, Knauf & Blood, LLP

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miller, Naomi J.

At the offices of the accounting firm of DeJoy, Knauf & Blood, LLP in Rochester, NY, the GATEWAY program evaluated a new lighting system that incorporates a number of different OLED luminaires. Evaluation of the OLED products included efficacy performance, field measurements of panel color, flicker measurements, and staff feedback.

Evaluation of Self-Perceptions of Creativity: Is It a Useful Criterion?

ERIC Educational Resources Information Center

Reiter-Palmon, Roni; Robinson-Morral, Erika J.; Kaufman, James C.; Santo, Jonathan B.

2012-01-01

Self-evaluations or self-perceptions of creativity have been used in the past both as predictors of creative performance and as criteria. Four measures utilizing self-perceptions of creativity were assessed for their usefulness as criterion measures of creativity. Analyses provided evidence of domain specificity of self-perceptions. The scales…
Examining Teacher Effectiveness Using Classroom Observation Scores: Evidence from the Randomization of Teachers to Students

ERIC Educational Resources Information Center

Garrett, Rachel; Steinberg, Matthew P.

2015-01-01

Despite policy efforts to encourage multiple measures of performance in newly developing teacher evaluation systems, practical constraints often result in evaluations based predominantly on formal classroom observations. Yet there is limited knowledge of how these observational measures relate to student achievement. This article leverages the…
The Development of NOAA Education Common Outcome Performance Measures (Invited)

NASA Astrophysics Data System (ADS)

Baek, J.

2013-12-01

The National Oceanic and Atmospheric Administration (NOAA) Education Council has embarked on an ambitious Monitoring and Evaluation (M&E) project that will allow it to assess education program outcomes and impacts across the agency, line offices, and programs. The purpose of this internal effort is to link outcome measures to program efforts and to evaluate the success of the agency's education programs in meeting the strategic goals. Using an outcome-based evaluation approach, the NOAA Education Council is developing two sets of common outcome performance measures, environmental stewardship and professional development. This presentation will examine the benefits and tradeoffs of common outcome performance measures that collect program results across a portfolio of education programs focused on common outcomes. Common outcome performance measures have a few benefits to our agency and to the climate education field at large. The primary benefit is shared understanding, which comes from our process for writing common outcome performance measures. Without a shared and agreed upon set of definitions for the measure of an outcome, the reported results may not be measuring the same things and would incorrectly indicate levels of performance. Therefore, our writing process relies on a commitment to developing a shared set of definitions based on consensus. We hope that by taking the time to debate and coming to agreement across a diverse set of programs, the strength of our common measures can indicate real progress towards outcomes we care about. An additional benefit is that these common measures can be adopted and adapted by other agencies and organizations that share similar theories of change. The measures are not without their drawbacks, and we do make tradeoffs as part of our process in order to continue making progress. We know that any measure is necessarily a narrow slice of performance. A slice that may not best represent the unique and remarkable contribution of an individual program, but does reflect a variety of contributions along a single dimension across a large portfolio of programs. The process has ended up pushing our working group to call for even more measures, to capture an increasing number of dimensions that reflect the nature of the portfolio of programs. This past year we have been working on developing two sets of common outcome performance measures for professional development (PD) and stewardship education programs. The outcome we chose for PD programs was the use of what was learned in the educator's practice. The outcome we chose for stewardship programs was the stewardship behaviors that participants learn and practice. The measurement of these outcomes will inform whether our strategies are having their intended impact. By knowing how and how much these outcomes are occurring as a result of our program, we can improve program performance over time. The common outcome performance measures help demonstrate how these programs engage audiences in supporting NOAA's mission. As AGU climate literacy community continues to grow, it is important to consider an approach to demonstrate the community's contribution to the Nation's climate literacy. Development of common outcome performance measures is one approach that could help focus the community in meeting its goals.
Assessing resident's knowledge and communication skills using four different evaluation tools.

PubMed

Nuovo, Jim; Bertakis, Klea D; Azari, Rahman

2006-07-01

This study assesses the relationship between 4 Accreditation Council for Graduate Medical Education (ACGME) outcome project measures for interpersonal and communication skills and medical knowledge; specifically, monthly performance evaluations, objective structured clinical examinations (OSCEs), the American Board of Family Practice in-training examination (ABFP-ITE) and the Davis observation code (DOC) practice style profiles. Based on previous work, we have DOC scoring for 29 residents from the University of California, Davis Department of Family and Community Medicine. For all these residents we also had the results of monthly performance evaluations, 2 required OSCE exercises, and the results of 3 American Board of Family Medicine (ABFM) ITEs. Data for each of these measures were abstracted for each resident. The Pearson correlation coefficient was used to assess the presence or lack of correlation between each of these evaluation methods. There is little correlation between various evaluation methods used to assess medical knowledge, and there is also little correlation between various evaluation methods used to assess communication skills. The outcome project remains a 'work in progress', with the need for larger studies to assess the value of different assessment measures of resident competence. It is unlikely that DOC will become a useful evaluation tool.
Data envelopment analysis in service quality evaluation: an empirical study

NASA Astrophysics Data System (ADS)

Najafi, Seyedvahid; Saati, Saber; Tavana, Madjid

2015-09-01

Service quality is often conceptualized as the comparison between service expectations and the actual performance perceptions. It enhances customer satisfaction, decreases customer defection, and promotes customer loyalty. Substantial literature has examined the concept of service quality, its dimensions, and measurement methods. We introduce the perceived service quality index (PSQI) as a single measure for evaluating the multiple-item service quality construct based on the SERVQUAL model. A slack-based measure (SBM) of efficiency with constant inputs is used to calculate the PSQI. In addition, a non-linear programming model based on the SBM is proposed to delineate an improvement guideline and improve service quality. An empirical study is conducted to assess the applicability of the method proposed in this study. A large number of studies have used DEA as a benchmarking tool to measure service quality. These models do not propose a coherent performance evaluation construct and consequently fail to deliver improvement guidelines for improving service quality. The DEA models proposed in this study are designed to evaluate and improve service quality within a comprehensive framework and without any dependency on external data.
Evaluation of Nutritional Status in Children during Predialysis, or Treated By Peritoneal Dialysis or Hemodialysis.

PubMed

Yılmaz, Dilek; Sönmez, Ferah; Karakaş, Sacide; Yavaşcan, Önder; Aksu, Nejat; Ömürlü, İmran Kurt; Yenisey, Çiğdem

2016-06-01

Malnutrition is one of the major causes of morbidity and mortality in children with chronic kidney disease (CKD). The objective of this study was to evaluate nutritional status of children with stage 3-4 CKD and treated by peritoneal dialysis or hemodialysis using anthropometric measurements, biochemical parameters and bioelectrical impedance analysis. The study included a total of 52 patients and 46 healthy children. In anthropometric evaluation, the children with CKD had lower values for standard deviation score for weight, height, body mass index, skinfold thickness and mid-arm circumference than those of healthy children (p < 0.05). The fat mass (%) and the body cell mass (%) measurements performed by bioelectrical impedance analysis were lower compared with the control group (p < 0.05). It is considered that bioelectrical impedance analysis measurement should be used with anthropometric measurements, which are easy to perform, to achieve more accurate nutritional evaluation in children. © The Author [2016]. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Measurement of the 241Am neutron capture cross section at the n_TOF facility at CERN

NASA Astrophysics Data System (ADS)

Mendoza, E.; Cano-Ott, D.; Altstadt, S.; Andriamonje, S.; Andrzejewski, J.; Audouin, L.; Balibrea, J.; Bécares, V.; Barbagallo, M.; Bečvář, F.; Belloni, F.; Berthier, B.; Berthoumieux, E.; Billowes, J.; Boccone, V.; Bosnar, D.; Brugger, M.; Calviño, F.; Calviani, M.; Carrapiço, C.; Cerutti, F.; Chiaveri, E.; Chin, M.; Colonna, N.; Cortés, G.; Cortés-Giraldo, M. A.; Diakaki, M.; Dillmann, I.; Domingo-Pardo, C.; Durán, I.; Dzysiuk, N.; Eleftheriadis, C.; Fernández-Ordóñez, M.; Ferrari, A.; Fraval, K.; Furman, V.; Gómez-Hornillos, M. B.; Ganesan, S.; García, A. R.; Giubrone, G.; Gonçalves, I. F.; González, E.; Goverdovski, A.; Gramegna, F.; Griesmayer, E.; Guerrero, C.; Gunsing, F.; Gurusamy, P.; Heftrich, T.; Heinitz, S.; Hernández-Prieto, A.; Heyse, J.; Jenkins, D. G.; Jericha, E.; Käppeler, F.; Kadi, Y.; Karadimos, D.; Katabuchi, T.; Ketlerov, V.; Khryachkov, V.; Koehler, P.; Kokkoris, M.; Kroll, J.; Krtička, M.; Lampoudis, C.; Langer, C.; Leal-Cidoncha, E.; Lederer, C.; Leeb, H.; Leong, L. S.; Lerendegui-Marco, J.; Licata, M.; Losito, R.; Manousos, A.; Marganiec, J.; Martínez, T.; Massimi, C.; Mastinu, P.; Mastromarco, M.; Mengoni, A.; Milazzo, P. M.; Mingrone, F.; Mirea, M.; Mondelaers, W.; Paradela, C.; Pavlik, A.; Perkowski, J.; Plompen, A. J. M.; Praena, J.; Quesada, J. M.; Rauscher, T.; Reifarth, R.; Riego-Perez, A.; Robles, M.; Roman, F.; Rubbia, C.; Ryan, J. A.; Sabaté-Gilarte, M.; Sarmento, R.; Saxena, A.; Schillebeeckx, P.; Schmidt, S.; Schumann, D.; Sedyshev, P.; Tagliente, G.; Tain, J. L.; Tarifeño-Saldivia, A.; Tarrío, D.; Tassan-Got, L.; Tsinganis, A.; Valenta, S.; Vannini, G.; Variale, V.; Vaz, P.; Ventura, A.; Vermeulen, M. J.; Versaci, R.; Vlachoudis, V.; Vlastou, R.; Wallner, A.; Ware, T.; Weigand, M.; Weiss, C.; Wright, T.; Žugec, P.

2017-09-01

New neutron cross section measurements of minor actinides have been performed recently in order to reduce the uncertainties in the evaluated data, which is important for the design of advanced nuclear reactors and, in particular, for determining their performance in the transmutation of nuclear waste. We have measured the 241Am(n,γ) cross section at the n_TOF facility between 0.2 eV and 10 keV with a BaF2 Total Absorption Calorimeter, and the analysis of the measurement has been recently concluded. Our results are in reasonable agreement below 20 eV with the ones published by C. Lampoudis et al. in 2013, who reported a 22% larger capture cross section up to 110 eV compared to experimental and evaluated data published before. Our results also indicate that the 241Am(n,γ) cross section is underestimated in the present evaluated libraries between 20 eV and 2 keV by 25%, on average, and up to 35% for certain evaluations and energy ranges.
TECHNOLOGY EVALUATION REPORT, HYDROTECHNICS IN SITU FLOW SENSOR

EPA Science Inventory

The U.S. Environmental Protection Agency (EPA) Superfund Innovative Technology Evaluation (SITE) Program evaluated performance of HydroTechnics, Inc. flow sensors in measuring the three-dimensional flow pattern created by operation of the Wasatch Environmental, Inc. (WEI) ground...
TREATMENT PLANT EVALUATION FOR PARTICULATE CONTAMINANT REMOVAL

EPA Science Inventory

A general procedure is suggested for evaluating performance of water filtration plants. Plant operating records should be reviewed. Plant hydraulics should be evaluated. Chemical feed pumps, measuring, and additional points, plus control of chemical doses, are discussed. Rapid mi...
Solar power plant performance evaluation: simulation and experimental validation

NASA Astrophysics Data System (ADS)

Natsheh, E. M.; Albarbar, A.

2012-05-01

In this work the performance of solar power plant is evaluated based on a developed model comprise photovoltaic array, battery storage, controller and converters. The model is implemented using MATLAB/SIMULINK software package. Perturb and observe (P&O) algorithm is used for maximizing the generated power based on maximum power point tracker (MPPT) implementation. The outcome of the developed model are validated and supported by a case study carried out using operational 28.8kW grid-connected solar power plant located in central Manchester. Measurements were taken over 21 month's period; using hourly average irradiance and cell temperature. It was found that system degradation could be clearly monitored by determining the residual (the difference) between the output power predicted by the model and the actual measured power parameters. It was found that the residual exceeded the healthy threshold, 1.7kW, due to heavy snow in Manchester last winter. More important, the developed performance evaluation technique could be adopted to detect any other reasons that may degrade the performance of the P V panels such as shading and dirt. Repeatability and reliability of the developed system performance were validated during this period. Good agreement was achieved between the theoretical simulation and the real time measurement taken the online grid connected solar power plant.
Lessons from Five States: Public Sector Use of Washington Circle Performance Measures

PubMed Central

Garnick, Deborah W.; Lee, Margaret T.; Horgan, Constance; Acevedo, Andrea; Botticelli, Michael; Clark, Spencer; Davis, Steven; Gallati, Robert; Haberlin, Karin; Hanchett, Andrew; Lambert–Wacey, Dawn; Leeper, Tracy; Siemianowski, James; Tikoo, Minakshi

2011-01-01

Five states (Connecticut, Massachusetts, New York, North Carolina, and Oklahoma) have incorporated Washington Circle (WC) substance abuse performance measures in various ways into their quality improvement strategies. In this paper we focus on what other states and local providers might learn from these states’ experiences as they consider using WC performance measures. Using a case study approach, we report that the use of WC measures differs across these five states, although there are important common themes required for adoption and sustainability of performance measures which include: leadership, evaluation of specification and use of measures over time, state-specific adaptation of the WC measure specifications, collaboration with consultants and partners, inclusion of WC measures in the context of other initiatives, reporting to providers and the public, and data and resource requirements. As additional states adopt some of the WC measures, or adopt other performance measurement approaches, these states’ experiences could help them to develop implementations based on their particular needs. PMID:21257282
Analysis of Photovoltaic System Energy Performance Evaluation Method

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kurtz, S.; Newmiller, J.; Kimber, A.

2013-11-01

Documentation of the energy yield of a large photovoltaic (PV) system over a substantial period can be useful to measure a performance guarantee, as an assessment of the health of the system, for verification of a performance model to then be applied to a new system, or for a variety of other purposes. Although the measurement of this performance metric might appear to be straight forward, there are a number of subtleties associated with variations in weather and imperfect data collection that complicate the determination and data analysis. A performance assessment is most valuable when it is completed with amore » very low uncertainty and when the subtleties are systematically addressed, yet currently no standard exists to guide this process. This report summarizes a draft methodology for an Energy Performance Evaluation Method, the philosophy behind the draft method, and the lessons that were learned by implementing the method.« less
MODELING AND PERFORMANCE EVALUATION FOR AVIATION SECURITY CARGO INSPECTION QUEUING SYSTEM

DOE Office of Scientific and Technical Information (OSTI.GOV)

Allgood, Glenn O; Olama, Mohammed M; Rose, Terri A

Beginning in 2010, the U.S. will require that all cargo loaded in passenger aircraft be inspected. This will require more efficient processing of cargo and will have a significant impact on the inspection protocols and business practices of government agencies and the airlines. In this paper, we conduct performance evaluation study for an aviation security cargo inspection queuing system for material flow and accountability. The overall performance of the aviation security cargo inspection system is computed, analyzed, and optimized for the different system dynamics. Various performance measures are considered such as system capacity, residual capacity, and throughput. These metrics aremore » performance indicators of the system s ability to service current needs and response capacity to additional requests. The increased physical understanding resulting from execution of the queuing model utilizing these vetted performance measures will reduce the overall cost and shipping delays associated with the new inspection requirements.« less
Key results of battery performance and life tests at Argonne National Laboratory

NASA Astrophysics Data System (ADS)

Deluca, W. H.; Gillie, K. R.; Kulaga, J. E.; Smaga, J. A.; Tummillo, A. F.; Webster, C. E.

1991-12-01

Advanced battery technology evaluations are performed under simulated electric vehicle operating conditions at Argonne National Laboratory's & Diagnostic Laboratory (ADL). The ADL provide a common basis for both performance characterization and life evaluation with unbiased application of tests and analyses. This paper summarizes the performance characterizations and life evaluations conducted in 1991 on twelve single cells and eight 3- to 360-cell modules that encompass six battery technologies (Na/S, Li/MS, Ni/MH, Zn/Br, Ni/Fe, and Pb-Acid). These evaluations were performed for the Department of Energy, Office of Transportation Technologies, Electric and Hybrid Propulsion Division. The results measure progress in battery R & D programs, compare battery technologies, and provide basic data for modeling and continuing R & D to battery users, developers, and program managers.
Evaluating the performance of a fault detection and diagnostic system for vapor compression equipment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Breuker, M.S.; Braun, J.E.

This paper presents a detailed evaluation of the performance of a statistical, rule-based fault detection and diagnostic (FDD) technique presented by Rossi and Braun (1997). Steady-state and transient tests were performed on a simple rooftop air conditioner over a range of conditions and fault levels. The steady-state data without faults were used to train models that predict outputs for normal operation. The transient data with faults were used to evaluate FDD performance. The effect of a number of design variables on FDD sensitivity for different faults was evaluated and two prototype systems were specified for more complete evaluation. Good performancemore » was achieved in detecting and diagnosing five faults using only six temperatures (2 input and 4 output) and linear models. The performance improved by about a factor of two when ten measurements (three input and seven output) and higher order models were used. This approach for evaluating and optimizing the performance of the statistical, rule-based FDD technique could be used as a design and evaluation tool when applying this FDD method to other packaged air-conditioning systems. Furthermore, the approach could also be modified to evaluate the performance of other FDD methods.« less
Psychophysical evaluation of the image quality of a dynamic flat-panel digital x-ray image detector using the threshold contrast detail detectability (TCDD) technique

NASA Astrophysics Data System (ADS)

Davies, Andrew G.; Cowen, Arnold R.; Bruijns, Tom J. C.

1999-05-01

We are currently in an era of active development of the digital X-ray imaging detectors that will serve the radiological communities in the new millennium. The rigorous comparative physical evaluations of such devices are therefore becoming increasingly important from both the technical and clinical perspectives. The authors have been actively involved in the evaluation of a clinical demonstration version of a flat-panel dynamic digital X-ray image detector (or FDXD). Results of objective physical evaluation of this device have been presented elsewhere at this conference. The imaging performance of FDXD under radiographic exposure conditions have been previously reported, and in this paper a psychophysical evaluation of the FDXD detector operating under continuous fluoroscopic conditions is presented. The evaluation technique employed was the threshold contrast detail detectability (TCDD) technique, which enables image quality to be measured on devices operating in the clinical environment. This approach addresses image quality in the context of both the image acquisition and display processes, and uses human observers to measure performance. The Leeds test objects TO[10] and TO[10+] were used to obtain comparative measurements of performance on the FDXD and two digital spot fluorography (DSF) systems, one utilizing a Plumbicon camera and the other a state of the art CCD camera. Measurements were taken at a range of detector entrance exposure rates, namely 6, 12, 25 and 50 (mu) R/s. In order to facilitate comparisons between the systems, all fluoroscopic image processing such as noise reduction algorithms, were disabled during the experiments. At the highest dose rate FDXD significantly outperformed the DSF comparison systems in the TCDD comparisons. At 25 and 12 (mu) R/s all three-systems performed in an equivalent manner and at the lowest exposure rate FDXD was inferior to the two DSF systems. At standard fluoroscopic exposures, FDXD performed in an equivalent manner to the DSF systems for the TCDD comparisons. This would suggest that FDXD would therefore perform adequately in a clinical fluoroscopic environment and our initial clinical experiences support this. Noise reduction processing of the fluoroscopic data acquired on FDXD was also found to further improve TCDD performance for FDXD. FDXD therefore combines acceptable fluoroscopic performance with excellent radiographic (snap shot) imaging fidelity, allowing the possibility of a universal x-ray detector to be developed, based on FDXD's technology. It is also envisaged that fluoroscopic performance will be improved by the development of digital image enhancement techniques specifically tailored to the characteristics of the FDXD detector.
Osseointegration of dental implants in Macaca fascicularis

NASA Astrophysics Data System (ADS)

Dewi, R. S.; Odang, R. W.; Odelia, L.

2017-08-01

Osseointegration is an important factor in determining the success of a dental implant. It can be assessed from the osseointegration that occurs between the implant and the bone. The implant stability is determined by the osseous support at the implant-bone interface, which is commonly evaluated by histomorphometric analysis. This study aimed to evaluate whether the osseointegration level measured by a Low Resonance Frequency Analyzer (LRFA) gave results as good as those obtained by histomorphometric examination. Six male Macaca fascicularis were used in this study. In each animal, two types of loading were performed: immediate and delayed loading. Clinical examination and LRFA measurement were performed to determine osseointegration at the first and second weeks and at the first, second, third, and fourth months. After four months, histomorphometric examination was performed. The relationship between the histomorphometric examination and LRFA measurement was compared using the Pearson correlation coefficient. There was no significant difference in the osseointegration between immediate loading and delayed loading (p > 0.05) The bone-implant contact percentage in the first group did not differ significantly from that in the second group. Statistical analysis showed that there was a strong correlation between LRFA measurement and histomorphometric examination. Osseointegration could be evaluated through LRFA measurement as well as through histomorphometric examination.
Guiding principles and checklist for population-based quality metrics.

PubMed

Krishnan, Mahesh; Brunelli, Steven M; Maddux, Franklin W; Parker, Thomas F; Johnson, Douglas; Nissenson, Allen R; Collins, Allan; Lacson, Eduardo

2014-06-06

The Centers for Medicare and Medicaid Services oversees the ESRD Quality Incentive Program to ensure that the highest quality of health care is provided by outpatient dialysis facilities that treat patients with ESRD. To that end, Centers for Medicare and Medicaid Services uses clinical performance measures to evaluate quality of care under a pay-for-performance or value-based purchasing model. Now more than ever, the ESRD therapeutic area serves as the vanguard of health care delivery. By translating medical evidence into clinical performance measures, the ESRD Prospective Payment System became the first disease-specific sector using the pay-for-performance model. A major challenge for the creation and implementation of clinical performance measures is the adjustments that are necessary to transition from taking care of individual patients to managing the care of patient populations. The National Quality Forum and others have developed effective and appropriate population-based clinical performance measures quality metrics that can be aggregated at the physician, hospital, dialysis facility, nursing home, or surgery center level. Clinical performance measures considered for endorsement by the National Quality Forum are evaluated using five key criteria: evidence, performance gap, and priority (impact); reliability; validity; feasibility; and usability and use. We have developed a checklist of special considerations for clinical performance measure development according to these National Quality Forum criteria. Although the checklist is focused on ESRD, it could also have broad application to chronic disease states, where health care delivery organizations seek to enhance quality, safety, and efficiency of their services. Clinical performance measures are likely to become the norm for tracking performance for health care insurers. Thus, it is critical that the methodologies used to develop such metrics serve the payer and the provider and most importantly, reflect what represents the best care to improve patient outcomes. Copyright © 2014 by the American Society of Nephrology.
Thrust stand evaluation of engine performance improvement algorithms in an F-15 airplane

NASA Technical Reports Server (NTRS)

Conners, Timothy R.

1992-01-01

An investigation is underway to determine the benefits of a new propulsion system optimization algorithm in an F-15 airplane. The performance seeking control (PSC) algorithm optimizes the quasi-steady-state performance of an F100 derivative turbofan engine for several modes of operation. The PSC algorithm uses an onboard software engine model that calculates thrust, stall margin, and other unmeasured variables for use in the optimization. As part of the PSC test program, the F-15 aircraft was operated on a horizontal thrust stand. Thrust was measured with highly accurate load cells. The measured thrust was compared to onboard model estimates and to results from posttest performance programs. Thrust changes using the various PSC modes were recorded. Those results were compared to benefits using the less complex highly integrated digital electronic control (HIDEC) algorithm. The PSC maximum thrust mode increased intermediate power thrust by 10 percent. The PSC engine model did very well at estimating measured thrust and closely followed the transients during optimization. Quantitative results from the evaluation of the algorithms and performance calculation models are included with emphasis on measured thrust results. The report presents a description of the PSC system and a discussion of factors affecting the accuracy of the thrust stand load measurements.
Multitask protocols to evaluate activities of daily living performance in people with COPD: a systematic review.

PubMed

Paes, Thaís; Machado, Felipe Vilaça Cavallari; Cavalheri, Vinícius; Pitta, Fabio; Hernandes, Nidia Aparecida

2017-07-01

People with chronic obstructive pulmonary disease (COPD) present symptoms such as dyspnea and fatigue, which hinder their performance in activities of daily living (ADL). A few multitask protocols have been developed to assess ADL performance in this population, although measurement properties of such protocols were not yet systematically reviewed. Areas covered: Studies were included if an assessment of the ability to perform ADL was conducted in people with COPD using a (objective) performance-based protocol. The search was conducted in the following databases: Pubmed, EMBASE, Cochrane Library, PEDro, CINAHL and LILACS. Furthermore, hand searches were conducted. Expert commentary: Up to this moment, only three protocols had measurement properties described: the Glittre ADL Test, the Monitored Functional Task Evaluation and the Londrina ADL Protocol were shown to be valid and reliable whereas only the Glittre ADL Test was shown to be responsive to change after pulmonary rehabilitation. These protocols can be used in laboratory settings and clinical practice to evaluate ADL performance in people with COPD, although there is need for more in-depth information on their validity, reliability and especially responsiveness due to the growing interest in the accurate assessment of ADL performance in this population.

The Evaluation of Teachers.

ERIC Educational Resources Information Center

National Education Association, Washington, DC. Div. of Instruction and Professional Development.

The several components of this package on the evaluation of teachers and educational programs are designed to help affiliates deal constructively with the subject. The issue of evaluation continues to intensify as state legislatures increasingly mandate that evaluation systems be imposed throughout the state to measure the performance of teachers…
An evaluation toolkit for Florida's Commuter Assistance Programs (CAP) : a companion to the 1999 CAP evaluation manual

DOT National Transportation Integrated Search

2001-01-01

This manual is a companion piece to the Commuter Assistance Program Evaluation Manual that was developed to assist Florida's Commuter Assistance Programs (CAP) in their efforts to measure and evaluate their performance. This manual is intended to pro...
Evaluation of a method for heat transfer measurements and thermal visualization using a composite of a heater element and liquid crystals. [thermal performance of turbine blade cooling configurations

NASA Technical Reports Server (NTRS)

Hippensteele, S. A.; Russell, L. M.; Stepka, F. S.

1981-01-01

Commercially available elements of a composite consisting of a plastic sheet coated with liquid crystal, another sheet with a thin layer of a conducting material (gold or carbon), and copper bus bar strips were evaluated and found to provide a simple, convenient, accurate, and low-cost measuring device for use in heat transfer research. The particular feature of the composite is its ability to obtain local heat transfer coefficients and isotherm patterns that provide visual evaluation of the thermal performances of turbine blade cooling configurations. Examples of the use of the composite are presented.
Measures to Evaluate the Effects of DBS on Speech Production

PubMed Central

Weismer, Gary; Yunusova, Yana; Bunton, Kate

2011-01-01

The purpose of this paper is to review and evaluate measures of speech production that could be used to document effects of Deep Brain Stimulation (DBS) on speech performance, especially in persons with Parkinson disease (PD). A small set of evaluative criteria for these measures is presented first, followed by consideration of several speech physiology and speech acoustic measures that have been studied frequently and reported on in the literature on normal speech production, and speech production affected by neuromotor disorders (dysarthria). Each measure is reviewed and evaluated against the evaluative criteria. Embedded within this review and evaluation is a presentation of new data relating speech motions to speech intelligibility measures in speakers with PD, amyotrophic lateral sclerosis (ALS), and control speakers (CS). These data are used to support the conclusion that at the present time the slope of second formant transitions (F2 slope), an acoustic measure, is well suited to make inferences to speech motion and to predict speech intelligibility. The use of other measures should not be ruled out, however, and we encourage further development of evaluative criteria for speech measures designed to probe the effects of DBS or any treatment with potential effects on speech production and communication skills. PMID:24932066
Study of magnetic perturbations on SEC vidicon tubes. [large space telescope

NASA Technical Reports Server (NTRS)

Long, D. C.; Zucchino, P.; Lowrance, J.

1973-01-01

A laboratory measurements program was conducted to determine the tolerances that must be imposed to achieve optimum performance from SEC-vidicon data sensors in the LST mission. These measurements along with other data were used to formulate recommendations regarding the necessary telemetry and remote control for the television data sensors when in orbit. The study encompassed the following tasks: (1) Conducted laboratory measurements of the perturbations which an external magnetic field produces on a magnetically focused, SEC-vidicon. Evaluated shielding approaches. (2) Experimentally evaluated the effects produced on overall performance by variations of the tube electrode potentials, and the focus, deflection and alignment fields. (3) Recommended the extent of ground control of camera parameters and camera parameter telemetry required for optimizing the performance of the television system in orbit. The experimental data are summarized in a set of graphs.
Instrument performance of a radon measuring system with the alpha-track detection technique.

PubMed

Tokonami, S; Zhuo, W; Ryuo, H; Yonehara, H; Yamada, Y; Shimo, M

2003-01-01

An instrument performance test has been carried out for a radon measuring system made in Hungary. The system measures radon using the alpha-track detection technique. It consists of three parts: the passive detector, the etching unit and the evaluation unit. A CR-39 detector is used as the radiation detector. Alpha-track reading and data analysis are carried out after chemical etching. The following subjects were examined in the present study: (1) radon sensitivity, (2) performance of etching and evaluation processes and (3) thoron sensitivity. The radon sensitivity of 6.9 x 10(-4) mm(-2) (Bq m(-3) d)(-1) was acceptable for practical application. The thoron sensitivity was estimated to be as low as 3.3 x 10(-5) mm(-2) (Bq m(-3) d)(-1) from the experimental study.
Evaluation of frailty in older adults with cardiovascular disease: incorporating physical performance measures.

PubMed

Gary, Rebecca

2012-01-01

Rapid growth in the numbers of older adults with cardiovascular disease (CVD) is raising awareness and concern of the impact that common geriatric syndromes such as frailty may have on clinical outcomes, health-related quality of life, and rising economic burden associated with healthcare. Increasingly, frailty is recognized to be a highly prevalent and important risk factor that is associated with adverse cardiovascular outcomes. A limitation of previous studies in patients with CVD has been the lack of a consistent definition and measures to evaluate frailty. In this review, building upon the work of Fried and colleagues, a definition of frailty is provided that is applicable for evaluating frailty in older adults with CVD. Simple, well-established performance-based measures widely used in comprehensive geriatric assessment are recommended that can be readily implemented by nurses in most practice settings. The limited studies conducted in older adults with CVD have shown physical performance measures to be highly predictive of clinical outcomes. Implications for practice and areas for future research are described for the growing numbers of elderly cardiac patients who are frail frailty and at risk for disability.
Gas-cell measurements for evaluating longwave-infrared passive-sensor performance

NASA Astrophysics Data System (ADS)

Cummings, Alan S.; Combs, Roger J.; Thomas, Mark J.; Curry, Timothy; Kroutil, Robert T.

2006-10-01

A longwave-infrared (LWIR) passive-spectrometer performance was evaluated with a short-pathlength gas cell. This cell was accurately positioned between the sensor and a NIST-traceable blackbody radiance source. Cell contents were varied over the Beer's Law absorbance range from the limit of detection to saturation for the gas analytes of sulfur hexafluoride and hexafluoroethane. The spectral impact of saturation on infrared absorbance was demonstrated for the passive sensor configuration. The gas-cell contents for all concentration-pathlength products was monitored with an active traditional-laboratory Fourier Transform Infrared (FTIR) spectrometer and was verified by comparison with the established PNNL/DOE vapor-phase infrared (IR) spectral database. For the passive FTIR measurements, the blackbody source employed a range of background temperatures from 5 °C to 50 °C. The passive measurements without the presence of a gas cell permitted a determination of the noise equivalent spectral noise (NESR) for each set of passive gas-cell measurements. In addition, the no-cell condition allowed the evaluation of the effect of gas cell window materials of low density poly(ethylene), potassium chloride, potassium bromide, and zinc selenide. The components of gas cell, different window materials, temperature differentials, and absorbances of target-analyte gases supplied the means of evaluating the LWIR performance of a passive FTIR spectrometer. The various LWIR-passive measurements were found to simulate those often encountered in open-air scenarios important to both industrial and environmental monitoring applications.
Evaluation of Low-Cost Mitigation Measures Implemented to Improve Air Quality in Nursery and Primary Schools.

PubMed

Sá, Juliana P; Branco, Pedro T B S; Alvim-Ferraz, Maria C M; Martins, Fernando G; Sousa, Sofia I V

2017-05-31

Indoor air pollution mitigation measures are highly important due to the associated health impacts, especially on children, a risk group that spends significant time indoors. Thus, the main goal of the work here reported was the evaluation of mitigation measures implemented in nursery and primary schools to improve air quality. Continuous measurements of CO₂, CO, NO₂, O₃, CH₂O, total volatile organic compounds (VOC), PM₁, PM 2.5 , PM 10 , Total Suspended Particles (TSP) and radon, as well as temperature and relative humidity were performed in two campaigns, before and after the implementation of low-cost mitigation measures. Evaluation of those mitigation measures was performed through the comparison of the concentrations measured in both campaigns. Exceedances to the values set by the national legislation and World Health Organization (WHO) were found for PM 2.5 , PM 10 , CO₂ and CH₂O during both indoor air quality campaigns. Temperature and relative humidity values were also above the ranges recommended by American Society of Heating, Refrigerating, and Air-Conditioning Engineers (ASHRAE). In general, pollutant concentrations measured after the implementation of low-cost mitigation measures were significantly lower, mainly for CO₂. However, mitigation measures were not always sufficient to decrease the pollutants' concentrations till values considered safe to protect human health.
[Balanced scorecard for performance measurement of a nursing organization in a Korean hospital].

PubMed

Hong, Yoonmi; Hwang, Kyung Ja; Kim, Mi Ja; Park, Chang Gi

2008-02-01

The purpose of this study was to develop a balanced scorecard (BSC) for performance measurement of a Korean hospital nursing organization and to evaluate the validity and reliability of performance measurement indicators. Two hundred fifty-nine nurses in a Korean hospital participated in a survey questionnaire that included 29-item performance evaluation indicators developed by investigators of this study based on the Kaplan and Norton's BSC (1992). Cronbach's alpha was used to test the reliability of the BSC. Exploratory and confirmatory factor analysis with a structure equation model (SEM) was applied to assess the construct validity of the BSC. Cronbach's alpha of 29 items was .948. Factor analysis of the BSC showed 5 principal components (eigen value >1.0) which explained 62.7% of the total variance, and it included a new one, community service. The SEM analysis results showed that 5 components were significant for the hospital BSC tool. High degree of reliability and validity of this BSC suggests that it may be used for performance measurements of a Korean hospital nursing organization. Future studies may consider including a balanced number of nurse managers and staff nurses in the study. Further data analysis on the relationships among factors is recommended.
Continuous performance measurement in flight systems. [sequential control model

NASA Technical Reports Server (NTRS)

Connelly, E. M.; Sloan, N. A.; Zeskind, R. M.

1975-01-01

The desired response of many man machine control systems can be formulated as a solution to an optimal control synthesis problem where the cost index is given and the resulting optimal trajectories correspond to the desired trajectories of the man machine system. Optimal control synthesis provides the reference criteria and the significance of error information required for performance measurement. The synthesis procedure described provides a continuous performance measure (CPM) which is independent of the mechanism generating the control action. Therefore, the technique provides a meaningful method for online evaluation of man's control capability in terms of total man machine performance.
Advanced Actuation Systems Development. Volume 2

DTIC Science & Technology

1989-08-01

and unloaded performance characteristics of a test specimen produced by General Dynamics Corporation as a feasibility model. The actuation system for...changing the camber of the test specimen is unique and was evaluated with a series of input/output measurements. The testing verified the general ...MAWS General ’rest Procedure........................................6 General Performance Measurements .................................... 10 Test
Factor Analysis of Aviation Training Measures and Post-Training Performance Evaluations.

ERIC Educational Resources Information Center

Booth, Richard F.; Berkshire, James R.

The purpose of this study was to relate the factor structure of naval air training measures to the performance of Marine pilots in operational squadrons. Five post-training criteria were developed; four were Commanding Officer (C.O.) nominations of junior officers for hypothetical special assignments, and the fifth was a general…
FBI fingerprint identification automation study. AIDS 3 evaluation report. Volume 4: Economic feasibility

NASA Technical Reports Server (NTRS)

Mulhall, B. D. L.

1980-01-01

The results of the economic analysis of the AIDS 3 system design are presented. AIDS 3 evaluated a set of economic feasibility measures including life cycle cost, implementation cost, annual operating expenditures and annual capital expenditures. The economic feasibility of AIDS 3 was determined by comparing the evaluated measures with the same measures, where applicable, evaluated for the current system. A set of future work load scenarios was constructed using JPL's environmental evaluation study of the fingerprint identification system. AIDS 3 and the current system were evaluated for each of the economic feasibility measures for each of the work load scenarios. They were compared for a set of performance measures, including response time and accuracy, and for a set of cost/benefit ratios, including cost per transaction and cost per technical search. Benefit measures related to the economic feasibility of the system are also presented, including the required number of employees and the required employee skill mix.
Development of performance measures based on visibility for effective placement of aids to navigation

NASA Astrophysics Data System (ADS)

Fang, Tae Hyun; Kim, Yeon-Gyu; Gong, In-Young; Park, Sekil; Kim, Ah-Young

2015-09-01

In order to develop the challenging process of placing Aids to Navigation (AtoN), we propose performance measures which quantifies the effect of such placement. The best placement of AtoNs is that from which the navigator can best recognize the information provided by an AtoN. The visibility of AtoNs depends mostly on light sources, the weather condition and the position of the navigator. Visual recognition is enabled by achieving adequate contrast between the AtoN light source and background light. Therefore, the performance measures can be formulated through the amount of differences between these two lights. For simplification, this approach is based on the values of the human factor suggested by International Association of Marine Aids to Navigation and Lighthouse Authorities (IALA). Performance measures for AtoN placement can be evaluated through AtoN Simulator, which has been being developed by KIOST/KRISO in Korea and has been launched by Korea National Research Program. Simulations for evaluation are carried out at waterway in Busan port in Korea.
Chief Complaint-Based Performance Measures: A New Focus For Acute Care Quality Measurement

PubMed Central

Griffey, Richard T; Pines, Jesse M.; Farley, Heather L.; Phelan, Michael P; Beach, Christopher; Schuur, Jeremiah D; Venkatesh, Arjun K.

2014-01-01

Performance measures are increasingly important to guide meaningful quality improvement efforts and value-based reimbursement. Populations included in most current hospital performance measures are defined by recorded diagnoses using International Disease Classification (ICD)-9 codes in administrative claims data. While the diagnosis-centric approach allows the assessment of disease-specific quality, it fails to measure one of the primary functions of emergency department (ED) care which involves diagnosing, risk-stratifying, and treating patients’ potentially life-threatening conditions based on symptoms (i.e. chief complaints). In this paper we propose chief complaint-based quality measures as a means to enhance the evaluation of quality and value in emergency care. We discuss the potential benefits of chief-complaint based measures, describe opportunities to mitigate challenges, propose an example measure set, and present several recommendations to advance this paradigm in ED-based performance measurement. PMID:25443989
Effects of a Velocity-Vector Based Command Augmentation System and Synthetic Vision System Terrain Portrayal and Guidance Symbology Concepts on Single-Pilot Performance

NASA Technical Reports Server (NTRS)

Liu, Dahai; Goodrich, Kenneth H.; Peak, Bob

2010-01-01

This study investigated the effects of synthetic vision system (SVS) concepts and advanced flight controls on the performance of pilots flying a light, single-engine general aviation airplane. We evaluated the effects and interactions of two levels of terrain portrayal, guidance symbology, and flight control response type on pilot performance during the conduct of a relatively complex instrument approach procedure. The terrain and guidance presentations were evaluated as elements of an integrated primary flight display system. The approach procedure used in the study included a steeply descending, curved segment as might be encountered in emerging, required navigation performance (RNP) based procedures. Pilot performance measures consisted of flight technical performance, perceived workload, perceived situational awareness and subjective preference. The results revealed that an elevation based generic terrain portrayal significantly improved perceived situation awareness without adversely affecting flight technical performance or workload. Other factors (pilot instrument rating, control response type, and guidance symbology) were not found to significantly affect the performance measures.
Simplified procedures for correlation of experimentally measured and predicted thrust chamber performance

NASA Technical Reports Server (NTRS)

Powell, W. B.

1973-01-01

Thrust chamber performance is evaluated in terms of an analytical model incorporating all the loss processes that occur in a real rocket motor. The important loss processes in the real thrust chamber were identified, and a methodology and recommended procedure for predicting real thrust chamber vacuum specific impulse were developed. Simplified equations for the calculation of vacuum specific impulse are developed to relate the delivered performance (both vacuum specific impulse and characteristic velocity) to the ideal performance as degraded by the losses corresponding to a specified list of loss processes. These simplified equations enable the various performance loss components, and the corresponding efficiencies, to be quantified separately (except that interaction effects are arbitrarily assigned in the process). The loss and efficiency expressions presented can be used to evaluate experimentally measured thrust chamber performance, to direct development effort into the areas most likely to yield improvements in performance, and as a basis to predict performance of related thrust chamber configurations.
Solar energy system performance evaluation. Seasonal report for Colt Pueblo, Pueblo, Colorado

NASA Technical Reports Server (NTRS)

1980-01-01

The Colt-Pueblo solar energy system, designed to provide space heating and hot water preheating, is described and its operational performance for a 12 month period from February 1979 through January 1980 is evaluated. The space heating subsystem met 31 percent of the measured space heating load which was close to the expected 34 percent solar fraction. Although the hot water solar fraction was 79 percent, the overall energy saving capability was reduced because of the low hot water demand. The measured heating subsystem performance would have improved considerably if the uncontrolled losses primarily from transport piping could have been reduced to an inconsequential level. Fossil energy savings of 70.31 million BTUs are estimated.
Evaluation of measurement data from a sensor system for breath control

NASA Astrophysics Data System (ADS)

Seifert, Rolf; Keller, Hubert B.; Conrad, Thorsten; Peter, Jens

2017-03-01

Binary ethanol-H2 gas samples were measured by an innovative mobile sensor system for the alcohol control in the respiratory air. The measurements were performed by a gas sensor operated by cyclic variation of the working temperature at the sensor head. The evaluation of the data, using an updated version of the evaluation procedure ProSens, results in a very good substance identification and concentration determination of the components of the gas mixture. The relative analysis errors were in all cases less than 9%.

A Post-Marketing Surveillance Study to Evaluate Performance of the EXIMO™ Blood Glucose Monitoring System.

PubMed

Chandnani, Sonia R; Ramakrishna, C D; Dave, Bhargav A; Kothavade, Pankaj S; Thakkar, Ashok S

2017-05-01

The performance of Blood Glucose Monitoring System (BGMS) is critical as the information provided by the system guide the patient or health care professional in making treatment decisions. However, besides evaluating accuracy of the BGMS in laboratory setting, it is equally important that the intended users (healthcare professionals and patients) should be able to achieve blood glucose measurements with similar level of high accuracy. To assess the performance of EXIMO™ (Meril Diagnostics Pvt. Ltd., Vapi, Gujarat, India) BGMS as per International Organization for Standardization (ISO) 15197:2013 section 8 user performance criteria. This was a non-randomized and post-marketing study conducted at a tertiary care centre of India. A total of 1005 patients with diabetes themselves performed fingertip blood glucose measurement using EXIMO™ BGMS. Immediately after capillary blood glucose measurement using the blood glucose monitoring system, venous blood sample from each patient was obtained by a trained technician which was assessed by reference laboratory method- Cobas Integra 400 plus (Roche Instrument Centre, Rotkreuz, Switzerland). All the blood glucose measurements assessed by EXIMO™ were compared with laboratory results. Performance of the system was assessed as per ISO 15197:2013 criteria using Bland-Altman plot, Parkes-Consensus Error Grid (CEG) and Surveillance Error Grid analyses (SEG). A total of 1005 patients participated in the study. Average age of the patients was 44.93±14.65 years. Evaluation of capillary fingertip blood glucose measurements demonstrated that 95.82% measurements fulfilled ISO 15197:2013 section 8 user performance criteria. All the results lie within clinically non-critical zones; Zone A (99.47%; n=1000) and Zone B (0.53%; n=05) of the CEG analysis. As per SEG analysis, majority of the results fell within "no-risk" zone (risk score 0 to 0.5; 90.42%). The result of the study confirmed that intended users are able to obtain accurate glucose measurements when operating EXIMO™ BGMS, given only the instructions and training materials routinely provided with the system, in clinical practice.
A Post-Marketing Surveillance Study to Evaluate Performance of the EXIMO™ Blood Glucose Monitoring System

PubMed Central

Chandnani, Sonia R.; Ramakrishna, C. D.; Dave, Bhargav A.; Kothavade, Pankaj S.

2017-01-01

Introduction The performance of Blood Glucose Monitoring System (BGMS) is critical as the information provided by the system guide the patient or health care professional in making treatment decisions. However, besides evaluating accuracy of the BGMS in laboratory setting, it is equally important that the intended users (healthcare professionals and patients) should be able to achieve blood glucose measurements with similar level of high accuracy. Aim To assess the performance of EXIMO™ (Meril Diagnostics Pvt. Ltd., Vapi, Gujarat, India) BGMS as per International Organization for Standardization (ISO) 15197:2013 section 8 user performance criteria. Materials and Methods This was a non-randomized and post-marketing study conducted at a tertiary care centre of India. A total of 1005 patients with diabetes themselves performed fingertip blood glucose measurement using EXIMO™ BGMS. Immediately after capillary blood glucose measurement using the blood glucose monitoring system, venous blood sample from each patient was obtained by a trained technician which was assessed by reference laboratory method- Cobas Integra 400 plus (Roche Instrument Centre, Rotkreuz, Switzerland). All the blood glucose measurements assessed by EXIMO™ were compared with laboratory results. Performance of the system was assessed as per ISO 15197:2013 criteria using Bland-Altman plot, Parkes-Consensus Error Grid (CEG) and Surveillance Error Grid analyses (SEG). Results A total of 1005 patients participated in the study. Average age of the patients was 44.93±14.65 years. Evaluation of capillary fingertip blood glucose measurements demonstrated that 95.82% measurements fulfilled ISO 15197:2013 section 8 user performance criteria. All the results lie within clinically non-critical zones; Zone A (99.47%; n=1000) and Zone B (0.53%; n=05) of the CEG analysis. As per SEG analysis, majority of the results fell within “no-risk” zone (risk score 0 to 0.5; 90.42%). Conclusion The result of the study confirmed that intended users are able to obtain accurate glucose measurements when operating EXIMO™ BGMS, given only the instructions and training materials routinely provided with the system, in clinical practice. PMID:28658800
Developing and evaluating a target-background similarity metric for camouflage detection.

PubMed

Lin, Chiuhsiang Joe; Chang, Chi-Chan; Liu, Bor-Shong

2014-01-01

Measurement of camouflage performance is of fundamental importance for military stealth applications. The goal of camouflage assessment algorithms is to automatically assess the effect of camouflage in agreement with human detection responses. In a previous study, we found that the Universal Image Quality Index (UIQI) correlated well with the psychophysical measures, and it could be a potentially camouflage assessment tool. In this study, we want to quantify the camouflage similarity index and psychophysical results. We compare several image quality indexes for computational evaluation of camouflage effectiveness, and present the results of an extensive human visual experiment conducted to evaluate the performance of several camouflage assessment algorithms and analyze the strengths and weaknesses of these algorithms. The experimental data demonstrates the effectiveness of the approach, and the correlation coefficient result of the UIQI was higher than those of other methods. This approach was highly correlated with the human target-searching results. It also showed that this method is an objective and effective camouflage performance evaluation method because it considers the human visual system and image structure, which makes it consistent with the subjective evaluation results.
Characterization and performance of injection molded poly(methylmethacrylate) microchips for capillary electrophoresis

PubMed Central

Nikcevic, Irena; Lee, Se Hwan; Piruska, Aigars; Ahn, Chong H.; Ridgway, Thomas H.; Limbach, Patrick A.; Wehmeyer, K. R.; Heineman, William R.; Seliskar, Carl J.

2009-01-01

Injection molded poly(methylmethacrylate) (IM-PMMA), chips were evaluated as potential candidates for capillary electrophoresis disposable chip applications. Mass production and usage of plastic microchips depends on chip-to-chip reproducibility and on analysis accuracy. Several important properties of IM-PMMA chips were considered: fabrication quality evaluated by environmental scanning electron microscope imaging, surface quality measurements, selected thermal/electrical properties as indicated by measurement of the current versus applied voltage (I–V) characteristic, and the influence of channel surface treatments. Electroosmotic flow was also evaluated for untreated and O2 reactive ion etching (RIE) treated surface microchips. The performance characteristics of single lane plastic microchip capillary electrophoresis (MCE) separations were evaluated using a mixture of two dyes - fluorescein (FL) and fluorescein isothiocyanate (FITC). To overcome non-wettability of the native IM-PMMA surface, a modifier, polyethylene oxide was added to the buffer as a dynamic coating. Chip performance reproducibility was studied for chips with and without surface modification via the process of RIE with O2 and by varying the hole position for the reservoir in the cover plate or on the pattern side of the chip. Additionally, the importance of reconditioning steps to achieve optimal performance reproducibility was also examined. It was found that more reproducible quantitative results were obtained when normalized values of migration time, peak area and peak height of FL and FITC were used instead of actual measured parameters PMID:17477932
An official American thoracic society workshop report: developing performance measures from clinical practice guidelines.

PubMed

Kahn, Jeremy M; Gould, Michael K; Krishnan, Jerry A; Wilson, Kevin C; Au, David H; Cooke, Colin R; Douglas, Ivor S; Feemster, Laura C; Mularski, Richard A; Slatore, Christopher G; Wiener, Renda Soylemez

2014-05-01

Many health care performance measures are either not based on high-quality clinical evidence or not tightly linked to patient-centered outcomes, limiting their usefulness in quality improvement. In this report we summarize the proceedings of an American Thoracic Society workshop convened to address this problem by reviewing current approaches to performance measure development and creating a framework for developing high-quality performance measures by basing them directly on recommendations from well-constructed clinical practice guidelines. Workshop participants concluded that ideally performance measures addressing care processes should be linked to clinical practice guidelines that explicitly rate the quality of evidence and the strength of recommendations, such as the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) process. Under this framework, process-based performance measures would only be developed from strong recommendations based on high- or moderate-quality evidence. This approach would help ensure that clinical processes specified in performance measures are both of clear benefit to patients and supported by strong evidence. Although this approach may result in fewer performance measures, it would substantially increase the likelihood that quality-improvement programs based on these measures actually improve patient care.
A performance evaluation of the IBM 370/XT personal computer

NASA Technical Reports Server (NTRS)

Dominick, Wayne D. (Editor); Triantafyllopoulos, Spiros

1984-01-01

An evaluation of the IBM 370/XT personal computer is given. This evaluation focuses primarily on the use of the 370/XT for scientific and technical applications and applications development. A measurement of the capabilities of the 370/XT was performed by means of test programs which are presented. Also included is a review of facilities provided by the operating system (VM/PC), along with comments on the IBM 370/XT hardware configuration.
HDL-cholesterol and physical performance: results from the ageing and longevity study in the sirente geographic area (ilSIRENTE Study).

PubMed

Landi, Francesco; Russo, Andrea; Cesari, Matteo; Pahor, Marco; Bernabei, Roberto; Onder, Graziano

2007-09-01

High-density lipoprotein (HDL) cholesterol has been hypothesised to be a reliable marker of frailty and poor prognosis among the oldest elderly. We evaluate the relationship of HDL-cholesterol with measures of physical performance, muscle strength, and functional status in older persons aged 80years or older. Data are from baseline evaluation of the ageing and longevity study in the Sirente geographic area (ilSIRENTE study) (n = 364). Physical performance was assessed using the physical performance battery score [short physical performance battery (SPPB)], which is based on three-timed tests: 4-m walking-speed, balance, and chair-stand tests. Muscle strength was measured by hand-grip strength. Analyses of covariance were performed to evaluate the relationship of different HDL-cholesterol levels with physical function. In the unadjusted analyses, physical function (as measured by the 4-m walking-speed, theSPPB score, the basic and instrumental activities of daily living scales scores), but not hand-grip strength, improved significantly as HDL-cholesterol tertiles increased. After adjustment for potential confounders, which included age, gender, living alone, alcohol abuse, physical activity, congestive heart failure, diabetes, cerebrovascular diseases, osteoarthritis, albumin, urea, C-reactive protein and LDL cholesterol, the association of HDL-cholesterol tertiles with the 4-m walking-speed and the SPPB score was still consistent. The present study suggests that among very old subjects living in the community the higher levels of HDL-cholesterol are associated with better functional performance.
40 CFR Appendix A to Part 58 - Quality Assurance Requirements for SLAMS, SPMs and PSD Air Monitoring

Code of Federal Regulations, 2014 CFR

2014-07-01

... monitor. 3.3.4.4Pb Performance Evaluation Program (PEP) Procedures. Each year, one performance evaluation... Information 2. Quality System Requirements 3. Measurement Quality Check Requirements 4. Calculations for Data... 10 of this appendix) and at a national level in references 1, 2, and 3 of this appendix. 1...
40 CFR Appendix A to Part 58 - Quality Assurance Requirements for SLAMS, SPMs and PSD Air Monitoring

Code of Federal Regulations, 2013 CFR

2013-07-01

... monitor. 3.3.4.4Pb Performance Evaluation Program (PEP) Procedures. Each year, one performance evaluation... Information 2. Quality System Requirements 3. Measurement Quality Check Requirements 4. Calculations for Data... 10 of this appendix) and at a national level in references 1, 2, and 3 of this appendix. 1...
Improving Fifth Grade Students' Mathematics Self-Efficacy Calibration and Performance through Self-Regulation Training

ERIC Educational Resources Information Center

Ramdass, Darshanand H.

2009-01-01

This primary goal of this study was to investigate the effects of strategy training and self-reflection, two subprocesses of Zimmerman's cyclical model of self-regulation, on fifth grade students' mathematics performance, self-efficacy, self-evaluation, and calibration measures of self-efficacy bias, self-efficacy accuracy, self-evaluation bias,…
Measuring Information Security Performance with 10 by 10 Model for Holistic State Evaluation.

PubMed

Bernik, Igor; Prislan, Kaja

Organizations should measure their information security performance if they wish to take the right decisions and develop it in line with their security needs. Since the measurement of information security is generally underdeveloped in practice and many organizations find the existing recommendations too complex, the paper presents a solution in the form of a 10 by 10 information security performance measurement model. The model-ISP 10×10M is composed of ten critical success factors, 100 key performance indicators and 6 performance levels. Its content was devised on the basis of findings presented in the current research studies and standards, while its structure results from an empirical research conducted among information security professionals from Slovenia. Results of the study show that a high level of information security performance is mostly dependent on measures aimed at managing information risks, employees and information sources, while formal and environmental factors have a lesser impact. Experts believe that information security should evolve systematically, where it's recommended that beginning steps include technical, logical and physical security controls, while advanced activities should relate predominantly strategic management activities. By applying the proposed model, organizations are able to determine the actual level of information security performance based on the weighted indexing technique. In this manner they identify the measures they ought to develop in order to improve the current situation. The ISP 10×10M is a useful tool for conducting internal system evaluations and decision-making. It may also be applied to a larger sample of organizations in order to determine the general state-of-play for research purposes.
On the estimation algorithm used in adaptive performance optimization of turbofan engines

NASA Technical Reports Server (NTRS)

Espana, Martin D.; Gilyard, Glenn B.

1993-01-01

The performance seeking control algorithm is designed to continuously optimize the performance of propulsion systems. The performance seeking control algorithm uses a nominal model of the propulsion system and estimates, in flight, the engine deviation parameters characterizing the engine deviations with respect to nominal conditions. In practice, because of measurement biases and/or model uncertainties, the estimated engine deviation parameters may not reflect the engine's actual off-nominal condition. This factor has a necessary impact on the overall performance seeking control scheme exacerbated by the open-loop character of the algorithm. The effects produced by unknown measurement biases over the estimation algorithm are evaluated. This evaluation allows for identification of the most critical measurements for application of the performance seeking control algorithm to an F100 engine. An equivalence relation between the biases and engine deviation parameters stems from an observability study; therefore, it is undecided whether the estimated engine deviation parameters represent the actual engine deviation or whether they simply reflect the measurement biases. A new algorithm, based on the engine's (steady-state) optimization model, is proposed and tested with flight data. When compared with previous Kalman filter schemes, based on local engine dynamic models, the new algorithm is easier to design and tune and it reduces the computational burden of the onboard computer.
Teacher Evaluation Policy and Conflicting Theories of Motivation

ERIC Educational Resources Information Center

Firestone, William A.

2014-01-01

Current interest in teacher evaluation focuses disproportionately on measurement issues and performance-based pay without an overarching theory of how evaluation works. To develop such a theory, I contrast two motivation theories often used to guide thinking about teacher evaluation. External motivation theory relies on economics and extrinsic…
Application of robotic manipulability indices to evaluate thumb performance during smartphone touch operations.

PubMed

Endo, Hiroshi

2015-01-01

This study examined whether manipulability during smartphone thumb-based touch operations could be predicted by the following robotic manipulability indices: the volume and direction of the 'manipulability ellipsoid' (MEd), both of which evaluate the influence of kinematics on manipulability. Limits of the thumb's range of motion were considered in the MEd to improve predictability. Thumb postures at 25 key target locations were measured in 16 subjects. Though there was no correlation between subjective evaluation and the volume of the MEd, high correlation was obtained when motion range limits were taken into account. These limits changed the size of the MEd and improved the accuracy of the manipulability evaluation. Movement directions associated with higher performance could also be predicted. In conclusion, robotic manipulability indices with motion range limits were considered to be useful measures for quantitatively evaluating human hand operations.
[Comparison of Organ Dose Calculation Using Monte Carlo Simulation and In-phantom Dosimetry in CT Examination].

PubMed

Iriuchijima, Akiko; Fukushima, Yasuhiro; Ogura, Akio

Direct measurement of each patient organ dose from computed tomography (CT) is not possible. Most methods to estimate patient organ dose is using Monte Carlo simulation with dedicated software. However, the method and the relative differences between organ dose simulation and measurement is unclear. The purpose of this study was to compare organ doses evaluated by Monte Carlo simulation with doses evaluated by in-phantom dosimetry. The simulation software Radimetrics (Bayer) was used for the calculation of organ dose. Measurement was performed with radio-photoluminescence glass dosimeter (RPLD) set at various organ positions within RANDO phantom. To evaluate difference of CT scanner, two different CT scanners were used in this study. Angular dependence of RPLD and measurement of effective energy were performed for each scanner. The comparison of simulation and measurement was evaluated by relative differences. In the results, angular dependence of RPLD at two scanners was 31.6±0.45 mGy for SOMATOM Definition Flash and 29.2±0.18 mGy for LightSpeed VCT. The organ dose was 42.2 mGy (range, 29.9-52.7 mGy) by measurements and 37.7 mGy (range, 27.9-48.1 mGy) by simulations. The relative differences of organ dose between measurement and simulation were 13%, excluding of breast's 42%. We found that organ dose by simulation was lower than by measurement. In conclusion, the results of relative differences will be useful for evaluating organ doses for individual patients by simulation software Radimetrics.
Association between liver transplant center performance evaluations and transplant volume.

PubMed

Buccini, L D; Segev, D L; Fung, J; Miller, C; Kelly, D; Quintini, C; Schold, J D

2014-09-01

There has been increased oversight of transplant centers and stagnation in liver transplantation nationally in recent years. We hypothesized that centers that received low performance (LP) evaluations were more likely to alter protocols, resulting in reduced rates of transplants and patients placed on the waiting list. We evaluated the association of LP evaluations and transplant activity among liver transplant centers in the United States using national Scientific Registry of Transplant Recipients data (January 2007 to July 2012). We compared the average change in recipient and candidate volume and donor and patient characteristics based on whether the centers received LP evaluations. Of 92 eligible centers, 27 (29%) received at least one LP evaluation. Centers without an LP evaluation (n = 65) had an average increase of 9.3 transplants and 14.9 candidates while LP centers had an average decrease of 39.9 transplants (p < 0.01) and 67.3 candidates (p < 0.01). LP centers reduced the use of older donors, donations with longer cold ischemia, and donations after cardiac death (p-values < 0.01). There was no association between the change in transplant volume and measured performance (R(2) = 0.002, p = 0.91). Findings indicate a strong association between performance evaluations and changes in candidate listings and transplants among liver transplant centers, with no measurable improvement in outcomes associated with reduction in transplant volume. © Copyright 2014 The American Society of Transplantation and the American Society of Transplant Surgeons.
Modelling of different measures for improving removal in a stormwater pond.

PubMed

German, J; Jansons, K; Svensson, G; Karlsson, D; Gustafsson, L G

2005-01-01

The effect of retrofitting an existing pond on removal efficiency and hydraulic performance was modelled using the commercial software Mike21 and compartmental modelling. The Mike21 model had previously been calibrated on the studied pond. Installation of baffles, the addition of culverts under a causeway and removal of an existing island were all studied as possible improvement measures in the pond. The subsequent effect on hydraulic performance and removal of suspended solids was then evaluated. Copper, cadmium, BOD, nitrogen and phosphorus removal were also investigated for that specific improvement measure showing the best results. Outcomes of this study reveal that all measures increase the removal efficiency of suspended solids. The hydraulic efficiency is improved for all cases, except for the case where the island is removed. Compartmental modelling was also used to evaluate hydraulic performance and facilitated a better understanding of the way each of the different measures affected the flow pattern and performance. It was concluded that the installation of baffles is the best of the studied measures resulting in a reduction in the annual load on the receiving lake by approximately 8,000 kg of suspended solids (25% reduction of the annual load), 2 kg of copper (10% reduction of the annual load) and 600 kg of BOD (10% reduction of the annual load).
Symptom and performance validity with veterans assessed for attention-deficit/hyperactivity disorder (ADHD).

PubMed

Shura, Robert D; Denning, John H; Miskey, Holly M; Rowland, Jared A

2017-12-01

Little is known about attention-deficit/hyperactivity disorder (ADHD) in veterans. Practice standards recommend the use of both symptom and performance validity measures in any assessment, and there are salient external incentives associated with ADHD evaluation (stimulant medication access and academic accommodations). The purpose of this study was to evaluate symptom and performance validity measures in a clinical sample of veterans presenting for specialty ADHD evaluation. Patients without a history of a neurocognitive disorder and for whom data were available on all measures (n = 114) completed a clinical interview structured on DSM-5 ADHD symptoms, the Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF), and the Test of Memory Malingering Trial 1 (TOMM1) as part of a standardized ADHD diagnostic evaluation. Veterans meeting criteria for ADHD were not more likely to overreport symptoms on the MMPI-2-RF nor to fail TOMM1 (score ≤ 41) compared with those who did not meet criteria. Those who overreported symptoms did not endorse significantly more ADHD symptoms; however, those who failed TOMM1 did report significantly more ADHD symptoms (g = 0.90). In the total sample, 19.3% failed TOMM1, 44.7% overreported on the MMPI-2-RF, and 8.8% produced both an overreported MMPI-2-RF and invalid TOMM1. F-r had the highest correlation to TOMM1 scores (r = -.30). These results underscore the importance of assessing both symptom and performance validity in a clinical ADHD evaluation with veterans. In contrast to certain other conditions (e.g., mild traumatic brain injury), ADHD as a diagnosis is not related to higher rates of invalid report/performance in veterans. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Performance of electrolyte measurements assessed by a trueness verification program.

PubMed

Ge, Menglei; Zhao, Haijian; Yan, Ying; Zhang, Tianjiao; Zeng, Jie; Zhou, Weiyan; Wang, Yufei; Meng, Qinghui; Zhang, Chuanbao

2016-08-01

In this study, we analyzed frozen sera with known commutabilities for standardization of serum electrolyte measurements in China. Fresh frozen sera were sent to 187 clinical laboratories in China for measurement of four electrolytes (sodium, potassium, calcium, and magnesium). Target values were assigned by two reference laboratories. Precision (CV), trueness (bias), and accuracy [total error (TEa)] were used to evaluate measurement performance, and the tolerance limit derived from the biological variation was used as the evaluation criterion. About half of the laboratories used a homogeneous system (same manufacturer for instrument, reagent and calibrator) for calcium and magnesium measurement, and more than 80% of laboratories used a homogeneous system for sodium and potassium measurement. More laboratories met the tolerance limit of imprecision (coefficient of variation [CVa]) than the tolerance limits of trueness (biasa) and TEa. For sodium, calcium, and magnesium, the minimal performance criterion derived from biological variation was used, and the pass rates for total error were approximately equal to the bias (<50%). For potassium, the pass rates for CV and TE were more than 90%. Compared with the non homogeneous system, the homogeneous system was superior for all three quality specifications. The use of commutable proficiency testing/external quality assessment (PT/EQA) samples with values assigned by reference methods can monitor performance and provide reliable data for improving the performance of laboratory electrolyte measurement. The homogeneous systems were superior to the non homogeneous systems, whereas accuracy of assigned values of calibrators and assay stability remained challenges.
Standardization of Broadband UV Measurements for 365 nm LED Sources

PubMed Central

Eppeldauer, George P.

2012-01-01

Broadband UV measurements are evaluated when UV-A irradiance meters measure optical radiation from 365 nm UV sources. The CIE standardized rectangular-shape UV-A function can be realized only with large spectral mismatch errors. The spectral power-distribution of the 365 nm excitation source is not standardized. Accordingly, the readings made with different types of UV meters, even if they measure the same UV source, can be very different. Available UV detectors and UV meters were measured and evaluated for spectral responsivity. The spectral product of the source-distribution and the meter’s spectral-responsivity were calculated for different combinations to estimate broad-band signal-measurement errors. Standardization of both the UV source-distribution and the meter spectral-responsivity is recommended here to perform uniform broad-band measurements with low uncertainty. It is shown what spectral responsivity function(s) is needed for new and existing UV irradiance meters to perform low-uncertainty broadband 365 nm measurements. PMID:26900516

Insights into Education's Race to the Top: Correlational Survey Exploring Perceptions of Organizational Culture and Change Ambivalence during the Implementation of a Mandated Performance Evaluation System in a Northeast U.S. School District

ERIC Educational Resources Information Center

Schwamb, Andrea B.

2013-01-01

American public schools are currently facing a new mandated evaluation system that will create substantial change by requiring districts to evaluate professional staff based on two quantified measures: (a) state testing, and (b) a district determined measure. Although reforms have been at the forefront of policymakers' agendas, these initiatives…
Indirect Measures in Evaluation: On Not Knowing What We Don't Know

ERIC Educational Resources Information Center

Heath, Linda; DeHoek, Adam; Locatelli, Sara House

2012-01-01

Evaluators frequently make use of indirect measures of participant learning or skill mastery, with participants either being asked if they have learned material or mastered a skill or being asked to indicate how confident they are that they know the material or can perform the task in question. Unfortunately, myriad research in social psychology…
Performance optimisation of a new-generation orthogonal-acceleration quadrupole-time-of-flight mass spectrometer.

PubMed

Bristow, Tony; Constantine, Jill; Harrison, Mark; Cavoit, Fabien

2008-04-01

Orthogonal-acceleration quadrupole time-of-flight (oa-QTOF) mass spectrometers, employed for accurate mass measurement, have been commercially available for well over a decade. A limitation of the early instruments of this type was the narrow ion abundance range over which accurate mass measurements could be made with a high degree of certainty. Recently, a new generation of oa-QTOF mass spectrometers has been developed and these allow accurate mass measurements to be recorded over a much greater range of ion abundances. This development has resulted from new ion detection technology and improved electronic stability or by accurate control of the number of ions reaching the detector. In this report we describe the results from experiments performed to evaluate the mass measurement performance of the Bruker micrOTOF-Q, a member of the new-generation oa-QTOFs. The relationship between mass accuracy and ion abundance has been extensively evaluated and mass measurement accuracy remained stable (+/-1.5 m m/z units) over approximately 3-4 orders of magnitude of ion abundance. The second feature of the Bruker micrOTOF-Q that was evaluated was the SigmaFit function of the software. This isotope pattern-matching algorithm provides an exact numerical comparison of the theoretical and measured isotope patterns as an additional identification tool to accurate mass measurement. The smaller the value, the closer the match between theoretical and measured isotope patterns. This information is then employed to reduce the number of potential elemental formulae produced from the mass measurements. A relationship between the SigmaFit value and ion abundance has been established. The results from the study for both mass accuracy and SigmaFit were employed to define the performance criteria for the micrOTOF-Q. This provided increased confidence in the selection of elemental formulae resulting from accurate mass measurements.
Performance evaluation of nonhomogeneous hospitals: the case of Hong Kong hospitals.

PubMed

Li, Yongjun; Lei, Xiyang; Morton, Alec

2018-02-14

Throughout the world, hospitals are under increasing pressure to become more efficient. Efficiency analysis tools can play a role in giving policymakers insight into which units are less efficient and why. Many researchers have studied efficiencies of hospitals using data envelopment analysis (DEA) as an efficiency analysis tool. However, in the existing literature on DEA-based performance evaluation, a standard assumption of the constant returns to scale (CRS) or the variable returns to scale (VRS) DEA models is that decision-making units (DMUs) use a similar mix of inputs to produce a similar set of outputs. In fact, hospitals with different primary goals supply different services and provide different outputs. That is, hospitals are nonhomogeneous and the standard assumption of the DEA model is not applicable to the performance evaluation of nonhomogeneous hospitals. This paper considers the nonhomogeneity among hospitals in the performance evaluation and takes hospitals in Hong Kong as a case study. An extension of Cook et al. (2013) [1] based on the VRS assumption is developed to evaluated nonhomogeneous hospitals' efficiencies since inputs of hospitals vary greatly. Following the philosophy of Cook et al. (2013) [1], hospitals are divided into homogeneous groups and the product process of each hospital is divided into subunits. The performance of hospitals is measured on the basis of subunits. The proposed approach can be applied to measure the performance of other nonhomogeneous entities that exhibit variable return to scale.
Using hybrid method to evaluate the green performance in uncertainty.

PubMed

Tseng, Ming-Lang; Lan, Lawrence W; Wang, Ray; Chiu, Anthony; Cheng, Hui-Ping

2011-04-01

Green performance measure is vital for enterprises in making continuous improvements to maintain sustainable competitive advantages. Evaluation of green performance, however, is a challenging task due to the dependence complexity of the aspects, criteria, and the linguistic vagueness of some qualitative information and quantitative data together. To deal with this issue, this study proposes a novel approach to evaluate the dependence aspects and criteria of firm's green performance. The rationale of the proposed approach, namely green network balanced scorecard, is using balanced scorecard to combine fuzzy set theory with analytical network process (ANP) and importance-performance analysis (IPA) methods, wherein fuzzy set theory accounts for the linguistic vagueness of qualitative criteria and ANP converts the relations among the dependence aspects and criteria into an intelligible structural modeling used IPA. For the empirical case study, four dependence aspects and 34 green performance criteria for PCB firms in Taiwan were evaluated. The managerial implications are discussed.
Performance Evaluation and Analysis for Gravity Matching Aided Navigation.

PubMed

Wu, Lin; Wang, Hubiao; Chai, Hua; Zhang, Lu; Hsu, Houtse; Wang, Yong

2017-04-05

Simulation tests were accomplished in this paper to evaluate the performance of gravity matching aided navigation (GMAN). Four essential factors were focused in this study to quantitatively evaluate the performance: gravity database (DB) resolution, fitting degree of gravity measurements, number of samples in matching, and gravity changes in the matching area. Marine gravity anomaly DB derived from satellite altimetry was employed. Actual dynamic gravimetry accuracy and operating conditions were referenced to design the simulation parameters. The results verified that the improvement of DB resolution, gravimetry accuracy, number of measurement samples, or gravity changes in the matching area generally led to higher positioning accuracies, while the effects of them were different and interrelated. Moreover, three typical positioning accuracy targets of GMAN were proposed, and the conditions to achieve these targets were concluded based on the analysis of several different system requirements. Finally, various approaches were provided to improve the positioning accuracy of GMAN.
Performance Evaluation and Analysis for Gravity Matching Aided Navigation

PubMed Central

Wu, Lin; Wang, Hubiao; Chai, Hua; Zhang, Lu; Hsu, Houtse; Wang, Yong

2017-01-01

Simulation tests were accomplished in this paper to evaluate the performance of gravity matching aided navigation (GMAN). Four essential factors were focused in this study to quantitatively evaluate the performance: gravity database (DB) resolution, fitting degree of gravity measurements, number of samples in matching, and gravity changes in the matching area. Marine gravity anomaly DB derived from satellite altimetry was employed. Actual dynamic gravimetry accuracy and operating conditions were referenced to design the simulation parameters. The results verified that the improvement of DB resolution, gravimetry accuracy, number of measurement samples, or gravity changes in the matching area generally led to higher positioning accuracies, while the effects of them were different and interrelated. Moreover, three typical positioning accuracy targets of GMAN were proposed, and the conditions to achieve these targets were concluded based on the analysis of several different system requirements. Finally, various approaches were provided to improve the positioning accuracy of GMAN. PMID:28379178
Efficient Comparison between Windows and Linux Platform Applicable in a Virtual Architectural Walkthrough Application

NASA Astrophysics Data System (ADS)

Thubaasini, P.; Rusnida, R.; Rohani, S. M.

This paper describes Linux, an open source platform used to develop and run a virtual architectural walkthrough application. It proposes some qualitative reflections and observations on the nature of Linux in the concept of Virtual Reality (VR) and on the most popular and important claims associated with the open source approach. The ultimate goal of this paper is to measure and evaluate the performance of Linux used to build the virtual architectural walkthrough and develop a proof of concept based on the result obtain through this project. Besides that, this study reveals the benefits of using Linux in the field of virtual reality and reflects a basic comparison and evaluation between Windows and Linux base operating system. Windows platform is use as a baseline to evaluate the performance of Linux. The performance of Linux is measured based on three main criteria which is frame rate, image quality and also mouse motion.
Performance evaluation of 4 measuring methods of ground-glass opacities for predicting the 5-year relapse-free survival of patients with peripheral nonsmall cell lung cancer: a multicenter study.

PubMed

Kakinuma, Ryutaro; Kodama, Ken; Yamada, Kouzo; Yokoyama, Akira; Adachi, Shuji; Mori, Kiyoshi; Fukuyama, Yasuro; Fukuda, Yasuro; Kuriyama, Keiko; Oda, Junichi; Oda, Junji; Noguchi, Masayuki; Matsuno, Yoshihiro; Yokose, Tomoyuki; Ohmatsu, Hironobu; Nishiwaki, Yutaka

2008-01-01

To evaluate the performance of 4 methods of measuring the extent of ground-glass opacities as a means of predicting the 5-year relapse-free survival of patients with peripheral nonsmall cell lung cancer (NSLC). Ground-glass opacities on thin-section computed tomographic images of 120 peripheral NSLCs were measured at 7 medical institutions by the length, area, modified length, and vanishing ratio (VR) methods. The performance (Az) of each method in predicting the 5-year relapse-free survival was evaluated using receiver operating characteristic analysis. The mean Az value obtained by the length, area, modified length, and VR methods in the receiver operating characteristic analyses was 0.683, 0.702, 0.728, and 0.784, respectively. The differences between the mean Az value obtained by the VR method and by the other 3 methods were significant. Vanishing ratio method was the most accurate predictor of the 5-year relapse-free survival of patients with peripheral NSLC.
Emergency radiobioassay preparedness exercises through the NIST radiochemistry intercomparison program.

PubMed

Nour, Svetlana; LaRosa, Jerry; Inn, Kenneth G W

2011-08-01

The present challenge for the international emergency radiobioassay community is to analyze contaminated samples rapidly while maintaining high quality results. The National Institute of Standards and Technology (NIST) runs a radiobioassay measurement traceability testing program to evaluate the radioanalytical capabilities of participating laboratories. The NIST Radiochemistry Intercomparison Program (NRIP) started more than 10 years ago, and emergency performance testing was added to the program seven years ago. Radiobioassay turnaround times under the NRIP program for routine production and under emergency response scenarios are 60 d and 8 h, respectively. Because measurement accuracy and sample turnaround time are very critical in a radiological emergency, response laboratories' analytical systems are best evaluated and improved through traceable Performance Testing (PT) programs. The NRIP provides participant laboratories with metrology tools to evaluate their performance and to improve it. The program motivates the laboratories to optimize their methodologies and minimize the turnaround time of their results. Likewise, NIST has to make adjustments and periodical changes in the bioassay test samples in order to challenge the participating laboratories continually. With practice, radioanalytical measurements turnaround time can be reduced to 3-4 h.
Reliability of Performance-Based Clinical Measurements to Assess Shoulder Girdle Kinematics and Positioning: Systematic Review.

PubMed

D'hondt, Norman E; Kiers, Henri; Pool, Jan J M; Hacquebord, Sijmen T; Terwee, Caroline B; Veeger, Dirkjan H E J

2017-01-01

Deviant shoulder girdle movement is suggested as an eminent factor in the etiology of shoulder pain. Reliable measurements of shoulder girdle kinematics are a prerequisite for optimizing clinical management strategies. The purpose of this study was to evaluate the reliability, measurement error, and internal consistency of measurements with performance-based clinical tests for shoulder girdle kinematics and positioning in patients with shoulder pain. The MEDLINE, Embase, CINAHL, and SPORTDiscus databases were systematically searched from inception to August 2015. Articles published in Dutch, English, or German were included if they involved the evaluation of at least one of the measurement properties of interest. Two reviewers independently evaluated the methodological quality per studied measurement property with the 4-point-rating scale of the COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) checklist, extracted data, and assessed the adequacy of the measurement properties. Forty studies comprising more than 30 clinical tests were included. Actual reported measurements of the tests were categorized into: (1) positional measurement methods, (2) measurement methods to determine dynamic characteristics, and (3) tests to diagnose impairments of shoulder girdle function. Best evidence synthesis of the tests was performed per measurement for each measurement property. All studies had significant limitations, including incongruence between test description and actual reported measurements and a lack of reporting on minimal important change. In general, the methodological quality of the selected studies was fair to poor. High-quality evidence indicates that measurements obtained with the Modified Scapular Assistance Test are not reliable for clinical use. Sound recommendations for the use of other tests could not be made due to inadequate evidence. Across studies, diversity in description, performance, and interpretation of similar tests was present, and different criteria were used to establish similar diagnoses, mostly without taking into account a clinically meaningful context. Consequently, these tests lack face validity, which hampers their clinical use. Further research on validity and how to integrate a clinically meaningful context of movement into clinical tests is warranted. © 2017 American Physical Therapy Association
Development of a measure of student self-evaluation of physics exam performance

NASA Astrophysics Data System (ADS)

Hagedorn, Eric Anthony

The central purpose of this study was to provide preliminary evidence of the reliability and validity of the SEVSI - P (Self- evaluation scaled instrument - physics). This instrument, designed to measure student self-evaluation of physics exam performance, was developed in congruence with social cognitive theory. Self-evaluation in this study is defined to consist of two of the three subprocesses of self-regulation: self-observation and judgmental process. As such, the SEVSI - P consists of two subscales, one measuring the frequency and types of self-observations made during a physics exam and one measuring the frequency and types of judgmental comparisons made after an exam. Data from 621 completed surveys, voluntarily taken by first semester algebra/trigonometry based physics students at six Midwestern universities and one Southern university, were analyzed for reliability and factorial validity. Cronbach alphas of .71 and .83 for the self-observation and judgment subscales, respectively, indicate acceptable reliability for the instrument. Confirmatory factor analysis indicates the acceptability of the hypothesis that the data analyzed could have indeed been obtained from the proposed two factor model (self-observation and judgment). The results of this confirmatory factor analysis provide preliminary construct validity for this instrument. A number of theoretically related items were included on the SEVSI - P form to elicity information about the use of goals and pre-planned strategies, actions taken in response to previous poor performances, and emotional responses to performance. A correlational analysis of these items along with the self-observation and judgment subscale scores provided a limited degree of convergent validity for the two subscales. Analyses of variance were done to determine the presence of differences in scoring patterns based on gender or reported ethnic origin. These results indicate slightly higher judgment subscale scores for women and members of minority groups. The implications of these differences are suggested as warranting future research. Future uses of the SEVSI - P include classroom use to assist students self-evaluate their exam performances in order to increase their achievement. Future research using the SEVSI - P to determine the causal relationships between self-evaluation, actual achievement, and other social cognitive constructs such as self-efficacy are suggested.
Quality Measures for Dialysis: Time for a Balanced Scorecard.

PubMed

Kliger, Alan S

2016-02-05

Recent federal legislation establishes a merit-based incentive payment system for physicians, with a scorecard for each professional. The Centers for Medicare and Medicaid Services evaluate quality of care with clinical performance measures and have used these metrics for public reporting and payment to dialysis facilities. Similar metrics may be used for the future merit-based incentive payment system. In nephrology, most clinical performance measures measure processes and intermediate outcomes of care. These metrics were developed from population studies of best practice and do not identify opportunities for individualizing care on the basis of patient characteristics and individual goals of treatment. The In-Center Hemodialysis (ICH) Consumer Assessment of Healthcare Providers and Systems (CAHPS) survey examines patients' perception of care and has entered the arena to evaluate quality of care. A balanced scorecard of quality performance should include three elements: population-based best clinical practice, patient perceptions, and individually crafted patient goals of care. Copyright © 2016 by the American Society of Nephrology.
INNOVATIVE TECHNOLOGY VERIFICATION REPORT " ...

EPA Pesticide Factsheets

The EnSys Petro Test System developed by Strategic Diagnostics Inc. (SDI), was demonstrated under the U.S. Environmental Protection Agency Superfund Innovative Technology Evaluation Program in June 2000 at the Navy Base Ventura County site in Port Hueneme, California. The purpose of the demonstration was to collect reliable performance and cost data for the EnSys Petro Test System and six other field measurement devices for total petroleum hydrocarbons (TPH) in soil. In addition to assessing ease of device operation, the key objectives of the demonstration included determining the (1) method detection limit, (2) accuracy and precision, (3) effects of interferents and soil moisture content on TPH measurement, (4) sample throughput, and (5) TPH measurement costs for each device. The demonstration involved analysis of both performance evaluation samples and environmental samples collected in four areas contaminated with gasoline, diesel, or other petroleum products. The performance and cost results for a given field measurement device were compared to those for an off-site laboratory reference method,
INNOVATIVE TECHNOLOGY VERIFICATION REPORT " ...

EPA Pesticide Factsheets

The Synchronous Scanning Luminoscope (Luminoscope) developed by the Oak Ridge National Laboratory in collaboration with Environmental Systems Corporation (ESC) was demonstrated under the U.S. Environmental Protection Agency Superfund Innovative Technology Evaluation Program in June 2000 at the Navy Base Ventura County site in Port Hueneme, California. The purpose of the demonstration was to collect reliable performance and cost data for the Luminoscope and six other field measurement devices for total petroleum hydrocarbons (TPH) in soil. In addition to assessing ease of device operation, the key objectives of the demonstration included determining the (1) method detection limit, (2) accuracy and precision, (3) effects of interferents and soil moisture content on TPH measurement, (4) sample throughput, and (5) TPH measurement costs for each device. The demonstration involved analysis of both performance evaluation samples and environmental samples collected in five areas contaminated with gasoline, diesel, lubricating oil, or other petroleum products. The performance and cost results for a given field measurement device were compared to those for an off-site laboratory reference method,
Evaluating stereoscopic displays: both efficiency measures and perceived workload sensitive to manipulations in binocular disparity

NASA Astrophysics Data System (ADS)

van Beurden, Maurice H. P. H.; Ijsselsteijn, Wijnand A.; de Kort, Yvonne A. W.

2011-03-01

Stereoscopic displays are known to offer a number of key advantages in visualizing complex 3D structures or datasets. The large majority of studies that focus on evaluating stereoscopic displays for professional applications use completion time and/or the percentage of correct answers to measure potential performance advantages. However, completion time and accuracy may not fully reflect all the benefits of stereoscopic displays. In this paper, we argue that perceived workload is an additional valuable indicator reflecting the extent to which users can benefit from using stereoscopic displays. We performed an experiment in which participants were asked to perform a visual path-tracing task within a convoluted 3D wireframe structure, varying in level of complexity of the visualised structure and level of disparity of the visualisation. The results showed that an optimal performance (completion time, accuracy and workload), depend both on task difficulty and disparity level. Stereoscopic disparity revealed a faster and more accurate task performance, whereas we observed a trend that performance on difficult tasks stands to benefit more from higher levels of disparity than performance on easy tasks. Perceived workload (as measured using the NASA-TLX) showed a similar response pattern, providing evidence that perceived workload is sensitive to variations in disparity as well as task difficulty. This suggests that perceived workload could be a useful concept, in addition to standard performance indicators, in characterising and measuring human performance advantages when using stereoscopic displays.
Experimental Evaluation of Adaptive Modulation and Coding in MIMO WiMAX with Limited Feedback

NASA Astrophysics Data System (ADS)

Mehlführer, Christian; Caban, Sebastian; Rupp, Markus

2007-12-01

We evaluate the throughput performance of an OFDM WiMAX (IEEE 802.16-2004, Section 8.3) transmission system with adaptive modulation and coding (AMC) by outdoor measurements. The standard compliant AMC utilizes a 3-bit feedback for SISO and Alamouti coded MIMO transmissions. By applying a 6-bit feedback and spatial multiplexing with individual AMC on the two transmit antennas, the data throughput can be increased significantly for large SNR values. Our measurements show that at small SNR values, a single antenna transmission often outperforms an Alamouti transmission. We found that this effect is caused by the asymmetric behavior of the wireless channel and by poor channel knowledge in the two-transmit-antenna case. Our performance evaluation is based on a measurement campaign employing the Vienna MIMO testbed. The measurement scenarios include typical outdoor-to-indoor NLOS, outdoor-to-outdoor NLOS, as well as outdoor-to-indoor LOS connections. We found that in all these scenarios, the measured throughput is far from its achievable maximum; the loss is mainly caused by a too simple convolutional coding.
Optical Coherence Tomography Evaluation in the Multicenter Uveitis Steroid Treatment (MUST) Trial

PubMed Central

Domalpally, Amitha; Altaweel, Michael M.; Kempen, John H.; Myers, Dawn; Davis, Janet L; Foster, C Stephen; Latkany, Paul; Srivastava, Sunil K.; Stawell, Richard J.; Holbrook, Janet T.

2013-01-01

Purpose To describe the evaluation of optical coherence tomography (OCT) scans in the Muliticenter Uveitis Steroid Treatment (MUST) trial and report baseline OCT features of enrolled participants. Methods Time domain OCTs acquired by certified photographers using a standardized scan protocol were evaluated at a Reading Center. Accuracy of retinal thickness data was confirmed with quality evaluation and caliper measurement of centerpoint thickness (CPT) was performed when unreliable. Morphological evaluation included cysts, subretinal fluid,epiretinal membranes (ERMs),and vitreomacular traction. Results Of the 453 OCTs evaluated, automated retinal thickness was accurate in 69.5% of scans, caliper measurement was performed in 26%,and 4% were ungradable. Intraclass correlation was 0.98 for reproducibility of caliper measurement. Macular edema (centerpoint thickness ≥ 240um) was present in 36%. Cysts were present in 36.6% of scans and ERMs in 27.8%, predominantly central. Intergrader agreement ranged from 78 − 82% for morphological features. Conclusion Retinal thickness data can be retrieved in a majority of OCT scans in clinical trial submissions for uveitis studies. Small cysts and ERMs involving the center are common in intermediate and posterior/panuveitis requiring systemic corticosteroid therapy. PMID:23163490
77 FR 38071 - Council on Graduate Medical Education; Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-06-26

... graduate medical education, evaluation of teaching programs especially in terms of meeting community needs...' development of performance measures and methods of longitudinal evaluation specific to the training programs...
Estimating learning outcomes from pre- and posttest student self-assessments: a longitudinal study.

PubMed

Schiekirka, Sarah; Reinhardt, Deborah; Beißbarth, Tim; Anders, Sven; Pukrop, Tobias; Raupach, Tobias

2013-03-01

Learning outcome is an important measure for overall teaching quality and should be addressed by comprehensive evaluation tools. The authors evaluated the validity of a novel evaluation tool based on student self-assessments, which may help identify specific strengths and weaknesses of a particular course. In 2011, the authors asked 145 fourth-year students at Göttingen Medical School to self-assess their knowledge on 33 specific learning objectives in a pretest and posttest as part of a cardiorespiratory module. The authors compared performance gain calculated from self-assessments with performance gain derived from formative examinations that were closely matched to these 33 learning objectives. Eighty-three students (57.2%) completed the assessment. There was good agreement between performance gain derived from subjective data and performance gain derived from objective examinations (Pearson r=0.78; P<.0001) on the group level. The association between the two measures was much weaker when data were analyzed on the individual level. Further analysis determined a quality cutoff for performance gain derived from aggregated student self-assessments. When using this cutoff, the evaluation tool was highly sensitive in identifying specific learning objectives with favorable or suboptimal objective performance gains. The tool is easy to implement, takes initial performance levels into account, and does not require extensive pre-post testing. By providing valid estimates of actual performance gain obtained during a teaching module, it may assist medical teachers in identifying strengths and weaknesses of a particular course on the level of specific learning objectives.

Accuracy of force and center of pressure measures of the Wii Balance Board.

PubMed

Bartlett, Harrison L; Ting, Lena H; Bingham, Jeffrey T

2014-01-01

The Nintendo Wii Balance Board (WBB) is increasingly used as an inexpensive force plate for assessment of postural control; however, no documentation of force and COP accuracy and reliability is publicly available. Therefore, we performed a standard measurement uncertainty analysis on 3 lightly and 6 heavily used WBBs to provide future users with information about the repeatability and accuracy of the WBB force and COP measurements. Across WBBs, we found the total uncertainty of force measurements to be within ± 9.1N, and of COP location within ± 4.1mm. However, repeatability of a single measurement within a board was better (4.5 N, 1.5mm), suggesting that the WBB is best used for relative measures using the same device, rather than absolute measurement across devices. Internally stored calibration values were comparable to those determined experimentally. Further, heavy wear did not significantly degrade performance. In combination with prior evaluation of WBB performance and published standards for measuring human balance, our study provides necessary information to evaluate the use of the WBB for analysis of human balance control. We suggest the WBB may be useful for low-resolution measurements, but should not be considered as a replacement for laboratory-grade force plates. Published by Elsevier B.V.
Accuracy of force and center of pressure measures of the Wii Balance Board

PubMed Central

Bartlett, Harrison L.; Ting, Lena H.; Bingham, Jeffrey T.

2013-01-01

The Nintendo Wii Balance Board (WBB) is increasingly used as an inexpensive force plate for assessment of postural control; however, no documentation of force and COP accuracy and reliability is publicly available. Therefore, we performed a standard measurement uncertainty analysis on 3 lightly and 6 heavily used WBBs to provide future users with information about the repeatability and accuracy of the WBB force and COP measurements. Across WBBs, we found the total uncertainty of force measurements to be within ±9.1 N, and of COP location within ±4.1 mm. However, repeatability of a single measurement within a board was better (4.5 N, 1.5 mm), suggesting that the WBB is best used for relative measures using the same device, rather than absolute measurement across devices. Internally stored calibration values were comparable to those determined experimentally. Further, heavy wear did not significantly degrade performance. In combination with prior evaluation of WBB performance and published standards for measuring human balance, our study provides necessary information to evaluate the use of the WBB for analysis of human balance control. We suggest the WBB may be useful for low-resolution measurements, but should not be considered as a replacement for laboratory-grade force plates. PMID:23910725
Evaluation of a flow direction probe and a pitot-static probe on the F-14 airplane at high angles of attack and sideslip

NASA Technical Reports Server (NTRS)

Larson, T. J.

1984-01-01

The measurement performance of a hemispherical flow-angularity probe and a fuselage-mounted pitot-static probe was evaluated at high flow angles as part of a test program on an F-14 airplane. These evaluations were performed using a calibrated pitot-static noseboom equipped with vanes for reference flow direction measurements, and another probe incorporating vanes but mounted on a pod under the fuselage nose. Data are presented for angles of attack up to 63, angles of sideslip from -22 deg to 22 deg, and for Mach numbers from approximately 0.3 to 1.3. During maneuvering flight, the hemispherical flow-angularity probe exhibited flow angle errors that exceeded 2 deg. Pressure measurements with the pitot-static probe resulted in very inaccurate data above a Mach number of 0.87 and exhibited large sensitivities with flow angle.
Data warehouse model for monitoring key performance indicators (KPIs) using goal oriented approach

NASA Astrophysics Data System (ADS)

Abdullah, Mohammed Thajeel; Ta'a, Azman; Bakar, Muhamad Shahbani Abu

2016-08-01

The growth and development of universities, just as other organizations, depend on their abilities to strategically plan and implement development blueprints which are in line with their vision and mission statements. The actualizations of these statements, which are often designed into goals and sub-goals and linked to their respective actors are better measured by defining key performance indicators (KPIs) of the university. The proposes ReGADaK, which is an extended the GRAnD approach highlights the facts, dimensions, attributes, measures and KPIs of the organization. The measures from the goal analysis of this unit serve as the basis of developing the related university's KPIs. The proposed data warehouse schema is evaluated through expert review, prototyping and usability evaluation. The findings from the evaluation processes suggest that the proposed data warehouse schema is suitable for monitoring the University's KPIs.
Are university rankings useful to improve research? A systematic review

PubMed Central

Momani, Shaher

2018-01-01

Introduction Concerns about reproducibility and impact of research urge improvement initiatives. Current university ranking systems evaluate and compare universities on measures of academic and research performance. Although often useful for marketing purposes, the value of ranking systems when examining quality and outcomes is unclear. The purpose of this study was to evaluate usefulness of ranking systems and identify opportunities to support research quality and performance improvement. Methods A systematic review of university ranking systems was conducted to investigate research performance and academic quality measures. Eligibility requirements included: inclusion of at least 100 doctoral granting institutions, be currently produced on an ongoing basis and include both global and US universities, publish rank calculation methodology in English and independently calculate ranks. Ranking systems must also include some measures of research outcomes. Indicators were abstracted and contrasted with basic quality improvement requirements. Exploration of aggregation methods, validity of research and academic quality indicators, and suitability for quality improvement within ranking systems were also conducted. Results A total of 24 ranking systems were identified and 13 eligible ranking systems were evaluated. Six of the 13 rankings are 100% focused on research performance. For those reporting weighting, 76% of the total ranks are attributed to research indicators, with 24% attributed to academic or teaching quality. Seven systems rely on reputation surveys and/or faculty and alumni awards. Rankings influence academic choice yet research performance measures are the most weighted indicators. There are no generally accepted academic quality indicators in ranking systems. Discussion No single ranking system provides a comprehensive evaluation of research and academic quality. Utilizing a combined approach of the Leiden, Thomson Reuters Most Innovative Universities, and the SCImago ranking systems may provide institutions with a more effective feedback for research improvement. Rankings which extensively rely on subjective reputation and “luxury” indicators, such as award winning faculty or alumni who are high ranking executives, are not well suited for academic or research performance improvement initiatives. Future efforts should better explore measurement of the university research performance through comprehensive and standardized indicators. This paper could serve as a general literature citation when one or more of university ranking systems are used in efforts to improve academic prominence and research performance. PMID:29513762
A Case Report Examining the Feasibility of Meta-Cognitive Strategy Training in Acute Inpatient Stroke Rehabilitation

PubMed Central

Skidmore, Elizabeth R.; Holm, Margo B.; Whyte, Ellen M.; Dew, Mary Amanda; Dawson, Deirdre; Becker, James T.

2011-01-01

Meta-cognitive strategy training may be used to augment inpatient rehabilitation to promote active engagement and subsequent benefit for individuals with cognitive impairments after stroke. We examined the feasibility of administering a form of meta-cognitive strategy training, Cognitive Orientation to daily Occupational Performance, during inpatient rehabilitation. We trained an individual with cognitive impairments after right hemisphere stroke to identify performance problems, set self-selected goals, develop plans to address goals, and evaluate performance improvements. To assess feasibility, we examined the number of meta-cognitive training sessions attended, the number of self-selected goals, and changes in goal-related performance. We also examined changes in rehabilitation engagement and disability. The participant used the meta-cognitive strategy to set 8 goals addressing physically-oriented, instrumental, and work-related activities. Mean improvement in Canadian Occupational Performance Measure Performance Scale scores was 6.1. Pittsburgh Rehabilitation Participation Scale scores (measuring rehabilitation engagement) improved from 3.2 at admission to 4.9 at discharge. Functional Independence Measure scores (measuring disability) improved from 68 at admission, to 97 at discharge. Performance Assessment of Self-care Skills scores improved from 1.1 at admission to 2.9 at discharge. The results indicate that meta-cognitive strategy training was feasible during inpatient rehabilitation and warrants further evaluation to determine its effectiveness. PMID:21391121
Analysis of key technologies for virtual instruments metrology

NASA Astrophysics Data System (ADS)

Liu, Guixiong; Xu, Qingui; Gao, Furong; Guan, Qiuju; Fang, Qiang

2008-12-01

Virtual instruments (VIs) require metrological verification when applied as measuring instruments. Owing to the software-centered architecture, metrological evaluation of VIs includes two aspects: measurement functions and software characteristics. Complexity of software imposes difficulties on metrological testing of VIs. Key approaches and technologies for metrology evaluation of virtual instruments are investigated and analyzed in this paper. The principal issue is evaluation of measurement uncertainty. The nature and regularity of measurement uncertainty caused by software and algorithms can be evaluated by modeling, simulation, analysis, testing and statistics with support of powerful computing capability of PC. Another concern is evaluation of software features like correctness, reliability, stability, security and real-time of VIs. Technologies from software engineering, software testing and computer security domain can be used for these purposes. For example, a variety of black-box testing, white-box testing and modeling approaches can be used to evaluate the reliability of modules, components, applications and the whole VI software. The security of a VI can be assessed by methods like vulnerability scanning and penetration analysis. In order to facilitate metrology institutions to perform metrological verification of VIs efficiently, an automatic metrological tool for the above validation is essential. Based on technologies of numerical simulation, software testing and system benchmarking, a framework for the automatic tool is proposed in this paper. Investigation on implementation of existing automatic tools that perform calculation of measurement uncertainty, software testing and security assessment demonstrates the feasibility of the automatic framework advanced.
[Supply services at health facilities: measuring performance].

PubMed

Dacosta Claro, I

2001-01-01

Performance measurement, in their different meanings--either balance scorecard or outputs measurement--have become an essential tool in today's organizations (World-Class organizations) to improve service quality and reduce costs. This paper presents a performance measurement system for the hospital supply chain. The system is organized in different levels and groups of indicators in order to show a hierarchical, coherent and integrated vision of the processes. Thus, supply services performance is measured according to (1) financial aspects, (2) customers satisfaction aspects and (3) internal aspects of the processes performed. Since the informational needs of the managers vary within the administrative structure, the performance measurement system is defined in three hierarchical levels. Firstly, the whole supply chain, with the different interrelation of activities. Secondly, the three main processes of the chain--physical management of products, purchasing and negotiation processes and the local storage units. And finally, the performance measurement of each activity involved. The system and the indicators have been evaluated with the participation of 17 health services of Quebec (Canada), however, and due to the similarities of the operation, could be equally implemented in Spanish hospitals.
Mapping photopolarimeter spectrometer instrument feasibility study for future planetary flight missions

NASA Technical Reports Server (NTRS)

1990-01-01

Evaluations are summarized directed towards defining optimal instrumentation for performing planetary polarization measurements from a spacecraft platform. An overview of the science rationale for polarimetric measurements is given to point out the importance of such measurements for future studies and exploration of the outer planets. The key instrument features required to perform the needed measurements are discussed and applied to the requirements for the Cassini mission to Saturn. The resultant conceptual design of a spectro-polarimeter photometer for Cassini is described in detail.
Refining the Pediatric Evaluation of Disability Inventory-Patient-Reported Outcome (PEDI-PRO) item candidates: interpretation of a self-reported outcome measure of functional performance by young people with neurodevelopmental disabilities.

PubMed

Kramer, Jessica M; Schwartz, Ariel

2017-10-01

This study examined the item interpretability and rating scale use of the Pediatric Evaluation of Disability Inventory-Patient-Reported Outcome (PEDI-PRO) by young people with developmental disabilities. The PEDI-PRO assesses the functional performance of discrete functional tasks in the context of everyday life situations. A two-phase cognitive interview design was implemented with a convenience sample of 37 young people (mean age 19y, SD 2y 5mo; 13 males and 24 females; 68% with intellectual disability) with developmental disabilities. In phase I, 182 item candidates were each reviewed by an average of four young people. In phase II, 103 items were carried forward or revised and each reviewed by an average of seven additional young people. Two raters coded responses for intended item interpretation and performance quality; codes were analysed using descriptive statistics. Qualitative analysis explored young people's self-evaluation process. Items were interpreted as intended by most young people (mean 86%). Young people can use PEDI-PRO response categories appropriately to describe their performance: 94% of positive performance descriptions coincided with a positive response category choice; 73% of negative descriptions coincided with a negative response category choice. Young people interpreted items in a literal manner, and their self-evaluation incorporated the use of supports that facilitate functional performance. The PEDI-PRO's measurement framework appears to support the self-evaluation of functional performance of young people with developmental disabilities. © 2017 Mac Keith Press.
Experimental Evaluation of High Performance Integrated Heat Pump

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miller, William A; Berry, Robert; Durfee, Neal

2016-01-01

Integrated heat pump (IHP) technology provides significant potential for energy savings and comfort improvement for residential buildings. In this study, we evaluate the performance of a high performance IHP that provides space heating, cooling, and water heating services. Experiments were conducted according to the ASHRAE Standard 206-2013 where 24 test conditions were identified in order to evaluate the IHP performance indices based on the airside performance. Empirical curve fits of the unit s compressor maps are used in conjunction with saturated condensing and evaporating refrigerant conditions to deduce the refrigerant mass flowrate, which, in turn was used to evaluate themore » refrigerant side performance as a check on the airside performance. Heat pump (compressor, fans, and controls) and water pump power were measured separately per requirements of Standard 206. The system was charged per the system manufacturer s specifications. System test results are presented for each operating mode. The overall IHP performance metrics are determined from the test results per the Standard 206 calculation procedures.« less
Evaluation of an interactive science publishing tool: toward enabling three-dimensional analysis of medical images.

PubMed

Rinewalt, Daniel; Williams, Betsy W; Reeves, Anthony P; Shah, Palmi; Hong, Edward; Mulshine, James L

2015-03-01

Higher resolution medical imaging platforms are rapidly emerging, but there is a challenge in applying these tools in a clinically meaningful way. The purpose of the current study was to evaluate a novel three-dimensional (3D) software imaging environment, known as interactive science publishing (ISP), in appraising 3D computed tomography images and to compare this approach with traditional planar (2D) imaging in a series of lung cancer cases. Twenty-four physician volunteers at different levels of training across multiple specialties were recruited to evaluate eight lung cancer-related clinical vignettes. The volunteers were asked to compare the performance of traditional 2D versus the ISP 3D imaging in assessing different visualization environments for diagnostic and measurement processes and to further evaluate the ISP tool in terms of general satisfaction, usability, and probable applicability. Volunteers were satisfied with both imaging methods; however, the 3D environment had significantly higher ratings. Measurement performance was comparable using both traditional 2D and 3D image evaluation. Physicians not trained in 2D measurement approaches versus those with such training demonstrated better performance with ISP and preferred working in the ISP environment. Recent postgraduates with only modest self-administered training performed equally well on 3D and 2D cases. This suggests that the 3D environment has no reduction in accuracy over the conventional 2D approach, while providing the advantage of a digital environment for cross-disciplinary interaction for shared problem solving. Exploration of more effective, efficient, self-directed training could potentially result in further improvement in image evaluation proficiency and potentially decrease training costs. Copyright © 2015. Published by Elsevier Inc.
Faustmann and the forestry tradition of outcome-based performance measures

Treesearch

Peter J. Ince

1999-01-01

The concept of land expectation value developed by Martin Faustmann may serve as a paradigm for outcome-based performance measures in public forest management if the concept of forest equity value is broadened to include social and environmental benefits and costs, and sustainability. However, anticipation and accurate evaluation of all benefits and costs appears to...
Performance Analysis and Experimental Validation of the Direct Strain Imaging Method

Treesearch

Athanasios Iliopoulos; John G. Michopoulos; John C. Hermanson

2013-01-01

Direct Strain Imaging accomplishes full field measurement of the strain tensor on the surface of a deforming body, by utilizing arbitrarily oriented engineering strain measurements originating from digital imaging. In this paper an evaluation of the methodâs performance with respect to its operating parameter space is presented along with a preliminary...
Measuring Longitudinal Student Performance on Student Learning Outcomes in Sustainability Education

ERIC Educational Resources Information Center

Jarchow, Meghann E.; Formisano, Paul; Nordyke, Shane; Sayre, Matthew

2018-01-01

Purpose: The purpose of this paper is to describe the student learning outcomes (SLOs) for a sustainability major, evaluate faculty incorporation of the SLOs into the courses in the sustainability major curriculum and measure student performance on the SLOs from entry into the major to the senior capstone course. Design/methodology/approach:…
The Accuracy of Aggregate Student Growth Percentiles as Indicators of Educator Performance

ERIC Educational Resources Information Center

Castellano, Katherine E.; McCaffrey, Daniel F.

2017-01-01

Mean or median student growth percentiles (MGPs) are a popular measure of educator performance, but they lack rigorous evaluation. This study investigates the error in MGP due to test score measurement error (ME). Using analytic derivations, we find that errors in the commonly used MGP are correlated with average prior latent achievement: Teachers…
Performance Evaluation and Community Application of Low-Cost Sensors for Ozone and Nitrogen Dioxide

PubMed Central

Duvall, Rachelle M.; Long, Russell W.; Beaver, Melinda R.; Kronmiller, Keith G.; Wheeler, Michael L.; Szykman, James J.

2016-01-01

This study reports on the performance of electrochemical-based low-cost sensors and their use in a community application. CairClip sensors were collocated with federal reference and equivalent methods and operated in a network of sites by citizen scientists (community members) in Houston, Texas and Denver, Colorado, under the umbrella of the NASA-led DISCOVER-AQ Earth Venture Mission. Measurements were focused on ozone (O3) and nitrogen dioxide (NO2). The performance evaluation showed that the CairClip O3/NO2 sensor provided a consistent measurement response to that of reference monitors (r2 = 0.79 in Houston; r2 = 0.72 in Denver) whereas the CairClip NO2 sensor measurements showed no agreement to reference measurements. The CairClip O3/NO2 sensor data from the citizen science sites compared favorably to measurements at nearby reference monitoring sites. This study provides important information on data quality from low-cost sensor technologies and is one of few studies that reports sensor data collected directly by citizen scientists. PMID:27754370
Intra- and Inter-observer Variability of Measurements of the Laxity Index on Stress Radiographs Performed with the Vezzoni-Modified Badertscher Hip Distension Device.

PubMed

Bertal, Mileva; Vezzoni, Aldo; Houdellier, Blandine; Bogaerts, Evelien; Stock, Emmelie; Polis, Ingeborgh; Deforce, Dieter; Saunders, Jimmy H; Broeckx, Bart J G

2018-06-02

To describe and evaluate the accuracy, intra- and inter-observer variability of the laxity index (LI), used to quantify hip laxity on stress radiographs obtained with the Vezzoni-modified Badertscher distension device (VMBDD). Stress radiographs of 10 dogs obtained with the VMBDD were measured three times by an experienced observer. Six participants with different backgrounds (two ECVDI residents, two PhD students, two veterinary assistants) followed a short presentation and performed subsequently the measurements four times in two separate sessions. The effect of self-learning, feedback and specialization on the accuracy of the measurements was assessed. While the intra- and inter-observer variability were in agreement with other studies, the results of the experienced observer indicated that the variability can be very low. Neither feedback nor self-learning improved the results. A high degree of experience in radiographic assessment was not necessary to perform the measurements correctly. As the LI measurements were acceptable after a short presentation, they support the use of VMBDD for a complete and correct in-house evaluation of the hip joint by trained clinicians. However, we propose that, in the context of screening, measurements should be performed by a limited number of experienced examiners, to limit the impact of the inter-observer variability. Schattauer GmbH Stuttgart.
A systematic review finds limited data on measurement properties of instruments measuring outcomes in adult intensive care unit survivors.

PubMed

Robinson, Karen A; Davis, Wesley E; Dinglas, Victor D; Mendez-Tellez, Pedro A; Rabiee, Anahita; Sukrithan, Vineeth; Yalamanchilli, Ramakrishna; Turnbull, Alison E; Needham, Dale M

2017-02-01

There is a growing number of studies evaluating the physical, cognitive, mental health, and health-related quality of life (HRQOL) outcomes of adults surviving critical illness. However, there is little consensus on the most appropriate instruments to measure these outcomes. To inform the development of such consensus, we conducted a systematic review of the performance characteristics of instruments measuring physical, cognitive, mental health, and HRQOL outcomes in adult intensive care unit (ICU) survivors. We searched PubMed, Embase, PsycInfo, Cumulative Index of Nursing and Allied Health Literature, and The Cochrane Library in March 2015. We also conducted manual searches of reference lists of eligible studies and relevant review articles. Two people independently selected studies, completed data abstraction, and assessed the quality of eligible studies using the COnsensus-based Standards for the selection of health Measurement Instruments (COSMIN) initiative checklist. We identified 20 studies which explicitly evaluated measurement properties for 21 different instruments assessing outcomes in ICU survivors. Eleven of the instruments assessed quality of life, with few instruments assessing other domains. Of the nine measurement properties evaluated on the COSMIN checklist, six were assessed in <10% of the evaluations. Overall quality of eligible studies was generally poor to fair based on the COSMIN checklist. Although an increasing number of studies measure physical, cognitive, mental health, and HRQOL outcomes in adult ICU survivors, data on the measurement properties of such instruments are sparse and generally of poor to fair quality. Empirical analyses evaluating the performance of instruments in adult ICU survivors are needed to advance research in this field. Copyright © 2016 Elsevier Inc. All rights reserved.
Evaluating Library Staff: A Performance Appraisal System.

ERIC Educational Resources Information Center

Belcastro, Patricia

This manual provides librarians and library managers with a performance appraisal system that measures staff fairly and objectively and links performance to the goals of the library. The following topics are addressed: (1) identifying expectations for quality service or standards of performance; (2) the importance of a library's code of service,…

Phonological awareness and writing skills in children with Down syndrome.

PubMed

Lavra-Pinto, Bárbara de; Lamprecht, Regina Ritter

2010-01-01

Down syndrome, phonological awareness, writing and working memory. to evaluate the phonological awareness of Brazilian children with Down syndrome; to analyze the relationship between the writing hypothesis and the phonological awareness scores of the participants; to compare the performance of children with Down syndrome to that of children with typical development according to the Phonological Awareness: Tool for sequential evaluation (PHONATSE), using the writing hypothesis as a matching criteria; to verify the correlation between the phonological awareness measurements and the phonological working memory. a group of eleven children aged between 7 and 14 years (average: 9 y 10 m) was selected for the study. Phonological awareness was evaluated using the PHONATSE. The phonological working memory was evaluated through an instrument developed by the researcher. all subjects presented measurable levels of phonological awareness through the PHONATSE. The phonological awareness scores and the writing hypothesis presented a significant positive association. The performance of children with Down syndrome was significantly lower than children with typical development who presented the same writing hypothesis. Measurements of phonological awareness and phonological working memory presented significant positive correlations. the phonological awareness of Brazilian children with Down syndrome can be evaluated through the PHONATSE. Syllable awareness improves with literacy, whereas phonemic awareness seems to result from written language learning. The phonological working memory influences the performance of children with Down syndrome in phonological awareness tasks.
The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

PubMed

Saito, Takaya; Rehmsmeier, Marc

2015-01-01

Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (PRC) plots are used less frequently. Many bioinformatics studies develop and evaluate classifiers that are to be applied to strongly imbalanced datasets in which the number of negatives outweighs the number of positives significantly. While ROC plots are visually appealing and provide an overview of a classifier's performance across a wide range of specificities, one can ask whether ROC plots could be misleading when applied in imbalanced classification scenarios. We show here that the visual interpretability of ROC plots in the context of imbalanced datasets can be deceptive with respect to conclusions about the reliability of classification performance, owing to an intuitive but wrong interpretation of specificity. PRC plots, on the other hand, can provide the viewer with an accurate prediction of future classification performance due to the fact that they evaluate the fraction of true positives among positive predictions. Our findings have potential implications for the interpretation of a large number of studies that use ROC plots on imbalanced datasets.
Assessing Therapist Competence: Development of a Performance-Based Measure and Its Comparison With a Web-Based Measure.

PubMed

Cooper, Zafra; Doll, Helen; Bailey-Straebler, Suzanne; Bohn, Kristin; de Vries, Dian; Murphy, Rebecca; O'Connor, Marianne E; Fairburn, Christopher G

2017-10-31

Recent research interest in how best to train therapists to deliver psychological treatments has highlighted the need for rigorous, but scalable, means of measuring therapist competence. There are at least two components involved in assessing therapist competence: the assessment of their knowledge of the treatment concerned, including how and when to use its strategies and procedures, and an evaluation of their ability to apply such knowledge skillfully in practice. While the assessment of therapists' knowledge has the potential to be completed efficiently on the Web, the assessment of skill has generally involved a labor-intensive process carried out by clinicians, and as such, may not be suitable for assessing training outcome in certain circumstances. The aims of this study were to develop and evaluate a role-play-based measure of skill suitable for assessing training outcome and to compare its performance with a highly scalable Web-based measure of applied knowledge. Using enhanced cognitive behavioral therapy (CBT-E) for eating disorders as an exemplar, clinical scenarios for role-play assessment were developed and piloted together with a rating scheme for assessing trainee therapists' performance. These scenarios were evaluated by examining the performance of 93 therapists from different professional backgrounds and at different levels of training in implementing CBT-E. These therapists also completed a previously developed Web-based measure of applied knowledge, and the ability of the Web-based measure to efficiently predict competence on the role-play measure was investigated. The role-play measure assessed performance at implementing a range of CBT-E procedures. The majority of the therapists rated their performance as moderately or closely resembling their usual clinical performance. Trained raters were able to achieve good-to-excellent reliability for averaged competence, with intraclass correlation coefficients ranging from .653 to 909. The measure was also sensitive to change, with scores being significantly higher after training than before as might be expected (mean difference 0.758, P<.001) even when taking account of repeated data (mean difference 0.667, P<.001). The major shortcoming of the role-play measure was that it required considerable time and resources. This shortcoming is inherent in the method. Given this, of most interest for assessing training outcome, scores on the Web-based measure efficiently predicted therapist competence, as judged by the role-play measure (with the Web-based measure having a positive predictive value of 77% and specificity of 78%). The results of this study suggest that while it was feasible and acceptable to assess performance using the newly developed role-play measure, the highly scalable Web-based measure could be used in certain circumstances as a substitute for the more labor-intensive, and hence, more costly role-play method. ©Zafra Cooper, Helen Doll, Suzanne Bailey-Straebler, Kristin Bohn, Dian de Vries, Rebecca Murphy, Marianne E O'Connor, Christopher G Fairburn. Originally published in JMIR Mental Health (http://mental.jmir.org), 31.10.2017.
Impact of an Activity-Based Program on Health, Quality of Life, and Occupational Performance of Women Diagnosed With Cancer.

PubMed

Maher, Colleen; Mendonca, Rochelle J

We evaluated the impact of a 1-wk activity program on the health, quality of life (QOL), and occupational performance of community-living women diagnosed with cancer. A one-group pretest-posttest repeated-measures design was used. Participants completed a functional health measure (36-Item Short Form Health Survey [SF-36]), a QOL measure (World Health Organization Quality of Life-Brief version [WHOQOL-BREF]), and an occupational performance and satisfaction measure (Canadian Occupational Performance Measure [COPM]) before and 6 wk after program completion. The COPM was also administered on Day 5. Paired t tests for the SF-36 and WHOQOL-BREF showed no significant differences, except for the WHOQOL-BREF's Social Relationships subscale (p < .008). Repeated-measures analyses of variance showed a significant difference in COPM performance and satisfaction scores (p < .001). The activity program effectively improved occupational performance and satisfaction and social relationships of community-living women diagnosed with cancer. Copyright © 2018 by the American Occupational Therapy Association, Inc.
Performance measures for lower gastrointestinal endoscopy: a European Society of Gastrointestinal Endoscopy (ESGE) quality improvement initiative

PubMed Central

Thomas-Gibson, Siwan; Bugajski, Marek; Bretthauer, Michael; Rees, Colin J; Dekker, Evelien; Hoff, Geir; Jover, Rodrigo; Suchanek, Stepan; Ferlitsch, Monika; Anderson, John; Roesch, Thomas; Hultcranz, Rolf; Racz, Istvan; Kuipers, Ernst J; Garborg, Kjetil; East, James E; Rupinski, Maciej; Seip, Birgitte; Bennett, Cathy; Senore, Carlo; Minozzi, Silvia; Bisschops, Raf; Domagk, Dirk; Valori, Roland; Spada, Cristiano; Hassan, Cesare; Dinis-Ribeiro, Mario; Rutter, Matthew D

2017-01-01

The European Society of Gastrointestinal Endoscopy and United European Gastroenterology present a short list of key performance measures for lower gastrointestinal endoscopy. We recommend that endoscopy services across Europe adopt the following seven key performance measures for lower gastrointestinal endoscopy for measurement and evaluation in daily practice at a center and endoscopist level: 1 rate of adequate bowel preparation (minimum standard 90%); 2 cecal intubation rate (minimum standard 90%); 3 adenoma detection rate (minimum standard 25%); 4 appropriate polypectomy technique (minimum standard 80%); 5 complication rate (minimum standard not set); 6 patient experience (minimum standard not set); 7 appropriate post-polypectomy surveillance recommendations (minimum standard not set). Other identified performance measures have been listed as less relevant based on an assessment of their importance, scientific acceptability, feasibility, usability, and comparison to competing measures. PMID:28507745
Physical evaluation of color and monochrome medical displays using an imaging colorimeter

NASA Astrophysics Data System (ADS)

Roehrig, Hans; Gu, Xiliang; Fan, Jiahua

2013-03-01

This paper presents an approach to physical evaluation of color and monochrome medical grade displays using an imaging colorimeter. The purpose of this study was to examine the influence of medical display types, monochrome or color at the same maximum luminance settings, on diagnostic performance. The focus was on the measurements of physical characteristics including spatial resolution and noise performance, which we believed could affect the clinical performance. Specifically, Modulation Transfer Function (MTF) and Noise Power Spectrum (NPS) were evaluated and compared at different digital driving levels (DDL) between two EIZO displays.
Effects of Newly Designed Hospital Buildings on Staff Perceptions: A Pre-Post Study to Validate Design Decisions.

PubMed

Schreuder, Eliane; van Heel, Liesbeth; Goedhart, Rien; Dusseldorp, Elise; Schraagen, Jan Maarten; Burdorf, Alex

2015-01-01

This study investigates effects of the newly built nonpatient-related buildings of a large university medical center on staff perceptions and whether the design objectives were achieved. The medical center is gradually renewing its hospital building area of 200,000 m.(2) This redevelopment is carefully planned and because lessons learned can guide design decisions of the next phase, the medical center is keen to evaluate the performance of the new buildings. A pre- and post-study with a control group was conducted. Prior to the move to the new buildings an occupancy evaluation was carried out in the old setting (n = 729) (pre-study). Post occupation of the new buildings another occupancy evaluation (post-study) was carried out in the new setting (intervention group) and again in some old settings (control group) (n = 664). The occupancy evaluation consisted of an online survey that measured the perceived performance of different aspects of the building. Longitudinal multilevel analysis was used to compare the performance of the old buildings with the new buildings. Significant improvements were found in indoor climate, perceived safety, working environment, well-being, facilities, sustainability, and overall satisfaction. Commitment to the employer, working atmosphere, orientation, work performance, and knowledge sharing did not improve. The results were interpreted by relating them to specific design choices. We showed that it is possible to measure the performance improvements of a complex intervention being a new building design and validate design decisions. A focused design process aiming for a safe, pleasant and sustainable building resulted in actual improvements in some of the related performance measures. © The Author(s) 2015.
Investigation of the reproducibility and reliability of sagittal vertebral inclination measurements from MR images of the spine.

PubMed

Vrtovec, Tomaž; Pernuš, Franjo; Likar, Boštjan

2014-10-01

In this study, sagittal vertebral inclination (SVI) was systematically evaluated for 28 vertebrae (segments between T4 and L5) in magnetic resonance (MR) images of one normal and one scoliotic subject to compare the performance of manual and computerized measurements, and identify the most reproducible and reliable measurements. Manual measurements were performed by three observers, who identified on two occasions the distinctive anatomical landmarks required to evaluate SVI by six measurement methods, i.e. the superior tangents, inferior tangents, anterior tangents, posterior tangents, mid-endplate lines and mid-wall lines. Computerized measurements were performed by automatically evaluating SVI from the symmetry of vertebral anatomical structures in two-dimensional (2D) sagittal cross-sections and in three-dimensional (3D) volumetric images. The mid-wall lines and posterior tangents proved to be the manual measurements with the lowest intra-observer (standard deviation, SD, of 1.4° and 1.7°, respectively) and inter-observer variability (SD of 1.9° and 2.4°, respectively). The strongest inter-method agreement was found between the mid-wall lines and posterior tangents (SD of 2.0°). Computerized measurements in 2D and in 3D resulted in intra-observer (SD of 2.8° and 3.1°, respectively) and inter-observer variability (SD of 3.8° and 5.2°, respectively) that were comparable to those of the superior tangents (SD of 2.6° and 3.7°) and inferior tangents (SD of 3.2° and 4.5°), which represent standard Cobb angle measurements. It can be concluded that computerized measurements of SVI should be based on the inclination of vertebral body walls. Copyright © 2014 Elsevier Ltd. All rights reserved.
Data analysis techniques used at the Oak Ridge Y-12 plant flywheel evaluation laboratory

NASA Astrophysics Data System (ADS)

Steels, R. S., Jr.; Babelay, E. F., Jr.

1980-07-01

Some of the more advanced data analysis techniques applied to the problem of experimentally evaluating the performance of high performance composite flywheels are presented. Real time applications include polar plots of runout with interruptions relating to balance and relative motions between parts, radial growth measurements, and temperature of the spinning part. The technique used to measure torque applied to a containment housing during flywheel failure is also presented. The discussion of pre and post test analysis techniques includes resonant frequency determination with modal analysis, waterfall charts, and runout signals at failure.
Description and Evaluation of a Measurement Technique for Assessment of Performing Gender

PubMed Central

Harris, Kathleen Mullan; Halpern, Carolyn Tucker

2016-01-01

The influence of masculinity and femininity on behaviors and outcomes has been extensively studied in social science research using various measurement strategies. In the present paper, we describe and evaluate a measurement technique that uses existing survey items to capture the extent to which an individual behaves similarly to their same-gender peers. We use data from the first four waves of The National Longitudinal Study of Adolescent to Adult Health (Add Health), a nationally representative sample of adolescents (age 12–18) in the United States who were re-interviewed at ages 13–19, 18–26, and 24–32. We estimate split-half reliability and provide evidence that supports the validity of this measurement technique. We demonstrate that the resulting measure does not perform as a trait measure and is associated with involvement in violent fights, a pattern consistent with theory and empirical findings. This measurement technique represents a novel approach for gender researchers with the potential for expanding our current knowledge base. PMID:28630528
Pre-Service Identification of Talented Teachers through Non-Traditional Measures: A Study of the Role of Affective Variables as Predictors of Success in Student Teaching.

ERIC Educational Resources Information Center

Basom, Margaret; And Others

1994-01-01

Researchers examined relationships between the SRI Gallup Pre-Professional Teacher Interview and performance-based student teaching evaluations and between SRI Interview and California Student Achievement Test (CAT) scores. A relationship between SRI Interview scores and performance-based student teaching evaluations surfaces. CAT scores did not…
Indicators of Program Quality, Measures of Performance & Standards. Adult Basic Education and ESL Programs in NJ. Summary Report.

ERIC Educational Resources Information Center

Merkel-Keller, Claudia; Streeter-Scrupski, Sandra

In 1992, adult education staff and adult literacy volunteer organizations developed 8 indicators of program quality to be used for evaluating adult basic education and English as a Second Language (ESL) programs in New Jersey. Performance standards were developed to match the standards. An evaluation was conducted to determine how the indicators…
LABORATORY EVALUATION OF SIX NEW/MODIFIED PORTABLE X-RAY FLUORESCENCE SPECTROMETERS FOR THE MEASUREMENT OF LEAD IN CHARACTERIZED PAINT FILMS AND RESEARCH MATERIAL BOARDS (TECHNICAL REPORT)

EPA Science Inventory

A laboratory study was performed in 1994-1995 to identify and estimate the influence of key characteristics for evaluating the performance of portable X-ray fluorescence (XRF) spectrometers. Six new/modified spectrometers, including HNU SEFA-Pb, Metorex X-MET, Niton X-L, Radiat...
LABORATORY EVALUATION OF SIX NEW/MODIFIED PORTABLE X-RAY FLUORESCENCE SPECTROMETERS FOR THE MEASUREMENT OF LEAD IN CHARACTERIZED PAINT FILMS AND RESEARCH MATERIAL BOARDS (APPENDICES)

EPA Science Inventory

A laboratory study was performed in 1994-1995 to identify and estimate the influence of key characteristics for evaluating the performance of portable X-ray fluorescence (XRF) spectrometers. Six new/modified spectrometers, including HNU SEFA-Pb, Metorex X-MET, Niton X-L, Radiat...
Development and Performance Evaluation of Image-Based Robotic Waxing System for Detailing Automobiles

PubMed Central

Hsu, Bing-Cheng

2018-01-01

Waxing is an important aspect of automobile detailing, aimed at protecting the finish of the car and preventing rust. At present, this delicate work is conducted manually due to the need for iterative adjustments to achieve acceptable quality. This paper presents a robotic waxing system in which surface images are used to evaluate the quality of the finish. An RGB-D camera is used to build a point cloud that details the sheet metal components to enable path planning for a robot manipulator. The robot is equipped with a multi-axis force sensor to measure and control the forces involved in the application and buffing of wax. Images of sheet metal components that were waxed by experienced car detailers were analyzed using image processing algorithms. A Gaussian distribution function and its parameterized values were obtained from the images for use as a performance criterion in evaluating the quality of surfaces prepared by the robotic waxing system. Waxing force and dwell time were optimized using a mathematical model based on the image-based criterion used to measure waxing performance. Experimental results demonstrate the feasibility of the proposed robotic waxing system and image-based performance evaluation scheme. PMID:29757940
Development and Performance Evaluation of Image-Based Robotic Waxing System for Detailing Automobiles.

PubMed

Lin, Chi-Ying; Hsu, Bing-Cheng

2018-05-14

Waxing is an important aspect of automobile detailing, aimed at protecting the finish of the car and preventing rust. At present, this delicate work is conducted manually due to the need for iterative adjustments to achieve acceptable quality. This paper presents a robotic waxing system in which surface images are used to evaluate the quality of the finish. An RGB-D camera is used to build a point cloud that details the sheet metal components to enable path planning for a robot manipulator. The robot is equipped with a multi-axis force sensor to measure and control the forces involved in the application and buffing of wax. Images of sheet metal components that were waxed by experienced car detailers were analyzed using image processing algorithms. A Gaussian distribution function and its parameterized values were obtained from the images for use as a performance criterion in evaluating the quality of surfaces prepared by the robotic waxing system. Waxing force and dwell time were optimized using a mathematical model based on the image-based criterion used to measure waxing performance. Experimental results demonstrate the feasibility of the proposed robotic waxing system and image-based performance evaluation scheme.
Uncertainty evaluation of thickness and warp of a silicon wafer measured by a spectrally resolved interferometer

NASA Astrophysics Data System (ADS)

Praba Drijarkara, Agustinus; Gergiso Gebrie, Tadesse; Lee, Jae Yong; Kang, Chu-Shik

2018-06-01

Evaluation of uncertainty of thickness and gravity-compensated warp of a silicon wafer measured by a spectrally resolved interferometer is presented. The evaluation is performed in a rigorous manner, by analysing the propagation of uncertainty from the input quantities through all the steps of measurement functions, in accordance with the ISO Guide to the Expression of Uncertainty in Measurement. In the evaluation, correlation between input quantities as well as uncertainty attributed to thermal effect, which were not included in earlier publications, are taken into account. The temperature dependence of the group refractive index of silicon was found to be nonlinear and varies widely within a wafer and also between different wafers. The uncertainty evaluation described here can be applied to other spectral interferometry applications based on similar principles.
Measured effects of coolant injection on the performance of a film cooled turbine

NASA Technical Reports Server (NTRS)

Mcdonel, J. D.; Eiswerth, J. E.

1977-01-01

Tests have been conducted on a 20-inch diameter single-stage air-cooled turbine designed to evaluate the effects of film cooling air on turbine aerodynamic performance. The present paper reports the results of five test configurations, including two different cooling designs and three combinations of cooled and solid airfoils. A comparison is made of the experimental results with a previously published analytical method of evaluating coolant injection effects on turbine performance.
Performance Evaluation of Glottal Inverse Filtering Algorithms Using a Physiologically Based Articulatory Speech Synthesizer

DTIC Science & Technology

2017-01-05

1 Performance Evaluation of Glottal Inverse Filtering Algorithms Using a Physiologically Based Articulatory Speech Synthesizer Yu-Ren Chien, Daryush...D. Mehta, Member, IEEE, Jón Guðnason, Matías Zañartu, Member, IEEE, and Thomas F. Quatieri, Fellow, IEEE Abstract—Glottal inverse filtering aims to...of inverse filtering performance has been challenging due to the practical difficulty in measuring the true glottal signals while speech signals are
40 CFR 125.98 - As the Director, what must I do to comply with the requirements of this subpart?

Code of Federal Regulations, 2011 CFR

2011-07-01

... technologies, operational measures, or restoration measures should be included in the permit to meet the... evaluate the performance of the design and construction technologies, operational measures, and/or... construction technologies, operational measure, and/or restoration measures, and/or improved operation and...

An experimental investigation of multi-element airfoil ice accretion and resulting performance degradation

NASA Technical Reports Server (NTRS)

Potapczuk, Mark G.; Berkowitz, Brian M.

1989-01-01

An investigation of the ice accretion pattern and performance characteristics of a multi-element airfoil was undertaken in the NASA Lewis 6- by 9-Foot Icing Research Tunnel. Several configurations of main airfoil, slat, and flaps were employed to examine the effects of ice accretion and provide further experimental information for code validation purposes. The text matrix consisted of glaze, rime, and mixed icing conditions. Airflow and icing cloud conditions were set to correspond to those typical of the operating environment anticipated tor a commercial transport vehicle. Results obtained included ice profile tracings, photographs of the ice accretions, and force balance measurements obtained both during the accretion process and in a post-accretion evaluation over a range of angles of attack. The tracings and photographs indicated significant accretions on the slat leading edge, in gaps between slat or flaps and the main wing, on the flap leading-edge surfaces, and on flap lower surfaces. Force measurments indicate the possibility of severe performance degradation, especially near C sub Lmax, for both light and heavy ice accretion and performance analysis codes presently in use. The LEWICE code was used to evaluate the ice accretion shape developed during one of the rime ice tests. The actual ice shape was then evaluated, using a Navier-Strokes code, for changes in performance characteristics. These predicted results were compared to the measured results and indicate very good agreement.
The partial coherence modulation transfer function in testing lithography lens

NASA Astrophysics Data System (ADS)

Huang, Jiun-Woei

2018-03-01

Due to the lithography demanding high performance in projection of semiconductor mask to wafer, the lens has to be almost free in spherical and coma aberration, thus, in situ optical testing for diagnosis of lens performance has to be established to verify the performance and to provide the suggesting for further improvement of the lens, before the lens has been build and integrated with light source. The measurement of modulation transfer function of critical dimension (CD) is main performance parameter to evaluate the line width of semiconductor platform fabricating ability for the smallest line width of producing tiny integrated circuits. Although the modulation transfer function (MTF) has been popularly used to evaluation the optical system, but in lithography, the contrast of each line-pair is in one dimension or two dimensions, analytically, while the lens stand along in the test bench integrated with the light source coherent or near coherent for the small dimension near the optical diffraction limit, the MTF is not only contributed by the lens, also by illumination of platform. In the study, the partial coherence modulation transfer function (PCMTF) for testing a lithography lens is suggested by measuring MTF in the high spatial frequency of in situ lithography lens, blended with the illumination of partial and in coherent light source. PCMTF can be one of measurement to evaluate the imperfect lens of lithography lens for further improvement in lens performance.
Measurement of Dam Deformations: Case Study of Obruk Dam (Turkey)

NASA Astrophysics Data System (ADS)

Gulal, V. Engin; Alkan, R. Metin; Alkan, M. Nurullah; İlci, Veli; Ozulu, I. Murat; Tombus, F. Engin; Kose, Zafer; Aladogan, Kayhan; Sahin, Murat; Yavasoglu, Hakan; Oku, Guldane

2016-04-01

In the literature, there is information regarding the first deformation and displacement measurements in dams that were conducted in 1920s Switzerland. Todays, deformation measurements in the dams have gained very different functions with improvements in both measurement equipment and evaluation of measurements. Deformation measurements and analysis are among the main topics studied by scientists who take interest in the engineering measurement sciences. The Working group of Deformation Measurements and Analysis, which was established under the International Federation of Surveyors (FIG), carries out its studies and activities with regard to this subject. At the end of the 1970s, the subject of the determination of fixed points in the deformation monitoring network was one of the main subjects extensively studied. Many theories arose from this inquiry, as different institutes came to differing conclusions. In 1978, a special commission with representatives of universities has been established within the FIG 6.1 working group; this commission worked on the issue of determining a general approach to geometric deformation analysis. The results gleaned from the commission were discussed at symposiums organized by the FIG. In accordance with these studies, scientists interested in the subject have begun to work on models that investigate cause and effect relations between the effects that cause deformation and deformation. As of the scientist who interest with the issue focused on different deformation methods, another special commission was established within the FIG engineering measurements commission in order to classify deformation models and study terminology. After studying this material for a long time, the official commission report was published in 2001. In this prepared report, studies have been carried out by considering the FIG Engineering Surveying Commission's report entitled, 'MODELS AND TERMINOLOGY FOR THE ANALYSIS OF GEODETIC MONITORING OBSERVATIONS'. In October of 2015, geodetic deformation measurements were conducted by considering FIG reports related to deformation measurements and German DIN 18710 Engineering Measurements norms in the Çorum province of Turkey. The main purpose of the study is to determine optimum measurement and evaluation methods that will be used to specify movements in the horizontal and vertical directions for the fill dam. For this purpose; • In reference networks consisting of 8 points, measurements were performed by using long-term dual-frequency GNSS receivers for duration of 8 hours. • GNSS measurements were conducted in varying times between 30 minutes and 120 minutes at the 44 units object points on the body of the dam. • Two repetitive measurements of real time kinematic (RTK) GNSS were conducted at the object points on dam. • Geometric leveling measurements were performed between reference and object points. • Trigonometric leveling measurements were performed between reference and object points. • Polar measurements were performed between references and object points. GNSS measurements performed at reference points of the monitoring network for 8 hours have been evaluated by using GAMIT software in accordance with the IGS points in the region. In this manner, regional and local movements in the network can be determined. It is aimed to determine measurement period which will provide 1-2mm accuracy that expected in local GNSS network by evaluating GNSS measurements performed on body of dam. Results will be compared by offsetting GNSS and terrestrial measurements. This study will investigate whether or not there is increased accuracy provided by GNSS measurements carried out among reference points without the possibility of vision.
Prediction and Stability of Mathematics Skill and Difficulty

PubMed Central

Martin, Rebecca B.; Cirino, Paul T.; Barnes, Marcia A.; Ewing-Cobbs, Linda; Fuchs, Lynn S.; Stuebing, Karla K.; Fletcher, Jack M.

2016-01-01

The present study evaluated the stability of math learning difficulties over a 2-year period and investigated several factors that might influence this stability (categorical vs. continuous change, liberal vs. conservative cut point, broad vs. specific math assessment); the prediction of math performance over time and by performance level was also evaluated. Participants were 144 students initially identified as having a math difficulty (MD) or no learning difficulty according to low achievement criteria in the spring of Grade 3 or Grade 4. Students were reassessed 2 years later. For both measure types, a similar proportion of students changed whether assessed categorically or continuously. However, categorical change was heavily dependent on distance from the cut point and so more common for MD, who started closer to the cut point; reliable change index change was more similar across groups. There were few differences with regard to severity level of MD on continuous metrics or in terms of prediction. Final math performance on a broad computation measure was predicted by behavioral inattention and working memory while considering initial performance; for a specific fluency measure, working memory was not uniquely related, and behavioral inattention more variably related to final performance, again while considering initial performance. PMID:22392890
Prediction and stability of mathematics skill and difficulty.

PubMed

Martin, Rebecca B; Cirino, Paul T; Barnes, Marcia A; Ewing-Cobbs, Linda; Fuchs, Lynn S; Stuebing, Karla K; Fletcher, Jack M

2013-01-01

The present study evaluated the stability of math learning difficulties over a 2-year period and investigated several factors that might influence this stability (categorical vs. continuous change, liberal vs. conservative cut point, broad vs. specific math assessment); the prediction of math performance over time and by performance level was also evaluated. Participants were 144 students initially identified as having a math difficulty (MD) or no learning difficulty according to low achievement criteria in the spring of Grade 3 or Grade 4. Students were reassessed 2 years later. For both measure types, a similar proportion of students changed whether assessed categorically or continuously. However, categorical change was heavily dependent on distance from the cut point and so more common for MD, who started closer to the cut point; reliable change index change was more similar across groups. There were few differences with regard to severity level of MD on continuous metrics or in terms of prediction. Final math performance on a broad computation measure was predicted by behavioral inattention and working memory while considering initial performance; for a specific fluency measure, working memory was not uniquely related, and behavioral inattention more variably related to final performance, again while considering initial performance.
Study on the application of ambient vibration tests to evaluate the effectiveness of seismic retrofitting

NASA Astrophysics Data System (ADS)

Liang, Li; Takaaki, Ohkubo; Guang-hui, Li

2018-03-01

In recent years, earthquakes have occurred frequently, and the seismic performance of existing school buildings has become particularly important. The main method for improving the seismic resistance of existing buildings is reinforcement. However, there are few effective methods to evaluate the effect of reinforcement. Ambient vibration measurement experiments were conducted before and after seismic retrofitting using wireless measurement system and the changes of vibration characteristics were compared. The changes of acceleration response spectrum, natural periods and vibration modes indicate that the wireless vibration measurement system can be effectively applied to evaluate the effect of seismic retrofitting. The method can evaluate the effect of seismic retrofitting qualitatively, it is difficult to evaluate the effect of seismic retrofitting quantitatively at this stage.
Reliable and valid tools for measuring surgeons' teaching performance: residents' vs. self evaluation.

PubMed

Boerebach, Benjamin C M; Arah, Onyebuchi A; Busch, Olivier R C; Lombarts, Kiki M J M H

2012-01-01

In surgical education, there is a need for educational performance evaluation tools that yield reliable and valid data. This paper describes the development and validation of robust evaluation tools that provide surgeons with insight into their clinical teaching performance. We investigated (1) the reliability and validity of 2 tools for evaluating the teaching performance of attending surgeons in residency training programs, and (2) whether surgeons' self evaluation correlated with the residents' evaluation of those surgeons. We surveyed 343 surgeons and 320 residents as part of a multicenter prospective cohort study of faculty teaching performance in residency training programs. The reliability and validity of the SETQ (System for Evaluation Teaching Qualities) tools were studied using standard psychometric techniques. We then estimated the correlations between residents' and surgeons' evaluations. The response rate was 87% among surgeons and 84% among residents, yielding 2625 residents' evaluations and 302 self evaluations. The SETQ tools yielded reliable and valid data on 5 domains of surgical teaching performance, namely, learning climate, professional attitude towards residents, communication of goals, evaluation of residents, and feedback. The correlations between surgeons' self and residents' evaluations were low, with coefficients ranging from 0.03 for evaluation of residents to 0.18 for communication of goals. The SETQ tools for the evaluation of surgeons' teaching performance appear to yield reliable and valid data. The lack of strong correlations between surgeons' self and residents' evaluations suggest the need for using external feedback sources in informed self evaluation of surgeons. Copyright © 2012 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Ground truth data for test sites (SL-4). [thermal radiation brightness temperature and solar radiation measurments

NASA Technical Reports Server (NTRS)

1974-01-01

Field measurements performed simultaneous with Skylab overpass in order to provide comparative calibration and performance evaluation measurements for the EREP sensors are presented. Wavelength region covered include: solar radiation (400 to 1300 nanometer), and thermal radiation (8 to 14 micrometer). Measurements consisted of general conditions and near surface meteorology, atmospheric temperature and humidity vs altitude, the thermal brightness temperature, total and diffuse solar radiation, direct solar radiation (subsequently analyzed for optical depth/transmittance), and target reflectivity/radiance. The particular instruments used are discussed along with analyses performed. Detailed instrument operation, calibrations, techniques, and errors are given.
NASA Boeing 737 Aircraft Test Results from 1996 Joint Winter Runway Friction Measurement Program

NASA Technical Reports Server (NTRS)

Yager, Thomas J.

1996-01-01

A description of the joint test program objectives and scope is given together with the performance capability of the NASA Langley B-737 instrumented aircraft. The B-737 test run matrix conducted during the first 8 months of this 5-year program is discussed with a description of the different runway conditions evaluated. Some preliminary test results are discussed concerning the Electronic Recording Decelerometer (ERD) readings and a comparison of B-737 aircraft braking performance for different winter runway conditions. Detailed aircraft parameter time history records, analysis of ground vehicle friction measurements and harmonization with aircraft braking performance, assessment of induced aircraft contaminant drag, and evaluation of the effects of other factors on aircraft/ground vehicle friction performance will be documented in a NASA Technical Report which is being prepared for publication next year.
A comparison of spectral decorrelation techniques and performance evaluation metrics for a wavelet-based, multispectral data compression algorithm

NASA Technical Reports Server (NTRS)

Matic, Roy M.; Mosley, Judith I.

1994-01-01

Future space-based, remote sensing systems will have data transmission requirements that exceed available downlinks necessitating the use of lossy compression techniques for multispectral data. In this paper, we describe several algorithms for lossy compression of multispectral data which combine spectral decorrelation techniques with an adaptive, wavelet-based, image compression algorithm to exploit both spectral and spatial correlation. We compare the performance of several different spectral decorrelation techniques including wavelet transformation in the spectral dimension. The performance of each technique is evaluated at compression ratios ranging from 4:1 to 16:1. Performance measures used are visual examination, conventional distortion measures, and multispectral classification results. We also introduce a family of distortion metrics that are designed to quantify and predict the effect of compression artifacts on multi spectral classification of the reconstructed data.
Segmentation quality evaluation using region-based precision and recall measures for remote sensing images

NASA Astrophysics Data System (ADS)

Zhang, Xueliang; Feng, Xuezhi; Xiao, Pengfeng; He, Guangjun; Zhu, Liujun

2015-04-01

Segmentation of remote sensing images is a critical step in geographic object-based image analysis. Evaluating the performance of segmentation algorithms is essential to identify effective segmentation methods and optimize their parameters. In this study, we propose region-based precision and recall measures and use them to compare two image partitions for the purpose of evaluating segmentation quality. The two measures are calculated based on region overlapping and presented as a point or a curve in a precision-recall space, which can indicate segmentation quality in both geometric and arithmetic respects. Furthermore, the precision and recall measures are combined by using four different methods. We examine and compare the effectiveness of the combined indicators through geometric illustration, in an effort to reveal segmentation quality clearly and capture the trade-off between the two measures. In the experiments, we adopted the multiresolution segmentation (MRS) method for evaluation. The proposed measures are compared with four existing discrepancy measures to further confirm their capabilities. Finally, we suggest using a combination of the region-based precision-recall curve and the F-measure for supervised segmentation evaluation.
Modified Universal Design Survey: Enhancing Operability of Launch Vehicle Ground Crew Worksites

NASA Technical Reports Server (NTRS)

Blume, Jennifer L.

2010-01-01

Operability is a driving requirement for next generation space launch vehicles. Launch site ground operations include numerous operator tasks to prepare the vehicle for launch or to perform preflight maintenance. Ensuring that components requiring operator interaction at the launch site are designed for optimal human use is a high priority for operability. To promote operability, a Design Quality Evaluation Survey based on Universal Design framework was developed to support Human Factors Engineering (HFE) evaluation for NASA s launch vehicles. Universal Design per se is not a priority for launch vehicle processing however; applying principles of Universal Design will increase the probability of an error free and efficient design which promotes operability. The Design Quality Evaluation Survey incorporates and tailors the seven Universal Design Principles and adds new measures for Safety and Efficiency. Adapting an approach proven to measure Universal Design Performance in Product, each principle is associated with multiple performance measures which are rated with the degree to which the statement is true. The Design Quality Evaluation Survey was employed for several launch vehicle ground processing worksite analyses. The tool was found to be most useful for comparative judgments as opposed to an assessment of a single design option. It provided a useful piece of additional data when assessing possible operator interfaces or worksites for operability.
Smart ECG Monitoring Patch with Built-in R-Peak Detection for Long-Term HRV Analysis.

PubMed

Lee, W K; Yoon, H; Park, K S

2016-07-01

Since heart rate variability (HRV) analysis is widely used to evaluate the physiological status of the human body, devices specifically designed for such applications are needed. To this end, we developed a smart electrocardiography (ECG) patch. The smart patch measures ECG using three electrodes integrated into the patch, filters the measured signals to minimize noise, performs analog-to-digital conversion, and detects R-peaks. The measured raw ECG data and the interval between the detected R-peaks can be recorded to enable long-term HRV analysis. Experiments were performed to evaluate the performance of the built-in R-wave detection, robustness of the device under motion, and applicability to the evaluation of mental stress. The R-peak detection results obtained with the device exhibited a sensitivity of 99.29%, a positive predictive value of 100.00%, and an error of 0.71%. The device also exhibited less motional noise than conventional ECG recording, being stable up to a walking speed of 5 km/h. When applied to mental stress analysis, the device evaluated the variation in HRV parameters in the same way as a normal ECG, with very little difference. This device can help users better understand their state of health and provide physicians with more reliable data for objective diagnosis.
Handwriting features of children with developmental coordination disorder--results of triangular evaluation.

PubMed

Rosenblum, Sara; Margieh, Jumana Aassy; Engel-Yeger, Batya

2013-11-01

Developmental coordination disorders (DCD) is one of the most common disorders affecting school-aged children. The study aimed to characterize the handwriting performance of children with DCD who write in Arabic, based on triangular evaluation. Participants included 58 children aged 11-12 years, 29 diagnosed with DCD based on the DSM-IV criteria and the M-ABC, and 29 matched typically developed controls. Children were asked to copy a paragraph on a sheet of paper affixed to a digitizer supplying objective measures of the handwriting process. The handwriting proficiency screening questionnaire (HPSQ) was completed by their teachers while observing their performance and followed by evaluation of their final written product. Results indicated that compared to controls, children with DCD required significantly more on-paper and in-air time per stroke while copying. In addition, global legibility, unrecognizable letters and spatial arrangement measures of their written product were significantly inferior. Significant group differences were also found between the HPSQ subscales scores. Furthermore, 82.8% of all participants were correctly classified into groups based on one discriminate function which included two handwriting performance measures. These study results strongly propose application of triangular standardized evaluation to receive better insight of handwriting deficit features of individual children with DCD who write in Arabic. Copyright © 2013 Elsevier Ltd. All rights reserved.
Benchmarking. Issues in the Design and Implementation of a Benchmarking System for Employment and Training Programs for Young People.

ERIC Educational Resources Information Center

Coughlin, David C.; Bielen, Rhonda P.

This paper has been prepared to assist the United States Department of Labor to explore new approaches to evaluating and measuring the performance of employment and training activities for youth. As one of several tools for evaluating success of local youth training programs, "benchmarking" provides a system for measuring the development…
Bibliometrics as a Performance Measurement Tool for Research Evaluation: The Case of Research Funded by the National Cancer Institute of Canada

ERIC Educational Resources Information Center

Campbell, David; Picard-Aitken, Michelle; Cote, Gregoire; Caruso, Julie; Valentim, Rodolfo; Edmonds, Stuart; Williams, Gregory Thomas; Macaluso, Benoit; Robitaille, Jean-Pierre; Bastien, Nicolas; Laframboise, Marie-Claude; Lebeau, Louis-Michel; Mirabel, Philippe; Lariviere, Vincent; Archambault, Eric

2010-01-01

As bibliometric indicators are objective, reliable, and cost-effective measures of peer-reviewed research outputs, they are expected to play an increasingly important role in research assessment/management. Recently, a bibliometric approach was developed and integrated within the evaluation framework of research funded by the National Cancer…
Neuropsychological test performance and prediction of functional capacities among Spanish-speaking and English-speaking patients with dementia.

PubMed

Loewenstein, D A; Rubert, M P; Argüelles, T; Duara, R

1995-03-01

Neuropsychological measures have been widely used by clinicians to assist them in making judgments regarding a cognitively impaired patient's ability to independently perform important activities of daily living. However, important questions have been raised concerning the degree to which neuropsychological instruments can predict a broad array of specific functional capacities required in the home environment. In the present study, we examined 127 English-speaking and 56 Spanish-speaking patients with Alzheimer's disease (AD) and determined the extent to which various neuropsychological measures and demographic variables were predictive of performance on functional measures administered within the clinical setting. Among English-speaking AD patients, Block Design and Digit-Span of the WAIS-R, as well as tests of language were among the strongest predictors of functional performance. For Spanish-speakers, Block Design, The Mini-Mental State Evaluation (MMSE) and Digit Span had the optimal predictive power. When stepwise regression was conducted on the entire sample of 183 subjects, ethnicity emerged as a statistically significant predictor variable on one of the seven functional tests (writing a check). Despite the predictive power of several of the neuropsychological measures for both groups, most of the variability in objective functional performance could not be explained in our regression models. As a result, it would appear prudent to include functional measures as part of a comprehensive neuropsychological evaluation for dementia.
Evaluation of working conditions of workers engaged in tending horses.

PubMed

Nowakowicz-Dębek, Bożena; Pawlak, Halina; Wlazło, Łukasz; Kuna-Broniowska, Izabela; Bis-Wencel, Hanna; Buczaj, Agnieszka; Maksym, Piotr

2014-01-01

A growing interest in the horse business has resulted in the increased engagement of many people in this area, and the health problems occurring among workers create the need to search for prophylactic measures. The objective of the study was evaluation of the level of exposure to air pollution in a stable, and estimation of the degree of work load among workers engaged in tending horses. The study was conducted twice, during the winter season, in a stable maintaining race horses, and in a social room. In order to evaluate workers' exposure, air samples were collected by the aspiration method. After the incubation of material, the total number of bacteria and fungi in the air was determined, as well as the number of aerobic mesophilic and thermophilic bacteria, expressed as the number of colony forming units per cubic meter of air (CFU/m3). The measurement of total dust concentration in the air was also performed, simultaneously with the measurement of microclimatic parameters. The study of work load also covered the measurement of energy expenditure, evaluation of static physical load, and monotony of movements performed. The stable may be considered as a workplace with considerable risk of the occurrence of unfavourable health effects.
The relationships of waist and mid-thigh circumference with performance of college golfers

PubMed Central

Son, Seungbum; Han, Kunho; So, Wi-Young

2016-01-01

[Purpose] Our aim was to evaluate the relationships between waist and mid-thigh circumference, used as proxy measures of trunk and lower limb strengths, respectively, and selected parameters of driver and putting performance in Korean college golfers. [Subjects and Methods] The participants were 103 college golfers (81 male, 20 to 27 years old). Measurements of body composition, waist and mid-thigh circumference, and grip strength, as well as assessment of golf performance, including driver distance, driver swing speed, putting accuracy, and putting consistency, were performed at the golf performance laboratory at Konkuk University in Chungju-si, Republic of Korea. Average round score was obtained from 10 rounds of golf completed during the study period. The relationships between strength measures and golf performance were evaluated by partial correlation analysis, with adjustment for age, golf experience, and body mass index. [Results] Waist circumference did not correlate with any of the performance variables in both males and females. Mid-thigh circumference correlated with putting consistency (r = 0.364) in males and with putting consistency (r = 0.490) and accuracy (r = 0.547) in females. No other significant correlations between waist and mid-thigh circumference and golf performance were identified. [Conclusion] Lower limb strength may be an important component of putting performance. Further studies are needed to fully characterize the contributions of trunk strength to performance. PMID:27134346
The Influence of the Manner of Performing the Thyroid Ultrasound Examination on the Reliability of the Assessment of the Thyroid Size in School-Aged Children.

PubMed

Zygmunt, Arkadiusz; Adamczewski, Zbigniew; Zygmunt, Agnieszka; Karbownik-Lewinska, Malgorzata; Lewinski, Andrzej

2017-01-01

Goitre incidence in school-aged children evaluated using ultrasonography is one of the essential indicators of iodine intake in a given area. The aim of the study was to examine what the difference is between the volume of the thyroid gland measured in the supine and sitting position and to determine the intra-observer, inter-observer, and inter-position variations. The survey was conducted among 87 children (56 girls and 31 boys aged 7-13 years, mean age 10.44 ± 1.72 years). The thyroid volume measured in a sitting position was significantly lower than that measured in the supine position. The intra-observer variations for the total thyroid volume equalled 9.56-9.65%. The inter-observer variations were significantly higher and amounted to 34.5-35.7%. The way in which ultrasound evaluation is performed is important for the analysis of the results. It is crucial to aim for the smallest inter-observer variation, which can be achieved by strictly defining the methods of the thyroid measurement and comparing one's measuring techniques with the reference method. The use of standards in ultrasound evaluation performed in the supine position, as well as the use of standards without a strict determination of the study method, can lead to erro-neous conclusions. © 2017 S. Karger AG, Basel.

Familial aggregation patterns in mathematical ability.

PubMed

Wijsman, Ellen M; Robinson, Nancy M; Ainsworth, Kathryn H; Rosenthal, Elisabeth A; Holzman, Ted; Raskind, Wendy H

2004-01-01

Mathematical talent is an asset in modern society both at an individual and a societal level. Environmental factors such as quality of mathematics education undoubtedly affect an individual's performance, and there is some evidence that genetic factors also may play a role. The current study was performed to investigate the feasibility of undertaking genetics studies on mathematical ability. Because the etiology of low ability in mathematics is likely to be multifactorial and heterogeneous, we evaluated families ascertained through a proband with high mathematical performance in grade 7 on the SAT to eliminate, to some degree, adverse environmental factors. Families of sex-matched probands, selected for high verbal performance on the SAT, served as the comparison group. We evaluated a number of proxy measures for their usefulness in the study of clustering of mathematical talent. Given the difficulty of testing mathematics performance across developmental ages, especially with the added complexity of decreasing exposure to formal mathematics concepts post schooling, we also devised a semiquantitative scale that incorporated educational, occupational, and avocational information as a surrogate for an academic mathematics measure. Whereas several proxy measures showed no evidence of a genetic basis, we found that the semiquantitative scale of mathematical talent showed strong evidence of a genetic basis, with a differential response as a function of the performance measure used to select the proband. This observation suggests that there may be a genetic basis to specific mathematical talent, and that specific, as opposed to proxy, investigative measures that are designed to measure such talent in family members could be of benefit for this purpose.
Comparative evaluation of ultrasound scanner accuracy in distance measurement

NASA Astrophysics Data System (ADS)

Branca, F. P.; Sciuto, S. A.; Scorza, A.

2012-10-01

The aim of the present study is to develop and compare two different automatic methods for accuracy evaluation in ultrasound phantom measurements on B-mode images: both of them give as a result the relative error e between measured distances, performed by 14 brand new ultrasound medical scanners, and nominal distances, among nylon wires embedded in a reference test object. The first method is based on a least squares estimation, while the second one applies the mean value of the same distance evaluated at different locations in ultrasound image (same distance method). Results for both of them are proposed and explained.
APPLICATION OF THE AERIAL PROFILING OF TERRAIN SYSTEM.

USGS Publications Warehouse

Cyran, Edward J.

1985-01-01

The U. S. Geological Survey has completed the performance evaluation flight tests of the Aerial Profiling of Terrain System (APTS) and is now performing a series of application tests to determine its effectiveness and efficiency as an earth-science data collection tool. These tests are designed to evaluate the APTS at such tasks as positioning water wells, testing reliability of older maps, measuring elevations of kettle ponds, and profiling stream valleys for flood studies. The results of three application tests in Massachusetts are discussed: positioning water wells and measuring elevations along the Charles River; testing four older 1:24,000-scale quadrangle maps in the Plymouth area; and measuring elevations of several hundred kettle ponds near the Cape Cod Canal.
Function library programming to support B89 evaluation of Sheffield Apollo RS50 DCC (Direct Computer Control) CMM (Coordinate Measuring Machine)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Frank, R.N.

1990-02-28

The Inspection Shop at Lawrence Livermore Lab recently purchased a Sheffield Apollo RS50 Direct Computer Control Coordinate Measuring Machine. The performance of the machine was specified to conform to B89 standard which relies heavily upon using the measuring machine in its intended manner to verify its accuracy (rather than parametric tests). Although it would be possible to use the interactive measurement system to perform these tasks, a more thorough and efficient job can be done by creating Function Library programs for certain tasks which integrate Hewlett-Packard Basic 5.0 language and calls to proprietary analysis and machine control routines. This combinationmore » provides efficient use of the measuring machine with a minimum of keyboard input plus an analysis of the data with respect to the B89 Standard rather than a CMM analysis which would require subsequent interpretation. This paper discusses some characteristics of the Sheffield machine control and analysis software and my use of H-P Basic language to create automated measurement programs to support the B89 performance evaluation of the CMM. 1 ref.« less
Evaluation of Low-Cost Mitigation Measures Implemented to Improve Air Quality in Nursery and Primary Schools

PubMed Central

Sá, Juliana P.; Branco, Pedro T. B. S.; Alvim-Ferraz, Maria C. M.; Martins, Fernando G.; Sousa, Sofia I. V.

2017-01-01

Indoor air pollution mitigation measures are highly important due to the associated health impacts, especially on children, a risk group that spends significant time indoors. Thus, the main goal of the work here reported was the evaluation of mitigation measures implemented in nursery and primary schools to improve air quality. Continuous measurements of CO2, CO, NO2, O3, CH2O, total volatile organic compounds (VOC), PM1, PM2.5, PM10, Total Suspended Particles (TSP) and radon, as well as temperature and relative humidity were performed in two campaigns, before and after the implementation of low-cost mitigation measures. Evaluation of those mitigation measures was performed through the comparison of the concentrations measured in both campaigns. Exceedances to the values set by the national legislation and World Health Organization (WHO) were found for PM2.5, PM10, CO2 and CH2O during both indoor air quality campaigns. Temperature and relative humidity values were also above the ranges recommended by American Society of Heating, Refrigerating, and Air-Conditioning Engineers (ASHRAE). In general, pollutant concentrations measured after the implementation of low-cost mitigation measures were significantly lower, mainly for CO2. However, mitigation measures were not always sufficient to decrease the pollutants’ concentrations till values considered safe to protect human health. PMID:28561795
Trends Supporting the In-Field Use of Wearable Inertial Sensors for Sport Performance Evaluation: A Systematic Review.

PubMed

Camomilla, Valentina; Bergamini, Elena; Fantozzi, Silvia; Vannozzi, Giuseppe

2018-03-15

Recent technological developments have led to the production of inexpensive, non-invasive, miniature magneto-inertial sensors, ideal for obtaining sport performance measures during training or competition. This systematic review evaluates current evidence and the future potential of their use in sport performance evaluation. Articles published in English (April 2017) were searched in Web-of-Science, Scopus, Pubmed, and Sport-Discus databases. A keyword search of titles, abstracts and keywords which included studies using accelerometers, gyroscopes and/or magnetometers to analyse sport motor-tasks performed by athletes (excluding risk of injury, physical activity, and energy expenditure) resulted in 2040 papers. Papers and reference list screening led to the selection of 286 studies and 23 reviews. Information on sport, motor-tasks, participants, device characteristics, sensor position and fixing, experimental setting and performance indicators was extracted. The selected papers dealt with motor capacity assessment (51 papers), technique analysis (163), activity classification (19), and physical demands assessment (61). Focus was placed mainly on elite and sub-elite athletes (59%) performing their sport in-field during training (62%) and competition (7%). Measuring movement outdoors created opportunities in winter sports (8%), water sports (16%), team sports (25%), and other outdoor activities (27%). Indications on the reliability of sensor-based performance indicators are provided, together with critical considerations and future trends.
Travtek Global Evaluation And Executive Summary

DOT National Transportation Integrated Search

2000-09-01

Several measures have been carried out in the Long Term Pavement Performance (LTPP) Program to ensure uniform distress data collection and interpretation. However, no systematic evaluation has been done to quantify the variability (bias and precision...
Quantitative comparison of randomization designs in sequential clinical trials based on treatment balance and allocation randomness.

PubMed

Zhao, Wenle; Weng, Yanqiu; Wu, Qi; Palesch, Yuko

2012-01-01

To evaluate the performance of randomization designs under various parameter settings and trial sample sizes, and identify optimal designs with respect to both treatment imbalance and allocation randomness, we evaluate 260 design scenarios from 14 randomization designs under 15 sample sizes range from 10 to 300, using three measures for imbalance and three measures for randomness. The maximum absolute imbalance and the correct guess (CG) probability are selected to assess the trade-off performance of each randomization design. As measured by the maximum absolute imbalance and the CG probability, we found that performances of the 14 randomization designs are located in a closed region with the upper boundary (worst case) given by Efron's biased coin design (BCD) and the lower boundary (best case) from the Soares and Wu's big stick design (BSD). Designs close to the lower boundary provide a smaller imbalance and a higher randomness than designs close to the upper boundary. Our research suggested that optimization of randomization design is possible based on quantified evaluation of imbalance and randomness. Based on the maximum imbalance and CG probability, the BSD, Chen's biased coin design with imbalance tolerance method, and Chen's Ehrenfest urn design perform better than popularly used permuted block design, EBCD, and Wei's urn design. Copyright © 2011 John Wiley & Sons, Ltd.
Evaluation of performance of light-weight profilometers

DOT National Transportation Integrated Search

2003-10-01

Several lightweight, non-contact profilometers (LWP) are now available to measure profiles of newly constructed Portland Cement Concrete Pavement (PCCP). As constructed smoothness measurements by four LWP's and the California-type profilograph were c...
Occupant Motion Sensors

DOT National Transportation Integrated Search

1971-03-01

An analysis was made of methods for measuring vehicle occupant motion during crash or impact conditions. The purpose of the measurements is to evaluate restraint performance using human, anthropometric dummy, or animal occupants. A detailed Fourier f...
Diabetes-related emotional distress instruments: a systematic review of measurement properties.

PubMed

Lee, Jiyeon; Lee, Eun-Hyun; Kim, Chun-Ja; Moon, Seung Hei

2015-12-01

The objectives of this study were to identify all available diabetes-related emotional distress instruments and evaluate the evidence regarding their measurement properties to help in the selection of the most appropriate instrument for use in practice and research. A systematic literature search was performed. PubMed, Embase, CINAHL, and PsycINFO were searched systematically for articles on diabetes-related emotional distress instruments. The Consensus-based Standards for the Selection of Health Measurement Instruments checklist was used to evaluate the methodological quality of the identified studies. The quality of results with respect to the measurement properties of each study was evaluated using Terwee's quality criteria. An ancillary meta-analysis was performed. Of the 2345 articles yielded by the search, 19 full-text articles evaluating 6 diabetes-related emotional distress instruments were included in this study. No instrument demonstrated evidence for all measurement properties. The Problem Areas in Diabetes scale (PAID) was the most frequently studied and the best validated of the instruments. Pooled summary estimates of the correlation coefficient between the PAID and serum glycated hemoglobin revealed a positive but weak correlation. No diabetes-related emotional distress instrument demonstrated evidence for all measurement properties. No instrument was better than another, although the PAID was the best validated and is thus recommended for use. Further psychometric studies of the diabetes-related emotional distress instruments with rigorous methodologies are required. Copyright © 2015 Elsevier Ltd. All rights reserved.
Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents.

PubMed

Meystre, Stéphane M; Thibault, Julien; Shen, Shuying; Hurdle, John F; South, Brett R

2010-01-01

OBJECTIVE To describe a new medication information extraction system-Textractor-developed for the 'i2b2 medication extraction challenge'. The development, functionalities, and official evaluation of the system are detailed. Textractor is based on the Apache Unstructured Information Management Architecture (UMIA) framework, and uses methods that are a hybrid between machine learning and pattern matching. Two modules in the system are based on machine learning algorithms, while other modules use regular expressions, rules, and dictionaries, and one module embeds MetaMap Transfer. The official evaluation was based on a reference standard of 251 discharge summaries annotated by all teams participating in the challenge. The metrics used were recall, precision, and the F(1)-measure. They were calculated with exact and inexact matches, and were averaged at the level of systems and documents. The reference metric for this challenge, the system-level overall F(1)-measure, reached about 77% for exact matches, with a recall of 72% and a precision of 83%. Performance was the best with route information (F(1)-measure about 86%), and was good for dosage and frequency information, with F(1)-measures of about 82-85%. Results were not as good for durations, with F(1)-measures of 36-39%, and for reasons, with F(1)-measures of 24-27%. The official evaluation of Textractor for the i2b2 medication extraction challenge demonstrated satisfactory performance. This system was among the 10 best performing systems in this challenge.
Functional Fitness Testing Results Following Long-Duration ISS Missions.

PubMed

Laughlin, Mitzi S; Guilliams, Mark E; Nieschwitz, Bruce A; Hoellen, David

2015-12-01

Long-duration spaceflight missions lead to the loss of muscle strength and endurance. Significant reduction in muscle function can be hazardous when returning from spaceflight. To document these losses, NASA developed medical requirements that include measures of functional strength and endurance. Results from this Functional Fitness Test (FFT) battery are also used to evaluate the effectiveness of in-flight exercise countermeasures. The purpose of this paper is to document results from the FFT and correlate this information with performance of in-flight exercise on board the International Space Station. The FFT evaluates muscular strength and endurance, flexibility, and agility and includes the following eight measures: sit and reach, cone agility, push-ups, pull-ups, sliding crunches, bench press, leg press, and hand grip dynamometry. Pre- to postflight functional fitness measurements were analyzed using dependent t-tests and correlation analyses were used to evaluate the relationship between functional fitness measurements and in-flight exercise workouts. Significant differences were noted post space flight with the sit and reach, cone agility, leg press, and hand grip measurements while other test scores were not significantly altered. The relationships between functional fitness and in-flight exercise measurements showed minimal to moderate correlations for most in-flight exercise training variables. The change in FFT results can be partially explained by in-flight exercise performance. Although there are losses documented in the FFT results, it is important to realize that the crewmembers are successfully performing activities of daily living and are considered functional for normal activities upon return to Earth.
Measuring Information Security Performance with 10 by 10 Model for Holistic State Evaluation

PubMed Central

2016-01-01

Organizations should measure their information security performance if they wish to take the right decisions and develop it in line with their security needs. Since the measurement of information security is generally underdeveloped in practice and many organizations find the existing recommendations too complex, the paper presents a solution in the form of a 10 by 10 information security performance measurement model. The model—ISP 10×10M is composed of ten critical success factors, 100 key performance indicators and 6 performance levels. Its content was devised on the basis of findings presented in the current research studies and standards, while its structure results from an empirical research conducted among information security professionals from Slovenia. Results of the study show that a high level of information security performance is mostly dependent on measures aimed at managing information risks, employees and information sources, while formal and environmental factors have a lesser impact. Experts believe that information security should evolve systematically, where it’s recommended that beginning steps include technical, logical and physical security controls, while advanced activities should relate predominantly strategic management activities. By applying the proposed model, organizations are able to determine the actual level of information security performance based on the weighted indexing technique. In this manner they identify the measures they ought to develop in order to improve the current situation. The ISP 10×10M is a useful tool for conducting internal system evaluations and decision-making. It may also be applied to a larger sample of organizations in order to determine the general state-of-play for research purposes. PMID:27655001
Teacher Effectiveness: An Update on Pennsylvania's Teacher Evaluation System. Issue Brief

ERIC Educational Resources Information Center

Research For Action, 2013

2013-01-01

Act 82 of 2012 established new standards for Pennsylvania's teacher evaluation system, including the incorporation of student performance measures in ratings decisions. Since 2009, approximately 35 states have amended teacher evaluation systems, with student achievement playing an increasingly prominent role. This count includes neighboring…
Evaluation of the measurement properties of self-reported health-related work-functioning instruments among workers with common mental disorders.

PubMed

Abma, Femke I; van der Klink, Jac J L; Terwee, Caroline B; Amick, Benjamin C; Bültmann, Ute

2012-01-01

During the past decade, common mental disorders (CMD) have emerged as a major public and occupational health problem in many countries. Several instruments have been developed to measure the influence of health on functioning at work. To select appropriate instruments for use in occupational health practice and research, the measurement properties (eg, reliability, validity, responsiveness) must be evaluated. The objective of this study is to appraise critically and compare the measurement properties of self-reported health-related work-functioning instruments among workers with CMD. A systematic review was performed searching three electronic databases. Papers were included that: (i) mainly focused on the development and/or evaluation of the measurement properties of a self-reported health-related work-functioning instrument; (ii) were conducted in a CMD population; and (iii) were fulltext original papers. Quality appraisal was performed using the consensus-based standards for the selection of health status measurement instruments (COSMIN) checklist. Five papers evaluating measurement properties of five self-reported health-related work-functioning instruments in CMD populations were included. There is little evidence available for the measurement properties of the identified instruments in this population, mainly due to low methodological quality of the included studies. The available evidence on measurement properties is based on studies of poor-to-fair methodological quality. Information on a number of measurement properties, such as measurement error, content validity, and cross-cultural validity is still lacking. Therefore, no evidence-based decisions and recommendations can be made for the use of health-related work functioning instruments. Studies of high methodological quality are needed to properly assess the existing instruments' measurement properties.
Entrepreneurship Education and Academic Performance

ERIC Educational Resources Information Center

Johansen, Vegard

2014-01-01

The significant increase of entrepreneurship education (EE) is a trend in Europe. Entrepreneurship education is supposed to promote general and specific entrepreneurial abilities and improve academic performance. This paper evaluates whether EE influences academic performance, measured by Grade Point Average. The main indicator used for EE is the…
Construct Validity of Three Clerkship Performance Assessments

ERIC Educational Resources Information Center

Lee, Ming; Wimmers, Paul F.

2010-01-01

This study examined construct validity of three commonly used clerkship performance assessments: preceptors' evaluations, OSCE-type clinical performance measures, and the NBME [National Board of Medical Examiners] medicine subject examination. Six hundred and eighty-six students taking the inpatient medicine clerkship from 2003 to 2007…
Impact of jammer side information on the performance of anti-jam systems

NASA Astrophysics Data System (ADS)

Lim, Samuel

1992-03-01

The Chernoff bound parameter, D, provides a performance measure for all coded communication systems. D can be used to determine upper-bounds on bit error probabilities (BEPs) of Viterbi decoded convolutional codes. The impact on BEP bounds of channel measurements that provide additional side information can also be evaluated with D. This memo documents the results of a Chernoff bound parameter evaluation in optimum partial-band noise jamming (OPBNJ) for both BPSK and DPSK modulation schemes. Hard and soft quantized receivers, with and without jammer side information (JSI), were examined. The results of this analysis indicate that JSI does improve decoding performance. However, a knowledge of jammer presence alone achieves a performance level comparable to soft decision decoding with perfect JSI. Furthermore, performance degradation due to the lack of JSI can be compensated for by increasing the number of levels of quantization. Therefore, an anti-jam system without JSI can be made to perform almost as well as a system with JSI.
Evaluation plan for space station network interface units

NASA Technical Reports Server (NTRS)

Weaver, Alfred C.

1990-01-01

Outlined here is a procedure for evaluating network interface units (NIUs) produced for the Space Station program. The procedures should be equally applicable to the data management system (DMS) testbed NIUs produced by Honeywell and IBM. The evaluation procedures are divided into four areas. Performance measurement tools are hardware and software that must be developed in order to evaluate NIU performance. Performance tests are a series of tests, each of which documents some specific characteristic of NIU and/or network performance. In general, these performance tests quantify the speed, capacity, latency, and reliability of message transmission under a wide variety of conditions. Functionality tests are a series of tests and code inspections that demonstrate the functionality of the particular subset of ISO protocols which have been implemented in a given NIU. Conformance tests are a series of tests which would expose whether or not selected features within the ISO protocols are present and interoperable.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.