Sample records for observed performance evaluation

  1. When the third party observer of a neuropsychological evaluation is an audio-recorder.

    PubMed

    Constantinou, Marios; Ashendorf, Lee; McCaffrey, Robert J

    2002-08-01

    The presence of third parties during neuropsychological evaluations is an issue of concern for contemporary neuropsychologists. Previous studies have reported that the presence of an observer during neuropsychological testing alters the performance of individuals under evaluation. The present study sought to investigate whether audio-recording affects the neuropsychological test performance of individuals in the same way that third party observation does. In the presence of an audio-recorder the performance of the participants on memory tests declined. Performance on motor tests, on the other hand, was not affected by the presence of an audio-recorder. The implications of these findings in forensic neuropsychological evaluations are discussed.

  2. Cluster signal-to-noise analysis for evaluation of the information content in an image.

    PubMed

    Weerawanich, Warangkana; Shimizu, Mayumi; Takeshita, Yohei; Okamura, Kazutoshi; Yoshida, Shoko; Yoshiura, Kazunori

    2018-01-01

    (1) To develop an observer-free method of analysing image quality related to the observer performance in the detection task and (2) to analyse observer behaviour patterns in the detection of small mass changes in cone-beam CT images. 13 observers detected holes in a Teflon phantom in cone-beam CT images. Using the same images, we developed a new method, cluster signal-to-noise analysis, to detect the holes by applying various cut-off values using ImageJ and reconstructing cluster signal-to-noise curves. We then evaluated the correlation between cluster signal-to-noise analysis and the observer performance test. We measured the background noise in each image to evaluate the relationship with false positive rates (FPRs) of the observers. Correlations between mean FPRs and intra- and interobserver variations were also evaluated. Moreover, we calculated true positive rates (TPRs) and accuracies from background noise and evaluated their correlations with TPRs from observers. Cluster signal-to-noise curves were derived in cluster signal-to-noise analysis. They yield the detection of signals (true holes) related to noise (false holes). This method correlated highly with the observer performance test (R 2 = 0.9296). In noisy images, increasing background noise resulted in higher FPRs and larger intra- and interobserver variations. TPRs and accuracies calculated from background noise had high correlation with actual TPRs from observers; R 2 was 0.9244 and 0.9338, respectively. Cluster signal-to-noise analysis can simulate the detection performance of observers and thus replace the observer performance test in the evaluation of image quality. Erroneous decision-making increased with increasing background noise.

  3. Evaluation of Multiclass Model Observers in PET LROC Studies

    NASA Astrophysics Data System (ADS)

    Gifford, H. C.; Kinahan, P. E.; Lartizien, C.; King, M. A.

    2007-02-01

    A localization ROC (LROC) study was conducted to evaluate nonprewhitening matched-filter (NPW) and channelized NPW (CNPW) versions of a multiclass model observer as predictors of human tumor-detection performance with PET images. Target localization is explicitly performed by these model observers. Tumors were placed in the liver, lungs, and background soft tissue of a mathematical phantom, and the data simulation modeled a full-3D acquisition mode. Reconstructions were performed with the FORE+AWOSEM algorithm. The LROC study measured observer performance with 2D images consisting of either coronal, sagittal, or transverse views of the same set of cases. Versions of the CNPW observer based on two previously published difference-of-Gaussian channel models demonstrated good quantitative agreement with human observers. One interpretation of these results treats the CNPW observer as a channelized Hotelling observer with implicit internal noise

  4. Clinical Observed Performance Evaluation: A Prospective Study in Final Year Students of Surgery

    ERIC Educational Resources Information Center

    Markey, G. C.; Browne, K.; Hunter, K.; Hill, A. D.

    2011-01-01

    We report a prospective study of clinical observed performance evaluation (COPE) for 197 medical students in the pre-qualification year of clinical education. Psychometric quality was the main endpoint. Students were assessed in groups of 5 in 40-min patient encounters, with each student the focus of evaluation for 8 min. Each student had a series…

  5. Evaluation of CNN as anthropomorphic model observer

    NASA Astrophysics Data System (ADS)

    Massanes, Francesc; Brankov, Jovan G.

    2017-03-01

    Model observers (MO) are widely used in medical imaging to act as surrogates of human observers in task-based image quality evaluation, frequently towards optimization of reconstruction algorithms. In this paper, we explore the use of convolutional neural networks (CNN) to be used as MO. We will compare CNN MO to alternative MO currently being proposed and used such as the relevance vector machine based MO and channelized Hotelling observer (CHO). As the success of the CNN, and other deep learning approaches, is rooted in large data sets availability, which is rarely the case in medical imaging systems task-performance evaluation, we will evaluate CNN performance on both large and small training data sets.

  6. A Study of the Associations between Conditions of Performance and Characteristics of Performers and New York State Solo Performance Ratings

    ERIC Educational Resources Information Center

    vonWurmb, Elizabeth C.

    2013-01-01

    This dissertation undertakes an analysis of 1,044 performance evaluations from New York State School Music Association (NYSSMA) Spring Festival solo adjudication ratings of student performers from a large suburban school district. It relies on results of evaluations of observed performances, and takes these evaluations as assessments of what the…

  7. Exploring Instructional Coaches' Attitudes and Use of the DataCapture Mobile Application to Collect Video-Based Evidence in Teacher Evaluation

    ERIC Educational Resources Information Center

    Shewell, Justin Reed

    2013-01-01

    An integral part of teacher development are teacher observations. Many teachers are observed once or twice a year to evaluate their performance and hold them accountable for meeting standards. Instructional coaches, however, observe and work with teachers to help them reflect on their performance, with the goal of improving their practice.…

  8. Development of a Rubric for Collegiate Jazz Improvisation Performance Assessment

    ERIC Educational Resources Information Center

    Moore, Kendall Ryan

    2016-01-01

    The purpose of this study was to develop a jazz improvisation rubric for the evaluation of collegiate jazz improvisation. To create this measure, research objectives were devised to investigate the aurally-observed performer-controlled components of improvisation, which aurally-observed components should be evaluated in an improvisatory…

  9. 40 CFR 63.7342 - What records must I keep?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... malfunction. (3) Records of performance tests, performance evaluations, and opacity observations as required...) Monitoring data for COMS during a performance evaluation as required in § 63.6(h)(7)(i) and (ii). (3) Previous (that is, superceded) versions of the performance evaluation plan as required in § 63.8(d)(3). (4...

  10. When 'just doing it' is not enough: assessing the fidelity of player performance of an injury prevention exercise program.

    PubMed

    Fortington, Lauren V; Donaldson, Alex; Lathlean, Tim; Young, Warren B; Gabbe, Belinda J; Lloyd, David; Finch, Caroline F

    2015-05-01

    To obtain benefits from sports injury prevention programs, players are instructed to perform the exercises as prescribed. We developed an observational checklist to measure the quality of exercise performance by players participating in FootyFirst, a coach-led, exercise-based, lower-limb injury prevention program in community Australian Football (AF). Observational. The essential performance criteria for each FootyFirst exercise were described in terms of the technique, volume and intensity required to perform each exercise. An observational checklist was developed to evaluate each criterion through direct visual observation of players at training. The checklist was trialled by two independent raters who observed the same 70 players completing the exercises at eight clubs. Agreement between observers was assessed by Kappa-statistics. Exercise fidelity was defined as the proportion of observed players who performed all aspects of their exercises correctly. The raters agreed on 61/70 observations (87%) (Kappa=0.72, 95% CI: 0.55; 0.89). Of the observations with agreed ratings, 41 (67%) players were judged as performing the exercises as prescribed. The observational checklist demonstrated high inter-rater reliability. Many players observed did not perform the exercises as prescribed, raising concern as to whether they would be receiving anticipated program benefits. Where quality of exercise performance is important, evaluation and reporting of program fidelity should include direct observations of participants. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  11. Self-Handicapping and Interpersonal Trade-Offs: The Effects of Claimed Self-Handicaps on Observers' Performance Evaluations and Feedback.

    ERIC Educational Resources Information Center

    Rhodewalt, Frederick; And Others

    1995-01-01

    Male subjects (n=130) evaluated performance of targets who, prior to and during the performance, offered no excuse, claimed intended low effort, claimed anxiety, or claimed drug impairment. Subjects evaluated objectively equivalent performances more negatively if they came from an excuse-making target than a no-excuse target. (JBJ)

  12. Evaluating performance of risk identification methods through a large-scale simulation of observational data.

    PubMed

    Ryan, Patrick B; Schuemie, Martijn J

    2013-10-01

    There has been only limited evaluation of statistical methods for identifying safety risks of drug exposure in observational healthcare data. Simulations can support empirical evaluation, but have not been shown to adequately model the real-world phenomena that challenge observational analyses. To design and evaluate a probabilistic framework (OSIM2) for generating simulated observational healthcare data, and to use this data for evaluating the performance of methods in identifying associations between drug exposure and health outcomes of interest. Seven observational designs, including case-control, cohort, self-controlled case series, and self-controlled cohort design were applied to 399 drug-outcome scenarios in 6 simulated datasets with no effect and injected relative risks of 1.25, 1.5, 2, 4, and 10, respectively. Longitudinal data for 10 million simulated patients were generated using a model derived from an administrative claims database, with associated demographics, periods of drug exposure derived from pharmacy dispensings, and medical conditions derived from diagnoses on medical claims. Simulation validation was performed through descriptive comparison with real source data. Method performance was evaluated using Area Under ROC Curve (AUC), bias, and mean squared error. OSIM2 replicates prevalence and types of confounding observed in real claims data. When simulated data are injected with relative risks (RR) ≥ 2, all designs have good predictive accuracy (AUC > 0.90), but when RR < 2, no methods achieve 100 % predictions. Each method exhibits a different bias profile, which changes with the effect size. OSIM2 can support methodological research. Results from simulation suggest method operating characteristics are far from nominal properties.

  13. Weighting Mean and Variability during Confidence Judgments

    PubMed Central

    de Gardelle, Vincent; Mamassian, Pascal

    2015-01-01

    Humans can not only perform some visual tasks with great precision, they can also judge how good they are in these tasks. However, it remains unclear how observers produce such metacognitive evaluations, and how these evaluations might be dissociated from the performance in the visual task. Here, we hypothesized that some stimulus variables could affect confidence judgments above and beyond their impact on performance. In a motion categorization task on moving dots, we manipulated the mean and the variance of the motion directions, to obtain a low-mean low-variance condition and a high-mean high-variance condition with matched performances. Critically, in terms of confidence, observers were not indifferent between these two conditions. Observers exhibited marked preferences, which were heterogeneous across individuals, but stable within each observer when assessed one week later. Thus, confidence and performance are dissociable and observers’ confidence judgments put different weights on the stimulus variables that limit performance. PMID:25793275

  14. Predictive validity of driving-simulator assessments following traumatic brain injury: a preliminary study.

    PubMed

    Lew, Henry L; Poole, John H; Lee, Eun Ha; Jaffe, David L; Huang, Hsiu-Chen; Brodd, Edward

    2005-03-01

    To evaluate whether driving simulator and road test evaluations can predict long-term driving performance, we conducted a prospective study on 11 patients with moderate to severe traumatic brain injury. Sixteen healthy subjects were also tested to provide normative values on the simulator at baseline. At their initial evaluation (time-1), subjects' driving skills were measured during a 30-minute simulator trial using an automated 12-measure Simulator Performance Index (SPI), while a trained observer also rated their performance using a Driving Performance Inventory (DPI). In addition, patients were evaluated on the road by a certified driving evaluator. Ten months later (time-2), family members observed patients driving for at least 3 hours over 4 weeks and rated their driving performance using the DPI. At time-1, patients were significantly impaired on automated SPI measures of driving skill, including: speed and steering control, accidents, and vigilance to a divided-attention task. These simulator indices significantly predicted the following aspects of observed driving performance at time-2: handling of automobile controls, regulation of vehicle speed and direction, higher-order judgment and self-control, as well as a trend-level association with car accidents. Automated measures of simulator skill (SPI) were more sensitive and accurate than observational measures of simulator skill (DPI) in predicting actual driving performance. To our surprise, the road test results at time-1 showed no significant relation to driving performance at time-2. Simulator-based assessment of patients with brain injuries can provide ecologically valid measures that, in some cases, may be more sensitive than a traditional road test as predictors of long-term driving performance in the community.

  15. Is Beauty in the Eyes of the Beholder? Aesthetic Quality versus Technical Skill in Movement Evaluation of Tai Chi.

    PubMed

    Zamparo, Paola; Zorzi, Elena; Marcantoni, Sara; Cesari, Paola

    2015-01-01

    The aim of this study was to compare experts to naïve practitioners in rating the beauty and the technical quality of a Tai Chi sequence observed in video-clips (of high and middle level performances). Our hypothesis are: i) movement evaluation will correlate with the level of skill expressed in the kinematics of the observed action but ii) only experts will be able to unravel the technical component from the aesthetic component of the observed action. The judgments delivered indicate that both expert and non-expert observers are able to discern a good from a mediocre performance; however, as expected, only experts discriminate the technical from the aesthetic component of the action evaluated and do this independently of the level of skill shown by the model (high or middle level performances). Furthermore, the judgments delivered were strongly related to the kinematic variables measured in the observed model, indicating that observers rely on specific movement kinematics (e.g. movement amplitude, jerk and duration) for action evaluation. These results provide evidence of the complementary functional role of visual and motor action representation in movement evaluation and underline the role of expertise in judging the aesthetic quality of movements.

  16. Is Beauty in the Eyes of the Beholder? Aesthetic Quality versus Technical Skill in Movement Evaluation of Tai Chi

    PubMed Central

    2015-01-01

    The aim of this study was to compare experts to naïve practitioners in rating the beauty and the technical quality of a Tai Chi sequence observed in video-clips (of high and middle level performances). Our hypothesis are: i) movement evaluation will correlate with the level of skill expressed in the kinematics of the observed action but ii) only experts will be able to unravel the technical component from the aesthetic component of the observed action. The judgments delivered indicate that both expert and non-expert observers are able to discern a good from a mediocre performance; however, as expected, only experts discriminate the technical from the aesthetic component of the action evaluated and do this independently of the level of skill shown by the model (high or middle level performances). Furthermore, the judgments delivered were strongly related to the kinematic variables measured in the observed model, indicating that observers rely on specific movement kinematics (e.g. movement amplitude, jerk and duration) for action evaluation. These results provide evidence of the complementary functional role of visual and motor action representation in movement evaluation and underline the role of expertise in judging the aesthetic quality of movements. PMID:26047473

  17. Asynchronous threat awareness by observer trials using crowd simulation

    NASA Astrophysics Data System (ADS)

    Dunau, Patrick; Huber, Samuel; Stein, Karin U.; Wellig, Peter

    2016-10-01

    The last few years showed that a high risk of asynchronous threats is given in every day life. Especially in large crowds a high probability of asynchronous attacks is evident. High observational abilities to detect threats are desirable. Consequently highly trained security and observation personal is needed. This paper evaluates the effectiveness of a training methodology to enhance performance of observation personnel engaging in a specific target identification task. For this purpose a crowd simulation video is utilized. The study first provides a measurement of the base performance before the training sessions. Furthermore a training procedure will be performed. Base performance will then be compared to the after training performance in order to look for a training effect. A thorough evaluation of both the training sessions as well as the overall performance will be done in this paper. A specific hypotheses based metric is used. Results will be discussed in order to provide guidelines for the design of training for observational tasks.

  18. Performance Evaluation of the United Nations Environment Programme Air Quality Monitoring Unit

    EPA Pesticide Factsheets

    This report defines the specifics of the environmental test conditions used in the evaluation (systems and conditions), data observations, summarization of key performance evaluation findings, and ease of use features concerning the UNEP pod.

  19. Evaluation of medical command and control using performance indicators in a full-scale, major aircraft accident exercise.

    PubMed

    Gryth, Dan; Rådestad, Monica; Nilsson, Heléne; Nerf, Ola; Svensson, Leif; Castrén, Maaret; Rüter, Anders

    2010-01-01

    Large, functional, disaster exercises are expensive to plan and execute, and often are difficult to evaluate objectively. Command and control in disaster medicine organizations can benefit from objective results from disaster exercises to identify areas that must be improved. The objective of this pilot study was to examine if it is possible to use performance indicators for documentation and evaluation of medical command and control in a full-scale major incident exercise at two levels: (1) local level (scene of the incident and hospital); and (2) strategic level of command and control. Staff procedure skills also were evaluated. Trained observers were placed in each of the three command and control locations. These observers recorded and scored the performance of command and control using templates of performance indicators. The observers scored the level of performance by awarding 2, 1, or 0 points according to the template and evaluated content and timing of decisions. Results from 11 performance indicators were recorded at each template and scores greater than 11 were considered as acceptable. Prehospital command and control had the lowest score. This also was expressed by problems at the scene of the incident. The scores in management and staff skills were at the strategic level 15 and 17, respectively; and at the hospital level, 17 and 21, respectively. It is possible to use performance indicators in a full-scale, major incident exercise for evaluation of medical command and control. The results could be used to compare similar exercises and evaluate real incidents in the future.

  20. 40 CFR 63.7842 - What records must I keep?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ..., performance evaluations, and opacity observations as required in § 63.10(b)(2)(viii). (b) For each COMS, you... in § 63.10(b)(2)(vi) through (xi). (2) Monitoring data for a performance evaluation as required in § 63.6(h)(7)(i) and (ii). (3) Previous (that is, superceded) versions of the performance evaluation...

  1. Planning for an Evaluation of Teaching Performance. Volume IV. Summaries of Instruments for Use in Evaluating Teacher Performance.

    ERIC Educational Resources Information Center

    Yuzdepski, I., Comp.; Elliott, L., Comp.

    This document presents information, in the form of summary sheets, on 54 teacher evaluation instruments. Each summary contains pertinent information about the instrument regarding publishing company, author, criteria evaluated, subject of observation, category dimension, and coding units. The 19 criteria used in the evaluation tests, which were…

  2. Model Performance Evaluation and Scenario Analysis ...

    EPA Pesticide Factsheets

    This tool consists of two parts: model performance evaluation and scenario analysis (MPESA). The model performance evaluation consists of two components: model performance evaluation metrics and model diagnostics. These metrics provides modelers with statistical goodness-of-fit measures that capture magnitude only, sequence only, and combined magnitude and sequence errors. The performance measures include error analysis, coefficient of determination, Nash-Sutcliffe efficiency, and a new weighted rank method. These performance metrics only provide useful information about the overall model performance. Note that MPESA is based on the separation of observed and simulated time series into magnitude and sequence components. The separation of time series into magnitude and sequence components and the reconstruction back to time series provides diagnostic insights to modelers. For example, traditional approaches lack the capability to identify if the source of uncertainty in the simulated data is due to the quality of the input data or the way the analyst adjusted the model parameters. This report presents a suite of model diagnostics that identify if mismatches between observed and simulated data result from magnitude or sequence related errors. MPESA offers graphical and statistical options that allow HSPF users to compare observed and simulated time series and identify the parameter values to adjust or the input data to modify. The scenario analysis part of the too

  3. Objective and automated protocols for the evaluation of biomedical search engines using No Title Evaluation protocols.

    PubMed

    Campagne, Fabien

    2008-02-29

    The evaluation of information retrieval techniques has traditionally relied on human judges to determine which documents are relevant to a query and which are not. This protocol is used in the Text Retrieval Evaluation Conference (TREC), organized annually for the past 15 years, to support the unbiased evaluation of novel information retrieval approaches. The TREC Genomics Track has recently been introduced to measure the performance of information retrieval for biomedical applications. We describe two protocols for evaluating biomedical information retrieval techniques without human relevance judgments. We call these protocols No Title Evaluation (NT Evaluation). The first protocol measures performance for focused searches, where only one relevant document exists for each query. The second protocol measures performance for queries expected to have potentially many relevant documents per query (high-recall searches). Both protocols take advantage of the clear separation of titles and abstracts found in Medline. We compare the performance obtained with these evaluation protocols to results obtained by reusing the relevance judgments produced in the 2004 and 2005 TREC Genomics Track and observe significant correlations between performance rankings generated by our approach and TREC. Spearman's correlation coefficients in the range of 0.79-0.92 are observed comparing bpref measured with NT Evaluation or with TREC evaluations. For comparison, coefficients in the range 0.86-0.94 can be observed when evaluating the same set of methods with data from two independent TREC Genomics Track evaluations. We discuss the advantages of NT Evaluation over the TRels and the data fusion evaluation protocols introduced recently. Our results suggest that the NT Evaluation protocols described here could be used to optimize some search engine parameters before human evaluation. Further research is needed to determine if NT Evaluation or variants of these protocols can fully substitute for human evaluations.

  4. Evaluation of medical management during a mass casualty incident exercise: an objective assessment tool to enhance direct observation.

    PubMed

    Ingrassia, Pier Luigi; Prato, Federico; Geddo, Alessandro; Colombo, Davide; Tengattini, Marco; Calligaro, Sara; La Mura, Fabrizio; Franc, Jeffrey Michael; Della Corte, Francesco

    2010-11-01

    Functional exercises represent an important link between disaster planning and disaster response. Although these exercises are widely performed, no standardized method exists for their evaluation. To describe a simple and objective method to assess medical performance during functional exercise events. An evaluation tool comprising three data fields (triage, clinical maneuvers, and radio usage), accompanied by direct anecdotal observational methods, was used to evaluate a large functional mass casualty incident exercise. Seventeen medical responders managed 112 victims of a simulated building explosion. Although 81% of the patients were assigned the appropriate triage codes, evacuation from the site did not follow in priority. Required maneuvers were performed correctly in 85.2% of airway maneuvers and 78.7% of breathing maneuvers, however, significant under-treatment occurred, possibly due to equipment shortages. Extensive use of radio communication was documented. In evaluating this tool, the structured markers were informative, but further information provided by direct observation was invaluable. A three-part tool (triage, medical maneuvers, and radio usage) can provide a method to evaluate functional mass casualty incident exercises, and is easily implemented. For the best results, it should be used in conjunction with direct observation. The evaluation tool has great potential as a reproducible and internationally recognized tool for evaluating disaster management exercises. Copyright © 2010 Elsevier Inc. All rights reserved.

  5. Clinical Performance Evaluations of Third-Year Medical Students and Association With Student and Evaluator Gender.

    PubMed

    Riese, Alison; Rappaport, Leah; Alverson, Brian; Park, Sangshin; Rockney, Randal M

    2017-06-01

    Clinical performance evaluations are major components of medical school clerkship grades. But are they sufficiently objective? This study aimed to determine whether student and evaluator gender is associated with assessment of overall clinical performance. This was a retrospective analysis of 4,272 core clerkship clinical performance evaluations by 829 evaluators of 155 third-year students, within the Alpert Medical School grading database for the 2013-2014 academic year. Overall clinical performance, assessed on a three-point scale (meets expectations, above expectations, exceptional), was extracted from each evaluation, as well as evaluator gender, age, training level, department, student gender and age, and length of observation time. Hierarchical ordinal regression modeling was conducted to account for clustering of evaluations. Female students were more likely to receive a better grade than males (adjusted odds ratio [AOR] 1.30, 95% confidence interval [CI] 1.13-1.50), and female evaluators awarded lower grades than males (AOR 0.72, 95% CI 0.55-0.93), adjusting for department, observation time, and student and evaluator age. The interaction between student and evaluator gender was significant (P = .03), with female evaluators assigning higher grades to female students, while male evaluators' grading did not differ by student gender. Students who spent a short time with evaluators were also more likely to get a lower grade. A one-year examination of all third-year clerkship clinical performance evaluations at a single institution revealed that male and female evaluators rated male and female students differently, even when accounting for other measured variables.

  6. Performance evaluation of automated segmentation software on optical coherence tomography volume data

    PubMed Central

    Tian, Jing; Varga, Boglarka; Tatrai, Erika; Fanni, Palya; Somfai, Gabor Mark; Smiddy, William E.

    2016-01-01

    Over the past two decades a significant number of OCT segmentation approaches have been proposed in the literature. Each methodology has been conceived for and/or evaluated using specific datasets that do not reflect the complexities of the majority of widely available retinal features observed in clinical settings. In addition, there does not exist an appropriate OCT dataset with ground truth that reflects the realities of everyday retinal features observed in clinical settings. While the need for unbiased performance evaluation of automated segmentation algorithms is obvious, the validation process of segmentation algorithms have been usually performed by comparing with manual labelings from each study and there has been a lack of common ground truth. Therefore, a performance comparison of different algorithms using the same ground truth has never been performed. This paper reviews research-oriented tools for automated segmentation of the retinal tissue on OCT images. It also evaluates and compares the performance of these software tools with a common ground truth. PMID:27159849

  7. Automation of immunohistochemical evaluation in breast cancer using image analysis

    PubMed Central

    Prasad, Keerthana; Tiwari, Avani; Ilanthodi, Sandhya; Prabhu, Gopalakrishna; Pai, Muktha

    2011-01-01

    AIM: To automate breast cancer diagnosis and to study the inter-observer and intra-observer variations in the manual evaluations. METHODS: Breast tissue specimens from sixty cases were stained separately for estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor-2 (HER-2/neu). All cases were assessed by manual grading as well as image analysis. The manual grading was performed by an experienced expert pathologist. To study inter-observer and intra-observer variations, we obtained readings from another pathologist as the second observer from a different laboratory who has a little less experience than the first observer. We also took a second reading from the second observer to study intra-observer variations. Image analysis was carried out using in-house developed software (TissueQuant). A comparison of the results from image analysis and manual scoring of ER, PR and HER-2/neu was also carried out. RESULTS: The performance of the automated analysis in the case of ER, PR and HER-2/neu expressions was compared with the manual evaluations. The performance of the automated system was found to correlate well with the manual evaluations. The inter-observer variations were measured using Spearman correlation coefficient r and 95% confidence interval. In the case of ER expression, Spearman correlation r = 0.53, in the case of PR expression, r = 0.63, and in the case of HER-2/neu expression, r = 0.68. Similarly, intra-observer variations were also measured. In the case of ER, PR and HER-2/neu expressions, r = 0.46, 0.66 and 0.70, respectively. CONCLUSION: The automation of breast cancer diagnosis from immunohistochemically stained specimens is very useful for providing objective and repeatable evaluations. PMID:21611095

  8. Teacher Supervision and Evaluation: A Case Study of Administrators' and Teachers' Perceptions of Mini Observations

    ERIC Educational Resources Information Center

    Campbell, Thomas F.

    2013-01-01

    This case study will investigate teachers' and administrators' perceptions of the relationship between mini observations and teacher performance to understand what effect, if any, a system of mini observations has on teacher performance, and if mini observations influences a teacher's pedagogical practice differently than a…

  9. Empirical Performance of Covariates in Education Observational Studies

    ERIC Educational Resources Information Center

    Wong, Vivian C.; Valentine, Jeffrey C.; Miller-Bains, Kate

    2017-01-01

    This article summarizes results from 12 empirical evaluations of observational methods in education contexts. We look at the performance of three common covariate-types in observational studies where the outcome is a standardized reading or math test. They are: pretest measures, local geographic matching, and rich covariate sets with a strong…

  10. The Impact of Level of Performance on Feedback Strategy

    ERIC Educational Resources Information Center

    Beaulieu, R. P.; Love, Kevin G.

    2006-01-01

    The primary purpose of this study was to investigate the impact of the level of observed performance on the feedback strategy selected by a performance evaluator. One hundred and twenty-three actual performance evaluators from 15 different organizations and 123 college students reviewed, in groups which ranged from 2 to 20, a job description for…

  11. Nonparametric EROC analysis for observer performance evaluation on joint detection and estimation tasks

    NASA Astrophysics Data System (ADS)

    Wunderlich, Adam; Goossens, Bart

    2014-03-01

    The majority of the literature on task-based image quality assessment has focused on lesion detection tasks, using the receiver operating characteristic (ROC) curve, or related variants, to measure performance. However, since many clinical image evaluation tasks involve both detection and estimation (e.g., estimation of kidney stone composition, estimation of tumor size), there is a growing interest in performance evaluation for joint detection and estimation tasks. To evaluate observer performance on such tasks, Clarkson introduced the estimation ROC (EROC) curve, and the area under the EROC curve as a summary figure of merit. In the present work, we propose nonparametric estimators for practical EROC analysis from experimental data, including estimators for the area under the EROC curve and its variance. The estimators are illustrated with a practical example comparing MRI images reconstructed from different k-space sampling trajectories.

  12. Performance of the SEAPROG prognosis variant of the forest vegetation simulator.

    Treesearch

    Michael H. McClellan; Frances E. Biles

    2003-01-01

    This paper reports the first phase of a recent effort to evaluate the performance and use of the FVS-SEAPROG vegetation growth model. In this paper, we present our evaluation of SEAPROG’s performance in modeling the growth of even-aged stands regenerated by clearcutting, windthrow, or fire. We evaluated the model by comparing model predictions to observed values from...

  13. Watch what happens: using a web-based multimedia platform to enhance intraoperative learning and development of clinical reasoning.

    PubMed

    Fingeret, Abbey L; Martinez, Rebecca H; Hsieh, Christine; Downey, Peter; Nowygrod, Roman

    2016-02-01

    We aim to determine whether observed operations or internet-based video review predict improved performance in the surgery clerkship. A retrospective review of students' usage of surgical videos, observed operations, evaluations, and examination scores were used to construct an exploratory principal component analysis. Multivariate regression was used to determine factors predictive of clerkship performance. Case log data for 231 students revealed a median of 25 observed cases. Students accessed the web-based video platform a median of 15 times. Principal component analysis yielded 4 factors contributing 74% of the variability with a Kaiser-Meyer-Olkin coefficient of .83. Multivariate regression predicted shelf score (P < .0001), internal clinical skills examination score (P < .0001), subjective evaluations (P < .001), and video website utilization (P < .001) but not observed cases to be significantly associated with overall performance. Utilization of a web-based operative video platform during a surgical clerkship is an independently associated with improved clinical reasoning, fund of knowledge, and overall evaluation. Thus, this modality can serve as a useful adjunct to live observation. Copyright © 2016 Elsevier Inc. All rights reserved.

  14. Examining Teacher Effectiveness Using Classroom Observation Scores: Evidence from the Randomization of Teachers to Students

    ERIC Educational Resources Information Center

    Garrett, Rachel; Steinberg, Matthew P.

    2015-01-01

    Despite policy efforts to encourage multiple measures of performance in newly developing teacher evaluation systems, practical constraints often result in evaluations based predominantly on formal classroom observations. Yet there is limited knowledge of how these observational measures relate to student achievement. This article leverages the…

  15. A method to determine the impact of reduced visual function on nodule detection performance.

    PubMed

    Thompson, J D; Lança, C; Lança, L; Hogg, P

    2017-02-01

    In this study we aim to validate a method to assess the impact of reduced visual function and observer performance concurrently with a nodule detection task. Three consultant radiologists completed a nodule detection task under three conditions: without visual defocus (0.00 Dioptres; D), and with two different magnitudes of visual defocus (-1.00 D and -2.00 D). Defocus was applied with lenses and visual function was assessed prior to each image evaluation. Observers evaluated the same cases on each occasion; this comprised of 50 abnormal cases containing 1-4 simulated nodules (5, 8, 10 and 12 mm spherical diameter, 100 HU) placed within a phantom, and 25 normal cases (images containing no nodules). Data was collected under the free-response paradigm and analysed using Rjafroc. A difference in nodule detection performance would be considered significant at p < 0.05. All observers had acceptable visual function prior to beginning the nodule detection task. Visual acuity was reduced to an unacceptable level for two observers when defocussed to -1.00 D and for one observer when defocussed to -2.00 D. Stereoacuity was unacceptable for one observer when defocussed to -2.00 D. Despite unsatisfactory visual function in the presence of defocus we were unable to find a statistically significant difference in nodule detection performance (F(2,4) = 3.55, p = 0.130). A method to assess visual function and observer performance is proposed. In this pilot evaluation we were unable to detect any difference in nodule detection performance when using lenses to reduce visual function. Copyright © 2016 The College of Radiographers. Published by Elsevier Ltd. All rights reserved.

  16. Evaluation of full depth asphaltic concrete pavements : final report.

    DOT National Transportation Integrated Search

    1982-10-01

    the aim of this study was to evaluate the full depth asphaltic concrete pavement design concept by observing the performance characteristics of two 13-inch pavements constructed in 1970. Pavement performance measurements, over an 11-year period, incl...

  17. Evaluation of bone marrow aspirates in patients with acute myeloid leukemia at day 14 of induction therapy.

    PubMed

    Souto Filho, João Tadeu D; Loureiro, Monique M; Pulcheri, Wolmar; Morais, José Carlos; Nucci, Marcio; Portugal, Rodrigo D

    2015-07-25

    Early assessment of response to chemotherapy in acute myeloid leukemia may be performed by examining bone marrow aspirate (BMA) or biopsy (BMB); a hypocellular bone marrow sample indicates adequate anti-leukemic activity. We sought to evaluate the quantitative and qualitative assessment of BMA performed on day 14 (D14) of chemotherapy, to verify the inter-observer agreement, to compare the results of BMA and BMB, and to evaluate the impact of D14 blast clearance on the overall survival (OS). A total of 107 patients who received standard induction chemotherapy and had bone marrow samples were included. BMA evaluation was performed by two observers using two methods: quantitative assessment and a qualitative (Likert) scale. ROC curves were obtained correlating the BMA quantification of blasts and the qualitative scale, by both observers, with BMB result as gold-standard. There was a significant agreement between the two observers in both the qualitative and quantitative assessments (Kw = 0.737, p < 0.001, and rs = 0.798, p < 0.001; ICC = 0.836, p < 0.001, respectively). The areas under the curve (AUC) were 0.924 and 0.946 for observer 1 and 0.867 and 0.870 for observer 2 for assessments of the percentage of blasts and qualitative scale, respectively. The best cutoff for blast percentage in BMA was 6% and 7% for observers 1 and 2, respectively. A similar analysis for the qualitative scale showed the best cutoff as "probably infiltrated". Patients who attained higher grades of cytoreduction on D14 had better OS. Evaluation of D14 BMA using both methods had a significant agreement with BMB and between observers, identifying a population of patients with poor outcome.

  18. Final postflight hardware evaluation report RSRM-28 (STS-53)

    NASA Technical Reports Server (NTRS)

    Starrett, William David, Jr.

    1993-01-01

    The final report for the Clearfield disassembly evaluation and a continuation of the KSC postflight assessment for the RSRM-28 (STS-53) RSRM flight set is presented. All observed hardware conditions were documented on PFOR's and are included in Appendices A through C. Appendices D and E contain the measurements and safety factor data for the nozzle and insulation components. This report, along with the KSC Ten-Day Postflight Hardware Evaluation Report (TWR-64215), represents a summary of the RSRM-28 hardware evaluation. The as-flown hardware configuration is documented in TWR-63638. Disassembly evaluation photograph numbers are logged in TWA-1989. The RSRM-28 flight set disassembly evaluations described were performed at the RSRM Refurbishment Facility in Clearfield, Utah. The final factory joint demate occurred on July 15, 1993. Additional time was required to perform the evaluation of the stiffener rings per special issue 4.1.5.2 because of the washout schedule. The release of this report was after completion of all special issues per program management direction. Detailed evaluations were performed in accordance with the Clearfield PEEP, TWR-50051, Revision A. All observations were compared against limits that are also defined in the PEEP. These limits outline the criteria for categorizing the observations as acceptable, reportable, or critical. Hardware conditions that were unexpected and/or determined to be reportable or critical were evaluated by the applicable team and tracked through the PFAR system.

  19. Intra-observer reproducibility and diagnostic performance of breast shear-wave elastography in Asian women.

    PubMed

    Park, Hye Young; Han, Kyung Hwa; Yoon, Jung Hyun; Moon, Hee Jung; Kim, Min Jung; Kim, Eun-Kyung

    2014-06-01

    Our aim was to evaluate intra-observer reproducibility of shear-wave elastography (SWE) in Asian women. Sixty-four breast masses (24 malignant, 40 benign) were examined with SWE in 53 consecutive Asian women (mean age, 44.9 y old). Two SWE images were obtained for each of the lesions. The intra-observer reproducibility was assessed by intra-class correlation coefficients (ICC). We also evaluated various clinicoradiologic factors that can influence reproducibility in SWE. The ICC of intra-observer reproducibility was 0.789. In clinicoradiologic factor evaluation, masses surrounded by mixed fatty and glandular tissue (ICC: 0.619) showed lower intra-observer reproducibility compared with lesions that were surrounded by glandular tissue alone (ICC: 0.937; p < 0.05). Overall, the intra-observer reproducibility of breast SWE was excellent in Asian women. However, it may decrease when breast tissue is in a heterogeneous background. Therefore, SWE should be performed carefully in these cases. Copyright © 2014 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.

  20. Exercising upper respiratory videoendoscopic evaluation of 100 nonracing performance horses with abnormal respiratory noise and/or poor performance.

    PubMed

    Davidson, E J; Martin, B B; Boston, R C; Parente, E J

    2011-01-01

    Although well documented in racehorses, there is paucity in the literature regarding the prevalence of dynamic upper airway abnormalities in nonracing performance horses. To describe upper airway function of nonracing performance horses with abnormal respiratory noise and/or poor performance via exercising upper airway videoendoscopy. Medical records of nonracing performance horses admitted for exercising evaluation with a chief complaint of abnormal respiratory noise and/or poor performance were reviewed. All horses had video recordings of resting and exercising upper airway endoscopy. Relationships between horse demographics, resting endoscopic findings, treadmill intensity and implementation of head and neck flexion during exercise with exercising endoscopic findings were examined. Dynamic upper airway obstructions were observed in 72% of examinations. Head and neck flexion was necessary to obtain a diagnosis in 21 horses. Pharyngeal wall collapse was the most prevalent upper airway abnormality, observed in 31% of the examinations. Complex abnormalities were noted in 27% of the examinations. Resting laryngeal dysfunction was significantly associated with dynamic arytenoid collapse and the odds of detecting intermittent dorsal displacement of the soft palate (DDSP) during exercise in horses with resting DDSP was only 7.7%. Exercising endoscopic observations were different from the resting observations in 54% of examinations. Dynamic upper airway obstructions were common in nonracing performance horses with respiratory noise and/or poor performance. Resting endoscopy was only helpful in determining exercising abnormalities with recurrent laryngeal neuropathy. This study emphasises the importance of exercising endoscopic evaluation in nonracing performance horses with abnormal respiratory noise and/or poor performance for accurate assessment of dynamic upper airway function. © 2010 EVJ Ltd.

  1. Interpersonal relationship modulates brain responses to outcome evaluation when gambling for/against others: an electrophysiological analysis.

    PubMed

    Leng, Yue; Zhou, Xiaolin

    2014-10-01

    When individuals play a gambling task and their actions have consequences for observers, how are the brain responses of the performers modulated by their interpersonal relationship with the observers? To address this issue, we examined the event-related potentials responses in performers while they played two gambling games: one during which they tried to earn money for the observers instead of themselves (i.e., Experiment 1) and another gambling game during which they attempted to earn money from the observers (i.e., Experiment 2). In Experiment 1, ERP results showed that when gambling for either the friends or the strangers, the feedback-related negativity (FRN) responses were more negative-going to the losses than to the gains. The FRN effect (loss minus gain) was significantly larger when gambling for the friends than for the strangers. The general P300 response was more positive-going when gambling for the friends than for the strangers. These results suggested that gambling for others enables individuals to assess the outcome from the interests of the other people, consequently, the FRN response may be driven by the evaluative process related to interests of the others. Because one׳s own economic interests were not involved, the performers׳ brain responses during both the early, semi-automatic stage (i.e., the FRN) and the later, controlled stage (i.e., the P300) of outcome evaluation were modulated by the interpersonal relationship between the performers and the observers. In Experiment 2, ERP results revealed that when gambling against others, the FRN response was more negative-going to the losses than to the gains, as well. However, neither the FRN effect nor the general FRN response was modulated by interpersonal relationship. The general P300 response was more positive-going when gambling against the stranger than against the friend. These results suggested that when gambling against others, the performers׳ FRN response may be driven by two evaluative processes: one is related to the interests of their own, and another is related to the interests of the other people; and the former one plays a dominant role. Because of highly self-involvement, only the performers׳ brain responses during the later controlled stage of outcome evaluation were modulated by interpersonal relationship. The present study extended previous research on brain responses to outcome evaluation when decision making actions have consequences for the other people by suggesting that the FRN response in the performer could also be driven by two evaluative processes. In addition, whether the FRN in the performer was modulated by interpersonal relationship depends on which evaluative process plays a dominant role. However, the P300 in the performer could always be modulated by interpersonal relationship. These findings provide evidence on outcome evaluation being composed of an early semi-automatic primitive process and a later controlled cognitive/affective appraisal process. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. Using satellite observations in performance evaluation for regulatory air quality modeling: Comparison with ground-level measurements

    NASA Astrophysics Data System (ADS)

    Odman, M. T.; Hu, Y.; Russell, A.; Chai, T.; Lee, P.; Shankar, U.; Boylan, J.

    2012-12-01

    Regulatory air quality modeling, such as State Implementation Plan (SIP) modeling, requires that model performance meets recommended criteria in the base-year simulations using period-specific, estimated emissions. The goal of the performance evaluation is to assure that the base-year modeling accurately captures the observed chemical reality of the lower troposphere. Any significant deficiencies found in the performance evaluation must be corrected before any base-case (with typical emissions) and future-year modeling is conducted. Corrections are usually made to model inputs such as emission-rate estimates or meteorology and/or to the air quality model itself, in modules that describe specific processes. Use of ground-level measurements that follow approved protocols is recommended for evaluating model performance. However, ground-level monitoring networks are spatially sparse, especially for particulate matter. Satellite retrievals of atmospheric chemical properties such as aerosol optical depth (AOD) provide spatial coverage that can compensate for the sparseness of ground-level measurements. Satellite retrievals can also help diagnose potential model or data problems in the upper troposphere. It is possible to achieve good model performance near the ground, but have, for example, erroneous sources or sinks in the upper troposphere that may result in misleading and unrealistic responses to emission reductions. Despite these advantages, satellite retrievals are rarely used in model performance evaluation, especially for regulatory modeling purposes, due to the high uncertainty in retrievals associated with various contaminations, for example by clouds. In this study, 2007 was selected as the base year for SIP modeling in the southeastern U.S. Performance of the Community Multiscale Air Quality (CMAQ) model, at a 12-km horizontal resolution, for this annual simulation is evaluated using both recommended ground-level measurements and non-traditional satellite retrievals. Evaluation results are assessed against recommended criteria and peer studies in the literature. Further analysis is conducted, based upon these assessments, to discover likely errors in model inputs and potential deficiencies in the model itself. Correlations as well as differences in input errors and model deficiencies revealed by ground-level measurements versus satellite observations are discussed. Additionally, sensitivity analyses are employed to investigate errors in emission-rate estimates using either ground-level measurements or satellite retrievals, and the results are compared against each other considering observational uncertainties. Recommendations are made for how to effectively utilize satellite retrievals in regulatory air quality modeling.

  3. Dribble Files: Methodologies to Evaluate Learning and Performance in Complex Environments

    ERIC Educational Resources Information Center

    Schrader, P. G.; Lawless, Kimberly A.

    2007-01-01

    Research in the area of technology learning environments is tremendously complex. Tasks performed in these contexts are highly cognitive and mostly invisible to the observer. The nature of performance in these contexts is explained not only by the outcome but also by the process. However, evaluating the learning process with respect to tasks…

  4. Lessons from cross-fleet/cross-airline observations - Evaluating the impact of CRM/LOFT training

    NASA Technical Reports Server (NTRS)

    Butler, Roy E.

    1991-01-01

    A review is presented of the crew resource management/line oriented flight training (CRM/LOFT) program to help determine the level of standardization across fleets and airlines in the critical area of evaluating crew behavior and performance. One of the goals of the project is to verify that check airmen and LOFT instructors within organizations are evaluating CRM issues consistently and that differences observed between fleets are not a function of idiosyncracies on the part of observers. Attention is given to the research tools for crew evaluation.

  5. Skylab program earth resouces experiment package. Volume 4: Sensor performance evaluation (S193 R/S). [radiometer/scatterometer

    NASA Technical Reports Server (NTRS)

    Kenney, G. P.

    1975-01-01

    The results of the sensor performance evaluation of the 13.9 GHz radiometer/scatterometer, which was part of the earth resources experiment package on Skylab. Findings are presented in the areas of housekeeping parameters, antenna gain and scanning performance, dynamic range, linearity, precision, resolution, stability, integration time, and transmitter output. Supplementary analyses covering performance anomalies, data stream peculiarities, aircraft sensor data comparisons, scatterometer saturation characteristics, and RF heating effects are reported. Results of the evaluation show that instrument performance was generally as expected, but capability degradations were observed to result from three major anomalies. Conclusions are drawn from the evaluation results, and recommendations for improving the effectiveness of a future program are offered. An addendum describes the special evaluation techniques developed and applied in the sensor performance evaluation tasks.

  6. 49 CFR 192.805 - Qualification program.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... include provisions to: (a) Identify covered tasks; (b) Ensure through evaluation that individuals performing covered tasks are qualified; (c) Allow individuals that are not qualified pursuant to this subpart to perform a covered task if directed and observed by an individual that is qualified; (d) Evaluate...

  7. Evaluation of camouflage pattern performance of textiles by human observers and CAMAELEON

    NASA Astrophysics Data System (ADS)

    Heinrich, Daniela H.; Selj, Gorm K.

    2017-10-01

    Military textiles with camouflage pattern are an important part of the protection measures for soldiers. Military operational environments differ a lot depending on climate and vegetation. This requires very different camouflage pattern to achieve good protection. To find the best performing pattern for given environments we have in earlier evaluations mainly applied observer trials as evaluation method. In these camouflage evaluation test human observers were asked to search for targets (in natural settings) presented on a high resolution PC screen, and the corresponding detection times were recorded. Another possibility is to base the evaluation on simulations. CAMAELEON is a licensed tool that ranks camouflaged targets by their similarity with local backgrounds. The similarity is estimated through the parameters local contrast, orientation of structures in the pattern and spatial frequency, by mimicking the response and signal processing in the visual cortex of the human eye. Simulations have a number of advantages over observer trials, for example, that they are more flexible, cheaper, and faster. Applying these two methods to the same images of camouflaged targets we found that CAMAELEON simulation results didn't match observer trial results for targets with disruptive patterns. This finding now calls for follow up studies in order to learn more about the advantages and pitfalls of CAMAELEON. During recent observer trials we studied new camouflage patterns and the effect of additional equipment, such as combat vests. In this paper we will present the results from a study comparing evaluation results of human based observer trials and CAMAELEON.

  8. The Independence and Interdependence of Coacting Observers in Regard to Performance Efficiency, Workload, and Stress in a Vigilance Task

    DTIC Science & Technology

    2016-09-01

    independence/ dependence , evaluation apprehension, workload, stress 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT: SAR 18. NUMBER...indepen- dence/ dependence , evaluation apprehension, workload, stress IntroductIon Vigilance or sustained attention tasks require observers to maintain

  9. Application of Wavelet Filters in an Evaluation of Photochemical Model Performance

    EPA Science Inventory

    Air quality model evaluation can be enhanced with time-scale specific comparisons of outputs and observations. For example, high-frequency (hours to one day) time scale information in observed ozone is not well captured by deterministic models and its incorporation into model pe...

  10. Performance Evaluation of Particle Sampling Probes for Emission Measurements of Aircraft Jet Engines

    NASA Technical Reports Server (NTRS)

    Lee, Poshin; Chen, Da-Ren; Sanders, Terry (Technical Monitor)

    2001-01-01

    Considerable attention has been recently received on the impact of aircraft-produced aerosols upon the global climate. Sampling particles directly from jet engines has been performed by different research groups in the U.S. and Europe. However, a large variation has been observed among published data on the conversion efficiency and emission indexes of jet engines. The variation results surely from the differences in test engine types, engine operation conditions, and environmental conditions. The other factor that could result in the observed variation is the performance of sampling probes used. Unfortunately, it is often neglected in the jet engine community. Particle losses during the sampling, transport, and dilution processes are often not discussed/considered in literatures. To address this issue, we evaluated the performance of one sampling probe by challenging it with monodisperse particles. A significant performance difference was observed on the sampling probe evaluated under different temperature conditions. Thermophoretic effect, nonisokinetic sampling and turbulence loss contribute to the loss of particles in sampling probes. The results of this study show that particle loss can be dramatic if the sampling probe is not well designed. Further, the result allows ones to recover the actual size distributions emitted from jet engines.

  11. Final postflight hardware evaluation report RSRM-32 (STS-57)

    NASA Technical Reports Server (NTRS)

    Nielson, Greg

    1993-01-01

    This document is the final report for the postflight assessment of the RSRM-32 (STS-57) flight set. This report presents the disassembly evaluations performed at the Thiokol facilities in Utah and is a continuation of the evaluations performed at KSC (TWR-64239). The PEEP for this assessment is outlined in TWR-50051, Revision B. The PEEP defines the requirements for evaluating RSRM hardware. Special hardware issues pertaining to this flight set requiring additional or modified assessment are outlined in TWR-64237. All observed hardware conditions were documented on PFOR's which are included in Appendix A. Observations were compared against limits defined in the PEEP. Any observation that was categorized as reportable or had no defined limits was documented on a preliminary PFAR by the assessment engineers. Preliminary PFAR's were reviewed by the Thiokol SPAT Executive Board to determine if elevation to PFAR's was required.

  12. Intra- and inter-observer reliability of ten major histological scoring systems used for the evaluation of in vivo cartilage repair.

    PubMed

    Bonasia, Davide Edoardo; Marmotti, Antongiulio; Massa, Alessandro Domenico Felice; Ferro, Andrea; Blonna, Davide; Castoldi, Filippo; Rossi, Roberto

    2015-09-01

    In the last two decades, many surgical techniques have been described for articular cartilage repair. Reliable histological scoring systems are fundamental tools to evaluate new procedures. Several histological scoring systems have been described, and these can be divided in elementary and comprehensive scores, according to the number of sub-items. The aim of this study was to test the inter- and intra-observer reliability of ten main scores used for the histological evaluation of in vivo cartilage repair. The authors tested the starting hypothesis that elementary scores would show superior intra- and inter-observer reliability compared with comprehensive scores. Fifty histological sections obtained from the trochlea of New Zealand Rabbit and stained with Safranin-O fast green were used. The histological sections were analysed by 4 observers: 2 experienced in cartilage histology and 2 inexperienced. Histological evaluations were performed at time 1 and time 2, separated by a 30-day interval. The following scores were used: Mankin, O'Driscoll, Pineda, Wakitani, Fortier, Selleres, ICRS, ICRSII, Oswestry (OsScore) and modified O'Driscoll. Intra- and inter-observer reliability were evaluated for each score. In addition, the pavement-ceiling effect and the Bland-Altman Coefficient of Repeatability were then evaluated for each sub-item of every score. Intra-observer reliability was high for all observers in every score, even though the reliability was significantly lower for non-expert observers compared with expert counterparts. In terms of Coefficient of Repeatability, some scores performed better (O'Driscoll, Modified O'Driscoll and ICRSII) than others (Fortier, Seller). Inter-observer reliability was high for all observers in every score, but significantly lower for non-expert compared with expert observers. In expert hands, all the scores showed high intra- and inter-observer reliability, independently of the complexity. Although every score has advantages and disadvantages, ICRSII, O'Driscoll and Modified O'Driscoll scores should be preferred for the evaluation of in vivo cartilage repair in animal models.

  13. A four-alternative forced choice (4AFC) software for observer performance evaluation in radiology

    NASA Astrophysics Data System (ADS)

    Zhang, Guozhi; Cockmartin, Lesley; Bosmans, Hilde

    2016-03-01

    Four-alternative forced choice (4AFC) test is a psychophysical method that can be adopted for observer performance evaluation in radiological studies. While the concept of this method is well established, difficulties to handle large image data, perform unbiased sampling, and keep track of the choice made by the observer have restricted its application in practice. In this work, we propose an easy-to-use software that can help perform 4AFC tests with DICOM images. The software suits for any experimental design that follows the 4AFC approach. It has a powerful image viewing system that favorably simulates the clinical reading environment. The graphical interface allows the observer to adjust various viewing parameters and perform the selection with very simple operations. The sampling process involved in 4AFC as well as the speed and accuracy of the choice made by the observer is precisely monitored in the background and can be easily exported for test analysis. The software has also a defensive mechanism for data management and operation control that minimizes the possibility of mistakes from user during the test. This software can largely facilitate the use of 4AFC approach in radiological observer studies and is expected to have widespread applicability.

  14. Evaluation of Eco-Efficiency and Performance of Retrofit Materials

    NASA Astrophysics Data System (ADS)

    Gopinath, Smitha; Rama Chandra Murthy, A.; Iyer, Nagesh R.; Kokila, S.

    2015-12-01

    In this work three materials namely Fiber Reinforced Polymer (FRP), ferrocement and Textile Reinforced Concrete (TRC) have been evaluated towards their performance efficiency and eco-effectiveness for sustainable retrofitting applications. Investigations have been carried out for flexural strengthening of RC beams with FRP, ferrocement and TRC. It is observed that in the case of FRP, it is not possible to tailor the material according to design requirements and most of the time strengthened structure becomes over stiff. Eco-effectiveness of these retrofitting materials has been evaluated by computing the embodied energy. It is observed that the amount of CO2 emitted by TRC is less compared to other retrofit materials. Further, the performance point of retrofitted RC frames has been evaluated and damage index has been calculated to find out the effective retrofit material. It is concluded that, if RC frame is retrofitted with FRP and TRC, it undergoes less damage compared to ferrocement.

  15. Evaluation of modern cotton harvest systems on irrigated cotton: harvester performance

    USDA-ARS?s Scientific Manuscript database

    Picker and stripper harvest systems were evaluated on production-scale irrigated cotton on the High Plains of Texas over three harvest seasons. Observations on harvester performance, including time-in-motion, harvest loss, seed cotton composition, and turnout, were conducted at seven locations with...

  16. Evaluation of Immediate Actions Taken to Deal with Cracking Problems Observed in Wheels of Rail Commuter Cars

    DOT National Transportation Integrated Search

    1993-07-01

    The report is the first in a series of engineering studies on railroad vehicle wheel performance. Preliminary studies are summarized, involving evaluation of actions taken to respond to high rates of crack occurrence observed in the wheels of certain...

  17. Administrators' Perceptions Regarding the Effectiveness of the Teacher Observation Evaluation System

    ERIC Educational Resources Information Center

    Williams, Kathleen Riley

    2015-01-01

    This phenomenological narrative study was designed to explore public school administrators' perceptions regarding Louisiana's Compass teacher observation evaluation system as a method for assessing teacher performance. Participants were administrators with at least two years of experience as a public school administrator at the secondary level,…

  18. Guidelines for reporting evaluations based on observational methodology.

    PubMed

    Portell, Mariona; Anguera, M Teresa; Chacón-Moscoso, Salvador; Sanduvete-Chaves, Susana

    2015-01-01

    Observational methodology is one of the most suitable research designs for evaluating fidelity of implementation, especially in complex interventions. However, the conduct and reporting of observational studies is hampered by the absence of specific guidelines, such as those that exist for other evaluation designs. This lack of specific guidance poses a threat to the quality and transparency of these studies and also constitutes a considerable publication hurdle. The aim of this study thus was to draw up a set of proposed guidelines for reporting evaluations based on observational methodology. The guidelines were developed by triangulating three sources of information: observational studies performed in different fields by experts in observational methodology, reporting guidelines for general studies and studies with similar designs to observational studies, and proposals from experts in observational methodology at scientific meetings. We produced a list of guidelines grouped into three domains: intervention and expected outcomes, methods, and results. The result is a useful, carefully crafted set of simple guidelines for conducting and reporting observational studies in the field of program evaluation.

  19. Development of a measure of student self-evaluation of physics exam performance

    NASA Astrophysics Data System (ADS)

    Hagedorn, Eric Anthony

    The central purpose of this study was to provide preliminary evidence of the reliability and validity of the SEVSI - P (Self- evaluation scaled instrument - physics). This instrument, designed to measure student self-evaluation of physics exam performance, was developed in congruence with social cognitive theory. Self-evaluation in this study is defined to consist of two of the three subprocesses of self-regulation: self-observation and judgmental process. As such, the SEVSI - P consists of two subscales, one measuring the frequency and types of self-observations made during a physics exam and one measuring the frequency and types of judgmental comparisons made after an exam. Data from 621 completed surveys, voluntarily taken by first semester algebra/trigonometry based physics students at six Midwestern universities and one Southern university, were analyzed for reliability and factorial validity. Cronbach alphas of .71 and .83 for the self-observation and judgment subscales, respectively, indicate acceptable reliability for the instrument. Confirmatory factor analysis indicates the acceptability of the hypothesis that the data analyzed could have indeed been obtained from the proposed two factor model (self-observation and judgment). The results of this confirmatory factor analysis provide preliminary construct validity for this instrument. A number of theoretically related items were included on the SEVSI - P form to elicity information about the use of goals and pre-planned strategies, actions taken in response to previous poor performances, and emotional responses to performance. A correlational analysis of these items along with the self-observation and judgment subscale scores provided a limited degree of convergent validity for the two subscales. Analyses of variance were done to determine the presence of differences in scoring patterns based on gender or reported ethnic origin. These results indicate slightly higher judgment subscale scores for women and members of minority groups. The implications of these differences are suggested as warranting future research. Future uses of the SEVSI - P include classroom use to assist students self-evaluate their exam performances in order to increase their achievement. Future research using the SEVSI - P to determine the causal relationships between self-evaluation, actual achievement, and other social cognitive constructs such as self-efficacy are suggested.

  20. Multi-phenomenology Observation Network Evaluation Tool'' (MONET)

    NASA Astrophysics Data System (ADS)

    Oltrogge, D.; North, P.; Vallado, D.

    2014-09-01

    Evaluating overall performance of an SSA "system-of-systems" observational network collecting against thousands of Resident Space Objects (RSO) is very difficult for typical tasking or scheduling-based analysis tools. This is further complicated by networks that have a wide variety of sensor types and phenomena, to include optical, radar and passive RF types, each having unique resource, ops tempo, competing customer and detectability constraints. We present details of the Multi-phenomenology Observation Network Evaluation Tool (MONET), which circumvents these difficulties by assessing the ideal performance of such a network via a digitized supply-vs-demand approach. Cells of each sensors supply time are distributed among RSO targets of interest to determine the average performance of the network against that set of RSO targets. Orbit Determination heuristics are invoked to represent observation quantity and geometry notionally required to obtain the desired orbit estimation quality. To feed this approach, we derive the detectability and collection rate performance of optical, radar and passive RF sensor physical and performance characteristics. We then prioritize the selected RSO targets according to object size, active/inactive status, orbit regime, and/or other considerations. Finally, the OD-derived tracking demands of each RSO of interest are levied against remaining sensor supply until either (a) all sensor time is exhausted; or (b) the list of RSO targets is exhausted. The outputs from MONET include overall network performance metrics delineated by sensor type, objects and orbits tracked, along with likely orbit accuracies which might result from the conglomerate network tracking.

  1. Kalman-Filter-Based Orientation Determination Using Inertial/Magnetic Sensors: Observability Analysis and Performance Evaluation

    PubMed Central

    Sabatini, Angelo Maria

    2011-01-01

    In this paper we present a quaternion-based Extended Kalman Filter (EKF) for estimating the three-dimensional orientation of a rigid body. The EKF exploits the measurements from an Inertial Measurement Unit (IMU) that is integrated with a tri-axial magnetic sensor. Magnetic disturbances and gyro bias errors are modeled and compensated by including them in the filter state vector. We employ the observability rank criterion based on Lie derivatives to verify the conditions under which the nonlinear system that describes the process of motion tracking by the IMU is observable, namely it may provide sufficient information for performing the estimation task with bounded estimation errors. The observability conditions are that the magnetic field, perturbed by first-order Gauss-Markov magnetic variations, and the gravity vector are not collinear and that the IMU is subject to some angular motions. Computer simulations and experimental testing are presented to evaluate the algorithm performance, including when the observability conditions are critical. PMID:22163689

  2. Using cloud-based mobile technology for assessment of competencies among medical students.

    PubMed

    Ferenchick, Gary S; Solomon, David

    2013-01-01

    Valid, direct observation of medical student competency in clinical settings remains challenging and limits the opportunity to promote performance-based student advancement. The rationale for direct observation is to ascertain that students have acquired the core clinical competencies needed to care for patients. Too often student observation results in highly variable evaluations which are skewed by factors other than the student's actual performance. Among the barriers to effective direct observation and assessment include the lack of effective tools and strategies for assuring that transparent standards are used for judging clinical competency in authentic clinical settings. We developed a web-based content management system under the name, Just in Time Medicine (JIT), to address many of these issues. The goals of JIT were fourfold: First, to create a self-service interface allowing faculty with average computing skills to author customizable content and criterion-based assessment tools displayable on internet enabled devices, including mobile devices; second, to create an assessment and feedback tool capable of capturing learner progress related to hundreds of clinical skills; third, to enable easy access and utilization of these tools by faculty for learner assessment in authentic clinical settings as a means of just in time faculty development; fourth, to create a permanent record of the trainees' observed skills useful for both learner and program evaluation. From July 2010 through October 2012, we implemented a JIT enabled clinical evaluation exercise (CEX) among 367 third year internal medicine students. Observers (attending physicians and residents) performed CEX assessments using JIT to guide and document their observations, record their time observing and providing feedback to the students, and their overall satisfaction. Inter-rater reliability and validity were assessed with 17 observers who viewed six videotaped student-patient encounters and by measuring the correlation between student CEX scores and their scores on subsequent standardized-patient OSCE exams. A total of 3567 CEXs were completed by 516 observers. The average number of evaluations per student was 9.7 (±1.8 SD) and the average number of CEXs completed per observer was 6.9 (±15.8 SD). Observers spent less than 10 min on 43-50% of the CEXs and 68.6% on feedback sessions. A majority of observers (92%) reported satisfaction with the CEX. Inter-rater reliability was measured at 0.69 among all observers viewing the videotapes and these ratings adequately discriminated competent from non-competent performance. The measured CEX grades correlated with subsequent student performance on an end-of-year OSCE. We conclude that the use of JIT is feasible in capturing discrete clinical performance data with a high degree of user satisfaction. Our embedded checklists had adequate inter-rater reliability and concurrent and predictive validity.

  3. A Self-Evaluation Instrument for Work Performance and Support Needs

    ERIC Educational Resources Information Center

    Brady, Michael P.; Rosenberg, Howard; Frain, Michael P.

    2008-01-01

    Involvement of students and adult employees into the decisions that affect their education and employment can improve their transition into supported employment. One means for increasing involvement into these decisions is to gain their input into performance evaluations and support needs. The "Job Observation and Behavior Scale: Opportunity for…

  4. Creation of an ensemble of simulated cardiac cases and a human observer study: tools for the development of numerical observers for SPECT myocardial perfusion imaging

    NASA Astrophysics Data System (ADS)

    O'Connor, J. Michael; Pretorius, P. Hendrik; Gifford, Howard C.; Licho, Robert; Joffe, Samuel; McGuiness, Matthew; Mehurg, Shannon; Zacharias, Michael; Brankov, Jovan G.

    2012-02-01

    Our previous Single Photon Emission Computed Tomography (SPECT) myocardial perfusion imaging (MPI) research explored the utility of numerical observers. We recently created two hundred and eighty simulated SPECT cardiac cases using Dynamic MCAT (DMCAT) and SIMIND Monte Carlo tools. All simulated cases were then processed with two reconstruction methods: iterative ordered subset expectation maximization (OSEM) and filtered back-projection (FBP). Observer study sets were assembled for both OSEM and FBP methods. Five physicians performed an observer study on one hundred and seventy-nine images from the simulated cases. The observer task was to indicate detection of any myocardial perfusion defect using the American Society of Nuclear Cardiology (ASNC) 17-segment cardiac model and the ASNC five-scale rating guidelines. Human observer Receiver Operating Characteristic (ROC) studies established the guidelines for the subsequent evaluation of numerical model observer (NO) performance. Several NOs were formulated and their performance was compared with the human observer performance. One type of NO was based on evaluation of a cardiac polar map that had been pre-processed using a gradient-magnitude watershed segmentation algorithm. The second type of NO was also based on analysis of a cardiac polar map but with use of a priori calculated average image derived from an ensemble of normal cases.

  5. Clinical performance of a glass ionomer restorative system: a 6-year evaluation.

    PubMed

    Gurgan, Sevil; Kutuk, Zeynep Bilge; Ergin, Esra; Oztas, Sema Seval; Cakir, Filiz Yalcin

    2017-09-01

    The aim of this study is to evaluate the long-term clinical performance of a glass ionomer (GI) restorative system in the restoration of posterior teeth compared with a micro-filled hybrid posterior composite. A total of 140 (80 Cl1 and 60 Cl2) lesions in 59 patients were restored with a GI system (Equia) or a micro hybrid composite (Gradia Direct). Restorations were evaluated at baseline and yearly during 6 years according to the modified-USPHS criteria. Negative replicas at each recall were observed under SEM to evaluate surface characteristics. Data were analyzed with Cohcran's Q and McNemar's tests (p < 0.05). One hundred fifteen (70 Cl1 and 45 Cl2) restorations were evaluated in 47 patients with a recall rate of 79.6% at 6 years. Significant differences were found in marginal adaptation and marginal discoloration for both restorative materials for Cl1 and Cl2 restorations (p < 0.05). However, none of the materials were superior to the other (p > 0.05). A significant decrease in color match was observed in Equia restorations (p < 0.05). Only one Cl2 Equia restoration was missing at 3 years and another one at 4 years. No failures were observed at 5 and 6 years. Both materials exhibited clinically successful performance after 6 years. SEM evaluations were in accordance with the clinical findings. Both materials showed a good clinical performance for the restoration of posterior teeth during the 6-year evaluation. The clinical effectiveness of Equia and Gradia Direct Posterior was acceptable in Cl1 and Cl2 cavities subsequent to 6-year evaluation.

  6. Performance evaluation of contrast-detail in full field digital mammography systems using ideal (Hotelling) observer vs. conventional automated analysis of CDMAM images for quality control of contrast-detail characteristics.

    PubMed

    Delakis, Ioannis; Wise, Robert; Morris, Lauren; Kulama, Eugenia

    2015-11-01

    The purpose of this work was to evaluate the contrast-detail performance of full field digital mammography (FFDM) systems using ideal (Hotelling) observer Signal-to-Noise Ratio (SNR) methodology and ascertain whether it can be considered an alternative to the conventional, automated analysis of CDMAM phantom images. Five FFDM units currently used in the national breast screening programme were evaluated, which differed with respect to age, detector, Automatic Exposure Control (AEC) and target/filter combination. Contrast-detail performance was analysed using CDMAM and ideal observer SNR methodology. The ideal observer SNR was calculated for input signal originating from gold discs of varying thicknesses and diameters, and then used to estimate the threshold gold thickness for each diameter as per CDMAM analysis. The variability of both methods and the dependence of CDMAM analysis on phantom manufacturing discrepancies also investigated. Results from both CDMAM and ideal observer methodologies were informative differentiators of FFDM systems' contrast-detail performance, displaying comparable patterns with respect to the FFDM systems' type and age. CDMAM results suggested higher threshold gold thickness values compared with the ideal observer methodology, especially for small-diameter details, which can be attributed to the behaviour of the CDMAM phantom used in this study. In addition, ideal observer methodology results showed lower variability than CDMAM results. The Ideal observer SNR methodology can provide a useful metric of the FFDM systems' contrast detail characteristics and could be considered a surrogate for conventional, automated analysis of CDMAM images. Copyright © 2015 Associazione Italiana di Fisica Medica. Published by Elsevier Ltd. All rights reserved.

  7. Observational uncertainty and regional climate model evaluation: A pan-European perspective

    NASA Astrophysics Data System (ADS)

    Kotlarski, Sven; Szabó, Péter; Herrera, Sixto; Räty, Olle; Keuler, Klaus; Soares, Pedro M.; Cardoso, Rita M.; Bosshard, Thomas; Pagé, Christian; Boberg, Fredrik; Gutiérrez, José M.; Jaczewski, Adam; Kreienkamp, Frank; Liniger, Mark. A.; Lussana, Cristian; Szepszo, Gabriella

    2017-04-01

    Local and regional climate change assessments based on downscaling methods crucially depend on the existence of accurate and reliable observational reference data. In dynamical downscaling via regional climate models (RCMs) observational data can influence model development itself and, later on, model evaluation, parameter calibration and added value assessment. In empirical-statistical downscaling, observations serve as predictand data and directly influence model calibration with corresponding effects on downscaled climate change projections. Focusing on the evaluation of RCMs, we here analyze the influence of uncertainties in observational reference data on evaluation results in a well-defined performance assessment framework and on a European scale. For this purpose we employ three different gridded observational reference grids, namely (1) the well-established EOBS dataset (2) the recently developed EURO4M-MESAN regional re-analysis, and (3) several national high-resolution and quality-controlled gridded datasets that recently became available. In terms of climate models five reanalysis-driven experiments carried out by five different RCMs within the EURO-CORDEX framework are used. Two variables (temperature and precipitation) and a range of evaluation metrics that reflect different aspects of RCM performance are considered. We furthermore include an illustrative model ranking exercise and relate observational spread to RCM spread. The results obtained indicate a varying influence of observational uncertainty on model evaluation depending on the variable, the season, the region and the specific performance metric considered. Over most parts of the continent, the influence of the choice of the reference dataset for temperature is rather small for seasonal mean values and inter-annual variability. Here, model uncertainty (as measured by the spread between the five RCM simulations considered) is typically much larger than reference data uncertainty. For parameters of the daily temperature distribution and for the spatial pattern correlation, however, important dependencies on the reference dataset can arise. The related evaluation uncertainties can be as large or even larger than model uncertainty. For precipitation the influence of observational uncertainty is, in general, larger than for temperature. It often dominates model uncertainty especially for the evaluation of the wet day frequency, the spatial correlation and the shape and location of the distribution of daily values. But even the evaluation of large-scale seasonal mean values can be considerably affected by the choice of the reference. When employing a simple and illustrative model ranking scheme on these results it is found that RCM ranking in many cases depends on the reference dataset employed.

  8. Assessing Multi-year Changes in Modeled and Observed Urban NOx Concentrations from a Dynamic Model Evaluation Perspective

    EPA Science Inventory

    An investigation of the concentrations of nitrogen oxides (NOx) from an air quality model and observations at monitoring sites was performed to assess the changes in NOx levels attributable to changes in mobile emissions. This evaluation effort focused on weekday morning rush hou...

  9. 10 CFR 712.37 - Evaluation for hallucinogen use.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 10 Energy 4 2011-01-01 2011-01-01 false Evaluation for hallucinogen use. 712.37 Section 712.37 Energy DEPARTMENT OF ENERGY HUMAN RELIABILITY PROGRAM Medical Standards § 712.37 Evaluation for... performance and observed behavior. ...

  10. 10 CFR 712.37 - Evaluation for hallucinogen use.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 10 Energy 4 2013-01-01 2013-01-01 false Evaluation for hallucinogen use. 712.37 Section 712.37 Energy DEPARTMENT OF ENERGY HUMAN RELIABILITY PROGRAM Medical Standards § 712.37 Evaluation for... performance and observed behavior. ...

  11. 10 CFR 712.37 - Evaluation for hallucinogen use.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 10 Energy 4 2014-01-01 2014-01-01 false Evaluation for hallucinogen use. 712.37 Section 712.37 Energy DEPARTMENT OF ENERGY HUMAN RELIABILITY PROGRAM Medical Standards § 712.37 Evaluation for... performance and observed behavior. ...

  12. 10 CFR 712.37 - Evaluation for hallucinogen use.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 10 Energy 4 2010-01-01 2010-01-01 false Evaluation for hallucinogen use. 712.37 Section 712.37 Energy DEPARTMENT OF ENERGY HUMAN RELIABILITY PROGRAM Medical Standards § 712.37 Evaluation for... performance and observed behavior. ...

  13. 10 CFR 712.37 - Evaluation for hallucinogen use.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 10 Energy 4 2012-01-01 2012-01-01 false Evaluation for hallucinogen use. 712.37 Section 712.37 Energy DEPARTMENT OF ENERGY HUMAN RELIABILITY PROGRAM Medical Standards § 712.37 Evaluation for... performance and observed behavior. ...

  14. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Neuhauser, K.

    Through discussion of five case studies (test homes), this project evaluates strategies to elevate the performance of existing homes to a level commensurate with best-in-class implementation of high-performance new construction homes. The test homes featured in this research activity participated in Deep Energy Retrofit (DER) Pilot Program sponsored by the electric and gas utility National Grid in Massachusetts and Rhode Island. Building enclosure retrofit strategies are evaluated for impact on durability and indoor air quality in addition to energy performance. Evaluation of strategies is structured around the critical control functions of water, airflow, vapor flow, and thermal control. The aimmore » of the research project is to develop guidance that could serve as a foundation for wider adoption of high performance, 'deep' retrofit work. The project will identify risk factors endemic to advanced retrofit in the context of the general building type, configuration and vintage encountered in the National Grid DER Pilot. Results for the test homes are based on observation and performance testing of recently completed projects. Additional observation would be needed to fully gauge long-term energy performance, durability, and occupant comfort.« less

  15. A Gold Standards Approach to Training Instructors to Evaluate Crew Performance

    NASA Technical Reports Server (NTRS)

    Baker, David P.; Dismukes, R. Key

    2003-01-01

    The Advanced Qualification Program requires that airlines evaluate crew performance in Line Oriented Simulation. For this evaluation to be meaningful, instructors must observe relevant crew behaviors and evaluate those behaviors consistently and accurately against standards established by the airline. The airline industry has largely settled on an approach in which instructors evaluate crew performance on a series of event sets, using standardized grade sheets on which behaviors specific to event set are listed. Typically, new instructors are given a class in which they learn to use the grade sheets and practice evaluating crew performance observed on videotapes. These classes emphasize reliability, providing detailed instruction and practice in scoring so that all instructors within a given class will give similar scores to similar performance. This approach has value but also has important limitations; (1) ratings within one class of new instructors may differ from those of other classes; (2) ratings may not be driven primarily by the specific behaviors on which the company wanted the crews to be scored; and (3) ratings may not be calibrated to company standards for level of performance skill required. In this paper we provide a method to extend the existing method of training instructors to address these three limitations. We call this method the "gold standards" approach because it uses ratings from the company's most experienced instructors as the basis for training rater accuracy. This approach ties the training to the specific behaviors on which the experienced instructors based their ratings.

  16. The Role of Scheduling in Observing Teacher-Child Interactions

    ERIC Educational Resources Information Center

    Cash, Anne H.; Pianta, Robert C.

    2014-01-01

    Observational assessment is being used on a large scale to evaluate the quality of interactions between teachers and children in classroom environments. When one performs observations at scale, features of the protocol such as the scheduling of observations can potentially influence observed scores. In this study interactions were observed for 88…

  17. Statistical Evaluation of CRM-Simulated Cloud and Precipitation Structures Using Multi- sensor TRMM Measurements and Retrievals

    NASA Astrophysics Data System (ADS)

    Posselt, D.; L'Ecuyer, T.; Matsui, T.

    2009-05-01

    Cloud resolving models are typically used to examine the characteristics of clouds and precipitation and their relationship to radiation and the large-scale circulation. As such, they are not required to reproduce the exact location of each observed convective system, much less each individual cloud. Some of the most relevant information about clouds and precipitation is provided by instruments located on polar-orbiting satellite platforms, but these observations are intermittent "snapshots" in time, making assessment of model performance challenging. In contrast to direct comparison, model results can be evaluated statistically. This avoids the requirement for the model to reproduce the observed systems, while returning valuable information on the performance of the model in a climate-relevant sense. The focus of this talk is a model evaluation study, in which updates to the microphysics scheme used in a three-dimensional version of the Goddard Cumulus Ensemble (GCE) model are evaluated using statistics of observed clouds, precipitation, and radiation. We present the results of multiday (non-equilibrium) simulations of organized deep convection using single- and double-moment versions of a the model's cloud microphysical scheme. Statistics of TRMM multi-sensor derived clouds, precipitation, and radiative fluxes are used to evaluate the GCE results, as are simulated TRMM measurements obtained using a sophisticated instrument simulator suite. We present advantages and disadvantages of performing model comparisons in retrieval and measurement space and conclude by motivating the use of data assimilation techniques for analyzing and improving model parameterizations.

  18. Comparing masked target transform volume (MTTV) clutter metric to human observer evaluation of visual clutter

    NASA Astrophysics Data System (ADS)

    Camp, H. A.; Moyer, Steven; Moore, Richard K.

    2010-04-01

    The Night Vision and Electronic Sensors Directorate's current time-limited search (TLS) model, which makes use of the targeting task performance (TTP) metric to describe image quality, does not explicitly account for the effects of visual clutter on observer performance. The TLS model is currently based on empirical fits to describe human performance for a time of day, spectrum and environment. Incorporating a clutter metric into the TLS model may reduce the number of these empirical fits needed. The masked target transform volume (MTTV) clutter metric has been previously presented and compared to other clutter metrics. Using real infrared imagery of rural images with varying levels of clutter, NVESD is currently evaluating the appropriateness of the MTTV metric. NVESD had twenty subject matter experts (SME) rank the amount of clutter in each scene in a series of pair-wise comparisons. MTTV metric values were calculated and then compared to the SME observers rankings. The MTTV metric ranked the clutter in a similar manner to the SME evaluation, suggesting that the MTTV metric may emulate SME response. This paper is a first step in quantifying clutter and measuring the agreement to subjective human evaluation.

  19. Evaluation of Oral Performance in Outsourced Call Centres: An Exploratory Case Study

    ERIC Educational Resources Information Center

    Friginal, Eric

    2013-01-01

    This case study discusses the development and use of an oral performance assessment instrument intended to evaluate Filipino agents' customer service transactions with callers from the United States (US). The design and applications of the instrument were based on a longitudinal, qualitative observation of language training and customer service…

  20. A Descriptive-Comparative Study of Teacher Performance Evaluation on Student Achievement in a Public School District

    ERIC Educational Resources Information Center

    Christensen, William Howard

    2013-01-01

    In 2010, the federal government increased accountability expectations by placing more emphasis on monitoring teacher performance. Using a model that focuses on the New York State teacher evaluation system, that is comprised of a rubric for observation, local student assessment scores, and student state assessment scores, this…

  1. Differences between Employees' and Supervisors' Evaluations of Work Performance and Support Needs

    ERIC Educational Resources Information Center

    Bennett, Kyle; Frain, Michael; Brady, Michael P.; Rosenberg, Howard; Surinak, Tricia

    2009-01-01

    Assessment systems are needed that are sensitive to employees' work performance as well as their need for support, while incorporating the input from both employees and their supervisors. This study examined the correspondence of one such evaluation system, the Job Observation and Behavior Scale (JOBS) and the JOBS: Opportunity for…

  2. Using Summative and Formative Assessments to Evaluate EFL Teachers' Teaching Performance

    ERIC Educational Resources Information Center

    Wei, Wei

    2015-01-01

    Using classroom observations (formative) and student course experience survey results (summative) to evaluate English lecturers' teaching performances is not new in practice, but surprisingly only a few studies have investigated this issue in a higher education context. This study was conducted in an English department of a large university in…

  3. Administrators' Views on Teacher Evaluation: Examining Ontario's Teacher Performance Appraisal

    ERIC Educational Resources Information Center

    Maharaj, Sachin

    2014-01-01

    This study examines the views of administrators (i.e., principals and vice-principals) in Ontario, Canada, with regard to the province's Teacher Performance Appraisal process. A total of 178 responses were collected from a survey that examined five areas: 1) preparation and training; 2) classroom observations; 3) preparing the formal evaluation;…

  4. Classroom Composition and Measured Teacher Performance: What Do Teacher Observation Scores Really Measure?

    ERIC Educational Resources Information Center

    Steinberg, Matthew P.; Garrett, Rachel

    2016-01-01

    As states and districts implement more rigorous teacher evaluation systems, measures of teacher performance are increasingly being used to support instruction and inform retention decisions. Classroom observations take a central role in these systems, accounting for the majority of teacher ratings upon which accountability decisions are based.…

  5. Postflight hardware evaluation 360T026 (RSRM-26, STS-47)

    NASA Technical Reports Server (NTRS)

    Nielson, Greg

    1993-01-01

    The final report for the Clearfield disassembly evaluation and a continuation of the KSC postflight assessment for the 360T026 (STS-47) Redesigned Solid Rocket Motor (RSRM) flight set is provided. All observed hardware conditions were documented on PFOR's and are included in Appendices A, B, and C. Appendices D and E contain the measurements and safety factor data for the nozzle and insulation components. This report, along with the KSC Ten-Day Postflight Hardware Evaluation Report (TWR-64203), represents a summary of the 360T026 hardware evaluation. The as-flown hardware configuration is documented in TWR-60472. Disassembly evaluation photograph numbers are logged in TWA-1987. The 360T026 flight set disassembly evaluations described were performed at the RSRM Refurbishment Facility in Clearfield, Utah. The final factory joint demate occurred on 12 April 1993. Detailed evaluations were performed in accordance with the Clearfield Postflight Engineering Evaluation Plan (PEEP), TWR-50051, Revision A. All observations were compared against limits that are also defined in the PEEP. These limits outline the criteria for categorizing the observations as acceptable, reportable, or critical. Hardware conditions that were unexpected and/or determined to be reportable or critical were evaluated by the applicable CPT and tracked through the PFAR system.

  6. Initial Clinical Trial of Robot of Endovascular Treatment with Force Feedback and Cooperating of Catheter and Guidewire.

    PubMed

    Jiang, Yuhua; Liu, Keyun; Li, Youxiang

    2018-01-01

    To evaluate the feasibility and safety of the robot of endovascular treatment (RobEnt) in clinical practice, we carried out a cerebral angiography using this robot system. We evaluated the performance of application of the robot system to clinical practice through using this robotic system to perform the digital subtraction angiography for a patient who was suspected of suffering intracranial aneurysm. At the same time, through comparing the postoperative head nuclear magnetic and blood routine with the preoperative examination, we evaluated the safety of application of the robot system to clinical practice. We performed the robot system to complete the bilateral carotid artery and bilateral vertebral arteriography. The results indicate that there was no obvious abnormality in the patient's cerebral artery. No obvious abnormality was observed in the examination of patients' check-up, head nuclear magnetism, and blood routine after the digital subtraction angiography. From this clinical trial, it can be observed that the robot system can perform the operation of cerebral angiography. The robot system can basically complete the related observation indexes, and its accuracy, effectiveness, stability, and safety basically meet the requirements of clinical application in neurointerventional surgery.

  7. A Focused Observation Tool Using Dreyfus Stages of Skill Acquisition as an Evaluative Scale.

    PubMed

    Driver, Richard; Grose, Brian; Serafini, Mario; Cottrell, Scott; Sizemore, Daniel; Vallejo, Manuel

    2017-01-01

    Focused Observartion (FO) is associated with assessing complex skills and differs from generalized observations and evaluations. We've developed a FO assessing clinical procedural skills using Hubert Dreyfus Stages of Skill Acquisition as descriptive anchors. This study sought to analyze the effectiveness of this measure of skill progression. During week 1 and week 4 of training, FO was performed repetitively on 6 residents during endotracheal intubation. Skill stage ratings were converted to numerical scores. A dependent, paired samples t-test was calculated using total mean score (dependent variable) and an effect size. (Cohen's d) was performed to ascertain the standardized mean difference between observations. A significant improvement in mean scores occurred between Week 1 (AVG 1.2, STDV ± 0.1) and Week 4 (AVG 2.0, STDV ± 0.1) (t= -3.9, p<.05) Calculated Chohen's d indicates that this difference was meaningful. This study demonstrates success in adapting a Focused Observation technique and an innovative evaluative scale based upon Dreyfus stages of skill acquisition.

  8. The evaluation of ASOS for the Kennedy Space Center's Shuttle Landing Facility

    NASA Technical Reports Server (NTRS)

    Yersavich, Ann; Wheeler, Mark; Taylor, Gregory; Schumann, Robin; Manobianco, John

    1994-01-01

    This report documents the Applied Meteorology Unit's (AMU) evaluation of the effectiveness and utility of the Automated Surface Observing System (ASOS) in terms of spaceflight operations and user requirements. In particular, the evaluation determines which of the Shuttle Landing Facility (SLF) observation requirements can be satisfied by ASOS. This report also includes a summary of ASOS' background, current configuration and specifications, system performance, and the possible concepts of operations for use of ASOS at the SLF. This evaluation stems from a desire by the Air Force to determine if ASOS units could be used to reduce the cost of SLF meteorological observations.

  9. Correlation between human observer performance and model observer performance in differential phase contrast CT

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Ke; Garrett, John; Chen, Guang-Hong

    2013-11-15

    Purpose: With the recently expanding interest and developments in x-ray differential phase contrast CT (DPC-CT), the evaluation of its task-specific detection performance and comparison with the corresponding absorption CT under a given radiation dose constraint become increasingly important. Mathematical model observers are often used to quantify the performance of imaging systems, but their correlations with actual human observers need to be confirmed for each new imaging method. This work is an investigation of the effects of stochastic DPC-CT noise on the correlation of detection performance between model and human observers with signal-known-exactly (SKE) detection tasks.Methods: The detectabilities of different objectsmore » (five disks with different diameters and two breast lesion masses) embedded in an experimental DPC-CT noise background were assessed using both model and human observers. The detectability of the disk and lesion signals was then measured using five types of model observers including the prewhitening ideal observer, the nonprewhitening (NPW) observer, the nonprewhitening observer with eye filter and internal noise (NPWEi), the prewhitening observer with eye filter and internal noise (PWEi), and the channelized Hotelling observer (CHO). The same objects were also evaluated by four human observers using the two-alternative forced choice method. The results from the model observer experiment were quantitatively compared to the human observer results to assess the correlation between the two techniques.Results: The contrast-to-detail (CD) curve generated by the human observers for the disk-detection experiments shows that the required contrast to detect a disk is inversely proportional to the square root of the disk size. Based on the CD curves, the ideal and NPW observers tend to systematically overestimate the performance of the human observers. The NPWEi and PWEi observers did not predict human performance well either, as the slopes of their CD curves tended to be steeper. The CHO generated the best quantitative agreement with human observers with its CD curve overlapping with that of human observer. Statistical equivalence between CHO and humans can be claimed within 11% of the human observer results, including both the disk and lesion detection experiments.Conclusions: The model observer method can be used to accurately represent human observer performance with the stochastic DPC-CT noise for SKE tasks with sizes ranging from 8 to 128 pixels. The incorporation of the anatomical noise remains to be studied.« less

  10. Blinded evaluation of interrater reliability of an operative competency assessment tool for direct laryngoscopy and rigid bronchoscopy.

    PubMed

    Ishman, Stacey L; Benke, James R; Johnson, Kaalan Erik; Zur, Karen B; Jacobs, Ian N; Thorne, Marc C; Brown, David J; Lin, Sandra Y; Bhatti, Nasir; Deutsch, Ellen S

    2012-10-01

    OBJECTIVES To confirm interrater reliability using blinded evaluation of a skills-assessment instrument to assess the surgical performance of resident and fellow trainees performing pediatric direct laryngoscopy and rigid bronchoscopy in simulated models. DESIGN Prospective, paired, blinded observational validation study. SUBJECTS Paired observers from multiple institutions simultaneously evaluated residents and fellows who were performing surgery in an animal laboratory or using high-fidelity manikins. The evaluators had no previous affiliation with the residents and fellows and did not know their year of training. INTERVENTIONS One- and 2-page versions of an objective structured assessment of technical skills (OSATS) assessment instrument composed of global and a task-specific surgical items were used to evaluate surgical performance. RESULTS Fifty-two evaluations were completed by 17 attending evaluators. The instrument agreement for the 2-page assessment was 71.4% when measured as a binary variable (ie, competent vs not competent) (κ = 0.38; P = .08). Evaluation as a continuous variable revealed a 42.9% percentage agreement (κ = 0.18; P = .14). The intraclass correlation was 0.53, considered substantial/good interrater reliability (69% reliable). For the 1-page instrument, agreement was 77.4% when measured as a binary variable (κ = 0.53, P = .0015). Agreement when evaluated as a continuous measure was 71.0% (κ = 0.54, P < .001). The intraclass correlation was 0.73, considered high interrater reliability (85% reliable). CONCLUSIONS The OSATS assessment instrument is an effective tool for evaluating surgical performance among trainees with acceptable interrater reliability in a simulator setting. Reliability was good for both the 1- and 2-page OSATS checklists, and both serve as excellent tools to provide immediate formative feedback on operational competency.

  11. Mirror neuron system and observational learning: behavioral and neurophysiological evidence.

    PubMed

    Lago-Rodriguez, Angel; Lopez-Alonso, Virginia; Fernández-del-Olmo, Miguel

    2013-07-01

    Three experiments were performed to study observational learning using behavioral, perceptual, and neurophysiological data. Experiment 1 investigated whether observing an execution model, during physical practice of a transitive task that only presented one execution strategy, led to performance improvements compared with physical practice alone. Experiment 2 investigated whether performing an observational learning protocol improves subjects' action perception. In experiment 3 we evaluated whether the type of practice performed determined the activation of the Mirror Neuron System during action observation. Results showed that, compared with physical practice, observing an execution model during a task that only showed one execution strategy does not provide behavioral benefits. However, an observational learning protocol allows subjects to predict more precisely the outcome of the learned task. Finally, intersperse observation of an execution model with physical practice results in changes of primary motor cortex activity during the observation of the motor pattern previously practiced, whereas modulations in the connectivity between primary and non primary motor areas (PMv-M1; PPC-M1) were not affected by the practice protocol performed by the observer. Copyright © 2013 Elsevier B.V. All rights reserved.

  12. Milestone-specific, Observed data points for evaluating levels of performance (MODEL) assessment strategy for anesthesiology residency programs.

    PubMed

    Nagy, Christopher J; Fitzgerald, Brian M; Kraus, Gregory P

    2014-01-01

    Anesthesiology residency programs will be expected to have Milestones-based evaluation systems in place by July 2014 as part of the Next Accreditation System. The San Antonio Uniformed Services Health Education Consortium (SAUSHEC) anesthesiology residency program developed and implemented a Milestones-based feedback and evaluation system a year ahead of schedule. It has been named the Milestone-specific, Observed Data points for Evaluating Levels of performance (MODEL) assessment strategy. The "MODEL Menu" and the "MODEL Blueprint" are tools that other anesthesiology residency programs can use in developing their own Milestones-based feedback and evaluation systems prior to ACGME-required implementation. Data from our early experience with the streamlined MODEL blueprint assessment strategy showed substantially improved faculty compliance with reporting requirements. The MODEL assessment strategy provides programs with a workable assessment method for residents, and important Milestones data points to programs for ACGME reporting.

  13. Reliability of the Cardiff Test of basic life support and automated external defibrillation version 3.1.

    PubMed

    Whitfield, Richard H; Newcombe, Robert G; Woollard, Malcolm

    2003-12-01

    The introduction of the European Resuscitation Guidelines (2000) for cardiopulmonary resuscitation (CPR) and automated external defibrillation (AED) prompted the development of an up-to-date and reliable method of assessing the quality of performance of CPR in combination with the use of an AED. The Cardiff Test of basic life support (BLS) and AED version 3.1 was developed to meet this need and uses standardised checklists to retrospectively evaluate performance from analyses of video recordings and data drawn from a laptop computer attached to a training manikin. This paper reports the inter- and intra-observer reliability of this test. Data used to assess reliability were obtained from an investigation of CPR and AED skill acquisition in a lay responder AED training programme. Six observers were recruited to evaluate performance in 33 data sets, repeating their evaluation after a minimum interval of 3 weeks. More than 70% of the 42 variables considered in this study had a kappa score of 0.70 or above for inter-observer reliability or were drawn from computer data and therefore not subject to evaluator variability. 85% of the 42 variables had kappa scores for intra-observer reliability of 0.70 or above or were drawn from computer data. The standard deviations for inter- and intra-observer measures of time to first shock were 11.6 and 7.7 s, respectively. The inter- and intra-observer reliability for the majority of the variables in the Cardiff Test of BLS and AED version 3.1 is satisfactory. However, reliability is less acceptable with respect to shaking when checking for responsiveness, initial check/clearing of the airway, checks for signs of circulation, time to first shock and performance of interventions in the correct sequence. Further research is required to determine if modifications to the method of assessing these variables can increase reliability.

  14. Using a human patient simulator to study the relationship between communication and nursing students' team performance.

    PubMed

    Hirokawa, Randy Y; Daub, Katharyn; Lovell, Eileen; Smith, Sarah; Davis, Alice; Beck, Christine

    2012-11-01

    This study examined the relationship between communication and nursing students' team performance by determining whether variations in team performance are related to differences in communication regarding five task-relevant functions: assessment, diagnosis, planning, implementation, and evaluation. The study results indicate a positive relationship between nursing students' team performance and comments focused on the implementation of treatment(s) and the evaluation of treatment options. A negative relationship between nursing students' team performance and miscellaneous comments made by team members was also observed. Copyright 2012, SLACK Incorporated.

  15. DSS range delay calibrations: Current performance level

    NASA Technical Reports Server (NTRS)

    Spradlin, G. L.

    1976-01-01

    A means for evaluating Deep Space Station (DSS) range delay calibration performance was developed. Inconsistencies frequently noted in these data are resolved. Development of the DSS range delay data base is described. The data base is presented with comments regarding apparent discontinuities. Data regarding the exciter frequency dependence of the delay values are presented. The improvement observed in the consistency of current DSS range delay calibration data over the performance previously observed is noted.

  16. Balancing the Role of Priors in Multi-Observer Segmentation Evaluation

    PubMed Central

    Huang, Xiaolei; Wang, Wei; Lopresti, Daniel; Long, Rodney; Antani, Sameer; Xue, Zhiyun; Thoma, George

    2009-01-01

    Comparison of a group of multiple observer segmentations is known to be a challenging problem. A good segmentation evaluation method would allow different segmentations not only to be compared, but to be combined to generate a “true” segmentation with higher consensus. Numerous multi-observer segmentation evaluation approaches have been proposed in the literature, and STAPLE in particular probabilistically estimates the true segmentation by optimal combination of observed segmentations and a prior model of the truth. An Expectation–Maximization (EM) algorithm, STAPLE’S convergence to the desired local minima depends on good initializations for the truth prior and the observer-performance prior. However, accurate modeling of the initial truth prior is nontrivial. Moreover, among the two priors, the truth prior always dominates so that in certain scenarios when meaningful observer-performance priors are available, STAPLE can not take advantage of that information. In this paper, we propose a Bayesian decision formulation of the problem that permits the two types of prior knowledge to be integrated in a complementary manner in four cases with differing application purposes: (1) with known truth prior; (2) with observer prior; (3) with neither truth prior nor observer prior; and (4) with both truth prior and observer prior. The third and fourth cases are not discussed (or effectively ignored) by STAPLE, and in our research we propose a new method to combine multiple-observer segmentations based on the maximum a posterior (MAP) principle, which respects the observer prior regardless of the availability of the truth prior. Based on the four scenarios, we have developed a web-based software application that implements the flexible segmentation evaluation framework for digitized uterine cervix images. Experiment results show that our framework has flexibility in effectively integrating different priors for multi-observer segmentation evaluation and it also generates results comparing favorably to those by the STAPLE algorithm and the Majority Vote Rule. PMID:20523759

  17. The Influence of the Manner of Performing the Thyroid Ultrasound Examination on the Reliability of the Assessment of the Thyroid Size in School-Aged Children.

    PubMed

    Zygmunt, Arkadiusz; Adamczewski, Zbigniew; Zygmunt, Agnieszka; Karbownik-Lewinska, Malgorzata; Lewinski, Andrzej

    2017-01-01

    Goitre incidence in school-aged children evaluated using ultrasonography is one of the essential indicators of iodine intake in a given area. The aim of the study was to examine what the difference is between the volume of the thyroid gland measured in the supine and sitting position and to determine the intra-observer, inter-observer, and inter-position variations. The survey was conducted among 87 children (56 girls and 31 boys aged 7-13 years, mean age 10.44 ± 1.72 years). The thyroid volume measured in a sitting position was significantly lower than that measured in the supine position. The intra-observer variations for the total thyroid volume equalled 9.56-9.65%. The inter-observer variations were significantly higher and amounted to 34.5-35.7%. The way in which ultrasound evaluation is performed is important for the analysis of the results. It is crucial to aim for the smallest inter-observer variation, which can be achieved by strictly defining the methods of the thyroid measurement and comparing one's measuring techniques with the reference method. The use of standards in ultrasound evaluation performed in the supine position, as well as the use of standards without a strict determination of the study method, can lead to erro-neous conclusions. © 2017 S. Karger AG, Basel.

  18. Influence of visual observational conditions on tongue motor learning.

    PubMed

    Kothari, Mohit; Liu, Xuemei; Baad-Hansen, Lene; Kumar, Abhishek; Bin, Guo; Svensson, Peter

    2016-12-01

    The aim of this study was to investigate the impact of visual observational conditions on performance during a standardized tongue-protrusion training (TPT) task and to evaluate subject-based reports of helpfulness, disturbance, pain, and fatigue, due to the observational conditions on 0-10 numerical rating scales. Forty-eight healthy participants performed a 1-h standard TPT task. Participants were randomly assigned to one of the following three groups with different observational conditions: group 1, model observation (participants watched a prerecorded video showing standard TPT before optimal TPT being performed); group 2, self-observation (participants watched live video feedback of their own TPT performance); and group 3, control group (participants performed the TPT with no conditioning). There was no overall difference between groups but TPT performance increased over time. A significant group×time interaction indicated that the self-observation group performed significantly better than the model-observation group in the last 20 min of TPT. The subject-based reports of video helpfulness showed that the model-observation group rated the prerecorded video as more helpful for TPT performance compared with the other groups but there was no significant difference between groups regarding the level of disturbance, pain, or fatigue. Self-observation of tongue-training facilitated behavioral aspects of tongue motor learning compared with model observation but not compared with control. © 2016 Eur J Oral Sci.

  19. Evaluating Simulation Methodologies to Determine Best Strategies to Maximize Student Learning.

    PubMed

    Scherer, Yvonne K; Foltz-Ramos, Kelly; Fabry, Donna; Chao, Ying-Yu

    2016-01-01

    Limited evidence exists as to the most effective ways to provide simulation experiences to maximize student learning. This quasi-experimental study investigated 2 different strategies repeated versus 1 exposure and participation versus observation on student outcomes following exposure to a high-fidelity acute asthma exacerbation of asthma scenario. Immediate repeated exposure resulted in significantly higher scores on knowledge, student satisfaction and self-confidence, and clinical performance measures than a single exposure. Significant intergroup differences were found on participants' satisfaction and self-confidence as compared with observers. Implications for nurse educators include expanding the observer role when designing repeated exposure to simulations and integrating technical, cognitive, and behavioral outcomes as a way for faculty to evaluate students' clinical performance. Published by Elsevier Inc.

  20. Values of a Patient and Observer Scar Assessment Scale to Evaluate the Facial Skin Graft Scar.

    PubMed

    Chae, Jin Kyung; Kim, Jeong Hee; Kim, Eun Jung; Park, Kun

    2016-10-01

    The patient and observer scar assessment scale (POSAS) recently emerged as a promising method, reflecting both observer's and patient's opinions in evaluating scar. This tool was shown to be consistent and reliable in burn scar assessment, but it has not been tested in the setting of skin graft scar in skin cancer patients. To evaluate facial skin graft scar applied to POSAS and to compare with objective scar assessment tools. Twenty three patients, who diagnosed with facial cutaneous malignancy and transplanted skin after Mohs micrographic surgery, were recruited. Observer assessment was performed by three independent rates using the observer component of the POSAS and Vancouver scar scale (VSS). Patient self-assessment was performed using the patient component of the POSAS. To quantify scar color and scar thickness more objectively, spectrophotometer and ultrasonography was applied. Inter-observer reliability was substantial with both VSS and the observer component of the POSAS (average measure intraclass coefficient correlation, 0.76 and 0.80, respectively). The observer component consistently showed significant correlations with patients' ratings for the parameters of the POSAS (all p -values<0.05). The correlation between subjective assessment using POSAS and objective assessment using spectrophotometer and ultrasonography showed low relationship. In facial skin graft scar assessment in skin cancer patients, the POSAS showed acceptable inter-observer reliability. This tool was more comprehensive and had higher correlation with patient's opinion.

  1. Performance Evaluation of New-Generation Pulse Oximeters in the NICU: Observational Study.

    PubMed

    Nizami, Shermeen; Greenwood, Kim; Barrowman, Nick; Harrold, JoAnn

    2015-09-01

    This crossover observational study compares the data characteristics and performance of new-generation Nellcor OXIMAX and Masimo SET SmartPod pulse oximeter technologies. The study was conducted independent of either original equipment manufacturer (OEM) across eleven preterm infants in a Neonatal Intensive Care Unit (NICU). The SmartPods were integrated with Dräger Infinity Delta monitors. The Delta monitor measured the heart rate (HR) using an independent electrocardiogram sensor, and the two SmartPods collected arterial oxygen saturation (SpO2) and pulse rate (PR). All patient data were non-Gaussian. Nellcor PR showed a higher correlation with the HR as compared to Masimo PR. The statistically significant difference found in their median values (1% for SpO2, 1 bpm for PR) was deemed clinically insignificant. SpO2 alarms generated by both SmartPods were observed and categorized for performance evaluation. Results for sensitivity, positive predictive value, accuracy and false alarm rates were Nellcor (80.3, 50, 44.5, 50%) and Masimo (72.2, 48.2, 40.6, 51.8%) respectively. These metrics were not statistically significantly different between the two pulse oximeters. Despite claims by OEMs, both pulse oximeters exhibited high false alarm rates, with no statistically or clinically significant difference in performance. These findings have a direct impact on alarm fatigue in the NICU. Performance evaluation studies can also impact medical device purchase decisions made by hospital administrators.

  2. Empirical evaluation of spatial and non-spatial European-scale multimedia fate models: results and implications for chemical risk assessment.

    PubMed

    Armitage, James M; Cousins, Ian T; Hauck, Mara; Harbers, Jasper V; Huijbregts, Mark A J

    2007-06-01

    Multimedia environmental fate models are commonly-applied tools for assessing the fate and distribution of contaminants in the environment. Owing to the large number of chemicals in use and the paucity of monitoring data, such models are often adopted as part of decision-support systems for chemical risk assessment. The purpose of this study was to evaluate the performance of three multimedia environmental fate models (spatially- and non-spatially-explicit) at a European scale. The assessment was conducted for four polycyclic aromatic hydrocarbons (PAHs) and hexachlorobenzene (HCB) and compared predicted and median observed concentrations using monitoring data collected for air, water, sediments and soils. Model performance in the air compartment was reasonable for all models included in the evaluation exercise as predicted concentrations were typically within a factor of 3 of the median observed concentrations. Furthermore, there was good correspondence between predictions and observations in regions that had elevated median observed concentrations for both spatially-explicit models. On the other hand, all three models consistently underestimated median observed concentrations in sediment and soil by 1-3 orders of magnitude. Although regions with elevated median observed concentrations in these environmental media were broadly identified by the spatially-explicit models, the magnitude of the discrepancy between predicted and median observed concentrations is of concern in the context of chemical risk assessment. These results were discussed in terms of factors influencing model performance such as the steady-state assumption, inaccuracies in emission estimates and the representativeness of monitoring data.

  3. Evaluation on surface current observing network of high frequency ground wave radars in the Gulf of Thailand

    NASA Astrophysics Data System (ADS)

    Yin, Xunqiang; Shi, Junqiang; Qiao, Fangli

    2018-05-01

    Due to the high cost of ocean observation system, the scientific design of observation network becomes much important. The current network of the high frequency radar system in the Gulf of Thailand has been studied using a three-dimensional coastal ocean model. At first, the observations from current radars have been assimilated into this coastal model and the forecast results have improved due to the data assimilation. But the results also show that further optimization of the observing network is necessary. And then, a series of experiments were carried out to assess the performance of the existing high frequency ground wave radar surface current observation system. The simulated surface current data in three regions were assimilated sequentially using an efficient ensemble Kalman filter data assimilation scheme. The experimental results showed that the coastal surface current observation system plays a positive role in improving the numerical simulation of the currents. Compared with the control experiment without assimilation, the simulation precision of surface and subsurface current had been improved after assimilated the surface currents observed at current networks. However, the improvement for three observing regions was quite different and current observing network in the Gulf of Thailand is not effective and a further optimization is required. Based on these evaluations, a manual scheme has been designed by discarding the redundant and inefficient locations and adding new stations where the performance after data assimilation is still low. For comparison, an objective scheme based on the idea of data assimilation has been obtained. Results show that all the two schemes of observing network perform better than the original network and optimal scheme-based data assimilation is much superior to the manual scheme that based on the evaluation of original observing network in the Gulf of Thailand. The distributions of the optimal network of radars could be a useful guidance for future design of observing system in this region.

  4. Model helicopter performance degradation with simulated ice shapes

    NASA Technical Reports Server (NTRS)

    Tinetti, Ana F.; Korkan, Kenneth D.

    1987-01-01

    An experimental program using a commercially available model helicopter has been conducted in the Texas A&M University Subsonic Wind Tunnel to investigate main rotor performance degradation due to generic ice. The simulated ice, including both primary and secondary formations, was scaled by chord from previously documented artificial ice accretions. Base and iced performance data were gathered as functions of fuselage incidence, blade collective pitch, main rotor rotational velocity, and freestream velocity. It was observed that the presence of simulated ice tends to decrease the lift to equivalent drag ratio, as well as thrust coefficient for the range of velocity ratios tested. Also, increases in torque coefficient due to the generic ice formations were observed. Evaluation of the data has indicated that the addition of roughness due to secondary ice formations is crucial for proper evaluation of the degradation in main rotor performance.

  5. [Inferential evaluation of intimacy based on observation of interpersonal communication].

    PubMed

    Kimura, Masanori

    2015-06-01

    How do people inferentially evaluate others' levels of intimacy with friends? We examined the inferential evaluation of intimacy based on the observation of interpersonal communication. In Experiment 1, participants (N = 41) responded to questions after observing conversations between friends. Results indicated that participants inferentially evaluated not only goodness of communication, but also intimacy between friends, using an expressivity heuristic approach. In Experiment 2, we investigated how inferential evaluation of intimacy was affected by prior information about relationships and by individual differences in face-to-face interactional ability. Participants (N = 64) were divided into prior- and no-prior-information groups and all performed the same task as in Experiment 1. Additionally, their interactional ability was assessed. In the prior-information group, individual differences had no effect on inferential evaluation of intimacy. On the other hand, in the no-prior-information group, face-to-face interactional ability partially influenced evaluations of intimacy. Finally, we discuss the fact that to understand one's social environment, it is important to observe others' interpersonal communications.

  6. Intra- and Inter-observer Variability of Measurements of the Laxity Index on Stress Radiographs Performed with the Vezzoni-Modified Badertscher Hip Distension Device.

    PubMed

    Bertal, Mileva; Vezzoni, Aldo; Houdellier, Blandine; Bogaerts, Evelien; Stock, Emmelie; Polis, Ingeborgh; Deforce, Dieter; Saunders, Jimmy H; Broeckx, Bart J G

    2018-06-02

     To describe and evaluate the accuracy, intra- and inter-observer variability of the laxity index (LI), used to quantify hip laxity on stress radiographs obtained with the Vezzoni-modified Badertscher distension device (VMBDD).  Stress radiographs of 10 dogs obtained with the VMBDD were measured three times by an experienced observer. Six participants with different backgrounds (two ECVDI residents, two PhD students, two veterinary assistants) followed a short presentation and performed subsequently the measurements four times in two separate sessions. The effect of self-learning, feedback and specialization on the accuracy of the measurements was assessed.  While the intra- and inter-observer variability were in agreement with other studies, the results of the experienced observer indicated that the variability can be very low. Neither feedback nor self-learning improved the results. A high degree of experience in radiographic assessment was not necessary to perform the measurements correctly.  As the LI measurements were acceptable after a short presentation, they support the use of VMBDD for a complete and correct in-house evaluation of the hip joint by trained clinicians. However, we propose that, in the context of screening, measurements should be performed by a limited number of experienced examiners, to limit the impact of the inter-observer variability. Schattauer GmbH Stuttgart.

  7. Toward Meaningful Evaluation of Medical Trainees: The Influence of Participants' Perceptions of the Process

    ERIC Educational Resources Information Center

    Watling, Christopher J.; Lingard, Lorelei

    2012-01-01

    An essential goal of evaluation is to foster learning. Across the medical education spectrum, evaluation of clinical performance is dominated by subjective feedback to learners based on observation by expert supervisors. Research in non-medical settings has suggested that participants' perceptions of evaluation processes exert considerable…

  8. The Reliability of Classification Decisions for the Furtado-Gallagher Computerized Observational Movement Pattern Assessment System--FG-COMPASS

    ERIC Educational Resources Information Center

    Furtado, Ovande, Jr.; Gallagher, Jere D.

    2012-01-01

    Mastery of fundamental movement skills (FMS) is an important factor in preventing weight gain and increasing physical activity. To master FMS, performance evaluation is necessary. In this study, we investigated the reliability of a new observational assessment tool. In Phase I, 110 video clips of children performing five locomotor, and six…

  9. Development of an ideal observer that incorporates nuisance parameters and processes list-mode data

    DOE PAGES

    MacGahan, Christopher Jonathan; Kupinski, Matthew Alan; Hilton, Nathan R.; ...

    2016-02-01

    Observer models were developed to process data in list-mode format in order to perform binary discrimination tasks for use in an arms-control-treaty context. Data used in this study was generated using GEANT4 Monte Carlo simulations for photons using custom models of plutonium inspection objects and a radiation imaging system. We evaluated observer model performance and then presented using the area under the receiver operating characteristic curve. Lastly, we studied the ideal observer under both signal-known-exactly conditions and in the presence of unknowns such as object orientation and absolute count-rate variability; when these additional sources of randomness were present, their incorporationmore » into the observer yielded superior performance.« less

  10. Evaluating health worker performance in Benin using the simulated client method with real children.

    PubMed

    Rowe, Alexander K; Onikpo, Faustin; Lama, Marcel; Deming, Michael S

    2012-10-08

    The simulated client (SC) method for evaluating health worker performance utilizes surveyors who pose as patients to make surreptitious observations during consultations. Compared to conspicuous observation (CO) by surveyors, which is commonly done in developing countries, SC data better reflect usual health worker practices. This information is important because CO can cause performance to be better than usual. Despite this advantage of SCs, the method's full potential has not been realized for evaluating performance for pediatric illnesses because real children have not been utilized as SCs. Previous SC studies used scenarios of ill children that were not actually brought to health workers. During a trial that evaluated a quality improvement intervention in Benin (the Integrated Management of Childhood Illness [IMCI] strategy), we conducted an SC survey with adult caretakers as surveyors and real children to evaluate the feasibility of this approach and used the results to assess the validity of CO. We conducted an SC survey and a CO survey (one right after the other) of health workers in the same 55 health facilities. A detailed description of the SC survey process was produced. Results of the two surveys were compared for 27 performance indicators using logistic regression modeling. SC and CO surveyors observed 54 and 185 consultations, respectively. No serious problems occurred during the SC survey. Performance levels measured by CO were moderately higher than those measured by SCs (median CO - SC difference = 16.4 percentage-points). Survey differences were sometimes much greater for IMCI-trained health workers (median difference = 29.7 percentage-points) than for workers without IMCI training (median difference = 3.1 percentage-points). SC surveys can be done safely with real children if appropriate precautions are taken. CO can introduce moderately large positive biases, and these biases might be greater for health workers exposed to quality improvement interventions. http://clinicaltrials.gov Identifier NCT00510679.

  11. An Opinion of the Relationship Between Outpatient Department Organization and Evaluation of Clinical Competency

    ERIC Educational Resources Information Center

    Scherer, Paul R.

    1978-01-01

    Observations of a departmental and non-departmental outpatient service organization are compared regarding student attitude and evaluation, patient care, clinical exposure control, and faculty attitude and performance. Tables show the evaluation topics of each rotation in the departmental system. (LBH)

  12. Gum-compliant uncertainty propagations for Pu and U concentration measurements using the 1st-prototype XOS/LANL hiRX instrument; an SRNL H-Canyon Test Bed performance evaluation project

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Holland, Michael K.; O'Rourke, Patrick E.

    An SRNL H-Canyon Test Bed performance evaluation project was completed jointly by SRNL and LANL on a prototype monochromatic energy dispersive x-ray fluorescence instrument, the hiRX. A series of uncertainty propagations were generated based upon plutonium and uranium measurements performed using the alpha-prototype hiRX instrument. Data reduction and uncertainty modeling provided in this report were performed by the SRNL authors. Observations and lessons learned from this evaluation were also used to predict the expected uncertainties that should be achievable at multiple plutonium and uranium concentration levels provided instrument hardware and software upgrades being recommended by LANL and SRNL are performed.

  13. Evaluation of Knowledge Development in a Healthcare Setting

    NASA Astrophysics Data System (ADS)

    Schaffer, Scott P.

    Healthcare organizations worldwide have recently increased efforts to improve performance, quality, and knowledge transfer using information and communication technologies. Evaluation of the effectiveness and quality of such efforts is challenging. A macro and micro-level system evaluation conducted with a 14000 member US hospital administrative services organization examined the appropriateness of a blended face-to-face and technology-enabled performance improvement and knowledge development system. Furthermore, a successful team or microsystem in a high performing hospital was studied in-depth. Several types of data methods including interview, observation, and questionnaire were used to address evaluation questions within a knowledge development framework created for the study. Results of this preliminary study focus on how this organization attempted to organize clinical improvement efforts around quality and performance improvement processes supported by networked technologies.

  14. How accurately do drivers evaluate their own driving behavior? An on-road observational study.

    PubMed

    Amado, Sonia; Arıkan, Elvan; Kaça, Gülin; Koyuncu, Mehmet; Turkan, B Nilay

    2014-02-01

    Self-assessment of driving skills became a noteworthy research subject in traffic psychology, since by knowing one's strenghts and weaknesses, drivers can take an efficient compensatory action to moderate risk and to ensure safety in hazardous environments. The current study aims to investigate drivers' self-conception of their own driving skills and behavior in relation to expert evaluations of their actual driving, by using naturalistic and systematic observation method during actual on-road driving session and to assess the different aspects of driving via comprehensive scales sensitive to different specific aspects of driving. 19-63 years old male participants (N=158) attended an on-road driving session lasting approximately 80min (45km). During the driving session, drivers' errors and violations were recorded by an expert observer. At the end of the driving session, observers completed the driver evaluation questionnaire, while drivers completed the driving self-evaluation questionnaire and Driver Behavior Questionnaire (DBQ). Low to moderate correlations between driver and observer evaluations of driving skills and behavior, mainly on errors and violations of speed and traffic lights was found. Furthermore, the robust finding that drivers evaluate their driving performance as better than the expert was replicated. Over-positive appraisal was higher among drivers with higher error/violation score and with the ones that were evaluated by the expert as "unsafe". We suggest that the traffic environment might be regulated by increasing feedback indicators of errors and violations, which in turn might increase the insight into driving performance. Improving self-awareness by training and feedback sessions might play a key role for reducing the probability of risk in their driving activity. Copyright © 2013 Elsevier Ltd. All rights reserved.

  15. Evaluation of concrete pavements with materials-related distress : appendix G.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  16. Evaluation of concrete pavements with materials-related distress : final report.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors : contributing to pavement distress observed in the field were determined, including expansive : alkali-silica reactivity and freeze-thaw deterioration related to poor ...

  17. Evaluation of concrete pavements with materials-related distress : appendix F.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  18. Evaluation of concrete pavements with materials-related distress : appendix E.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  19. Evaluation of concrete pavements with materials-related distress : appendix D.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  20. Evaluation of concrete pavements with materials-related distress : appendix B.

    DOT National Transportation Integrated Search

    2010-02-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  1. Evaluation of concrete pavements with materials-related distress : appendix C.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  2. A short-term clinical evaluation of IPS Empress 2 crowns.

    PubMed

    Toksavul, Suna; Toman, Muhittin

    2007-01-01

    The aim of this study was to evaluate the clinical performance of all-ceramic crowns made with the IPS Empress 2 system after an observation period of 12 to 60 months. Seventy-nine IPS Empress 2 crowns were placed in 21 patients. The all-ceramic crowns were evaluated clinically, radiographically, and using clinical photographs. The evaluations took place at baseline (2 days after cementation) and at 6-month intervals for 12 to 60 months. Survival rate of the crowns was determined using Kaplan-Meier statistical analysis. Based on the US Public Health Service criteria, 95.24% of the crowns were rated satisfactory after a mean follow-up period of 58 months. Fracture was registered in only 1 crown. One endodontically treated tooth failed as a result of fracture at the cervical margin area. In this in vivo study, IPS Empress 2 crowns exhibited a satisfactory clinical performance during an observation period ranging from 12 to 60 months.

  3. Rural Principals and the North Carolina Teacher Evaluation Process: How Has the Transition from the TPAI-R to the New Evaluation Process Changed Principals' Evaluative Practices?

    ERIC Educational Resources Information Center

    Fuller, Charles Avery

    2016-01-01

    Beginning with the 2010-2011 school year the North Carolina State Board of Education (SBE) mandated the use of the North Carolina Teacher Evaluation Process (Evaluation Process) for use in all public school systems in the state to conduct teacher observations and evaluations. The Evaluation Process replaced the Teacher Performance Appraisal…

  4. Getting Classroom Observations Right

    ERIC Educational Resources Information Center

    Whitehurst, Grover; Chingos, Matthew M.; Lindquist, Katharine

    2015-01-01

    This article contributes to the body of knowledge on teacher evaluation systems by examining the actual design and performance of new teacher-evaluation systems in four school districts that are at the forefront of the effort to evaluate teachers meaningfully. The authors find first that the ratings assigned teachers by the districts'…

  5. A Literature Survey and Experimental Evaluation of the State-of-the-Art in Uplift Modeling: A Stepping Stone Toward the Development of Prescriptive Analytics.

    PubMed

    Devriendt, Floris; Moldovan, Darie; Verbeke, Wouter

    2018-03-01

    Prescriptive analytics extends on predictive analytics by allowing to estimate an outcome in function of control variables, allowing as such to establish the required level of control variables for realizing a desired outcome. Uplift modeling is at the heart of prescriptive analytics and aims at estimating the net difference in an outcome resulting from a specific action or treatment that is applied. In this article, a structured and detailed literature survey on uplift modeling is provided by identifying and contrasting various groups of approaches. In addition, evaluation metrics for assessing the performance of uplift models are reviewed. An experimental evaluation on four real-world data sets provides further insight into their use. Uplift random forests are found to be consistently among the best performing techniques in terms of the Qini and Gini measures, although considerable variability in performance across the various data sets of the experiments is observed. In addition, uplift models are frequently observed to be unstable and display a strong variability in terms of performance across different folds in the cross-validation experimental setup. This potentially threatens their actual use for business applications. Moreover, it is found that the available evaluation metrics do not provide an intuitively understandable indication of the actual use and performance of a model. Specifically, existing evaluation metrics do not facilitate a comparison of uplift models and predictive models and evaluate performance either at an arbitrary cutoff or over the full spectrum of potential cutoffs. In conclusion, we highlight the instability of uplift models and the need for an application-oriented approach to assess uplift models as prime topics for further research.

  6. 49 CFR 195.505 - Qualification program.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... that the individual's performance of a covered task contributed to an accident as defined in Part 195... tasks; (b) Ensure through evaluation that individuals performing covered tasks are qualified; (c) Allow individuals that are not qualified pursuant to this subpart to perform a covered task if directed and observed...

  7. Evaluation of Administrators: Issues and Practices. OSSC Bulletin Vol. 19, No. 10.

    ERIC Educational Resources Information Center

    Wills, Lewis A.

    In this review of current practices it is observed that administrators are evaluated for two major purposes--(1) to provide a basis for school districts' decisions at the conclusion of the evaluation period, and (2) to provide feedback on performance to allow administrator improvement. A comparison is made of evaluation material from five school…

  8. A Skill Score of Trajectory Model Evaluation Using Reinitialized Series of Normalized Cumulative Lagrangian Separation

    NASA Astrophysics Data System (ADS)

    Liu, Y.; Weisberg, R. H.

    2017-12-01

    The Lagrangian separation distance between the endpoints of simulated and observed drifter trajectories is often used to assess the performance of numerical particle trajectory models. However, the separation distance fails to indicate relative model performance in weak and strong current regions, such as a continental shelf and its adjacent deep ocean. A skill score is proposed based on the cumulative Lagrangian separation distances normalized by the associated cumulative trajectory lengths. The new metrics correctly indicates the relative performance of the Global HYCOM in simulating the strong currents of the Gulf of Mexico Loop Current and the weaker currents of the West Florida Shelf in the eastern Gulf of Mexico. In contrast, the Lagrangian separation distance alone gives a misleading result. Also, the observed drifter position series can be used to reinitialize the trajectory model and evaluate its performance along the observed trajectory, not just at the drifter end position. The proposed dimensionless skill score is particularly useful when the number of drifter trajectories is limited and neither a conventional Eulerian-based velocity nor a Lagrangian-based probability density function may be estimated.

  9. Syndromic craniosynostosis: neuropsycholinguistic abilities and imaging analysis of the central nervous system.

    PubMed

    Maximino, Luciana Paula; Ducati, Luis Gustavo; Abramides, Dagma Venturini Marques; Corrêa, Camila de Castro; Garcia, Patrícia Fernandes; Fernandes, Adriano Yacubian

    2017-12-01

    To characterize patients with syndromic craniosynostosis with respect to their neuropsycholinguistic abilities and to present these findings together with the brain abnormalities. Eighteen patients with a diagnosis of syndromic craniosynostosis were studied. Eight patients had Apert syndrome and 10 had Crouzon syndrome. They were submitted to phonological evaluation, neuropsychological evaluation and magnetic resonance imaging of the brain. The phonological evaluation was done by behavioral observation of the language, the Peabody test, Token test and a school achievement test. The neuropsychological evaluation included the WISC III and WAIS tests. Abnormalities in language abilities were observed and the school achievement test showed abnormalities in 66.67% of the patients. A normal intelligence quotient was observed in 39.3% of the patients, and congenital abnormalities of the central nervous system were observed in 46.4% of the patients. Abnormalities of language abilities were observed in the majority of patients with syndromic craniosynostosis, and low cognitive performance was also observed.

  10. Influence of bromide on the performance of the amphipod Hyalella azteca in reconstituted waters

    USGS Publications Warehouse

    Ivey, Chris D.; Ingersoll, Christopher G.

    2016-01-01

    Poor performance of the amphipod Hyalella azteca has been observed in exposures using reconstituted waters. Previous studies have reported success in H. azteca water-only exposures with the addition of relatively high concentrations of bromide. The present study evaluated the influence of lower environmentally representative concentrations of bromide on the response ofH. azteca in 42-d water-only exposures. Improved performance of H. azteca was observed in reconstituted waters with >0.02 mg Br/L.

  11. Evaluating supplier quality performance using analytical hierarchy process

    NASA Astrophysics Data System (ADS)

    Kalimuthu Rajoo, Shanmugam Sundram; Kasim, Maznah Mat; Ahmad, Nazihah

    2013-09-01

    This paper elaborates the importance of evaluating supplier quality performance to an organization. Supplier quality performance evaluation reflects the actual performance of the supplier exhibited at customer's end. It is critical in enabling the organization to determine the area of improvement and thereafter works with supplier to close the gaps. Success of the customer partly depends on supplier's quality performance. Key criteria as quality, cost, delivery, technology support and customer service are categorized as main factors in contributing to supplier's quality performance. 18 suppliers' who were manufacturing automotive application parts evaluated in year 2010 using weight point system. There were few suppliers with common rating which led to common ranking observed by few suppliers'. Analytical Hierarchy Process (AHP), a user friendly decision making tool for complex and multi criteria problems was used to evaluate the supplier's quality performance challenging the weight point system that was used for 18 suppliers'. The consistency ratio was checked for criteria and sub-criteria. Final results of AHP obtained with no overlap ratings, therefore yielded a better decision making methodology as compared to weight point rating system.

  12. Evaluation of a numerical model's ability to predict bed load transport observed in braided river experiments

    NASA Astrophysics Data System (ADS)

    Javernick, Luke; Redolfi, Marco; Bertoldi, Walter

    2018-05-01

    New data collection techniques offer numerical modelers the ability to gather and utilize high quality data sets with high spatial and temporal resolution. Such data sets are currently needed for calibration, verification, and to fuel future model development, particularly morphological simulations. This study explores the use of high quality spatial and temporal data sets of observed bed load transport in braided river flume experiments to evaluate the ability of a two-dimensional model, Delft3D, to predict bed load transport. This study uses a fixed bed model configuration and examines the model's shear stress calculations, which are the foundation to predict the sediment fluxes necessary for morphological simulations. The evaluation is conducted for three flow rates, and model setup used highly accurate Structure-from-Motion (SfM) topography and discharge boundary conditions. The model was hydraulically calibrated using bed roughness, and performance was evaluated based on depth and inundation agreement. Model bed load performance was evaluated in terms of critical shear stress exceedance area compared to maps of observed bed mobility in a flume. Following the standard hydraulic calibration, bed load performance was tested for sensitivity to horizontal eddy viscosity parameterization and bed morphology updating. Simulations produced depth errors equal to the SfM inherent errors, inundation agreement of 77-85%, and critical shear stress exceedance in agreement with 49-68% of the observed active area. This study provides insight into the ability of physically based, two-dimensional simulations to accurately predict bed load as well as the effects of horizontal eddy viscosity and bed updating. Further, this study highlights how using high spatial and temporal data to capture the physical processes at work during flume experiments can help to improve morphological modeling.

  13. Visual-search model observer for assessing mass detection in CT

    NASA Astrophysics Data System (ADS)

    Karbaschi, Zohreh; Gifford, Howard C.

    2017-03-01

    Our aim is to devise model observers (MOs) to evaluate acquisition protocols in medical imaging. To optimize protocols for human observers, an MO must reliably interpret images containing quantum and anatomical noise under aliasing conditions. In this study of sampling parameters for simulated lung CT, the lesion-detection performance of human observers was compared with that of visual-search (VS) observers, a channelized nonprewhitening (CNPW) observer, and a channelized Hoteling (CH) observer. Scans of a mathematical torso phantom modeled single-slice parallel-hole CT with varying numbers of detector pixels and angular projections. Circular lung lesions had a fixed radius. Twodimensional FBP reconstructions were performed. A localization ROC study was conducted with the VS, CNPW and human observers, while the CH observer was applied in a location-known ROC study. Changing the sampling parameters had negligible effect on the CNPW and CH observers, whereas several VS observers demonstrated a sensitivity to sampling artifacts that was in agreement with how the humans performed.

  14. Group 3: Performance evaluation and assessment

    NASA Technical Reports Server (NTRS)

    Frink, A.

    1981-01-01

    Line-oriented flight training provides a unique learning experience and an opportunity to look at aspects of performance other types of training did not provide. Areas such as crew coordination, resource management, leadership, and so forth, can be readily evaluated in such a format. While individual performance is of the utmost importance, crew performance deserves equal emphasis, therefore, these areas should be carefully observed by the instructors as an rea for discussion in the same way that individual performane is observed. To be effective, it must be accepted by the crew members, and administered by the instructors as pure training-learning through experience. To keep open minds, to benefit most from the experience, both in the doing and in the follow-on discussion, it is essential that it be entered into with a feeling of freedom, openness, and enthusiasm. Reserve or defensiveness because of concern for failure must be inhibit participation.

  15. Program management model study

    NASA Technical Reports Server (NTRS)

    Connelly, J. J.; Russell, J. E.; Seline, J. R.; Sumner, N. R., Jr.

    1972-01-01

    Two models, a system performance model and a program assessment model, have been developed to assist NASA management in the evaluation of development alternatives for the Earth Observations Program. Two computer models were developed and demonstrated on the Goddard Space Flight Center Computer Facility. Procedures have been outlined to guide the user of the models through specific evaluation processes, and the preparation of inputs describing earth observation needs and earth observation technology. These models are intended to assist NASA in increasing the effectiveness of the overall Earth Observation Program by providing a broader view of system and program development alternatives.

  16. Modulation of the brain activity in outcome evaluation by the presence of an audience: An electrophysiological investigation.

    PubMed

    Tian, Tengxiang; Feng, Xue; Gu, Ruolei; Broster, Lucas S; Feng, Chunliang; Wang, Lili; Guan, Qing; Luo, Yue-Jia

    2015-07-30

    The audience effect refers to the phenomenon that one׳s performance on a task is affected by the presence of others. Here we investigated how the audience effect modulates the neurocognitive signatures underlying people׳s evaluation of their own task performance/outcome. Participants in our study played a gambling game in two social contexts: an "audience" condition and an "alone" condition. The presence of others modulated the feedback-related negativity (FRN), which might reflect enhanced motivational significance or increased reward processing when participants were watched compared to when they were alone. We also observed increased P300 responses to outcome feedback in the audience condition, presumably reflecting more elaborative and sustained evaluation of outcomes in the audience than alone context. This audience effect on the evaluative processes complements previous observations on the social nature of outcome evaluation and extends a traditional topic in social psychology to the neuroscientific field. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. Comparison of human observer and algorithmic target detection in nonurban forward-looking infrared imagery

    NASA Astrophysics Data System (ADS)

    Weber, Bruce A.

    2005-07-01

    We have performed an experiment that compares the performance of human observers with that of a robust algorithm for the detection of targets in difficult, nonurban forward-looking infrared imagery. Our purpose was to benchmark the comparison and document performance differences for future algorithm improvement. The scale-insensitive detection algorithm, used as a benchmark by the Night Vision Electronic Sensors Directorate for algorithm evaluation, employed a combination of contrastlike features to locate targets. Detection receiver operating characteristic curves and observer-confidence analyses were used to compare human and algorithmic responses and to gain insight into differences. The test database contained ground targets, in natural clutter, whose detectability, as judged by human observers, ranged from easy to very difficult. In general, as compared with human observers, the algorithm detected most of the same targets, but correlated confidence with correct detections poorly and produced many more false alarms at any useful level of performance. Though characterizing human performance was not the intent of this study, results suggest that previous observational experience was not a strong predictor of human performance, and that combining individual human observations by majority vote significantly reduced false-alarm rates.

  18. Evaluation of concrete pavements with materials-related distress : appendix A, part 1.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  19. Evaluation of concrete pavements with materials-related distress : appendix A, part 3.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  20. Evaluation of concrete pavements with materials-related distress : appendix A, part 2.

    DOT National Transportation Integrated Search

    2010-03-02

    An evaluation of cores sampled from six concrete pavements was performed. Factors contributing to pavement distress observed in the field were determined, including expansive alkali-silica reactivity and freeze-thaw deterioration related to poor entr...

  1. Noninvasive evaluation system of fractured bone based on speckle interferometry

    NASA Astrophysics Data System (ADS)

    Yamanada, Shinya; Murata, Shigeru; Tanaka, Yohsuke

    2010-11-01

    This paper presents a noninvasive evaluation system of fractured bone based on speckle interferometry using a modified evaluation index for higher performance, and the experiments are carried out to examine the feasibility in evaluating bone fracture healing and the influence of some system parameters on the performance. From experimental results, it is shown that the presence of fractured part of bone and the state of bone fracture healing are successfully estimated by observing fine speckle fringes on the object surface. The proposed evaluation index also can successfully express the difference between the cases with cut and without it. Since most system parameters are found not to affect the performance of the present technique, the present technique is expected to be applied to various patients that have considerable individual variability.

  2. Values of a Patient and Observer Scar Assessment Scale to Evaluate the Facial Skin Graft Scar

    PubMed Central

    Chae, Jin Kyung; Kim, Eun Jung; Park, Kun

    2016-01-01

    Background The patient and observer scar assessment scale (POSAS) recently emerged as a promising method, reflecting both observer's and patient's opinions in evaluating scar. This tool was shown to be consistent and reliable in burn scar assessment, but it has not been tested in the setting of skin graft scar in skin cancer patients. Objective To evaluate facial skin graft scar applied to POSAS and to compare with objective scar assessment tools. Methods Twenty three patients, who diagnosed with facial cutaneous malignancy and transplanted skin after Mohs micrographic surgery, were recruited. Observer assessment was performed by three independent rates using the observer component of the POSAS and Vancouver scar scale (VSS). Patient self-assessment was performed using the patient component of the POSAS. To quantify scar color and scar thickness more objectively, spectrophotometer and ultrasonography was applied. Results Inter-observer reliability was substantial with both VSS and the observer component of the POSAS (average measure intraclass coefficient correlation, 0.76 and 0.80, respectively). The observer component consistently showed significant correlations with patients' ratings for the parameters of the POSAS (all p-values<0.05). The correlation between subjective assessment using POSAS and objective assessment using spectrophotometer and ultrasonography showed low relationship. Conclusion In facial skin graft scar assessment in skin cancer patients, the POSAS showed acceptable inter-observer reliability. This tool was more comprehensive and had higher correlation with patient's opinion. PMID:27746642

  3. Quality Lies in the Eyes of the Beholder: A Mismatch between Student Evaluation and Peer Observation of Teaching

    ERIC Educational Resources Information Center

    Hassan, Salochana; Wium, Wouter

    2014-01-01

    The study described in this article was prompted by the poor performance of students in an "at risk subject" in a science faculty at a university in South Africa. Teacher performance could contribute to poor performance among students, therefore the performance of one of the science teachers whose students were performing poorly was…

  4. Self-rated and observer-rated measures of well-being and distress in adolescence: an exploratory study.

    PubMed

    Vescovelli, Francesca; Albieri, Elisa; Ruini, Chiara

    2014-01-01

    The evaluation of eudaimonic well-being in adolescence is hampered by the lack of specific assessment tools. Moreover, with younger populations, the assessment of positive functioning may be biased by self-report data only, and may be more accurate by adding significant adults' evaluations. The objective of this research was to measure adolescents' well-being and prosocial behaviours using self-rated and observer-rated instruments, and their pattern of associations. The sample included 150 Italian high school adolescents. Observed-evaluation was performed by their school teachers using the Strengths and Difficulties Questionnaire. Adolescents completed Ryff's Psychological Well-being Scales and Symptom Questionnaire. Pearson' r correlations and Linear regression were performed. Self-rated dimensions of psychological well-being significantly correlated with all observer-rated dimensions, but Strengths and Difficulties Emotional symptom scale. Multiple linear regression showed that the self-rated dimensions Environmental Mastery and Personal Growth, and surprisingly not Positive Relations, are related to the observer-rated dimension Prosocial Behaviour. Adolescents with higher levels of well-being in specific dimensions tend to be perceived as less problematic by their teachers. However, some dimensions of positive functioning present discrepancies between self and observer-rated instruments. Thus, the conjunct use of self-reports and observer-rated tools for a more comprehensive assessment of students' eudaimonic well-being is recommended.

  5. Technical Evaluation Motor No. 7 (TEM-7)

    NASA Technical Reports Server (NTRS)

    Hughes, Phil

    1991-01-01

    The Technical Evaluation Motor No. 7 (TEM-7) test was a full-scale, full duration static test firing of a high performance motor-configuration solid rocket motor with nozzle vectoring. The final test report documents the procedures, performance, and results of the static test firing of TEM-7. All observations, discussions, conclusions, and recommendations included in the report are complete and final except for the TEM-7 fixed housing unbond investigation. A presentation and discussion of TEM-7 performance, anomalies, and test result concurrence with the objectives outlined in CTP-0107, Rev A, Space Shuttle Technical Evaluation Motor No. 7 (TEM-7) Static Fire Test Plan are included.

  6. An evaluation of training effectiveness of an intelligent tutoring system

    NASA Technical Reports Server (NTRS)

    Johnson, Debra Steele; Pieper, Kalen F.; Culbert, Chris

    1992-01-01

    The study evaluated the training effectiveness of an intelligent tutoring system (ITS) for the Remote Manipulator System (RMS). The study examined how well individuals learn the training content and skills from the RMS ITS and to what extent the content and skills learned using the ITS transfer to RMS task performance in the SES, a high fidelity simulator. Three astronauts completed 8 2-hour ITS sessions addressing movement in three coordinate systems, grapple, ungrapple, berth, and unberth procedures, and singularities and reach limits. Their performance was also observed in an SES training session. Performance data were collected using multiple measures: ITS task performance, transfer performance on the SES, a conceptual knowledge test, an opinion survey completed by astronauts, and comments and observations from astronauts and trainers. Results indicated the RMS ITS to be moderately effective and provided evidence of the efficacy of ITS's, in general. Comments and suggestions are provided relating to how the ITS could be improved and to enable decision makers to judge the effectiveness of the RMS ITS.

  7. Keto analogue and amino acid supplementation and its effects on ammonemia and performance under thermoneutral conditions.

    PubMed

    Camerino, Saulo Rodrigo Alves e Silva; Lima, Rafaela Carvalho Pereira; França, Thássia Casado Lima; Herculano, Edla de Azevedo; Rodrigues, Daniela Souza Araújo; Gouveia, Marcos Guilherme de Sousa; Cameron, L C; Prado, Eduardo Seixas

    2016-02-01

    Alterations of cerebral function, fatigue and disturbance in cognitive-motor performance can be caused by hyperammonemia and/or hot environmental conditions during exercise. Exercise-induced hyperammonemia can be reduced through supplementation with either amino acids or combined keto analogues and amino acids (KAAA) to improve exercise tolerance. In the present study, we evaluated KAAA supplementation on ammonia metabolism and cognitive-motor performance after high-intensity exercise under a low heat stress environment. Sixteen male cyclists received a ketogenic diet for 2 d and were divided into two groups, KAAA (KEx) or placebo (CEx) supplementation. The athletes performed a 2 h cycling session followed by a maximum test (MAX), and blood samples were obtained at rest and during exercise. Cognitive-motor tasks were performed before and after the protocol, and the exhaustion time was used to evaluate physical performance. The hydration status was also evaluated. The CEx group showed a significant increase (∼ 70%) in ammonia concentration at MAX, which did not change in the KEx group. The non-supplemented group showed a significant increase in uremia. Both the groups had a significant increase in blood urate concentrations at 120 min, and an early significant increase from 120 min was observed in the CEx group. There was no change in the glucose concentrations of the two groups. A significant increase in lactate was observed at the MAX moment in both groups. There was no significant difference in the exhaustion times between the groups. No changes were observed in the cognitive-motor tasks after the protocol. We suggest that KAAA supplementation decreases ammonia concentration during high-intensity exercise but does not affect physical or cognitive-motor performances under a low heat stress environment.

  8. Properties of the Multiple Measures in Arizona's Teacher Evaluation Model. REL 2015-050

    ERIC Educational Resources Information Center

    Lazarev, Valeriy; Newman, Denis; Sharp, Alyssa

    2014-01-01

    This study explored the relationships among the components of the Arizona Department of Education's new teacher evaluation model, with a particular focus on the extent to which ratings from the state model's teacher observation instrument differentiated higher and lower performance. The study used teacher-level evaluation data collected by the…

  9. Evaluation of climatic changes in South-Asia

    NASA Astrophysics Data System (ADS)

    Kjellstrom, Erik; Rana, Arun; Grigory, Nikulin; Renate, Wilcke; Hansson, Ulf; Kolax, Michael

    2016-04-01

    Literature has sufficient evidences of climate change impact all over the world and its impact on various sectors. In light of new advancements made in climate modeling, availability of several climate downscaling approaches, the more robust bias correction methods with varying complexities and strengths, in the present study we performed a systematic evaluation of climate change impact over South-Asia region. We have used different Regional Climate Models (RCMs) (from CORDEX domain), (Global Climate Models GCMs) and gridded observations for the study area to evaluate the models in historical/control period (1980-2010) and changes in future period (2010-2099). Firstly, GCMs and RCMs are evaluated against the Gridded observational datasets in the area using precipitation and temperature as indicative variables. Observational dataset are also evaluated against the reliable set of observational dataset, as pointed in literature. Bias, Correlation, and changes (among other statistical measures) are calculated for the entire region and both the variables. Eventually, the region was sub-divided into various smaller domains based on homogenous precipitation zones to evaluate the average changes over time period. Spatial and temporal changes for the region are then finally calculated to evaluate the future changes in the region. Future changes are calculated for 2 Representative Concentration Pathways (RCPs), the middle emission (RCP4.5) and high emission (RCP8.5) and for both climatic variables, precipitation and temperature. Lastly, Evaluation of Extremes is performed based on precipitation and temperature based indices for whole region in future dataset. Results have indicated that the whole study region is under extreme stress in future climate scenarios for both climatic variables i.e. precipitation and temperature. Precipitation variability is dependent on the location in the area leading to droughts and floods in various regions in future. Temperature is hinting towards a constant increase throughout the region regardless of location.

  10. The role of peer-assisted learning in building evaluative judgement: opportunities in clinical medical education.

    PubMed

    Tai, Joanna Hong-Meng; Canny, Benedict J; Haines, Terry P; Molloy, Elizabeth K

    2016-08-01

    This study explored the contribution of peer-assisted learning (PAL) in the development of evaluative judgement capacity; the ability to understand work quality and apply those standards to appraising performance. The study employed a mixed methods approach, collecting self-reported survey data, observations of, and reflective interviews with, the medical students observed. Participants were in their first year of clinical placements. Data were thematically analysed. Students indicated that PAL contributed to both the comprehension of notions of quality, and the practice of making comparisons between a given performance and the standards. Emergent themes included peer story-telling, direct observation of performance, and peer-based feedback, all of which helped students to define 'work quality'. By participating in PAL, students were required to make comparisons, therefore using the standards of practice and gaining a deeper understanding of them. The data revealed tensions in that peers were seen as less threatening than supervisors with the advantage of increasing learners' appetites for thoughtful 'intellectual risk taking'. Despite this reported advantage of peer engagement, learners still expressed a preference for feedback from senior teachers as more trusted sources of clinical knowledge. While this study suggests that PAL already contributes to the development of evaluative judgement, further steps could be taken to formalise PAL in clinical placements to improve learners' capacity to make accurate judgements on the performance of self and others. Further experimental studies are necessary to confirm the best methods of using PAL to develop evaluative judgement. This may include both students and educators as instigators of PAL in the workplace.

  11. Testing and Evaluating C3I Systems That Employ AI. Volume 4. Published Articles

    DTIC Science & Technology

    1991-01-31

    development performance in an naming, design and actions The system ’ s & Sophisticated system organizational setting) evaluated in a classroom setting by...observing designed as an intelligent the system in use and administering training aid in r questionnaires . Observers videotape and tave classroom setting...notes to assess how both students and instructors use the system in an actual classroom setting. Questionnaires are administered to both students and

  12. Technical Evaluation Motor No. 10 (TEM-10)

    NASA Technical Reports Server (NTRS)

    1993-01-01

    Technical Evaluation Motor No. 10 (TEM-10) was static fired on 27 Apr. 1993 at the Thiokol Corporation full-scale motor static test bay, T-24. This final test report documents the procedures, performance, and results of the static test firing of TEM-10. All observations, discussions, conclusions, and recommendations contained are final. Included is a presentation and discussion of TEM-10 performance, anomalies, and test results in concurrence with the objectives outlined in CTP-0110, Revision D, Space Shuttle Technical Evaluation Motor No. 10 (TEM-10) Static Fire Test Plan.

  13. Does Digital Game Interactivity Always Promote Self-Efficacy?

    PubMed

    Lee, Yu-Hao

    2015-11-01

    Interactive digital games can promote self-efficacy by engaging players in enactive and observational learning. However, interactivity does not always lead to greater self-efficacy. Important constructs in social cognitive theory, such as performance outcome and perceived similarity, are often not accounted for in studies that have tested the effect of digital game interactivity on self-efficacy. This study assessed the effects of interactive digital games compared with passive digital games based on video comparison, a common experimental design used to test the effect of digital game interactivity on self-efficacy. In addition, this study also evaluated player performance and measured perceived similarity to the observed player. Findings suggested that in general, digital game interactivity predicted higher self-efficacy compared with noninteractive passive games. However, in the noninteractive conditions, the effects of performance on self-efficacy were moderated by perceived similarity between the observer and the observed player. When the observed player was perceived to be similar to the observer, the effects of performance on self-efficacy were comparable to the interactive game, but when the observed player was perceived as dissimilar to the observer, observing the dissimilar player failed to increase observer self-efficacy. Implications for interactivity manipulations and game developers are discussed.

  14. Evaluating Tasks for Performance-Based Assessments: Advice for Music Teachers

    ERIC Educational Resources Information Center

    Scott, Sheila

    2004-01-01

    Performance-based assessments allow teachers to systematically observe skills used or demonstrated by students when they create a product, construct a response, or make a presentation (McMillan 2001). These assessments are grounded in performance-based tasks that elicit students' responses in relation to the outcomes of instruction. The criteria…

  15. Motor and cognitive performances of parkinsonian patients in the on and off phases of the disease.

    PubMed Central

    Girotti, F; Carella, F; Grassi, M P; Soliveri, P; Marano, R; Caraceni, T

    1986-01-01

    Twenty-one Parkinsonian patients were tested in on and off phases during chronic levodopa therapy for cognitive function, affective status, and evaluation of motor performance with reaction and movement times. A worsening of mood was observed from the on to the off phase. No variation in cognitive performance was observed from the on to the off phase in spite of evident motor changes. Mood changes during on-off variations may reflect involvement of mesocortical and mesolimbic dopaminergic systems. PMID:3734822

  16. Comprehensive evaluation of lung allograft function in infants after lung and heart-lung transplantation.

    PubMed

    Hayes, Don; Naguib, Aymen; Kirkby, Stephen; Galantowicz, Mark; McConnell, Patrick I; Baker, Peter B; Kopp, Benjamin T; Lloyd, Eric A; Astor, Todd L

    2014-05-01

    Limited data exist on methods to evaluate allograft function in infant recipients of lung and heart-lung transplants. At our institution, we developed a procedural protocol in coordination with pediatric anesthesia where infants were sedated to perform infant pulmonary function testing, computed tomography imaging of the chest, and flexible fiberoptic bronchoscopy with transbronchial biopsies. A retrospective review was performed of children aged younger than 1 year who underwent lung or heart-lung transplantation at our institution to assess the effect of this procedural protocol in the evaluation of infant lung allografts. Since 2005, 5 infants have undergone thoracic transplantation (3 heart-lung, 2 lung). At time of transplant, the mean ± standard deviation age was 7.2 ± 2.8 months (range, 3-11 months). Of 24 procedural sessions performed to evaluate lung allografts, 83% (20 of 24) were considered surveillance where the patients were completely asymptomatic. Of the surveillance procedures, 80% were performed as an outpatient, whereas 20% were done as inpatients during the lung or heart-lung transplant post-operative period before discharge home. Sedation was performed with propofol alone (23 of 24) or in addition to ketamine (1 of 24) infusion; mean sedation time was 141 ± 39 minutes (range, 70-214) minutes. Of the 16 outpatient procedures, patients were discharged after 14 (88%) on the same day, and after 2 (12%) were admitted for observation, with 1 being due to transportation issues and the other due to fever during the observation period. A comprehensive procedural protocol to evaluate allograft function in infant lung and heart-lung transplant recipients was performed safely as an outpatient. Copyright © 2014 International Society for Heart and Lung Transplantation. Published by Elsevier Inc. All rights reserved.

  17. QRS complex detection based on continuous density hidden Markov models using univariate observations

    NASA Astrophysics Data System (ADS)

    Sotelo, S.; Arenas, W.; Altuve, M.

    2018-04-01

    In the electrocardiogram (ECG), the detection of QRS complexes is a fundamental step in the ECG signal processing chain since it allows the determination of other characteristics waves of the ECG and provides information about heart rate variability. In this work, an automatic QRS complex detector based on continuous density hidden Markov models (HMM) is proposed. HMM were trained using univariate observation sequences taken either from QRS complexes or their derivatives. The detection approach is based on the log-likelihood comparison of the observation sequence with a fixed threshold. A sliding window was used to obtain the observation sequence to be evaluated by the model. The threshold was optimized by receiver operating characteristic curves. Sensitivity (Sen), specificity (Spc) and F1 score were used to evaluate the detection performance. The approach was validated using ECG recordings from the MIT-BIH Arrhythmia database. A 6-fold cross-validation shows that the best detection performance was achieved with 2 states HMM trained with QRS complexes sequences (Sen = 0.668, Spc = 0.360 and F1 = 0.309). We concluded that these univariate sequences provide enough information to characterize the QRS complex dynamics from HMM. Future works are directed to the use of multivariate observations to increase the detection performance.

  18. Extensions to the visual predictive check to facilitate model performance evaluation.

    PubMed

    Post, Teun M; Freijer, Jan I; Ploeger, Bart A; Danhof, Meindert

    2008-04-01

    The Visual Predictive Check (VPC) is a valuable and supportive instrument for evaluating model performance. However in its most commonly applied form, the method largely depends on a subjective comparison of the distribution of the simulated data with the observed data, without explicitly quantifying and relating the information in both. In recent adaptations to the VPC this drawback is taken into consideration by presenting the observed and predicted data as percentiles. In addition, in some of these adaptations the uncertainty in the predictions is represented visually. However, it is not assessed whether the expected random distribution of the observations around the predicted median trend is realised in relation to the number of observations. Moreover the influence of and the information residing in missing data at each time point is not taken into consideration. Therefore, in this investigation the VPC is extended with two methods to support a less subjective and thereby more adequate evaluation of model performance: (i) the Quantified Visual Predictive Check (QVPC) and (ii) the Bootstrap Visual Predictive Check (BVPC). The QVPC presents the distribution of the observations as a percentage, thus regardless the density of the data, above and below the predicted median at each time point, while also visualising the percentage of unavailable data. The BVPC weighs the predicted median against the 5th, 50th and 95th percentiles resulting from a bootstrap of the observed data median at each time point, while accounting for the number and the theoretical position of unavailable data. The proposed extensions to the VPC are illustrated by a pharmacokinetic simulation example and applied to a pharmacodynamic disease progression example.

  19. Temporal and spatial evaluation of satellite rainfall estimates over different regions in Latin-America.

    NASA Astrophysics Data System (ADS)

    Villanueva, O. M. B.; Zambrano-Bigiarini, M.; Ribbe, L.; Nauditt, A.; Rebolledo Coy, M. A.; Xuan Thinh, N.; Bartz-Beielstein, T.

    2017-12-01

    In developing countries an accurate representation of the spatio-temporal variability of catchment rainfall inputs is currently severely limited. This issue can be overcame with the use of satellite rainfall estimates (SREs), which provide rainfall data in such environments for a wide range of hydrological applications, such as extreme events analysis and water accounting. Three different basins in Latin-America (Imperial Basin in Chile, Paraiba do Sul in Brazil and Magdalena in Colombia) were evaluated with a point-to-pixel analysis to determine the best SRE for further hydrological modelling. For this purpose, daily values of six state-of-the-art SRE products (TMPA 3B42v7, TMPA 3B42RT, CHIRPSv2, CMORPH, PERSIANN-CDR and MSWEPv1.2) were evaluated at annual and seasonal scales. The modified Kling-Gupta Efficiency (KGE') was used to evaluate the linear correlation, variability and bias relationship between satellite data and observations. Also, two categorical indices (POD and fBias) were used to assess product performance for different rainfall intensities. The results showed that for the southern Imperial River Basin PERSIANN-CDR presented the best performance at the annual scale, while TRMM 3B42v7 and PERSIANN-CDR had the best performance in a seasonal basis. In the Brazilian Paraiba do Sul, MSWEP performed the best in annual and seasonal basis. For the Magdalena Basin, CHIRPS and TRMM 3B42RT presented the highest performance in the seasonal analysis, while CHIRPS showed the best annual performance. When the bias term of the modified KGE' was removed from KGE', it was observed that the best evaluated SRE was not necessarily the one that have the highest linear correlation and variability relation with the observed data. In the categorical indices, all SREs showed a good detection in no-rain events, but low skill classifying days with precipitation. Nevertheless, all SREs performed relatively well identifying moderate rain events in all regions. We finally conclude that there is not a best performing SRE over all, a specific assessment is required to determine which SRE is the most suitable for each region. However, SREs show promising potential to be used for hydrological studies, and they must be taken in to account in order to derive better rainfall estimates.

  20. Non-technical skills evaluation in the critical care air ambulance environment: introduction of an adapted rating instrument--an observational study.

    PubMed

    Myers, Julia A; Powell, David M C; Psirides, Alex; Hathaway, Karyn; Aldington, Sarah; Haney, Michael F

    2016-03-08

    In the isolated and dynamic health-care setting of critical care air ambulance transport, the quality of clinical care is strongly influenced by non-technical skills such as anticipating, recognising and understanding, decision making, and teamwork. However there are no published reports identifying or applying a non-technical skills framework specific to an intensive care air ambulance setting. The objective of this study was to adapt and evaluate a non-technical skills rating framework for the air ambulance clinical environment. In the first phase of the project the anaesthetists' non-technical skills (ANTS) framework was adapted to the air ambulance setting, using data collected directly from clinician groups, published literature, and field observation. In the second phase experienced and inexperienced inter-hospital transport clinicians completed a simulated critical care air transport scenario, and their non-technical skills performance was independently rated by two blinded assessors. Observed and self-rated general clinical performance ratings were also collected. Rank-based statistical tests were used to examine differences in the performance of experienced and inexperienced clinicians, and relationships between different assessment approaches and assessors. The framework developed during phase one was referred to as an aeromedical non-technical skills framework, or AeroNOTS. During phase two 16 physicians from speciality training programmes in intensive care, emergency medicine and anaesthesia took part in the clinical simulation study. Clinicians with inter-hospital transport experience performed more highly than those without experience, according to both AeroNOTS non-technical skills ratings (p = 0.001) and general performance ratings (p = 0.003). Self-ratings did not distinguish experienced from inexperienced transport clinicians (p = 0.32) and were not strongly associated with either observed general performance (r(s) = 0.4, p = 0.11) or observed non-technical skills performance (r(s) = 0.4, p = 0.1). This study describes a framework which characterises the non-technical skills required by critical care air ambulance clinicians, and distinguishes higher and lower levels of performance. The AeroNOTS framework could be used to facilitate education and training in non-technical skills for air ambulance clinicians, and further evaluation of this rating system is merited.

  1. Measuring teacher effectiveness in physical education.

    PubMed

    Rink, Judith E

    2013-12-01

    This article summarizes the research base on teacher effectiveness in physical education from a historical perspective and explores the implications of the recent emphasis on student performance and teacher observation systems to evaluate teachers for physical education. The problems and the potential positive effects of using student performance scores as well as establishing a comprehensive evaluation program are explored with supportive evidence that some level of accountability is necessary in our field to make significant change.

  2. Correlation analysis between 2D and quasi-3D gamma evaluations for both intensity-modulated radiation therapy and volumetric modulated arc therapy

    PubMed Central

    Kim, Jung-in; Choi, Chang Heon; Wu, Hong-Gyun; Kim, Jin Ho; Kim, Kyubo; Park, Jong Min

    2017-01-01

    The aim of this work was to investigate correlations between 2D and quasi-3D gamma passing rates. A total of 20 patients (10 prostate cases and 10 head and neck cases, H&N) were retrospectively selected. For each patient, both intensity-modulated radiation therapy (IMRT) and volumetric modulated arc therapy (VMAT) plans were generated. For each plan, 2D gamma evaluation with radiochromic films and quasi-3D gamma evaluation with fluence measurements were performed with both 2%/2 mm and 3%/3 mm criteria. Gamma passing rates were grouped together according to delivery techniques and treatment sites. Statistical analyses were performed to examine the correlation between 2D and quasi-3D gamma evaluations. Statistically significant difference was observed between delivery techniques only in the quasi-3D gamma passing rates with 2%/2 mm. Statistically significant differences were observed between treatment sites in the 2D gamma passing rates (differences of less than 8%). No statistically significant correlations were observed between 2D and quasi-3D gamma passing rates except the VMAT group and the group including both IMRT and VMAT with 3%/3 mm (r = 0.564 with p = 0.012 for theVMAT group and r = 0.372 with p = 0.020 for the group including both IMRT and VMAT), however, those were not strong. No strong correlations were observed between 2D and quasi-3D gamma evaluations. PMID:27690300

  3. Solubility and bacterial sealing ability of MTA and root-end filling materials.

    PubMed

    Espir, Camila Galletti; Guerreiro-Tanomaru, Juliane Maria; Spin-Neto, Rubens; Chávez-Andrade, Gisselle Moraima; Berbert, Fabio Luiz Camargo Villela; Tanomaru-Filho, Mario

    2016-04-01

    Objective To evaluate solubility and sealing ability of Mineral Trioxide Aggregate (MTA) and root-end filling materials. Material and Methods The materials evaluated were: MTA, Calcium Silicate Cement with zirconium oxide (CSC/ZrO2), and zinc oxide/eugenol (ZOE). Solubility test was performed according to ANSI/ADA. The difference between initial and final mass of the materials was analyzed after immersion in distilled water for 7 and 30 days. Retrograde cavities in human teeth with single straight root canal were performed by using ultrasonic tip CVD 9.5107-8. The cavities were filled with the evaluated materials to evaluate sealing ability using the bacterial leakage test with Enterococcus faecalis. Bacterial leakage was evaluated every 24 hours for six weeks observing the turbidity of Brain Heart infusion (BHI) medium in contact with root apex. Data were submitted to ANOVA followed by Tukey tests (solubility), and Kruskal-Wallis and Dunn tests (sealing ability) at a 5% significance level. Results For the 7-day period, ZOE presented highest solubility when compared with the other groups (p<0.05). For the 30-day period, no difference was observed among the materials. Lower bacterial leakage was observed for MTA and CSC/ZrO2, and both presented better results than ZOE (p<0.05). Conclusion MTA and CSC/ZrO2 presented better bacterial sealing capacity, which may be related to lower initial solubility observed for these materials in relation to ZOE.

  4. Investigating Physicians' Views on Soft Signals in the Context of Their Peers' Performance.

    PubMed

    van den Goor, Myra; Silkens, Milou; Heineman, Maas Jan; Lombarts, Kiki

    2017-11-14

    Physicians are responsible for delivering high quality of care. In cases of underperformance, hindsight knowledge indicates forewarning being potentially available in terms of concerns, signs, or signals. It is not known how the physicians involved perceive such signals. To openly explore how physicians perceive soft signals and react on them. In-depth interviews with 12 hospital-based physicians from various specialties and institutions following the interpretative phenomenological analysis approach. Physicians perceive soft signals as an observable deviation from a colleague's normal behavior, appearance, or communication. Once observed, they evaluate the signal by reflecting on it personally and/or by consulting others, resulting in either an active (i.e., speaking up) or passive (i.e., avoidance) reaction. Observer sensitivity, closeness to the peer, and cohesion of the physician group influence this observation-evaluation-reaction process. Physicians perceive soft signals as indicators of well-being and collegiality, not as concerns about performance or patient safety. They feel it is their responsibility to be sensitive to and deal with expressed signals. Creating a psychological safe culture could foster such an environment. Because a threat to physicians' well-being may indirectly affect their professional performance, soft signals require serious follow-up.

  5. Evaluation of an institutional project to improve venous thromboembolism prevention.

    PubMed

    Minami, Christina A; Yang, Anthony D; Ju, Mila; Culver, Eckford; Seifert, Kathryn; Kreutzer, Lindsey; Halverson, Terri; O'Leary, Kevin J; Bilimoria, Karl Y

    2016-12-01

    Northwestern Memorial Hospital (NMH) was historically a poor performer on the venous thromboembolism (VTE) outcome measure. As this measure has been shown to be flawed by surveillance bias, NMH embraced process-of-care measures to ensure appropriate VTE prophylaxis to assess healthcare-associated VTE prevention efforts. To evaluate the impact of an institution-wide project aimed at improving hospital performance on VTE prophylaxis measures. A retrospective observational study. NMH, an 885-bed academic medical center in Chicago, Illinois PATIENTS: Inpatients admitted to NMH from January 1, 2013 to May 1, 2013 and from October 1, 2014 to April 1, 2015 were eligible for evaluation. Using the define-measure-analyze-improve-control (DMAIC) process-improvement methodology, a multidisciplinary team implemented and iteratively improved 15 data-driven interventions in 4 broad areas: (1) electronic medical record (EMR) alerts, (2) education initiatives, (3) new EMR order sets, and (4) other EMR changes. The Joint Commission's 6 core measures and the Surgical Care Improvement Project (SCIP) SCIP-VTE-2 measure. Based on 3103 observations (1679 from January 1, 2013 to May 1, 2013, and 1424 from October 1, 2014 to April 1, 2015), performance on the core measures improved. Performance on measure 1 (chemoprophylaxis) improved from 82.5% to 90.2% on medicine services, and from 94.4% to 97.6% on surgical services. The largest improvements were seen in measure 4 (platelet monitoring), with a performance increase from 76.7% adherence to 100%, and measure 5 (warfarin discharge instructions), with a performance increase from 27.4% to 88.8%. A systematic hospital-wide DMAIC project improved VTE prophylaxis measure performance. Sustained performance has been observed, and novel control mechanisms for continued performance surveillance have been embedded in the hospital system. Journal of Hospital Medicine 2016;11:S29-S37. © 2016 Society of Hospital Medicine. © 2016 Society of Hospital Medicine.

  6. Classification of small lesions on dynamic breast MRI: Integrating dimension reduction and out-of-sample extension into CADx methodology

    PubMed Central

    Nagarajan, Mahesh B.; Huber, Markus B.; Schlossbauer, Thomas; Leinsinger, Gerda; Krol, Andrzej; Wismüller, Axel

    2014-01-01

    Objective While dimension reduction has been previously explored in computer aided diagnosis (CADx) as an alternative to feature selection, previous implementations of its integration into CADx do not ensure strict separation between training and test data required for the machine learning task. This compromises the integrity of the independent test set, which serves as the basis for evaluating classifier performance. Methods and Materials We propose, implement and evaluate an improved CADx methodology where strict separation is maintained. This is achieved by subjecting the training data alone to dimension reduction; the test data is subsequently processed with out-of-sample extension methods. Our approach is demonstrated in the research context of classifying small diagnostically challenging lesions annotated on dynamic breast magnetic resonance imaging (MRI) studies. The lesions were dynamically characterized through topological feature vectors derived from Minkowski functionals. These feature vectors were then subject to dimension reduction with different linear and non-linear algorithms applied in conjunction with out-of-sample extension techniques. This was followed by classification through supervised learning with support vector regression. Area under the receiver-operating characteristic curve (AUC) was evaluated as the metric of classifier performance. Results Of the feature vectors investigated, the best performance was observed with Minkowski functional ’perimeter’ while comparable performance was observed with ’area’. Of the dimension reduction algorithms tested with ’perimeter’, the best performance was observed with Sammon’s mapping (0.84 ± 0.10) while comparable performance was achieved with exploratory observation machine (0.82 ± 0.09) and principal component analysis (0.80 ± 0.10). Conclusions The results reported in this study with the proposed CADx methodology present a significant improvement over previous results reported with such small lesions on dynamic breast MRI. In particular, non-linear algorithms for dimension reduction exhibited better classification performance than linear approaches, when integrated into our CADx methodology. We also note that while dimension reduction techniques may not necessarily provide an improvement in classification performance over feature selection, they do allow for a higher degree of feature compaction. PMID:24355697

  7. Making Performance-Based Evaluation Work for You: A Recipe for Personal Learning

    ERIC Educational Resources Information Center

    Church, Audrey

    2012-01-01

    Teacher observation and teacher evaluation are a given in American schools, and Charlotte Danielson's work in teacher effectiveness and professional practice has guided evaluation efforts for many years. There is a new, big kid in town, however. As Race to the Top requires documentation of student growth, and research shows that teacher…

  8. Interrater Reliability among Elementary Principals Using the North Carolina Teacher Evaluation Process

    ERIC Educational Resources Information Center

    Mazurek, Sharon Ann

    2012-01-01

    Teacher observation remains one of the primary data collection methods for analyzing teaching behaviors. States use various evaluation instruments and current trends across the United States show that more states are working to tie teacher evaluation to student performance. The purpose of this study was to determine to what extent there was…

  9. Observing System Forecast Experiments at the DAO

    NASA Technical Reports Server (NTRS)

    Atlas, Robert

    2001-01-01

    Since the advent of meteorological satellites in the 1960's, numerous experiments have been conducted in order to evaluate the impact of these and other data on atmospheric analysis and prediction. Such studies have included both OSE'S and OSSE's. The OSE's were conducted to evaluate the impact of specific observations or classes of observations on analyses and forecasts. Such experiments have been performed for selected types of conventional data and for various satellite data sets as they became available. (See for example the 1989 ECMWF/EUMETSAT workshop proceedings on "The use of satellite data in operational numerical weather prediction" and the references contained therein.) The ODYSSEY were conducted to evaluate the potential for future observing systems to improve Numerical Weather Prediction NWP and to plan for the Global Weather Experiment and more recently for EVANS (Atlas et al., 1985a; Arnold and Day, 1986; Hoffman et al., 1990). In addition, OSSE's have been run to evaluate trade-offs in the design of observing systems and observing networks (Atlas and Emmitt, 1991; Rohaly and Krishnamurti, 1993), and to test new methodology for data assimilation (Atlas and Bloom, 1989).

  10. Solid Polymer Electrolyte (SPE) fuel cell technology program

    NASA Technical Reports Server (NTRS)

    1978-01-01

    Many previously demonstrated improved fuel cell features were consolidated to (1) obtain a better understanding of the observed characteristics of the operating laboratory-sized cells; (2) evaluate appropriate improved fuel cell features in 0.7 sq ft cell hardware; and (3) study the resultant fuel cell capability and determine its impact on various potential fuel cell space missions. The observed performance characteristics of the fuel cell at high temperatures and high current densities were matched with a theoretical model based on the change in Gibbs free energy voltage with respect to temperature and internal resistance change with current density. Excellent agreement between the observed and model performance was obtained. The observed performance decay with operational time on cells with very low noble metal loadings (0.05 mg/sq cm) were shown to be related to loss in surface area. Cells with the baseline amount of noble catalyst electrode loading demonstrated over 40,000 hours of stable performance.

  11. The Impact of Teacher Observations with Coordinated Professional Development on Student Performance: A 27-State Program Evaluation

    ERIC Educational Resources Information Center

    Shaha, Steven H.; Glassett, Kelly F.; Copas, Aimee

    2015-01-01

    The impact of teacher observations in alignment with professional development (PD) on teacher efficacy was quantified for 292 schools in 110 districts within 27 U.S. States. Teacher observations conducted by school leaders or designated internal coaches were coordinated with PD offerings aligned with intended teacher improvements. The PD involved…

  12. Does Class Size in First Grade Relate to Children's Academic and Social Performance or Observed Classroom Processes?

    ERIC Educational Resources Information Center

    Allhusen, Virginia; Belsky, Jay; Booth-LaForce, Cathryn L.; Bradley, Robert; Brownwell, Celia A; Burchinal, Margaret; Campbell, Susan B.; Clarke-Stewart, K. Alison; Cox, Martha; Friedman, Sarah L.; Hirsh-Pasek, Kathryn; Houts, Renate M.; Huston, Aletha; Jaeger, Elizabeth; Johnson, Deborah J.; Kelly, Jean F.; Knoke, Bonnie; Marshall, Nancy; McCartney, Kathleen; Morrison, Frederick J.; O'Brien, Marion; Tresch Owen, Margaret; Payne, Chris; Phillips, Deborah; Pianta, Robert; Randolph, Suzanne M.; Robeson, Wendy W.; Spieker, Susan; Lowe Vandell, Deborah; Weinraub, Marsha

    2004-01-01

    This study evaluated the extent to which first-grade class size predicted child outcomes and observed classroom processes for 651 children (in separate classrooms). Analyses examined observed child-adult ratios and teacher-reported class sizes. Smaller classrooms showed higher quality instructional and emotional support, although children were…

  13. Visual performance-based image enhancement methodology: an investigation of contrast enhancement algorithms

    NASA Astrophysics Data System (ADS)

    Neriani, Kelly E.; Herbranson, Travis J.; Reis, George A.; Pinkus, Alan R.; Goodyear, Charles D.

    2006-05-01

    While vast numbers of image enhancing algorithms have already been developed, the majority of these algorithms have not been assessed in terms of their visual performance-enhancing effects using militarily relevant scenarios. The goal of this research was to apply a visual performance-based assessment methodology to evaluate six algorithms that were specifically designed to enhance the contrast of digital images. The image enhancing algorithms used in this study included three different histogram equalization algorithms, the Autolevels function, the Recursive Rational Filter technique described in Marsi, Ramponi, and Carrato1 and the multiscale Retinex algorithm described in Rahman, Jobson and Woodell2. The methodology used in the assessment has been developed to acquire objective human visual performance data as a means of evaluating the contrast enhancement algorithms. Objective performance metrics, response time and error rate, were used to compare algorithm enhanced images versus two baseline conditions, original non-enhanced images and contrast-degraded images. Observers completed a visual search task using a spatial-forcedchoice paradigm. Observers searched images for a target (a military vehicle) hidden among foliage and then indicated in which quadrant of the screen the target was located. Response time and percent correct were measured for each observer. Results of the study and future directions are discussed.

  14. Video-Based Method of Quantifying Performance and Instrument Motion During Simulated Phonosurgery

    PubMed Central

    Conroy, Ellen; Surender, Ketan; Geng, Zhixian; Chen, Ting; Dailey, Seth; Jiang, Jack

    2015-01-01

    Objectives/Hypothesis To investigate the use of the Video-Based Phonomicrosurgery Instrument Tracking System to collect instrument position data during simulated phonomicrosurgery and calculate motion metrics using these data. We used this system to determine if novice subject motion metrics improved over 1 week of training. Study Design Prospective cohort study. Methods Ten subjects performed simulated surgical tasks once per day for 5 days. Instrument position data were collected and used to compute motion metrics (path length, depth perception, and motion smoothness). Data were analyzed to determine if motion metrics improved with practice time. Task outcome was also determined each day, and relationships between task outcome and motion metrics were used to evaluate the validity of motion metrics as indicators of surgical performance. Results Significant decreases over time were observed for path length (P <.001), depth perception (P <.001), and task outcome (P <.001). No significant change was observed for motion smoothness. Significant relationships were observed between task outcome and path length (P <.001), depth perception (P <.001), and motion smoothness (P <.001). Conclusions Our system can estimate instrument trajectory and provide quantitative descriptions of surgical performance. It may be useful for evaluating phonomicrosurgery performance. Path length and depth perception may be particularly useful indicators. PMID:24737286

  15. Psychobiological responses to critically evaluated multitasking.

    PubMed

    Wetherell, Mark A; Craw, Olivia; Smith, Kenny; Smith, Michael A

    2017-12-01

    In order to understand psychobiological responses to stress it is necessary to observe how people react to controlled stressors. A range of stressors exist for this purpose; however, laboratory stressors that are representative of real life situations provide more ecologically valid opportunities for assessing stress responding. The current study assessed psychobiological responses to an ecologically valid laboratory stressor involving multitasking and critical evaluation. The stressor elicited significant increases in psychological and cardiovascular stress reactivity; however, no cortisol reactivity was observed. Other socially evaluative laboratory stressors that lead to cortisol reactivity typically require a participant to perform tasks that involve verbal responses, whilst standing in front of evaluative others. The current protocol contained critical evaluation of cognitive performance; however, this was delivered from behind a seated participant. The salience of social evaluation may therefore be related to the response format of the task and the method of evaluation. That is, the current protocol did not involve the additional vulnerability associated with in person, face-to-face contact, and verbal delivery. Critical evaluation of multitasking provides an ecologically valid technique for inducing laboratory stress and provides an alternative tool for assessing psychological and cardiovascular reactivity. Future studies could additionally use this paradigm to investigate those components of social evaluation necessary for eliciting a cortisol response.

  16. Misstaging of ovarian cancer.

    PubMed

    McGowan, L; Lesher, L P; Norris, H J; Barnett, M

    1985-04-01

    The thoroughness of intraoperative evaluation of the extent of disease in 291 women with primary ovarian cancer was investigated. Notable differences among physician specialties but not types of hospitals where initial surgery was performed were observed. A review of medical record documentation revealed that 97% of the cases operated on by gynecologic oncologists had complete staging evaluations performed intraoperatively, but only 52 and 35% of cases operated on by obstetricians/gynecologists and general surgeons, respectfully, were adequately evaluated. Roughly one-half of the cases diagnosed in community hospitals and in hospitals with teaching affiliations were found to be completely studied, and 66% of those operated on in university hospitals received complete intraoperative evaluations.

  17. Evaluation of a metering, mixing, and dispensing system for mixing polysulfide adhesive

    NASA Technical Reports Server (NTRS)

    Evans, Kurt B.

    1989-01-01

    Tests were performed to evaluate whether a metered mixing system can mix PR-1221 polysulfide adhesive as well as or better than batch-mixed adhesive; also, to evaluate the quality of meter-mixed PR-1860 and PS-875 polysulfide adhesives. These adhesives are candidate replacements for PR-1221 which will not be manufactured in the future. The following material properties were evaluated: peel strength, specific gravity and adhesive components of mixed adhesives, Shore A hardness, tensile adhesion strength, and flow rate. Finally, a visual test called the butterfly test was performed to observe for bubbles and unmixed adhesive. The results of these tests are reported and discussed.

  18. Campaign datasets for Two-Column Aerosol Project (TCAP)

    DOE Data Explorer

    Berg,Larry; Mei,Fan; Cairns,Brian; Chand,Duli; Comstock,Jennifer; Cziczo,Daniel; Hostetler,Chris; Hubbe,John; Long,Chuck; Michalsky,Joseph; Pekour,Mikhail; Russell,Phil; Scott,Herman; Sedlacek,Arthur; Shilling,John; Springston,Stephen; Tomlinson,Jason; Watson,Thomas; Zelenyuk-Imre,Alla

    2013-12-30

    This campaign was designed to provide a detailed set of observations with which to 1) perform radiative and cloud condensation nuclei (CCN) closure studies, 2) evaluate a new retrieval algorithm for aerosol optical depth (AOD) in the presence of clouds using passive remote sensing 3) extend a previously developed technique to investigate aerosol indirect effects, and 4) evaluate the performance of a detailed regional-scale model and a more parameterized global-scale model in simulating particle activation and AOD associated with the aging of anthropogenic aerosols. To meet these science objectives, the ARM Mobile Facility (AMF) and the Mobile Aerosol Observing System (MAOS) was deployed on Cape Cod, Massachusetts for a 12-month period starting in the summer of 2012 in order to quantify aerosol properties, radiation and cloud characteristics at a location subject to both clear- and cloudy- conditions, and clean- and polluted-conditions. These observations were supplemented by two aircraft intensive observation periods (IOPS), one in the summer and a second in the winter. Each IOP required two aircraft.

  19. A general enhancement of autonomic and cortisol responses during social evaluative threat.

    PubMed

    Bosch, Jos A; de Geus, Eco J C; Carroll, Douglas; Goedhart, Annebet D; Anane, Leila A; van Zanten, Jet J Veldhuizen; Helmerhorst, Eva J; Edwards, Kate M

    2009-10-01

    To examine the Social Self Preservation Theory, which predicts that stressors involving social evaluative threat (SET) characteristically activate the hypothalamic-pituitary-adrenal (HPA) axis. The idea that distinct psychosocial factors may underlie specific patterns of neuroendocrine stress responses has been a topic of recurrent debate. Sixty-one healthy university students (n = 31 females) performed a challenging speech task in one of three conditions that aimed to impose increasing levels of SET: performing the task alone (no social evaluation), with one evaluating observer, or with four evaluating observers. Indices of sympathetic (preejection period) and parasympathetic (heart rate variability) cardiac drive were obtained by impedance- and electrocardiography. Salivary cortisol was used to index HPA activity. Questionnaires assessed affective responses. Affective responses (shame/embarrassment, anxiety, negative affect, and self-esteem), cortisol, heart rate, sympathetic and parasympathetic activation all differentiated evaluative from nonevaluative task conditions (p < .001). The largest effect sizes were observed for cardiac autonomic responses. Physiological reactivity increased in parallel with increasing audience size (p < .001). An increase in cortisol was predicted by sympathetic activation during the task (p < .001), but not by affective responses. It would seem that SET determines the magnitude, rather than the pattern, of physiological activation. This potential to perturb broadly multiple physiological systems may help explain why social stress has been associated with a range of health outcomes. We propose a threshold-activation model as a physiological explanation for why engaging stressors, such as those involving social evaluation or uncontrollability, may seem to induce selectively cortisol release.

  20. Global evaluation of runoff from 10 state-of-the-art hydrological models

    NASA Astrophysics Data System (ADS)

    Beck, Hylke E.; van Dijk, Albert I. J. M.; de Roo, Ad; Dutra, Emanuel; Fink, Gabriel; Orth, Rene; Schellekens, Jaap

    2017-06-01

    Observed streamflow data from 966 medium sized catchments (1000-5000 km2) around the globe were used to comprehensively evaluate the daily runoff estimates (1979-2012) of six global hydrological models (GHMs) and four land surface models (LSMs) produced as part of tier-1 of the eartH2Observe project. The models were all driven by the WATCH Forcing Data ERA-Interim (WFDEI) meteorological dataset, but used different datasets for non-meteorologic inputs and were run at various spatial and temporal resolutions, although all data were re-sampled to a common 0. 5° spatial and daily temporal resolution. For the evaluation, we used a broad range of performance metrics related to important aspects of the hydrograph. We found pronounced inter-model performance differences, underscoring the importance of hydrological model uncertainty in addition to climate input uncertainty, for example in studies assessing the hydrological impacts of climate change. The uncalibrated GHMs were found to perform, on average, better than the uncalibrated LSMs in snow-dominated regions, while the ensemble mean was found to perform only slightly worse than the best (calibrated) model. The inclusion of less-accurate models did not appreciably degrade the ensemble performance. Overall, we argue that more effort should be devoted on calibrating and regionalizing the parameters of macro-scale models. We further found that, despite adjustments using gauge observations, the WFDEI precipitation data still contain substantial biases that propagate into the simulated runoff. The early bias in the spring snowmelt peak exhibited by most models is probably primarily due to the widespread precipitation underestimation at high northern latitudes.

  1. Observer efficiency in free-localization tasks with correlated noise.

    PubMed

    Abbey, Craig K; Eckstein, Miguel P

    2014-01-01

    The efficiency of visual tasks involving localization has traditionally been evaluated using forced choice experiments that capitalize on independence across locations to simplify the performance of the ideal observer. However, developments in ideal observer analysis have shown how an ideal observer can be defined for free-localization tasks, where a target can appear anywhere in a defined search region and subjects respond by localizing the target. Since these tasks are representative of many real-world search tasks, it is of interest to evaluate the efficiency of observer performance in them. The central question of this work is whether humans are able to effectively use the information in a free-localization task relative to a similar task where target location is fixed. We use a yes-no detection task at a cued location as the reference for this comparison. Each of the tasks is evaluated using a Gaussian target profile embedded in four different Gaussian noise backgrounds having power-law noise power spectra with exponents ranging from 0 to 3. The free localization task had a square 6.7° search region. We report on two follow-up studies investigating efficiency in a detect-and-localize task, and the effect of processing the white-noise backgrounds. In the fixed-location detection task, we find average observer efficiency ranges from 35 to 59% for the different noise backgrounds. Observer efficiency improves dramatically in the tasks involving localization, ranging from 63 to 82% in the forced localization tasks and from 78 to 92% in the detect-and- localize tasks. Performance in white noise, the lowest efficiency condition, was improved by filtering to give them a power-law exponent of 2. Classification images, used to examine spatial frequency weights for the tasks, show better tuning to ideal weights in the free-localization tasks. The high absolute levels of efficiency suggest that observers are well-adapted to free-localization tasks.

  2. Observer efficiency in free-localization tasks with correlated noise

    PubMed Central

    Abbey, Craig K.; Eckstein, Miguel P.

    2014-01-01

    The efficiency of visual tasks involving localization has traditionally been evaluated using forced choice experiments that capitalize on independence across locations to simplify the performance of the ideal observer. However, developments in ideal observer analysis have shown how an ideal observer can be defined for free-localization tasks, where a target can appear anywhere in a defined search region and subjects respond by localizing the target. Since these tasks are representative of many real-world search tasks, it is of interest to evaluate the efficiency of observer performance in them. The central question of this work is whether humans are able to effectively use the information in a free-localization task relative to a similar task where target location is fixed. We use a yes-no detection task at a cued location as the reference for this comparison. Each of the tasks is evaluated using a Gaussian target profile embedded in four different Gaussian noise backgrounds having power-law noise power spectra with exponents ranging from 0 to 3. The free localization task had a square 6.7° search region. We report on two follow-up studies investigating efficiency in a detect-and-localize task, and the effect of processing the white-noise backgrounds. In the fixed-location detection task, we find average observer efficiency ranges from 35 to 59% for the different noise backgrounds. Observer efficiency improves dramatically in the tasks involving localization, ranging from 63 to 82% in the forced localization tasks and from 78 to 92% in the detect-and- localize tasks. Performance in white noise, the lowest efficiency condition, was improved by filtering to give them a power-law exponent of 2. Classification images, used to examine spatial frequency weights for the tasks, show better tuning to ideal weights in the free-localization tasks. The high absolute levels of efficiency suggest that observers are well-adapted to free-localization tasks. PMID:24817854

  3. Perceptual evaluation of visual alerts in surveillance videos

    NASA Astrophysics Data System (ADS)

    Rogowitz, Bernice E.; Topkara, Mercan; Pfeiffer, William; Hampapur, Arun

    2015-03-01

    Visual alerts are commonly used in video monitoring and surveillance systems to mark events, presumably making them more salient to human observers. Surprisingly, the effectiveness of computer-generated alerts in improving human performance has not been widely studied. To address this gap, we have developed a tool for simulating different alert parameters in a realistic visual monitoring situation, and have measured human detection performance under conditions that emulated different set-points in a surveillance algorithm. In the High-Sensitivity condition, the simulated alerts identified 100% of the events with many false alarms. In the Lower-Sensitivity condition, the simulated alerts correctly identified 70% of the targets, with fewer false alarms. In the control condition, no simulated alerts were provided. To explore the effects of learning, subjects performed these tasks in three sessions, on separate days, in a counterbalanced, within subject design. We explore these results within the context of cognitive models of human attention and learning. We found that human observers were more likely to respond to events when marked by a visual alert. Learning played a major role in the two alert conditions. In the first session, observers generated almost twice as many False Alarms as in the No-Alert condition, as the observers responded pre-attentively to the computer-generated false alarms. However, this rate dropped equally dramatically in later sessions, as observers learned to discount the false cues. Highest observer Precision, Hits/(Hits + False Alarms), was achieved in the High Sensitivity condition, but only after training. The successful evaluation of surveillance systems depends on understanding human attention and performance.

  4. Fuzzy model-based observers for fault detection in CSTR.

    PubMed

    Ballesteros-Moncada, Hazael; Herrera-López, Enrique J; Anzurez-Marín, Juan

    2015-11-01

    Under the vast variety of fuzzy model-based observers reported in the literature, what would be the properone to be used for fault detection in a class of chemical reactor? In this study four fuzzy model-based observers for sensor fault detection of a Continuous Stirred Tank Reactor were designed and compared. The designs include (i) a Luenberger fuzzy observer, (ii) a Luenberger fuzzy observer with sliding modes, (iii) a Walcott-Zak fuzzy observer, and (iv) an Utkin fuzzy observer. A negative, an oscillating fault signal, and a bounded random noise signal with a maximum value of ±0.4 were used to evaluate and compare the performance of the fuzzy observers. The Utkin fuzzy observer showed the best performance under the tested conditions. Copyright © 2015 ISA. Published by Elsevier Ltd. All rights reserved.

  5. Design and implementation of a controlled clinical trial to evaluate the effectiveness and efficiency of routine opt-out rapid human immunodeficiency virus screening in the emergency department.

    PubMed

    Haukoos, Jason S; Hopkins, Emily; Byyny, Richard L; Conroy, Amy A; Silverman, Morgan; Eisert, Sheri; Thrun, Mark; Wilson, Michael; Boyett, Brian; Heffelfinger, James D

    2009-08-01

    In 2006, the Centers for Disease Control and Prevention (CDC) released revised recommendations for performing human immunodeficiency virus (HIV) testing in health care settings, including implementing routine rapid HIV screening, the use of an integrated opt-out consent, and limited prevention counseling. Emergency departments (EDs) have been a primary focus of these efforts. These revised CDC recommendations were primarily based on feasibility studies and have not been evaluated through the application of rigorous research methods. This article describes the design and implementation of a large prospective controlled clinical trial to evaluate the CDC's recommendations in an ED setting. From April 15, 2007, through April 15, 2009, a prospective quasi-experimental equivalent time-samples clinical trial was performed to compare the clinical effectiveness and efficiency of routine (nontargeted) opt-out rapid HIV screening (intervention) to physician-directed diagnostic rapid HIV testing (control) in a high-volume urban ED. In addition, three nested observational studies were performed to evaluate the cost-effectiveness and patient and staff acceptance of the two rapid HIV testing methods. This article describes the rationale, methodologies, and study design features of this program evaluation clinical trial. It also provides details regarding the integration of the principal clinical trial and its nested observational studies. Such ED-based trials are rare, but serve to provide valid comparisons between testing approaches. Investigators should consider similar methodology when performing future ED-based health services research.

  6. Alpha1 LASSO data bundles Lamont, OK

    DOE Data Explorer

    Gustafson, William Jr; Vogelmann, Andrew; Endo, Satoshi; Toto, Tami; Xiao, Heng; Li, Zhijin; Cheng, Xiaoping; Krishna, Bhargavi (ORCID:000000018828528X)

    2016-08-03

    A data bundle is a unified package consisting of LASSO LES input and output, observations, evaluation diagnostics, and model skill scores. LES input includes model configuration information and forcing data. LES output includes profile statistics and full domain fields of cloud and environmental variables. Model evaluation data consists of LES output and ARM observations co-registered on the same grid and sampling frequency. Model performance is quantified by skill scores and diagnostics in terms of cloud and environmental variables.

  7. We're all in this together now: group performance feedback to increase classroom team data collection.

    PubMed

    Pellecchia, Melanie; Connell, James E; Eisenhart, Donald; Kane, Meghan; Schoener, Christine; Turkel, Kimberly; Riley, Megan; Mandell, David S

    2011-08-01

    This study's primary goal was to evaluate the use of performance feedback procedures delivered to a classroom team to increase daily data collection. Performance feedback (PFB) was delivered to four classroom teams responsible for the daily collection of data representing student performance during prescribed instructional activities. Using a multiple-baseline design, the effects of the team performance-feedback were evaluated for the target student, and for generalization to data collection for all classroom students. A secondary question evaluated if student on-task behavior correlated with increased data collection. Finally, social validity was investigated to evaluate team satisfaction with the PFB intervention. The results demonstrate improved data collection across all four classroom teams for the target student in each classroom and generalization within classrooms to all remaining students. Slight increases in student on-task behavior were observed in three of the four classrooms, and teacher satisfaction ratings were high. Copyright © 2011 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.

  8. Regime-based evaluation of cloudiness in CMIP5 models

    NASA Astrophysics Data System (ADS)

    Jin, Daeho; Oreopoulos, Lazaros; Lee, Dongmin

    2017-01-01

    The concept of cloud regimes (CRs) is used to develop a framework for evaluating the cloudiness of 12 fifth Coupled Model Intercomparison Project (CMIP5) models. Reference CRs come from existing global International Satellite Cloud Climatology Project (ISCCP) weather states. The evaluation is made possible by the implementation in several CMIP5 models of the ISCCP simulator generating in each grid cell daily joint histograms of cloud optical thickness and cloud top pressure. Model performance is assessed with several metrics such as CR global cloud fraction (CF), CR relative frequency of occurrence (RFO), their product [long-term average total cloud amount (TCA)], cross-correlations of CR RFO maps, and a metric of resemblance between model and ISCCP CRs. In terms of CR global RFO, arguably the most fundamental metric, the models perform unsatisfactorily overall, except for CRs representing thick storm clouds. Because model CR CF is internally constrained by our method, RFO discrepancies yield also substantial TCA errors. Our results support previous findings that CMIP5 models underestimate cloudiness. The multi-model mean performs well in matching observed RFO maps for many CRs, but is still not the best for this or other metrics. When overall performance across all CRs is assessed, some models, despite shortcomings, apparently outperform Moderate Resolution Imaging Spectroradiometer cloud observations evaluated against ISCCP like another model output. Lastly, contrasting cloud simulation performance against each model's equilibrium climate sensitivity in order to gain insight on whether good cloud simulation pairs with particular values of this parameter, yields no clear conclusions.

  9. Detecting ecosystem performance anomalies for land management in the upper colorado river basin using satellite observations, climate data, and ecosystem models

    USGS Publications Warehouse

    Gu, Yingxin; Wylie, B.K.

    2010-01-01

    This study identifies areas with ecosystem performance anomalies (EPA) within the Upper Colorado River Basin (UCRB) during 2005-2007 using satellite observations, climate data, and ecosystem models. The final EPA maps with 250-m spatial resolution were categorized as normal performance, underperformance, and overperformance (observed performance relative to weather-based predictions) at the 90% level of confidence. The EPA maps were validated using "percentage of bare soil" ground observations. The validation results at locations with comparable site potential showed that regions identified as persistently underperforming (overperforming) tended to have a higher (lower) percentage of bare soil, suggesting that our preliminary EPA maps are reliable and agree with ground-based observations. The 3-year (2005-2007) persistent EPA map from this study provides the first quantitative evaluation of ecosystem performance anomalies within the UCRB and will help the Bureau of Land Management (BLM) identify potentially degraded lands. Results from this study can be used as a prototype by BLM and other land managers for making optimal land management decisions. ?? 2010 by the authors.

  10. Detecting Ecosystem Performance Anomalies for Land Management in the Upper Colorado River Basin Using Satellite Observations, Climate Data, and Ecosystem Models

    USGS Publications Warehouse

    Gu, Yingxin; Wylie, Bruce K.

    2010-01-01

    This study identifies areas with ecosystem performance anomalies (EPA) within the Upper Colorado River Basin (UCRB) during 2005–2007 using satellite observations, climate data, and ecosystem models. The final EPA maps with 250-m spatial resolution were categorized as normal performance, underperformance, and overperformance (observed performance relative to weather-based predictions) at the 90% level of confidence. The EPA maps were validated using “percentage of bare soil” ground observations. The validation results at locations with comparable site potential showed that regions identified as persistently underperforming (overperforming) tended to have a higher (lower) percentage of bare soil, suggesting that our preliminary EPA maps are reliable and agree with ground-based observations. The 3-year (2005–2007) persistent EPA map from this study provides the first quantitative evaluation of ecosystem performance anomalies within the UCRB and will help the Bureau of Land Management (BLM) identify potentially degraded lands. Results from this study can be used as a prototype by BLM and other land managers for making optimal land management decisions.

  11. On the Fidelity of Semi-distributed Hydrologic Model Simulations for Large Scale Catchment Applications

    NASA Astrophysics Data System (ADS)

    Ajami, H.; Sharma, A.; Lakshmi, V.

    2017-12-01

    Application of semi-distributed hydrologic modeling frameworks is a viable alternative to fully distributed hyper-resolution hydrologic models due to computational efficiency and resolving fine-scale spatial structure of hydrologic fluxes and states. However, fidelity of semi-distributed model simulations is impacted by (1) formulation of hydrologic response units (HRUs), and (2) aggregation of catchment properties for formulating simulation elements. Here, we evaluate the performance of a recently developed Soil Moisture and Runoff simulation Toolkit (SMART) for large catchment scale simulations. In SMART, topologically connected HRUs are delineated using thresholds obtained from topographic and geomorphic analysis of a catchment, and simulation elements are equivalent cross sections (ECS) representative of a hillslope in first order sub-basins. Earlier investigations have shown that formulation of ECSs at the scale of a first order sub-basin reduces computational time significantly without compromising simulation accuracy. However, the implementation of this approach has not been fully explored for catchment scale simulations. To assess SMART performance, we set-up the model over the Little Washita watershed in Oklahoma. Model evaluations using in-situ soil moisture observations show satisfactory model performance. In addition, we evaluated the performance of a number of soil moisture disaggregation schemes recently developed to provide spatially explicit soil moisture outputs at fine scale resolution. Our results illustrate that the statistical disaggregation scheme performs significantly better than the methods based on topographic data. Future work is focused on assessing the performance of SMART using remotely sensed soil moisture observations using spatially based model evaluation metrics.

  12. A new global and comprehensive model for ICU ventilator performances evaluation.

    PubMed

    Marjanovic, Nicolas S; De Simone, Agathe; Jegou, Guillaume; L'Her, Erwan

    2017-12-01

    This study aimed to provide a new global and comprehensive evaluation of recent ICU ventilators taking into account both technical performances and ergonomics. Six recent ICU ventilators were evaluated. Technical performances were assessed under two FIO 2 levels (100%, 50%), three respiratory mechanics combinations (Normal: compliance [C] = 70 mL cmH 2 O -1 /resistance [R] = 5 cmH 2 O L -1  s -1 ; Restrictive: C = 30/R = 10; Obstructive: C = 120/R = 20), four exponential levels of leaks (from 0 to 12.5 L min -1 ) and three levels of inspiratory effort (P0.1 = 2, 4 and 8 cmH 2 O), using an automated test lung. Ergonomics were evaluated by 20 ICU physicians using a global and comprehensive model involving physiological response to stress measurements (heart rate, respiratory rate, tidal volume variability and eye tracking), psycho-cognitive scales (SUS and NASA-TLX) and objective tasks completion. Few differences in terms of technical performance were observed between devices. Non-invasive ventilation modes had a huge influence on asynchrony occurrence. Using our global model, either objective tasks completion, psycho-cognitive scales and/or physiological measurements were able to depict significant differences in terms of devices' usability. The level of failure that was observed with some devices depicted the lack of adaptation of device's development to end users' requests. Despite similar technical performance, some ICU ventilators exhibit low ergonomics performance and a high risk of misusage.

  13. Three-phase bone scintigraphy for diagnosis of Charcot neuropathic osteoarthropathy in the diabetic foot - does quantitative data improve diagnostic value?

    PubMed

    Fosbøl, M; Reving, S; Petersen, E H; Rossing, P; Lajer, M; Zerahn, B

    2017-01-01

    To investigate whether inclusion of quantitative data on blood flow distribution compared with visual qualitative evaluation improve the reliability and diagnostic performance of 99 m Tc-hydroxymethylene diphosphate three-phase bone scintigraphy (TPBS) in patients suspected for charcot neuropathic osteoarthropathy (CNO) of the foot. A retrospective cohort study of TPBS performed on 148 patients with suspected acute CNO referred from a single specialized diabetes care centre. The quantitative blood flow distribution was calculated based on the method described by Deutsch et al. All scintigraphies were re-evaluated by independent, blinded observers twice with and without quantitative data on blood flow distribution at ankle and focus level, respectively. The diagnostic validity of TPBS was determined by subsequent review of clinical data and radiological examinations. A total of 90 patients (61%) had confirmed diagnosis of CNO. The sensitivity, specificity and accuracy of three-phase bone scintigraphy without/with quantitative data were 89%/88%, 58%/62% and 77%/78%, respectively. The intra-observer agreement improved significantly by adding quantitative data in the evaluation (Kappa value 0·79/0·94). The interobserver agreement was not significantly improved. Adding quantitative data on blood flow distribution in the interpretation of TBPS improves intra-observer variation, whereas no difference in interobserver variation was observed. The sensitivity of TPBS in the diagnosis of CNO is high, but holds limited specificity. Diagnostic performance does not improve using quantitative data in the evaluation. This may be due to the reference intervals applied in the study or the absence of a proper gold standard diagnostic procedure for comparison. © 2015 Scandinavian Society of Clinical Physiology and Nuclear Medicine. Published by John Wiley & Sons Ltd.

  14. Application of Lidar Data to the Performance Evaluations of ...

    EPA Pesticide Factsheets

    The Tropospheric Ozone (O3) Lidar Network (TOLNet) provides time/height O3 measurements from near the surface to the top of the troposphere to describe in high-fidelity spatial-temporal distributions, which is uniquely useful to evaluate the temporal evolution of O3 profiles in air quality models. This presentation describes the application of the Lidar data to the performance evaluation of CMAQ simulated O3 vertical profiles during the summer, 2014. Two-way coupled WRF-CMAQ simulations with 12km and 4km domains centered over Boulder, Colorado were performed during this time period. The analysis on the time series of observed and modeled O3 mixing ratios at different vertical layers indicates that the model frequently underestimated the observed values, and the underestimation was amplified in the middle model layers (~1km above the ground). When the lightning strikes detected by the National Lightning Detection Network (NLDN) were analyzed along with the observed O3 time series, it was found that the daily maximum O3 mixing ratios correlated well with the lightning strikes in the vicinity of the Lidar station. The analysis on temporal vertical profiles of both observed and modeled O3 mixing ratios on episodic days suggests that the model resolutions (12km and 4km) do not make any significant difference for this analysis (at this specific location and simulation period), but high O3 levels in the middle layers were linked to lightning activity that occurred in t

  15. Performance of Lung Ultrasound in Detecting Peri-Operative Atelectasis after General Anesthesia.

    PubMed

    Yu, Xin; Zhai, Zhenping; Zhao, Yongfeng; Zhu, Zhiming; Tong, Jianbin; Yan, Jianqin; Ouyang, Wen

    2016-12-01

    The aim of this prospective observational study was to evaluate the performance of lung ultrasound (LUS) in detecting post-operative atelectasis in adult patients under general anesthesia. Forty-six patients without pulmonary comorbidities who were scheduled for elective neurosurgery were enrolled in the study. A total of 552 pairs of LUS clips and thoracic computed tomography (CT) images were ultimately analyzed to determine the presence of atelectasis in 12 prescribed lung regions. The accuracy of LUS in detecting peri-operative atelectasis was evaluated with thoracic CT as gold standard. Levels of agreement between the two observers for LUS and the two observers for thoracic CT were analyzed using the κ reliability test. The quantitative correlation between LUS scores of aeration and the volumetric data of atelectasis in thoracic CT were further evaluated. LUS had reliable performance in post-operative atelectasis, with a sensitivity of 87.7%, specificity of 92.1% and diagnostic accuracy of 90.8%. The levels of agreement between the two observers for LUS and for thoracic CT were both satisfactory, with κ coefficients of 0.87 (p < 0.0001) and 0.93 (p < 0.0001), respectively. In patients in the supine position, LUS scores were highly correlated with the atelectasis volume of CT (r = 0.58, p < 0.0001). Thus, LUS provides a fast, reliable and radiation-free method to identify peri-operative atelectasis in adults. Copyright © 2016. Published by Elsevier Inc.

  16. Online model evaluation of large-eddy simulations covering Germany with a horizontal resolution of 156 m

    NASA Astrophysics Data System (ADS)

    Hansen, Akio; Ament, Felix; Lammert, Andrea

    2017-04-01

    Large-eddy simulations have been performed since several decades, but due to computational limits most studies were restricted to small domains or idealised initial-/boundary conditions. Within the High definition clouds and precipitation for advancing climate prediction (HD(CP)2) project realistic weather forecasting like LES simulations were performed with the newly developed ICON LES model for several days. The domain covers central Europe with a horizontal resolution down to 156 m. The setup consists of more than 3 billion grid cells, by what one 3D dump requires roughly 500 GB. A newly developed online evaluation toolbox was created to check instantaneously for realistic model simulations. The toolbox automatically combines model results with observations and generates several quicklooks for various variables. So far temperature-/humidity profiles, cloud cover, integrated water vapour, precipitation and many more are included. All kind of observations like aircraft observations, soundings or precipitation radar networks are used. For each dataset, a specific module is created, which allows for an easy handling and enhancement of the toolbox. Most of the observations are automatically downloaded from the Standardized Atmospheric Measurement Database (SAMD). The evaluation tool should support scientists at monitoring computational costly model simulations as well as to give a first overview about model's performance. The structure of the toolbox as well as the SAMD database are presented. Furthermore, the toolbox was applied on an ICON LES sensitivity study, where example results are shown.

  17. 40 CFR 63.8694 - What records must I keep?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ...)(iii) through (v) related to startup, shutdown, and malfunction. (3) Records of performance tests, performance evaluations, and opacity and visible emission observations as required in § 63.10(b)(2)(viii). (b... Section 63.8694 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS...

  18. A statistical evaluation and comparison of VISSR Atmospheric Sounder (VAS) data

    NASA Technical Reports Server (NTRS)

    Jedlovec, G. J.

    1984-01-01

    In order to account for the temporal and spatial discrepancies between the VAS and rawinsonde soundings, the rawinsonde data were adjusted to a common hour of release where the new observation time corresponded to the satellite scan time. Both the satellite and rawinsonde observations of the basic atmospheric parameters (T Td, and Z) were objectively analyzed to a uniform grid maintaining the same mesoscale structure in each data set. The performance of each retrieval algorithm in producing accurate and representative soundings was evaluated using statistical parameters such as the mean, standard deviation, and root mean square of the difference fields for each parameter and grid level. Horizontal structure was also qualitatively evaluated by examining atmospheric features on constant pressure surfaces. An analysis of the vertical structure of the atmosphere were also performed by looking at colocated and grid mean vertical profiles of both the satellite and rawinsonde data sets. Highlights of these results are presented.

  19. Real-Time Point Positioning Performance Evaluation of Single-Frequency Receivers Using NASA's Global Differential GPS System

    NASA Technical Reports Server (NTRS)

    Muellerschoen, Ronald J.; Iijima, Byron; Meyer, Robert; Bar-Sever, Yoaz; Accad, Elie

    2004-01-01

    This paper evaluates the performance of a single-frequency receiver using the 1-Hz differential corrections as provided by NASA's global differential GPS system. While the dual-frequency user has the ability to eliminate the ionosphere error by taking a linear combination of observables, the single-frequency user must remove or calibrate this error by other means. To remove the ionosphere error we take advantage of the fact that the magnitude of the group delay in range observable and the carrier phase advance have the same magnitude but are opposite in sign. A way to calibrate this error is to use a real-time database of grid points computed by JPL's RTI (Real-Time Ionosphere) software. In both cases we evaluate the positional accuracy of a kinematic carrier phase based point positioning method on a global extent.

  20. Measurement of fog and haze extinction characteristics and availability evaluation of free space optical link under the sea surface environment.

    PubMed

    Wu, Xiaojun; Wang, Hongxing; Song, Bo

    2015-02-10

    Fog and haze can lead to changes in extinction characteristics. Therefore, the performance of the free space optical link is highly influenced by severe weather conditions. Considering the influential behavior of weather conditions, a state-of-the-art solution for the observation of fog and haze over the sea surface is presented in this paper. A Mie scattering laser radar, with a wavelength of 532 nm, is used to observe the weather conditions of the sea surface environment. The horizontal extinction coefficients and visibilities are obtained from the observation data, and the results are presented in the paper. The changes in the characteristics of extinction coefficients and visibilities are analyzed based on both the short-term (6 days) severe weather data and long-term (6 months) data. Finally, the availability performance of the free space optical communication link is evaluated under the sea surface environment.

  1. [Evaluation of the surface of the new intraocular lenses in the scanning electron microscope].

    PubMed

    Kałuzny, B J; Szatkowski, J; Kałuzny, J J

    2001-01-01

    To evaluate the surface of the new PC IOLs commercially available in Poland in 2000. Representative samples of new posterior chamber IOLs produced by 6 different companies (Alcon, Lensita, Medicontur, Opsia, Rayner, Storz), 5 of each, underwent surface examination with Novoscan 30 scanning electron microscope. Although, in general, smooth surface of optic and haptic parts were observed, three samples with irregularities were found. Comparing to previous evaluation performed in 1994, significant improvement in quality of IOLs surface was noted. No considerable differences in this field between above mentioned producers were observed.

  2. Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items

    ERIC Educational Resources Information Center

    Kieftenbeld, Vincent; Boyer, Michelle

    2017-01-01

    Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…

  3. Mariner Mars 1971 attitude control subsystem flight performance

    NASA Technical Reports Server (NTRS)

    Schumacher, L.

    1973-01-01

    The flight performance of the Mariner 71 attitude control subsystem is discussed. Each phase of the mission is delineated and the attitude control subsystem is evaluated within the observed operational environment. Performance anomalies are introduced and discussed within the context of general performance. Problems such as the sun sensor interface incompatibility, gas valve leaks, and scan platform dynamic coupling effects are given analytical considerations.

  4. An evaluation of Dynamic TOPMODEL for low flow simulation

    NASA Astrophysics Data System (ADS)

    Coxon, G.; Freer, J. E.; Quinn, N.; Woods, R. A.; Wagener, T.; Howden, N. J. K.

    2015-12-01

    Hydrological models are essential tools for drought risk management, often providing input to water resource system models, aiding our understanding of low flow processes within catchments and providing low flow predictions. However, simulating low flows and droughts is challenging as hydrological systems often demonstrate threshold effects in connectivity, non-linear groundwater contributions and a greater influence of water resource system elements during low flow periods. These dynamic processes are typically not well represented in commonly used hydrological models due to data and model limitations. Furthermore, calibrated or behavioural models may not be effectively evaluated during more extreme drought periods. A better understanding of the processes that occur during low flows and how these are represented within models is thus required if we want to be able to provide robust and reliable predictions of future drought events. In this study, we assess the performance of dynamic TOPMODEL for low flow simulation. Dynamic TOPMODEL was applied to a number of UK catchments in the Thames region using time series of observed rainfall and potential evapotranspiration data that captured multiple historic droughts over a period of several years. The model performance was assessed against the observed discharge time series using a limits of acceptability framework, which included uncertainty in the discharge time series. We evaluate the models against multiple signatures of catchment low-flow behaviour and investigate differences in model performance between catchments, model diagnostics and for different low flow periods. We also considered the impact of surface water and groundwater abstractions and discharges on the observed discharge time series and how this affected the model evaluation. From analysing the model performance, we suggest future improvements to Dynamic TOPMODEL to improve the representation of low flow processes within the model structure.

  5. Comprehensive Performance Evaluation for Hydrological and Nutrients Simulation Using the Hydrological Simulation Program–Fortran in a Mesoscale Monsoon Watershed, China

    PubMed Central

    Luo, Chuan; Jiang, Kaixia; Wan, Rongrong; Li, Hengpeng

    2017-01-01

    The Hydrological Simulation Program–Fortran (HSPF) is a hydrological and water quality computer model that was developed by the United States Environmental Protection Agency. Comprehensive performance evaluations were carried out for hydrological and nutrient simulation using the HSPF model in the Xitiaoxi watershed in China. Streamflow simulation was calibrated from 1 January 2002 to 31 December 2007 and then validated from 1 January 2008 to 31 December 2010 using daily observed data, and nutrient simulation was calibrated and validated using monthly observed data during the period from July 2009 to July 2010. These results of model performance evaluation showed that the streamflows were well simulated over the study period. The determination coefficient (R2) was 0.87, 0.77 and 0.63, and the Nash-Sutcliffe coefficient of efficiency (Ens) was 0.82, 0.76 and 0.65 for the streamflow simulation in annual, monthly and daily time-steps, respectively. Although limited to monthly observed data, satisfactory performance was still achieved during the quantitative evaluation for nutrients. The R2 was 0.73, 0.82 and 0.92, and the Ens was 0.67, 0.74 and 0.86 for nitrate, ammonium and orthophosphate simulation, respectively. Some issues may affect the application of HSPF were also discussed, such as input data quality, parameter values, etc. Overall, the HSPF model can be successfully used to describe streamflow and nutrients transport in the mesoscale watershed located in the East Asian monsoon climate area. This study is expected to serve as a comprehensive and systematic documentation of understanding the HSPF model for wide application and avoiding possible misuses. PMID:29257117

  6. Comprehensive Performance Evaluation for Hydrological and Nutrients Simulation Using the Hydrological Simulation Program-Fortran in a Mesoscale Monsoon Watershed, China.

    PubMed

    Li, Zhaofu; Luo, Chuan; Jiang, Kaixia; Wan, Rongrong; Li, Hengpeng

    2017-12-19

    The Hydrological Simulation Program-Fortran (HSPF) is a hydrological and water quality computer model that was developed by the United States Environmental Protection Agency. Comprehensive performance evaluations were carried out for hydrological and nutrient simulation using the HSPF model in the Xitiaoxi watershed in China. Streamflow simulation was calibrated from 1 January 2002 to 31 December 2007 and then validated from 1 January 2008 to 31 December 2010 using daily observed data, and nutrient simulation was calibrated and validated using monthly observed data during the period from July 2009 to July 2010. These results of model performance evaluation showed that the streamflows were well simulated over the study period. The determination coefficient ( R ²) was 0.87, 0.77 and 0.63, and the Nash-Sutcliffe coefficient of efficiency (Ens) was 0.82, 0.76 and 0.65 for the streamflow simulation in annual, monthly and daily time-steps, respectively. Although limited to monthly observed data, satisfactory performance was still achieved during the quantitative evaluation for nutrients. The R ² was 0.73, 0.82 and 0.92, and the Ens was 0.67, 0.74 and 0.86 for nitrate, ammonium and orthophosphate simulation, respectively. Some issues may affect the application of HSPF were also discussed, such as input data quality, parameter values, etc. Overall, the HSPF model can be successfully used to describe streamflow and nutrients transport in the mesoscale watershed located in the East Asian monsoon climate area. This study is expected to serve as a comprehensive and systematic documentation of understanding the HSPF model for wide application and avoiding possible misuses.

  7. A PERFORMANCE EVALUATION OF THE ETA- CMAQ AIR QUALITY FORECAST SYSTEM FOR THE SUMMER OF 2005

    EPA Science Inventory

    This poster presents an evaluation of the Eta-CMAQ Air Quality Forecast System's experimental domain using O3 observations obtained from EPA's AIRNOW program and a suite of statistical metrics examining both discrete and categorical forecasts.

  8. The Role of Performance Quality in Adolescents' Self-Evaluation and Rumination after a Speech: Is it Contingent on Social Anxiety Level?

    PubMed

    Blöte, Anke W; Miers, Anne C; Van den Bos, Esther; Westenberg, P Michiel

    2018-05-17

    Cognitive behavioural therapy (CBT) has relatively poor outcomes for youth with social anxiety, possibly because broad-based CBT is not tailored to their specific needs. Treatment of social anxiety in youth may need to pay more attention to negative social cognitions that are considered a key factor in social anxiety development and maintenance. The aim of the present study was to learn more about the role of performance quality in adolescents' cognitions about their social performance and, in particular, the moderating role social anxiety plays in the relationship between performance quality and self-cognitions. A community sample of 229 participants, aged 11 to 18 years, gave a speech and filled in questionnaires addressing social anxiety, depression, expected and self-evaluated performance, and post-event rumination. Independent observers rated the quality of the speech. The data were analysed using moderated mediation analysis. Performance quality mediated the link between expected and self-evaluated performance in adolescents with low and medium levels of social anxiety. For adolescents with high levels of social anxiety, only a direct link between expected and self-evaluated performance was found. Their self-evaluation was not related to the quality of their performance. Performance quality also mediated the link between expected performance and rumination, but social anxiety did not moderate this mediation effect. Results suggest that a good performance does not help socially anxious adolescents to replace their negative self-evaluations with more realistic ones. Specific cognitive intervention strategies should be tailored to the needs of socially anxious adolescents who perform well.

  9. Evaluation and Optimization of China's Anthropogenic CO2 Emissions using Observations from Northern China (2005-2009).

    NASA Astrophysics Data System (ADS)

    Dayalu, A.; Munger, J. W.; Wang, Y.; Wofsy, S.; Zhao, Y.; Nielsen, C. P.; Nehrkorn, T.; McElroy, M. B.; Chang, R.

    2017-12-01

    China has pledged to peak carbon emissions by 2030, but there continues to be significant uncertainty in estimates of its anthropogenic carbon dioxide (CO2) emissions. In this study, we evaluate the performance of three anthropogenic CO2 inventories, two global and one regional, using five years of continuous hourly observations from a site in Northern China. We model five years of continuous hourly observations (2005 to 2009) using the Stochastic Time-Inverted Lagrangian Transport Model (STILT) run in backward time mode driven by high resolution meteorology from the Weather Research and Forecasting Model version 3.6.1 (WRF) with vegetation fluxes prescribed by a simple biosphere model. We calculate regional enhancements to advected background CO2 derived from NOAA CarbonTracker on seasonal and annual bases and use observations to optimize emissions inventories within the site's influence region at these timescales. Finally, we use annual enhancements to examine carbon intensity of provinces in and adjacent to Northern China as CO2 per unit of the region's GDP to evaluate the effects of local and global economic events on CO2 emissions. With the exception of peak growing season where discrepancies are confounded by errors in the vegetation model, we find the regional inventory agrees significantly better with observations than the global inventories at all timescales. Here we use a single measurement site; significant improvements in inventory optimizations can be achieved with a network of measurements stations. This study highlights the importance of China-specific data over global averages in emissions evaluation and demonstrates the value of top-down studies in independently evaluating inventory performance. We demonstrate the framework's ability to resolve differences of at least 20% among inventories, establishing a benchmark for ongoing efforts to decrease uncertainty in China's reported CO2 emissions estimates.

  10. Effects of structured written feedback by cards on medical students' performance at Mini Clinical Evaluation Exercise (Mini-CEX) in an outpatient clinic.

    PubMed

    Haghani, Fariba; Hatef Khorami, Mohammad; Fakhari, Mohammad

    2016-07-01

    Feedback cards are recommended as a feasible tool for structured written feedback delivery in clinical education while effectiveness of this tool on the medical students' performance is still questionable.  The purpose of this study was to compare the effects of structured written feedback by cards as well as verbal feedback versus verbal feedback alone on the clinical performance of medical students at the Mini Clinical Evaluation Exercise (Mini-CEX) test in an outpatient clinic. This is a quasi-experimental study with pre- and post-test comprising four groups in two terms of medical students' externship. The students' performance was assessed through the Mini-Clinical Evaluation Exercise (Mini-CEX) as a clinical performance evaluation tool. Structured written feedbacks were given to two experimental groups by designed feedback cards as well as verbal feedback, while in the two control groups feedback was delivered verbally as a routine approach in clinical education. By consecutive sampling method, 62 externship students were enrolled in this study and seven students were excluded from the final analysis due to their absence for three days. According to the ANOVA analysis and Post Hoc Tukey test,  no statistically significant difference was observed among the four groups at the pre-test, whereas a statistically significant difference was observed between the experimental and control groups at the post-test  (F = 4.023, p =0.012). The effect size of the structured written feedbacks on clinical performance was 0.19. Structured written feedback by cards could improve the performance of medical students in a statistical sense. Further studies must be conducted in other clinical courses with longer durations.

  11. Blind Demodulation of Pass Band OFDMA Signals and Jamming Battle Damage Assessment Utilizing Link Adaptation

    DTIC Science & Technology

    2014-03-27

    Access (OFDMA) signal so that jamming effectiveness can be assessed; referred to in this research as Battle Damage Assessment ( BDA ). The research extends...the 802.16 Wireless Metropolitan Area Network (MAN) OFDMA standard, and presents a novel method for performing BDA via observation of Sub Carrier (SC...interferer is also evaluated where the blind demodulator’s performance is degraded. BDA is achieved via observing SC LA modulation behavior of the

  12. Transient and steady-state performance of a single turbojet combustor with four different fuel nozzles

    NASA Technical Reports Server (NTRS)

    Mccafferty, Richard J; Donlon, Richard H

    1955-01-01

    Acceleration and steady-state performance of a tubular combustor was evaluated at two simulated altitudes with four different fuel nozzles. Temperature response lag was observed with all the nozzles. Except for rich-limit blowout, the only combustion failures observed during acceleration were with a fuel nozzle that gave an interrupted flow delivery during the acceleration. This same nozzle, because of superior fuel atomization, gave the highest steady-state combustion efficiencies.

  13. A general enhancement of autonomic and cortisol responses during social evaluative threat

    PubMed Central

    Bosch, Jos A.; de Geus, Eco J.C.; Carroll, Douglas; Goedhart, Annebet D.; Anane, Leila A.; van Zanten, Jet J Veldhuizen; Helmerhorst, Eva J.; Edwards, Kate M.

    2013-01-01

    Objective The idea that distinct psychosocial factors may underlie specific patterns of neuroendocrine stress responses has been a topic of recurrent debate. We examined a recent contribution to this debate, the Social Self Preservation Theory, which predicts that stressors involving social evaluative threat (SET) characteristically activate the hypothalamic-pituitary-adrenal (HPA) axis. Methods Sixty-one healthy university students (31 females) performed a challenging speech task in one of three conditions that aimed to impose increasing levels of SET: performing the task alone (no social evaluation), with 1 evaluating observer, or with 4 evaluating observers. Indices of sympathetic (pre-ejection period) and parasympathetic (heart rate variability) cardiac drive were obtained by impedance- and electrocardiography. Salivary cortisol was used to index HPA activity. Questionnaires assessed affective responses. Results Affective responses (shame/embarrassment, anxiety, negative affect, and self-esteem), cortisol, heart rate, sympathetic, and parasympathetic activation all differentiated evaluative from non-evaluative task conditions (p<.001). The largest effect-sizes were observed for cardiac autonomic responses. Physiological reactivity increased in parallel with increasing audience size (p<.001). A rise in cortisol was predicted by sympathetic activation during the task (p<.001), but not by affective responses. Conclusion It would appear that SET determines the magnitude, rather than the pattern, of physiological activation. This potential to broadly perturb multiple physiological systems may help explain why social stress has been associated with a range of health outcomes. We propose a threshold-activation model as a physiological explanation for why engaging stressors, such as those involving social evaluation or uncontrollability, may appear to selectively induce cortisol release. PMID:19779143

  14. The Impact of Current and Future Polar Orbiting Satellite Data on Numerical Weather Prediction at NASA/GSFC

    NASA Technical Reports Server (NTRS)

    Atlas, Robert

    2004-01-01

    The lack of adequate observational data continues to be recognized as a major factor limiting both atmospheric research and numerical prediction on a variety of temporal and spatial scales. Since the advent of meteorological satellites in the 1960's, a considerable research effort has been directed toward the design of space-borne meteorological sensors, the development of optimal methods for the utilization of these data, (and an assessment of the influence of existing satellite data and the potential influence of future satellite observations on numerical weather prediction. This has included both Observing System Experiments (OSEs) and Observing System Simulation Experiments (OSSEs). OSEs are conducted to evaluate the impact of specific observations or classes of observations on analyses and forecasts. While OSEs are performed with existing data, OSSEs are conducted to evaluate the potential for future observing systems to improve-NWP, as well as to evaluate trade-offs in observing system design, and to develop and test improved methods for data assimilation. At the conference, results from OSEs to evaluate satellite data sets that have recently become available to the global observing system, such as AIRS and Seawinds, and results from OSSEs to determine the potential impact of space-based lidar winds will be presented.

  15. Foveated model observers to predict human performance in 3D images

    NASA Astrophysics Data System (ADS)

    Lago, Miguel A.; Abbey, Craig K.; Eckstein, Miguel P.

    2017-03-01

    We evaluate 3D search requires model observers that take into account the peripheral human visual processing (foveated models) to predict human observer performance. We show that two different 3D tasks, free search and location-known detection, influence the relative human visual detectability of two signals of different sizes in synthetic backgrounds mimicking the noise found in 3D digital breast tomosynthesis. One of the signals resembled a microcalcification (a small and bright sphere), while the other one was designed to look like a mass (a larger Gaussian blob). We evaluated current standard models observers (Hotelling; Channelized Hotelling; non-prewhitening matched filter with eye filter, NPWE; and non-prewhitening matched filter model, NPW) and showed that they incorrectly predict the relative detectability of the two signals in 3D search. We propose a new model observer (3D Foveated Channelized Hotelling Observer) that incorporates the properties of the visual system over a large visual field (fovea and periphery). We show that the foveated model observer can accurately predict the rank order of detectability of the signals in 3D images for each task. Together, these results motivate the use of a new generation of foveated model observers for predicting image quality for search tasks in 3D imaging modalities such as digital breast tomosynthesis or computed tomography.

  16. Approaches to chronic disease management evaluation in use in Europe: a review of current methods and performance measures.

    PubMed

    Conklin, Annalijn; Nolte, Ellen; Vrijhoef, Hubertus

    2013-01-01

    An overview was produced of approaches currently used to evaluate chronic disease management in selected European countries. The study aims to describe the methods and metrics used in Europe as a first to help advance the methodological basis for their assessment. A common template for collection of evaluation methods and performance measures was sent to key informants in twelve European countries; responses were summarized in tables based on template evaluation categories. Extracted data were descriptively analyzed. Approaches to the evaluation of chronic disease management vary widely in objectives, designs, metrics, observation period, and data collection methods. Half of the reported studies used noncontrolled designs. The majority measure clinical process measures, patient behavior and satisfaction, cost and utilization; several also used a range of structural indicators. Effects are usually observed over 1 or 3 years on patient populations with a single, commonly prevalent, chronic disease. There is wide variation within and between European countries on approaches to evaluating chronic disease management in their objectives, designs, indicators, target audiences, and actors involved. This study is the first extensive, international overview of the area reported in the literature.

  17. Objective evaluation of acute adverse events and image quality of gadolinium-based contrast agents (gadobutrol and gadobenate dimeglumine) by blinded evaluation. Pilot study.

    PubMed

    Semelka, Richard C; Hernandes, Mateus de A; Stallings, Clifton G; Castillo, Mauricio

    2013-01-01

    The purpose was to objectively evaluate a recently FDA-approved gadolinium-based contrast agent (GBCA) in comparison to our standard GBCA for acute adverse events and image quality by blinded evaluation. Evaluation was made of a recently FDA-approved GBCA, gadobutrol (Gadavist; Bayer), in comparison to our standard GBCA, gadobenate dimeglumine (MultiHance; Bracco), in an IRB- and HIPAA-compliant study. Both the imaging technologist and patient were not aware of the brand of the GBCA used. A total of 59 magnetic resonance studies were evaluated (59 patients, 31 men, 28 women, age range of 5-85 years, mean age of 52 years). Twenty-nine studies were performed with gadobutrol (22 abdominal and 7 brain studies), and 30 studies were performed with gadobenate dimeglumine (22 abdominal and 8 brain studies). Assessment was made of acute adverse events focusing on objective observations of vomiting, hives, and moderate and severe reactions. Adequacy of enhancement was rated as poor, fair and good by one of two experienced radiologists who were blinded to the type of agent evaluated. No patient experienced acute adverse events with either agent. The target minor adverse events of vomiting or hives, and moderate and severe reactions were not observed in any patient. Adequacy of enhancement was rated as good for both agents in all patients. Objective, blinded evaluation is feasible and readily performable for the evaluation of GBCAs. This proof-of-concept study showed that both GBCAs evaluated exhibited consistent good image quality and no noteworthy adverse events. Copyright © 2013 Elsevier Inc. All rights reserved.

  18. Tuberculosis control program in the municipal context: performance evaluation

    PubMed Central

    Arakawa, Tiemi; Magnabosco, Gabriela Tavares; Andrade, Rubia Laine de Paula; Brunello, Maria Eugenia Firmino; Monroe, Aline Aparecida; Ruffino-Netto, Antonio; Scatena, Lucia Marina; Villa, Tereza Cristina Scatena

    2017-01-01

    ABSTRACT OBJECTIVE The objective of this study is to evaluate the performance of the Tuberculosis Control Program in municipalities of the State of São Paulo. METHODS This is a program evaluation research, with ecological design, which uses three non-hierarchical groups of the municipalities of the State of São Paulo according to their performance in relation to operational indicators. We have selected 195 municipalities with at least five new cases of tuberculosis notified in the Notification System of the State of São Paulo and with 20,000 inhabitants or more in 2010. The multiple correspondence analysis was used to identify the association between the groups of different performances, the epidemiological and demographic characteristics, and the characteristics of the health systems of the municipalities. RESULTS The group with the worst performance showed the highest rates of abandonment (average [avg] = 10.4, standard deviation [sd] = 9.4) and the lowest rates of supervision of Directly Observed Treatment (avg = 6.1, sd = 12.9), and it was associated with low incidence of tuberculosis, high tuberculosis and HIV, small population, high coverage of the Family Health Strategy/Program of Community Health Agents, and being located on the countryside. The group with the best performance presented the highest cure rate (avg = 83.7, sd = 10.5) and the highest rate of cases in Directly Observed Treatment (avg = 83.0, sd = 12.7); the group of regular performance showed regular results for outcome (avg cure = 79.8, sd = 13.2; abandonment avg = 9.5, sd = 8.3) and supervision of the Directly Observed Treatment (avg = 42.8, sd = 18.8). Large population, low coverage of the Family Health Strategy/Program of Community Health Agents, high incidence of tuberculosis and AIDS, and being located on the coast and in metropolitan areas were associated with these groups. CONCLUSIONS The findings highlight the importance of the Directly Observed Treatment in relation to the outcome for treatment and raise reflections on the structural and managerial capacity of municipalities in the implementation of the Tuberculosis Control Program. PMID:28380207

  19. GUIDANCE FOR THE PERFORMANCE EVALUATION OF THREE-DIMENSIONAL AIR QUALITY MODELING SYSTEMS FOR PARTICULATE MATTER AND VISIBILITY

    EPA Science Inventory

    The National Ambient Air Quality Standards for particulate matter (PM) and the federal regional haze regulations place some emphasis on the assessment of fine particle (PM; 5) concentrations. Current air quality models need to be improved and evaluated against observations to a...

  20. Control of large flexible space structures

    NASA Technical Reports Server (NTRS)

    Vandervelde, W. E.

    1986-01-01

    Progress in robust design of generalized parity relations, design of failure sensitive observers using the geometric system theory of Wonham, computational techniques for evaluation of the performance of control systems with fault tolerance and redundancy management features, and the design and evaluation od control systems for structures having nonlinear joints are described.

  1. 33 CFR 154.2010 - Qualifications for acceptance as a certifying entity.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... qualifications: (a) The ability to review and evaluate design drawings and failure analyses for compliance to... incorporated by reference; (c) The ability to monitor and evaluate test procedures and results for compliance with the operational requirements of this subpart; (d) The ability to perform inspections and observe...

  2. Evaluation of decadal predictions using a satellite simulator for the Special Sensor Microwave Imager (SSM/I)

    NASA Astrophysics Data System (ADS)

    Spangehl, Thomas; Schröder, Marc; Bodas-Salcedo, Alejandro; Glowienka-Hense, Rita; Hense, Andreas; Hollmann, Rainer; Dietzsch, Felix

    2017-04-01

    Decadal climate predictions are commonly evaluated focusing on geophysical parameters such as temperature, precipitation or wind speed using observational datasets and reanalysis. Alternatively, satellite based radiance measurements combined with satellite simulator techniques to deduce virtual satellite observations from the numerical model simulations can be used. The latter approach enables an evaluation in the instrument's parameter space and has the potential to reduce uncertainties on the reference side. Here we present evaluation methods focusing on forward operator techniques for the Special Sensor Microwave Imager (SSM/I). The simulator is developed as an integrated part of the CFMIP Observation Simulator Package (COSP). On the observational side the SSM/I and SSMIS Fundamental Climate Data Record (FCDR) released by CM SAF (http://dx.doi.org/10.5676/EUM_SAF_CM/FCDR_MWI/V002) is used, which provides brightness temperatures for different channels and covers the period from 1987 to 2013. The simulator is applied to hindcast simulations performed within the MiKlip project (http://fona-miklip.de) which is funded by the BMBF (Federal Ministry of Education and Research in Germany). Probabilistic evaluation results are shown based on a subset of the hindcast simulations covering the observational period.

  3. Performance assessment through pre- and post-training evaluation of continuing medical education courses in prevention and management of cardio-vascular diseases in primary health care facilities of Armenia.

    PubMed

    Khachatryan, Lilit; Balalian, Arin

    2013-12-01

    To assess the difference of pre- and post-training performance evaluation of continuing medical education (CME) courses in cardio-vascular diseases (CVD) management among physicians at primary health care facilities of Armenian regions we conducted an evaluation survey. 212 medical records were surveyed on assessment of performance before and after the training courses through a self-employed structured questionnaire. Analysis of survey revealed statistically significant differences (p < 0.05) in a number of variables: threefold increased recording of lipids and body mass index (p = 0.001); moderate increased recording of comorbidities and aspirin prescription (p < 0.012); eightfold increased recording of dyslipidemia management plan, twofold increased recording for CVD management plan and fivefold increased recording for CVD absolute risk (p = 0.000). Missing records of electrocardiography and urine/creatinine analyses decreased statistically significantly (p < 0.05). Statistically significant decrease was observed in prescription of thiazides and angiotensin receptor blockers/angiotensin converting enzyme inhibitors (p < 0.005), while prescription of statins and statins with diet for dyslipidemia management showed increased recording (p < 0.05). Similarly, we observed increased records for counseling of rehabilitation physical activity (p = 0.006). In this survey most differences in pre- and post-evaluation of performance assessment may be explained by improved and interactive training modes, more advanced methods of demonstration of modeling. Current findings may serve a basis for future planning of CME courses for physicians of remote areas facing challenges in upgrading their knowledge, as well as expand the experience of performance assessment along with evaluation of knowledge scores.

  4. Feedback-giving behaviour in performance evaluations during clinical clerkships.

    PubMed

    Bok, Harold G J; Jaarsma, Debbie A D C; Spruijt, Annemarie; Van Beukelen, Peter; Van Der Vleuten, Cees P M; Teunissen, Pim W

    2016-01-01

    Narrative feedback documented in performance evaluations by the teacher, i.e. the clinical supervisor, is generally accepted to be essential for workplace learning. Many studies have examined factors of influence on the usage of mini-clinical evaluation exercise (mini-CEX) instruments and provision of feedback, but little is known about how these factors influence teachers' feedback-giving behaviour. In this study, we investigated teachers' use of mini-CEX in performance evaluations to provide narrative feedback in undergraduate clinical training. We designed an exploratory qualitative study using an interpretive approach. Focusing on the usage of mini-CEX instruments in clinical training, we conducted semi-structured interviews to explore teachers' perceptions. Between February and June 2013, we conducted interviews with 14 clinicians participated as teachers during undergraduate clinical clerkships. Informed by concepts from the literature, we coded interview transcripts and iteratively reduced and displayed data using template analysis. We identified three main themes of interrelated factors that influenced teachers' practice with regard to mini-CEX instruments: teacher-related factors; teacher-student interaction-related factors, and teacher-context interaction-related factors. Four issues (direct observation, relationship between teacher and student, verbal versus written feedback, formative versus summative purposes) that are pertinent to workplace-based performance evaluations were presented to clarify how different factors interact with each other and influence teachers' feedback-giving behaviour. Embedding performance observation in clinical practice and establishing trustworthy teacher-student relationships in more longitudinal clinical clerkships were considered important in creating a learning environment that supports and facilitates the feedback exchange. Teachers' feedback-giving behaviour within the clinical context results from the interaction between personal, interpersonal and contextual factors. Increasing insight into how teachers use mini-CEX instruments in daily practice may offer strategies for creating a professional learning culture in which feedback giving and seeking would be enhanced.

  5. Investigating the feasibility of using partial least squares as a method of extracting salient information for the evaluation of digital breast tomosynthesis

    NASA Astrophysics Data System (ADS)

    Zhang, George Z.; Myers, Kyle J.; Park, Subok

    2013-03-01

    Digital breast tomosynthesis (DBT) has shown promise for improving the detection of breast cancer, but it has not yet been fully optimized due to a large space of system parameters to explore. A task-based statistical approach1 is a rigorous method for evaluating and optimizing this promising imaging technique with the use of optimal observers such as the Hotelling observer (HO). However, the high data dimensionality found in DBT has been the bottleneck for the use of a task-based approach in DBT evaluation. To reduce data dimensionality while extracting salient information for performing a given task, efficient channels have to be used for the HO. In the past few years, 2D Laguerre-Gauss (LG) channels, which are a complete basis for stationary backgrounds and rotationally symmetric signals, have been utilized for DBT evaluation2, 3 . But since background and signal statistics from DBT data are neither stationary nor rotationally symmetric, LG channels may not be efficient in providing reliable performance trends as a function of system parameters. Recently, partial least squares (PLS) has been shown to generate efficient channels for the Hotelling observer in detection tasks involving random backgrounds and signals.4 In this study, we investigate the use of PLS as a method for extracting salient information from DBT in order to better evaluate such systems.

  6. Reliability and reproducibility of disc-foveal angle measurements by non-mydriatic fundus photography.

    PubMed

    Le Jeune, Caroline; Chebli, Fayçal; Leon, Lorette; Anthoine, Emmanuelle; Weber, Michel; Péchereau, Alain; Lebranchu, Pierre

    2018-01-01

    Abnormal torsion could be associated with cyclovertical strabismus, but torsion measurements are not reliable in children. To assess an objective fundus torsion evaluation in a paediatric population, we used Non-Mydriatic Fundus photography (NMFP) in healthy and cyclovertical strabismus patients to evaluate the disc-foveal angle over time and observers. We used a retrospective set of NMFP including 24 A or V-pattern strabismus and 27 age-matched normal children (mean age 6.4 and 6.7 years respectively), taken during 2 distinct follow-up consultations (separated by 251 and 479 days respectively). Each disc-foveal angle measurement (from which the ocular torsion can be assessed) was performed by 5 different observers, using graphical software and based on reproducible fundus anatomical marks. Statistical analysis was performed with a multivariate ANOVA using group, time and observers as factors, in addition to intraclass coefficient correlation (ICC) to assess measurement reproducibility. A significant difference of disc-foveal angle measures was observed between groups (p<0,001): 18.73° (SD = 6.42), -3,25° (SD = 5.51) and 6,89° (SD = 4,41) respectively for V-pattern, A- pattern and normal subjects. Neither observers (F = 0,2028 p = 0,9369) nor time between 1st and 2nd NMFP (F = 0,6312 p = 0,4271) seem to influence the measure of disc-foveal angle. The evaluation of disc-foveal angle was very reproducible between observers (ICC>0,97). Abnormal amount of objective torsion could be associated with alphabet-pattern strabismus. Disc-foveal angle evaluation by NMFP in a children population appears as a non-invasive, reliable and reproducible method.

  7. Donabedian's structure-process-outcome quality of care model: Validation in an integrated trauma system.

    PubMed

    Moore, Lynne; Lavoie, André; Bourgeois, Gilles; Lapointe, Jean

    2015-06-01

    According to Donabedian's health care quality model, improvements in the structure of care should lead to improvements in clinical processes that should in turn improve patient outcome. This model has been widely adopted by the trauma community but has not yet been validated in a trauma system. The objective of this study was to assess the performance of an integrated trauma system in terms of structure, process, and outcome and evaluate the correlation between quality domains. Quality of care was evaluated for patients treated in a Canadian provincial trauma system (2005-2010; 57 centers, n = 63,971) using quality indicators (QIs) developed and validated previously. Structural performance was measured by transposing on-site accreditation visit reports onto an evaluation grid according to American College of Surgeons criteria. The composite process QI was calculated as the average sum of proportions of conformity to 15 process QIs derived from literature review and expert opinion. Outcome performance was measured using risk-adjusted rates of mortality, complications, and readmission as well as hospital length of stay (LOS). Correlation was assessed with Pearson's correlation coefficients. Statistically significant correlations were observed between structure and process QIs (r = 0.33), and process and outcome QIs (r = -0.33 for readmission, r = -0.27 for LOS). Significant positive correlations were also observed between outcome QIs (r = 0.37 for mortality-readmission; r = 0.39 for mortality-LOS and readmission-LOS; r = 0.45 for mortality-complications; r = 0.34 for readmission-complications; 0.63 for complications-LOS). Significant correlations between quality domains observed in this study suggest that Donabedian's structure-process-outcome model is a valid model for evaluating trauma care. Trauma centers that perform well in terms of structure also tend to perform well in terms of clinical processes, which in turn has a favorable influence on patient outcomes. Prognostic study, level III.

  8. Patient-based and clinical outcomes of implant telescopic attachment-retained mandibular overdentures: a 1-year longitudinal prospective study.

    PubMed

    Yunus, Norsiah; Saub, Roslan; Taiyeb Ali, Tara Bai; Salleh, Nosizana Mohd; Baig, Mirza Rustum

    2014-01-01

    The purpose of this study was to evaluate and compare Oral Health-Related Quality of Life (OHRQoL), denture satisfaction, and masticatory performance in edentulous patients provided with mandibular implant-supported overdentures (ISODs) retained with telescopic attachments and those of conventional complete dentures (CCDs). Peri-implant soft tissue changes were also evaluated at various intervals during a 1-year observation period. Participating patients received new CCDs and later received two mandibular interforaminal implants and had their mandibular CCDs converted into ISODs with telescopic attachments. Questionnaires were used to assess OHRQoL (Shortened Oral Health Impact Profile-14, Malaysian version) and denture satisfaction at different stages of treatment with CCDs and ISODs. Objective masticatory performance with the CCDs and ISODs was recorded with a mixing ability test. Evaluations were carried out at 3 months with the new CCDs, 3 months after mandibular ISOD provision, and 1 year after receiving the ISOD. Peri-implant parameters were additionally assessed at specific intervals during the treatment period. The data obtained were statistically analyzed and compared. In the 17 patients who completed the protocol, significant improvements were observed in OHRQoL and patient satisfaction when CCDs were modified to ISODs, after 3 months, and at 1 year. Significantly better mixing ability with the ISOD was noted, with the highest values observed at 1 year. Statistically insignificant differences were observed for all the peri-implant parameters, except for gingival recession, for which significant changes were observed 6 months after ISOD delivery (values had stabilized by 1 year). Telescopic crown attachment-retained mandibular ISODs improved OHRQoL, dental prosthesis satisfaction, and masticatory performance compared to CCDs. Peri-implant soft tissue response and implant stability were found to be favorable after 1 year.

  9. Investigation of the reproducibility and reliability of sagittal vertebral inclination measurements from MR images of the spine.

    PubMed

    Vrtovec, Tomaž; Pernuš, Franjo; Likar, Boštjan

    2014-10-01

    In this study, sagittal vertebral inclination (SVI) was systematically evaluated for 28 vertebrae (segments between T4 and L5) in magnetic resonance (MR) images of one normal and one scoliotic subject to compare the performance of manual and computerized measurements, and identify the most reproducible and reliable measurements. Manual measurements were performed by three observers, who identified on two occasions the distinctive anatomical landmarks required to evaluate SVI by six measurement methods, i.e. the superior tangents, inferior tangents, anterior tangents, posterior tangents, mid-endplate lines and mid-wall lines. Computerized measurements were performed by automatically evaluating SVI from the symmetry of vertebral anatomical structures in two-dimensional (2D) sagittal cross-sections and in three-dimensional (3D) volumetric images. The mid-wall lines and posterior tangents proved to be the manual measurements with the lowest intra-observer (standard deviation, SD, of 1.4° and 1.7°, respectively) and inter-observer variability (SD of 1.9° and 2.4°, respectively). The strongest inter-method agreement was found between the mid-wall lines and posterior tangents (SD of 2.0°). Computerized measurements in 2D and in 3D resulted in intra-observer (SD of 2.8° and 3.1°, respectively) and inter-observer variability (SD of 3.8° and 5.2°, respectively) that were comparable to those of the superior tangents (SD of 2.6° and 3.7°) and inferior tangents (SD of 3.2° and 4.5°), which represent standard Cobb angle measurements. It can be concluded that computerized measurements of SVI should be based on the inclination of vertebral body walls. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. Assessment of spontaneous resolution of idiopathic bone cavity.

    PubMed

    Battisti, Maíra de Paula Leite; Soares, Mariana Quirino Silveira; Rubira, Cássia Maria Fischer; Bullen, Izabel Regina Fischer Rubira de; Lauris, José Roberto Pereira; Damante, José Humberto

    2018-01-01

    Idiopathic Bone Cavity (IBC) or Simple Bone Cyst (SBC) is a non- epithelialized bone cavity with serosanguinous fluid content or empty. There is a literature debate regarding its pathogenesis that remains unclear. The main treatment option is the surgical exploration, although there are successful cases described in the literature in which just a follow-up with clinical and radiographic evaluation was performed. Objective This study aimed to assess the spontaneous resolution of idiopathic bone cavity untreated by surgery. Material and Methods Twenty-one patients diagnosed with surgically untreated IBC were submitted to a follow-up protocol modified from Damante, Guerra, and Ferreira5 (2002). A clinical and radiographic evaluation was performed in 13 patients (13/21), while eight patients (8/21) were only radiographically evaluated. Three observers evaluated the panoramic radiographs of 21 patients and the Kappa test was performed by intra and inter-examiners. Inductive and descriptive statistics were applied to the results. Results Only one patient had a positive response to palpation and percussion of the teeth in the cyst area. Most of the cysts evaluated were rated as 3 (lesion "in involution"), 4 (lesion "almost completely resolved"), or 5 ("completely resolved"). Conclusions We observed progressive spontaneous resolution of IBC. Most cysts were found in the recovery process in different follow-up periods. Patient's follow-up, without surgery, may be considered after the diagnosis based on epidemiological, clinical, and radiographic features of the lesion.

  11. Triangulation of written assessments from patients, teachers and students: useful for students and teachers?

    PubMed

    Gran, Sarah Frandsen; Braend, Anja Maria; Lindbaek, Morten

    2010-01-01

    Many medical students in general practice clerkships experience lack of observation-based feedback. The StudentPEP project combined written feedback from patients, observing teachers and students. This study analyzes the perceived usefulness of triangulated written feedback. A total of 71 general practitioners and 79 medical students at the University of Oslo completed project evaluation forms after a 6-week clerkship. A principal component analysis was performed to find structures within the questionnaire. Regression analysis was performed regarding students' answers to whether StudentPEP was worthwhile. Free-text answers were analyzed qualitatively. Student and teacher responses were mixed within six subscales, with highest agreement on 'Teachers oral and written feedback' and 'Attitude to patient evaluation'. Fifty-four per cent of the students agreed that the triangulation gave concrete feedback on their weaknesses, and 59% valued the teachers' feedback provided. Two statements regarding the teacher's attitudes towards StudentPEP were significantly associated with the student's perception of worthwhileness. Qualitative analysis showed that patient evaluations were encouraging or distrusted. Some students thought that StudentPEP ensured observation and feedback. The patient evaluations increased the students' awareness of the patient perspective. A majority of the students considered the triangulated written feedback beneficial, although time-consuming. The teacher's attitudes strongly influenced how the students perceived the usefulness of StudentPEP.

  12. Systematic Evaluation of Molecular Networks for Discovery of Disease Genes. | Office of Cancer Genomics

    Cancer.gov

    Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall.

  13. Using a Systems Engineering Initiative for Patient Safety to Evaluate a Hospital-wide Daily Chlorhexidine Bathing Intervention.

    PubMed

    Caya, Teresa; Musuuza, Jackson; Yanke, Eric; Schmitz, Michelle; Anderson, Brooke; Carayon, Pascale; Safdar, Nasia

    2015-01-01

    We undertook a systems engineering approach to evaluate housewide implementation of daily chlorhexidine bathing. We performed direct observations of the bathing process and conducted provider and patient surveys. The main outcome was compliance with bathing using a checklist. Fifty-seven percent of baths had full compliance with the chlorhexidine bathing protocol. Additional time was the main barrier. Institutions undertaking daily chlorhexidine bathing should perform a rigorous assessment of implementation to optimize the benefits of this intervention.

  14. Learning Molecular Structures in a Tangible Augmented Reality Environment

    ERIC Educational Resources Information Center

    Asai, Kikuo; Takase, Norio

    2011-01-01

    This article presents the characteristics of using a tangible table top environment produced by augmented reality (AR), aimed at improving the environment in which learners observe three-dimensional molecular structures. The authors perform two evaluation experiments. A performance test for a user interface demonstrates that learners with a…

  15. 40 CFR 63.7132 - What records must I keep?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ...) Records of performance tests, performance evaluations, and opacity and VE observations as required in § 63... 40 Protection of Environment 13 2010-07-01 2010-07-01 false What records must I keep? 63.7132 Section 63.7132 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS...

  16. Accuracy evaluation of ClimGen weather generator and daily to hourly disaggregation methods in tropical conditions

    NASA Astrophysics Data System (ADS)

    Safeeq, Mohammad; Fares, Ali

    2011-12-01

    Daily and sub-daily weather data are often required for hydrological and environmental modeling. Various weather generator programs have been used to generate synthetic climate data where observed climate data are limited. In this study, a weather data generator, ClimGen, was evaluated for generating information on daily precipitation, temperature, and wind speed at four tropical watersheds located in Hawai`i, USA. We also evaluated different daily to sub-daily weather data disaggregation methods for precipitation, air temperature, dew point temperature, and wind speed at Mākaha watershed. The hydrologic significance values of the different disaggregation methods were evaluated using Distributed Hydrology Soil Vegetation Model. MuDRain and diurnal method performed well over uniform distribution in disaggregating daily precipitation. However, the diurnal method is more consistent if accurate estimates of hourly precipitation intensities are desired. All of the air temperature disaggregation methods performed reasonably well, but goodness-of-fit statistics were slightly better for sine curve model with 2 h lag. Cosine model performed better than random model in disaggregating daily wind speed. The largest differences in annual water balance were related to wind speed followed by precipitation and dew point temperature. Simulated hourly streamflow, evapotranspiration, and groundwater recharge were less sensitive to the method of disaggregating daily air temperature. ClimGen performed well in generating the minimum and maximum temperature and wind speed. However, for precipitation, it clearly underestimated the number of extreme rainfall events with an intensity of >100 mm/day in all four locations. ClimGen was unable to replicate the distribution of observed precipitation at three locations (Honolulu, Kahului, and Hilo). ClimGen was able to reproduce the distributions of observed minimum temperature at Kahului and wind speed at Kahului and Hilo. Although the weather data generation and disaggregation methods were concentrated in a few Hawaiian watersheds, the results presented can be used to similar mountainous location settings, as well as any specific locations aimed at furthering the site-specific performance evaluation of these tested models.

  17. Multiple model analysis with discriminatory data collection (MMA-DDC): A new method for improving measurement selection

    NASA Astrophysics Data System (ADS)

    Kikuchi, C.; Ferre, P. A.; Vrugt, J. A.

    2011-12-01

    Hydrologic models are developed, tested, and refined based on the ability of those models to explain available hydrologic data. The optimization of model performance based upon mismatch between model outputs and real world observations has been extensively studied. However, identification of plausible models is sensitive not only to the models themselves - including model structure and model parameters - but also to the location, timing, type, and number of observations used in model calibration. Therefore, careful selection of hydrologic observations has the potential to significantly improve the performance of hydrologic models. In this research, we seek to reduce prediction uncertainty through optimization of the data collection process. A new tool - multiple model analysis with discriminatory data collection (MMA-DDC) - was developed to address this challenge. In this approach, multiple hydrologic models are developed and treated as competing hypotheses. Potential new data are then evaluated on their ability to discriminate between competing hypotheses. MMA-DDC is well-suited for use in recursive mode, in which new observations are continuously used in the optimization of subsequent observations. This new approach was applied to a synthetic solute transport experiment, in which ranges of parameter values constitute the multiple hydrologic models, and model predictions are calculated using likelihood-weighted model averaging. MMA-DDC was used to determine the optimal location, timing, number, and type of new observations. From comparison with an exhaustive search of all possible observation sequences, we find that MMA-DDC consistently selects observations which lead to the highest reduction in model prediction uncertainty. We conclude that using MMA-DDC to evaluate potential observations may significantly improve the performance of hydrologic models while reducing the cost associated with collecting new data.

  18. Performance measurement for supply chain management and evaluation criteria determination for reverse supply chain management

    NASA Astrophysics Data System (ADS)

    Kongar, N. Elif

    2004-12-01

    Today, since customers are able to obtain similar-quality products for similar prices, the lead time has become the only preference criterion for most of the consumers. Therefore, it is crucial that the lead time, i.e., the time spent from the raw material phase till the manufactured good reaches the customer, is minimized. This issue can be investigated under the title of Supply Chain Management (SCM). An efficiently managed supply chain can lead to reduced response time for customers. To achieve this, continuous observation of supply chain efficiency, i.e., a constant performance evaluation of the current SCM is required. Widely used conventional performance measurement methods lack the ability to evaluate a SCM since the supply chain is a dynamic system that requires a more thorough and flexible performance measurement technique. Balanced Scorecard (BS) is an efficient tool for measuring the performance of dynamic systems and has a proven capability of providing the decision makers with the appropriate feedback data. In addition to SCM, a relatively new management field, namely reverse supply chain management (RSCM), also necessitates an appropriate evaluation approach. RSCM differs from SCM in many aspects, i.e., the criteria used for evaluation, the high level of uncertainty involved etc., not allowing the usage of identical evaluation techniques used for SCM. This study proposes a generic Balanced Scorecard to measure the performance of supply chain management while defining the appropriate performance measures for SCM. A scorecard prototype, ESCAPE, is presented to demonstrate the evaluation process.

  19. Evaluating the Ocean Component of the US Navy Earth System Model

    NASA Astrophysics Data System (ADS)

    Zamudio, L.

    2017-12-01

    Ocean currents, temperature, and salinity observations are used to evaluate the ocean component of the US Navy Earth System Model. The ocean and atmosphere components of the system are an eddy-resolving (1/12.5° equatorial resolution) version of the HYbrid Coordinate Ocean Model (HYCOM), and a T359L50 version of the NAVy Global Environmental Model (NAVGEM), respectively. The system was integrated in hindcast mode and the ocean results are compared against unassimilated observations, a stand-alone version of HYCOM, and the Generalized Digital Environment Model ocean climatology. The different observation types used in the system evaluation are: drifting buoys, temperature profiles, salinity profiles, and acoustical proxies (mixed layer depth, sonic layer depth, below layer gradient, and acoustical trapping). To evaluate the system's performance in each different metric, a scorecard is used to translate the system's errors into scores, which provide an indication of the system's skill in both space and time.

  20. Apices of maxillary premolars observed by swept source optical coherence tomography

    NASA Astrophysics Data System (ADS)

    Ebihara, Arata; Iino, Yoshiko; Yoshioka, Toshihiko; Hanada, Takahiro; Sunakawa, Mitsuhiro; Sumi, Yasunori; Suda, Hideaki

    2015-02-01

    Apicoectomy is performed for the management of apical periodontitis when orthograde root canal treatment is not possible or is ineffective. Prior to the surgery, cone beam computed tomography (CBCT) examination is often performed to evaluate the lesion and the adjacent tissues. During the surgical procedure, the root apex is resected and the resected surface is usually observed under dental operating microscope (DOM). However, it is difficult to evaluate the details and the subsurface structure of the root using CBCT and DOM. A new diagnostic system, swept source optical coherence tomography (SS-OCT), has been developed to observe the subsurface anatomical structure. The aim of this study was to observe resected apical root canals of human maxillary premolars using SS-OCT and compare the findings with those observed using CBCT and DOM. Six extracted human maxillary premolars were used. After microfocus computed tomography (Micro CT; for gold standard) and CBCT scanning of the root, 1 mm of the apex was cut perpendicular to the long axis of the tooth. Each resected surface was treated with EDTA, irrigated with saline solution, and stained with methylene blue dye. The resected surface was observed with DOM and SS-OCT. This sequence was repeated three times. The number of root canals was counted and statistically evaluated. There was no significant difference in the accuracy of detecting root canals among CBCT, DOM and SS-OCT (p > 0.05, Wilcoxon test). Because SS-OCT can be used in real time during surgery, it would be a useful tool for observing resected apical root canals.

  1. Evolution of short cognitive test performance in stroke patients with vascular cognitive impairment and vascular dementia: Baseline evaluation and follow-up.

    PubMed

    Custodio, Nilton; Montesinos, Rosa; Lira, David; Herrera-Perez, Eder; Bardales, Yadira; Valeriano-Lorenzo, Lucia

    2017-01-01

    There is limited evidence about the progression of cognitive performance during the post-stroke stage. To assess the evolution of cognitive performance in stroke patients without vascular cognitive impairment (VCI), patients with vascular mild cognitive impairment (MCI), and patients with vascular dementia (VD). A prospective cohort of stroke outpatients from two secondary medical centers in Lima, Peru was studied. We performed standardized evaluations at definitive diagnosis (baseline evaluation), and control follow-ups at 6 and 12 months, including a battery of short cognitive tests: Clinical Dementia Rating (CDR), Addenbrooke's Cognitive Examination (ACE), and INECO Frontal Screening (IFS). 152 outpatients completed the follow-up, showing progressive increase in mean score on the CDR(0.34 to 0.46), contrary to the pattern observed on the ACE and IFS (78.18 to 76.48 and 23.63 to 22.24). The box plot for the CDR test showed that VCI patients had progressive worsening (0.79 to 0.16). Conversely, this trend was not observed in subjects without VCI. The box plot for the ACE and IFS showed that, for the majority of the differentiated stroke types, both non-VCI and VCI patients had progressive worsening. According to both ACE and IFS results during a 1-year follow-up, the cognitive performance of stroke patients worsened, a trend which was particularly consistent in infarction-type stroke patients.

  2. Cognitive-evaluative features of childhood social anxiety in a performance task.

    PubMed

    Tuschen-Caffier, Brunna; Kühl, Sigrid; Bender, Caroline

    2011-06-01

    Using an experimental design, we analysed differences in the occurrence of cognitive-evaluative distortions and performance deficits across children with social anxiety disorder, with subclinical anxiety and without any anxiety symptoms. Twenty-one children with full syndrome social phobia, 18 children with partial syndrome social phobia and 20 children without any symptoms of social phobia were compared with respect to their degree of anxiety, negative thinking and task performance during two social-evaluative tasks. In addition, self-ratings of task performance, performance estimations for other children and objective behavioural ratings by two independent observers were obtained. Children with social anxiety disorder and subclinical social anxiety showed higher degrees of experienced anxiety and negative thinking than healthy control children. There was no group difference in respect to actual task performance. Findings are discussed with regard to the continuum assumption of childhood social anxiety disorder and the need of well-adapted early interventions. Copyright © 2010. Published by Elsevier Ltd.

  3. Evaluation of Student Performance through a Multidimensional Finite Mixture IRT Model.

    PubMed

    Bacci, Silvia; Bartolucci, Francesco; Grilli, Leonardo; Rampichini, Carla

    2017-01-01

    In the Italian academic system, a student can enroll for an exam immediately after the end of the teaching period or can postpone it; in this second case the exam result is missing. We propose an approach for the evaluation of a student performance throughout the course of study, accounting also for nonattempted exams. The approach is based on an item response theory model that includes two discrete latent variables representing student performance and priority in selecting the exams to take. We explicitly account for nonignorable missing observations as the indicators of attempted exams also contribute to measure the performance (within-item multidimensionality). The model also allows for individual covariates in its structural part.

  4. Evaluation of integrated assessment model hindcast experiments: a case study of the GCAM 3.0 land use module

    NASA Astrophysics Data System (ADS)

    Snyder, Abigail C.; Link, Robert P.; Calvin, Katherine V.

    2017-11-01

    Hindcasting experiments (conducting a model forecast for a time period in which observational data are available) are being undertaken increasingly often by the integrated assessment model (IAM) community, across many scales of models. When they are undertaken, the results are often evaluated using global aggregates or otherwise highly aggregated skill scores that mask deficiencies. We select a set of deviation-based measures that can be applied on different spatial scales (regional versus global) to make evaluating the large number of variable-region combinations in IAMs more tractable. We also identify performance benchmarks for these measures, based on the statistics of the observational dataset, that allow a model to be evaluated in absolute terms rather than relative to the performance of other models at similar tasks. An ideal evaluation method for hindcast experiments in IAMs would feature both absolute measures for evaluation of a single experiment for a single model and relative measures to compare the results of multiple experiments for a single model or the same experiment repeated across multiple models, such as in community intercomparison studies. The performance benchmarks highlight the use of this scheme for model evaluation in absolute terms, providing information about the reasons a model may perform poorly on a given measure and therefore identifying opportunities for improvement. To demonstrate the use of and types of results possible with the evaluation method, the measures are applied to the results of a past hindcast experiment focusing on land allocation in the Global Change Assessment Model (GCAM) version 3.0. The question of how to more holistically evaluate models as complex as IAMs is an area for future research. We find quantitative evidence that global aggregates alone are not sufficient for evaluating IAMs that require global supply to equal global demand at each time period, such as GCAM. The results of this work indicate it is unlikely that a single evaluation measure for all variables in an IAM exists, and therefore sector-by-sector evaluation may be necessary.

  5. Evaluation of internal noise methods for Hotelling observers

    NASA Astrophysics Data System (ADS)

    Zhang, Yani; Pham, Binh T.; Eckstein, Miguel P.

    2005-04-01

    Including internal noise in computer model observers to degrade model observer performance to human levels is a common method to allow for quantitatively comparisons of human and model performance. In this paper, we studied two different types of methods for injecting internal noise to Hotelling model observers. The first method adds internal noise to the output of the individual channels: a) Independent non-uniform channel noise, b) Independent uniform channel noise. The second method adds internal noise to the decision variable arising from the combination of channel responses: a) internal noise standard deviation proportional to decision variable's standard deviation due to the external noise, b) internal noise standard deviation proportional to decision variable's variance caused by the external noise. We tested the square window Hotelling observer (HO), channelized Hotelling observer (CHO), and Laguerre-Gauss Hotelling observer (LGHO). The studied task was detection of a filling defect of varying size/shape in one of four simulated arterial segment locations with real x-ray angiography backgrounds. Results show that the internal noise method that leads to the best prediction of human performance differs across the studied models observers. The CHO model best predicts human observer performance with the channel internal noise. The HO and LGHO best predict human observer performance with the decision variable internal noise. These results might help explain why previous studies have found different results on the ability of each Hotelling model to predict human performance. Finally, the present results might guide researchers with the choice of method to include internal noise into their Hotelling models.

  6. Triadic instruction of chained food preparation responses: acquisition and observational learning.

    PubMed Central

    Griffen, A K; Wolery, M; Schuster, J W

    1992-01-01

    This research examined whether constant time delay would be effective in teaching students with moderate mental retardation in triads to perform chained tasks and whether observational learning would occur. Three chained snack preparation tasks were identified, and each student was directly taught one task. The other 2 students observed the instruction. The instructed student told the observers to watch and to turn pages of a pictorial recipe book. The teacher provided frequent praise to the instructed student based on performance and to the observers for watching the instruction and turning pages. A multiple probe design across students and tasks was used to evaluate the instruction. The results indicated that each student learned the skill he or she was taught directly, and the observers learned nearly all of the steps of the chains they observed. The implications for classroom instruction and future research in observational learning are discussed. PMID:1533856

  7. Empirical performance of the self-controlled case series design: lessons for developing a risk identification and analysis system.

    PubMed

    Suchard, Marc A; Zorych, Ivan; Simpson, Shawn E; Schuemie, Martijn J; Ryan, Patrick B; Madigan, David

    2013-10-01

    The self-controlled case series (SCCS) offers potential as an statistical method for risk identification involving medical products from large-scale observational healthcare data. However, analytic design choices remain in encoding the longitudinal health records into the SCCS framework and its risk identification performance across real-world databases is unknown. To evaluate the performance of SCCS and its design choices as a tool for risk identification in observational healthcare data. We examined the risk identification performance of SCCS across five design choices using 399 drug-health outcome pairs in five real observational databases (four administrative claims and one electronic health records). In these databases, the pairs involve 165 positive controls and 234 negative controls. We also consider several synthetic databases with known relative risks between drug-outcome pairs. We evaluate risk identification performance through estimating the area under the receiver-operator characteristics curve (AUC) and bias and coverage probability in the synthetic examples. The SCCS achieves strong predictive performance. Twelve of the twenty health outcome-database scenarios return AUCs >0.75 across all drugs. Including all adverse events instead of just the first per patient and applying a multivariate adjustment for concomitant drug use are the most important design choices. However, the SCCS as applied here returns relative risk point-estimates biased towards the null value of 1 with low coverage probability. The SCCS recently extended to apply a multivariate adjustment for concomitant drug use offers promise as a statistical tool for risk identification in large-scale observational healthcare databases. Poor estimator calibration dampens enthusiasm, but on-going work should correct this short-coming.

  8. CVD2014-A Database for Evaluating No-Reference Video Quality Assessment Algorithms.

    PubMed

    Nuutinen, Mikko; Virtanen, Toni; Vaahteranoksa, Mikko; Vuori, Tero; Oittinen, Pirkko; Hakkinen, Jukka

    2016-07-01

    In this paper, we present a new video database: CVD2014-Camera Video Database. In contrast to previous video databases, this database uses real cameras rather than introducing distortions via post-processing, which results in a complex distortion space in regard to the video acquisition process. CVD2014 contains a total of 234 videos that are recorded using 78 different cameras. Moreover, this database contains the observer-specific quality evaluation scores rather than only providing mean opinion scores. We have also collected open-ended quality descriptions that are provided by the observers. These descriptions were used to define the quality dimensions for the videos in CVD2014. The dimensions included sharpness, graininess, color balance, darkness, and jerkiness. At the end of this paper, a performance study of image and video quality algorithms for predicting the subjective video quality is reported. For this performance study, we proposed a new performance measure that accounts for observer variance. The performance study revealed that there is room for improvement regarding the video quality assessment algorithms. The CVD2014 video database has been made publicly available for the research community. All video sequences and corresponding subjective ratings can be obtained from the CVD2014 project page (http://www.helsinki.fi/psychology/groups/visualcognition/).

  9. Regime-Based Evaluation of Cloudiness in CMIP5 Models

    NASA Technical Reports Server (NTRS)

    Jin, Daeho; Oraiopoulos, Lazaros; Lee, Dong Min

    2016-01-01

    The concept of Cloud Regimes (CRs) is used to develop a framework for evaluating the cloudiness of 12 fifth Coupled Model Intercomparison Project (CMIP5) models. Reference CRs come from existing global International Satellite Cloud Climatology Project (ISCCP) weather states. The evaluation is made possible by the implementation in several CMIP5 models of the ISCCP simulator generating for each gridcell daily joint histograms of cloud optical thickness and cloud top pressure. Model performance is assessed with several metrics such as CR global cloud fraction (CF), CR relative frequency of occurrence (RFO), their product (long-term average total cloud amount [TCA]), cross-correlations of CR RFO maps, and a metric of resemblance between model and ISCCP CRs. In terms of CR global RFO, arguably the most fundamental metric, the models perform unsatisfactorily overall, except for CRs representing thick storm clouds. Because model CR CF is internally constrained by our method, RFO discrepancies yield also substantial TCA errors. Our findings support previous studies showing that CMIP5 models underestimate cloudiness. The multi-model mean performs well in matching observed RFO maps for many CRs, but is not the best for this or other metrics. When overall performance across all CRs is assessed, some models, despite their shortcomings, apparently outperform Moderate Resolution Imaging Spectroradiometer (MODIS) cloud observations evaluated against ISCCP as if they were another model output. Lastly, cloud simulation performance is contrasted with each model's equilibrium climate sensitivity (ECS) in order to gain insight on whether good cloud simulation pairs with particular values of this parameter.

  10. Benefits of cardiac sonography performed by a non-expert sonographer in patients with non-traumatic cardiopulmonary arrest.

    PubMed

    Zengin, Suat; Yavuz, Erdal; Al, Behçet; Cindoruk, Şener; Altunbaş, Gökhan; Gümüşboğa, Hasan; Yıldırım, Cuma

    2016-05-01

    The purpose of this study was to evaluate a rapid cardiac ultrasound assessment performed by trained non-expert sonographers integrated into the advanced cardiac life support (ACLS). This study was prospectively performed in 179 patients (104 males and 75 females) who underwent cardiopulmonary resuscitation (CPR) in an emergency department (ED) during two calendar years (2013 and 2014). Two senior doctors, who had received emergency cardiac ultrasonography training, performed cardiac ultrasound through the apical, subxiphoid, or parasternal windows. Ultrasound evaluation and pulse controls were performed simultaneously. SPSS 18.0 was used for statistical analysis. A total of 63.7% (114) of the cardiopulmonary arrest incidents occurred out of the hospital. Only 13 patients had a femoral pulse during the initial evaluation, while 166 showed no femoral pulse. Initial monitoring showed a regular rhythm in 53 patients, ventricular fibrillation in 18 patients, and no rhythms in 108 patients. The first evaluation with ultrasound detected an effective heart rate in 26 patients and ventricular fibrillation in 14 patients, while no effective heart rate was observed in 139 patients. In addition, ultrasound revealed pericardial tamponade in seven patients and right ventricular enlargement in four cases. Global hypokinesia was detected in four patients and hypovolemia was observed in another four patients. The use of real-time ultrasonography during resuscitation with real-time femoral pulse check can help facilitate the distinguishing of pea-type arrest, ascertain the cause of the arrest, infer a suitable treatment, and optimize medical management decisions regarding CPR termination. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  11. Do prehospital discharge pacemaker checks provide any additional clinical benefit?

    PubMed

    Wheelan, Kevin R; Legge, Darlene M; Sakowski, Brent C; Bruce, Susan S; Roberts, David C; Johnston, L Murphy; Moore, B Jane; Beveridge, Thomas P; Wells, Peter J; Vallabahn, Ravi; Donsky, Michael S; Franklin, Jay O

    2005-08-01

    We performed a retrospective analysis of 250 records of consecutive, newly implanted, pacemaker patients from a single center to determine the rate of postimplant complications and observations discovered before and during the prehospital discharge evaluation. No observations occurred in 246 of 250 patients (98.4%) (1-sided 95% confidence interval 96.4%). Of the 250 patients, 4 had observations that were discovered at the prehospital discharge check and required reprogramming to increase the sensitivity safety margin (3 atrial and 1 ventricular). We documented only 1 complication that was discovered before the predischarge evaluation through telemetry and resulted in an atrial lead revision.

  12. "New Space Explosion" and Earth Observing System Capabilities

    NASA Astrophysics Data System (ADS)

    Stensaas, G. L.; Casey, K.; Snyder, G. I.; Christopherson, J.

    2017-12-01

    This presentation will describe recent developments in spaceborne remote sensing, including introduction to some of the increasing number of new firms entering the market, along with new systems and successes from established players, as well as industry consolidation reactions to these developments from communities of users. The information in this presentation will include inputs from the results of the Joint Agency Commercial Imagery Evaluation (JACIE) 2017 Civil Commercial Imagery Evaluation Workshop and the use of the US Geological Survey's Requirements Capabilities and Analysis for Earth Observation (RCA-EO) centralized Earth observing systems database and how system performance parameters are used with user science applications requirements.

  13. Distributed Space Mission Design for Earth Observation Using Model-Based Performance Evaluation

    NASA Technical Reports Server (NTRS)

    Nag, Sreeja; LeMoigne-Stewart, Jacqueline; Cervantes, Ben; DeWeck, Oliver

    2015-01-01

    Distributed Space Missions (DSMs) are gaining momentum in their application to earth observation missions owing to their unique ability to increase observation sampling in multiple dimensions. DSM design is a complex problem with many design variables, multiple objectives determining performance and cost and emergent, often unexpected, behaviors. There are very few open-access tools available to explore the tradespace of variables, minimize cost and maximize performance for pre-defined science goals, and therefore select the most optimal design. This paper presents a software tool that can multiple DSM architectures based on pre-defined design variable ranges and size those architectures in terms of predefined science and cost metrics. The tool will help a user select Pareto optimal DSM designs based on design of experiments techniques. The tool will be applied to some earth observation examples to demonstrate its applicability in making some key decisions between different performance metrics and cost metrics early in the design lifecycle.

  14. Teaching three-dimensional surgical concepts of inguinal hernia in a time-effective manner using a two-dimensional paper-cut.

    PubMed

    Mann, B D; Seidman, A; Haley, T; Sachdeva, A K

    1997-06-01

    Because inguinal hernia repair is difficult for third-year students to comprehend, a 2-dimensional paper-cut was developed to teach the concepts of inguinal hernia in a time-effective manner before students' observation of herniorrhaphy in the operating room. Using Adobe Illustrator 5.5 for MacIntosh, a 2-dimensional inexpensively printed paper-cut was created to allow students to perform their own simulated hernia repair before observing surgery. The exercise was performed using a no.15 scalpel or an iris scissors and was evaluated by comparing 10-question pre-tests and post-tests. Seventy-five students performed the exercise, most completing it within 15 minutes. The mean pre-test score was 7.4/10 and the mean post-test score was 9.1/10. Students performing the paper-cut reported better understanding when observing actual herniorrhaphy. A 2-dimensional paper-cut ("surgical origami") may be a time-effective method to prepare students for the observation of hernia repair.

  15. Evaluation of Integrated Multi-satellitE Retrievals for GPM with All Weather Gauge Observations over CONUS

    NASA Astrophysics Data System (ADS)

    Chen, S.; Qi, Y.; Hu, B.; Hu, J.; Hong, Y.

    2015-12-01

    The Global Precipitation Measurement (GPM) mission is composed of an international network of satellites that provide the next-generation global observations of rain and snow. Integrated Multi-satellitE Retrievals for GPM (IMERG) is the state-of-art precipitation products with high spatio-temporal resolution of 0.1°/30min. IMERG unifies precipitation measurements from a constellation of research and operational satellites with the core sensors dual-frequency precipitation radar (DPR) and microwave imager (GMI) on board a "Core" satellite. Additionally, IMERG blends the advantages of currently most popular satellite-based quantitative precipitation estimates (QPE) algorithms, i.e. TRMM Multi-satellite Precipitation Analysis (TMPA), Climate Prediction Center morphing technique (CMORPH), Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Cloud Classification System (PERSIANN-CCS). The real-time and post real-time IMERG products are now available online at https://stormpps.gsfc.nasa.gov/storm. In this study, the final run post real-time IMERG is evaluated with all-weather manual gauge observations over CONUS from June 2014 through May 2015. Relative Bias (RB), Root-Mean-Squared Error (RMSE), Correlation Coefficient (CC), Probability Of Detection (POD), False Alarm Ratio (FAR), and Critical Success Index (CSI) are used to quantify the performance of IMERG. The performance of IMERG in estimating snowfall precipitation is highlighted in the study. This timely evaluation with all-weather gauge observations is expected to offer insights into performance of IMERG and thus provide useful feedback to the algorithm developers as well as the GPM data users.

  16. Evaluation of the channelized Hotelling observer with an internal-noise model in a train-test paradigm for cardiac SPECT defect detection.

    PubMed

    Brankov, Jovan G

    2013-10-21

    The channelized Hotelling observer (CHO) has become a widely used approach for evaluating medical image quality, acting as a surrogate for human observers in early-stage research on assessment and optimization of imaging devices and algorithms. The CHO is typically used to measure lesion detectability. Its popularity stems from experiments showing that the CHO's detection performance can correlate well with that of human observers. In some cases, CHO performance overestimates human performance; to counteract this effect, an internal-noise model is introduced, which allows the CHO to be tuned to match human-observer performance. Typically, this tuning is achieved using example data obtained from human observers. We argue that this internal-noise tuning step is essentially a model training exercise; therefore, just as in supervised learning, it is essential to test the CHO with an internal-noise model on a set of data that is distinct from that used to tune (train) the model. Furthermore, we argue that, if the CHO is to provide useful insights about new imaging algorithms or devices, the test data should reflect such potential differences from the training data; it is not sufficient simply to use new noise realizations of the same imaging method. Motivated by these considerations, the novelty of this paper is the use of new model selection criteria to evaluate ten established internal-noise models, utilizing four different channel models, in a train-test approach. Though not the focus of the paper, a new internal-noise model is also proposed that outperformed the ten established models in the cases tested. The results, using cardiac perfusion SPECT data, show that the proposed train-test approach is necessary, as judged by the newly proposed model selection criteria, to avoid spurious conclusions. The results also demonstrate that, in some models, the optimal internal-noise parameter is very sensitive to the choice of training data; therefore, these models are prone to overfitting, and will not likely generalize well to new data. In addition, we present an alternative interpretation of the CHO as a penalized linear regression wherein the penalization term is defined by the internal-noise model.

  17. Evaluation of radiographic interpretation competence of veterinary students in Finland.

    PubMed

    Koskinen, Heli I; Snellman, Marjatta

    2009-01-01

    In the evaluation of the clinical competence of veterinary students, many different definitions and methods are approved. Due to the increasing discussion of the quality of outcomes produced by newly graduated veterinarians, methods for the evaluation of clinical competencies should also be evaluated. In this study, this was done by comparing two qualitative evaluation schemes: the well-known structure of observed learning outcome (SOLO) taxonomy and a modification of this taxonomy. A case-based final radiologic examination was selected and the investigation was performed by classifying students' outcomes. These classes were finally put next to original (quantitative) scores and the statistical calculations were initiated. Significant correlations between taxonomies (0.53) and the modified taxonomy and original scores (0.66) were found and some qualitative similarities between evaluation methods were observed. In addition, some supplements were recommended for the structure of evaluation schemes, especially for the structure of the modified SOLO taxonomy.

  18. Doctor performance assessment in daily practise: does it help doctors or not? A systematic review.

    PubMed

    Overeem, Karlijn; Faber, Marjan J; Arah, Onyebuchi A; Elwyn, Glyn; Lombarts, Kiki M J M H; Wollersheim, Hub C; Grol, Richard P T M

    2007-11-01

    Continuous assessment of individual performance of doctors is crucial for life-long learning and quality of care. Policy-makers and health educators should have good insights into the strengths and weaknesses of the methods available. The aim of this study was to systematically evaluate the feasibility of methods, the psychometric properties of instruments that are especially important for summative assessments, and the effectiveness of methods serving formative assessments used in routine practise to assess the performance of individual doctors. We searched the MEDLINE (1966-January 2006), PsychINFO (1972-January 2006), CINAHL (1982-January 2006), EMBASE (1980-January 2006) and Cochrane (1966-2006) databases for English language articles, and supplemented this with a hand-search of reference lists of relevant studies and bibliographies of review articles. Studies that aimed to assess the performance of individual doctors in routine practise were included. Two reviewers independently abstracted data regarding study design, setting and findings related to reliability, validity, feasibility and effectiveness using a standard data abstraction form. A total of 64 articles met our inclusion criteria. We observed 6 different methods of evaluating performance: simulated patients; video observation; direct observation; peer assessment; audit of medical records, and portfolio or appraisal. Peer assessment is the most feasible method in terms of costs and time. Little psychometric assessment of the instruments has been undertaken so far. Effectiveness of formative assessments is poorly studied. All systems but 2 rely on a single method to assess performance. There is substantial potential to assess performance of doctors in routine practise. The longterm impact and effectiveness of formative performance assessments on education and quality of care remains hardly known. Future research designs need to pay special attention to unmasking effectiveness in terms of performance improvement.

  19. Teaching childbirth with high-fidelity simulation. Is it better observing the scenario during the briefing session?

    PubMed

    Cuerva, Marcos J; Piñel, Carlos S; Martin, Lourdes; Espinosa, Jose A; Corral, Octavio J; Mendoza, Nicolás

    2018-02-12

    The design of optimal courses for obstetric undergraduate teaching is a relevant question. This study evaluates two different designs of simulator-based learning activity on childbirth with regard to respect to the patient, obstetric manoeuvres, interpretation of cardiotocography tracings (CTG) and infection prevention. This randomised experimental study which differs in the content of their briefing sessions consisted of two groups of undergraduate students, who performed two simulator-based learning activities on childbirth. The first briefing session included the observations of a properly performed scenario according to Spanish clinical practice guidelines on care in normal childbirth by the teachers whereas the second group did not include the observations of a properly performed scenario, and the students observed it only after the simulation process. The group that observed a properly performed scenario after the simulation obtained worse grades during the simulation, but better grades during the debriefing and evaluation. Simulator use in childbirth may be more fruitful when the medical students observe correct performance at the completion of the scenario compared to that at the start of the scenario. Impact statement What is already known on this subject? There is a scarcity of literature about the design of optimal high-fidelity simulation training in childbirth. It is known that preparing simulator-based learning activities is a complex process. Simulator-based learning includes the following steps: briefing, simulation, debriefing and evaluation. The most important part of high-fidelity simulations is the debriefing. A good briefing and simulation are of high relevance in order to have a fruitful debriefing session. What do the results of this study add? Our study describes a full simulator-based learning activity on childbirth that can be reproduced in similar facilities. The findings of this study add that high-fidelity simulation training in childbirth is favoured by a short briefing session and an abrupt start to the scenario, rather than a long briefing session that includes direct instruction in the scenario. What are the implications of these findings for clinical practice and/or further research? The findings of this study reveal what to include in the briefing of simulator-based learning activities on childbirth. These findings have implications in medical teaching and in medical practice.

  20. A Reflective Learning Framework to Evaluate CME Effects on Practice Reflection

    ERIC Educational Resources Information Center

    Leung, Kit H.; Pluye, Pierre; Grad, Roland; Weston, Cynthia

    2010-01-01

    Introduction: The importance of reflective practice is recognized by the adoption of a reflective learning model in continuing medical education (CME), but little is known about how to evaluate reflective learning in CME. Reflective learning seldom is defined in terms of specific cognitive processes or observable performances. Competency-based…

  1. EVALUATING THE PERFORMANCE OF REGIONAL-SCALE PHOTOCHEMICAL MODELING SYSTEMS: PART I--METEOROLOGICAL PREDICTIONS. (R825260)

    EPA Science Inventory

    In this study, the concept of scale analysis is applied to evaluate two state-of-science meteorological models, namely MM5 and RAMS3b, currently being used to drive regional-scale air quality models. To this end, seasonal time series of observations and predictions for temperatur...

  2. Measuring and Promoting Inter-Rater Agreement of Teacher and Principal Performance Ratings

    ERIC Educational Resources Information Center

    Graham, Matthew; Milanowski, Anthony; Miller, Jackson

    2012-01-01

    As states, districts, and schools transition toward more rigorous educator evaluation systems, they are placing additional weight on judgments about educator practice. Since teacher and principal observation ratings inherently rely on evaluators' professional judgment, there is always a question of how much the ratings depend on the particular…

  3. Evaluating Rater Accuracy in Rater-Mediated Assessments Using an Unfolding Model

    ERIC Educational Resources Information Center

    Wang, Jue; Engelhard, George, Jr.; Wolfe, Edward W.

    2016-01-01

    The number of performance assessments continues to increase around the world, and it is important to explore new methods for evaluating the quality of ratings obtained from raters. This study describes an unfolding model for examining rater accuracy. Accuracy is defined as the difference between observed and expert ratings. Dichotomous accuracy…

  4. An Overview on Evaluation of E-Learning/Training Response Time Considering Artificial Neural Networks Modeling

    ERIC Educational Resources Information Center

    Mustafa, Hassan M. H.; Tourkia, Fadhel Ben; Ramadan, Ramadan Mohamed

    2017-01-01

    The objective of this piece of research is to interpret and investigate systematically an observed brain functional phenomenon which is associated with proceeding of e-learning processes. More specifically, this work addresses an interesting and challenging educational issue concerned with dynamical evaluation of elearning performance considering…

  5. Dynamic Evaluation of Two Decades of WRF-CMAQ Ozone Simulations over the Contiguous United States (2017 MAC-MAQ Conference Presentation)

    EPA Science Inventory

    Dynamic evaluation of two decades of ozone simulations performed with the fully coupled Weather Research and Forecasting (WRF)–Community Multi-scale Air Quality (CMAQ) model over the contiguous United States is conducted to assess how well the changes in observed ozone air ...

  6. Evaluating the Performance of Repeated Measures Approaches in Replicating Experimental Benchmark Results

    ERIC Educational Resources Information Center

    McConeghy, Kevin; Wing, Coady; Wong, Vivian C.

    2015-01-01

    Randomized experiments have long been established as the gold standard for addressing causal questions. However, experiments are not always feasible or desired, so observational methods are also needed. When multiple observations on the same variable are available, a repeated measures design may be used to assess whether a treatment administered…

  7. A fundamental study of cryoablation on normal bone: diagnostic imaging and histopathology.

    PubMed

    Yoshimoto, Yuta; Azuma, Kazuo; Miya, Atsushi; Makino, Eiichi; Nakamoto, Hidekazu; Abe, Nobutaka; Kaburagi, Masashi; Ueda, Hisaki; Kuroda, Kohei; Tsuka, Takeshi; Sugiyama, Akihiko; Imagawa, Tomohiro; Murahata, Yusuke; Itoh, Norihiko; Osaki, Tomohiro; Shimizu, Tadashi; Okamoto, Yoshiharu

    2014-10-01

    Cryoablation is a minimally invasive cancer treatment. In this study, the effects of cryoablation on normal rabbit bone were evaluated using imaging and histopathological examinations. Cryoablation was performed using a Cryo-Hit (Galil Medical, Yokneam, Israel). Under anesthesia, one cryoablation needle was inserted at the center of the femur (day 0). To create an ice ball (2 x 3 cm), two 10-min freeze cycles were performed, separated by a 5-min thaw cycle. During cryoablation, changes in the bone and regional tissue were monitored using magnetic resonance imaging (MRI). MRI scans, computed tomography (CT) scans, and collections from the femur (for histopathological evaluation) were performed on days 7, 14, 28, and 56. In terms of the all rabbits' general conditions, we did not observe lameness, decreased appetite, or any other side effects during the experimental periods. Histopathological evaluations of the femur were performed using hematoxylin and eosin staining. MRI indicated inflammation around the ice ball on day 7. Subsequently, the area of inflammation gradually decreased from days 14 to 56. In the histopathological examination, necrosis of bone marrow cells and endosteum were observed from days 7 to 56. No regeneration of bone marrow cells was observed during the experimental period. On the other hand, cryoablation did not influence osteoblasts. Furthermore, there was no pathologic fracture during the experimental period. Our results suggest that cryoablation does not induce severe adverse effects on normal bone, and therefore has potential as a therapeutic option for bone tumors, including metastatic tumors to bone. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. Evaluation of interpolation techniques for the creation of gridded daily precipitation (1 × 1 km2); Cyprus, 1980-2010

    NASA Astrophysics Data System (ADS)

    Camera, Corrado; Bruggeman, Adriana; Hadjinicolaou, Panos; Pashiardis, Stelios; Lange, Manfred A.

    2014-01-01

    High-resolution gridded daily data sets are essential for natural resource management and the analyses of climate changes and their effects. This study aims to evaluate the performance of 15 simple or complex interpolation techniques in reproducing daily precipitation at a resolution of 1 km2 over topographically complex areas. Methods are tested considering two different sets of observation densities and different rainfall amounts. We used rainfall data that were recorded at 74 and 145 observational stations, respectively, spread over the 5760 km2 of the Republic of Cyprus, in the Eastern Mediterranean. Regression analyses utilizing geographical copredictors and neighboring interpolation techniques were evaluated both in isolation and combined. Linear multiple regression (LMR) and geographically weighted regression methods (GWR) were tested. These included a step-wise selection of covariables, as well as inverse distance weighting (IDW), kriging, and 3D-thin plate splines (TPS). The relative rank of the different techniques changes with different station density and rainfall amounts. Our results indicate that TPS performs well for low station density and large-scale events and also when coupled with regression models. It performs poorly for high station density. The opposite is observed when using IDW. Simple IDW performs best for local events, while a combination of step-wise GWR and IDW proves to be the best method for large-scale events and high station density. This study indicates that the use of step-wise regression with a variable set of geographic parameters can improve the interpolation of large-scale events because it facilitates the representation of local climate dynamics.

  9. Performance evaluation of haptic hand-controllers in a robot-assisted surgical system.

    PubMed

    Zareinia, Kourosh; Maddahi, Yaser; Ng, Canaan; Sepehri, Nariman; Sutherland, Garnette R

    2015-12-01

    This paper presents the experimental evaluation of three commercially available haptic hand-controllers to evaluate which was more suitable to the participants. Two surgeons and seven engineers performed two peg-in-hole tasks with different levels of difficulty. Each operator guided the end-effector of a Kuka manipulator that held surgical forceps and was equipped with a surgical microscope. Sigma 7, HD(2) and PHANToM Premium 3.0 hand-controllers were compared. Ten measures were adopted to evaluate operators' performances with respect to effort, speed and accuracy in completing a task, operator improvement during the tests, and the force applied by each haptic device. The best performance was observed with the Premium 3.0; the hand-piece was able to be held in a similar way to that used by surgeons to hold conventional tools. Hand-controllers with a linkage structure similar to the human upper extremity take advantage of the inherent human brain connectome, resulting in improved surgeon performance during robotic-assisted surgery. Copyright © 2015 John Wiley & Sons, Ltd.

  10. Compatibility of pedigree-based and marker-based relationship matrices for single-step genetic evaluation.

    PubMed

    Christensen, Ole F

    2012-12-03

    Single-step methods provide a coherent and conceptually simple approach to incorporate genomic information into genetic evaluations. An issue with single-step methods is compatibility between the marker-based relationship matrix for genotyped animals and the pedigree-based relationship matrix. Therefore, it is necessary to adjust the marker-based relationship matrix to the pedigree-based relationship matrix. Moreover, with data from routine evaluations, this adjustment should in principle be based on both observed marker genotypes and observed phenotypes, but until now this has been overlooked. In this paper, I propose a new method to address this issue by 1) adjusting the pedigree-based relationship matrix to be compatible with the marker-based relationship matrix instead of the reverse and 2) extending the single-step genetic evaluation using a joint likelihood of observed phenotypes and observed marker genotypes. The performance of this method is then evaluated using two simulated datasets. The method derived here is a single-step method in which the marker-based relationship matrix is constructed assuming all allele frequencies equal to 0.5 and the pedigree-based relationship matrix is constructed using the unusual assumption that animals in the base population are related and inbred with a relationship coefficient γ and an inbreeding coefficient γ / 2. Taken together, this γ parameter and a parameter that scales the marker-based relationship matrix can handle the issue of compatibility between marker-based and pedigree-based relationship matrices. The full log-likelihood function used for parameter inference contains two terms. The first term is the REML-log-likelihood for the phenotypes conditional on the observed marker genotypes, whereas the second term is the log-likelihood for the observed marker genotypes. Analyses of the two simulated datasets with this new method showed that 1) the parameters involved in adjusting marker-based and pedigree-based relationship matrices can depend on both observed phenotypes and observed marker genotypes and 2) a strong association between these two parameters exists. Finally, this method performed at least as well as a method based on adjusting the marker-based relationship matrix. Using the full log-likelihood and adjusting the pedigree-based relationship matrix to be compatible with the marker-based relationship matrix provides a new and interesting approach to handle the issue of compatibility between the two matrices in single-step genetic evaluation.

  11. Pre-admission criteria and pre-clinical achievement: Can they predict medical students performance in the clinical phase?

    PubMed

    Salem, Raneem O; Al-Mously, Najwa; AlFadil, Sara; Baalash, Amal

    2016-01-01

    Various factors affect medical students' performance during clinical phase. Identifying these factors would help in mentoring weak students and help in selection process for residency programmes. Our study objective is to evaluate the impact of pre-admission criteria, and pre-clinical grade point average (GPA) on undergraduate medical students' performance during clinical phase. This study has a cross-sectional design that includes fifth- and sixth-year female medical students (71). Data of clinical and pre-clinical GPA in medical school and pre-admission to medical school tests scores were collected. A significant correlation between clinical GPA with the pre-clinical GPA was observed (p < 0.05). Such significant correlation was not seen with other variables under study. A regression analysis was performed, and the only significant predictor of students clinical performance was the pre-clinical GPA (p < 0.001). However, no significant difference between students' clinical and pre-clinical GPA for both cohorts was observed (p > 0.05). Pre-clinical GPA is strongly correlated with and can predict medical students' performance during clinical years. Our study highlighted the importance of evaluating the academic performances of students in pre-clinical years before they move into clinical years in order to identify weak students to mentor them and monitor their progress.

  12. Failures to change stimulus evaluations by means of subliminal approach and avoidance training.

    PubMed

    Van Dessel, Pieter; De Houwer, Jan; Roets, Arne; Gast, Anne

    2016-01-01

    Previous research suggests that the repeated performance of approach and avoidance (AA) actions in response to a stimulus causes changes in stimulus evaluations. Kawakami, Phills, Steele, and Dovidio (2007) and Jones, Vilensky, Vasey, and Fazio (2013) provided evidence that these AA training effects occur even when stimuli are presented only subliminally. We also examined whether reliable AA training effects can be observed with subliminal stimulus presentations but added more sensitive checks of perceptual stimulus discriminability. Three experiments, including a direct replication of the study by Kawakami et al. (2007), failed to provide any evidence for effects of subliminal AA training on implicit or explicit evaluations. Bayesian analyses indicated that our data provide robust evidence that subliminal AA training does not cause changes in evaluations. In contrast, we observed changes in evaluations when participants were provided with (either correct or incorrect) information about the stimulus-action contingencies in the subliminal AA training task and when participants performed a supraliminal AA training task that allowed participants to detect these contingencies. These findings support the idea that contingency awareness is necessary for the occurrence of AA training effects. (c) 2016 APA, all rights reserved).

  13. Evaluation of the usefulness of color digital summation radiography in temporally sequential digital radiographs: a phantom study.

    PubMed

    Ogata, Yuji; Naito, Hiroaki; Tomiyama, Noriyuki; Hamada, Seiki; Kozuka, Takenori; Koyama, Mitsuhiro; Tsubamoto, Mitusko; Murai, Sachiko; Ueguchi, Takashi; Matsumoto, Mitsuhiro; Tamura, Shinichi; Nakamura, Hironobu; Johkoh, Takeshi

    2006-04-01

    The purpose of this study was to assess the usefulness of color digital summation radiography (CDSR) for detection of nodules on chest radiographs by observers with different levels of experience. A total of 30 radiographs of chest phantoms with abnormalities and 30 normal ones were arranged at random. Set A was conventional radiographs only. Set B consisted of both conventional radiographs and CDSR images, which were colored with magenta. Five chest radiologists and five residents evaluated both image sets on a TFT monitor. The observers were asked to rate each image set using a continuous rating scale. The reading time for each set was also recorded. In set A, the performance of chest radiologists was significantly superior to that of the residents (P < 0.05). However, in set B, there was no significant difference in the performance of the chest radiologists and the residents. In both observer groups, the mean reading time per case in set B was significantly shorter than that in set A (P < 0.01). By using CDSR, the detection capability of observers with little experience improves and is comparable to that of experienced observers. Moreover, the reading time becomes much shorter using CDSR.

  14. Risk-adjusted performance evaluation in three academic thoracic surgery units using the Eurolung risk models.

    PubMed

    Pompili, Cecilia; Shargall, Yaron; Decaluwe, Herbert; Moons, Johnny; Chari, Madhu; Brunelli, Alessandro

    2018-01-03

    The objective of this study was to evaluate the performance of 3 thoracic surgery centres using the Eurolung risk models for morbidity and mortality. This was a retrospective analysis performed on data collected from 3 academic centres (2014-2016). Seven hundred and twenty-one patients in Centre 1, 857 patients in Centre 2 and 433 patients in Centre 3 who underwent anatomical lung resections were analysed. The Eurolung1 and Eurolung2 models were used to predict risk-adjusted cardiopulmonary morbidity and 30-day mortality rates. Observed and risk-adjusted outcomes were compared within each centre. The observed morbidity of Centre 1 was in line with the predicted morbidity (observed 21.1% vs predicted 22.7%, P = 0.31). Centre 2 performed better than expected (observed morbidity 20.2% vs predicted 26.7%, P < 0.001), whereas the observed morbidity of Centre 3 was higher than the predicted morbidity (observed 41.1% vs predicted 24.3%, P < 0.001). Centre 1 had higher observed mortality when compared with the predicted mortality (3.6% vs 2.1%, P = 0.005), whereas Centre 2 had an observed mortality rate significantly lower than the predicted mortality rate (1.2% vs 2.5%, P = 0.013). Centre 3 had an observed mortality rate in line with the predicted mortality rate (observed 1.4% vs predicted 2.4%, P = 0.17). The observed mortality rates in the patients with major complications were 30.8% in Centre 1 (versus predicted mortality rate 3.8%, P < 0.001), 8.2% in Centre 2 (versus predicted mortality rate 4.1%, P = 0.030) and 9.0% in Centre 3 (versus predicted mortality rate 3.5%, P = 0.014). The Eurolung models were successfully used as risk-adjusting instruments to internally audit the outcomes of 3 different centres, showing their applicability for future quality improvement initiatives. © The Author(s) 2018. Published by Oxford University Press on behalf of the European Association for Cardio-Thoracic Surgery. All rights reserved.

  15. Management of children exposed to Mycobacterium tuberculosis: a public health evaluation in West Java, Indonesia.

    PubMed

    Rutherford, Merrin E; Ruslami, Rovina; Anselmo, Melissa; Alisjahbana, Bachti; Yulianti, Neti; Sampurno, Hedy; van Crevel, Reinout; Hill, Philip C

    2013-12-01

    To investigate qualitatively and quantitatively the performance of a programme for managing the child contacts of adult tuberculosis patients in Indonesia. A public health evaluation framework was used to assess gaps in a child contact management programme at a lung clinic. Targets for programme performance indicators were derived from established programme indicator targets, the scientific literature and expert opinion. Compliance with tuberculosis screening, the initiation of isoniazid preventive therapy in children younger than 5 years, the accuracy of tuberculosis diagnosis and adherence to preventive therapy were assessed in 755 child contacts in two cohorts. In addition, 22 primary caregivers and 34 clinic staff were interviewed to evaluate knowledge and acceptance of child contact management. The cost to caregivers was recorded. Gaps between observed and target indicator values were quantified. THE GAPS BETWEEN OBSERVED AND TARGET PERFORMANCE INDICATORS WERE: 82% for screening compliance; 64 to 100% for diagnostic accuracy, 50% for the initiation of preventive therapy, 54% for adherence to therapy and 50% for costs. Many staff did not have adequate knowledge of, or an appropriate attitude towards, child contact management, especially regarding isoniazid preventive therapy. Caregivers had good knowledge of screening but not of preventive therapy and had difficulty travelling to the clinic and paying costs. The study identified widespread gaps in the performance of a child contact management system in Indonesia, all of which appear amenable to intervention. The public health evaluation framework used could be applied in other settings where child contact management is failing.

  16. In vitro and in vivo evaluation of diamond-coated strips.

    PubMed

    Lione, Roberta; Gazzani, Francesca; Pavoni, Chiara; Guarino, Stefano; Tagliaferri, Vincenzo; Cozza, Paola

    2017-05-01

    To test in vitro and in vivo the wear performance of diamond-coated strips by means of tribological testing and scanning electronic microscope (SEM). To evaluate the in vitro wear performance, a tribological test was performed by a standard tribometer. The abrasive strips slid against stationary, freshly extracted premolars fixed in resin blocks, at a 2-newton load. At the end of the tribological test, the residual surface of the strip was observed by means of SEM analysis, which was performed every 50 meters until reaching 300 meters. For the in vivo analysis, the strip was used for 300 seconds, corresponding to 250 meters. The strips presented a fenestrated structure characterized by diamond granules alternating with voids. After the first 50 meters, it was possible to observe tooth material deposited on the surface of the strips and a certain number of abrasive grains detached. The surface of the strip after 250 meters appeared smoother and therefore less effective in its abrasive power. After 300 seconds of in vivo utilization of the strip, it was possible to observe the detachment of diamond abrasive grains, the near absence of the grains and, therefore, loss of abrasive power. Under ideal conditions, after 5 minutes (30 meters) of use, the strip loses its abrasive capacity by about 60%. In vivo, a more rapid loss of abrasive power was observed due to the greater load applied by the clinician in forcing the strip into the contact point.

  17. Improving the interview skills of college students using behavioral skills training.

    PubMed

    Stocco, Corey S; Thompson, Rachel H; Hart, John M; Soriano, Heidi L

    2017-07-01

    Obtaining a job as a college graduate is partly dependent on interview performance. We used a multiple baseline design across skills to evaluate the effects of behavioral skills training with self-evaluation for five college students. Training effects were evaluated using simulated interviews as baseline and posttraining assessments. All participants acquired targeted skills, but we observed some individual differences. Participants were satisfied with training outcomes and rated the procedures as acceptable. Furthermore, ratings from university staff who provide interview training indicated that training improved performance across several skills for the majority of participants. © 2017 Society for the Experimental Analysis of Behavior.

  18. Electroacoustic verification of frequency modulation systems in cochlear implant users.

    PubMed

    Fidêncio, Vanessa Luisa Destro; Jacob, Regina Tangerino de Souza; Tanamati, Liége Franzini; Bucuvic, Érika Cristina; Moret, Adriane Lima Mortari

    2017-12-26

    The frequency modulation system is a device that helps to improve speech perception in noise and is considered the most beneficial approach to improve speech recognition in noise in cochlear implant users. According to guidelines, there is a need to perform a check before fitting the frequency modulation system. Although there are recommendations regarding the behavioral tests that should be performed at the fitting of the frequency modulation system to cochlear implant users, there are no published recommendations regarding the electroacoustic test that should be performed. Perform and determine the validity of an electroacoustic verification test for frequency modulation systems coupled to different cochlear implant speech processors. The sample included 40 participants between 5 and 18 year's users of four different models of speech processors. For the electroacoustic evaluation, we used the Audioscan Verifit device with the HA-1 coupler and the listening check devices corresponding to each speech processor model. In cases where the transparency was not achieved, a modification was made in the frequency modulation gain adjustment and we used the Brazilian version of the "Phrases in Noise Test" to evaluate the speech perception in competitive noise. It was observed that there was transparency between the frequency modulation system and the cochlear implant in 85% of the participants evaluated. After adjusting the gain of the frequency modulation receiver in the other participants, the devices showed transparency when the electroacoustic verification test was repeated. It was also observed that patients demonstrated better performance in speech perception in noise after a new adjustment, that is, in these cases; the electroacoustic transparency caused behavioral transparency. The electroacoustic evaluation protocol suggested was effective in evaluation of transparency between the frequency modulation system and the cochlear implant. Performing the adjustment of the speech processor and the frequency modulation system gain are essential when fitting this device. Copyright © 2017 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.

  19. Adapting the McMaster-Ottawa scale and developing behavioral anchors for assessing performance in an interprofessional Team Observed Structured Clinical Encounter.

    PubMed

    Lie, Désirée; May, Win; Richter-Lagha, Regina; Forest, Christopher; Banzali, Yvonne; Lohenry, Kevin

    2015-01-01

    Current scales for interprofessional team performance do not provide adequate behavioral anchors for performance evaluation. The Team Observed Structured Clinical Encounter (TOSCE) provides an opportunity to adapt and develop an existing scale for this purpose. We aimed to test the feasibility of using a retooled scale to rate performance in a standardized patient encounter and to assess faculty ability to accurately rate both individual students and teams. The 9-point McMaster-Ottawa Scale developed for a TOSCE was converted to a 3-point scale with behavioral anchors. Students from four professions were trained a priori to perform in teams of four at three different levels as individuals and teams. Blinded faculty raters were trained to use the scale to evaluate individual and team performances. G-theory was used to analyze ability of faculty to accurately rate individual students and teams using the retooled scale. Sixteen faculty, in groups of four, rated four student teams, each participating in the same TOSCE station. Faculty expressed comfort rating up to four students in a team within a 35-min timeframe. Accuracy of faculty raters varied (38-81% individuals, 50-100% teams), with errors in the direction of over-rating individual, but not team performance. There was no consistent pattern of error for raters. The TOSCE can be administered as an evaluation method for interprofessional teams. However, faculty demonstrate a 'leniency error' in rating students, even with prior training using behavioral anchors. To improve consistency, we recommend two trained faculty raters per station.

  20. Performance of the European System for Cardiac Operative Risk Evaluation II: a meta-analysis of 22 studies involving 145,592 cardiac surgery procedures.

    PubMed

    Guida, Pietro; Mastro, Florinda; Scrascia, Giuseppe; Whitlock, Richard; Paparella, Domenico

    2014-12-01

    A systematic review of the European System for Cardiac Operative Risk Evaluation (euroSCORE) II performance for prediction of operative mortality after cardiac surgery has not been performed. We conducted a meta-analysis of studies based on the predictive accuracy of the euroSCORE II. We searched the Embase and PubMed databases for all English-only articles reporting performance characteristics of the euroSCORE II. The area under the receiver operating characteristic curve, the observed/expected mortality ratio, and observed-expected mortality difference with their 95% confidence intervals were analyzed. Twenty-two articles were selected, including 145,592 procedures. Operative mortality occurred in 4293 (2.95%), whereas the expected events according to euroSCORE II were 4802 (3.30%). Meta-analysis of these studies provided an area under the receiver operating characteristic curve of 0.792 (95% confidence interval, 0.773-0.811), an estimated observed/expected ratio of 1.019 (95% confidence interval, 0.899-1.139), and observed-expected difference of 0.125 (95% confidence interval, -0.269 to 0.519). Statistical heterogeneity was detected among retrospective studies including less recent procedures. Subgroups analysis confirmed the robustness of combined estimates for isolated valve procedures and those combined with revascularization surgery. A significant overestimation of the euroSCORE II with an observed/expected ratio of 0.829 (95% confidence interval, 0.677-0.982) was observed in isolated coronary artery bypass grafting and a slight underestimation of predictions in high-risk patients (observed/expected ratio 1.253 and observed-expected difference 1.859). Despite the heterogeneity, the results from this meta-analysis show a good overall performance of the euroSCORE II in terms of discrimination and accuracy of model predictions for operative mortality. Validation of the euroSCORE II in prospective populations needs to be further studied for a continuous improvement of patients' risk stratification before cardiac surgery. Copyright © 2014 The American Association for Thoracic Surgery. Published by Elsevier Inc. All rights reserved.

  1. Performance and scalability evaluation of "Big Memory" on Blue Gene Linux.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yoshii, K.; Iskra, K.; Naik, H.

    2011-05-01

    We address memory performance issues observed in Blue Gene Linux and discuss the design and implementation of 'Big Memory' - an alternative, transparent memory space introduced to eliminate the memory performance issues. We evaluate the performance of Big Memory using custom memory benchmarks, NAS Parallel Benchmarks, and the Parallel Ocean Program, at a scale of up to 4,096 nodes. We find that Big Memory successfully resolves the performance issues normally encountered in Blue Gene Linux. For the ocean simulation program, we even find that Linux with Big Memory provides better scalability than does the lightweight compute node kernel designed solelymore » for high-performance applications. Originally intended exclusively for compute node tasks, our new memory subsystem dramatically improves the performance of certain I/O node applications as well. We demonstrate this performance using the central processor of the LOw Frequency ARray radio telescope as an example.« less

  2. Analyses on hydrophobicity and attractiveness of all-atom distance-dependent potentials

    PubMed Central

    Shirota, Matsuyuki; Ishida, Takashi; Kinoshita, Kengo

    2009-01-01

    Accurate model evaluation is a crucial step in protein structure prediction. For this purpose, statistical potentials, which evaluate a model structure based on the observed atomic distance frequencies in comparison with those in reference states, have been widely used. The reference state is a virtual state where all of the atomic interactions are turned off, and it provides a standard to measure the observed frequencies. In this study, we examined seven all-atom distance-dependent potentials with different reference states. As results, we observed that the variations of atom pair composition and those of distance distributions in the reference states produced systematic changes in the hydrophobic and attractive characteristics of the potentials. The performance evaluations with the CASP7 structures indicated that the preference of hydrophobic interactions improved the correlation between the energy and the GDT-TS score, but decreased the Z-score of the native structure. The attractiveness of potential improved both the correlation and Z-score for template-based modeling targets, but the benefit was smaller in free modeling targets. These results indicated that the performances of the potentials were more strongly influenced by their characteristics than by the accuracy of the definitions of the reference states. PMID:19588493

  3. Comparison of AERMOD and CALPUFF models for simulating SO2 concentrations in a gas refinery.

    PubMed

    Atabi, Farideh; Jafarigol, Farzaneh; Moattar, Faramarz; Nouri, Jafar

    2016-09-01

    In this study, concentration of SO2 from a gas refinery located in complex terrain was calculated by the steady-state, AERMOD model, and nonsteady-state CALPUFF model. First, in four seasons, SO2 concentrations emitted from 16 refinery stacks, in nine receptors, were obtained by field measurements, and then the performance of both models was evaluated. Then, the simulated results for SO2 ambient concentrations made by each model were compared with the results of the observed concentrations, and model results were compared among themselves. The evaluation of the two models to simulate SO2 concentrations was based on the statistical analysis and Q-Q plots. Review of statistical parameters and Q-Q plots has shown that, according to the evaluation of estimations made, performance of both models to simulate the concentration of SO2 in the region can be considered acceptable. The results showed the AERMOD composite ratio between simulated values made by models and the observed values in various receptors for all four average times is 0.72, whereas CALPUFF's ratio is 0.89. However, in the complex conditions of topography, CALPUFF offers better agreement with the observed concentrations.

  4. Presentation of the EURODELTA III intercomparison exercise - evaluation of the chemistry transport models' performance on criteria pollutants and joint analysis with meteorology

    NASA Astrophysics Data System (ADS)

    Bessagnet, Bertrand; Pirovano, Guido; Mircea, Mihaela; Cuvelier, Cornelius; Aulinger, Armin; Calori, Giuseppe; Ciarelli, Giancarlo; Manders, Astrid; Stern, Rainer; Tsyro, Svetlana; García Vivanco, Marta; Thunis, Philippe; Pay, Maria-Teresa; Colette, Augustin; Couvidat, Florian; Meleux, Frédérik; Rouïl, Laurence; Ung, Anthony; Aksoyoglu, Sebnem; María Baldasano, José; Bieser, Johannes; Briganti, Gino; Cappelletti, Andrea; D'Isidoro, Massimo; Finardi, Sandro; Kranenburg, Richard; Silibello, Camillo; Carnevale, Claudio; Aas, Wenche; Dupont, Jean-Charles; Fagerli, Hilde; Gonzalez, Lucia; Menut, Laurent; Prévôt, André S. H.; Roberts, Pete; White, Les

    2016-10-01

    The EURODELTA III exercise has facilitated a comprehensive intercomparison and evaluation of chemistry transport model performances. Participating models performed calculations for four 1-month periods in different seasons in the years 2006 to 2009, allowing the influence of different meteorological conditions on model performances to be evaluated. The exercise was performed with strict requirements for the input data, with few exceptions. As a consequence, most of differences in the outputs will be attributed to the differences in model formulations of chemical and physical processes. The models were evaluated mainly for background rural stations in Europe. The performance was assessed in terms of bias, root mean square error and correlation with respect to the concentrations of air pollutants (NO2, O3, SO2, PM10 and PM2.5), as well as key meteorological variables. Though most of meteorological parameters were prescribed, some variables like the planetary boundary layer (PBL) height and the vertical diffusion coefficient were derived in the model preprocessors and can partly explain the spread in model results. In general, the daytime PBL height is underestimated by all models. The largest variability of predicted PBL is observed over the ocean and seas. For ozone, this study shows the importance of proper boundary conditions for accurate model calculations and then on the regime of the gas and particle chemistry. The models show similar and quite good performance for nitrogen dioxide, whereas they struggle to accurately reproduce measured sulfur dioxide concentrations (for which the agreement with observations is the poorest). In general, the models provide a close-to-observations map of particulate matter (PM2.5 and PM10) concentrations over Europe rather with correlations in the range 0.4-0.7 and a systematic underestimation reaching -10 µg m-3 for PM10. The highest concentrations are much more underestimated, particularly in wintertime. Further evaluation of the mean diurnal cycles of PM reveals a general model tendency to overestimate the effect of the PBL height rise on PM levels in the morning, while the intensity of afternoon chemistry leads formation of secondary species to be underestimated. This results in larger modelled PM diurnal variations than the observations for all seasons. The models tend to be too sensitive to the daily variation of the PBL. All in all, in most cases model performances are more influenced by the model setup than the season. The good representation of temporal evolution of wind speed is the most responsible for models' skillfulness in reproducing the daily variability of pollutant concentrations (e.g. the development of peak episodes), while the reconstruction of the PBL diurnal cycle seems to play a larger role in driving the corresponding pollutant diurnal cycle and hence determines the presence of systematic positive and negative biases detectable on daily basis.

  5. Performance of biometric quality measures.

    PubMed

    Grother, Patrick; Tabassi, Elham

    2007-04-01

    We document methods for the quantitative evaluation of systems that produce a scalar summary of a biometric sample's quality. We are motivated by a need to test claims that quality measures are predictive of matching performance. We regard a quality measurement algorithm as a black box that converts an input sample to an output scalar. We evaluate it by quantifying the association between those values and observed matching results. We advance detection error trade-off and error versus reject characteristics as metrics for the comparative evaluation of sample quality measurement algorithms. We proceed this with a definition of sample quality, a description of the operational use of quality measures. We emphasize the performance goal by including a procedure for annotating the samples of a reference corpus with quality values derived from empirical recognition scores.

  6. A framework for improving the cost-effectiveness of DSM program evaluations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sonnenblick, R.; Eto, J.

    The prudence of utility demand-side management (DSM) investments hinges on their performance, yet evaluating performance is complicated because the energy saved by DSM programs can never be observed directly but only inferred. This study frames and begins to answer the following questions: (1) how well do current evaluation methods perform in improving confidence in the measurement of energy savings produced by DSM programs; (2) in view of this performance, how can limited evaluation resources be best allocated to maximize the value of the information they provide? The authors review three major classes of methods for estimating annual energy savings: trackingmore » database (sometimes called engineering estimates), end-use metering, and billing analysis and examine them in light of the uncertainties in current estimates of DSM program measure lifetimes. The authors assess the accuracy and precision of each method and construct trade-off curves to examine the costs of increases in accuracy or precision. Several approaches for improving evaluations for the purpose of assessing program cost effectiveness are demonstrated. The methods can be easily generalized to other evaluation objectives, such as shared savings incentive payments.« less

  7. Performance and Evaluation of the Global Modeling and Assimilation Office Observing System Simulation Experiment

    NASA Technical Reports Server (NTRS)

    Prive, Nikki; Errico, R. M.; Carvalho, D.

    2018-01-01

    The National Aeronautics and Space Administration Global Modeling and Assimilation Office (NASA/GMAO) has spent more than a decade developing and implementing a global Observing System Simulation Experiment framework for use in evaluting both new observation types as well as the behavior of data assimilation systems. The NASA/GMAO OSSE has constantly evolved to relect changes in the Gridpoint Statistical Interpolation data assimiation system, the Global Earth Observing System model, version 5 (GEOS-5), and the real world observational network. Software and observational datasets for the GMAO OSSE are publicly available, along with a technical report. Substantial modifications have recently been made to the NASA/GMAO OSSE framework, including the character of synthetic observation errors, new instrument types, and more sophisticated atmospheric wind vectors. These improvements will be described, along with the overall performance of the current OSSE. Lessons learned from investigations into correlated errors and model error will be discussed.

  8. Study of Integrated USV/UUV Observation System Performance in Monterey Bay

    DTIC Science & Technology

    2017-09-01

    5 IV. EXPERIMENTAL SETUP... quasi -stationary at depth in low-current environments. This thesis evaluates the performance of deep sensors in determining behavior of a moving source...acoustic sensors that would be quasi -stationary receivers when in drift mode at depth in low current environments. One key advantage to this technique is

  9. Least-Squares Models to Correct for Rater Effects in Performance Assessment.

    ERIC Educational Resources Information Center

    Raymond, Mark R.; Viswesvaran, Chockalingam

    This study illustrates the use of three least-squares models to control for rater effects in performance evaluation: (1) ordinary least squares (OLS); (2) weighted least squares (WLS); and (3) OLS subsequent to applying a logistic transformation to observed ratings (LOG-OLS). The three models were applied to ratings obtained from four…

  10. Cognitive Correlates of Functional Abilities in Individuals with Mild Cognitive Impairment: Comparison of Questionnaire, Direct Observation and Performance-based Measures

    PubMed Central

    Schmitter-Edgecombe, Maureen; Parsey, Carolyn M.

    2014-01-01

    The relationship between and the cognitive correlates of several proxy measures of functional status were studied in a population with mild cognitive impairment (MCI). Participants were 51 individuals diagnosed with MCI and 51 cognitively healthy older adults (OA). Participants completed performance-based functional status tests, standardized neuropsychological tests, and performed eight activities of daily living (e.g., watered plants, filled medication dispenser) while under direct observation in a campus apartment. An informant interview about everyday functioning was also conducted. Compared to the OA control group, the MCI group performed more poorly on all proxy measures of everyday functioning. The informant-report of instrumental activities of daily living (IADL) did not correlate with the two performance-based measures; however, both the informant-report IADL and the performance-based everyday problem-solving test correlated with the direct observation measure. After controlling for age and education, cognitive predictors did not explain a significant amount of variance in the performance-based measures; however, performance on a delayed memory task was a unique predictor for the informant-report IADL, and processing speed predicted unique variance for the direct observation score. These findings indicate that differing methods for evaluating functional status are not assessing completely overlapping aspects of everyday functioning in the MCI population. PMID:24766574

  11. Cognitive correlates of functional abilities in individuals with mild cognitive impairment: comparison of questionnaire, direct observation, and performance-based measures.

    PubMed

    Schmitter-Edgecombe, Maureen; Parsey, Carolyn M

    2014-01-01

    The relationship between, and the cognitive correlates of, several proxy measures of functional status were studied in a population with mild cognitive impairment (MCI). Participants were 51 individuals diagnosed with MCI and 51 cognitively healthy older adults (OA). Participants completed performance-based functional status tests and standardized neuropsychological tests, and performed eight activities of daily living (e.g., watered plants, filled medication dispenser) while under direct observation in a campus apartment. An informant interview about everyday functioning was also conducted. Compared to the OA control group, the MCI group performed more poorly on all proxy measures of everyday functioning. The informant report of instrumental activities of daily living (IADL) did not correlate with the two performance-based measures; however, both the informant-report IADL and the performance-based everyday problem-solving test correlated with the direct observation measure. After controlling for age and education, cognitive predictors did not explain a significant amount of variance in the performance-based measures; however, performance on a delayed memory task was a unique predictor for the informant-report IADL, and processing speed predicted unique variance for the direct observation score. These findings indicate that differing methods for evaluating functional status are not assessing completely overlapping aspects of everyday functioning in the MCI population.

  12. Comparative measurement of collagen bundle orientation by Fourier analysis and semiquantitative evaluation: reliability and agreement in Masson's trichrome, Picrosirius red and confocal microscopy techniques.

    PubMed

    Marcos-Garcés, V; Harvat, M; Molina Aguilar, P; Ferrández Izquierdo, A; Ruiz-Saurí, A

    2017-08-01

    Measurement of collagen bundle orientation in histopathological samples is a widely used and useful technique in many research and clinical scenarios. Fourier analysis is the preferred method for performing this measurement, but the most appropriate staining and microscopy technique remains unclear. Some authors advocate the use of Haematoxylin-Eosin (H&E) and confocal microscopy, but there are no studies comparing this technique with other classical collagen stainings. In our study, 46 human skin samples were collected, processed for histological analysis and stained with Masson's trichrome, Picrosirius red and H&E. Five microphotographs of the reticular dermis were taken with a 200× magnification with light microscopy, polarized microscopy and confocal microscopy, respectively. Two independent observers measured collagen bundle orientation with semiautomated Fourier analysis with the Image-Pro Plus 7.0 software and three independent observers performed a semiquantitative evaluation of the same parameter. The average orientation for each case was calculated with the values of the five pictures. We analyzed the interrater reliability, the consistency between Fourier analysis and average semiquantitative evaluation and the consistency between measurements in Masson's trichrome, Picrosirius red and H&E-confocal. Statistical analysis for reliability and agreement was performed with the SPSS 22.0 software and consisted of intraclass correlation coefficient (ICC), Bland-Altman plots and limits of agreement and coefficient of variation. Interrater reliability was almost perfect (ICC > 0.8) with all three histological and microscopy techniques and always superior in Fourier analysis than in average semiquantitative evaluation. Measurements were consistent between Fourier analysis by one observer and average semiquantitative evaluation by three observers, with an almost perfect agreement with Masson's trichrome and Picrosirius red techniques (ICC > 0.8) and a strong agreement with H&E-confocal (0.7 < ICC < 0.8). Comparison of measurements between the three techniques for the same observer showed an almost perfect agreement (ICC > 0.8), better with Fourier analysis than with semiquantitative evaluation (single and average). These results in nonpathological skin samples were also confirmed in a preliminary analysis in eight scleroderma skin samples. Our results show that Masson's trichrome and Picrosirius red are consistent with H&E-confocal for measuring collagen bundle orientation in histological samples and could thus be used indistinctly for this purpose. Fourier analysis is superior to average semiquantitative evaluation and should keep being used as the preferred method. © 2017 The Authors Journal of Microscopy © 2017 Royal Microscopical Society.

  13. Dentin bonding performance and interface observation of an MMA-based restorative material.

    PubMed

    Shinagawa, Junichi; Inoue, Go; Nikaido, Toru; Ikeda, Masaomi; Sadr, Alireza; Tagami, Junji

    2016-07-30

    The purpose of this study was to evaluate bonding performance and dentin interface acid resistance using a 4-META/MMA-TBB based restorative material (BF) compared to a conventional 4-META/MMA-TBB resin cement (SB), and the effect of sodium fluoride (NaF) addition to the materials. Dentin surfaces were treated with 10% citric acid-3% ferric chloride (10-3) or 4-META containing self-etching primer (TP), followed by application of BF or SB polymer powders with or without NaF, to evaluate microtensile bond strength (µTBS) in six experimental groups; 10-3/SB, 10-3/BF, TP/SB, TP/BF, TP/SB/NaF and TP/BF/NaF. SEM observation of the resin-dentin interface was performed after acid-base challenge to evaluate interfacial dentin resistance to acid attack. TP/BF showed highest µTBS, while NaF polymers decreased µTBS. TP/BF showed funnel-shaped erosion at the interface, however, NaF polymers improved acid resistance of interface. In conclusion, BF demonstrated high µTBSs and low acid-resistance at the interface. NaF addition enhanced acid resistance but decreased µTBS.

  14. Evaluating Global Emission Inventories of Biogenic Bromocarbons

    NASA Technical Reports Server (NTRS)

    Hossaini, Ryan; Mantle, H.; Chipperfield, M. P.; Montzka, S. A.; Hamer, P.; Ziska, F.; Quack, B.; Kruger, K.; Tegtmeier, S.; Atlas, E.; hide

    2013-01-01

    Emissions of halogenated very short-lived substances (VSLS) are poorly constrained. However, their inclusion in global models is required to simulate a realistic inorganic bromine (Bry) loading in both the troposphere, where bromine chemistry perturbs global oxidizing capacity, and in the stratosphere, where it is a major sink for ozone (O3). We have performed simulations using a 3-D chemical transport model (CTM) including three top-down and a single bottom-up derived emission inventory of the major brominated VSLS bromoform (CHBr3) and dibromomethane (CH2Br2). We perform the first concerted evaluation of these inventories, comparing both the magnitude and spatial distribution of emissions. For a quantitative evaluation of each inventory, model output is compared with independent long-term observations at National Oceanic and Atmospheric Administration (NOAA) ground-based stations and with aircraft observations made during the NSF (National Science Foundation) HIAPER Pole-to-Pole Observations (HIPPO) project. For CHBr3, the mean absolute deviation between model and surface observation ranges from 0.22 (38 %) to 0.78 (115 %) parts per trillion (ppt) in the tropics, depending on emission inventory. For CH2Br2, the range is 0.17 (24 %) to 1.25 (167 %) ppt. We also use aircraft observations made during the 2011 Stratospheric Ozone: Halogen Impacts in a Varying Atmosphere (SHIVA) campaign, in the tropical western Pacific. Here, the performance of the various inventories also varies significantly, but overall the CTM is able to reproduce observed CHBr3 well in the free troposphere using an inventory based on observed sea-to-air fluxes. Finally, we identify the range of uncertainty associated with these VSLS emission inventories on stratospheric bromine loading due to VSLS (Br(VSLS/y)). Our simulations show Br(VSLS/y) ranges from approximately 4.0 to 8.0 ppt depending on the inventory. We report an optimized estimate at the lower end of this range (approximately 4 ppt) based on combining the CHBr3 and CH2Br2 inventories which give best agreement with the compilation of observations in the tropics.

  15. Influence of signal processing strategy in auditory abilities.

    PubMed

    Melo, Tatiana Mendes de; Bevilacqua, Maria Cecília; Costa, Orozimbo Alves; Moret, Adriane Lima Mortari

    2013-01-01

    The signal processing strategy is a parameter that may influence the auditory performance of cochlear implant and is important to optimize this parameter to provide better speech perception, especially in difficult listening situations. To evaluate the individual's auditory performance using two different signal processing strategy. Prospective study with 11 prelingually deafened children with open-set speech recognition. A within-subjects design was used to compare performance with standard HiRes and HiRes 120 in three different moments. During test sessions, subject's performance was evaluated by warble-tone sound-field thresholds, speech perception evaluation, in quiet and in noise. In the silence, children S1, S4, S5, S7 showed better performance with the HiRes 120 strategy and children S2, S9, S11 showed better performance with the HiRes strategy. In the noise was also observed that some children performed better using the HiRes 120 strategy and other with HiRes. Not all children presented the same pattern of response to the different strategies used in this study, which reinforces the need to look at optimizing cochlear implant clinical programming.

  16. Long-term prospective evaluation of intestinal anastomosis using stainless steel staples in 14 dogs

    PubMed Central

    Benlloch-Gonzalez, Manuel; Gomes, Eymeric; Bouvy, Bernard; Poncet, Cyrill

    2015-01-01

    This prospective clinical study evaluated the use, complications, and clinical and ultrasonographic follow-ups of end-to-end intestinal anastomoses with skin staples in naturally occurring diseases in canine small and large intestines. Intestinal anastomoses were performed in 14 dogs and pre-, peri-, and postoperative data were recorded. Postoperative clinical and ultrasound evaluations were performed at regular intervals for 1 year. The mean time taken to construct the anastomosis was 5 min. There were no intraoperative complications. Hemorrhage and colonic stricture were the main postoperative complications. Staple loss occurred in 2 cases. Absence of wall layering and focal wall thickening were observed in all cases at each ultrasonographic follow-up. Hyperechoic fat was observed in all but 1 of the cases at month 1. Nine dogs were alive with normal digestive function at the end of the study. The skin stapler technique enabled rapid construction of consistent anastomoses with inexpensive stapling material. PMID:26130833

  17. Analyzing and Detecting Problems in Systems of Systems

    NASA Technical Reports Server (NTRS)

    Lindvall, Mikael; Ackermann, Christopher; Stratton, William C.; Sibol, Deane E.; Godfrey, Sally

    2008-01-01

    Many software systems are evolving complex system of systems (SoS) for which inter-system communication is mission-critical. Evidence indicates that transmission failures and performance issues are not uncommon occurrences. In a NASA-supported Software Assurance Research Program (SARP) project, we are researching a new approach addressing such problems. In this paper, we are presenting an approach for analyzing inter-system communications with the goal to uncover both transmission errors and performance problems. Our approach consists of a visualization and an evaluation component. While the visualization of the observed communication aims to facilitate understanding, the evaluation component automatically checks the conformance of an observed communication (actual) to a desired one (planned). The actual and the planned are represented as sequence diagrams. The evaluation algorithm checks the conformance of the actual to the planned diagram. We have applied our approach to the communication of aerospace systems and were successful in detecting and resolving even subtle and long existing transmission problems.

  18. Evaluating Hydrogen Evolution and Oxidation in Alkaline Media to Establish Baselines

    DOE PAGES

    Alia, Shaun M.; Pivovar, Bryan S.

    2018-04-28

    This paper fills a significant gap in the literature for alkaline hydrogen evolution (HER) and oxidation (HOR) baseline performance, while reviewing the different variables that influence observed properties. Although high-performing HER-HOR catalysts in acidic electrolytes are too active to measure kinetic in rotating disk electrode (RDE) half-cells, under alkaline conditions RDE kinetics evaluations are relevant and half-cell performances are comparable to hydrogen pump data. This paper focuses on best practices to ensure that half-cell tests don't unnecessarily lower platinum group metal (PGM) performance or improve non-PGM performance. Specific aspects examined include experiments on PGMs minimizing the impact of impurities (electrolyte,more » cell material) and experiments on non-PGMs minimizing the impact from test protocols (counter electrode).« less

  19. Evaluating Hydrogen Evolution and Oxidation in Alkaline Media to Establish Baselines

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alia, Shaun M.; Pivovar, Bryan S.

    This paper fills a significant gap in the literature for alkaline hydrogen evolution (HER) and oxidation (HOR) baseline performance, while reviewing the different variables that influence observed properties. Although high-performing HER-HOR catalysts in acidic electrolytes are too active to measure kinetic in rotating disk electrode (RDE) half-cells, under alkaline conditions RDE kinetics evaluations are relevant and half-cell performances are comparable to hydrogen pump data. This paper focuses on best practices to ensure that half-cell tests don't unnecessarily lower platinum group metal (PGM) performance or improve non-PGM performance. Specific aspects examined include experiments on PGMs minimizing the impact of impurities (electrolyte,more » cell material) and experiments on non-PGMs minimizing the impact from test protocols (counter electrode).« less

  20. Analysis of navigation performance for the Earth Observing System (EOS) using the TDRSS Onboard Navigation System (TONS)

    NASA Technical Reports Server (NTRS)

    Elrod, B.; Kapoor, A.; Folta, David C.; Liu, K.

    1991-01-01

    Use of the Tracking and Data Relay Satellite System (TDRSS) Onboard Navigation System (TONS) was proposed as an alternative to the Global Positioning System (GPS) for supporting the Earth Observing System (EOS) mission. The results are presented of EOS navigation performance evaluation with respect to TONS based orbit, time, and frequency determination (OD/TD/FD). Two TONS modes are considered: one uses scheduled TDRSS forward link service to derive one way Doppler tracking data for OD/FD support (TONS-I); the other uses an unscheduled navigation beacon service (proposed for Advanced TDRSS) to obtain pseudorange and Doppler data for OD/TD/FD support (TONS-II). Key objectives of the analysis were to evaluate nominal performance and potential sensitivities, such as suboptimal tracking geometry, tracking contact scheduling, and modeling parameter selection. OD/TD/FD performance predictions are presented based on covariance and simulation analyses. EOS navigation scenarios and the contributions of principal error sources impacting performance are also described. The results indicate that a TONS mode can be configured to meet current and proposed EOS position accuracy requirements of 100 and 50 m, respectively.

  1. The dependability of medical students' performance ratings as documented on in-training evaluations.

    PubMed

    van Barneveld, Christina

    2005-03-01

    To demonstrate an approach to obtain an unbiased estimate of the dependability of students' performance ratings during training, when the data-collection design includes nesting of student in rater, unbalanced nest sizes, and dependent observations. In 2003, two variance components analyses of in-training evaluation (ITE) report data were conducted using urGENOVA software. In the first analysis, the dependability for the nested and unbalanced data-collection design was calculated. In the second analysis, an approach using multiple generalizability studies was used to obtain an unbiased estimate of the student variance component, resulting in an unbiased estimate of dependability. Results suggested that there is bias in estimates of the dependability of students' performance on ITEs that are attributable to the data-collection design. When the bias was corrected, the results indicated that the dependability of ratings of student performance was almost zero. The combination of the multiple generalizability studies method and the use of specialized software provides an unbiased estimate of the dependability of ratings of student performance on ITE scores for data-collection designs that include nesting of student in rater, unbalanced nest sizes, and dependent observations.

  2. 47 CFR 301.7 - Waiver of household eligibility.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... and audio segments to evaluate the performance as perceived by a human observer. For subjective...Remote control may have dedicated keys to provide direct access to closed captioning and descriptive...

  3. Racial Earnings Differentials and Performance Pay

    ERIC Educational Resources Information Center

    Heywood, John S.; O'Halloran, Patrick L.

    2005-01-01

    A comparative analysis between output-based payment and time rates payment is presented. It is observed that racial or gender earnings discrimination is more likely in time rates payment and supervisory evaluations.

  4. The relationship between temporomandibular dysfunction and head and cervical posture.

    PubMed

    Matheus, Ricardo Alves; Ramos-Perez, Flávia Maria de Moraes; Menezes, Alynne Vieira; Ambrosano, Gláucia Maria Bovi; Haiter-Neto, Francisco; Bóscolo, Frab Norberto; de Almeida, Solange Maria

    2009-01-01

    This study aimed to evaluate the possibility of any correlation between disc displacement and parameters used for evaluation of skull positioning in relation to the cervical spine: craniocervical angle, suboccipital space between C0-C1, cervical curvature and position of the hyoid bone in individuals with and without symptoms of temporomandibular dysfunction. The patients were evaluated following the guidelines set forth by RDC/TMD. Evaluation was performed by magnetic resonance imaging for establishment of disc positioning in the temporomandibular joints (TMJs) of 30 volunteer patients without temporomandibular dysfunction symptoms and 30 patients with symptoms. Evaluation of skull positioning in relation to the cervical spine was performed on lateral cephalograms achieved with the individual in natural head position. Data were submitted to statistical analysis by Fisher's exact test at 5% significance level. To measure the degree of reproducibility/agreements between surveys, the kappa (K) statistics was used. Significant differences were observed between C0-C1 measurement for both symptomatic (p=0.04) and asymptomatic (p=0.02). No statistical differences were observed regarding craniocervical angle, C1-C2 and hyoid bone position in relation to the TMJs with and without disc displacement. Although statistically significant difference was found in the C0-C1 space, no association between these and internal temporomandibular joint disorder can be considered. Based on the results observed in this study, no direct relationship could be determined between the presence of disc displacement and the variables assessed.

  5. THE RELATIONSHIP BETWEEN TEMPOROMANDIBULAR DYSFUNCTION AND HEAD AND CERVICAL POSTURE

    PubMed Central

    Matheus, Ricardo Alves; Ramos-Perez, Flávia Maria de Moraes; Menezes, Alynne Vieira; Ambrosano, Gláucia Maria Bovi; Haiter, Francisco; Bóscolo, Frab Norberto; de Almeida, Solange Maria

    2009-01-01

    Objective: This study aimed to evaluate the possibility of any correlation between disc displacement and parameters used for evaluation of skull positioning in relation to the cervical spine: craniocervical angle, suboccipital space between C0-C1, cervical curvature and position of the hyoid bone in individuals with and without symptoms of temporomandibular dysfunction. Material and Methods: The patients were evaluated following the guidelines set forth by RDC/TMD. Evaluation was performed by magnetic resonance imaging for establishment of disc positioning in the temporomandibular joints (TMJs) of 30 volunteer patients without temporomandibular dysfunction symptoms and 30 patients with symptoms. Evaluation of skull positioning in relation to the cervical spine was performed on lateral cephalograms achieved with the individual in natural head position. Data were submitted to statistical analysis by Fisher's exact test at 5% significance level. To measure the degree of reproducibility/agreements between surveys, the kappa (K) statistics was used. Results: Significant differences were observed between C0-C1 measurement for both symptomatic (p=0.04) and asymptomatic (p=0.02). No statistical differences were observed regarding craniocervical angle, C1-C2 and hyoid bone position in relation to the TMJs with and without disc displacement. Although statistically significant difference was found in the C0-C1 space, no association between these and internal temporomandibular joint disorder can be considered. Conclusion: Based on the results observed in this study, no direct relationship could be determined between the presence of disc displacement and the variables assessed. PMID:19466252

  6. The chemistry-climate model ECHAM6.3-HAM2.3-MOZ1.0

    NASA Astrophysics Data System (ADS)

    Schultz, Martin G.; Stadtler, Scarlet; Schröder, Sabine; Taraborrelli, Domenico; Franco, Bruno; Krefting, Jonathan; Henrot, Alexandra; Ferrachat, Sylvaine; Lohmann, Ulrike; Neubauer, David; Siegenthaler-Le Drian, Colombe; Wahl, Sebastian; Kokkola, Harri; Kühn, Thomas; Rast, Sebastian; Schmidt, Hauke; Stier, Philip; Kinnison, Doug; Tyndall, Geoffrey S.; Orlando, John J.; Wespes, Catherine

    2018-05-01

    The chemistry-climate model ECHAM-HAMMOZ contains a detailed representation of tropospheric and stratospheric reactive chemistry and state-of-the-art parameterizations of aerosols using either a modal scheme (M7) or a bin scheme (SALSA). This article describes and evaluates the model version ECHAM6.3-HAM2.3-MOZ1.0 with a focus on the tropospheric gas-phase chemistry. A 10-year model simulation was performed to test the stability of the model and provide data for its evaluation. The comparison to observations concentrates on the year 2008 and includes total column observations of ozone and CO from IASI and OMI, Aura MLS observations of temperature, HNO3, ClO, and O3 for the evaluation of polar stratospheric processes, an ozonesonde climatology, surface ozone observations from the TOAR database, and surface CO data from the Global Atmosphere Watch network. Global budgets of ozone, OH, NOx, aerosols, clouds, and radiation are analyzed and compared to the literature. ECHAM-HAMMOZ performs well in many aspects. However, in the base simulation, lightning NOx emissions are very low, and the impact of the heterogeneous reaction of HNO3 on dust and sea salt aerosol is too strong. Sensitivity simulations with increased lightning NOx or modified heterogeneous chemistry deteriorate the comparison with observations and yield excessively large ozone budget terms and too much OH. We hypothesize that this is an impact of potential issues with tropical convection in the ECHAM model.

  7. Towards improved and more routine Earth system model evaluation in CMIP

    DOE PAGES

    Eyring, Veronika; Gleckler, Peter J.; Heinze, Christoph; ...

    2016-11-01

    The Coupled Model Intercomparison Project (CMIP) has successfully provided the climate community with a rich collection of simulation output from Earth system models (ESMs) that can be used to understand past climate changes and make projections and uncertainty estimates of the future. Confidence in ESMs can be gained because the models are based on physical principles and reproduce many important aspects of observed climate. More research is required to identify the processes that are most responsible for systematic biases and the magnitude and uncertainty of future projections so that more relevant performance tests can be developed. At the same time,more » there are many aspects of ESM evaluation that are well established and considered an essential part of systematic evaluation but have been implemented ad hoc with little community coordination. Given the diversity and complexity of ESM analysis, we argue that the CMIP community has reached a critical juncture at which many baseline aspects of model evaluation need to be performed much more efficiently and consistently. We provide a perspective and viewpoint on how a more systematic, open, and rapid performance assessment of the large and diverse number of models that will participate in current and future phases of CMIP can be achieved, and announce our intention to implement such a system for CMIP6. Accomplishing this could also free up valuable resources as many scientists are frequently "re-inventing the wheel" by re-writing analysis routines for well-established analysis methods. A more systematic approach for the community would be to develop and apply evaluation tools that are based on the latest scientific knowledge and observational reference, are well suited for routine use, and provide a wide range of diagnostics and performance metrics that comprehensively characterize model behaviour as soon as the output is published to the Earth System Grid Federation (ESGF). The CMIP infrastructure enforces data standards and conventions for model output and documentation accessible via the ESGF, additionally publishing observations (obs4MIPs) and reanalyses (ana4MIPs) for model intercomparison projects using the same data structure and organization as the ESM output. This largely facilitates routine evaluation of the ESMs, but to be able to process the data automatically alongside the ESGF, the infrastructure needs to be extended with processing capabilities at the ESGF data nodes where the evaluation tools can be executed on a routine basis. Efforts are already underway to develop community-based evaluation tools, and we encourage experts to provide additional diagnostic codes that would enhance this capability for CMIP. And, at the same time, we encourage the community to contribute observations and reanalyses for model evaluation to the obs4MIPs and ana4MIPs archives. The intention is to produce through the ESGF a widely accepted quasi-operational evaluation framework for CMIP6 that would routinely execute a series of standardized evaluation tasks. Over time, as this capability matures, we expect to produce an increasingly systematic characterization of models which, compared with early phases of CMIP, will more quickly and openly identify the strengths and weaknesses of the simulations. This will also reveal whether long-standing model errors remain evident in newer models and will assist modelling groups in improving their models. Finally, this framework will be designed to readily incorporate updates, including new observations and additional diagnostics and metrics as they become available from the research community.« less

  8. Combining a wavelet transform with a channelized Hotelling observer for tumor detection in 3D PET oncology imaging

    NASA Astrophysics Data System (ADS)

    Lartizien, Carole; Tomei, Sandrine; Maxim, Voichita; Odet, Christophe

    2007-03-01

    This study evaluates new observer models for 3D whole-body Positron Emission Tomography (PET) imaging based on a wavelet sub-band decomposition and compares them with the classical constant-Q CHO model. Our final goal is to develop an original method that performs guided detection of abnormal activity foci in PET oncology imaging based on these new observer models. This computer-aided diagnostic method would highly benefit to clinicians for diagnostic purpose and to biologists for massive screening of rodents populations in molecular imaging. Method: We have previously shown good correlation of the channelized Hotelling observer (CHO) using a constant-Q model with human observer performance for 3D PET oncology imaging. We propose an alternate method based on combining a CHO observer with a wavelet sub-band decomposition of the image and we compare it to the standard CHO implementation. This method performs an undecimated transform using a biorthogonal B-spline 4/4 wavelet basis to extract the features set for input to the Hotelling observer. This work is based on simulated 3D PET images of an extended MCAT phantom with randomly located lesions. We compare three evaluation criteria: classification performance using the signal-to-noise ratio (SNR), computation efficiency and visual quality of the derived 3D maps of the decision variable λ. The SNR is estimated on a series of test images for a variable number of training images for both observers. Results: Results show that the maximum SNR is higher with the constant-Q CHO observer, especially for targets located in the liver, and that it is reached with a smaller number of training images. However, preliminary analysis indicates that the visual quality of the 3D maps of the decision variable λ is higher with the wavelet-based CHO and the computation time to derive a 3D λ-map is about 350 times shorter than for the standard CHO. This suggests that the wavelet-CHO observer is a good candidate for use in our guided detection method.

  9. Role of observation of live cases done by Japanese experts in the acquisition of ESD skills by a western endoscopist.

    PubMed

    Draganov, Peter V; Chang, Myron; Coman, Roxana M; Wagh, Mihir S; An, Qi; Gotoda, Takuji

    2014-04-28

    To evaluate the role of observation of experts performing endoscopic submucosal dissection (ESD) in the acquisition of ESD skills. This prospective study is documenting the learning curve of one Western endoscopist. The study consisted of three periods. In the first period (pre-observation), the trainee performed ESDs in animal models in his home institution in the United States. The second period (observation) consisted of visit to Japan and observation of live ESD cases done by experts. The observation of cases occurred over a 5-wk period. During the third period (post-observation), the trainee performed ESD in animal models in a similar fashion as in the first period. Three animal models were used: live 40-50 kg Yorkshire pig, explanted pig stomach model, and explanted pig rectum model. The outcomes from the ESDs done in the animal models before and after observation of live human cases (main study intervention) were compared. Statistical analysis of the data included: Fisher's exact test to compare distributions of a categorical variable, Wilcoxon rank sum test to compare distributions of a continuous variable between the two groups (pre-observation and post-observation), and Kruskal-Wallis test to evaluate the impact of lesion location and type of model (ex-vivo vs live pig) on lesion removal time. The trainee performed 38 ESDs in animal model (29 pre-observation/9 post-observation). The removal times post-observation were significantly shorter than those pre-observation (32.7 ± 15.0 min vs 63.5 ± 9.8 min, P < 0.001). To minimize the impact of improving physician skill, the 9 lesions post-observation were compared to the last 9 lesions pre-observation and the removal times remained significantly shorter (32.7 ± 15.0 min vs 61.0 ± 7.4 min, P = 0.0011). Regression analysis showed that ESD observation significantly reduced removal time when controlling for the sequence of lesion removal (P = 0.025). Furthermore, it was also noted a trend towards decrease in failure to remove lesions and decrease in complications after the period of observation. This study did not find a significant difference in the time needed to remove lesions in different animal models. This finding could have important implications in designing training programs due to the substantial difference in cost between live animal and explanted organ models. The main limitation of this study is that it reflects the experience of a single endoscopist. Observation of experts performing ESD over short period of time can significantly contribute to the acquisition of ESD skills.

  10. First On-Site Data Analysis System for Subaru/Suprime-Cam

    NASA Astrophysics Data System (ADS)

    Furusawa, Hisanori; Okura, Yuki; Mineo, Sogo; Takata, Tadafumi; Nakata, Fumiaki; Tanaka, Manobu; Katayama, Nobuhiko; Itoh, Ryosuke; Yasuda, Naoki; Miyazaki, Satoshi; Komiyama, Yutaka; Utsumi, Yousuke; Uchida, Tomohisa; Aihara, Hiroaki

    2011-03-01

    We developed an automated on-site quick analysis system for mosaic CCD data of Suprime-Cam, which is a wide-field camera mounted at the prime focus of the Subaru Telescope, Mauna Kea, Hawaii. The first version of the data-analysis system was constructed, and started to operate in general observations. This system is a new function of observing support at the Subaru Telescope to provide the Subaru user community with an automated on-site data evaluation, aiming at improvements of observers' productivity, especially in large imaging surveys. The new system assists the data evaluation tasks in observations by the continuous monitoring of the characteristics of every data frame during observations. The evaluation results and data frames processed by this system are also useful for reducing the data-processing time in a full analysis after an observation. The primary analysis functions implemented in the data-analysis system are composed of automated realtime analysis for data evaluation and on-demand analysis, which is executed upon request, including mosaicing analysis and flat making analysis. In data evaluation, which is controlled by the organizing software, the database keeps track of the analysis histories, as well as the evaluated values of data frames, including seeing and sky background levels; it also helps in the selection of frames for mosaicing and flat making analysis. We examined the system performance and confirmed an improvement in the data-processing time by a factor of 9 with the aid of distributed parallel data processing and on-memory data processing, which makes the automated data evaluation effective.

  11. High-definition television evaluation for remote handling task performance

    NASA Astrophysics Data System (ADS)

    Fujita, Y.; Omori, E.; Hayashi, S.; Draper, J. V.; Herndon, J. N.

    Described are experiments designed to evaluate the impact of HDTV (High-Definition Television) on the performance of typical remote tasks. The experiments described in this paper compared the performance of four operators using HDTV with their performance while using other television systems. The experiments included four television systems: (1) high-definition color television, (2) high-definition monochromatic television, (3) standard-resolution monochromatic television, and (4) standard-resolution stereoscopic monochromatic television. The stereo system accomplished stereoscopy by displaying two cross-polarized images, one reflected by a half-silvered mirror and one seen through the mirror. Observers wore spectacles with cross-polarized lenses so that the left eye received only the view from the left camera and the right eye received only the view from the right camera.

  12. Performance evaluation of Bragg coherent diffraction imaging

    NASA Astrophysics Data System (ADS)

    Öztürk, H.; Huang, X.; Yan, H.; Robinson, I. K.; Noyan, I. C.; Chu, Y. S.

    2017-10-01

    In this study, we present a numerical framework for modeling three-dimensional (3D) diffraction data in Bragg coherent diffraction imaging (Bragg CDI) experiments and evaluating the quality of obtained 3D complex-valued real-space images recovered by reconstruction algorithms under controlled conditions. The approach is used to systematically explore the performance and the detection limit of this phase-retrieval-based microscopy tool. The numerical investigation suggests that the superb performance of Bragg CDI is achieved with an oversampling ratio above 30 and a detection dynamic range above 6 orders. The observed performance degradation subject to the data binning processes is also studied. This numerical tool can be used to optimize experimental parameters and has the potential to significantly improve the throughput of Bragg CDI method.

  13. Quality of Life, Psychological Burden, and Sleep Quality in Patients With Brain Metastasis Undergoing Whole Brain Radiation Therapy.

    PubMed

    Teke, Fatma; Bucaktepe, Pakize; Kıbrıslı, Erkan; Demir, Melike; Ibiloglu, Aslıhan; Inal, Ali

    2016-10-01

    Patients with brain metastasis (BM) usually suffer from poor quality of life (QOL), anxiety, depression, and sleep disorders in their reduced lifespan. The aim of this study was to evaluate QOL, anxiety, depression, and sleep characteristics in patients with BM at the beginning and end of whole brain radiation therapy (WBRT) and three months after treatment. Thirty-three patients undergoing WBRT for BM were featured in this study. The authors used the Karnofsky Performance Status (KPS) scale to measure performance status, the Hospital Anxiety and Depression Scale (HADS) to evaluate anxiety and depression, the SF-36® to evaluate health-related QOL, and the Pittsburgh Sleep Quality Index to evaluate sleep disorders at the start of WBRT, the end of WBRT, and three months after WBRT. Statistically significant improvements were noted in KPS scores from baseline evaluation to the end of WBRT and to three months after WBRT. No significant differences were observed in SF-36 and HADS scores between the start and the end of WBRT. Anxiety scores were negatively correlated with survival at the end of WBRT. Overall survival was better in those who reported better sleep. WBRT improves KPS scores and does not worsen sleep quality or mood, even in patients with poor performance status. When changes in mood and sleep quality are observed, survival and QOL may improve in patients with BM; consequently, nurses should be responsive to these changes.

  14. Evaluating a more cost-efficient alternative to providing in-home feedback to parents: the use of spousal feedback.

    PubMed Central

    Harris, T A; Peterson, S L; Filliben, T L; Glassberg, M; Favell, J E

    1998-01-01

    We evaluated the contribution of spousal feedback to a parent education curriculum designed for parents of children with autism. A modified multiple baseline design across 3 husband-and-wife dyads was used to examine the effects of teaching parents to give each other feedback on their teaching performance. For 5 of 6 participants, improvement in teaching performance occurred following didactic presentations. However, additional improvement was observed for 5 participants when the spousal feedback component was implemented. PMID:9532757

  15. Cultural values and performance appraisal: assessing the effects of rater self-construal on performance ratings.

    PubMed

    Mishra, Vipanchi; Roch, Sylvia G

    2013-01-01

    Much of the prior research investigating the influence of cultural values on performance ratings has focused either on conducting cross-national comparisons among raters or using cultural level individualism/collectivism scales to measure the effects of cultural values on performance ratings. Recent research has shown that there is considerable within country variation in cultural values, i.e. people in one country can be more individualistic or collectivistic in nature. Taking the latter perspective, the present study used Markus and Kitayama's (1991) conceptualization of independent and interdependent self-construals as measures of individual variations in cultural values to investigate within culture variations in performance ratings. Results suggest that rater self-construal has a significant influence on overall performance evaluations; specifically, raters with a highly interdependent self-construal tend to show a preference for interdependent ratees, whereas raters high on independent self-construal do not show a preference for specific type of ratees when making overall performance evaluations. Although rater self-construal significantly influenced overall performance evaluations, no such effects were observed for specific dimension ratings. Implications of these results for performance appraisal research and practice are discussed.

  16. The Role of Peer-Assisted Learning in Building Evaluative Judgement: Opportunities in Clinical Medical Education

    ERIC Educational Resources Information Center

    Tai, Joanna Hong-Meng; Canny, Benedict J.; Haines, Terry P.; Molloy, Elizabeth K.

    2016-01-01

    This study explored the contribution of peer-assisted learning (PAL) in the development of evaluative judgement capacity; the ability to understand work quality and apply those standards to appraising performance. The study employed a mixed methods approach, collecting self-reported survey data, observations of, and reflective interviews with, the…

  17. EVALUATING THE PERFORMANCE OF REGIONAL-SCALE PHOTOCHEMICAL MODELING SYSTEMS: PART II--OZONE PREDICTIONS. (R825260)

    EPA Science Inventory

    In this paper, the concept of scale analysis is applied to evaluate ozone predictions from two regional-scale air quality models. To this end, seasonal time series of observations and predictions from the RAMS3b/UAM-V and MM5/MAQSIP (SMRAQ) modeling systems for ozone were spectra...

  18. Evaluating the Impact and Determinants of Student Team Performance: Using LMS and CATME Data

    ERIC Educational Resources Information Center

    Braender, Lynn M.; Naples, Michele I.

    2013-01-01

    Practitioners find it difficult to allocate grades to individual students based on their contributions to the team project. They often use classroom observation of teamwork and student peer evaluations to differentiate an individual's grade from the group's grade, which can be subjective and imprecise. We used objective data from student activity…

  19. Teacher Evaluation Project. The Beginning Teacher Program, Intellectual Skills Development, Validity Studies of the Evaluation System, Special Instrument Development. Report for 1984-1985.

    ERIC Educational Resources Information Center

    Florida Coalition for the Development of a Performance Measurement System, Tallahassee.

    Reports, summaries, and recommendations are presented on the following research studies: (1) Beginning Teacher Studies; (2) Instructional Skills for Teaching Higher Order Thinking; (3) Development of the Conferential Observation Instrument; (4) Predictive Validity Studies Conducted to Test the Relationship Between Teacher Performance as Measured…

  20. An Evaluation of the Observer Effect on Treatment Integrity in a Day Treatment Center for Children

    ERIC Educational Resources Information Center

    Howard, Monica R.; Burke, Raymond V.; Allen, Keith D.

    2013-01-01

    Treatment integrity is an important concern in treatment centers but is often overlooked. Performance feedback is a well-established approach to improving treatment integrity, but is underused and undervalued. One way to increase its value to treatment centers may be to expose unrealized benefits on the observer who collects the performance…

  1. Assessing Mental Health First Aid Skills Using Simulated Patients

    PubMed Central

    Chen, Timothy F.; Moles, Rebekah J.; O’Reilly, Claire

    2018-01-01

    Objective. To evaluate mental health first aid (MHFA) skills using simulated patients and to compare self-reported confidence in providing MHFA with performance during simulated patient roleplays. Methods. Pharmacy students self-evaluated their confidence in providing MHFA post-training. Two mental health vignettes and an assessment rubric based on the MHFA Action Plan were developed to assess students’ observed MHFA skills during audio-recorded simulated patient roleplays. Results. There were 163 students who completed the MHFA training, of which 88% completed self-evaluations. There were 84% to 98% of students who self-reported that they agreed or strongly agreed they were confident providing MHFA. Postnatal depression (PND) and suicide vignettes were randomly assigned to 36 students. More students participating in the PND roleplay took appropriate actions, compared to those participating in the suicide role-play. However, more students participating in the suicide role play assessed alcohol and/or drug use. Ten (71%) participants in the PND roleplay and six (40%) in the suicide roleplay either avoided using suicide-specific terminology completely or used multiple terms rendering their inquiry unclear. Conclusion. Self-evaluated confidence levels in providing MHFA did not always reflect observed performance. Students had difficulty addressing suicide with only half passing the suicide vignette and many avoiding suicide-specific terminology. This indicates that both self-reported and observed behaviors should be used for post-training assessments. PMID:29606711

  2. Study on the influence of ground and satellite observations on the numerical air-quality for PM10 over Romanian territory

    NASA Astrophysics Data System (ADS)

    Dumitrache, Rodica Claudia; Iriza, Amalia; Maco, Bogdan Alexandru; Barbu, Cosmin Danut; Hirtl, Marcus; Mantovani, Simone; Nicola, Oana; Irimescu, Anisoara; Craciunescu, Vasile; Ristea, Alina; Diamandi, Andrei

    2016-10-01

    The numerical forecast of particulate matter concentrations in general, and PM10 in particular is a theme of high socio-economic relevance. The aim of this study was to investigate the impact of ground and satellite data assimilation of PM10 observations into the Weather Research and Forecasting model coupled with Chemistry (WRF-CHEM) numerical air quality model for Romanian territory. This is the first initiative of the kind for this domain of interest. Assimilation of satellite information - e.g. AOT's in air quality models is of interest due to the vast spatial coverage of the observations. Support Vector Regression (SVR) techniques are used to estimate the PM content from heterogeneous data sources, including EO products (Aerosol Optical Thickness), ground measurements and numerical model data (temperature, humidity, wind, etc.). In this study we describe the modeling framework employed and present the evaluation of the impact from the data assimilation of PM10 observations on the forecast of the WRF-CHEM model. Integrations of the WRF-CHEM model in data assimilation enabled/disabled configurations allowed the evaluation of satellite and ground data assimilation impact on the PM10 forecast performance for the Romanian territory. The model integration and evaluation were performed for two months, one in winter conditions (January 2013) and one in summer conditions (June 2013).

  3. The ICI classification for calcaneal injuries: a validation study.

    PubMed

    Frima, Herman; Eshuis, Rienk; Mulder, Paul; Leenen, Luke

    2012-06-01

    The integral classification of injuries (ICI), by Zwipp et al. has been developed as a classification system for injuries of the bones, joints, cartilage and ligaments of the foot. It follows the principles of the comprehensive classification of fractures by Müller et al. The ICI was developed for 'everyday use' and scientific purposes. Our aim was to perform a validation study for this classification system applied to the calcaneal injuries. A panel of five experienced trauma and orthopaedic surgeons evaluated the ICI score in 20 calcaneal injuries. After 2 months, a second classification was performed in a different order. Inter- and intra-observer variability were evaluated by kappa statistics. Panel members were not able to evaluate capsule and ligamental injuries based on X-ray and computed tomography (CT) films. Two injuries were excluded for logistical reasons. The inter-observer agreement based on 18 injuries of bone and joints was slight; kappa 0.14 (90% confidence interval (CI): 0.05-0.22). The intra-observer agreement was fair; kappa 0.31 (90% CI: 0.22-0.41). Overall, the panel rated the system as very complicated and not practical. The ICI is a complicated classification system with slight to fair inter- and intra-observer variabilities. It might not be a practical classification system for calcaneal injuries in 'everyday use' or scientific purposes. Copyright © 2011 Elsevier Ltd. All rights reserved.

  4. Evaluation of Flagging Criteria of United States Kidney Transplant Center Performance: How to Best Define Outliers?

    PubMed

    Schold, Jesse D; Miller, Charles M; Henry, Mitchell L; Buccini, Laura D; Flechner, Stuart M; Goldfarb, David A; Poggio, Emilio D; Andreoni, Kenneth A

    2017-06-01

    Scientific Registry of Transplant Recipients report cards of US organ transplant center performance are publicly available and used for quality oversight. Low center performance (LP) evaluations are associated with changes in practice including reduced transplant rates and increased waitlist removals. In 2014, Scientific Registry of Transplant Recipients implemented new Bayesian methodology to evaluate performance which was not adopted by Center for Medicare and Medicaid Services (CMS). In May 2016, CMS altered their performance criteria, reducing the likelihood of LP evaluations. Our aims were to evaluate incidence, survival rates, and volume of LP centers with Bayesian, historical (old-CMS) and new-CMS criteria using 6 consecutive program-specific reports (PSR), January 2013 to July 2015 among adult kidney transplant centers. Bayesian, old-CMS and new-CMS criteria identified 13.4%, 8.3%, and 6.1% LP PSRs, respectively. Over the 3-year period, 31.9% (Bayesian), 23.4% (old-CMS), and 19.8% (new-CMS) of centers had 1 or more LP evaluation. For small centers (<83 transplants/PSR), there were 4-fold additional LP evaluations (52 vs 13 PSRs) for 1-year mortality with Bayesian versus new-CMS criteria. For large centers (>183 transplants/PSR), there were 3-fold additional LP evaluations for 1-year mortality with Bayesian versus new-CMS criteria with median differences in observed and expected patient survival of -1.6% and -2.2%, respectively. A significant proportion of kidney transplant centers are identified as low performing with relatively small survival differences compared with expected. Bayesian criteria have significantly higher flagging rates and new-CMS criteria modestly reduce flagging. Critical appraisal of performance criteria is needed to assess whether quality oversight is meeting intended goals and whether further modifications could reduce risk aversion, more efficiently allocate resources, and increase transplant opportunities.

  5. Using a serious game to complement CPR instruction in a nurse faculty.

    PubMed

    Boada, Imma; Rodriguez-Benitez, Antonio; Garcia-Gonzalez, Juan Manuel; Olivet, Josep; Carreras, Vicenç; Sbert, Mateu

    2015-11-01

    Cardiopulmonary resuscitation (CPR) is a first aid key survival technique used to stimulate breathing and keep blood flowing to the heart. Its effective administration can significantly increase the chances of survival for victims of cardiac arrest. LISSA is a serious game designed to complement CPR teaching and also to refresh CPR skills in an enjoyable way. The game presents an emergency situation in a 3D virtual environment and the player has to save the victim applying the CPR actions. In this paper, we describe LISSA and its evaluation in a population composed of 109 nursing undergraduate students enrolled in the Nursing degree of our university. To evaluate LISSA we performed a randomized controlled trial that compares the classical teaching methodology, composed of self-directed learning for theory plus laboratory sessions with a mannequin for practice, with the one that uses LISSA after self-directed learning for theory and before laboratory sessions with a mannequin. From our evaluation we observed that students using LISSA (Group 2 and 3) gave significantly better learning acquisition scores than those following traditional classes (Group 1). To evaluate the differences between students of these groups we performed a paired samples t-test between Group 1 and 2 (μ1=35, 67, μ2=47, 50 and p<0.05) and between students of Group 1 and 3 (μ1=35, 67, μ3=50, 58 and p<0.05). From these tests we observed that there are significant differences in both cases. We also evaluated student performance of main steps of CPR protocol. Students that use LISSA performed better than the ones that did not use it. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  6. Reliability issues in active control of large flexible space structures

    NASA Technical Reports Server (NTRS)

    Vandervelde, W. E.

    1986-01-01

    Efforts in this reporting period were centered on four research tasks: design of failure detection filters for robust performance in the presence of modeling errors, design of generalized parity relations for robust performance in the presence of modeling errors, design of failure sensitive observers using the geometric system theory of Wonham, and computational techniques for evaluation of the performance of control systems with fault tolerance and redundancy management

  7. Surface wind mixing in the Regional Ocean Modeling System (ROMS)

    NASA Astrophysics Data System (ADS)

    Robertson, Robin; Hartlipp, Paul

    2017-12-01

    Mixing at the ocean surface is key for atmosphere-ocean interactions and the distribution of heat, energy, and gases in the upper ocean. Winds are the primary force for surface mixing. To properly simulate upper ocean dynamics and the flux of these quantities within the upper ocean, models must reproduce mixing in the upper ocean. To evaluate the performance of the Regional Ocean Modeling System (ROMS) in replicating the surface mixing, the results of four different vertical mixing parameterizations were compared against observations, using the surface mixed layer depth, the temperature fields, and observed diffusivities for comparisons. The vertical mixing parameterizations investigated were Mellor- Yamada 2.5 level turbulent closure (MY), Large- McWilliams- Doney Kpp (LMD), Nakanishi- Niino (NN), and the generic length scale (GLS) schemes. This was done for one temperate site in deep water in the Eastern Pacific and three shallow water sites in the Baltic Sea. The model reproduced the surface mixed layer depth reasonably well for all sites; however, the temperature fields were reproduced well for the deep site, but not for the shallow Baltic Sea sites. In the Baltic Sea, the models overmixed the water column after a few days. Vertical temperature diffusivities were higher than those observed and did not show the temporal fluctuations present in the observations. The best performance was by NN and MY; however, MY became unstable in two of the shallow simulations with high winds. The performance of GLS nearly as good as NN and MY. LMD had the poorest performance as it generated temperature diffusivities that were too high and induced too much mixing. Further observational comparisons are needed to evaluate the effects of different stratification and wind conditions and the limitations on the vertical mixing parameterizations.

  8. A Computational Framework for Quantitative Evaluation of Movement during Rehabilitation

    NASA Astrophysics Data System (ADS)

    Chen, Yinpeng; Duff, Margaret; Lehrer, Nicole; Sundaram, Hari; He, Jiping; Wolf, Steven L.; Rikakis, Thanassis

    2011-06-01

    This paper presents a novel generalized computational framework for quantitative kinematic evaluation of movement in a rehabilitation clinic setting. The framework integrates clinical knowledge and computational data-driven analysis together in a systematic manner. The framework provides three key benefits to rehabilitation: (a) the resulting continuous normalized measure allows the clinician to monitor movement quality on a fine scale and easily compare impairments across participants, (b) the framework reveals the effect of individual movement components on the composite movement performance helping the clinician decide the training foci, and (c) the evaluation runs in real-time, which allows the clinician to constantly track a patient's progress and make appropriate adaptations to the therapy protocol. The creation of such an evaluation is difficult because of the sparse amount of recorded clinical observations, the high dimensionality of movement and high variations in subject's performance. We address these issues by modeling the evaluation function as linear combination of multiple normalized kinematic attributes y = Σwiφi(xi) and estimating the attribute normalization function φi(ṡ) by integrating distributions of idealized movement and deviated movement. The weights wi are derived from a therapist's pair-wise comparison using a modified RankSVM algorithm. We have applied this framework to evaluate upper limb movement for stroke survivors with excellent results—the evaluation results are highly correlated to the therapist's observations.

  9. Experimental evaluation of nonclassical correlations between measurement outcomes and target observable in a quantum measurement

    NASA Astrophysics Data System (ADS)

    Iinuma, Masataka; Suzuki, Yutaro; Nii, Taiki; Kinoshita, Ryuji; Hofmann, Holger F.

    2016-03-01

    In general, it is difficult to evaluate measurement errors when the initial and final conditions of the measurement make it impossible to identify the correct value of the target observable. Ozawa proposed a solution based on the operator algebra of observables which has recently been used in experiments investigating the error-disturbance trade-off of quantum measurements. Importantly, this solution makes surprisingly detailed statements about the relations between measurement outcomes and the unknown target observable. In the present paper, we investigate this relation by performing a sequence of two measurements on the polarization of a photon, so that the first measurement commutes with the target observable and the second measurement is sensitive to a complementary observable. While the initial measurement can be evaluated using classical statistics, the second measurement introduces the effects of quantum correlations between the noncommuting physical properties. By varying the resolution of the initial measurement, we can change the relative contribution of the nonclassical correlations and identify their role in the evaluation of the quantum measurement. It is shown that the most striking deviation from classical expectations is obtained at the transition between weak and strong measurements, where the competition between different statistical effects results in measurement values well outside the range of possible eigenvalues.

  10. The Pfirrmann classification of lumbar intervertebral disc degeneration: an independent inter- and intra-observer agreement assessment.

    PubMed

    Urrutia, Julio; Besa, Pablo; Campos, Mauricio; Cikutovic, Pablo; Cabezon, Mario; Molina, Marcelo; Cruz, Juan Pablo

    2016-09-01

    Grading inter-vertebral disc degeneration (IDD) is important in the evaluation of many degenerative conditions, including patients with low back pain. Magnetic resonance imaging (MRI) is considered the best imaging instrument to evaluate IDD. The Pfirrmann classification is commonly used to grade IDD; the authors describing this classification showed an adequate agreement using it; however, there has been a paucity of independent agreement studies using this grading system. The aim of this study was to perform an independent inter- and intra-observer agreement study using the Pfirrmann classification. T2-weighted sagittal images of 79 patients consecutively studied with lumbar spine MRI were classified using the Pfirrmann grading system by six evaluators (three spine surgeons and three radiologists). After a 6-week interval, the 79 cases were presented to the same evaluators in a random sequence for repeat evaluation. The intra-class correlation coefficient (ICC) and the weighted kappa (wκ) were used to determine the inter- and intra-observer agreement. The inter-observer agreement was excellent, with an ICC = 0.94 (0.93-0.95) and wκ = 0.83 (0.74-0.91). There were no differences between spine surgeons and radiologists. Likewise, there were no differences in agreement evaluating the different lumbar discs. Most differences among observers were only of one grade. Intra-observer agreement was also excellent with ICC = 0.86 (0.83-0.89) and wκ = 0.89 (0.85-0.93). In this independent study, the Pfirrmann classification demonstrated an adequate agreement among different observers and by the same observer on separate occasions. Furthermore, it allows communication between radiologists and spine surgeons.

  11. UNIX-based operating systems robustness evaluation

    NASA Technical Reports Server (NTRS)

    Chang, Yu-Ming

    1996-01-01

    Robust operating systems are required for reliable computing. Techniques for robustness evaluation of operating systems not only enhance the understanding of the reliability of computer systems, but also provide valuable feed- back to system designers. This thesis presents results from robustness evaluation experiments on five UNIX-based operating systems, which include Digital Equipment's OSF/l, Hewlett Packard's HP-UX, Sun Microsystems' Solaris and SunOS, and Silicon Graphics' IRIX. Three sets of experiments were performed. The methodology for evaluation tested (1) the exception handling mechanism, (2) system resource management, and (3) system capacity under high workload stress. An exception generator was used to evaluate the exception handling mechanism of the operating systems. Results included exit status of the exception generator and the system state. Resource management techniques used by individual operating systems were tested using programs designed to usurp system resources such as physical memory and process slots. Finally, the workload stress testing evaluated the effect of the workload on system performance by running a synthetic workload and recording the response time of local and remote user requests. Moderate to severe performance degradations were observed on the systems under stress.

  12. Evolution of short cognitive test performance in stroke patients with vascular cognitive impairment and vascular dementia: Baseline evaluation and follow-up

    PubMed Central

    Custodio, Nilton; Montesinos, Rosa; Lira, David; Herrera-Perez, Eder; Bardales, Yadira; Valeriano-Lorenzo, Lucia

    2017-01-01

    ABSTRACT. There is limited evidence about the progression of cognitive performance during the post-stroke stage. Objective: To assess the evolution of cognitive performance in stroke patients without vascular cognitive impairment (VCI), patients with vascular mild cognitive impairment (MCI), and patients with vascular dementia (VD). Methods: A prospective cohort of stroke outpatients from two secondary medical centers in Lima, Peru was studied. We performed standardized evaluations at definitive diagnosis (baseline evaluation), and control follow-ups at 6 and 12 months, including a battery of short cognitive tests: Clinical Dementia Rating (CDR), Addenbrooke's Cognitive Examination (ACE), and INECO Frontal Screening (IFS). Results: 152 outpatients completed the follow-up, showing progressive increase in mean score on the CDR(0.34 to 0.46), contrary to the pattern observed on the ACE and IFS (78.18 to 76.48 and 23.63 to 22.24). The box plot for the CDR test showed that VCI patients had progressive worsening (0.79 to 0.16). Conversely, this trend was not observed in subjects without VCI. The box plot for the ACE and IFS showed that, for the majority of the differentiated stroke types, both non-VCI and VCI patients had progressive worsening. Conclusion: According to both ACE and IFS results during a 1-year follow-up, the cognitive performance of stroke patients worsened, a trend which was particularly consistent in infarction-type stroke patients. PMID:29354218

  13. Spatio-temporal pattern clustering for skill assessment of the Korea Operational Oceanographic System

    NASA Astrophysics Data System (ADS)

    Kim, J.; Park, K.

    2016-12-01

    In order to evaluate the performance of operational forecast models in the Korea operational oceanographic system (KOOS) which has been developed by Korea Institute of Ocean Science and Technology (KIOST), a skill assessment (SA) tool has developed and provided multiple skill metrics including not only correlation and error skills by comparing predictions and observation but also pattern clustering with numerical models, satellite, and observation. The KOOS has produced 72 hours forecast information on atmospheric and hydrodynamic forecast variables of wind, pressure, current, tide, wave, temperature, and salinity at every 12 hours per day produced by operating numerical models such as WRF, ROMS, MOM5, WW-III, and SWAN and the SA has conducted to evaluate the forecasts. We have been operationally operated several kinds of numerical models such as WRF, ROMS, MOM5, MOHID, WW-III. Quantitative assessment of operational ocean forecast model is very important to provide accurate ocean forecast information not only to general public but also to support ocean-related problems. In this work, we propose a method of pattern clustering using machine learning method and GIS-based spatial analytics to evaluate spatial distribution of numerical models and spatial observation data such as satellite and HF radar. For the clustering, we use 10 or 15 years-long reanalysis data which was computed by the KOOS, ECMWF, and HYCOM to make best matching clusters which are classified physical meaning with time variation and then we compare it with forecast data. Moreover, for evaluating current, we develop extraction method of dominant flow and apply it to hydrodynamic models and HF radar's sea surface current data. By applying pattern clustering method, it allows more accurate and effective assessment of ocean forecast models' performance by comparing not only specific observation positions which are determined by observation stations but also spatio-temporal distribution of whole model areas. We believe that our proposed method will be very useful to examine and evaluate large amount of numerical modeling data as well as satellite data.

  14. Articular cartilage grading of the knee: diagnostic performance of fat-suppressed 3D volume isotropic turbo spin-echo acquisition (VISTA) compared with 3D T1 high-resolution isovolumetric examination (THRIVE).

    PubMed

    Lee, Young Han; Hahn, Seok; Lim, Daekeon; Suh, Jin-Suck

    2017-02-01

    Background Conventionally, two-dimensional (2D) fast spin-echo (FSE) sequences have been widely used for clinical cartilage imaging as well as gradient (GRE) sequences. Recently, three-dimensional (3D) volumetric magnetic resonance imaging (MRI) has been introduced with one 3D volumetric scan, and this is replacing slice-by-slice 2D MR scans. Purpose To evaluate the image quality and diagnostic performance of two 3D sequences for abnormalities of knee cartilage: fat-suppressed (FS) FSE-based 3D volume isotropic turbo spin-echo acquisition (VISTA) and GRE-based 3D T1 high-resolution isovolumetric examination (THRIVE). Material and Methods The institutional review board approved the protocol of this retrospective review. This study enrolled 40 patients (41 knees) with arthroscopically confirmed abnormalities of cartilage. All patients underwent isovoxel 3D-VISTA and 3D-THRIVE MR sequences on 3T MRI. We assessed the cartilage grade on the two 3D sequences using arthroscopy as a gold standard. Inter-observer agreement for each technique was evaluated with the intraclass correlation coefficient (ICC). Differences in the area under the curve (AUC) were compared between the 3D-THRIVE and 3D-VISTA. Results Although inter-observer agreement for both sequences was excellent, the inter-observer agreement for 3D-VISTA was higher than for 3D-THRIVE for cartilage grading in all regions of the knee. There was no significant difference in the diagnostic performance ( P > 0.05) between the two sequences for detecting cartilage grade. Conclusion FSE-based 3D-VISTA images had good diagnostic performance that was comparable to GRE-based 3D-THRIVE images in the evaluation of knee cartilage, and can be used in routine knee MR protocols for the evaluation of cartilage.

  15. The first impression is what matters: a neuroaesthetic study of the cerebral perception and appreciation of paintings by Titian.

    PubMed

    Babiloni, Francesca; Rossi, Dario; Cherubino, Patrizia; Trettel, Arianna; Picconi, Daniela; Maglione, Anton Giulio; Vecchiato, Giovanni; de Vico Fallani, Fabrizio; Chavez, Mario; Babiloni, Fabio

    2015-08-01

    In this paper we measured the neuroelectrical and the eye-movements activities in a group of 27 healthy subjects during their visit of a fine arts gallery in which a series of masterpieces of the Italian painter Tiziano Vecellio (also known as Titian, 1488-1576) were shown. The pictures chosen for the visit were 10 portraits and 10 of religious subjects. Each picture was observed for a minute. A mobile EEG device with an eye-tracker was used for this experiment. Evaluation of the appreciation of the pictures was performed by using the neuroelectrical approach-withdrawal index (AW). High value of AW means high appreciation of the picture. The number of eye fixations performed by the subjects during the observation of the pictures was also analyzed. Results showed that in the examined group the AW index was significant higher during the observation of portraits than during the observation of the religious subjects (as resulted from an ANOVA performed on AW index, with a p<;0,007). Interestingly, the average AW index estimated in the first 20 seconds of the observation of the pictures remains highly correlated with the AW index evaluated for the second part of the data (from 20 s to one minute) for all the 20 pictures examined (r = 0,82, p<;0,0001). In addition, the number of eye fixations performed by the subjects in the first 5 or 10 seconds of observation of the pictures that were most appreciated are significantly higher than the number of eye fixations performed on pictures that subjects did not like (p<;0,048 and p<;0,0018, respectively). Such difference vanishes if the entire period of observation of the pictures of one minute is used (p = 0,54). Taken together, such results seem to suggest that the neuroelectrical correlates of the perception of "good" or "bad" pictures are rapidly formed in our brain, within the first 10-20 seconds from the exposition to the picture.

  16. Evaluation and intercomparison of air quality forecasts over Korea during the KORUS-AQ campaign

    NASA Astrophysics Data System (ADS)

    Lee, Seungun; Park, Rokjin J.; Kim, Soontae; Song, Chul H.; Kim, Cheol-Hee; Woo, Jung-Hun

    2017-04-01

    We evaluate and intercompare ozone and aerosol simulations over Korea during the KORUS-AQ campaign, which was conducted in May-June 2016. Four global and regional air quality models participated in the campaign and provided daily air quality forecasts over Korea to guide aircraft flight paths for detecting air pollution events over Korean peninsula and its nearby oceans. We first evaluate the model performance by comparing simulated and observed hourly surface ozone and PM2.5 concentrations at ground sites in Korea and find that the models successfully capture intermittent air pollution events and reproduce the daily variation of ozone and PM2.5 concentrations. However, significant underestimates of peak ozone concentrations in the afternoon are also found in most models. Among chemical constituents of PM2.5, the models typically overestimate observed nitrate aerosol concentrations and underestimate organic aerosol concentrations, although the observed mass concentrations of PM2.5 are seemingly reproduced by the models. In particular, all models used the same anthropogenic emission inventory (KU-CREATE) for daily air quality forecast, but they show a considerable discrepancy for ozone and aerosols. Compared to individual model results, the ensemble mean of all models shows the best performance with correlation coefficients of 0.73 for ozone and 0.57 for PM2.5. We here investigate contributing factors to the discrepancy, which will serve as a guidance to improve the performance of the air quality forecast.

  17. Practical implementation of channelized hotelling observers: effect of ROI size

    NASA Astrophysics Data System (ADS)

    Ferrero, Andrea; Favazza, Christopher P.; Yu, Lifeng; Leng, Shuai; McCollough, Cynthia H.

    2017-03-01

    Fundamental to the development and application of channelized Hotelling observer (CHO) models is the selection of the region of interest (ROI) to evaluate. For assessment of medical imaging systems, reducing the ROI size can be advantageous. Smaller ROIs enable a greater concentration of interrogable objects in a single phantom image, thereby providing more information from a set of images and reducing the overall image acquisition burden. Additionally, smaller ROIs may promote better assessment of clinical patient images as different patient anatomies present different ROI constraints. To this end, we investigated the minimum ROI size that does not compromise the performance of the CHO model. In this study, we evaluated both simulated images and phantom CT images to identify the minimum ROI size that resulted in an accurate figure of merit (FOM) of the CHO's performance. More specifically, the minimum ROI size was evaluated as a function of the following: number of channels, spatial frequency and number of rotations of the Gabor filters, size and contrast of the object, and magnitude of the image noise. Results demonstrate that a minimum ROI size exists below which the CHO's performance is grossly inaccurate. The minimum ROI size is shown to increase with number of channels and be dictated by truncation of lower frequency filters. We developed a model to estimate the minimum ROI size as a parameterized function of the number of orientations and spatial frequencies of the Gabor filters, providing a guide for investigators to appropriately select parameters for model observer studies.

  18. Practical implementation of Channelized Hotelling Observers: Effect of ROI size.

    PubMed

    Ferrero, Andrea; Favazza, Christopher P; Yu, Lifeng; Leng, Shuai; McCollough, Cynthia H

    2017-03-01

    Fundamental to the development and application of channelized Hotelling observer (CHO) models is the selection of the region of interest (ROI) to evaluate. For assessment of medical imaging systems, reducing the ROI size can be advantageous. Smaller ROIs enable a greater concentration of interrogable objects in a single phantom image, thereby providing more information from a set of images and reducing the overall image acquisition burden. Additionally, smaller ROIs may promote better assessment of clinical patient images as different patient anatomies present different ROI constraints. To this end, we investigated the minimum ROI size that does not compromise the performance of the CHO model. In this study, we evaluated both simulated images and phantom CT images to identify the minimum ROI size that resulted in an accurate figure of merit (FOM) of the CHO's performance. More specifically, the minimum ROI size was evaluated as a function of the following: number of channels, spatial frequency and number of rotations of the Gabor filters, size and contrast of the object, and magnitude of the image noise. Results demonstrate that a minimum ROI size exists below which the CHO's performance is grossly inaccurate. The minimum ROI size is shown to increase with number of channels and be dictated by truncation of lower frequency filters. We developed a model to estimate the minimum ROI size as a parameterized function of the number of orientations and spatial frequencies of the Gabor filters, providing a guide for investigators to appropriately select parameters for model observer studies.

  19. Clinical performance of bonded ceramic inlays/onlays: A 5- to 18-year retrospective longitudinal study.

    PubMed

    Borgia Botto, Ernesto; Baró, Rosario; Borgia Botto, José Luis

    2016-08-01

    This retrospective longitudinal study evaluated the clinical performance of bonded ceramic inlays/onlays, placed by the first author in his private practice, in a 5 to 18-year period. The patients evaluated had been treated in the office for at least 7 years and were still in the practice up to year 2013. 130 randomly selected patients agreed to participate in the study. 93 bonded ceramic inlays/onlays (BCRs), were placed on posterior teeth in 47 subjects. Gender, age, tooth preparation, number, type, extent, location, quality and survival of the restorations, ceramic materials, luting resins cements, parafunctional habits, secondary caries and maintenance therapy were the variables evaluated. Cohen 's Kappa coefficient, on the quality analysis of the restorations, ranged from 0.78 to 1. Fisher 's exact test, Chi Square test, Kruskal-Wallis test and Mann-Whitney non-parametric test were indicated to analyze significant differences. At the initial examination, 87 (93.5%) restorations were in function and six failed (6.5%). 81 (93%) were rated as clinical successes. The observed mean survival time of those that remained functional was 11 years. The standard deviation was 4 years, with a 95% CI for the overall observed mean survival time (10 years-11 years, 9 months). 87 of 93 BCRs had a functional success of 93.5%, with an observed mean survival of 11 years. The clinical performance of bonded ceramic onlays was very acceptable. Bonded ceramic onlays showed a predictable, esthetic, and functional treatment, with acceptable longevity.

  20. Evaluation of the Community Multiscale Air Quality (CMAQ) Model Version 5.1

    EPA Science Inventory

    The AMAD will performed two CMAQ model simulations, one with the current publically available version of the CMAQ model (v5.0.2) and the other with the new version of the CMAQ model (v5.1). The results of each model simulation are compared to observations and the performance of t...

  1. Are Funny Groups Good at Solving Problems? A Methodological Evaluation and Some Preliminary Results.

    ERIC Educational Resources Information Center

    Pollio, Howard R.; Bainum, Charlene Kubo

    1983-01-01

    Observed college students (N=195) divided according to sex and measures of wittiness to determine the effects of humor on problem solving in groups. Results showed that group composition was not a crucial issue in problem-solving performance, but that humerous group interaction was, and did not interfere with ongoing task performance. (LLL)

  2. Beyond game effectiveness. Part II, a qualitative study of multi-role experiential learning.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Willis, Matthew; Tucker, Eilish Marie; Raybourn, Elaine Marie

    The present paper is the second in a series published at I/ITSEC that seeks to explain the efficacy of multirole experiential learning employed to create engaging game-based training methods transitioned to the U.S. Army, U.S. Army Special Forces, Civil Affairs, and Psychological Operations teams. The first publication (I/ITSEC 2009) summarized findings from a quantitative study that investigated experiential learning in the multi-player, PC-based game module transitioned to PEO-STRI, DARWARS Ambush! NK (non-kinetic). The 2009 publication reported that participants of multi-role (Player and Reflective Observer/Evaluator) game-based training reported statistically significant learning and engagement. Additionally when the means of the two groupsmore » (Player and Reflective Observer/Evaluator) were compared, they were not statistically significantly different from each other. That is to say that both playing as well as observing/evaluating were engaging learning modalities. The Observer/Evaluator role was designed to provide an opportunity for real-time reflection and meta-cognitive learning during game play. Results indicated that this role was an engaging way to learn about communication, that participants learned something about cultural awareness, and that the skills they learned were helpful in problem solving and decision-making. The present paper seeks to continue to understand what and how users of non-kinetic game-based missions learn by revisiting the 2009 quantitative study with further investigation such as stochastic player performance analysis using latent semantic analyses and graph visualizations to correlate against human coder ratings and pre- and post-test self-analysis. The results are applicable to First-Person game-based learning systems designed to enhance trainee intercultural communication, interpersonal skills, and adaptive thinking. In the full paper, we discuss results obtained from data collected from 78 research participants of diverse backgrounds who trained by engaging in tasks directly, as well as observing and evaluating peer performance in real-time. The goal is two-fold. One is to quantify and visualize detailed player performance data coming from game play transcription to give further understanding to the results in the 2009 I/ITSEC paper. The second is to develop a set of technologies from this quantification and visualization approach into a generalized application tool to be used to aid in future games development of player/learner models and game adaptation algorithms.« less

  3. Implementation of Performance Assessment in STEM (Science, Technology, Engineering, Mathematics) Education to Detect Science Process Skill

    NASA Astrophysics Data System (ADS)

    Septiani, A.; Rustaman, N. Y.

    2017-02-01

    A descriptive study about the implementation of performance assessment in STEM based instruction was carried out to investigate the tenth grade of Vocational school students’ science process skills during the teaching learning processes. A number of tenth grade agriculture students was involved as research subjects selected through cluster random sampling technique (n=35). Performance assessment was planned on skills during the teaching learning process through observation and on product resulted from their engineering practice design. The procedure conducted in this study included thinking phase (identifying problem and sharing idea), designing phase, construction phase, and evaluation phase. Data was collected through the use of science process skills (SPS) test, observation sheet on student activity, as well as tasks and rubrics for performance assessment during the instruction. Research findings show that the implementation of performance assessment in STEM education in planting media could detect students science process skills better from the observation individually compared through SPS test. It was also found that the result of performance assessment was diverse when it was correlated to each indicator of SPS (strong and positive; weak and positive).

  4. Evaluating the Performance of the Goddard Multi-Scale Modeling Framework against GPM, TRMM and CloudSat/CALIPSO Products

    NASA Astrophysics Data System (ADS)

    Chern, J. D.; Tao, W. K.; Lang, S. E.; Matsui, T.; Mohr, K. I.

    2014-12-01

    Four six-month (March-August 2014) experiments with the Goddard Multi-scale Modeling Framework (MMF) were performed to study the impacts of different Goddard one-moment bulk microphysical schemes and large-scale forcings on the performance of the MMF. Recently a new Goddard one-moment bulk microphysics with four-ice classes (cloud ice, snow, graupel, and frozen drops/hail) has been developed based on cloud-resolving model simulations with large-scale forcings from field campaign observations. The new scheme has been successfully implemented to the MMF and two MMF experiments were carried out with this new scheme and the old three-ice classes (cloud ice, snow graupel) scheme. The MMF has global coverage and can rigorously evaluate microphysics performance for different cloud regimes. The results show MMF with the new scheme outperformed the old one. The MMF simulations are also strongly affected by the interaction between large-scale and cloud-scale processes. Two MMF sensitivity experiments with and without nudging large-scale forcings to those of ERA-Interim reanalysis were carried out to study the impacts of large-scale forcings. The model simulated mean and variability of surface precipitation, cloud types, cloud properties such as cloud amount, hydrometeors vertical profiles, and cloud water contents, etc. in different geographic locations and climate regimes are evaluated against GPM, TRMM, CloudSat/CALIPSO satellite observations. The Goddard MMF has also been coupled with the Goddard Satellite Data Simulation Unit (G-SDSU), a system with multi-satellite, multi-sensor, and multi-spectrum satellite simulators. The statistics of MMF simulated radiances and backscattering can be directly compared with satellite observations to assess the strengths and/or deficiencies of MMF simulations and provide guidance on how to improve the MMF and microphysics.

  5. Multi-criteria evaluation of CMIP5 GCMs for climate change impact analysis

    NASA Astrophysics Data System (ADS)

    Ahmadalipour, Ali; Rana, Arun; Moradkhani, Hamid; Sharma, Ashish

    2017-04-01

    Climate change is expected to have severe impacts on global hydrological cycle along with food-water-energy nexus. Currently, there are many climate models used in predicting important climatic variables. Though there have been advances in the field, there are still many problems to be resolved related to reliability, uncertainty, and computing needs, among many others. In the present work, we have analyzed performance of 20 different global climate models (GCMs) from Climate Model Intercomparison Project Phase 5 (CMIP5) dataset over the Columbia River Basin (CRB) in the Pacific Northwest USA. We demonstrate a statistical multicriteria approach, using univariate and multivariate techniques, for selecting suitable GCMs to be used for climate change impact analysis in the region. Univariate methods includes mean, standard deviation, coefficient of variation, relative change (variability), Mann-Kendall test, and Kolmogorov-Smirnov test (KS-test); whereas multivariate methods used were principal component analysis (PCA), singular value decomposition (SVD), canonical correlation analysis (CCA), and cluster analysis. The analysis is performed on raw GCM data, i.e., before bias correction, for precipitation and temperature climatic variables for all the 20 models to capture the reliability and nature of the particular model at regional scale. The analysis is based on spatially averaged datasets of GCMs and observation for the period of 1970 to 2000. Ranking is provided to each of the GCMs based on the performance evaluated against gridded observational data on various temporal scales (daily, monthly, and seasonal). Results have provided insight into each of the methods and various statistical properties addressed by them employed in ranking GCMs. Further; evaluation was also performed for raw GCM simulations against different sets of gridded observational dataset in the area.

  6. Personality traits affect teaching performance of attending physicians: results of a multi-center observational study.

    PubMed

    Scheepers, Renée A; Lombarts, Kiki M J M H; van Aken, Marcel A G; Heineman, Maas Jan; Arah, Onyebuchi A

    2014-01-01

    Worldwide, attending physicians train residents to become competent providers of patient care. To assess adequate training, attending physicians are increasingly evaluated on their teaching performance. Research suggests that personality traits affect teaching performance, consistent with studied effects of personality traits on job performance and academic performance in medicine. However, up till date, research in clinical teaching practice did not use quantitative methods and did not account for specialty differences. We empirically studied the relationship of attending physicians' personality traits with their teaching performance across surgical and non-surgical specialties. We conducted a survey across surgical and non-surgical specialties in eighteen medical centers in the Netherlands. Residents evaluated attending physicians' overall teaching performance, as well as the specific domains learning climate, professional attitude, communication, evaluation, and feedback, using the validated 21-item System for Evaluation of Teaching Qualities (SETQ). Attending physicians self-evaluated their personality traits on a 5-point scale using the validated 10-item Big Five Inventory (BFI), yielding the Five Factor model: extraversion, conscientiousness, neuroticism, agreeableness and openness. Overall, 622 (77%) attending physicians and 549 (68%) residents participated. Extraversion positively related to overall teaching performance (regression coefficient, B: 0.05, 95% CI: 0.01 to 0.10, P = 0.02). Openness was negatively associated with scores on feedback for surgical specialties only (B: -0.10, 95% CI: -0.15 to -0.05, P<0.001) and conscientiousness was positively related to evaluation of residents for non-surgical specialties only (B: 0.13, 95% CI: 0.03 to 0.22, p = 0.01). Extraverted attending physicians were consistently evaluated as better supervisors. Surgical attending physicians who display high levels of openness were evaluated as less adequate feedback-givers. Non-surgical attending physicians who were conscientious seem to be good at evaluating residents. These insights could contribute to future work on development paths of attending physicians in medical education.

  7. Personality Traits Affect Teaching Performance of Attending Physicians: Results of a Multi-Center Observational Study

    PubMed Central

    Scheepers, Renée A.; Lombarts, Kiki M. J. M. H.; van Aken, Marcel A. G.; Heineman, Maas Jan; Arah, Onyebuchi A.

    2014-01-01

    Background Worldwide, attending physicians train residents to become competent providers of patient care. To assess adequate training, attending physicians are increasingly evaluated on their teaching performance. Research suggests that personality traits affect teaching performance, consistent with studied effects of personality traits on job performance and academic performance in medicine. However, up till date, research in clinical teaching practice did not use quantitative methods and did not account for specialty differences. We empirically studied the relationship of attending physicians' personality traits with their teaching performance across surgical and non-surgical specialties. Method We conducted a survey across surgical and non-surgical specialties in eighteen medical centers in the Netherlands. Residents evaluated attending physicians' overall teaching performance, as well as the specific domains learning climate, professional attitude, communication, evaluation, and feedback, using the validated 21-item System for Evaluation of Teaching Qualities (SETQ). Attending physicians self-evaluated their personality traits on a 5-point scale using the validated 10-item Big Five Inventory (BFI), yielding the Five Factor model: extraversion, conscientiousness, neuroticism, agreeableness and openness. Results Overall, 622 (77%) attending physicians and 549 (68%) residents participated. Extraversion positively related to overall teaching performance (regression coefficient, B: 0.05, 95% CI: 0.01 to 0.10, P = 0.02). Openness was negatively associated with scores on feedback for surgical specialties only (B: −0.10, 95% CI: −0.15 to −0.05, P<0.001) and conscientiousness was positively related to evaluation of residents for non-surgical specialties only (B: 0.13, 95% CI: 0.03 to 0.22, p = 0.01). Conclusions Extraverted attending physicians were consistently evaluated as better supervisors. Surgical attending physicians who display high levels of openness were evaluated as less adequate feedback-givers. Non-surgical attending physicians who were conscientious seem to be good at evaluating residents. These insights could contribute to future work on development paths of attending physicians in medical education. PMID:24844725

  8. A statistical, task-based evaluation method for three-dimensional x-ray breast imaging systems using variable-background phantoms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Park, Subok; Jennings, Robert; Liu Haimo

    Purpose: For the last few years, development and optimization of three-dimensional (3D) x-ray breast imaging systems, such as digital breast tomosynthesis (DBT) and computed tomography, have drawn much attention from the medical imaging community, either academia or industry. However, there is still much room for understanding how to best optimize and evaluate the devices over a large space of many different system parameters and geometries. Current evaluation methods, which work well for 2D systems, do not incorporate the depth information from the 3D imaging systems. Therefore, it is critical to develop a statistically sound evaluation method to investigate the usefulnessmore » of inclusion of depth and background-variability information into the assessment and optimization of the 3D systems. Methods: In this paper, we present a mathematical framework for a statistical assessment of planar and 3D x-ray breast imaging systems. Our method is based on statistical decision theory, in particular, making use of the ideal linear observer called the Hotelling observer. We also present a physical phantom that consists of spheres of different sizes and materials for producing an ensemble of randomly varying backgrounds to be imaged for a given patient class. Lastly, we demonstrate our evaluation method in comparing laboratory mammography and three-angle DBT systems for signal detection tasks using the phantom's projection data. We compare the variable phantom case to that of a phantom of the same dimensions filled with water, which we call the uniform phantom, based on the performance of the Hotelling observer as a function of signal size and intensity. Results: Detectability trends calculated using the variable and uniform phantom methods are different from each other for both mammography and DBT systems. Conclusions: Our results indicate that measuring the system's detection performance with consideration of background variability may lead to differences in system performance estimates and comparisons. For the assessment of 3D systems, to accurately determine trade offs between image quality and radiation dose, it is critical to incorporate randomness arising from the imaging chain including background variability into system performance calculations.« less

  9. Inter- and intra-observer agreement of BI-RADS-based subjective visual estimation of amount of fibroglandular breast tissue with magnetic resonance imaging: comparison to automated quantitative assessment.

    PubMed

    Wengert, G J; Helbich, T H; Woitek, R; Kapetas, P; Clauser, P; Baltzer, P A; Vogl, W-D; Weber, M; Meyer-Baese, A; Pinker, Katja

    2016-11-01

    To evaluate the inter-/intra-observer agreement of BI-RADS-based subjective visual estimation of the amount of fibroglandular tissue (FGT) with magnetic resonance imaging (MRI), and to investigate whether FGT assessment benefits from an automated, observer-independent, quantitative MRI measurement by comparing both approaches. Eighty women with no imaging abnormalities (BI-RADS 1 and 2) were included in this institutional review board (IRB)-approved prospective study. All women underwent un-enhanced breast MRI. Four radiologists independently assessed FGT with MRI by subjective visual estimation according to BI-RADS. Automated observer-independent quantitative measurement of FGT with MRI was performed using a previously described measurement system. Inter-/intra-observer agreements of qualitative and quantitative FGT measurements were assessed using Cohen's kappa (k). Inexperienced readers achieved moderate inter-/intra-observer agreement and experienced readers a substantial inter- and perfect intra-observer agreement for subjective visual estimation of FGT. Practice and experience reduced observer-dependency. Automated observer-independent quantitative measurement of FGT was successfully performed and revealed only fair to moderate agreement (k = 0.209-0.497) with subjective visual estimations of FGT. Subjective visual estimation of FGT with MRI shows moderate intra-/inter-observer agreement, which can be improved by practice and experience. Automated observer-independent quantitative measurements of FGT are necessary to allow a standardized risk evaluation. • Subjective FGT estimation with MRI shows moderate intra-/inter-observer agreement in inexperienced readers. • Inter-observer agreement can be improved by practice and experience. • Automated observer-independent quantitative measurements can provide reliable and standardized assessment of FGT with MRI.

  10. Study on the application of the time-compressed speech in children.

    PubMed

    Padilha, Fernanda Yasmin Odila Maestri Miguel; Pinheiro, Maria Madalena Canina

    2017-11-09

    To analyze the performance of children without alteration of central auditory processing in the Time-compressed Speech Test. This is a descriptive, observational, cross-sectional study. Study participants were 22 children aged 7-11 years without central auditory processing disorders. The following instruments were used to assess whether these children presented central auditory processing disorders: Scale of Auditory Behaviors, simplified evaluation of central auditory processing, and Dichotic Test of Digits (binaural integration stage). The Time-compressed Speech Test was applied to the children without auditory changes. The participants presented better performance in the list of monosyllabic words than in the list of disyllabic words, but with no statistically significant difference. No influence on test performance was observed with respect to order of presentation of the lists and the variables gender and ear. Regarding age, difference in performance was observed only in the list of disyllabic words. The mean score of children in the Time-compressed Speech Test was lower than that of adults reported in the national literature. Difference in test performance was observed only with respect to the age variable for the list of disyllabic words. No difference was observed in the order of presentation of the lists or in the type of stimulus.

  11. Systematic review of coaching to enhance surgeons' operative performance.

    PubMed

    Min, Hyeyoun; Morales, Dianali Rivera; Orgill, Dennis; Smink, Douglas S; Yule, Steven

    2015-11-01

    There is increasing attention on the coaching of surgeons and trainees to improve performance but no comprehensive review on this topic. The purpose of this review is to summarize the quantity and the quality of studies involving surgical coaching methods and their effectiveness. We performed a systematic literature search through PubMed and PsychINFO by using predefined inclusion criteria. Evidence for main outcome categories was evaluated with the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) system and the Medical Education Research Study Quality Instrument (MERSQI). Of a total 3,063 articles, 23 met our inclusion criteria; 4 randomized controlled trials and 19 observational studies. We categorized the articles into 4 groups on the basis of the outcome studied: perception, attitude and opinion; technical skills; nontechnical skills; and performance measures. Overall strength of evidence for each outcome groups was as follows: Perception, attitude, and opinion (Grading of Recommendations Assessment, Development, and Evaluation: Very Low, Medical Education Research Study Quality Instrument [MERSQI]: 10); technical skills (randomized controlled trials: High, 13.1; Observation studies: Very Low, 11.5); nontechnical skills (Very Low, 12.4) and performance measures (Very Low, 13.6). Simulation was the most used setting for coaching; more than half of the studies deployed an experienced surgeon as a coach and showed that coaching was effective. Surgical coaching interventions have a positive impact on learners' perception and attitudes, their technical and nontechnical skills, and performance measures. Evidence of impact on patient outcomes was limited, and the quality of research studies was variable. Despite this, our systematic review of different coaching interventions will benefit future coaching strategies and implementation to enhance operative performance. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. A novel spatial performance metric for robust pattern optimization of distributed hydrological models

    NASA Astrophysics Data System (ADS)

    Stisen, S.; Demirel, C.; Koch, J.

    2017-12-01

    Evaluation of performance is an integral part of model development and calibration as well as it is of paramount importance when communicating modelling results to stakeholders and the scientific community. There exists a comprehensive and well tested toolbox of metrics to assess temporal model performance in the hydrological modelling community. On the contrary, the experience to evaluate spatial performance is not corresponding to the grand availability of spatial observations readily available and to the sophisticate model codes simulating the spatial variability of complex hydrological processes. This study aims at making a contribution towards advancing spatial pattern oriented model evaluation for distributed hydrological models. This is achieved by introducing a novel spatial performance metric which provides robust pattern performance during model calibration. The promoted SPAtial EFficiency (spaef) metric reflects three equally weighted components: correlation, coefficient of variation and histogram overlap. This multi-component approach is necessary in order to adequately compare spatial patterns. spaef, its three components individually and two alternative spatial performance metrics, i.e. connectivity analysis and fractions skill score, are tested in a spatial pattern oriented model calibration of a catchment model in Denmark. The calibration is constrained by a remote sensing based spatial pattern of evapotranspiration and discharge timeseries at two stations. Our results stress that stand-alone metrics tend to fail to provide holistic pattern information to the optimizer which underlines the importance of multi-component metrics. The three spaef components are independent which allows them to complement each other in a meaningful way. This study promotes the use of bias insensitive metrics which allow comparing variables which are related but may differ in unit in order to optimally exploit spatial observations made available by remote sensing platforms. We see great potential of spaef across environmental disciplines dealing with spatially distributed modelling.

  13. Management of children exposed to Mycobacterium tuberculosis: a public health evaluation in West Java, Indonesia

    PubMed Central

    Ruslami, Rovina; Anselmo, Melissa; Alisjahbana, Bachti; Yulianti, Neti; Sampurno, Hedy; van Crevel, Reinout; Hill, Philip C

    2013-01-01

    Abstract Objective To investigate qualitatively and quantitatively the performance of a programme for managing the child contacts of adult tuberculosis patients in Indonesia. Methods A public health evaluation framework was used to assess gaps in a child contact management programme at a lung clinic. Targets for programme performance indicators were derived from established programme indicator targets, the scientific literature and expert opinion. Compliance with tuberculosis screening, the initiation of isoniazid preventive therapy in children younger than 5 years, the accuracy of tuberculosis diagnosis and adherence to preventive therapy were assessed in 755 child contacts in two cohorts. In addition, 22 primary caregivers and 34 clinic staff were interviewed to evaluate knowledge and acceptance of child contact management. The cost to caregivers was recorded. Gaps between observed and target indicator values were quantified. Findings The gaps between observed and target performance indicators were: 82% for screening compliance; 64 to 100% for diagnostic accuracy, 50% for the initiation of preventive therapy, 54% for adherence to therapy and 50% for costs. Many staff did not have adequate knowledge of, or an appropriate attitude towards, child contact management, especially regarding isoniazid preventive therapy. Caregivers had good knowledge of screening but not of preventive therapy and had difficulty travelling to the clinic and paying costs. Conclusion The study identified widespread gaps in the performance of a child contact management system in Indonesia, all of which appear amenable to intervention. The public health evaluation framework used could be applied in other settings where child contact management is failing. PMID:24347732

  14. Measures of GCM Performance as Functions of Model Parameters Affecting Clouds and Radiation

    NASA Astrophysics Data System (ADS)

    Jackson, C.; Mu, Q.; Sen, M.; Stoffa, P.

    2002-05-01

    This abstract is one of three related presentations at this meeting dealing with several issues surrounding optimal parameter and uncertainty estimation of model predictions of climate. Uncertainty in model predictions of climate depends in part on the uncertainty produced by model approximations or parameterizations of unresolved physics. Evaluating these uncertainties is computationally expensive because one needs to evaluate how arbitrary choices for any given combination of model parameters affects model performance. Because the computational effort grows exponentially with the number of parameters being investigated, it is important to choose parameters carefully. Evaluating whether a parameter is worth investigating depends on two considerations: 1) does reasonable choices of parameter values produce a large range in model response relative to observational uncertainty? and 2) does the model response depend non-linearly on various combinations of model parameters? We have decided to narrow our attention to selecting parameters that affect clouds and radiation, as it is likely that these parameters will dominate uncertainties in model predictions of future climate. We present preliminary results of ~20 to 30 AMIPII style climate model integrations using NCAR's CCM3.10 that show model performance as functions of individual parameters controlling 1) critical relative humidity for cloud formation (RHMIN), and 2) boundary layer critical Richardson number (RICR). We also explore various definitions of model performance that include some or all observational data sources (surface air temperature and pressure, meridional and zonal winds, clouds, long and short-wave cloud forcings, etc...) and evaluate in a few select cases whether the model's response depends non-linearly on the parameter values we have selected.

  15. Analysis on mechanics response of long-life asphalt pavement at moist hot heavy loading area

    NASA Astrophysics Data System (ADS)

    Xu, Xinquan; Li, Hao; Wu, Chuanhai; Li, Shanqiang

    2018-04-01

    Based on the durability of semi-rigid base asphalt pavement test road in Guangdong Yunluo expressway, by comparing the mechanics response of modified semi-rigid base, RCC base and inverted semi-rigid base with the state of continuous, using four unit five parameter model to evaluate rut depth of asphalt pavement structure, and through commonly used fatigue life prediction model to evaluate fatigue performance of three types of asphalt pavement structure. Theoretical calculation and four years tracking observation results of test road show that rut depth of modified semi-rigid base asphalt pavement is the minimum, the road performance is the best, and the fatigue performance is the optimal.

  16. Foundations of Intervention Research in Instrumental Practice

    PubMed Central

    Hatfield, Johannes L.; Lemyre, Pierre-Nicolas

    2016-01-01

    The goals of the present study are to evaluate, implement, and adapt psychological skills used in the realm of sports into music performance. This research project also aims to build foundations on how to implement future interventions to guide music students on how to optimize practice toward performance. A 2-month psychological skills intervention was provided to two students from the national music academy's bachelor program in music performance to better understand how to adapt and construct psychological skills training programs for performing music students. The program evaluated multiple intervention tools including the use of questionnaires, performance profiling, iPads, electronic practice logs, recording the perceived value of individual and combined work, as well as the effectiveness of different communication forms. Perceived effects of the intervention were collected through semi-structured interviews, observations, and logs. PMID:26834660

  17. Systematic review of the methodological quality of controlled trials evaluating Chinese herbal medicine in patients with rheumatoid arthritis

    PubMed Central

    Pan, Xin; Lopez-Olivo, Maria A; Song, Juhee; Pratt, Gregory; Suarez-Almazor, Maria E

    2017-01-01

    Objectives We appraised the methodological and reporting quality of randomised controlled clinical trials (RCTs) evaluating the efficacy and safety of Chinese herbal medicine (CHM) in patients with rheumatoid arthritis (RA). Design For this systematic review, electronic databases were searched from inception until June 2015. The search was limited to humans and non-case report studies, but was not limited by language, year of publication or type of publication. Two independent reviewers selected RCTs, evaluating CHM in RA (herbals and decoctions). Descriptive statistics were used to report on risk of bias and their adherence to reporting standards. Multivariable logistic regression analysis was performed to determine study characteristics associated with high or unclear risk of bias. Results Out of 2342 unique citations, we selected 119 RCTs including 18 919 patients: 10 108 patients received CHM alone and 6550 received one of 11 treatment combinations. A high risk of bias was observed across all domains: 21% had a high risk for selection bias (11% from sequence generation and 30% from allocation concealment), 85% for performance bias, 89% for detection bias, 4% for attrition bias and 40% for reporting bias. In multivariable analysis, fewer authors were associated with selection bias (allocation concealment), performance bias and attrition bias, and earlier year of publication and funding source not reported or disclosed were associated with selection bias (sequence generation). Studies published in non-English language were associated with reporting bias. Poor adherence to recommended reporting standards (<60% of the studies not providing sufficient information) was observed in 11 of the 23 sections evaluated. Limitations Study quality and data extraction were performed by one reviewer and cross-checked by a second reviewer. Translation to English was performed by one reviewer in 85% of the included studies. Conclusions Studies evaluating CHM often fail to meet expected methodological criteria, and high-quality evidence is lacking. PMID:28249848

  18. Comparisons of cloud cover evaluated from LANDSAT imagery and meteorological stations across the British Isles

    NASA Technical Reports Server (NTRS)

    Barrett, E. C. (Principal Investigator); Grant, C. K.

    1976-01-01

    The author has identified the following significant results. This stage of the study has confirmed the initial supposition that LANDSAT data could be analyzed to provide useful data on cloud amount, and that useful light would be thrown thereby on the performance of the ground observer of this aspect of the state of the sky. This study, in comparison with previous studies of a similar nature using data from meteorological satellites, has benefited greatly from the much higher resolution data provided by LANDSAT. This has permitted consideration of not only the overall performance of the surface observer in estimating total cloud cover, but also his performance under different sky conditions.

  19. Impact of workplace based assessment on doctors' education and performance: a systematic review.

    PubMed

    Miller, Alice; Archer, Julian

    2010-09-24

    To investigate the literature for evidence that workplace based assessment affects doctors' education and performance. Systematic review. The primary data sources were the databases Journals@Ovid, Medline, Embase, CINAHL, PsycINFO, and ERIC. Evidence based reviews (Bandolier, Cochrane Library, DARE, HTA Database, and NHS EED) were accessed and searched via the Health Information Resources website. Reference lists of relevant studies and bibliographies of review articles were also searched. Review methods Studies of any design that attempted to evaluate either the educational impact of workplace based assessment, or the effect of workplace based assessment on doctors' performance, were included. Studies were excluded if the sampled population was non-medical or the study was performed with medical students. Review articles, commentaries, and letters were also excluded. The final exclusion criterion was the use of simulated patients or models rather than real life clinical encounters. Sixteen studies were included. Fifteen of these were non-comparative descriptive or observational studies; the other was a randomised controlled trial. Study quality was mixed. Eight studies examined multisource feedback with mixed results; most doctors felt that multisource feedback had educational value, although the evidence for practice change was conflicting. Some junior doctors and surgeons displayed little willingness to change in response to multisource feedback, whereas family physicians might be more prepared to initiate change. Performance changes were more likely to occur when feedback was credible and accurate or when coaching was provided to help subjects identify their strengths and weaknesses. Four studies examined the mini-clinical evaluation exercise, one looked at direct observation of procedural skills, and three were concerned with multiple assessment methods: all these studies reported positive results for the educational impact of workplace based assessment tools. However, there was no objective evidence of improved performance with these tools. Considering the emphasis placed on workplace based assessment as a method of formative performance assessment, there are few published articles exploring its impact on doctors' education and performance. This review shows that multisource feedback can lead to performance improvement, although individual factors, the context of the feedback, and the presence of facilitation have a profound effect on the response. There is no evidence that alternative workplace based assessment tools (mini-clinical evaluation exercise, direct observation of procedural skills, and case based discussion) lead to improvement in performance, although subjective reports on their educational impact are positive.

  20. Inter and intra-observer concordance for the diagnosis of portal hypertension gastropathy.

    PubMed

    Casas, Meritxell; Vergara, Mercedes; Brullet, Enric; Junquera, Félix; Martínez-Bauer, Eva; Miquel, Mireia; Sánchez-Delgado, Jordi; Dalmau, Blai; Campo, Rafael; Calvet, Xavier

    2018-03-01

    At present there is no fully accepted endoscopic classification for the assessment of the severity of portal hypertensive gastropathy (PHG). Few studies have evaluated inter and intra-observer concordance or the degree of concordance between different endoscopic classifications. To evaluate inter and intra-observer agreement for the presence of portal hypertensive gastropathy and enteropathy using different endoscopic classifications. Patients with liver cirrhosis were included into the study. Enteroscopy was performed under sedation. The location of lesions and their severity was recorded. Images were videotaped and subsequently evaluated independently by three different endoscopists, one of whom was the initial endoscopist. The agreement between observations was assessed using the kappa index. Seventy-four patients (mean age 63.2 years, 53 males and 21 females) were included. The agreement between the three endoscopists regarding the presence or absence of PHG using the Tanoue and McCormack classifications was very low (kappa scores = 0.16 and 0.27, respectively). The current classifications of portal hypertensive gastropathy have a very low degree of intra and inter-observer agreement for the diagnosis and assessment of gastropathy severity.

  1. Channelized relevance vector machine as a numerical observer for cardiac perfusion defect detection task

    NASA Astrophysics Data System (ADS)

    Kalayeh, Mahdi M.; Marin, Thibault; Pretorius, P. Hendrik; Wernick, Miles N.; Yang, Yongyi; Brankov, Jovan G.

    2011-03-01

    In this paper, we present a numerical observer for image quality assessment, aiming to predict human observer accuracy in a cardiac perfusion defect detection task for single-photon emission computed tomography (SPECT). In medical imaging, image quality should be assessed by evaluating the human observer accuracy for a specific diagnostic task. This approach is known as task-based assessment. Such evaluations are important for optimizing and testing imaging devices and algorithms. Unfortunately, human observer studies with expert readers are costly and time-demanding. To address this problem, numerical observers have been developed as a surrogate for human readers to predict human diagnostic performance. The channelized Hotelling observer (CHO) with internal noise model has been found to predict human performance well in some situations, but does not always generalize well to unseen data. We have argued in the past that finding a model to predict human observers could be viewed as a machine learning problem. Following this approach, in this paper we propose a channelized relevance vector machine (CRVM) to predict human diagnostic scores in a detection task. We have previously used channelized support vector machines (CSVM) to predict human scores and have shown that this approach offers better and more robust predictions than the classical CHO method. The comparison of the proposed CRVM with our previously introduced CSVM method suggests that CRVM can achieve similar generalization accuracy, while dramatically reducing model complexity and computation time.

  2. Evaluation of Ohio work zone speed zones process.

    DOT National Transportation Integrated Search

    2014-06-01

    This report describes the methodology and results of analyses performed to determine the effectiveness of Ohio Department of Transportation processes for establishing work zone speed zones. Researchers observed motorists speed choice upstream of a...

  3. Multiple Comparisons of Observation Means--Are the Means Significantly Different?

    ERIC Educational Resources Information Center

    Fahidy, T. Z.

    2009-01-01

    Several currently popular methods of ascertaining which treatment (population) means are different, via random samples obtained under each treatment, are briefly described and illustrated by evaluating catalyst performance in a chemical reactor.

  4. How Non-Linearity and Grade-Level Differences Complicate the Validation of Observation Protocols

    ERIC Educational Resources Information Center

    Lazarev, Valeriy; Newman, Denis

    2013-01-01

    Teacher evaluation is currently a major policy issue at all levels of the K-12 system driven in large part by current US Department of Education requirements. The main objective of this study is to explore the patterns of relationship between observational scores and value-added measures of teacher performance in math classrooms and the variation…

  5. Observation of T-2 and HT-2 glucosides from Fusarium sporotrichioides by liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS)

    USDA-ARS?s Scientific Manuscript database

    Cultures of Fusarium sporotrichioides were extracted and subjected to evaluation by high performance liquid chromatography – tandem mass spectrometry (LC-MS/MS). Along with the expected T-2 and HT-2 toxins, compounds 162 m/z higher than the toxins were observed. Fragmentation behavior of the larger ...

  6. Development of a set of activities to evaluate the arm and hand function in children with obstetric brachial plexus lesion.

    PubMed

    Boeschoten, K H; Folmer, K B; van der Lee, J H; Nollet, F

    2007-02-01

    To develop an observational instrument that can be used to evaluate the quality of arm and hand skills in daily functional activities in children with obstetric brachial plexus lesion (OBPL). A set of functional activities was constructed and standardized, and the intra-observer reliability of the assessment of this set of activities was studied. Department of Occupational Therapy and Department of Rehabilitation Medicine, VU University Medical Centre. Twenty-six children with OBPL in the age range of 4 -6 years. The children were asked to perform 47 bimanual activities, which were recorded on videotape. The videotapes were scored twice by the same occupational therapist. The percentage of agreement in scoring 'hand-use', 'speed' and 'assistance' was over 80% for a substantial number of activities, indicating a strong agreement. However, in scoring 'deviations in movements and body posture' the percentage of agreement was insufficient in most activities. This set of activities has good potential for assessment of the performance of functional activities in children with OBPL. This study, however, showed a number of difficulties in observing and scoring the activities that have to be considered when developing a standardized video observation.

  7. Acquisition and improvement of human motor skills: Learning through observation and practice

    NASA Technical Reports Server (NTRS)

    Iba, Wayne

    1991-01-01

    Skilled movement is an integral part of the human existence. A better understanding of motor skills and their development is a prerequisite to the construction of truly flexible intelligent agents. We present MAEANDER, a computational model of human motor behavior, that uniformly addresses both the acquisition of skills through observation and the improvement of skills through practice. MAEANDER consists of a sensory-effector interface, a memory of movements, and a set of performance and learning mechanisms that let it recognize and generate motor skills. The system initially acquires such skills by observing movements performed by another agent and constructing a concept hierarchy. Given a stored motor skill in memory, MAEANDER will cause an effector to behave appropriately. All learning involves changing the hierarchical memory of skill concepts to more closely correspond to either observed experience or to desired behaviors. We evaluated MAEANDER empirically with respect to how well it acquires and improves both artificial movement types and handwritten script letters from the alphabet. We also evaluate MAEANDER as a psychological model by comparing its behavior to robust phenomena in humans and by considering the richness of the predictions it makes.

  8. OSSE Assessment of Ocean Observing System Enhancements to Improve Coupled Tropical Cyclone Intensity Prediction

    NASA Astrophysics Data System (ADS)

    Halliwell, G. R., Jr.; Mehari, M. F.; Dong, J.; Kourafalou, V.; Atlas, R. M.; Kang, H.; Le Henaff, M.

    2016-02-01

    A new ocean OSSE system validated in the tropical/subtropical Atlantic Ocean is used to evaluate ocean observing strategies during the 2014 hurricane season with the goal of improving coupled tropical cyclone forecasts. Enhancements to the existing operational ocean observing system are evaluated prior to two storms, Edouard and Gonzalo, where ocean measurements were obtained during field experiments supported by the 2013 Disaster Relief Appropriation Act. For Gonzalo, a reference OSSE is performed to evaluate the impact of two ocean gliders deployed north and south of Puerto Rico and two Alamo profiling floats deployed in the same general region during most of the hurricane season. For Edouard, a reference OSSE is performed to evaluate impacts of the pre-storm ocean profile survey conducted by NOAA WP-3D aircraft. For both storms, additional OSSEs are then conducted to evaluate more extensive seasonal and pre-storm ocean observing strategies. These include (1) deploying a larger number of synthetic ocean gliders during the hurricane season, (2) deploying pre-storm synthetic thermistor chains or synthetic profiling floats along one or more "picket fence" lines that cross projected storm tracks, and (3) designing pre-storm airborne profiling surveys to have larger impacts than the actual pre-storm survey conducted for Edouard. Impacts are evaluated based on error reduction in ocean parameters important to SST cooling and hurricane intensity such as ocean heat content and the structure of the ocean eddy field. In all cases, ocean profiles that sample both temperature and salinity down to 1000m provide greater overall error reduction than shallower temperature profiles obtained from AXBTs and thermistor chains. Large spatial coverage with multiple instruments spanning a few degrees of longitude and latitude is necessary to sufficiently reduce ocean initialization errors over a region broad enough to significantly impact predicted surface enthalpy flux into the storm. Error reduction in hurricane intensity forecasts resulting from the additional ocean observations is then assessed by initializing the ocean component of the HYCOM-HWRF coupled prediction system with analyses produced by the OSSE system.

  9. The quadrant method measuring four points is as a reliable and accurate as the quadrant method in the evaluation after anatomical double-bundle ACL reconstruction.

    PubMed

    Mochizuki, Yuta; Kaneko, Takao; Kawahara, Keisuke; Toyoda, Shinya; Kono, Norihiko; Hada, Masaru; Ikegami, Hiroyasu; Musha, Yoshiro

    2017-11-20

    The quadrant method was described by Bernard et al. and it has been widely used for postoperative evaluation of anterior cruciate ligament (ACL) reconstruction. The purpose of this research is to further develop the quadrant method measuring four points, which we named four-point quadrant method, and to compare with the quadrant method. Three-dimensional computed tomography (3D-CT) analyses were performed in 25 patients who underwent double-bundle ACL reconstruction using the outside-in technique. The four points in this study's quadrant method were defined as point1-highest, point2-deepest, point3-lowest, and point4-shallowest, in femoral tunnel position. Value of depth and height in each point was measured. Antero-medial (AM) tunnel is (depth1, height2) and postero-lateral (PL) tunnel is (depth3, height4) in this four-point quadrant method. The 3D-CT images were evaluated independently by 2 orthopaedic surgeons. A second measurement was performed by both observers after a 4-week interval. Intra- and inter-observer reliability was calculated by means of intra-class correlation coefficient (ICC). Also, the accuracy of the method was evaluated against the quadrant method. Intra-observer reliability was almost perfect for both AM and PL tunnel (ICC > 0.81). Inter-observer reliability of AM tunnel was substantial (ICC > 0.61) and that of PL tunnel was almost perfect (ICC > 0.81). The AM tunnel position was 0.13% deep, 0.58% high and PL tunnel position was 0.01% shallow, 0.13% low compared to quadrant method. The four-point quadrant method was found to have high intra- and inter-observer reliability and accuracy. This method can evaluate the tunnel position regardless of the shape and morphology of the bone tunnel aperture for use of comparison and can provide measurement that can be compared with various reconstruction methods. The four-point quadrant method of this study is considered to have clinical relevance in that it is a detailed and accurate tool for evaluating femoral tunnel position after ACL reconstruction. Case series, Level IV.

  10. Influence of socioeconomic status on trauma center performance evaluations in a Canadian trauma system.

    PubMed

    Moore, Lynne; Turgeon, Alexis F; Sirois, Marie-Josée; Murat, Valérie; Lavoie, André

    2011-09-01

    Trauma center performance evaluations generally include adjustment for injury severity, age, and comorbidity. However, disparities across trauma centers may be due to other differences in source populations that are not accounted for, such as socioeconomic status (SES). We aimed to evaluate whether SES influences trauma center performance evaluations in an inclusive trauma system with universal access to health care. The study was based on data collected between 1999 and 2006 in a Canadian trauma system. Patient SES was quantified using an ecologic index of social and material deprivation. Performance evaluations were based on mortality adjusted using the Trauma Risk Adjustment Model. Agreement between performance results with and without additional adjustment for SES was evaluated with correlation coefficients. The study sample comprised a total of 71,784 patients from 48 trauma centers, including 3,828 deaths within 30 days (4.5%) and 5,549 deaths within 6 months (7.7%). The proportion of patients in the highest quintile of social and material deprivation varied from 3% to 43% and from 11% to 90% across hospitals, respectively. The correlation between performance results with or without adjustment for SES was almost perfect (r = 0.997; 95% CI 0.995-0.998) and the same hospital outliers were identified. We observed an important variation in SES across trauma centers but no change in risk-adjusted mortality estimates when SES was added to adjustment models. Results suggest that after adjustment for injury severity, age, comorbidity, and transfer status, disparities in SES across trauma center source populations do not influence trauma center performance evaluations in a system offering universal health coverage. Copyright © 2011 American College of Surgeons. Published by Elsevier Inc. All rights reserved.

  11. A Regional Climate Model Evaluation System based on Satellite and other Observations

    NASA Astrophysics Data System (ADS)

    Lean, P.; Kim, J.; Waliser, D. E.; Hall, A. D.; Mattmann, C. A.; Granger, S. L.; Case, K.; Goodale, C.; Hart, A.; Zimdars, P.; Guan, B.; Molotch, N. P.; Kaki, S.

    2010-12-01

    Regional climate models are a fundamental tool needed for downscaling global climate simulations and projections, such as those contributing to the Coupled Model Intercomparison Projects (CMIPs) that form the basis of the IPCC Assessment Reports. The regional modeling process provides the means to accommodate higher resolution and a greater complexity of Earth System processes. Evaluation of both the global and regional climate models against observations is essential to identify model weaknesses and to direct future model development efforts focused on reducing the uncertainty associated with climate projections. However, the lack of reliable observational data and the lack of formal tools are among the serious limitations to addressing these objectives. Recent satellite observations are particularly useful as they provide a wealth of information on many different aspects of the climate system, but due to their large volume and the difficulties associated with accessing and using the data, these datasets have been generally underutilized in model evaluation studies. Recognizing this problem, NASA JPL / UCLA is developing a model evaluation system to help make satellite observations, in conjunction with in-situ, assimilated, and reanalysis datasets, more readily accessible to the modeling community. The system includes a central database to store multiple datasets in a common format and codes for calculating predefined statistical metrics to assess model performance. This allows the time taken to compare model simulations with satellite observations to be reduced from weeks to days. Early results from the use this new model evaluation system for evaluating regional climate simulations over California/western US regions will be presented.

  12. Performance evaluation of Bragg coherent diffraction imaging

    DOE PAGES

    Ozturk, Hande; Huang, X.; Yan, H.; ...

    2017-10-03

    In this study, we present a numerical framework for modeling three-dimensional (3D) diffraction data in Bragg coherent diffraction imaging (Bragg CDI) experiments and evaluating the quality of obtained 3D complex-valued real-space images recovered by reconstruction algorithms under controlled conditions. The approach is used to systematically explore the performance and the detection limit of this phase-retrieval-based microscopy tool. The numerical investigation suggests that the superb performance of Bragg CDI is achieved with an oversampling ratio above 30 and a detection dynamic range above 6 orders. The observed performance degradation subject to the data binning processes is also studied. Furthermore, this numericalmore » tool can be used to optimize experimental parameters and has the potential to significantly improve the throughput of Bragg CDI method.« less

  13. Summer Indoor Heat Pump Water Heater Evaluation in a Hot-Dry Climate

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoeschele, Marc; Seitzler, Matthew

    Heat pump water heaters offer a significant opportunity to improve water heating performance for the over 40% of U.S. households that heat domestic hot water using electric resistance storage water heaters. Numerous field studies have also been completed documenting performance in a variety of climates and applications. More recent evaluation efforts have focused attention on the performance of May through September 2014, with ongoing winter monitoring being sponsored by California utility partners. Summer results show favorable system performance with extrapolated annual water heating savings of 1,466 to 2,300 kWh per year, based on the observed hot water loads. Additional summermore » space cooling benefits savings of 121 to 135 kWh per year were projected, further increasing the water energy savings.« less

  14. Summer Indoor Heat Pump Water Heater Evaluation in a Hot-Dry Climate

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoeschele, Marc; Seitzler, Matthew

    2017-05-01

    Heat pump water heaters offer a significant opportunity to improve water heating performance for the over 40% of U.S. households that heat domestic hot water using electric resistance storage water heaters. Numerous field studies have also been completed documenting performance in a variety of climates and applications. More recent evaluation efforts have focused attention on the performance of May through September 2014, with ongoing winter monitoring being sponsored by California utility partners. Summer results show favorable system performance with extrapolated annual water heating savings of 1,466 to 2,300 kWh per year, based on the observed hot water loads. Additional summermore » space cooling benefits savings of 121 to 135 kWh per year were projected, further increasing the water energy savings.« less

  15. An accurate evaluation of the performance of asynchronous DS-CDMA systems with zero-correlation-zone coding in Rayleigh fading

    NASA Astrophysics Data System (ADS)

    Walker, Ernest; Chen, Xinjia; Cooper, Reginald L.

    2010-04-01

    An arbitrarily accurate approach is used to determine the bit-error rate (BER) performance for generalized asynchronous DS-CDMA systems, in Gaussian noise with Raleigh fading. In this paper, and the sequel, new theoretical work has been contributed which substantially enhances existing performance analysis formulations. Major contributions include: substantial computational complexity reduction, including a priori BER accuracy bounding; an analytical approach that facilitates performance evaluation for systems with arbitrary spectral spreading distributions, with non-uniform transmission delay distributions. Using prior results, augmented by these enhancements, a generalized DS-CDMA system model is constructed and used to evaluated the BER performance, in a variety of scenarios. In this paper, the generalized system modeling was used to evaluate the performance of both Walsh- Hadamard (WH) and Walsh-Hadamard-seeded zero-correlation-zone (WH-ZCZ) coding. The selection of these codes was informed by the observation that WH codes contain N spectral spreading values (0 to N - 1), one for each code sequence; while WH-ZCZ codes contain only two spectral spreading values (N/2 - 1,N/2); where N is the sequence length in chips. Since these codes span the spectral spreading range for DS-CDMA coding, by invoking an induction argument, the generalization of the system model is sufficiently supported. The results in this paper, and the sequel, support the claim that an arbitrary accurate performance analysis for DS-CDMA systems can be evaluated over the full range of binary coding, with minimal computational complexity.

  16. Assessing hospital disaster preparedness: a comparison of an on-site survey, directly observed drill performance, and video analysis of teamwork.

    PubMed

    Kaji, Amy H; Langford, Vinette; Lewis, Roger J

    2008-09-01

    There is currently no validated method for assessing hospital disaster preparedness. We determine the degree of correlation between the results of 3 methods for assessing hospital disaster preparedness: administration of an on-site survey, drill observation using a structured evaluation tool, and video analysis of team performance in the hospital incident command center. This was a prospective, observational study conducted during a regional disaster drill, comparing the results from an on-site survey, a structured disaster drill evaluation tool, and a video analysis of teamwork, performed at 6 911-receiving hospitals in Los Angeles County, CA. The on-site survey was conducted separately from the drill and assessed hospital disaster plan structure, vendor agreements, modes of communication, medical and surgical supplies, involvement of law enforcement, mutual aid agreements with other facilities, drills and training, surge capacity, decontamination capability, and pharmaceutical stockpiles. The drill evaluation tool, developed by Johns Hopkins University under contract from the Agency for Healthcare Research and Quality, was used to assess various aspects of drill performance, such as the availability of the hospital disaster plan, the geographic configuration of the incident command center, whether drill participants were identifiable, whether the noise level interfered with effective communication, and how often key information (eg, number of available staffed floor, intensive care, and isolation beds; number of arriving victims; expected triage level of victims; number of potential discharges) was received by the incident command center. Teamwork behaviors in the incident command center were quantitatively assessed, using the MedTeams analysis of the video recordings obtained during the disaster drill. Spearman rank correlations of the results between pair-wise groupings of the 3 assessment methods were calculated. The 3 evaluation methods demonstrated qualitatively different results with respect to each hospital's level of disaster preparedness. The Spearman rank correlation coefficient between the results of the on-site survey and the video analysis of teamwork was -0.34; between the results of the on-site survey and the structured drill evaluation tool, 0.15; and between the results of the video analysis and the drill evaluation tool, 0.82. The disparate results obtained from the 3 methods suggest that each measures distinct aspects of disaster preparedness, and perhaps no single method adequately characterizes overall hospital preparedness.

  17. Temporal variations in the potential hydrological performance of extensive green roof systems

    NASA Astrophysics Data System (ADS)

    De-Ville, Simon; Menon, Manoj; Stovin, Virginia

    2018-03-01

    Existing literature provides contradictory information about variation in potential green roof hydrological performance over time. This study has evaluated a long-term hydrological monitoring record from a series of extensive green roof test beds to identify long-term evolutions and sub-annual (seasonal) variations in potential hydrological performance. Monitoring of nine differently-configured extensive green roof test beds took place over a period of 6 years in Sheffield, UK. Long-term evolutions and sub-annual trends in maximum potential retention performance were identified through physical monitoring of substrate field capacity over time. An independent evaluation of temporal variations in detention performance was undertaken through the fitting of reservoir-routing model parameters. Aggregation of the resulting retention and detention variations permitted the prediction of extensive green roof hydrological performance in response to a 1-in-30-year 1-h summer design storm for Sheffield, UK, which facilitated the comparison of multi and sub-annual hydrological performance variations. Sub-annual (seasonal) variation was found to be significantly greater than long-term evolution. Potential retention performance increased by up to 12% after 5-years, whilst the maximum sub-annual variation in potential retention was 27%. For vegetated roof configurations, a 4% long-term improvement was observed for detention performance, compared to a maximum 63% sub-annual variation. Consistent long-term reductions in detention performance were observed in unvegetated roof configurations, with a non-standard expanded-clay substrate experiencing a 45% reduction in peak attenuation over 5-years. Conventional roof configurations exhibit stable long-term hydrological performance, but are nonetheless subject to sub-annual variation.

  18. Salicylic acid for the treatment of melasma: new acquisitions for monitoring the clinical improvement.

    PubMed

    Fabbrocini, Gabriella; De Vita, Valerio; Marasca, Claudio; Palmisano, Franco; Monfrecola, Giuseppe

    2013-11-01

    The Melasma Area and Severity Index (MASI) and the Melasma Severity Score (MSS) are calculated on the basis of only a subjective clinical assessment. This raises the need to have an objective score, uniform in the evaluation by different clinicians. The purpose of this study was to establish if the images by Canfield Reveal Imager can be correlated to MASI score to better evaluate the clinical efficacy of salicylic acid 33% peeling in the treatment of melasma respect to the clinical observation. The study was a voluntary observational study. Twenty female patients affected with melasma, aged between 30 and 60 years, were included in the study. Treatment with salicylic acid 33% was performed once a month, for a total of four times. The dermatologist (Doc A) examined each patient's melasma areas using MASI score, at the face-to-face observation and at Reveal images evaluation during the first (T0) and the end point time (T4). Digital photographs were also evaluated by another experienced dermatologist (Doc B), who has never seen clinically the patients before and who evaluated MASI score by Reveal images at time T0 and T4. Student's t-test and linear regression test were performed, showing statistically significant values comparing MASI score obtained by digital photo and MASI score obtained clinically. The monitoring of the improvement by Reveal images can optimize the treatment approach and the efficacy of same dermocosmetics procedures can be revised following standard criteria. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  19. Observation strategies with the Fermi Gamma-ray Space Telescope

    NASA Astrophysics Data System (ADS)

    McEnery, Julie E.; Fermi mission Teams

    2015-01-01

    During the first few years of the Fermi mission, the default observation mode has been an all-sky survey, optimized to provide relatively uniform coverage of the entire sky every three hours. Over 95% of the mission has been performed in this observation mode. However, Fermi is capable of flexible survey mode patterns, and inertially pointed observations both of which allow increased coverage of selected parts of the sky. In this presentation, we will describe the types of observations that Fermi can make, the relative advantages and disadvantages of various observations, and provide guidelines to help Fermi users plan and evaluate non-standard observations.

  20. A simple method for low-contrast detectability, image quality and dose optimisation with CT iterative reconstruction algorithms and model observers.

    PubMed

    Bellesi, Luca; Wyttenbach, Rolf; Gaudino, Diego; Colleoni, Paolo; Pupillo, Francesco; Carrara, Mauro; Braghetti, Antonio; Puligheddu, Carla; Presilla, Stefano

    2017-01-01

    The aim of this work was to evaluate detection of low-contrast objects and image quality in computed tomography (CT) phantom images acquired at different tube loadings (i.e. mAs) and reconstructed with different algorithms, in order to find appropriate settings to reduce the dose to the patient without any image detriment. Images of supraslice low-contrast objects of a CT phantom were acquired using different mAs values. Images were reconstructed using filtered back projection (FBP), hybrid and iterative model-based methods. Image quality parameters were evaluated in terms of modulation transfer function; noise, and uniformity using two software resources. For the definition of low-contrast detectability, studies based on both human (i.e. four-alternative forced-choice test) and model observers were performed across the various images. Compared to FBP, image quality parameters were improved by using iterative reconstruction (IR) algorithms. In particular, IR model-based methods provided a 60% noise reduction and a 70% dose reduction, preserving image quality and low-contrast detectability for human radiological evaluation. According to the model observer, the diameters of the minimum detectable detail were around 2 mm (up to 100 mAs). Below 100 mAs, the model observer was unable to provide a result. IR methods improve CT protocol quality, providing a potential dose reduction while maintaining a good image detectability. Model observer can in principle be useful to assist human performance in CT low-contrast detection tasks and in dose optimisation.

  1. Content Validation and Evaluation of an Endovascular Teamwork Assessment Tool.

    PubMed

    Hull, L; Bicknell, C; Patel, K; Vyas, R; Van Herzeele, I; Sevdalis, N; Rudarakanchana, N

    2016-07-01

    To modify, content validate, and evaluate a teamwork assessment tool for use in endovascular surgery. A multistage, multimethod study was conducted. Stage 1 included expert review and modification of the existing Observational Teamwork Assessment for Surgery (OTAS) tool. Stage 2 included identification of additional exemplar behaviours contributing to effective teamwork and enhanced patient safety in endovascular surgery (using real-time observation, focus groups, and semistructured interviews of multidisciplinary teams). Stage 3 included content validation of exemplar behaviours using expert consensus according to established psychometric recommendations and evaluation of structure, content, feasibility, and usability of the Endovascular Observational Teamwork Assessment Tool (Endo-OTAS) by an expert multidisciplinary panel. Stage 4 included final team expert review of exemplars. OTAS core team behaviours were maintained (communication, coordination, cooperation, leadership team monitoring). Of the 114 OTAS behavioural exemplars, 19 were modified, four removed, and 39 additional endovascular-specific behaviours identified. Content validation of these 153 exemplar behaviours showed that 113/153 (73.9%) reached the predetermined Item-Content Validity Index rating for teamwork and/or patient safety. After expert team review, 140/153 (91.5%) exemplars were deemed to warrant inclusion in the tool. More than 90% of the expert panel agreed that Endo-OTAS is an appropriate teamwork assessment tool with observable behaviours. Some concerns were noted about the time required to conduct observations and provide performance feedback. Endo-OTAS is a novel teamwork assessment tool, with evidence for content validity and relevance to endovascular teams. Endo-OTAS enables systematic objective assessment of the quality of team performance during endovascular procedures. Copyright © 2016. Published by Elsevier Ltd.

  2. [Work as a source of pleasure: evaluating a Psychosocial Care Center team].

    PubMed

    Glanzner, Cecília Helena; Olschowsky, Agnes; Kantorski, Luciane Prado

    2011-06-01

    The objective of this study was to evaluate the pleasure at work felt by the members of a Psychosocial Care Center team. This qualitative case study used Forth Generation Evaluation. This study was performed in Foz do Iguaçu, Parana, Brazil, in November and December 2006. Participants were 10 tem members. Data collection was performed through observation and individual interviews. The analysis was initiated at the same time as the data collection, and the final analysis was performed as per the following steps: data ordering, classification and final analysis. The following analysis themes were developed: work characteristics at the psychological care center, suffering and coping with suffering at work. During the evaluation, the participants showed pleasure and fulfillment with their work by expressing pride, fulfillment and appreciation of what they deliver. Pleasure occurs during the development of psychosocial care, because they always have the freedom to rearrange their manner of working, making possible to develop activities and attitudes capable of giving them pleasure.

  3. Recognizing Disguised Faces: Human and Machine Evaluation

    PubMed Central

    Dhamecha, Tejas Indulal; Singh, Richa; Vatsa, Mayank; Kumar, Ajay

    2014-01-01

    Face verification, though an easy task for humans, is a long-standing open research area. This is largely due to the challenging covariates, such as disguise and aging, which make it very hard to accurately verify the identity of a person. This paper investigates human and machine performance for recognizing/verifying disguised faces. Performance is also evaluated under familiarity and match/mismatch with the ethnicity of observers. The findings of this study are used to develop an automated algorithm to verify the faces presented under disguise variations. We use automatically localized feature descriptors which can identify disguised face patches and account for this information to achieve improved matching accuracy. The performance of the proposed algorithm is evaluated on the IIIT-Delhi Disguise database that contains images pertaining to 75 subjects with different kinds of disguise variations. The experiments suggest that the proposed algorithm can outperform a popular commercial system and evaluates them against humans in matching disguised face images. PMID:25029188

  4. OSSE Evaluation of Aircraft Reconnaissance Observations and their Impact on Hurricane Analyses and Forecasts

    NASA Astrophysics Data System (ADS)

    Ryan, K. E.; Bucci, L. R.; Delgado, J.; Atlas, R. M.; Murillo, S.; Dodge, P.

    2016-12-01

    NOAA/AOML's Hurricane Research Division (HRD) annually conducts its Hurricane Field Program during which observations are collected via NOAA aircraft to improve the understanding and prediction of hurricanes. Mission experiments suggest a variety of flight patterns and sampling strategies aimed towards their respective goals described by the Intensity Forecasting Experiment (IFEX; Rogers et al., BAMS, 2006, 2013), a collaborative effort among HRD, NHC, and EMC. Evaluating the potential impact of various trade-offs in track design is valuable for determining the optimal air reconnaissance flight pattern for a prospective mission. AOML's HRD has developed a system for performing regional Observing System Simulation Experiments (OSSEs) to assess the potential impact of proposed observing systems on hurricane track and intensity forecasts and analyses. This study focuses on investigating the potential impact of proposed aircraft reconnaissance observing system designs. Aircraft instrument and flight level retrievals were simulated from a regional WRF ARW Nature Run (Nolan et al., 2013) spanning 13 days, covering the life cycle of a rapidly intensifying Atlantic tropical cyclone. The aircraft trajectories of NOAA aircraft are simulated in a variety of ways and are evaluated to examine the potential impact of aircraft reconnaissance observations on hurricane track and intensity forecasts.

  5. Factors associated with compliance to AHA/ACC performance measures in a myocardial infarction system of care in Brazil.

    PubMed

    Lana, Maria Letícia L; Beaton, Andrea Z; Brant, Luisa C C; Bozzi, Isadora C R S; de Magalhães, Osias; Castro, Luiz Ricardo de A; da Silva Júnior, Francisco César T; da Silva, José Luiz P; Ribeiro, Antonio Luiz P; Nascimento, Bruno R

    2017-08-01

    To evaluate compliance with American Heart Association/American College of Cardiology (AHA/ACC) performance measures for adults with acute myocardial infarction (AMI) and to investigate the factors associated with compliance, in an AMI System of Care in Brazil. Observational longitudinal study. A high-complexity University Hospital, part of the AMI System of Care implemented in Belo Horizonte, Brazil, in 2010. Of note, 1129 patients with ST-elevation myocardial infarction (STEMI) and non-ST-elevation myocardial infarction (NSTEMI) admitted to a single center over 36 months (between 2011 and 2014). Compliance with 13 pre-specified AHA/ACC AMI performance measures was evaluated for patients with AMI, observing exclusion criteria and appropriate numerators and denominators. Median compliance was calculated and variables independently associated with compliance rates were evaluated. Median age was 60 (51/68) years, 67.7% male, 69.8% presented with STEMI and hospital mortality was 8.7%. Median compliance with performance measures was 83% (75/88). Among patients with STEMI, 56% received reperfusion therapy. Overall, 67.3% of patients complied with ≥80% of quality measures. Factors independently associated with better compliance were later date of presentation (semester), likely reflecting ongoing training (OR = 1.19, 95% CI: 1.10-1.28, P < 0.001), male gender (OR = 1.33, 95% CI: 1.00-1.76, P < 0.046), Killip I/II on admission (OR = 1.95, 95% CI: 1.36-2.80, P < 0.001) and diagnosis of NSTEMI (OR = 5.0, 95% CI: 3.51-7.11, P < 0.001). Compliance with AHA/ACC AMI performance measures remains below target in Brazil, but the time trends observed suggest improvement. Continuing education, reduction of system delays and prioritizing high-risk groups are needed to optimize AMI systems of care and improve patient outcomes. © The Author 2017. Published by Oxford University Press in association with the International Society for Quality in Health Care. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com

  6. Direct observation of students during clerkship rotations: a multiyear descriptive study.

    PubMed

    Howley, Lisa D; Wilson, William G

    2004-03-01

    To determine how often students report that they are observed while performing physical examinations and taking histories during clerkship rotations. From 1999-2001, 397 students at the University of Virginia School of Medicine were asked at the end of their third year to report the number of times they had been observed by a resident or faculty member while taking histories and performing physical examinations on six rotations. Three hundred and forty-five students (87%) returned the survey instrument; of these, 322 (81%) returned instruments with complete information. On average, the majority reported that they had never been observed by a faculty member while taking a history (51%), performing a focused physical examination (54%), or a complete physical examination (81%). The majority (60%) reported that they had never been observed by a resident while performing a complete physical examination. Faculty observations occurred most frequently during the four-week family medicine rotation and least frequently during the 12-week surgery rotation. The length of the clerkship rotation was inversely related to the number of reported observations, chi(2) (5, n = 295) = 127.85, p <.000. Although alternative assessments of clinical skills are becoming more common in medical education, faculty ratings based on direct observation are still prominent. The data in this study reflect that these observations may actually be occurring quite infrequently, if at all. Decreasing the evaluative weight of faculty and resident ratings during the clerkship rotation may be necessary. Otherwise, efforts should be made to increase the validity of these ratings.

  7. Multi-objective optimization for generating a weighted multi-model ensemble

    NASA Astrophysics Data System (ADS)

    Lee, H.

    2017-12-01

    Many studies have demonstrated that multi-model ensembles generally show better skill than each ensemble member. When generating weighted multi-model ensembles, the first step is measuring the performance of individual model simulations using observations. There is a consensus on the assignment of weighting factors based on a single evaluation metric. When considering only one evaluation metric, the weighting factor for each model is proportional to a performance score or inversely proportional to an error for the model. While this conventional approach can provide appropriate combinations of multiple models, the approach confronts a big challenge when there are multiple metrics under consideration. When considering multiple evaluation metrics, it is obvious that a simple averaging of multiple performance scores or model ranks does not address the trade-off problem between conflicting metrics. So far, there seems to be no best method to generate weighted multi-model ensembles based on multiple performance metrics. The current study applies the multi-objective optimization, a mathematical process that provides a set of optimal trade-off solutions based on a range of evaluation metrics, to combining multiple performance metrics for the global climate models and their dynamically downscaled regional climate simulations over North America and generating a weighted multi-model ensemble. NASA satellite data and the Regional Climate Model Evaluation System (RCMES) software toolkit are used for assessment of the climate simulations. Overall, the performance of each model differs markedly with strong seasonal dependence. Because of the considerable variability across the climate simulations, it is important to evaluate models systematically and make future projections by assigning optimized weighting factors to the models with relatively good performance. Our results indicate that the optimally weighted multi-model ensemble always shows better performance than an arithmetic ensemble mean and may provide reliable future projections.

  8. Phosphorus component in AnnAGNPS

    USGS Publications Warehouse

    Yuan, Y.; Bingner, R.L.; Theurer, F.D.; Rebich, R.A.; Moore, P.A.

    2005-01-01

    The USDA Annualized Agricultural Non-Point Source Pollution model (AnnAGNPS) has been developed to aid in evaluation of watershed response to agricultural management practices. Previous studies have demonstrated the capability of the model to simulate runoff and sediment, but not phosphorus (P). The main purpose of this article is to evaluate the performance of AnnAGNPS on P simulation using comparisons with measurements from the Deep Hollow watershed of the Mississippi Delta Management Systems Evaluation Area (MDMSEA) project. A sensitivity analysis was performed to identify input parameters whose impact is the greatest on P yields. Sensitivity analysis results indicate that the most sensitive variables of those selected are initial soil P contents, P application rate, and plant P uptake. AnnAGNPS simulations of dissolved P yield do not agree well with observed dissolved P yield (Nash-Sutcliffe coefficient of efficiency of 0.34, R2 of 0.51, and slope of 0.24); however, AnnAGNPS simulations of total P yield agree well with observed total P yield (Nash-Sutcliffe coefficient of efficiency of 0.85, R2 of 0.88, and slope of 0.83). The difference in dissolved P yield may be attributed to limitations in model simulation of P processes. Uncertainties in input parameter selections also affect the model's performance.

  9. Direct metal laser sintering titanium dental implants: a review of the current literature.

    PubMed

    Mangano, F; Chambrone, L; van Noort, R; Miller, C; Hatton, P; Mangano, C

    2014-01-01

    Statement of Problem. Direct metal laser sintering (DMLS) is a technology that allows fabrication of complex-shaped objects from powder-based materials, according to a three-dimensional (3D) computer model. With DMLS, it is possible to fabricate titanium dental implants with an inherently porous surface, a key property required of implantation devices. Objective. The aim of this review was to evaluate the evidence for the reliability of DMLS titanium dental implants and their clinical and histologic/histomorphometric outcomes, as well as their mechanical properties. Materials and Methods. Electronic database searches were performed. Inclusion criteria were clinical and radiographic studies, histologic/histomorphometric studies in humans and animals, mechanical evaluations, and in vitro cell culture studies on DMLS titanium implants. Meta-analysis could be performed only for randomized controlled trials (RCTs); to evaluate the methodological quality of observational human studies, the Newcastle-Ottawa scale (NOS) was used. Results. Twenty-seven studies were included in this review. No RCTs were found, and meta-analysis could not be performed. The outcomes of observational human studies were assessed using the NOS: these studies showed medium methodological quality. Conclusions. Several studies have demonstrated the potential for the use of DMLS titanium implants. However, further studies that demonstrate the benefits of DMLS implants over conventional implants are needed.

  10. Direct Metal Laser Sintering Titanium Dental Implants: A Review of the Current Literature

    PubMed Central

    Mangano, F.; Chambrone, L.; van Noort, R.; Miller, C.; Hatton, P.; Mangano, C.

    2014-01-01

    Statement of Problem. Direct metal laser sintering (DMLS) is a technology that allows fabrication of complex-shaped objects from powder-based materials, according to a three-dimensional (3D) computer model. With DMLS, it is possible to fabricate titanium dental implants with an inherently porous surface, a key property required of implantation devices. Objective. The aim of this review was to evaluate the evidence for the reliability of DMLS titanium dental implants and their clinical and histologic/histomorphometric outcomes, as well as their mechanical properties. Materials and Methods. Electronic database searches were performed. Inclusion criteria were clinical and radiographic studies, histologic/histomorphometric studies in humans and animals, mechanical evaluations, and in vitro cell culture studies on DMLS titanium implants. Meta-analysis could be performed only for randomized controlled trials (RCTs); to evaluate the methodological quality of observational human studies, the Newcastle-Ottawa scale (NOS) was used. Results. Twenty-seven studies were included in this review. No RCTs were found, and meta-analysis could not be performed. The outcomes of observational human studies were assessed using the NOS: these studies showed medium methodological quality. Conclusions. Several studies have demonstrated the potential for the use of DMLS titanium implants. However, further studies that demonstrate the benefits of DMLS implants over conventional implants are needed. PMID:25525434

  11. Teaching leadership in trauma resuscitation: Immediate feedback from a real-time, competency-based evaluation tool shows long-term improvement in resident performance.

    PubMed

    Gregg, Shea C; Heffernan, Daithi S; Connolly, Michael D; Stephen, Andrew H; Leuckel, Stephanie N; Harrington, David T; Machan, Jason T; Adams, Charles A; Cioffi, William G

    2016-10-01

    Limited data exist on how to develop resident leadership and communication skills during actual trauma resuscitations. An evaluation tool was developed to grade senior resident performance as the team leader during full-trauma-team activations. Thirty actions that demonstrated the Accreditation Council for Graduate Medical Education core competencies were graded on a Likert scale of 1 (poor) to 5 (exceptional). These actions were grouped by their respective core competencies on 5 × 7-inch index cards. In Phase 1, baseline performance scores were obtained. In Phase 2, trauma-focused communication in-services were conducted early in the academic year, and immediate, personalized feedback sessions were performed after resuscitations based on the evaluation tool. In Phase 3, residents received only evaluation-based feedback following resuscitations. In Phase 1 (October 2009 to April 2010), 27 evaluations were performed on 10 residents. In Phase 2 (April 2010 to October 2010), 28 evaluations were performed on nine residents. In Phase 3 (October 2010 to January 2012), 44 evaluations were performed on 13 residents. Total scores improved significantly between Phases 1 and 2 (p = 0.003) and remained elevated throughout Phase 3. When analyzing performance by competency, significant improvement between Phases 1 and 2 (p < 0.05) was seen in all competencies (patient care, knowledge, system-based practice, practice-based learning) with the exception of "communication and professionalism" (p = 0.56). Statistically similar scores were observed between Phases 2 and 3 in all competencies with the exception of "medical knowledge," which showed ongoing significant improvement (p = 0.003). Directed resident feedback sessions utilizing data from a real-time, competency-based evaluation tool have allowed us to improve our residents' abilities to lead trauma resuscitations over a 30-month period. Given pressures to maximize clinical educational opportunities among work-hour constraints, such a model may help decrease the need for costly simulation-based training. Therapeutic study, level III.

  12. Enhanced just-in-time plus protocol for optical burst switching networks

    NASA Astrophysics Data System (ADS)

    Rodrigues, Joel J. P. C.; Gregório, José M. B.; Vasilakos, Athanasios V.

    2010-07-01

    We propose a new one-way resource reservation protocol for optical burst switching (OBS) networks, called Enhanced Just-in-Time Plus (E-JIT+). The protocol is described in detail, and its formal specification is presented, following an extended finite state machine approach. The performance evaluation of E-JIT+ is analyzed in comparison with other proposed OBS protocols (JIT+ and E-JIT) for the following network topologies: rings; degree-two, degree-three, and degree-four chordal rings; mesh-torus; NSFNET; ARPANET; FCCN-NET; and the European Optical Network. We evaluate and compare the performance of the different protocols in terms of burst loss probability, taking into account the most important OBS network parameters. It was shown that E-JIT+ performs better than available one-way resource reservation protocols for all the evaluated network topologies. Moreover, the scalability of E-JIT+ was observed, and when the network traffic increases, the burst loss probability also increases, leading to a worse network performance.

  13. MSFC Skylab Kohoutek experiments mission evaluation

    NASA Technical Reports Server (NTRS)

    1974-01-01

    The Comet Kohoutek was documented by the Skylab 4 experiments' observations. The experiment concepts, hardware, operational performance and anomalies are discussed. Experiments which viewed the comet were mainly through the SAL and ATM, but some were handheld and EVA.

  14. Space Suit Thermal Dynamics

    NASA Technical Reports Server (NTRS)

    Campbell, Anthony B.; Nair, Satish S.; Miles, John B.; Iovine, John V.; Lin, Chin H.

    1998-01-01

    The present NASA space suit (the Shuttle EMU) is a self-contained environmental control system, providing life support, environmental protection, earth-like mobility, and communications. This study considers the thermal dynamics of the space suit as they relate to astronaut thermal comfort control. A detailed dynamic lumped capacitance thermal model of the present space suit is used to analyze the thermal dynamics of the suit with observations verified using experimental and flight data. Prior to using the model to define performance characteristics and limitations for the space suit, the model is first evaluated and improved. This evaluation includes determining the effect of various model parameters on model performance and quantifying various temperature prediction errors in terms of heat transfer and heat storage. The observations from this study are being utilized in two future design efforts, automatic thermal comfort control design for the present space suit and design of future space suit systems for Space Station, Lunar, and Martian missions.

  15. [Design and validation of a scale to assess self-regulation of eating habits in Mexican university students].

    PubMed

    Campos-Uscanga, Yolanda; Lagunes Córdoba, Roberto; Morales-Romero, Jaime; Romo-González, Tania

    2015-03-01

    Healthy eating habits promote wellness and prevent disease, however, despite the intention to change a bad habit, people often fail in theirattempts. This is due, since the performance of a change requires self-regulation skills that allow to observe, to evaluate and to take an action, in a constant motivation during the all the process; not only theknowledge about proper nutrition. The objective of this study was to design and validate an instrument to evaluate the level of self-regulation for eating habits in college students.62 items were written and evaluated by four expert judges. Two applications of the instrument were performed to 487 subjects. An unweighted least squares factor analysis whit direct Oblimin rotation was performed. The items saturated in more than one factor were discarded, as well as those who had a loading factor less than 0.40 or commonality less than 0.30. It was obtained an instrument integrated by 14 items grouped into three factors, which explained the 46.9% of the variance: self-reaction, self-observation and self-evaluation. Cronbach's alpha yielded a high reliability coefficient (α = 0.874).The results show that the scale is a valid and reliable tool to measure of self-regulation of eating habits in college students. Its applications include the diagnostic of a population and the evaluation of interventions aimed to improving nutrition based on the assumption that the processes of change require sustained self-regulation skills in people protective effect against increases in both systolic and diastolic blood pressure.

  16. Earthworms and tree roots: A model study of the effect of preferential flow paths on runoff generation and groundwater recharge in steep, saprolitic, tropical lowland catchments

    NASA Astrophysics Data System (ADS)

    Cheng, Yanyan; Ogden, Fred L.; Zhu, Jianting

    2017-07-01

    Preferential flow paths (PFPs) affect the hydrological response of humid tropical catchments but have not received sufficient attention. We consider PFPs created by tree roots and earthworms in a near-surface soil layer in steep, humid, tropical lowland catchments and hypothesize that observed hydrological behaviors can be better captured by reasonably considering PFPs in this layer. We test this hypothesis by evaluating the performance of four different physically based distributed model structures without and with PFPs in different configurations. Model structures are tested both quantitatively and qualitatively using hydrological, geophysical, and geochemical data both from the Smithsonian Tropical Research Institute Agua Salud Project experimental catchment(s) in Central Panama and other sources in the literature. The performance of different model structures is evaluated using runoff Volume Error and three Nash-Sutcliffe efficiency measures against observed total runoff, stormflows, and base flows along with visual comparison of simulated and observed hydrographs. Two of the four proposed model structures which include both lateral and vertical PFPs are plausible, but the one with explicit simulation of PFPs performs the best. A small number of vertical PFPs that fully extend below the root zone allow the model to reasonably simulate deep groundwater recharge, which plays a crucial role in base flow generation. Results also show that the shallow lateral PFPs are the main contributor to the observed high flow characteristics. Their number and size distribution are found to be more important than the depth distribution. Our model results are corroborated by geochemical and geophysical observations.

  17. New method for evaluating high-quality fog protective coatings

    NASA Astrophysics Data System (ADS)

    Czeremuszkin, Grzegorz; Latreche, Mohamed; Mendoza-Suarez, Guillermo

    2011-05-01

    Fogging is commonly observed when humid-warm air contacts the cold surface of a transparent substrate, i.e. eyewear lenses, making the observed image blurred and hazy. To protect from fogging, the lens inner surfaces are protected with Anti-Fog coatings, which render them hydrophilic and induce water vapor condensation as a smooth, thin and invisible film, which uniformly flows down on the lens as the condensation progresses. Coatings differ in protection level, aging kinetics, and susceptibility to contamination. Some perform acceptably in limited conditions, beyond which the condensing water film becomes unstable, nonuniform, and scatters light or shows refractory distortions, both affecting the observed image. Quantifying the performance of Anti-Fog coated lenses is difficult: they may not show classical fogging and the existing testing methods, based on fog detection, are therefore inapplicable. The presented method for evaluating and quantifying AF properties is based on characterizing light scattering on lenses exposed to controlled humidity and temperature. Changes in intensity of laser light scattered at low angles (1, 2 4 and 8 degrees), observed during condensation of water on lenses, provide information on the swelling of Anti-Fog coatings, formation of uniform water film, going from an unstable to a steady state, and on the coalescence of discontinuous films. Real time observations/measurements allow for better understanding of factors controlling fogging and fog preventing phenomena. The method is especially useful in the development of new coatings for military-, sport-, and industrial protective eyewear as well as for medical and automotive applications. It allows for differentiating between coatings showing acceptable, good, and excellent performance.

  18. Data Container Study for Handling Array-based Data Using Rasdaman, Hive, Spark, and MongoDB

    NASA Astrophysics Data System (ADS)

    Xu, M.; Hu, F.; Yu, M.; Scheele, C.; Liu, K.; Huang, Q.; Yang, C. P.; Little, M. M.

    2016-12-01

    Geoscience communities have come up with various big data storage solutions, such as Rasdaman and Hive, to address the grand challenges for massive Earth observation data management and processing. To examine the readiness of current solutions in supporting big Earth observation, we propose to investigate and compare four popular data container solutions, including Rasdaman, Hive, Spark, and MongoDB. Using different types of spatial and non-spatial queries, datasets stored in common scientific data formats (e.g., NetCDF and HDF), and two applications (i.e. dust storm simulation data mining and MERRA data analytics), we systematically compare and evaluate the feature and performance of these four data containers in terms of data discover and access. The computing resources (e.g. CPU, memory, hard drive, network) consumed while performing various queries and operations are monitored and recorded for the performance evaluation. The initial results show that 1) Rasdaman has the best performance for queries on statistical and operational functions, and supports NetCDF data format better than HDF; 2) Rasdaman clustering configuration is more complex than the others; 3) Hive performs better on single pixel extraction from multiple images; and 4) Except for the single pixel extractions, Spark performs better than Hive and its performance is close to Rasdaman. A comprehensive report will detail the experimental results, and compare their pros and cons regarding system performance, ease of use, accessibility, scalability, compatibility, and flexibility.

  19. NACP Synthesis: Evaluating modeled carbon state and flux variables against multiple observational constraints (Invited)

    NASA Astrophysics Data System (ADS)

    Thornton, P. E.; Nacp Site Synthesis Participants

    2010-12-01

    The North American Carbon Program (NACP) synthesis effort includes an extensive intercomparison of modeled and observed ecosystem states and fluxes preformed with multiple models across multiple sites. The participating models span a range of complexity and intended application, while the participating sites cover a broad range of natural and managed ecosystems in North America, from the subtropics to arctic tundra, and coastal to interior climates. A unique characteristic of this collaborative effort is that multiple independent observations are available at all sites: fluxes are measured with the eddy covariance technique, and standard biometric and field sampling methods provide estimates of standing stock and annual production in multiple categories. In addition, multiple modeling approaches are employed to make predictions at each site, varying, for example, in the use of diagnostic vs. prognostic leaf area index. Given multiple independent observational constraints and multiple classes of model, we evaluate the internal consistency of observations at each site, and use this information to extend previously derived estimates of uncertainty in the flux observations. Model results are then compared with all available observations and models are ranked according to their consistency with each type of observation (high frequency flux measurement, carbon stock, annual production). We demonstrate a range of internal consistency across the sites, and show that some models which perform well against one observational metric perform poorly against others. We use this analysis to construct a hypothesis for combining eddy covariance, biometrics, and other standard physiological and ecological measurements which, as data collection proceeded over several years, would present an increasingly challenging target for next generation models.

  20. Subjective evaluation of next-generation video compression algorithms: a case study

    NASA Astrophysics Data System (ADS)

    De Simone, Francesca; Goldmann, Lutz; Lee, Jong-Seok; Ebrahimi, Touradj; Baroncini, Vittorio

    2010-08-01

    This paper describes the details and the results of the subjective quality evaluation performed at EPFL, as a contribution to the effort of the Joint Collaborative Team on Video Coding (JCT-VC) for the definition of the next-generation video coding standard. The performance of 27 coding technologies have been evaluated with respect to two H.264/MPEG-4 AVC anchors, considering high definition (HD) test material. The test campaign involved a total of 494 naive observers and took place over a period of four weeks. While similar tests have been conducted as part of the standardization process of previous video coding technologies, the test campaign described in this paper is by far the most extensive in the history of video coding standardization. The obtained subjective quality scores show high consistency and support an accurate comparison of the performance of the different coding solutions.

  1. Sliding Mode Control of Real-Time PNU Vehicle Driving Simulator and Its Performance Evaluation

    NASA Astrophysics Data System (ADS)

    Lee, Min Cheol; Park, Min Kyu; Yoo, Wan Suk; Son, Kwon; Han, Myung Chul

    This paper introduces an economical and effective full-scale driving simulator for study of human sensibility and development of new vehicle parts and its control. Real-time robust control to accurately reappear a various vehicle motion may be a difficult task because the motion platform is the nonlinear complex system. This study proposes the sliding mode controller with a perturbation compensator using observer-based fuzzy adaptive network (FAN). This control algorithm is designed to solve the chattering problem of a sliding mode control and to select the adequate fuzzy parameters of the perturbation compensator. For evaluating the trajectory control performance of the proposed approach, a tracking control of the developed simulator named PNUVDS is experimentally carried out. And then, the driving performance of the simulator is evaluated by using human perception and sensibility of some drivers in various driving conditions.

  2. Efficient Comparison between Windows and Linux Platform Applicable in a Virtual Architectural Walkthrough Application

    NASA Astrophysics Data System (ADS)

    Thubaasini, P.; Rusnida, R.; Rohani, S. M.

    This paper describes Linux, an open source platform used to develop and run a virtual architectural walkthrough application. It proposes some qualitative reflections and observations on the nature of Linux in the concept of Virtual Reality (VR) and on the most popular and important claims associated with the open source approach. The ultimate goal of this paper is to measure and evaluate the performance of Linux used to build the virtual architectural walkthrough and develop a proof of concept based on the result obtain through this project. Besides that, this study reveals the benefits of using Linux in the field of virtual reality and reflects a basic comparison and evaluation between Windows and Linux base operating system. Windows platform is use as a baseline to evaluate the performance of Linux. The performance of Linux is measured based on three main criteria which is frame rate, image quality and also mouse motion.

  3. Performance of vegetation indices from Landsat time series in deforestation monitoring

    NASA Astrophysics Data System (ADS)

    Schultz, Michael; Clevers, Jan G. P. W.; Carter, Sarah; Verbesselt, Jan; Avitabile, Valerio; Quang, Hien Vu; Herold, Martin

    2016-10-01

    The performance of Landsat time series (LTS) of eight vegetation indices (VIs) was assessed for monitoring deforestation across the tropics. Three sites were selected based on differing remote sensing observation frequencies, deforestation drivers and environmental factors. The LTS of each VI was analysed using the Breaks For Additive Season and Trend (BFAST) Monitor method to identify deforestation. A robust reference database was used to evaluate the performance regarding spatial accuracy, sensitivity to observation frequency and combined use of multiple VIs. The canopy cover sensitive Normalized Difference Fraction Index (NDFI) was the most accurate. Among those tested, wetness related VIs (Normalized Difference Moisture Index (NDMI) and the Tasselled Cap wetness (TCw)) were spatially more accurate than greenness related VIs (Normalized Difference Vegetation Index (NDVI) and Tasselled Cap greenness (TCg)). When VIs were fused on feature level, spatial accuracy was improved and overestimation of change reduced. NDVI and NDFI produced the most robust results when observation frequency varies.

  4. An observational analysis of surgical team compliance with perioperative safety practices after crew resource management training.

    PubMed

    France, Daniel J; Leming-Lee, Susie; Jackson, Tom; Feistritzer, Nancye R; Higgins, Michael S

    2008-04-01

    Acknowledging the need to improve team communication and coordination among health care providers, health care administrators and improvement officers have been quick to endorse and invest in aviation crew resource management (CRM). Despite the increased interest in CRM there exists limited data on the effectiveness of CRM to change team behavior and performance in clinical settings. Direct observational analyses were performed on 30 surgical teams (15 neurosurgery cases and 15 cardiac cases) to evaluate surgical team compliance with integrated safety and CRM practices after extensive CRM training. Observed surgical teams were compliant with only 60% of the CRM and perioperative safety practices emphasized in the training program. The results highlight many of the challenges the health care industry faces in its efforts to adapt CRM from aviation to medicine. Additional research is needed to develop and test new team training methods and performance feedback mechanisms for clinical teams.

  5. Computer controlled performance mapping of thermionic converters: effect of collector, guard-ring potential imbalances on the observed collector current-density, voltage characteristics and limited range performance map of an etched-rhenium, niobium planar converter

    NASA Technical Reports Server (NTRS)

    Manista, E. J.

    1972-01-01

    The effect of collector, guard-ring potential imbalance on the observed collector-current-density J, collector-to-emitter voltage V characteristic was evaluated in a planar, fixed-space, guard-ringed thermionic converter. The J,V characteristic was swept in a period of 15 msec by a variable load. A computerized data acquisition system recorded test parameters. The results indicate minimal distortion of the J,V curve in the power output quadrant for the nominal guard-ring circuit configuration. Considerable distortion, along with a lowering of the ignited-mode striking voltage, was observed for the configuration with the emitter shorted to the guard ring. A limited-range performance map of an etched-rhenium, niobium, planar converter was obtained by using an improved computer program for the data acquisition system.

  6. Ames life science telescience testbed evaluation

    NASA Technical Reports Server (NTRS)

    Haines, Richard F.; Johnson, Vicki; Vogelsong, Kristofer H.; Froloff, Walt

    1989-01-01

    Eight surrogate spaceflight mission specialists participated in a real-time evaluation of remote coaching using the Ames Life Science Telescience Testbed facility. This facility consisted of three remotely located nodes: (1) a prototype Space Station glovebox; (2) a ground control station; and (3) a principal investigator's (PI) work area. The major objective of this project was to evaluate the effectiveness of telescience techniques and hardware to support three realistic remote coaching science procedures: plant seed germinator charging, plant sample acquisition and preservation, and remote plant observation with ground coaching. Each scenario was performed by a subject acting as flight mission specialist, interacting with a payload operations manager and a principal investigator expert. All three groups were physically isolated from each other yet linked by duplex audio and color video communication channels and networked computer workstations. Workload ratings were made by the flight and ground crewpersons immediately after completing their assigned tasks. Time to complete each scientific procedural step was recorded automatically. Two expert observers also made performance ratings and various error assessments. The results are presented and discussed.

  7. Design and evaluation of a trilateral shared-control architecture for teleoperated training robots.

    PubMed

    Shamaei, Kamran; Kim, Lawrence H; Okamura, Allison M

    2015-08-01

    Multilateral teleoperated robots can be used to train humans to perform complex tasks that require collaborative interaction and expert supervision, such as laparoscopic surgical procedures. In this paper, we explain the design and performance evaluation of a shared-control architecture that can be used in trilateral teleoperated training robots. The architecture includes dominance and observation factors inspired by the determinants of motor learning in humans, including observational practice, focus of attention, feedback and augmented feedback, and self-controlled practice. Toward the validation of such an architecture, we (1) verify the stability of a trilateral system by applying Llewellyn's criterion on a two-port equivalent architecture, and (2) demonstrate that system transparency remains generally invariant across relevant observation factors and movement frequencies. In a preliminary experimental study, a dyad of two human users (one novice, one expert) collaborated on the control of a robot to follow a trajectory. The experiment showed that the framework can be used to modulate the efforts of the users and adjust the source and level of haptic feedback to the novice user.

  8. Performance evaluation of an agent-based occupancy simulation model

    DOE PAGES

    Luo, Xuan; Lam, Khee Poh; Chen, Yixing; ...

    2017-01-17

    Occupancy is an important factor driving building performance. Static and homogeneous occupant schedules, commonly used in building performance simulation, contribute to issues such as performance gaps between simulated and measured energy use in buildings. Stochastic occupancy models have been recently developed and applied to better represent spatial and temporal diversity of occupants in buildings. However, there is very limited evaluation of the usability and accuracy of these models. This study used measured occupancy data from a real office building to evaluate the performance of an agent-based occupancy simulation model: the Occupancy Simulator. The occupancy patterns of various occupant types weremore » first derived from the measured occupant schedule data using statistical analysis. Then the performance of the simulation model was evaluated and verified based on (1) whether the distribution of observed occupancy behavior patterns follows the theoretical ones included in the Occupancy Simulator, and (2) whether the simulator can reproduce a variety of occupancy patterns accurately. Results demonstrated the feasibility of applying the Occupancy Simulator to simulate a range of occupancy presence and movement behaviors for regular types of occupants in office buildings, and to generate stochastic occupant schedules at the room and individual occupant levels for building performance simulation. For future work, model validation is recommended, which includes collecting and using detailed interval occupancy data of all spaces in an office building to validate the simulated occupant schedules from the Occupancy Simulator.« less

  9. Performance evaluation of an agent-based occupancy simulation model

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luo, Xuan; Lam, Khee Poh; Chen, Yixing

    Occupancy is an important factor driving building performance. Static and homogeneous occupant schedules, commonly used in building performance simulation, contribute to issues such as performance gaps between simulated and measured energy use in buildings. Stochastic occupancy models have been recently developed and applied to better represent spatial and temporal diversity of occupants in buildings. However, there is very limited evaluation of the usability and accuracy of these models. This study used measured occupancy data from a real office building to evaluate the performance of an agent-based occupancy simulation model: the Occupancy Simulator. The occupancy patterns of various occupant types weremore » first derived from the measured occupant schedule data using statistical analysis. Then the performance of the simulation model was evaluated and verified based on (1) whether the distribution of observed occupancy behavior patterns follows the theoretical ones included in the Occupancy Simulator, and (2) whether the simulator can reproduce a variety of occupancy patterns accurately. Results demonstrated the feasibility of applying the Occupancy Simulator to simulate a range of occupancy presence and movement behaviors for regular types of occupants in office buildings, and to generate stochastic occupant schedules at the room and individual occupant levels for building performance simulation. For future work, model validation is recommended, which includes collecting and using detailed interval occupancy data of all spaces in an office building to validate the simulated occupant schedules from the Occupancy Simulator.« less

  10. Bringing the skills laboratory home: an affordable webcam-based personal trainer for developing laparoscopic skills.

    PubMed

    Kobayashi, Sow Alfred; Jamshidi, Ramin; O'Sullivan, Patricia; Palmer, Barnard; Hirose, Shinjiro; Stewart, Lygia; Kim, Edward Hyung

    2011-01-01

    The purpose of this work was to develop a more flexible system of laparoscopic surgery training with demonstrated effectiveness and construct validity. A personal, portable, durable laparoscopic trainer can be designed at low cost. The evaluation of expert surgeons on this device will reveal technical superiority over novices. With practice, novice surgeons can improve their performance significantly as measured by scores derived from performing skills with this training device. Prospective trial with observation and intervention components. The first aspect was observational comparison of novice and expert performance. The second was a prospective static-group comparison with pretest/posttest single-sample design. Tertiary-care academic medical center with affiliated general surgery residency. A total of 21 junior surgical residents and 5 experienced operators. Performance was assessed by the 5 tasks in the McGill Inanimate System for Training and Evaluation of Laparoscopic Skills (MISTELS): pegboard transfer, pattern cutting, placement of ligating loop, extracorporeal knotting, and intracorporeal knotting. Each task was assessed for accuracy and speed. Expert surgeons scored significantly higher than novices on total score and 4 of the 5 MISTELS tasks (peg transfer, pattern cut, extracorporeal knot, and intracorporeal knot). After 4 months of home-based training, the novices improved in total score and 3 of the 5 tasks (peg transfer, pattern cut, and extracorporeal knot). A low-cost personal laparoscopic training device can be built by individual residents. With their use, residents can significantly improve performance in important surgical skills. Evaluation of the system supports its validity. Copyright © 2011 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.

  11. Measuring the development of insight by dental health professionals in training using workplace-based assessment.

    PubMed

    Prescott-Clements, L E; van der Vleuten, C P M; Schuwirth, L; Gibb, E; Hurst, Y; Rennie, J S

    2011-08-01

    For health professionals, the development of insight into their performance is vital for safe practice, professional development and self-regulation. This study investigates whether the development of dental trainees' insight, when provided with external feedback on performance, can be assessed using a single criterion on a simple global ratings form such as the Longitudinal Evaluation of Performance or Mini Clinical Evaluation Exercise. Postgraduate dental trainees (N = 139) were assessed using this tool on a weekly basis for 6 months. Regression analysis of the data was carried out using SPSS, and a short trainer questionnaire was implemented to investigate feasibility. Ratings for insight were shown to increase with time in a similar manner to the growth observed in other essential skills. The gradient of the slope for growth of insight was slightly less than that of the other observed skills. Trainers were mostly positive about the new criterion assessing trainees' insight, although the importance of training for trainers in this process was highlighted. Our data suggest that practitioners' insight into their performance can be developed with experience and regular feedback. However, this is most likely a complex skill dependent on a number of intrinsic and external factors. The development of trainees' insight into their performance can be assessed using a single criterion on a simple global ratings form. The process involves no additional burden on evaluators in terms of their time or cost, and promotes best practice in the provision of feedback for trainees. © 2011 John Wiley & Sons A/S.

  12. Evaluation in Appalachian pasture systems of the 1996 (update 2000) National Research Council model for weaning cattle.

    PubMed

    Whetsell, M S; Rayburn, E B; Osborne, P I

    2006-05-01

    This study was conducted to evaluate the accuracy of the National Research Council's (2000) Nutrient Requirements of Beef Cattle computer model when used to predict calf performance during on-farm pasture or dry-lot weaning and backgrounding. Calf performance was measured on 22 farms in 2002 and 8 farms in 2003 that participated in West Virginia Beef Quality Assurance Sale marketing pools. Calves were weaned on pasture (25 farms) or dry-lot (5 farms) and fed supplemental hay, haylage, ground shell corn, soybean hulls, or a commercial concentrate. Concentrates were fed at a rate of 0.0 to 1.5% of BW. The National Research Council (2000) model was used to predict ADG of each group of calves observed on each farm. The model error was measured by calculating residuals (the difference between predicted ADG minus observed ADG). Predicted animal performance was determined using level 1 of the model. Results show that, when using normal on-farm pasture sampling and forage analysis methods, the model error for ADG is high and did not accurately predict the performance of steers or heifers fed high-forage pasture-based diets; the predicted ADG was lower (P < 0.05) than the observed ADG. The estimated intake of low-producing animals was similar to the expected DMI, but for the greater-producing animals it was not. The NRC (2000) beef model may more accurately predict on-farm animal performance in pastured situations if feed analysis values reflect the energy value of the feed, account for selective grazing, and relate empty BW and shrunk BW to NDF.

  13. Global-scale regionalization of hydrological model parameters using streamflow data from many small catchments

    NASA Astrophysics Data System (ADS)

    Beck, Hylke; de Roo, Ad; van Dijk, Albert; McVicar, Tim; Miralles, Diego; Schellekens, Jaap; Bruijnzeel, Sampurno; de Jeu, Richard

    2015-04-01

    Motivated by the lack of large-scale model parameter regionalization studies, a large set of 3328 small catchments (< 10000 km2) around the globe was used to set up and evaluate five model parameterization schemes at global scale. The HBV-light model was chosen because of its parsimony and flexibility to test the schemes. The catchments were calibrated against observed streamflow (Q) using an objective function incorporating both behavioral and goodness-of-fit measures, after which the catchment set was split into subsets of 1215 donor and 2113 evaluation catchments based on the calibration performance. The donor catchments were subsequently used to derive parameter sets that were transferred to similar grid cells based on a similarity measure incorporating climatic and physiographic characteristics, thereby producing parameter maps with global coverage. Overall, there was a lack of suitable donor catchments for mountainous and tropical environments. The schemes with spatially-uniform parameter sets (EXP2 and EXP3) achieved the worst Q estimation performance in the evaluation catchments, emphasizing the importance of parameter regionalization. The direct transfer of calibrated parameter sets from donor catchments to similar grid cells (scheme EXP1) performed best, although there was still a large performance gap between EXP1 and HBV-light calibrated against observed Q. The schemes with parameter sets obtained by simultaneously calibrating clusters of similar donor catchments (NC10 and NC58) performed worse than EXP1. The relatively poor Q estimation performance achieved by two (uncalibrated) macro-scale hydrological models suggests there is considerable merit in regionalizing the parameters of such models. The global HBV-light parameter maps and ancillary data are freely available via http://water.jrc.ec.europa.eu.

  14. A comparative and experimental evaluation of performance of stocked diploid and triploid brook trout

    USGS Publications Warehouse

    Budy, Phaedra E.; Thiede, G.P.; Dean, A.; Olsen, D.; Rowley, G.

    2012-01-01

    Despite numerous negative impacts, nonnative trout are still being stocked to provide economically and socially valuable sport fisheries in western mountain lakes. We evaluated relative performance and potential differences in feeding strategy and competitive ability of triploid versus diploid brook trout Salvelinus fontinalis in alpine lakes, as well as behavioral and performance differences of diploid and triploid brook trout in two controlled experimental settings: behavioral experiments in the laboratory and performance evaluations in ponds. Across lakes, catch per unit effort (CPUE) and relative weight (Wr ) were not significantly different between ploidy levels. Mean sizes were also similar between ploidy levels except in two of the larger lakes where diploids attained slightly larger sizes (approximately 20 mm longer). We observed no significant differences between diploids and triploids in diet, diet preference, or trophic structure. Similarly, growth and condition did not differ between ploidy levels in smaller-scale pond experiments, and aggressive behavior did not differ between ploidy levels (fed or unfed fish trials) in the laboratory. Independent of ploidy level, the relative performance of brook trout varied widely among lakes, a pattern that appeared to be a function of lake size or a factor that covaries with lake size such as temperature regime or carrying capacity. In summary, we observed no significant differences in the relative performance of brook trout from either ploidy level across a number of indices, systems, and environmental conditions, nor any indication that one group is more aggressive or a superior competitor than the other. Collectively, these results suggest that triploid brook trout will offer a more risk-averse and promising management opportunity when they are stocked to these lakes and elsewhere to simultaneously meet the needs for the sport fishery and conservation objectives.

  15. Accuracy of remote burn scar evaluation via live video-conferencing technology.

    PubMed

    Cai, Lawrence Z; Caceres, Maria; Dangol, Mohan Krishna; Nakarmi, Kiran; Rai, Shankar Man; Chang, James; Gibran, Nicole S; Pham, Tam N

    2016-12-05

    Telemedicine in outpatient burn care, particularly in burn scar management, may provide cost-effective care and comes highly rated by patients. However, an effective scar scale using both video and photographic elements has not been validated. The purpose of this study is to test the reliability of the Patient and Observer Scar Assessment Scale (POSAS) using live video-conferencing. A prospective study was conducted with individuals with healed burn scars in Kathmandu, Nepal. Three independent observers assessed 85 burn scars from 17 subjects, using the Observer portion to evaluate vascularity, pigmentation, thickness, relief, pliability, surface area, and overall opinion. The on-site observer was physically present with the subjects and used a live videoconferencing application to show the scars to two remote observers in the United States. Subjects used the Patient portion to evaluate the scar that they believed appeared the worst appearance and had the greatest impact on function. The single-rater reliability of the Observer scale was acceptable (ICC>0.70) in overall opinion, thickness, pliability, and surface area. The average-rater reliability for three observers was acceptable (ICC>0.70) for all parameters except for vascularity. When comparing Patients' and Observers' overall opinion scores, patients consistently reported worse opinion. Evaluation of burn scars using the Patient and Observer Scar Assessment Scale can be accurately performed via live videoconferencing and presents an opportunity to expand access to burn care to rural communities, particularly in low- and middle-income countries, where patients face significant access barriers to appropriate follow-up care. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.

  16. Nurses' evaluation of physicians' non-clinical performance in emergency departments: advantages, disadvantages and lessons learned.

    PubMed

    Alameddine, Mohamad; Mufarrij, Afif; Saliba, Miriam; Mourad, Yara; Jabbour, Rima; Hitti, Eveline

    2015-02-27

    Peer evaluation is increasingly used as a method to assess physicians' interpersonal and communication skills. We report on experience with soliciting registered nurses' feedback on physicians' non-clinical performance in the ED of a large academic medical center in Lebanon. We utilized a secondary analysis of a de-identified database of ED nurses' assessment of physicians' non-clinical performance coupled with an evaluation of interventions carried out as a result of this evaluation. The database was compiled as part of quality/performance improvement initiatives using a cross-sectional design to survey registered nurses working at the ED. The survey instrument included open ended and closed ended questions assessing physicians' communication, professionalism and leadership skills. Three episodes of evaluation were carried out over an 18 month period. Physicians were provided with a communication training carried out after the first cycle of evaluation and a detailed feedback on their assessment by nurses after each evaluation cycle. A paired t-test was carried out to compare mean evaluation scores between the three cycles of evaluation. Thematic analysis of nurses' qualitative comments was carried out. A statistically significant increase in the averages of skills was observed between the first and second evaluations, followed by a significant decrease in the averages of the three skills between the second and third evaluations. Personalized feedback to ED physicians and communication training initially contributed to a significant positive impact on improving ED physicians' non-clinical skills as perceived by the ED nurses. Yet, gains achieved were lost upon reaching the third cycle of evaluation. However, the thematic analysis of the nurses' qualitative responses portrays a decrease in concerns across the various dimensions of non-clinical performance. Nurses' evaluation of the non-clinical performance of physicians has the potential of improving communication, professionalism and leadership skills amongst physicians. For improvement to be realized in a sustainable manner, such programs may need to be offered in a staged and incremental manner over a long period of time with proper dedication of resources and timely monitoring and evaluation of outcomes. Department directors need to be trained on providing peer evaluation feedback in a constructive manner.

  17. Statistical properties of a utility measure of observer performance compared to area under the ROC curve

    NASA Astrophysics Data System (ADS)

    Abbey, Craig K.; Samuelson, Frank W.; Gallas, Brandon D.; Boone, John M.; Niklason, Loren T.

    2013-03-01

    The receiver operating characteristic (ROC) curve has become a common tool for evaluating diagnostic imaging technologies, and the primary endpoint of such evaluations is the area under the curve (AUC), which integrates sensitivity over the entire false positive range. An alternative figure of merit for ROC studies is expected utility (EU), which focuses on the relevant region of the ROC curve as defined by disease prevalence and the relative utility of the task. However if this measure is to be used, it must also have desirable statistical properties keep the burden of observer performance studies as low as possible. Here, we evaluate effect size and variability for EU and AUC. We use two observer performance studies recently submitted to the FDA to compare the EU and AUC endpoints. The studies were conducted using the multi-reader multi-case methodology in which all readers score all cases in all modalities. ROC curves from the study were used to generate both the AUC and EU values for each reader and modality. The EU measure was computed assuming an iso-utility slope of 1.03. We find mean effect sizes, the reader averaged difference between modalities, to be roughly 2.0 times as big for EU as AUC. The standard deviation across readers is roughly 1.4 times as large, suggesting better statistical properties for the EU endpoint. In a simple power analysis of paired comparison across readers, the utility measure required 36% fewer readers on average to achieve 80% statistical power compared to AUC.

  18. Practical implementation of Channelized Hotelling Observers: Effect of ROI size

    PubMed Central

    Yu, Lifeng; Leng, Shuai; McCollough, Cynthia H.

    2017-01-01

    Fundamental to the development and application of channelized Hotelling observer (CHO) models is the selection of the region of interest (ROI) to evaluate. For assessment of medical imaging systems, reducing the ROI size can be advantageous. Smaller ROIs enable a greater concentration of interrogable objects in a single phantom image, thereby providing more information from a set of images and reducing the overall image acquisition burden. Additionally, smaller ROIs may promote better assessment of clinical patient images as different patient anatomies present different ROI constraints. To this end, we investigated the minimum ROI size that does not compromise the performance of the CHO model. In this study, we evaluated both simulated images and phantom CT images to identify the minimum ROI size that resulted in an accurate figure of merit (FOM) of the CHO’s performance. More specifically, the minimum ROI size was evaluated as a function of the following: number of channels, spatial frequency and number of rotations of the Gabor filters, size and contrast of the object, and magnitude of the image noise. Results demonstrate that a minimum ROI size exists below which the CHO’s performance is grossly inaccurate. The minimum ROI size is shown to increase with number of channels and be dictated by truncation of lower frequency filters. We developed a model to estimate the minimum ROI size as a parameterized function of the number of orientations and spatial frequencies of the Gabor filters, providing a guide for investigators to appropriately select parameters for model observer studies. PMID:28943699

  19. Low power arcjet performance

    NASA Technical Reports Server (NTRS)

    Curran, Francis M.; Sarmiento, Charles J.

    1990-01-01

    An experimental investigation was performed to evaluate arc jet operation at low power. A standard, 1 kW, constricted arc jet was run using nozzles with three different constrictor diameters. Each nozzle was run over a range of current and mass flow rates to explore stability and performance in the low power engine. A standard pulse-width modulated power processor was modified to accommodate the high operating voltages required under certain conditions. Stable, reliable operation at power levels below 0.5 kW was obtained at efficiencies between 30 and 40 percent. The operating range was found to be somewhat dependent on constrictor geometry at low mass flow rates. Quasi-periodic voltage fluctuations were observed at the low power end of the operating envelope, The nozzle insert geometry was found to have little effect on the performance of the device. The observed performance levels show that specific impulse levels above 350 seconds can be obtained at the 0.5 kW power level.

  20. Low power arcjet performance

    NASA Technical Reports Server (NTRS)

    Curran, Francis M.; Sarmiento, Charles J.

    1990-01-01

    An experimental investigation was performed to evaluate arcjet operation at low power. A standard, 1 kW, constricted arcjet was run using nozzles with three different constrictor diameters. Each nozzle was run over a range of current and mass flow rates to explore stability and performance in the low power regime. A standard pulse-width modulated power processor was modified to accommodate the high operating voltages required under certain conditions. Stable, reliable operation at power levels below 0.5 kW was obtained at efficiencies between 30 and 40 percent. The operating range was found to be somewhat dependent on constrictor geometry at low mass flow rates. Quasi-periodic voltage fluctuations were observed at the low power end of the operating envelope. The nozzle insert geometry was found to have little effect on the performance of the device. The observed performance levels show that specific impulse levels above 350 seconds can be obtained at the 0.5 kW power level.

  1. Triage of the abnormal Papanicolaou smear in pregnancy.

    PubMed

    Apgar, B S; Zoschnick, L B

    1998-06-01

    Triage of the abnormal Papanicolaou smear in pregnancy requires colposcopic evaluation and directed biopsy. If histologic cervical intraepithelial neoplasia is confirmed, the patient can be managed with observations and can be re-evaluated in the postpartum period. If evidence of microinvasion is present, conization must be performed. For patients with invasive disease, a delay in therapy until fetal maturity is achieved does not compromise survival.

  2. Do health economic evaluations using observational data provide reliable assessment of treatment effects?

    PubMed Central

    2013-01-01

    Economic evaluation in modern health care systems is seen as a transparent scientific framework that can be used to advance progress towards improvements in population health at the best possible value. Despite the perceived superiority that trial-based studies have in terms of internal validity, economic evaluations often employ observational data. In this review, the interface between econometrics and economic evaluation is explored, with emphasis placed on highlighting methodological issues relating to the evaluation of cost-effectiveness within a bivariate framework. Studies that satisfied the eligibility criteria exemplified the use of matching, regression analysis, propensity scores, instrumental variables, as well as difference-in-differences approaches. All studies were reviewed and critically appraised using a structured template. The findings suggest that although state-of-the-art econometric methods have the potential to provide evidence on the causal effects of clinical and policy interventions, their application in economic evaluation is subject to a number of limitations. These range from no credible assessment of key assumptions and scarce evidence regarding the relative performance of different methods, to lack of reporting of important study elements, such as a summary outcome measure and its associated sampling uncertainty. Further research is required to better understand the ways in which observational data should be analysed in the context of the economic evaluation framework. PMID:24229445

  3. Evaluation of two disinfection/sterilization methods on silicon rubber-based composite finishing instruments.

    PubMed

    Lacerda, Vánia A; Pereira, Leandro O; Hirata JUNIOR, Raphael; Perez, Cesar R

    2015-12-01

    To evaluate the effectiveness of disinfection/sterilization methods and their effects on polishing capacity, micomorphology, and composition of two different composite fiishing and polishing instruments. Two brands of finishing and polishing instruments (Jiffy and Optimize), were analyzed. For the antimicrobial test, 60 points (30 of each brand) were used for polishing composite restorations and submitted to three different groups of disinfection/sterilization methods: none (control), autoclaving, and immersion in peracetic acid for 60 minutes. The in vitro tests were performed to evaluate the polishing performance on resin composite disks (Amelogen) using a 3D scanner (Talyscan) and to evaluate the effects on the points' surface composition (XRF) and micromorphology (MEV) after completing a polishing and sterilizing routine five times. Both sterilization/disinfection methods were efficient against oral cultivable organisms and no deleterious modification was observed to point surface.

  4. NASA trend analysis procedures

    NASA Technical Reports Server (NTRS)

    1993-01-01

    This publication is primarily intended for use by NASA personnel engaged in managing or implementing trend analysis programs. 'Trend analysis' refers to the observation of current activity in the context of the past in order to infer the expected level of future activity. NASA trend analysis was divided into 5 categories: problem, performance, supportability, programmatic, and reliability. Problem trend analysis uncovers multiple occurrences of historical hardware or software problems or failures in order to focus future corrective action. Performance trend analysis observes changing levels of real-time or historical flight vehicle performance parameters such as temperatures, pressures, and flow rates as compared to specification or 'safe' limits. Supportability trend analysis assesses the adequacy of the spaceflight logistics system; example indicators are repair-turn-around time and parts stockage levels. Programmatic trend analysis uses quantitative indicators to evaluate the 'health' of NASA programs of all types. Finally, reliability trend analysis attempts to evaluate the growth of system reliability based on a decreasing rate of occurrence of hardware problems over time. Procedures for conducting all five types of trend analysis are provided in this publication, prepared through the joint efforts of the NASA Trend Analysis Working Group.

  5. Effect of shoe type on descending a curb.

    PubMed

    George, Juff; Heller, Michelle; Kuzel, Michael

    2012-01-01

    The aim of this study was to evaluate the effect of shoe type on the performance of women during curb descent. Performance during curb stepping may be explained by biomechanical research that has evaluated the kinematics of overground walking and stair ascent and descent. Studies have reported that women exhibit performance differences when wearing high heels, flip flops and sneakers during overground walking and stair ascent and descent. Thus, in addition to features of the curb, the type of shoe being worn may also affect performance. Although several studies have investigated curb stepping, no known studies have investigated the effects of different types of footwear on curb descent performance. This research was conducted in a real-world environment where participants wore three different types of shoes and performed a series of activities that involved curb stepping. The subjects were videotaped while descending a curb, allowing for observation of changes in gait parameters. Results of this study indicate that wearing high heels leads to performance differences as compared to wearing flip flops or sneakers.

  6. Evaluation of rainfall simulations over West Africa in dynamically downscaled CMIP5 global circulation models

    NASA Astrophysics Data System (ADS)

    Akinsanola, A. A.; Ajayi, V. O.; Adejare, A. T.; Adeyeri, O. E.; Gbode, I. E.; Ogunjobi, K. O.; Nikulin, G.; Abolude, A. T.

    2018-04-01

    This study presents evaluation of the ability of Rossby Centre Regional Climate Model (RCA4) driven by nine global circulation models (GCMs), to skilfully reproduce the key features of rainfall climatology over West Africa for the period of 1980-2005. The seasonal climatology and annual cycle of the RCA4 simulations were assessed over three homogenous subregions of West Africa (Guinea coast, Savannah, and Sahel) and evaluated using observed precipitation data from the Global Precipitation Climatology Project (GPCP). Furthermore, the model output was evaluated using a wide range of statistical measures. The interseasonal and interannual variability of the RCA4 were further assessed over the subregions and the whole of the West Africa domain. Results indicate that the RCA4 captures the spatial and interseasonal rainfall pattern adequately but exhibits a weak performance over the Guinea coast. Findings from the interannual rainfall variability indicate that the model performance is better over the larger West Africa domain than the subregions. The largest difference across the RCA4 simulated annual rainfall was found in the Sahel. Result from the Mann-Kendall test showed no significant trend for the 1980-2005 period in annual rainfall either in GPCP observation data or in the model simulations over West Africa. In many aspects, the RCA4 simulation driven by the HadGEM2-ES perform best over the region. The use of the multimodel ensemble mean has resulted to the improved representation of rainfall characteristics over the study domain.

  7. Towards General Evaluation of Intelligent Systems: Lessons Learned from Reproducing AIQ Test Results

    NASA Astrophysics Data System (ADS)

    Vadinský, Ondřej

    2018-03-01

    This paper attempts to replicate the results of evaluating several artificial agents using the Algorithmic Intelligence Quotient test originally reported by Legg and Veness. Three experiments were conducted: One using default settings, one in which the action space was varied and one in which the observation space was varied. While the performance of freq, Q0, Qλ, and HLQλ corresponded well with the original results, the resulting values differed, when using MC-AIXI. Varying the observation space seems to have no qualitative impact on the results as reported, while (contrary to the original results) varying the action space seems to have some impact. An analysis of the impact of modifying parameters of MC-AIXI on its performance in the default settings was carried out with the help of data mining techniques used to identifying highly performing configurations. Overall, the Algorithmic Intelligence Quotient test seems to be reliable, however as a general artificial intelligence evaluation method it has several limits. The test is dependent on the chosen reference machine and also sensitive to changes to its settings. It brings out some differences among agents, however, since they are limited in size, the test setting may not yet be sufficiently complex. A demanding parameter sweep is needed to thoroughly evaluate configurable agents that, together with the test format, further highlights computational requirements of an agent. These and other issues are discussed in the paper along with proposals suggesting how to alleviate them. An implementation of some of the proposals is also demonstrated.

  8. Evaluation of results in aesthetic plastic surgery: preliminary observations on mammaplasty.

    PubMed

    Ferreira, M C

    2000-12-01

    Aesthetic plastic surgery has received wide public attention in the past few years. Expectations of patients regarding results have been exaggerated; the real place and medical importance of the procedures are still not clear because of a lack of more objective evidence. This study discusses the difficulties encountered related to the scientific evaluation of the aesthetic operations and proposes alternatives for assessment. A frequently performed procedure, reduction mammaplasty, is presented as an example, with its specific evaluation.

  9. Assessment of Work Performance (AWP)--development of an instrument.

    PubMed

    Sandqvist, Jan L; Törnquist, Kristina B; Henriksson, Chris M

    2006-01-01

    Adequate work assessments are a matter of importance both for individuals and society [5,29,31,38,40,46,52]. However, there is a lack of adequate and reliable instruments for use in work rehabilitation [14,15,20,21,31,44]. The purpose of this study was to develop and evaluate an observation instrument for assessing work performance, the AWP (Assessment of Work Performance). The purpose of the 14-item instrument is to assess the individual's observable working skills in three different areas: motor skills, process skills, and communication and interaction skills. This article describes the development and results of preliminary testing of the AWP. The testing indicates a satisfactory face validity and utility for the AWP and supports further research and testing of the instrument.

  10. Reproductive health services in Malawi: an evaluation of a quality improvement intervention.

    PubMed

    Rawlins, Barbara J; Kim, Young-Mi; Rozario, Aleisha M; Bazant, Eva; Rashidi, Tambudzai; Bandazi, Sheila N; Kachale, Fannie; Sanghvi, Harshad; Noh, Jin Won

    2013-01-01

    this study was to evaluate the impact of a quality improvement initiative in Malawi on reproductive health service quality and related outcomes. (1) post-only quasi-experimental design comparing observed service quality at intervention and comparison health facilities, and (2) a time-series analysis of service statistics. sixteen of Malawi's 23 district hospitals, half of which had implemented the Performance and Quality Improvement (PQI) intervention for reproductive health at the time of the study. a total of 98 reproductive health-care providers (mostly nurse-midwives) and 139 patients seeking family planning (FP), antenatal care (ANC), labour and delivery (L&D), or postnatal care (PNC) services. health facility teams implemented a performance and quality improvement (PQI) intervention over a 3-year period. Following an external observational assessment of service quality at baseline, facility teams analysed performance gaps, designed and implemented interventions to address weaknesses, and conducted quarterly internal assessments to assess progress. Facilities qualified for national recognition by complying with at least 80% of reproductive health clinical standards during an external verification assessment. key measures include facility readiness to provide quality care, observed health-care provider adherence to clinical performance standards during service delivery, and trends in service utilisation. intervention facilities were more likely than comparison facilities to have the needed infrastructure, equipment, supplies, and systems in place to offer reproductive health services. Observed quality of care was significantly higher at intervention than comparison facilities for PNC and FP. Compared with other providers, those at intervention facilities scored significantly higher on client assessment and diagnosis in three service areas, on clinical management and procedures in two service areas, and on counselling in one service area. Service statistics suggest that the PQI intervention increased the number of Caesarean sections, but showed no impact on other indicators of service utilisation and skilled care. the PQI intervention showed a positive impact on the quality of reproductive health services. The effects of the intervention on service utilisation had likely not yet been fully realized, since none of the facilities had achieved national recognition before the evaluation. Staff turnover needs to be reduced to maximise the effectiveness of the intervention. the PQI intervention evaluated here offers an effective way to improve the quality of health services in low-resource settings and should continue to be scaled up in Malawi. Copyright © 2011 Elsevier Ltd. All rights reserved.

  11. [New trends in the evaluation of mathematics learning disabilities. The role of metacognition].

    PubMed

    Miranda-Casas, A; Acosta-Escareño, G; Tarraga-Minguez, R; Fernández, M I; Rosel-Remírez, J

    2005-01-15

    The current trends in the evaluation of mathematics learning disabilities (MLD), based on cognitive and empirical models, are oriented towards combining procedures involving the criteria and the evaluation of cognitive and metacognitive processes, associated to performance in mathematical tasks. The objective of this study is to analyse the metacognitive skills of prediction and evaluation in performing maths tasks and to compare metacognitive performance among pupils with MLD and younger pupils without MLD, who have the same level of mathematical performance. Likewise, we analyse these pupils' desire to learn. Subjects and methods. We compare a total of 44 pupils from the second cycle of primary education (8-10 years old) with and without mathematics learning disabilities. Significant differences are observed between pupils with and without mathematics learning disabilities in their capacity to predict and assess all of the tasks evaluated. As regards their 'desire to learn', no significant differences were found between pupils with and without MLD, which indicated that those with MLD assess their chances of successfully performing maths tasks in the same way as those without MLD. Finally, the findings reveal a similar metacognitive profile in pupils with MLD and the younger pupils with no mathematics learning disabilities. In future studies we consider it important to analyse the influence of the socio-affective belief system in the use of metacognitive skills.

  12. Optimization of medical imaging display systems: using the channelized Hotelling observer for detecting lung nodules: experimental study

    NASA Astrophysics Data System (ADS)

    Platisa, Ljiljana; Vansteenkiste, Ewout; Goossens, Bart; Marchessoux, Cédric; Kimpe, Tom; Philips, Wilfried

    2009-02-01

    Medical-imaging systems are designed to aid medical specialists in a specific task. Therefore, the physical parameters of a system need to optimize the task performance of a human observer. This requires measurements of human performance in a given task during the system optimization. Typically, psychophysical studies are conducted for this purpose. Numerical observer models have been successfully used to predict human performance in several detection tasks. Especially, the task of signal detection using a channelized Hotelling observer (CHO) in simulated images has been widely explored. However, there are few studies done for clinically acquired images that also contain anatomic noise. In this paper, we investigate the performance of a CHO in the task of detecting lung nodules in real radiographic images of the chest. To evaluate variability introduced by the limited available data, we employ a commonly used study of a multi-reader multi-case (MRMC) scenario. It accounts for both case and reader variability. Finally, we use the "oneshot" methods to estimate the MRMC variance of the area under the ROC curve (AUC). The obtained AUC compares well to those reported for human observer study on a similar data set. Furthermore, the "one-shot" analysis implies a fairly consistent performance of the CHO with the variance of AUC below 0.002. This indicates promising potential for numerical observers in optimization of medical imaging displays and encourages further investigation on the subject.

  13. Performance evaluation of the QIAGEN EZ1 DSP Virus Kit with Abbott RealTime HIV-1, HBV and HCV assays.

    PubMed

    Schneider, George J; Kuper, Kevin G; Abravaya, Klara; Mullen, Carolyn R; Schmidt, Marion; Bunse-Grassmann, Astrid; Sprenger-Haussels, Markus

    2009-04-01

    Automated sample preparation systems must meet the demands of routine diagnostics laboratories with regard to performance characteristics and compatibility with downstream assays. In this study, the performance of QIAGEN EZ1 DSP Virus Kit on the BioRobot EZ1 DSP was evaluated in combination with the Abbott RealTime HIV-1, HCV, and HBV assays, followed by thermalcycling and detection on the Abbott m2000rt platform. The following performance characteristics were evaluated: linear range and precision, sensitivity, cross-contamination, effects of interfering substances and correlation. Linearity was observed within the tested ranges (for HIV-1: 2.0-6.0 log copies/ml, HCV: 1.3-6.9 log IU/ml, HBV: 1.6-7.6 log copies/ml). Excellent precision was obtained (inter-assay standard deviation for HIV-1: 0.06-0.17 log copies/ml (>2.17 log copies/ml), HCV: 0.05-0.11 log IU/ml (>2.09 log IU/ml), HBV: 0.03-0.07 log copies/ml (>2.55 log copies/ml)), with good sensitivity (95% hit rates for HIV-1: 50 copies/ml, HCV: 12.5 IU/ml, HBV: 10 IU/ml). No cross-contamination was observed, as well as no negative impact of elevated levels of various interfering substances. In addition, HCV and HBV viral load measurements after BioRobot EZ1 DSP extraction correlated well with those obtained after Abbott m2000sp extraction. This evaluation demonstrates that the QIAGEN EZ1 DSP Virus Kit provides an attractive solution for fully automated, low throughput sample preparation for use with the Abbott RealTime HIV-1, HCV, and HBV assays.

  14. The SPAtial EFficiency metric (SPAEF): multiple-component evaluation of spatial patterns for optimization of hydrological models

    NASA Astrophysics Data System (ADS)

    Koch, Julian; Cüneyd Demirel, Mehmet; Stisen, Simon

    2018-05-01

    The process of model evaluation is not only an integral part of model development and calibration but also of paramount importance when communicating modelling results to the scientific community and stakeholders. The modelling community has a large and well-tested toolbox of metrics to evaluate temporal model performance. In contrast, spatial performance evaluation does not correspond to the grand availability of spatial observations readily available and to the sophisticate model codes simulating the spatial variability of complex hydrological processes. This study makes a contribution towards advancing spatial-pattern-oriented model calibration by rigorously testing a multiple-component performance metric. The promoted SPAtial EFficiency (SPAEF) metric reflects three equally weighted components: correlation, coefficient of variation and histogram overlap. This multiple-component approach is found to be advantageous in order to achieve the complex task of comparing spatial patterns. SPAEF, its three components individually and two alternative spatial performance metrics, i.e. connectivity analysis and fractions skill score, are applied in a spatial-pattern-oriented model calibration of a catchment model in Denmark. Results suggest the importance of multiple-component metrics because stand-alone metrics tend to fail to provide holistic pattern information. The three SPAEF components are found to be independent, which allows them to complement each other in a meaningful way. In order to optimally exploit spatial observations made available by remote sensing platforms, this study suggests applying bias insensitive metrics which further allow for a comparison of variables which are related but may differ in unit. This study applies SPAEF in the hydrological context using the mesoscale Hydrologic Model (mHM; version 5.8), but we see great potential across disciplines related to spatially distributed earth system modelling.

  15. Evidence for -Gz Adaptation Observed with Wearable Biosensors During High Performance Jet Flight.

    PubMed

    Rice, G Merrill; Snider, Dallas; Moore, Jeffrey L; Lavan, J Timothy; Folga, Rich; VanBrunt, Thomas B

    2016-12-01

    Few studies have evaluated physiological responses to high acceleration forces during actual flight and to our knowledge no normative data has been acquired by technologies such as wearable biosensors during high performance jet aircraft operations. In-flight physiological data from an FDA cleared portable triaxial accelerometer and bio-sensor were observed from five active duty F-18 pilots of the Naval Flight Demonstration Squadron (Blue Angels). Of the five pilots, three were formation pilots who flew lower G profiles and two were solo pilots who flew higher G profiles. Physiological parameters monitored were heart rate, respiratory rate, temperature, caloric expenditure, and duration of exposure to levels of acceleration. Evaluated were 25 practice demonstration flights; 9 flights were excluded secondary to incomplete or inaccurate physiological data. We observed no significant bradycardia during a total of 189 maneuvers which met inclusion criteria for push-pull events (PPE) or isolated -Gz exposures. Further analysis of 73 PPE revealed an overall significant rise in HR following the PPE, where mean heart rate was 106 (95% CI, 100:112) at the beginning of the push and 129 (95% CI, 123:135) following the pull. A majority of the flights monitored provided reliable physiological data. Initial data suggests, contrary to currently held aeromedical doctrine, maneuvers such as the "push-pull" do not evoke vasovagal based bradycardic responses in aerobatic pilots. Possible explanations for these findings are sympathetic nervous system activation through adaptation and/or sustained isometric resistance from control inputs, both of which are areas of future research for our team.Rice GM, Snider D, Moore JL, Lavan JT, Folga R, VanBrunt TB. Evidence for -Gz adaptation observed with wearable biosensors during high performance jet flight. Aerosp Med Hum Perform. 2016; 87(12):996-1003.

  16. Design and flight performance evaluation of the Mariners 6, 7, and 9 short-circuit current, open-circuit voltage transducers

    NASA Technical Reports Server (NTRS)

    Patterson, R. E.

    1973-01-01

    The purpose of the short-circuit voltage transducer is to provide engineering data to aid the evaluation of array performance during flight. The design, fabrication, calibration, and in-flight performance of the transducers onboard the Mariner 6, 7 and 9 spacecrafts are described. No significant differences were observed in the in-flight electrical performance of the three transducers. The transducers did experience significant losses due to coverslides or adhesive darkening, increased surface reflection, or spectral shifts within coverslide assembly. Mariner 6, 7 and 9 transducers showed non-cell current degradations of 3-1/2%, 3%, and 4%, respectively at Mars encounter and 6%, 3%, and 4-12%, respectively at end of mission. Mariner 9 solar Array Test 2 showed 3-12% current degradation while the transducer showed 4-12% degradation.

  17. Signal detection theory and methods for evaluating human performance in decision tasks

    NASA Technical Reports Server (NTRS)

    Obrien, Kevin; Feldman, Evan M.

    1993-01-01

    Signal Detection Theory (SDT) can be used to assess decision making performance in tasks that are not commonly thought of as perceptual. SDT takes into account both the sensitivity and biases in responding when explaining the detection of external events. In the standard SDT tasks, stimuli are selected in order to reveal the sensory capabilities of the observer. SDT can also be used to describe performance when decisions must be made as to the classification of easily and reliably sensed stimuli. Numbers are stimuli that are minimally affected by sensory processing and can belong to meaningful categories that overlap. Multiple studies have shown that the task of categorizing numbers from overlapping normal distributions produces performance predictable by SDT. These findings are particularly interesting in view of the similarity between the task of the categorizing numbers and that of determining the status of a mechanical system based on numerical values that represent sensor readings. Examples of the use of SDT to evaluate performance in decision tasks are reviewed. The methods and assumptions of SDT are shown to be effective in the measurement, evaluation, and prediction of human performance in such tasks.

  18. Time-resolved speckle effects on the estimation of laser-pulse arrival times

    NASA Technical Reports Server (NTRS)

    Tsai, B.-M.; Gardner, C. S.

    1985-01-01

    A maximum-likelihood (ML) estimator of the pulse arrival in laser ranging and altimetry is derived for the case of a pulse distorted by shot noise and time-resolved speckle. The performance of the estimator is evaluated for pulse reflections from flat diffuse targets and compared with the performance of a suboptimal centroid estimator and a suboptimal Bar-David ML estimator derived under the assumption of no speckle. In the large-signal limit the accuracy of the estimator was found to improve as the width of the receiver observational interval increases. The timing performance of the estimator is expected to be highly sensitive to background noise when the received pulse energy is high and the receiver observational interval is large. Finally, in the speckle-limited regime the ML estimator performs considerably better than the suboptimal estimators.

  19. Selecting among competing models of electro-optic, infrared camera system range performance

    USGS Publications Warehouse

    Nichols, Jonathan M.; Hines, James E.; Nichols, James D.

    2013-01-01

    Range performance is often the key requirement around which electro-optical and infrared camera systems are designed. This work presents an objective framework for evaluating competing range performance models. Model selection based on the Akaike’s Information Criterion (AIC) is presented for the type of data collected during a typical human observer and target identification experiment. These methods are then demonstrated on observer responses to both visible and infrared imagery in which one of three maritime targets was placed at various ranges. We compare the performance of a number of different models, including those appearing previously in the literature. We conclude that our model-based approach offers substantial improvements over the traditional approach to inference, including increased precision and the ability to make predictions for some distances other than the specific set for which experimental trials were conducted.

  20. On the use and the performance of software reliability growth models

    NASA Technical Reports Server (NTRS)

    Keiller, Peter A.; Miller, Douglas R.

    1991-01-01

    We address the problem of predicting future failures for a piece of software. The number of failures occurring during a finite future time interval is predicted from the number failures observed during an initial period of usage by using software reliability growth models. Two different methods for using the models are considered: straightforward use of individual models, and dynamic selection among models based on goodness-of-fit and quality-of-prediction criteria. Performance is judged by the relative error of the predicted number of failures over future finite time intervals relative to the number of failures eventually observed during the intervals. Six of the former models and eight of the latter are evaluated, based on their performance on twenty data sets. Many open questions remain regarding the use and the performance of software reliability growth models.

  1. Expanded Awareness of Student Performance: A Case Study in Applied Ethnographic Monitoring in a Bilingual Classroom. Sociolinguistic Working Paper Number 60.

    ERIC Educational Resources Information Center

    Carrasco, Robert L.

    The case study of the use of a classroom observation technique to evaluate the abilities and performance of a bilingual kindergarten student previously assessed as a low achiever is described. There are three objectives: to show the validity of the ethnographic monitoring technique, to show the value of teachers as collaborating researchers, and…

  2. The association between self-perceived proficiency of personal protective equipment and objective performance: An observational study during a bioterrorism simulation drill.

    PubMed

    Fogel, Itay; David, Osant; Balik, Chaya H; Eisenkraft, Arik; Poles, Lion; Shental, Omri; Kassirer, Michael; Brosh-Nissimov, Tal

    2017-11-01

    The recent Ebola virus disease outbreak emphasized the potential misuse of personal protective equipment (PPE) by health care workers (HCWs) during such an event. We aimed to compare self-perceived proficiency of PPE use and objective performance, and identify predictors of low compliance and PPE misuse. An observational study combined with subjective questionnaires were carried out during a bioterror simulation drill. Forty-two observers evaluated performance under PPE. Mistakes were recorded and graded using a structured observational format and were correlated with the subjective questionnaires and with demographic parameters. One hundred seventy-eight HCWs from community clinics and hospitals were included. The mean self-perceived proficiency was high (6.1 out of 7), mean level of comfort was moderate (4.0 out of 7), and mean objective performance was intermediate (9.5 out of 13). There was no correlation between comfort and objective performance scores. Self-perceived proficiency was in correlation with donning and continuous performance with PPE but not with doffing. Clinic personnel performed better than personnel in hospitals (40.3% vs 67.8% with 3 or more mistakes, respectively; P = .001). Demographic characteristics had no correlation with objective or self-perceived performance. Self-perceived proficiency is a poor predictor of appropriate PPE use. The results suggest poor awareness of the possibility of PPE misuse. Copyright © 2017 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Elsevier Inc. All rights reserved.

  3. Market behavior and performance of different strategy evaluation schemes

    NASA Astrophysics Data System (ADS)

    Baek, Yongjoo; Lee, Sang Hoon; Jeong, Hawoong

    2010-08-01

    Strategy evaluation schemes are a crucial factor in any agent-based market model, as they determine the agents’ strategy preferences and consequently their behavioral pattern. This study investigates how the strategy evaluation schemes adopted by agents affect their performance in conjunction with the market circumstances. We observe the performance of three strategy evaluation schemes, the history-dependent wealth game, the trend-opposing minority game, and the trend-following majority game, in a stock market where the price is exogenously determined. The price is either directly adopted from the real stock market indices or generated with a Markov chain of order ≤2 . Each scheme’s success is quantified by average wealth accumulated by the traders equipped with the scheme. The wealth game, as it learns from the history, shows relatively good performance unless the market is highly unpredictable. The majority game is successful in a trendy market dominated by long periods of sustained price increase or decrease. On the other hand, the minority game is suitable for a market with persistent zigzag price patterns. We also discuss the consequence of implementing finite memory in the scoring processes of strategies. Our findings suggest under which market circumstances each evaluation scheme is appropriate for modeling the behavior of real market traders.

  4. Market behavior and performance of different strategy evaluation schemes.

    PubMed

    Baek, Yongjoo; Lee, Sang Hoon; Jeong, Hawoong

    2010-08-01

    Strategy evaluation schemes are a crucial factor in any agent-based market model, as they determine the agents' strategy preferences and consequently their behavioral pattern. This study investigates how the strategy evaluation schemes adopted by agents affect their performance in conjunction with the market circumstances. We observe the performance of three strategy evaluation schemes, the history-dependent wealth game, the trend-opposing minority game, and the trend-following majority game, in a stock market where the price is exogenously determined. The price is either directly adopted from the real stock market indices or generated with a Markov chain of order ≤2 . Each scheme's success is quantified by average wealth accumulated by the traders equipped with the scheme. The wealth game, as it learns from the history, shows relatively good performance unless the market is highly unpredictable. The majority game is successful in a trendy market dominated by long periods of sustained price increase or decrease. On the other hand, the minority game is suitable for a market with persistent zigzag price patterns. We also discuss the consequence of implementing finite memory in the scoring processes of strategies. Our findings suggest under which market circumstances each evaluation scheme is appropriate for modeling the behavior of real market traders.

  5. Geant4 Computing Performance Benchmarking and Monitoring

    DOE PAGES

    Dotti, Andrea; Elvira, V. Daniel; Folger, Gunter; ...

    2015-12-23

    Performance evaluation and analysis of large scale computing applications is essential for optimal use of resources. As detector simulation is one of the most compute intensive tasks and Geant4 is the simulation toolkit most widely used in contemporary high energy physics (HEP) experiments, it is important to monitor Geant4 through its development cycle for changes in computing performance and to identify problems and opportunities for code improvements. All Geant4 development and public releases are being profiled with a set of applications that utilize different input event samples, physics parameters, and detector configurations. Results from multiple benchmarking runs are compared tomore » previous public and development reference releases to monitor CPU and memory usage. Observed changes are evaluated and correlated with code modifications. Besides the full summary of call stack and memory footprint, a detailed call graph analysis is available to Geant4 developers for further analysis. The set of software tools used in the performance evaluation procedure, both in sequential and multi-threaded modes, include FAST, IgProf and Open|Speedshop. In conclusion, the scalability of the CPU time and memory performance in multi-threaded application is evaluated by measuring event throughput and memory gain as a function of the number of threads for selected event samples.« less

  6. The Linear Programming to evaluate the performance of Oral Health in Primary Care.

    PubMed

    Colussi, Claudia Flemming; Calvo, Maria Cristina Marino; Freitas, Sergio Fernando Torres de

    2013-01-01

    To show the use of Linear Programming to evaluate the performance of Oral Health in Primary Care. This study used data from 19 municipalities of Santa Catarina city that participated of the state evaluation in 2009 and have more than 50,000 habitants. A total of 40 indicators were evaluated, calculated using the Microsoft Excel 2007, and converted to the interval [0, 1] in ascending order (one indicating the best situation and zero indicating the worst situation). Applying the Linear Programming technique municipalities were assessed and compared among them according to performance curve named "quality estimated frontier". Municipalities included in the frontier were classified as excellent. Indicators were gathered, and became synthetic indicators. The majority of municipalities not included in the quality frontier (values different of 1.0) had lower values than 0.5, indicating poor performance. The model applied to the municipalities of Santa Catarina city assessed municipal management and local priorities rather than the goals imposed by pre-defined parameters. In the final analysis three municipalities were included in the "perceived quality frontier". The Linear Programming technique allowed to identify gaps that must be addressed by city managers to enhance actions taken. It also enabled to observe each municipal performance and compare results among similar municipalities.

  7. Tactile orientation perception: an ideal observer analysis of human psychophysical performance in relation to macaque area 3b receptive fields

    PubMed Central

    Peters, Ryan M.; Staibano, Phillip

    2015-01-01

    The ability to resolve the orientation of edges is crucial to daily tactile and sensorimotor function, yet the means by which edge perception occurs is not well understood. Primate cortical area 3b neurons have diverse receptive field (RF) spatial structures that may participate in edge orientation perception. We evaluated five candidate RF models for macaque area 3b neurons, previously recorded while an oriented bar contacted the monkey's fingertip. We used a Bayesian classifier to assign each neuron a best-fit RF structure. We generated predictions for human performance by implementing an ideal observer that optimally decoded stimulus-evoked spike counts in the model neurons. The ideal observer predicted a saturating reduction in bar orientation discrimination threshold with increasing bar length. We tested 24 humans on an automated, precision-controlled bar orientation discrimination task and observed performance consistent with that predicted. We next queried the ideal observer to discover the RF structure and number of cortical neurons that best matched each participant's performance. Human perception was matched with a median of 24 model neurons firing throughout a 1-s period. The 10 lowest-performing participants were fit with RFs lacking inhibitory sidebands, whereas 12 of the 14 higher-performing participants were fit with RFs containing inhibitory sidebands. Participants whose discrimination improved as bar length increased to 10 mm were fit with longer RFs; those who performed well on the 2-mm bar, with narrower RFs. These results suggest plausible RF features and computational strategies underlying tactile spatial perception and may have implications for perceptual learning. PMID:26354318

  8. Observational and Modeling Studies of Clouds and the Hydrological Cycle

    NASA Technical Reports Server (NTRS)

    Somerville, Richard C. J.

    1997-01-01

    Our approach involved validating parameterizations directly against measurements from field programs, and using this validation to tune existing parameterizations and to guide the development of new ones. We have used a single-column model (SCM) to make the link between observations and parameterizations of clouds, including explicit cloud microphysics (e.g., prognostic cloud liquid water used to determine cloud radiative properties). Surface and satellite radiation measurements were used to provide an initial evaluation of the performance of the different parameterizations. The results of this evaluation will then used to develop improved cloud and cloud-radiation schemes, which were tested in GCM experiments.

  9. TThe role of nitrogen availability in land-atmosphere interactions: a systematic evaluation of carbon-nitrogen coupling in a global land surface model using plot-level nitrogen fertilization experiments

    NASA Astrophysics Data System (ADS)

    Thomas, R. Q.; Goodale, C. L.; Bonan, G. B.; Mahowald, N. M.; Ricciuto, D. M.; Thornton, P. E.

    2010-12-01

    Recent research from global land surface models emphasizes the important role of nitrogen cycling on global climate, via its control on the terrestrial carbon balance. Despite the implications of nitrogen cycling on global climate predictions, the research community has not performed a systematic evaluation of nitrogen cycling in global models. Here, we present such an evaluation for one global land model, CLM-CN. In the evaluation we simulated 45 plot-scale nitrogen-fertilization experiments distributed across 33 temperate and boreal forest sites. Model predictions were evaluated against field observations by comparing the vegetation and soil carbon responses to the additional nitrogen. Aggregated across all experiments, the model predicted a larger vegetation carbon response and a smaller soil carbon response than observed; the responses partially offset each other, leading to a slightly larger total ecosystem carbon response than observed. However, the model-observation agreement improved for vegetation carbon when the sites with observed negative carbon responses to nitrogen were excluded, which may be because the model lacks mechanisms whereby nitrogen additions increase tree mortality. Among experiments, younger forests and boreal forests’ vegetation carbon responses were less than predicted and mature forests (> 40 years old) were greater than predicted. Specific to the CLM-CN, this study used a systematic evaluation to identify key areas to focus model development, especially soil carbon- nitrogen interactions and boreal forest nitrogen cycling. Applicable to the modeling community, this study demonstrates a standardized protocol for comparing carbon-nitrogen interactions among global land models.

  10. Correlation between a 2D Channelized Hotelling Observer and Human Observers in a Low-contrast Detection Task with Multi-slice Reading in CT

    PubMed Central

    Yu, Lifeng; Chen, Baiyu; Kofler, James M.; Favazza, Christopher P.; Leng, Shuai; Kupinski, Matthew A.; McCollough, Cynthia H.

    2017-01-01

    Purpose Model observers have been successfully developed and used to assess the quality of static 2D CT images. However, radiologists typically read images by paging through multiple 2D slices (i.e. multi-slice reading). The purpose of this study was to correlate human and model observer performance in a low-contrast detection task performed using both 2D and multi-slice reading, and to determine if the 2D model observer still correlate well with human observer performance in multi-slice reading. Methods A phantom containing 18 low-contrast spheres (6 sizes × 3 contrast levels) was scanned on a 192-slice CT scanner at 5 dose levels (CTDIvol = 27, 13.5, 6.8, 3.4, and 1.7 mGy), each repeated 100 times. Images were reconstructed using both filtered-backprojection (FBP) and an iterative reconstruction (IR) method (ADMIRE, Siemens). A 3D volume of interest (VOI) around each sphere was extracted and placed side-by-side with a signal-absent VOI to create a 2-alternative forced choice (2AFC) trial. Sixteen 2AFC studies were generated, each with 100 trials, to evaluate the impact of radiation dose, lesion size and contrast, and reconstruction methods on object detection. In total, 1600 trials were presented to both model and human observers. Three medical physicists acted as human observers and were allowed to page through the 3D volumes to make a decision for each 2AFC trial. The human observer performance was compared with the performance of a multi-slice channelized Hotelling observer (CHO_MS), which integrates multi-slice image data, and with the performance of previously validated CHO, which operates on static 2D images (CHO_2D). For comparison, the same 16 2AFC studies were also performed in a 2D viewing mode by the human observers and compared with the multi-slice viewing performance and the two CHO models. Results Human observer performance was well correlated with the CHO_2D performance in the 2D viewing mode (Pearson product-moment correlation coefficient R=0.972, 95% confidence interval (CI): 0.919 to 0.990) and with the CHO_MS performance in the multi-slice viewing mode (R=0.952, 95% CI: 0.865 to 0.984). The CHO_2D performance, calculated from the 2D viewing mode, also had a strong correlation with human observer performance in the multi-slice viewing mode (R=0.957, 95% CI: 879 to 0.985). Human observer performance varied between the multi-slice and 2D modes. One reader performed better in the multi-slice mode (p=0.013); whereas the other two readers showed no significant difference between the two viewing modes (p=0.057 and p=0.38). Conclusions A 2D CHO model is highly correlated with human observer performance in detecting spherical low contrast objects in multi-slice viewing of CT images. This finding provides some evidence for the use of a simpler, 2D CHO to assess image quality in clinically relevant CT tasks where multi-slice viewing is used. PMID:28555878

  11. Correlation between a 2D channelized Hotelling observer and human observers in a low-contrast detection task with multislice reading in CT.

    PubMed

    Yu, Lifeng; Chen, Baiyu; Kofler, James M; Favazza, Christopher P; Leng, Shuai; Kupinski, Matthew A; McCollough, Cynthia H

    2017-08-01

    Model observers have been successfully developed and used to assess the quality of static 2D CT images. However, radiologists typically read images by paging through multiple 2D slices (i.e., multislice reading). The purpose of this study was to correlate human and model observer performance in a low-contrast detection task performed using both 2D and multislice reading, and to determine if the 2D model observer still correlate well with human observer performance in multislice reading. A phantom containing 18 low-contrast spheres (6 sizes × 3 contrast levels) was scanned on a 192-slice CT scanner at five dose levels (CTDI vol = 27, 13.5, 6.8, 3.4, and 1.7 mGy), each repeated 100 times. Images were reconstructed using both filtered-backprojection (FBP) and an iterative reconstruction (IR) method (ADMIRE, Siemens). A 3D volume of interest (VOI) around each sphere was extracted and placed side-by-side with a signal-absent VOI to create a 2-alternative forced choice (2AFC) trial. Sixteen 2AFC studies were generated, each with 100 trials, to evaluate the impact of radiation dose, lesion size and contrast, and reconstruction methods on object detection. In total, 1600 trials were presented to both model and human observers. Three medical physicists acted as human observers and were allowed to page through the 3D volumes to make a decision for each 2AFC trial. The human observer performance was compared with the performance of a multislice channelized Hotelling observer (CHO_MS), which integrates multislice image data, and with the performance of previously validated CHO, which operates on static 2D images (CHO_2D). For comparison, the same 16 2AFC studies were also performed in a 2D viewing mode by the human observers and compared with the multislice viewing performance and the two CHO models. Human observer performance was well correlated with the CHO_2D performance in the 2D viewing mode [Pearson product-moment correlation coefficient R = 0.972, 95% confidence interval (CI): 0.919 to 0.990] and with the CHO_MS performance in the multislice viewing mode (R = 0.952, 95% CI: 0.865 to 0.984). The CHO_2D performance, calculated from the 2D viewing mode, also had a strong correlation with human observer performance in the multislice viewing mode (R = 0.957, 95% CI: 879 to 0.985). Human observer performance varied between the multislice and 2D modes. One reader performed better in the multislice mode (P = 0.013); whereas the other two readers showed no significant difference between the two viewing modes (P = 0.057 and P = 0.38). A 2D CHO model is highly correlated with human observer performance in detecting spherical low contrast objects in multislice viewing of CT images. This finding provides some evidence for the use of a simpler, 2D CHO to assess image quality in clinically relevant CT tasks where multislice viewing is used. © 2017 American Association of Physicists in Medicine.

  12. GEM-AQ, an On-line Global Multiscale Chemical Weather System: Model Description and Evaluation of Gas Phase Chemistry Processes

    NASA Astrophysics Data System (ADS)

    Neary, L.; Kaminski, J. W.; Struzewska, J.; Ainslie, B.; McConnell, J. C.

    2007-12-01

    Tropospheric chemistry and air quality processes were implemented on-line in the Global Environmental Multiscale model. The integrated model, GEM-AQ, has been developed as a platform to investigate chemical weather at scales from global to urban. On the global scale, the model was exercised for five years (2001-2005) to evaluate its ability to simulate seasonal variations and regional distributions of trace gases such as ozone, nitrogen dioxide and carbon monoxide. The model results are compared with observations from satellites, aircraft measurement campaigns and balloon sondes. The same model has also been evaluated on the regional (~15km resolution) and urban scale (~3km resolution). A simulation of the formation and transport of photooxidants during the European heat wave of 2006 was performed and compared with surface observations throughout central and eastern Europe. The complex topographic region of the Lower Fraser Valley in British Columbia was the focus of another model evaluation during the PACIFIC 2001 field campaign. Comparison of model results with observations during this period will be shown.

  13. OSSE Evaluation of Prospective Aircraft Reconnaissance Flight Patterns and their Impact on Hurricane Forecasts

    NASA Astrophysics Data System (ADS)

    Ryan, K. E.; Bucci, L. R.; Christophersen, H.; Atlas, R. M.; Murillo, S.; Dodge, P.

    2015-12-01

    Each year, NOAA/AOML's Hurricane Research Division (HRD) conducts its Hurricane field Program in which observations are collected via NOAA aircraft to improve the understanding and prediction of hurricanes. Mission experiments suggest a variety of flight patterns and sampling strategies aimed towards their respective goals described by the Intensity Forecasting Experiment (IFEX; Rogers et al., BAMS, 2006, 2013), a collaborative effort among HRD, NHC, and EMC. Evaluating the potential impact of various trade-offs in design is valuable for determining the optimal air reconnaissance flight pattern for a given prospective mission. AOML's HRD has developed a system for performing regional Observing System Simulation Experiments (OSSEs) to assess the potential impact of proposed observing systems on hurricane track and intensity forecasts and analyses. This study focuses on investigating the potential impact of proposed aircraft reconnaissance observing system designs. Aircraft instrument and flight level retrievals were simulated from a regional WRF ARW Nature Run (Nolan et al., 2013) spanning 13 days, covering the life cycle of a rapidly intensifying Atlantic tropical cyclone. The aircraft trajectories are simulated in a variety of ways and are evaluated to investigate the potential impact of aircraft reconnaissance observations on hurricane track and intensity forecasts.

  14. Scale Issues in Air Quality Modeling Policy Support

    EPA Science Inventory

    This study examines the issues relating to the use of regional photochemical air quality models for evaluating their performance in reproducing the spatio-temporal features embedded in the observations and for designing emission control strategies needed to achieve compliance wit...

  15. AWOS Sensor Evaluation : Transmissometer, Forward-Scatter Meter and Lidar Ceilometer

    DOT National Transportation Integrated Search

    1984-01-01

    Ceiling and visibility measurements are included in an Automatic Weather Observing System (AWOS) which is intended to satisfy the needs of aviation. The performance of one ceilometer and two visibility sensors was examined to determine whether they c...

  16. Salt Brine Blending to Optimize Deicing and Anti-Icing Performance and Cost Effectiveness : Phase III

    DOT National Transportation Integrated Search

    2017-11-01

    An evaluation of deicers and anti-icers and plowing, in parallel conditions on actual pavements to assess intuitions based on observations and anecdotal evidence. - Anti-icer persistence - Deicer effectiveness - Plow effectiveness - Pavement study of...

  17. Staging a Reflective Capstone Course to Transition PharmD Graduates to Professional Life

    PubMed Central

    Hobson, Eric H.; Spinelli, Alisa J.

    2015-01-01

    Objective. To develop and implement a capstone course that would allow students to reflect on their development as a professional, assess and share their achievement of the college’s outcomes, complete a professional portfolio, establish a continuing professional development plan, and prepare to enter the pharmacy profession. Design. Students were required to complete a hybrid course built around 4 online and inclass projects during the final semester of the curriculum. Assessment. Faculty used direct measures of learning, such as reading student portfolios and program outcome reflections, evaluating professional development plans, and directly observing each student in a video presentation. All projects were evaluated using standardized rubrics. Since 2012, all graduating students met the course’s minimum performance requirements. Conclusion. The course provided an opportunity for student-based summative evaluation, direct observation of student skills, and documentation of outcome completion as a means of evaluating readiness to enter the profession. PMID:25741030

  18. Development and evaluation of a decision-based simulation for assessment of team skills.

    PubMed

    Andrew, Brandon; Plachta, Stephen; Salud, Lawrence; Pugh, Carla M

    2012-08-01

    There is a need to train and evaluate a wide variety of nontechnical surgical skills. The goal of this project was to develop and evaluate a decision-based simulation to assess team skills. The decision-based exercise used our previously validated Laparoscopic Ventral Hernia simulator and a newly developed team evaluation survey. Five teams of 3 surgical residents (N = 15) were tasked with repairing a 10 × 10-cm right upper quadrant hernia. During the simulation, independent observers (N = 6) completed a 6-item survey assessing: (1) work quality; (2) communication; and (3) team effectiveness. After the simulation, team members self-rated their performance by using the same survey. Survey reliability revealed a Cronbach's alpha of r = .811. Significant differences were found when we compared team members' (T) and observers' (O) ratings for communication (T = 4.33/5.00 vs O = 3.00/5.00, P < .01) and work quality (T = 4.33/5.00 vs O = 3.33/5.00, P < .05). The team with the greatest survey ratings was the only group to successfully complete the task. The team evaluation survey had good reliability and correlated with task performance on the simulator. Our current and previous work provides strong evidence that nontechnical and team related skills can be assessed without simulating a crisis situation. Copyright © 2012 Mosby, Inc. All rights reserved.

  19. Evaluating Air-Quality Models: Review and Outlook.

    NASA Astrophysics Data System (ADS)

    Weil, J. C.; Sykes, R. I.; Venkatram, A.

    1992-10-01

    Over the past decade, much attention has been devoted to the evaluation of air-quality models with emphasis on model performance in predicting the high concentrations that are important in air-quality regulations. This paper stems from our belief that this practice needs to be expanded to 1) evaluate model physics and 2) deal with the large natural or stochastic variability in concentration. The variability is represented by the root-mean- square fluctuating concentration (c about the mean concentration (C) over an ensemble-a given set of meteorological, source, etc. conditions. Most air-quality models used in applications predict C, whereas observations are individual realizations drawn from an ensemble. For cC large residuals exist between predicted and observed concentrations, which confuse model evaluations.This paper addresses ways of evaluating model physics in light of the large c the focus is on elevated point-source models. Evaluation of model physics requires the separation of the mean model error-the difference between the predicted and observed C-from the natural variability. A residual analysis is shown to be an elective way of doing this. Several examples demonstrate the usefulness of residuals as well as correlation analyses and laboratory data in judging model physics.In general, c models and predictions of the probability distribution of the fluctuating concentration (c), (c, are in the developmental stage, with laboratory data playing an important role. Laboratory data from point-source plumes in a convection tank show that (c approximates a self-similar distribution along the plume center plane, a useful result in a residual analysis. At pmsent,there is one model-ARAP-that predicts C, c, and (c for point-source plumes. This model is more computationally demanding than other dispersion models (for C only) and must be demonstrated as a practical tool. However, it predicts an important quantity for applications- the uncertainty in the very high and infrequent concentrations. The uncertainty is large and is needed in evaluating operational performance and in predicting the attainment of air-quality standards.

  20. An Independent Inter- and Intraobserver Agreement Evaluation of the AOSpine Subaxial Cervical Spine Injury Classification System.

    PubMed

    Urrutia, Julio; Zamora, Tomas; Yurac, Ratko; Campos, Mauricio; Palma, Joaquin; Mobarec, Sebastian; Prada, Carlos

    2017-03-01

    An agreement study. The aim of this study was to perform an independent interobserver and intraobserver agreement assessment of the AOSpine subaxial cervical spine injury classification system. The AOSpine subaxial cervical spine injury classification system was recently described. It showed substantial inter- and intraobserver agreement in the study describing it; however, an independent evaluation has not been performed. Anteroposterior and lateral radiographs, computed tomography scans, and magnetic resonance imaging of 65 patients with acute traumatic subaxial cervical spine injuries were selected and classified using the morphologic grading of the subaxial cervical spine injury classification system by 6 evaluators (3 spine surgeons and 3 orthopedic surgery residents). After a 6-week interval, the 65 cases were presented to the same evaluators in a random sequence for repeat evaluation. The kappa coefficient (κ) was used to determine the inter- and intraobserver agreement. The interobserver agreement was substantial when considering the fracture main types (A, B, C, or F), with κ = 0.61 (0.57-0.64), but moderate when considering the subtypes: κ = 0.57 (0.54-0.60). The intraobserver agreement was substantial considering the fracture types, with κ = 0.68 (0.62-0.74) and considering subtypes, κ = 0.62 (0.57-0.66). No significant differences were observed between spine surgeons and orthopedic residents in the overall inter- and intraobserver agreement, or in the inter- and intraobserver agreement of specific A, B, C, or F type of injuries. This classification allows adequate agreement among different observers and by the same observer on separate occasions. Future prospective studies should determine whether this classification allows surgeons to decide the best treatment for patients with subaxial cervical spine injuries. 3.

  1. Evaluation of the wave measurement in a stormy sea by the Along-Track interferometry SAR

    NASA Astrophysics Data System (ADS)

    Kojima, S.

    2015-12-01

    NICT developed the along-track interferometry SAR (AT-InSAR) system to detect the running cars and ships and measure sea surface velocity in 2011. The preliminary experiments for the running truck and ship were performed and it confirmed that the system performance was satisfactory to its specifications. In addition, a method to estimate the wave height from the sea surface velocity measured by the AT-InSAR was developed. The preliminary wave height observation was performed in a calm sea, and it was confirmed that the wave height could be estimated from the measured sea surface velocity. The purpose of this study is to check the capability of the ocean waves observation in a stormy sea by the AT-InSAR. Therefore, the ocean wave observation was performed under the low atmospheric pressure. The observation area is the sea surface at 10 km off the coast of Kushiro, south-east to Hokaido, JAPAN on the 4th of March 2015. The wind speed was 8〜10m/s during the observation, and the significant wave height and period were 1.5m and 6.0s. The observation was performed in 2 directions and the accuracy of the estimation results were checked. The significant wave height and period measured by the AT-InSAR agreed with it measured by the wave gage located close to this observation area. In addition, it was confirmed that there were no irregular wave heights in the distribution of the estimated wave height. As a result, it became clear that the AT-InSAR could observe the wave height in a stormy sea.

  2. Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction.

    PubMed

    Park, Seong Ho; Han, Kyunghwa

    2018-03-01

    The use of artificial intelligence in medicine is currently an issue of great interest, especially with regard to the diagnostic or predictive analysis of medical images. Adoption of an artificial intelligence tool in clinical practice requires careful confirmation of its clinical utility. Herein, the authors explain key methodology points involved in a clinical evaluation of artificial intelligence technology for use in medicine, especially high-dimensional or overparameterized diagnostic or predictive models in which artificial deep neural networks are used, mainly from the standpoints of clinical epidemiology and biostatistics. First, statistical methods for assessing the discrimination and calibration performances of a diagnostic or predictive model are summarized. Next, the effects of disease manifestation spectrum and disease prevalence on the performance results are explained, followed by a discussion of the difference between evaluating the performance with use of internal and external datasets, the importance of using an adequate external dataset obtained from a well-defined clinical cohort to avoid overestimating the clinical performance as a result of overfitting in high-dimensional or overparameterized classification model and spectrum bias, and the essentials for achieving a more robust clinical evaluation. Finally, the authors review the role of clinical trials and observational outcome studies for ultimate clinical verification of diagnostic or predictive artificial intelligence tools through patient outcomes, beyond performance metrics, and how to design such studies. © RSNA, 2018.

  3. Production and evaluation of measuring equipment for share viscosity of polymer melts included nanofiller with injection molding machine

    NASA Astrophysics Data System (ADS)

    Kameda, Takao; Sugino, Naoto; Takei, Satoshi

    2016-10-01

    Shear viscosity measurement device was produced to evaluate the injection molding workability for high-performance resins. Observation was possible in shear rate from 10 to 10000 [1/sec] that were higher than rotary rheometer by measuring with a plasticization cylinder of the injection molding machine. The result of measurements extrapolated result of a measurement of the rotary rheometer.

  4. Statistical Inference and Reverse Engineering of Gene Regulatory Networks from Observational Expression Data

    PubMed Central

    Emmert-Streib, Frank; Glazko, Galina V.; Altay, Gökmen; de Matos Simoes, Ricardo

    2012-01-01

    In this paper, we present a systematic and conceptual overview of methods for inferring gene regulatory networks from observational gene expression data. Further, we discuss two classic approaches to infer causal structures and compare them with contemporary methods by providing a conceptual categorization thereof. We complement the above by surveying global and local evaluation measures for assessing the performance of inference algorithms. PMID:22408642

  5. Distraction versus Intensity: The Importance of Exercise Classes for Cognitive Performance in School.

    PubMed

    Wollseiffen, Petra; Vogt, Tobias; Strüder, Heiko K; Schneider, Stefan

    2018-01-01

    The aim of this study was to compare the influence of a class of aerobic exercise and an art class on brain cortical activity and possible effects on cognitive performance. Electroencephalography was used to record the electrocortical activity of 16 schoolchildren (8-10 years old) before and after an aerobic exercise class and an art class. Performance in a standardized test of educational attainment (VERA-3) was assessed following both classes. A significant decrease in cortical activity was detected in all 4 lobes after exercise but not after art classes (p < 0.05). No changes in cognitive performance were observed after exercise and art classes. In this study, cortical activity was reduced after an exercise class but no effect on cognitive performance was observed. Hence, the neurophysiological effect of exercise should be further evaluated regarding different kinds of cognitive performance: creativity, knowledge acquisition as well as the outlasting effects of exercise on academic achievement. © 2017 The Author(s) Published by S. Karger AG, Basel.

  6. An Experimental Study of the Effect of Out-of-the-Window Cues on Training Novice Pilots on a Flight Simulator

    NASA Technical Reports Server (NTRS)

    Khan, M. Javed; Rossi, Marcia; Heath, Bruce; Ali, Syed F.; Ward, Marcus

    2006-01-01

    The effects of out-of-the-window cues on learning a straight-in landing approach and a level 360deg turn by novice pilots on a flight simulator have been investigated. The treatments consisted of training with and without visual cues as well as density of visual cues. The performance of the participants was then evaluated through similar but more challenging tasks. It was observed that the participants in the landing study who trained with visual cues performed poorly than those who trained without the cues. However the performance of those who trained with a faded-cues sequence performed slightly better than those who trained without visual cues. In the level turn study it was observed that those who trained with the visual cues performed better than those who trained without visual cues. The study also showed that those participants who trained with a lower density of cues performed better than those who trained with a higher density of visual cues.

  7. Requirements for developing a regional monitoring capacity for aerosols in Europe within EMEP.

    PubMed

    Kahnert, Michael; Lazaridis, Mihalis; Tsyro, Svetlana; Torseth, Kjetil

    2004-07-01

    The European Monitoring and Evaluation Programme (EMEP) has been established to provide information to Parties to the Convention on Long Range Transboundary Air Pollution on deposition and concentration of air pollutants, as well as on the quantity and significance of long-range transmission of pollutants and transboundary fluxes. To achieve its objectives with the required scientific credibility and technical underpinning, a close integration of the programme's main elements is performed. These elements are emission inventories, chemical transport modelling, and the monitoring of atmospheric chemistry and deposition fluxes, which further are integrated towards abatement policy development. A critical element is the air pollution monitoring that is performed across Europe with a focus not only on health effect aspects and compliance monitoring, but also on process studies and source receptor relationships. Without a strong observational basis a predictive modelling capacity cannot be developed and validated. Thus the modelling success strongly depends on the quality and quantity of available observations. Particulate matter (PM) is a relatively recent addition to the EMEP monitoring programme, and the network for PM mass observations is still evolving. This article presents the current status of EMEP aerosol observations, followed by a critical evaluation in view of EMEP's main objectives and its model development requirements. Specific recommendations are given for improving the PM monitoring programme within EMEP.

  8. A New Method for the Evaluation and Prediction of Base Stealing Performance.

    PubMed

    Bricker, Joshua C; Bailey, Christopher A; Driggers, Austin R; McInnis, Timothy C; Alami, Arya

    2016-11-01

    Bricker, JC, Bailey, CA, Driggers, AR, McInnis, TC, and Alami, A. A new method for the evaluation and prediction of base stealing performance. J Strength Cond Res 30(11): 3044-3050, 2016-The purposes of this study were to evaluate a new method using electronic timing gates to monitor base stealing performance in terms of reliability, differences between it and traditional stopwatch-collected times, and its ability to predict base stealing performance. Twenty-five healthy collegiate baseball players performed maximal effort base stealing trials with a right and left-handed pitcher. An infrared electronic timing system was used to calculate the reaction time (RT) and total time (TT), whereas coaches' times (CT) were recorded with digital stopwatches. Reliability of the TGM was evaluated with intraclass correlation coefficients (ICCs) and coefficient of variation (CV). Differences between the TGM and traditional CT were calculated with paired samples t tests Cohen's d effect size estimates. Base stealing performance predictability of the TGM was evaluated with Pearson's bivariate correlations. Acceptable relative reliability was observed (ICCs 0.74-0.84). Absolute reliability measures were acceptable for TT (CVs = 4.4-4.8%), but measures were elevated for RT (CVs = 32.3-35.5%). Statistical and practical differences were found between TT and CT (right p = 0.00, d = 1.28 and left p = 0.00, d = 1.49). The TGM TT seems to be a decent predictor of base stealing performance (r = -0.49 to -0.61). The authors recommend using the TGM used in this investigation for athlete monitoring because it was found to be reliable, seems to be more precise than traditional CT measured with a stopwatch, provides an additional variable of value (RT), and may predict future performance.

  9. Perceived Sleep Quality, Mood States, and Their Relationship With Performance Among Brazilian Elite Athletes During a Competitive Period.

    PubMed

    Brandt, Ricardo; Bevilacqua, Guilherme G; Andrade, Alexandro

    2017-04-01

    Brandt, R, Bevilacqua, GG, and Andrade, A. Perceived sleep quality, mood states, and their relationship with performance among Brazilian elite athletes during a competitive period. J Strength Cond Res 31(4): 1033-1039, 2017-We described the perceived sleep quality and mood states of elite athletes during a competitive period, and clarified their relationship to athletes' sport performance. Participants were 576 Brazilian elite athletes (404 men and 172 women) of individual and team sports. Mood states were evaluated using the Brunel Mood Scale, whereas perceived sleep quality was evaluated using a single question ("How would you evaluate the quality of your sleep in the last few days?"). Evaluations of mood state and sleep quality were performed up to 60 minutes before national and international sports competitions began. Descriptive and inferential statistics (including logistic regression) were used to evaluate the relationship of sleep quality and mood states with performance (i.e., winning or losing). Athletes typically had good sleep quality and mood states similar to the Iceberg profile (i.e., high vigor and low tension, depression, anger, fatigue, and mental confusion). The Wald test revealed that sleep, anger, tension, and vigor predicted athletes' performance. Specifically, poor sleep quality and low vigor and anger decreased the odds of winning, whereas higher tension increased these odds. The Hosmer-Lemeshow test indicated that the results were sufficiently generalizable. Overall, we observed a significant relationship between sleep and mood states, which in turn both significantly influenced athletes' sports performance. Thus, coaching staff and athletes should monitor athletes' sleep quality before competitions to ensure athletes are in the optimal condition for performance.

  10. Results from the VALUE perfect predictor experiment: process-based evaluation

    NASA Astrophysics Data System (ADS)

    Maraun, Douglas; Soares, Pedro; Hertig, Elke; Brands, Swen; Huth, Radan; Cardoso, Rita; Kotlarski, Sven; Casado, Maria; Pongracz, Rita; Bartholy, Judit

    2016-04-01

    Until recently, the evaluation of downscaled climate model simulations has typically been limited to surface climatologies, including long term means, spatial variability and extremes. But these aspects are often, at least partly, tuned in regional climate models to match observed climate. The tuning issue is of course particularly relevant for bias corrected regional climate models. In general, a good performance of a model for these aspects in present climate does therefore not imply a good performance in simulating climate change. It is now widely accepted that, to increase our condidence in climate change simulations, it is necessary to evaluate how climate models simulate relevant underlying processes. In other words, it is important to assess whether downscaling does the right for the right reason. Therefore, VALUE has carried out a broad process-based evaluation study based on its perfect predictor experiment simulations: the downscaling methods are driven by ERA-Interim data over the period 1979-2008, reference observations are given by a network of 85 meteorological stations covering all European climates. More than 30 methods participated in the evaluation. In order to compare statistical and dynamical methods, only variables provided by both types of approaches could be considered. This limited the analysis to conditioning local surface variables on variables from driving processes that are simulated by ERA-Interim. We considered the following types of processes: at the continental scale, we evaluated the performance of downscaling methods for positive and negative North Atlantic Oscillation, Atlantic ridge and blocking situations. At synoptic scales, we considered Lamb weather types for selected European regions such as Scandinavia, the United Kingdom, the Iberian Pensinsula or the Alps. At regional scales we considered phenomena such as the Mistral, the Bora or the Iberian coastal jet. Such process-based evaluation helps to attribute biases in surface variables to underlying processes and ultimately to improve climate models.

  11. A satellite simulator for TRMM PR applied to climate model simulations

    NASA Astrophysics Data System (ADS)

    Spangehl, T.; Schroeder, M.; Bodas-Salcedo, A.; Hollmann, R.; Riley Dellaripa, E. M.; Schumacher, C.

    2017-12-01

    Climate model simulations have to be compared against observation based datasets in order to assess their skill in representing precipitation characteristics. Here we use a satellite simulator for TRMM PR in order to evaluate simulations performed with MPI-ESM (Earth system model of the Max Planck Institute for Meteorology in Hamburg, Germany) performed within the MiKlip project (https://www.fona-miklip.de/, funded by Federal Ministry of Education and Research in Germany). While classical evaluation methods focus on geophysical parameters such as precipitation amounts, the application of the satellite simulator enables an evaluation in the instrument's parameter space thereby reducing uncertainties on the reference side. The CFMIP Observation Simulator Package (COSP) provides a framework for the application of satellite simulators to climate model simulations. The approach requires the introduction of sub-grid cloud and precipitation variability. Radar reflectivities are obtained by applying Mie theory, with the microphysical assumptions being chosen to match the atmosphere component of MPI-ESM (ECHAM6). The results are found to be sensitive to the methods used to distribute the convective precipitation over the sub-grid boxes. Simple parameterization methods are used to introduce sub-grid variability of convective clouds and precipitation. In order to constrain uncertainties a comprehensive comparison with sub-grid scale convective precipitation variability which is deduced from TRMM PR observations is carried out.

  12. Echocardiography-guided or "sided" pericardiocentesis.

    PubMed

    Degirmencioglu, Aleks; Karakus, Gultekin; Güvenc, Tolga Sinan; Pinhan, Osman; Sipahi, Ilke; Akyol, Ahmet

    2013-10-01

    Echocardiography-guided pericardiocentesis is the first choice method for relieving cardiac tamponade, but the exact role of the echocardiography at the moment of the puncture is still controversial. In this report, detailed echocardiographic evaluation was performed in 21 consecutive patients with cardiac tamponade just before the pericardiocentesis. Appropriate needle position was determined according to the probe position using imaginary x, y, and z axes. Pericardiocentesis was performed successfully using this technique without simultaneous echocardiography and no complications were observed. We concluded that bedside echocardiography with detailed evaluation of the puncture site and angle is enough for pericardiocentesis instead of real time guiding. © 2013, Wiley Periodicals, Inc.

  13. Infrared upconversion for astronomical applications. [laser applications to astronomical spectroscopy of infrared spectra

    NASA Technical Reports Server (NTRS)

    Abbas, M. M.; Kostiuk, T.; Ogilvie, K. W.

    1975-01-01

    The performance of an upconversion system is examined for observation of astronomical sources in the low to middle infrared spectral range. Theoretical values for the performance parameters of an upconversion system for astronomical observations are evaluated in view of the conversion efficiencies, spectral resolution, field of view, minimum detectable source brightness and source flux. Experimental results of blackbody measurements and molecular absorption spectrum measurements using a lithium niobate upconverter with an argon-ion laser as the pump are presented. Estimates of the expected optimum sensitivity of an upconversion device which may be built with the presently available components are given.

  14. Interaction Metrics for Feedback Control of Sound Radiation from Stiffened Panels

    NASA Technical Reports Server (NTRS)

    Cabell, Randolph H.; Cox, David E.; Gibbs, Gary P.

    2003-01-01

    Interaction metrics developed for the process control industry are used to evaluate decentralized control of sound radiation from bays on an aircraft fuselage. The metrics are applied to experimentally measured frequency response data from a model of an aircraft fuselage. The purpose is to understand how coupling between multiple bays of the fuselage can destabilize or limit the performance of a decentralized active noise control system. The metrics quantitatively verify observations from a previous experiment, in which decentralized controllers performed worse than centralized controllers. The metrics do not appear to be useful for explaining control spillover which was observed in a previous experiment.

  15. Operational model evaluation for particulate matter in Europe and North America in the context of AQMEII

    NASA Astrophysics Data System (ADS)

    Solazzo, Efisio; Bianconi, Roberto; Pirovano, Guido; Matthias, Volker; Vautard, Robert; Moran, Michael D.; Wyat Appel, K.; Bessagnet, Bertrand; Brandt, Jørgen; Christensen, Jesper H.; Chemel, Charles; Coll, Isabelle; Ferreira, Joana; Forkel, Renate; Francis, Xavier V.; Grell, Georg; Grossi, Paola; Hansen, Ayoe B.; Miranda, Ana Isabel; Nopmongcol, Uarporn; Prank, Marje; Sartelet, Karine N.; Schaap, Martijn; Silver, Jeremy D.; Sokhi, Ranjeet S.; Vira, Julius; Werhahn, Johannes; Wolke, Ralf; Yarwood, Greg; Zhang, Junhua; Rao, S. Trivikrama; Galmarini, Stefano

    2012-06-01

    Ten state-of-the-science regional air quality (AQ) modeling systems have been applied to continental-scale domains in North America and Europe for full-year simulations of 2006 in the context of Air Quality Model Evaluation International Initiative (AQMEII), whose main goals are model inter-comparison and evaluation. Standardised modeling outputs from each group have been shared on the web-distributed ENSEMBLE system, which allows statistical and ensemble analyses to be performed. In this study, the one-year model simulations are inter-compared and evaluated with a large set of observations for ground-level particulate matter (PM10 and PM2.5) and its chemical components. Modeled concentrations of gaseous PM precursors, SO2 and NO2, have also been evaluated against observational data for both continents. Furthermore, modeled deposition (dry and wet) and emissions of several species relevant to PM are also inter-compared. The unprecedented scale of the exercise (two continents, one full year, fifteen modeling groups) allows for a detailed description of AQ model skill and uncertainty with respect to PM. Analyses of PM10 yearly time series and mean diurnal cycle show a large underestimation throughout the year for the AQ models included in AQMEII. The possible causes of PM bias, including errors in the emissions and meteorological inputs (e.g., wind speed and precipitation), and the calculated deposition are investigated. Further analysis of the coarse PM components, PM2.5 and its major components (SO4, NH4, NO3, elemental carbon), have also been performed, and the model performance for each component evaluated against measurements. Finally, the ability of the models to capture high PM concentrations has been evaluated by examining two separate PM2.5 episodes in Europe and North America. A large variability among models in predicting emissions, deposition, and concentration of PM and its precursors during the episodes has been found. Major challenges still remain with regards to identifying and eliminating the sources of PM bias in the models. Although PM2.5 was found to be much better estimated by the models than PM10, no model was found to consistently match the observations for all locations throughout the entire year.

  16. Evaluation of Long-Term Cloud-Resolving Model Simulations Using Satellite Radiance Observations and Multi-Frequency Satellite Simulators

    NASA Technical Reports Server (NTRS)

    Matsui, Toshihisa; Zeng, Xiping; Tao, Wei-Kuo; Masunaga, Hirohiko; Olson, William S.; Lang, Stephen

    2008-01-01

    This paper proposes a methodology known as the Tropical Rainfall Measuring Mission (TRMM) Triple-Sensor Three-step Evaluation Framework (T3EF) for the systematic evaluation of precipitating cloud types and microphysics in a cloud-resolving model (CRM). T3EF utilizes multi-frequency satellite simulators and novel statistics of multi-frequency radiance and backscattering signals observed from the TRMM satellite. Specifically, T3EF compares CRM and satellite observations in the form of combined probability distributions of precipitation radar (PR) reflectivity, polarization-corrected microwave brightness temperature (Tb), and infrared Tb to evaluate the candidate CRM. T3EF is used to evaluate the Goddard Cumulus Ensemble (GCE) model for cases involving the South China Sea Monsoon Experiment (SCSMEX) and Kwajalein Experiment (KWAJEX). This evaluation reveals that the GCE properly captures the satellite-measured frequencies of different precipitating cloud types in the SCSMEX case but underestimates the frequencies of deep convective and deep stratiform types in the KWAJEX case. Moreover, the GCE tends to simulate excessively large and abundant frozen condensates in deep convective clouds as inferred from the overestimated GCE-simulated radar reflectivities and microwave Tb depressions. Unveiling the detailed errors in the GCE s performance provides the best direction for model improvements.

  17. Multidetector-Row Computed Tomography in the Evaluation of Transjugular Intrahepatic Portosystemic Shunt Performed with Expanded-Polytetrafluoroethylene-Covered Stent-Graft

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fanelli, Fabrizio, E-mail: fabrizio.fanelli@uniroma1.it; Bezzi, Mario; Bruni, Antonio

    2011-02-15

    We assessed, in a prospective study, the efficacy of multidetector spiral computed tomography (MDCT) in the evaluation of transjugular intrahepatic portosystemic shunt (TIPS) patency in patients treated with the Viatorr (Gore, Flagstaff, AZ) expanded-polytetrafluoroethylene (e-PTFE)-covered stent-graft. Eighty patients who underwent TIPS procedure using the Viatorr self-expanding e-PTFE stent-graft were evaluated at follow-up of 1, 3, 6, and 12 months with clinical and laboratory tests as well as ultrasound-color Doppler (USCD) imaging. In case of varices, upper gastrointestinal endoscopy was also performed. In addition, the shunt was evaluated using MDCT at 6 and 12 months. In all cases of abnormal findingsmore » and discrepancy between MDCT and USCD, invasive control venography was performed. MDCT images were acquired before and after injection of intravenous contrast media on the axial plane and after three-dimensional reconstruction using different algorithms. MDCT was successfully performed in all patients. No artefacts correlated to the Viatorr stent-graft were observed. A missing correlation between UCSD and MDCT was noticed in 20 of 80 (25%) patients. Invasive control venography confirmed shunt patency in 16 (80%) cases and shunt malfunction in 4 (20%) cases. According to these data, MDCT sensitivity was 95.2%; specificity was 96.6%; and positive (PPV) and negative predictive values (NPV) were 90.9 and 98.2%, respectively. USCD sensitivity was 90%; specificity was 75%; and PPV and NPV were 54.5 and 95.7%, respectively. A high correlation (K value = 0.85) between MDCT and invasive control venography was observed. On the basis of these results, MDCT shows superior sensitivity and specificity compared with USCD in those patients in whom TIPS was performed with the Viatorr stent-graft. MDCT can be considered a valid tool in the follow-up of these patients.« less

  18. Wave and Wind Model Performance Metrics Tools

    NASA Astrophysics Data System (ADS)

    Choi, J. K.; Wang, D. W.

    2016-02-01

    Continual improvements and upgrades of Navy ocean wave and wind models are essential to the assurance of battlespace environment predictability of ocean surface wave and surf conditions in support of Naval global operations. Thus, constant verification and validation of model performance is equally essential to assure the progress of model developments and maintain confidence in the predictions. Global and regional scale model evaluations may require large areas and long periods of time. For observational data to compare against, altimeter winds and waves along the tracks from past and current operational satellites as well as moored/drifting buoys can be used for global and regional coverage. Using data and model runs in previous trials such as the planned experiment, the Dynamics of the Adriatic in Real Time (DART), we demonstrated the use of accumulated altimeter wind and wave data over several years to obtain an objective evaluation of the performance the SWAN (Simulating Waves Nearshore) model running in the Adriatic Sea. The assessment provided detailed performance of wind and wave models by using cell-averaged statistical variables maps with spatial statistics including slope, correlation, and scatter index to summarize model performance. Such a methodology is easily generalized to other regions and at global scales. Operational technology currently used by subject matter experts evaluating the Navy Coastal Ocean Model and the Hybrid Coordinate Ocean Model can be expanded to evaluate wave and wind models using tools developed for ArcMAP, a GIS application developed by ESRI. Recent inclusion of altimeter and buoy data into a format through the Naval Oceanographic Office's (NAVOCEANO) quality control system and the netCDF standards applicable to all model output makes it possible for the fusion of these data and direct model verification. Also, procedures were developed for the accumulation of match-ups of modelled and observed parameters to form a data base with which statistics are readily calculated, for the short or long term. Such a system has potential for a quick transition to operations at NAVOCEANO.

  19. Development and evaluation of a pliable biological valved conduit. Part II: Functional and hemodynamic evaluation.

    PubMed

    Sung, H W; Witzel, T H; Hata, C; Tu, R; Shen, S H; Lin, D; Noishiki, Y; Tomizawa, Y; Quijano, R C

    1993-04-01

    Many congenital cardiac malformations may require a valved conduit for the reconstruction of the right ventricular outflow tract. In spite of many endeavors made in the last 25 years, the clinical results of right ventricular outflow tract reconstruction with currently available valved conduits are still not satisfactory. Specific problems encountered clinically include suboptimal hemodynamic performance, conduit kinking or compression, and fibrous peeling from the luminal surface. To address these deficiencies, we undertook the development of a biological valved conduit: a bovine external jugular vein graft with a retained native valve cross-linked with a diglycidyl ether (DE). This study, using a canine model, was to evaluate the functional and hemodynamic performance of this newly developed valved conduit. Three 14 mm conduits, implanted as bypass grafts, right ventricle to pulmonary artery, were evaluated. The evaluation was conducted with a noninvasive color Doppler flow mapping system at pre-implantation, immediately post implantation, one- and three-months post implantation, and prior to retrieval (five-months post implantation). The two-dimensional tomographic inspection of the leaflet motion at various periods post implantation showed that the valvular leaflets in the DE treated conduit was quite pliable. No cardiac failure or valvular dysfunction was observed in any of the studied cases. The color Doppler flow mapping study demonstrated that the valve in the DE treated conduit was competent, with no conduit kinking or compression observed in any of the three cases. The spectral Doppler velocity study evidenced that the transvalvular pressure gradients of the DE treated conduit were minimal as compared to those of the currently available conduits. In conclusion, from the functional and hemodynamic performance points of view, this newly developed valved conduit is superior to those currently available.

  20. Phenobarbital in intensive care unit pediatric population: predictive performances of population pharmacokinetic model.

    PubMed

    Marsot, Amélie; Michel, Fabrice; Chasseloup, Estelle; Paut, Olivier; Guilhaumou, Romain; Blin, Olivier

    2017-10-01

    An external evaluation of phenobarbital population pharmacokinetic model described by Marsot et al. was performed in pediatric intensive care unit. Model evaluation is an important issue for dose adjustment. This external evaluation should allow confirming the proposed dosage adaptation and extending these recommendations to the entire intensive care pediatric population. External evaluation of phenobarbital published population pharmacokinetic model of Marsot et al. was realized in a new retrospective dataset of 35 patients hospitalized in a pediatric intensive care unit. The published population pharmacokinetic model was implemented in nonmem 7.3. Predictive performance was assessed by quantifying bias and inaccuracy of model prediction. Normalized prediction distribution errors (NPDE) and visual predictive check (VPC) were also evaluated. A total of 35 infants were studied with a mean age of 33.5 weeks (range: 12 days-16 years) and a mean weight of 12.6 kg (range: 2.7-70.0 kg). The model predicted the observed phenobarbital concentrations with a reasonable bias and inaccuracy. The median prediction error was 3.03% (95% CI: -8.52 to 58.12%), and the median absolute prediction error was 26.20% (95% CI: 13.07-75.59%). No trends in NPDE and VPC were observed. The model previously proposed by Marsot et al. in neonates hospitalized in intensive care unit was externally validated for IV infusion administration. The model-based dosing regimen was extended in all pediatric intensive care unit to optimize treatment. Due to inter- and intravariability in pharmacokinetic model, this dosing regimen should be combined with therapeutic drug monitoring. © 2017 Société Française de Pharmacologie et de Thérapeutique.

  1. Pulmonary tumor measurements from x-ray computed tomography in one, two, and three dimensions.

    PubMed

    Villemaire, Lauren; Owrangi, Amir M; Etemad-Rezai, Roya; Wilson, Laura; O'Riordan, Elaine; Keller, Harry; Driscoll, Brandon; Bauman, Glenn; Fenster, Aaron; Parraga, Grace

    2011-11-01

    We evaluated the accuracy and reproducibility of three-dimensional (3D) measurements of lung phantoms and patient tumors from x-ray computed tomography (CT) and compared these to one-dimensional (1D) and two-dimensional (2D) measurements. CT images of three spherical and three irregularly shaped tumor phantoms were evaluated by three observers who performed five repeated measurements. Additionally, three observers manually segmented 29 patient lung tumors five times each. Follow-up imaging was performed for 23 tumors and response criteria were compared. For a single subject, imaging was performed on nine occasions over 2 years to evaluate multidimensional tumor response. To evaluate measurement accuracy, we compared imaging measurements to ground truth using analysis of variance. For estimates of precision, intraobserver and interobserver coefficients of variation and intraclass correlations (ICC) were used. Linear regression and Pearson correlations were used to evaluate agreement and tumor response was descriptively compared. For spherical shaped phantoms, all measurements were highly accurate, but for irregularly shaped phantoms, only 3D measurements were in high agreement with ground truth measurements. All phantom and patient measurements showed high intra- and interobserver reproducibility (ICC >0.900). Over a 2-year period for a single patient, there was disagreement between tumor response classifications based on 3D measurements and those generated using 1D and 2D measurements. Tumor volume measurements were highly reproducible and accurate for irregular, spherical phantoms and patient tumors with nonuniform dimensions. Response classifications obtained from multidimensional measurements suggest that 3D measurements provide higher sensitivity to tumor response. Copyright © 2011 AUR. Published by Elsevier Inc. All rights reserved.

  2. Human interaction with robotic systems: performance and workload evaluations.

    PubMed

    Reinerman-Jones, L; Barber, D J; Szalma, J L; Hancock, P A

    2017-10-01

    We first tested the effect of differing tactile informational forms (i.e. directional cues vs. static cues vs. dynamic cues) on objective performance and perceived workload in a collaborative human-robot task. A second experiment evaluated the influence of task load and informational message type (i.e. single words vs. grouped phrases) on that same collaborative task. In both experiments, the relationship of personal characteristics (attentional control and spatial ability) to performance and workload was also measured. In addition to objective performance and self-report of cognitive load, we evaluated different physiological responses in each experiment. Results showed a performance-workload association for directional cues, message type and task load. EEG measures however, proved generally insensitive to such task load manipulations. Where significant EEG effects were observed, right hemisphere amplitude differences predominated, although unexpectedly these latter relationships were negative. Although EEG measures were partially associated with performance, they appear to possess limited utility as measures of workload in association with tactile displays. Practitioner Summary: As practitioners look to take advantage of innovative tactile displays in complex operational realms like human-robotic interaction, associated performance effects are mediated by cognitive workload. Despite some patterns of association, reliable reflections of operator state can be difficult to discern and employ as the number, complexity and sophistication of these respective measures themselves increase.

  3. Verification of NWP Cloud Properties using A-Train Satellite Observations

    NASA Astrophysics Data System (ADS)

    Kucera, P. A.; Weeks, C.; Wolff, C.; Bullock, R.; Brown, B.

    2011-12-01

    Recently, the NCAR Model Evaluation Tools (MET) has been enhanced to incorporate satellite observations for the verification of Numerical Weather Prediction (NWP) cloud products. We have developed tools that match fields spatially (both in the vertical and horizontal dimensions) to compare NWP products with satellite observations. These matched fields provide diagnostic evaluation of cloud macro attributes such as vertical distribution of clouds, cloud top height, and the spatial and seasonal distribution of cloud fields. For this research study, we have focused on using CloudSat, CALIPSO, and MODIS observations to evaluate cloud fields for a variety of NWP fields and derived products. We have selected cases ranging from large, mid-latitude synoptic systems to well-organized tropical cyclones. For each case, we matched the observed cloud field with gridded model and/or derived product fields. CloudSat and CALIPSO observations and model fields were matched and compared in the vertical along the orbit track. MODIS data and model fields were matched and compared in the horizontal. We then use MET to compute the verification statistics to quantify the performance of the models in representing the cloud fields. In this presentation we will give a summary of our comparison and show verification results for both synoptic and tropical cyclone cases.

  4. The WACMOS-ET project – Part 1: Tower-scale evaluation of four remote-sensing-based evapotranspiration algorithms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Michel, D.; Jimenez, C.; Miralles, D. G.

    The WAter Cycle Multi-mission Observation Strategy – EvapoTranspiration (WACMOS-ET) project has compiled a forcing data set covering the period 2005–2007 that aims to maximize the exploitation of European Earth Observations data sets for evapotranspiration (ET) estimation. The data set was used to run four established ET algorithms: the Priestley–Taylor Jet Propulsion Laboratory model (PT-JPL), the Penman–Monteith algorithm from the MODerate resolution Imaging Spectroradiometer (MODIS) evaporation product (PM-MOD), the Surface Energy Balance System (SEBS) and the Global Land Evaporation Amsterdam Model (GLEAM). In addition, in situ meteorological data from 24 FLUXNET towers were used to force the models, with results from both forcing sets compared tomore » tower-based flux observations. Model performance was assessed on several timescales using both sub-daily and daily forcings. The PT-JPL model and GLEAM provide the best performance for both satellite- and tower-based forcing as well as for the considered temporal resolutions. Simulations using the PM-MOD were mostly underestimated, while the SEBS performance was characterized by a systematic overestimation. In general, all four algorithms produce the best results in wet and moderately wet climate regimes. In dry regimes, the correlation and the absolute agreement with the reference tower ET observations were consistently lower. While ET derived with in situ forcing data agrees best with the tower measurements ( R 2 = 0.67), the agreement of the satellite-based ET estimates is only marginally lower ( R 2 = 0.58). Results also show similar model performance at daily and sub-daily (3-hourly) resolutions. Overall, our validation experiments against in situ measurements indicate that there is no single best-performing algorithm across all biome and forcing types. In conclusion, an extension of the evaluation to a larger selection of 85 towers (model inputs resampled to a common grid to facilitate global estimates) confirmed the original findings.« less

  5. The WACMOS-ET project – Part 1: Tower-scale evaluation of four remote-sensing-based evapotranspiration algorithms

    DOE PAGES

    Michel, D.; Jimenez, C.; Miralles, D. G.; ...

    2016-02-23

    The WAter Cycle Multi-mission Observation Strategy – EvapoTranspiration (WACMOS-ET) project has compiled a forcing data set covering the period 2005–2007 that aims to maximize the exploitation of European Earth Observations data sets for evapotranspiration (ET) estimation. The data set was used to run four established ET algorithms: the Priestley–Taylor Jet Propulsion Laboratory model (PT-JPL), the Penman–Monteith algorithm from the MODerate resolution Imaging Spectroradiometer (MODIS) evaporation product (PM-MOD), the Surface Energy Balance System (SEBS) and the Global Land Evaporation Amsterdam Model (GLEAM). In addition, in situ meteorological data from 24 FLUXNET towers were used to force the models, with results from both forcing sets compared tomore » tower-based flux observations. Model performance was assessed on several timescales using both sub-daily and daily forcings. The PT-JPL model and GLEAM provide the best performance for both satellite- and tower-based forcing as well as for the considered temporal resolutions. Simulations using the PM-MOD were mostly underestimated, while the SEBS performance was characterized by a systematic overestimation. In general, all four algorithms produce the best results in wet and moderately wet climate regimes. In dry regimes, the correlation and the absolute agreement with the reference tower ET observations were consistently lower. While ET derived with in situ forcing data agrees best with the tower measurements ( R 2 = 0.67), the agreement of the satellite-based ET estimates is only marginally lower ( R 2 = 0.58). Results also show similar model performance at daily and sub-daily (3-hourly) resolutions. Overall, our validation experiments against in situ measurements indicate that there is no single best-performing algorithm across all biome and forcing types. In conclusion, an extension of the evaluation to a larger selection of 85 towers (model inputs resampled to a common grid to facilitate global estimates) confirmed the original findings.« less

  6. An integrated approach of AHP and DEMATEL methods in evaluating the criteria of auto spare parts industry

    NASA Astrophysics Data System (ADS)

    Wu, Hsin-Hung; Tsai, Ya-Ning

    2012-11-01

    This study uses both analytic hierarchy process (AHP) and decision-making trial and evaluation laboratory (DEMATEL) methods to evaluate the criteria in auto spare parts industry in Taiwan. Traditionally, AHP does not consider indirect effects for each criterion and assumes that criteria are independent without further addressing the interdependence between or among the criteria. Thus, the importance computed by AHP can be viewed as short-term improvement opportunity. On the contrary, DEMATEL method not only evaluates the importance of criteria but also depicts the causal relations of criteria. By observing the causal diagrams, the improvement based on cause-oriented criteria might improve the performance effectively and efficiently for the long-term perspective. As a result, the major advantage of integrating AHP and DEMATEL methods is that the decision maker can continuously improve suppliers' performance from both short-term and long-term viewpoints.

  7. Observational studies are complementary to randomized controlled trials.

    PubMed

    Grootendorst, Diana C; Jager, Kitty J; Zoccali, Carmine; Dekker, Friedo W

    2010-01-01

    Randomized controlled trials (RCTs) are considered the gold standard study design to investigate the effect of health interventions, including treatment. However, in some situations, it may be unnecessary, inappropriate, impossible, or inadequate to perform an RCT. In these special situations, well-designed observational studies, including cohort and case-control studies, may provide an alternative to doing nothing in order to obtain estimates of treatment effect. It should be noted that such studies should be performed with caution and correctly. The aims of this review are (1) to explain why RCTs are considered the optimal study design to evaluate treatment effects, (2) to describe the situations in which an RCT is not possible and observational studies are an adequate alternative, and (3) to explain when randomization is not needed and can be approximated in observational studies. Examples from the nephrology literature are used for illustration. Copyright 2009 S. Karger AG, Basel.

  8. In-flight calibration and performance evaluation of the fixed head star trackers for the solar maximum mission

    NASA Technical Reports Server (NTRS)

    Thompson, R. H.; Gambardella, P. J.

    1980-01-01

    The Solar Maximum Mission (SMM) spacecraft provides an excellent opportunity for evaluating attitude determination accuracies achievable with tracking instruments such as fixed head star trackers (FHSTs). As a part of its payload, SMM carries a highly accurate fine pointing Sun sensor (FPSS). The EPSS provides an independent check of the pitch and yaw parameters computed from observations of stars in the FHST field of view. A method to determine the alignment of the FHSTs relative to the FPSS using spacecraft data is applied. Two methods that were used to determine distortions in the 8 degree by 8 degree field of view of the FHSTs using spacecraft data are also presented. The attitude determination accuracy performance of the in flight calibrated FHSTs is evaluated.

  9. The impact of green roof ageing on substrate characteristics and hydrological performance

    NASA Astrophysics Data System (ADS)

    De-Ville, Simon; Menon, Manoj; Jia, Xiaodong; Reed, George; Stovin, Virginia

    2017-04-01

    Green roofs contribute to stormwater management through the retention of rainfall and the detention of runoff. However, there is very limited knowledge concerning the evolution of green roof hydrological performance with system age. This study presents a non-invasive technique which allows for repeatable determination of key substrate characteristics over time, and evaluates the impact of observed substrate changes on hydrological performance. The physical properties of 12 green roof substrate cores have been evaluated using non-invasive X-ray microtomography (XMT) imaging. The cores comprised three replicates of two contrasting substrate types at two different ages: unused virgin samples; and 5-year-old samples from existing green roof test beds. Whilst significant structural differences (density, pore and particle sizes, tortuosity) between virgin and aged samples of a crushed brick substrate were observed, these differences did not significantly affect hydrological characteristics (maximum water holding capacity and saturated hydraulic conductivity). A contrasting substrate based upon a light expanded clay aggregate experienced increases in the number of fine particles and pores over time, which led to increases in maximum water holding capacity of 7%. In both substrates, the saturated hydraulic conductivity estimated from the XMT images was lower in aged compared with virgin samples. Comparisons between physically-derived and XMT-derived substrate hydrological properties showed that similar values and trends in the data were identified, confirming the suitability of the non-invasive XMT technique for monitoring changes in engineered substrates over time. The observed effects of ageing on hydrological performance were modelled as two distinct hydrological processes, retention and detention. Retention performance was determined via a moisture-flux model using physically-derived values of virgin and aged maximum water holding capacity. Increased water holding capacity with age increases the potential for retention performance. However, seasonal variations in retention performance greatly exceed those associated with the observed age-related increases in water holding capacity (+72% vs +7% respectively). Detention performance was determined via an unsaturated-flow finite element model, using van Genuchten parameters and XMT-derived values of saturated hydraulic conductivity. Reduced saturated hydraulic conductivity increases detention performance. For a 1-hour 30-year design storm, the peak runoff was found to be 33% lower for the aged brick-based substrate compared with its virgin counterpart.

  10. Implementation research to improve quality of maternal and newborn health care, Malawi.

    PubMed

    Brenner, Stephan; Wilhelm, Danielle; Lohmann, Julia; Kambala, Christabel; Chinkhumba, Jobiba; Muula, Adamson S; De Allegri, Manuela

    2017-07-01

    To evaluate the impact of a performance-based financing scheme on maternal and neonatal health service quality in Malawi. We conducted a non-randomized controlled before and after study to evaluate the effects of district- and facility-level performance incentives for health workers and management teams. We assessed changes in the facilities' essential drug stocks, equipment maintenance and clinical obstetric care processes. Difference-in-difference regression models were used to analyse effects of the scheme on adherence to obstetric care treatment protocols and provision of essential drugs, supplies and equipment. We observed 33 health facilities, 23 intervention facilities and 10 control facilities and 401 pregnant women across four districts. The scheme improved the availability of both functional equipment and essential drug stocks in the intervention facilities. We observed positive effects in respect to drug procurement and clinical care activities at non-intervention facilities, likely in response to improved district management performance. Birth assistants' adherence to clinical protocols improved across all studied facilities as district health managers supervised and coached clinical staff more actively. Despite nation-wide stock-outs and extreme health worker shortages, facilities in the study districts managed to improve maternal and neonatal health service quality by overcoming bottlenecks related to supply procurement, equipment maintenance and clinical performance. To strengthen and reform health management structures, performance-based financing may be a promising approach to sustainable improvements in quality of health care.

  11. The Modified, Multi-patient Observed Simulated Handoff Experience (M-OSHE): Assessment and Feedback for Entering Residents on Handoff Performance.

    PubMed

    Gaffney, Sean; Farnan, Jeanne M; Hirsch, Kristen; McGinty, Michael; Arora, Vineet M

    2016-04-01

    Despite the identification of transfer of patient responsibility as a Core Entrustable Professional Activity for Entering Residency, rigorous methods to evaluate incoming residents' ability to give a verbal handoff of multiple patients are lacking. Our purpose was to implement a multi-patient, simulation-based curriculum to assess verbal handoff performance. Graduate Medical Education (GME) orientation at an urban, academic medical center. Eighty-four incoming residents from four residency programs participated in the study. The curriculum featured an online training module and a multi-patient observed simulated handoff experience (M-OSHE). Participants verbally "handed off" three mock patients of varying acuity and were evaluated by a trained "receiver" using an expert-informed, five-item checklist. Prior handoff experience in medical school was associated with higher checklist scores (23% none vs. 33% either third OR fourth year vs. 58% third AND fourth year, p = 0.021). Prior training was associated with prioritization of patients based on acuity (12% no training vs. 38% prior training, p = 0.014). All participants agreed that the M-OSHE realistically portrayed a clinical setting. The M-OSHE is a promising strategy for teaching and evaluating entering residents' ability to give verbal handoffs of multiple patients. Prior training and more handoff experience was associated with higher performance, which suggests that additional handoff training in medical school may be of benefit.

  12. Evaluation of large-eddy simulations forced with mesoscale model output for a multi-week period during a measurement campaign

    NASA Astrophysics Data System (ADS)

    Heinze, Rieke; Moseley, Christopher; Böske, Lennart Nils; Muppa, Shravan Kumar; Maurer, Vera; Raasch, Siegfried; Stevens, Bjorn

    2017-06-01

    Large-eddy simulations (LESs) of a multi-week period during the HD(CP)2 (High-Definition Clouds and Precipitation for advancing Climate Prediction) Observational Prototype Experiment (HOPE) conducted in Germany are evaluated with respect to mean boundary layer quantities and turbulence statistics. Two LES models are used in a semi-idealized setup through forcing with mesoscale model output to account for the synoptic-scale conditions. Evaluation is performed based on the HOPE observations. The mean boundary layer characteristics like the boundary layer depth are in a principal agreement with observations. Simulating shallow-cumulus layers in agreement with the measurements poses a challenge for both LES models. Variance profiles agree satisfactorily with lidar measurements. The results depend on how the forcing data stemming from mesoscale model output are constructed. The mean boundary layer characteristics become less sensitive if the averaging domain for the forcing is large enough to filter out mesoscale fluctuations.

  13. Iris alterations after DSAEK.

    PubMed

    Del Hierro Zarzuelo, A; Boto de Los Bueis, A

    2016-09-01

    To evaluate a series of case that developed iris changes after performing Descemet's stripping automated endothelial keratoplasty (DSAEK). Retrospective study of eyes that developed iris abnormalities, such as pupil ovalisation, iris atrophy, iridocorneal synechiae, mydriatic pupil, and pigmentary changes after performing DSAEK in a tertiary hospital. In a series of the first 32 DSAEK procedures performed, new single or mixed iris alterations were observed in 12 eyes (37.5%). Iris-corneal synechiae were observed in 7 eyes, corectopias in 9 eyes, iris atrophy in 3 cases, and one case developed an areflexic mydriatic pupil. Long-term pigment dispersion at the edge of the lenticule was observed in 12 eyes. The alterations occurred after three months from the surgery. In the evaluation of the associated factors, malignant glaucoma had occurred in 1 case, 2 eyes had required a second surgery, one case by re-DSAEK, and the other one by removing the intraocular lens due to lens opacification. Two cases had a shallow anterior chamber. No relationship was found between the thickness of the peripheral lenticule and the presence of synechiae. Iris changes regarding DSAEK are possible. A discussion is presented on the relationship between increased intraocular pressure due to air in anterior chamber and its relationship with ischaemia and secondary alterations in the iris. Copyright © 2016 Sociedad Española de Oftalmología. Published by Elsevier España, S.L.U. All rights reserved.

  14. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schoellnast, Helmut; Monette, Sebastien; Ezell, Paula C.

    To evaluate the effects of irreversible electroporation (IRE) on the rectum wall after IRE applied adjacent to the rectum. CT-guided IRE adjacent to the rectum wall was performed in 11 pigs; a total of 44 lesions were created. In five pigs, ablations were performed without a water-filled endorectal coil (group A); in six pigs, ablation was performed with the coil to avoid displacement of the rectum wall (group B). The pigs were killed after 7-15 days and the rectums were harvested for pathological evaluation. There was no evidence of perforation on gross postmortem examination. Perirectal muscle lesions were observed inmore » 18 of 20 ablations in group A and in 21 of 24 ablations in group B. Inflammation and fibrosis of the muscularis propria was observed in ten of 18 lesions in group A and in ten of 21 lesions in group B. In group A, findings were limited to the external layer of the muscularis propria except for one lesion; in group B, findings were transmural in all cases. Transmural necrosis with marked suppurative mucosal inflammation was observed in seven of 21 lesions in group B and in no lesion in group A. IRE-ablation adjacent to the rectum may be uneventful if the rectum wall is mobile and able to contract. IRE-ablation of the rectum may be harmful if the rectum wall is fixed adjacent to the IRE-probe.« less

  15. Training Feedback Handbook. Research Product 83-7.

    ERIC Educational Resources Information Center

    Burnside, Billy L.; And Others

    This handbook is designed to assist training developers and evaluators in structuring their collection of feedback data. Addressed first are various methods for collecting feedback data, including informal feedback, existing unit performance records, questionnaires, structured interviews, systematic observation, and testing. The next chapter, a…

  16. Evaluation and implementation of BMPs for NCDOT's highway and industrial facilities : final report, May 2006.

    DOT National Transportation Integrated Search

    2006-05-01

    This research has provided NCDOT with (1) scientific observations to validate the pollutant removal : performance of selected structural BMPs, (2) a database management option for BMP monitoring and : non-monitoring sites, (3) pollution prevention pl...

  17. Developing Leadership Content Knowledge during School Leader Preparation

    ERIC Educational Resources Information Center

    Carver, Cynthia L.

    2012-01-01

    This instructional module describes a performance assessment designed to equip prospective principals with the knowledge and skill needed to evaluate curriculum, observe and assess instruction, interact meaningfully with teachers about instructional decision-making, and design professional learning opportunities that enhance student learning…

  18. Lesion detection performance of cone beam CT images with anatomical background noise: single-slice vs. multi-slice human and model observer study

    NASA Astrophysics Data System (ADS)

    Han, Minah; Jang, Hanjoo; Baek, Jongduk

    2018-03-01

    We investigate lesion detectability and its trends for different noise structures in single-slice and multislice CBCT images with anatomical background noise. Anatomical background noise is modeled using a power law spectrum of breast anatomy. Spherical signal with a 2 mm diameter is used for modeling a lesion. CT projection data are acquired by the forward projection and reconstructed by the Feldkamp-Davis-Kress algorithm. To generate different noise structures, two types of reconstruction filters (Hanning and Ram-Lak weighted ramp filters) are used in the reconstruction, and the transverse and longitudinal planes of reconstructed volume are used for detectability evaluation. To evaluate single-slice images, the central slice, which contains the maximum signal energy, is used. To evaluate multislice images, central nine slices are used. Detectability is evaluated using human and model observer studies. For model observer, channelized Hotelling observer (CHO) with dense difference-of-Gaussian (D-DOG) channels are used. For all noise structures, detectability by a human observer is higher for multislice images than single-slice images, and the degree of detectability increase in multislice images depends on the noise structure. Variation in detectability for different noise structures is reduced in multislice images, but detectability trends are not much different between single-slice and multislice images. The CHO with D-DOG channels predicts detectability by a human observer well for both single-slice and multislice images.

  19. Global Characterization of Protein Altering Mutations in Prostate Cancer

    DTIC Science & Technology

    2011-08-01

    prevalence of candidate cancer genes observed here in prostate cancer. (3) Perform integrative analyses of somatic mutation with gene expression and copy...analyses of somatic mutation with gene expression and copy number change data collected on the same samples. Body This is a “synergy” project between...However, to perform initial verification/validation studies, we have evaluated the mutation calls for several genes discovered initially by the

  20. An evaluation of hand hygiene in an intensive care unit: Are visitors a potential vector for pathogens?

    PubMed

    Birnbach, David J; Rosen, Lisa F; Fitzpatrick, Maureen; Arheart, Kristopher L; Munoz-Price, L Silvia

    2015-01-01

    Patients in an intensive care unit (ICU) are frequently immunocompromised and might be highly susceptible to infection. Visitors to an ICU who do not adequately clean their hands could carry pathogenic organisms, resulting in risk to a vulnerable patient population. This observational study identifies pathogens carried on the hands of visitors into an ICU and investigates the effect of hand hygiene. Two observers, one stationed outside and one inside the ICU, evaluated whether visitors performed hand hygiene at any of the wall-mounted alcohol-based hand sanitizer dispensers prior to reaching a patient's room. Upon reaching a patient's room, the dominant hand of all of the participants was cultured. Of the 55 participating visitors, 35 did not disinfect their hands. Among the cultures of those who failed to perform hand hygiene, eight cultures grew Gram-negative rods and one grew methicillin-resistant Staphylococcus aureus. Of the cultures of the 20 individuals who performed hand hygiene, 14 (70%) had no growth on the cultures, and the remaining six (30%) showed only the usual skin flora. The visitors who do not perform hand hygiene might carry pathogens that pose a risk to ICU patients. Copyright © 2015 King Saud Bin Abdulaziz University for Health Sciences. Published by Elsevier Ltd. All rights reserved.

Top